11.08.2013 Views

CATS Proceedings Printout - Graz University of Technology

CATS Proceedings Printout - Graz University of Technology

CATS Proceedings Printout - Graz University of Technology

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

th<br />

The 15 International IGTE Symposium<br />

on Numerical Field Calculation in Electrical Engineering<br />

Institute for Fundamentals and Theory<br />

in Electrical Engineering - IGTE<br />

<strong>Proceedings</strong><br />

Sept. 17 - 19, 2012<br />

Hotel Novapark, <strong>Graz</strong>, Austria<br />

ISBN: 978-3-85125-258-3<br />

Verlag der Technischen Universität <strong>Graz</strong><br />

www.ub.tugraz.at/Verlag<br />

<strong>Graz</strong> <strong>University</strong><br />

<strong>of</strong> <strong>Technology</strong>


The 15th International IGTE Symposium on Numerical Field Calculation in Electrical<br />

Engineering is sponsored and supported by:


.<br />

- I - 15th IGTE Symposium 2012<br />

Table <strong>of</strong> Contents<br />

Multi Domain Multi Scale Problems in the High Frequency Finite 1<br />

Element Method (FEM)<br />

Istvan Bardi, Kezhong Zhao, Rickard Petersson, John Silvestro, Nancy Lambert<br />

A parallel-TLM algorithm. Modelling the Earth-ionosphere waveguide 7<br />

Sergio Toledo-Redondo, Alfonso Salinas, Jesús Fornieles, Jorge Portí,<br />

Bruno Besser, Herbert I. M. Lichtenegger<br />

A Novel Parametric Model Order Reduction Approach with Applications 13<br />

to Geometrically Parameterized Microwave Devices<br />

Stefan Burgard, Ortwin Farle, Romanus Dyczij-Edlinger<br />

Efficient Finite-Element Computation <strong>of</strong> Far-Fields <strong>of</strong> Phased Arrays by 19<br />

Order Reduction<br />

Alexander Sommer, Ortwin Farle, Romanus Dyczij-Edlinger<br />

Nanoparticle device for biomedical and optoelectronics applications 25<br />

Renato Iovine, Luigi La Spada, Lucio Vegni<br />

Validation <strong>of</strong> measurements with conjugate heat transfer models 31<br />

Maximilian Schrittwieser, Oszkár Bíró, Ernst Farnleitner, Gebhard Kastner<br />

Computing the shielding effectiveness <strong>of</strong> waveguides using FE-mesh 37<br />

truncation by surface operator implementation<br />

Christian Tuerk, Werner Renhart, Christian Magele<br />

Heat Transfer Analysis on End Windings <strong>of</strong> a Hydro Generator using a 41<br />

Stator-Slot-Sector Model<br />

Stephan Klomberg, Ernst Farnleitner, Gebhard Kastner, Oszkár Bíró<br />

Numerical Investigation <strong>of</strong> Linear Systems Obtained by Extended 47<br />

Element-Free Galerkin Method<br />

Taku Itoh, Soichiro Ikuno, Atsushi Kamitani<br />

Electromagnetic Wave Propagation Simulation in Corrugated 53<br />

Waveguide using Meshless Time Domain Method<br />

Soichiro Ikuno, Yoshihisa Fujita, Taku Itoh, Susumu Nakata, Atsushi Kamitani<br />

Optimization <strong>of</strong> Permanent Magnet Linear Actuator for Braille Screen 59<br />

Ivan Yatchev, Iosko Balabozov, Krastio Hinov, Vultchan Gueorgiev,<br />

Dimitar Karastoyanov<br />

3D Finite Element Analysis <strong>of</strong> Induction Heating System for High 63<br />

Frequency Welding<br />

Ilona Iatcheva, Georgi Gigov, Georgi Kunov, Rumena Stancheva<br />

Optimization Algorithms in the View <strong>of</strong> State Space Concepts 67<br />

Markus Neumayer, Daniel Watzenig, Gerald Steiner, Bernhard Brandstätter<br />

Quasi TEM Analysis <strong>of</strong> 2D Symmetrically Coupled Strip Lines with Finite 73<br />

Grounded Plane using HBEM<br />

Saša Ilić, Mirjana Perić, Slavoljub Aleksić, Nebojsa Raicevic


- II - 15th IGTE Symposium 2012<br />

Design Approach for a Line-Start Internal Permanent Magnet 78<br />

Synchronous Motor<br />

Vera Elistratova, Michel Hecquet, Pascal Brochet, Darius Vizireanu,<br />

Maxime Dessoude<br />

Speed-up <strong>of</strong> Nonlinear Electromagnetic Field Analysis using Fixed-Point 84<br />

Method<br />

Norio Takahashi, Kouske Shimoyama, Daisuke Miyagi, Hiroyuki Kaimori<br />

S<strong>of</strong>tware agent based domain decomposition method 89<br />

Matthias Jüttner, André Buchau, Michael Rauscher, Wolfgang M. Rucker,<br />

Peter Göhner<br />

Stochastic Jiles-Atherton model accounting for s<strong>of</strong>t magnetic material 95<br />

variability<br />

Rindra Ramarotafika, Abdelkader Benabou, Stéphane Clénet<br />

Human exposure to the magnetic field produced by MFDC spot welding 101<br />

systems<br />

Davide Bavastro, Aldo Canova, Luca Giaccone, Michele Manca, Marco Simioli<br />

A Circuital Approach for Eddy Currents Fast Evaluation in Beam-like 108<br />

Structures<br />

Alessandro Formisano, Raffaele Martone<br />

Convergence Characteristics <strong>of</strong> Preconditioned MRTR Method with 113<br />

Eisenstat’s Technique in Real Symmetric Sparse Matrix<br />

Yoshifumi Okamoto, Tomonori Tsuburaya, Koji Fujiwara, Shuji Sato<br />

High Frequency Mixing Rule Based Effective Medium Theory <strong>of</strong> 119<br />

Metamaterials<br />

Zsolt Szabo<br />

Enhancement <strong>of</strong> Maximum Starting Torque and Efficiency in Permanent 125<br />

Magnet Synchronous Motors<br />

Jawad Faiz, Vahid Ghorbanian, Bashir Mahdi Ebrahimi<br />

Core Losses Estimation Techniques in Electrical Machines with 131<br />

Different Supplies-A Review<br />

Jawad Faiz, Amir Masoud Takbash, Bashir Mahdi Ebrahimi<br />

Fast Computation <strong>of</strong> Inductances and Capacitances <strong>of</strong> High Voltage 137<br />

Power Transformer Windings<br />

Tomislav Župan, Željko Štih, Bojan Trkulja<br />

Numerical and Experimental Investigations <strong>of</strong> the Structural 144<br />

Characteristics <strong>of</strong> Stator Core Stacks<br />

Mathias Mair, Bernhard Weilharter, Siegfried Rainer, Katrin Ellermann,<br />

Oszkár Bíró<br />

Proper Location <strong>of</strong> the Regulating Coil in Transformers from 154<br />

Short-circuit Forces Point <strong>of</strong> View<br />

Oluş Sonmez, Bilal Düzgün, Güven Kömürgöz<br />

Robust Design <strong>of</strong> IPM motors using Co-Evolutionary Algorithms 160<br />

Min Li, Andre Ruela, Frederico Guimaraes, Jaime Ramirez, David Lowther


- III - 15th IGTE Symposium 2012<br />

Free-form optimization for magnetic design 167<br />

Zoran Andjelic, Salih Sadovic<br />

Optimization for ECT treatment planning 171<br />

Paolo Di Barba, Luca Giovanni Campana, Fabrizio Dughiero, Carlo Riccardo Rossi,<br />

Elisabetta Sieni<br />

Investigation <strong>of</strong> the Electroporation Effect in a Singel Cell 175<br />

Jaime Ramirez, William Figueiredo, Joao Francisco Vale, Isabela Metzker,<br />

Rafael Santos, Elizabeth Silva, David Lowther<br />

Anisotropic Model for the Numerical Computation <strong>of</strong> Magnetostriction 181<br />

in Steel Sheets<br />

Manfred Kaltenbacher, Adrian Volk, Michael Ertl<br />

Analytic Approximation Solution for the Schwarz-Christ<strong>of</strong>fel Parameter 186<br />

Problem<br />

Norbert Eidenberger, Bernhard G. Zagar<br />

Additional Eddy Current Losses in Induction Machines Due to 190<br />

Interlaminar Short Circuits<br />

Paul Handgruber, Andrej Stermecki, Oszkár Bíró, Georg Ofner<br />

Evaluating the influence <strong>of</strong> manufacturing tolerances in permanent 198<br />

magnet synchronous machines<br />

Isabel Coenen, Thomas Herold, Christelle Piantsop Mboo, Kay Hameyer<br />

Eddy current analysis <strong>of</strong> a PWM controlled induction machine 204<br />

Hai Van Jorks, Erion Gjonaj, Thomas Weiland<br />

Computation <strong>of</strong> end-winding inductances <strong>of</strong> rotating electrical 208<br />

machinery through three-dimensional magnetostatic integral FEM formulation<br />

Flavio Calvano, Giorgio Dal Mut, Fabrizio Ferraioli, Alessandro Formisano,<br />

Fabrizio Marignetti, Raffaele Martone, Guglielmo Rubinacci,<br />

Antonello Tamburrino, Salvatore Ventre<br />

Magnetomechanical Coupled FE Simulations <strong>of</strong> Rotating Electrical 214<br />

Machines<br />

Anouar Belahcen, Katarzyna Fonteyn, Reijo Kouhia, Paavo Rasilo, Antero Arkkio<br />

Saturable Model <strong>of</strong> Squirrel-cage Induction Motors under Stator 220<br />

Inter-turn Fault<br />

Jawad Faiz, Mansour Ojaghi, Mahdi Sabouri<br />

Accurate Magnetostatic Simulation <strong>of</strong> Step-Lap Joints in Transformer 226<br />

Cores Using Anisotropic Higher Order FEM<br />

Andreas Hauck, Michael Ertl, Joachim Schöberl, Manfred Kaltenbacher<br />

Finite Element Based Modeling <strong>of</strong> Wound Rotor Induction Machines 232<br />

Martin Mohr, Oszkár Bíró, Andrej Stermecki, Franz Diwoky<br />

Post Insulator Optimization Based on Dynamic Population Size 238<br />

Peter Kitak, Arnel Glotic, Igor Ticar<br />

Simulation <strong>of</strong> the Absorbing Clamp Method for Optimizing the 242<br />

Shielding <strong>of</strong> Power Cables<br />

Szabolcs Gyimóthy, József Pávó, Péter Kiss, Tomoaki Toratani, Ryuichi Katsumi,<br />

Gábor Varga


- IV - 15th IGTE Symposium 2012<br />

A Neural Network Approach to Sizing an Electrical Machine 248<br />

Steven Bielby, David Lowther<br />

Exploring and Exploiting Parallelism in the Finite Element Method on 254<br />

Multi-core Processors: an Overview<br />

Hussein Moghnieh, David Lowther<br />

Diagnosis <strong>of</strong> real cracks from the three spatial components <strong>of</strong> the eddy 262<br />

current testing signals<br />

Milan Smetana, Ladislav Janousek, Mihai Rebican, Tatiana Strapacova,<br />

Anton Duca, Gabriel Preda<br />

Adaptive Galaxy-Based Search Approach Applied to Loney’s Solenoid 267<br />

Benchmark Problem<br />

Leandro dos Santos Coelho, Teodoro Cardoso Bora, Piergiorgio Alotto<br />

Implementation <strong>of</strong> a 3D magnetic circuit model for automotive 271<br />

applications<br />

Ioannis Anastasiadis, Andreas Buchinger, Tobias Werth, Lukas Bellwald,<br />

Kurt Preis<br />

Robust Optimization <strong>of</strong> Passive RFID Antennas Loaded by Non-linear 276<br />

Circuits<br />

Yuta Watanabe, Hajime Igarashi<br />

Mixed Order Edge-based Finite Element Analysis by Means <strong>of</strong> 282<br />

Nonconforming Connection<br />

Yoshifumi Okamoto, Shuji Sato<br />

Topology Optimization Using Parallel Search Strategy for Magnetic 288<br />

Devices<br />

Takumi Nagano, Shogo Yasukawa, Shinji Wakao, Yoshifumi Okamoto<br />

Modeling <strong>of</strong> the Road Influence on the Grounding System in its Vicinity 294<br />

Dragan Vuckovic, Nenad Cvetkovic, Dejan Krstic, Miodrag Stojanovic<br />

Interaction Magnetic Force Calculation <strong>of</strong> Axial Passive Magnetic 300<br />

Bearing Using Magnetization Charges and Discretization Technique<br />

Saša Ilić, Ana Vuckovic, Slavoljub Aleksić<br />

Consideration <strong>of</strong> erroneous magnets in the electromagnetic field 305<br />

simulation<br />

Peter Offermann, Isabel Coenen, Kay Hameyer<br />

Potential <strong>of</strong> Spheroids in a Homogeneous Magnetic Field in Cartesian 310<br />

Coordinates<br />

Markus Kraiger, Bernhard Schnizer<br />

Application <strong>of</strong> Signal Processing Tools for Fault Diagnosis in Induction 315<br />

Motors-A Review<br />

Jawad Faiz, Amir Masoud Takbash, Bashir Mahdi Ebrahimi, Subhasis Nandi<br />

Experimental Calibration <strong>of</strong> Numerical Model <strong>of</strong> Thermoelastic Actuator 321<br />

Lukas Voracek, Vaclav Kotlan, Bohus Ulrych


- V - 15th IGTE Symposium 2012<br />

Scattering Calculations <strong>of</strong> Passive UHF-RFID Transponders 327<br />

Thomas Bauernfeind, Gergely Koczka, Kurt Preis, Oszkár Bíró<br />

Simulation <strong>of</strong> a high speed Reluctance Machine including hysteresis 331<br />

and eddy current losses<br />

Bernhard Schweigh<strong>of</strong>er, Hannes Wegleiter, Manes Recheis, Paul Fulmek<br />

An Iterative Domain Decomposition Method for Solving Wave 337<br />

Propagation Problems<br />

Koczka Gergely, Thomas Bauernfeind, Kurt Preis, Oszkár Bíró<br />

On Effectiveness <strong>of</strong> Model Reduction for Computational 340<br />

Electromagnetism<br />

Yuki Sato, Hajime Igarashi<br />

Calculation <strong>of</strong> eddy-current probe signal for a volumetric defect using 346<br />

global series expansion<br />

Sandor Bilicz, József Pávó, Szabolcs Gyimóthy<br />

Bodies motion computation using eddy-current integral equation 352<br />

Mihai Maricaru, Ioan R. Ciric, Horia Gavrila, George-Marian Vasilescu,<br />

Florea I. Hantila<br />

Adaptive Inductance Computation on GPU’s 357<br />

Andrea Gaetano Chiariello, Alessandro Formisano, Raffaele Martone<br />

The reduced-basis method applied to transport equations <strong>of</strong> a 362<br />

lithium-ion battery<br />

Stefan Volkwein, Andrea Wesche<br />

Surrogate Parameter Optimization based on Space Mapping for 368<br />

Lithium-Ion Cell Models<br />

Matthias Scharrer, Bettina Suhr, Daniel Watzenig<br />

Large Scale Energy Storage with Redox Flow Batteries 374<br />

Piergiorgio Alotto, Massimo Guarnieri, Federico Moro, Andrea Stella<br />

Model Order Reduction for a Lithium-Ion Cell 380<br />

Bettina Suhr, Jelena Rubesa<br />

Automatic domain detection for a meshfree post-processing in 386<br />

boundary element methods<br />

André Buchau, Matthias Jüttner, Wolfgang M. Rucker<br />

Efficient modeling <strong>of</strong> coil filament losses in 2D 392<br />

Leena Lehti, Janne Keränen, Saku Suuriniemi, Timo Tarhasaari, Lauri Kettunen<br />

Optimization <strong>of</strong> Energy Storage Usage 398<br />

Arnel Glotic, Peter Kitak, Igor Ticar, Adnan Glotic<br />

Adaptive Surrogate Approach for Bayesian Inference in Inverse 403<br />

Problems<br />

Markus Neumayer, Helcio R.B. Orlande, Marcello J. Colaco, Daniel Watzenig,<br />

Gerald Steiner, Bernhard Brandstätter, George S. Dulikravich


- 1 - 15th IGTE Symposium 2012<br />

Multi-Domain Multi-Scale Problems in High<br />

Frequency Finite Element Methods<br />

Istvan Bardi, Kezhong Zhao, Rickard Petersson, John Silvestro and Nancy Lambert<br />

ANSYS Inc., 225 W Station Square Drive, Pittsburgh, PA 15219, U.S.A.<br />

E-mail: steve.bardi@ansys.com<br />

Abstract—This paper presents Domain Decomposition Methods to overcome the challenges posed by multi-domain, multi-scale<br />

high frequency problems. By decomposing large electromagnetic regions into smaller domains, the Finite Element Method can<br />

cope with the simulation <strong>of</strong> electrically large problems. A hybrid Finite Element and Boundary Integral procedure is also<br />

presented that allows for domains to employ different solution methods in different subdomains. The Robin Transmission<br />

Condition (RTC) is applied to link the domains and preserve field continuity on interfaces. Real life examples demonstrate the<br />

accuracy and efficiency <strong>of</strong> the new method.<br />

Index Terms—FEM, hybrid FEM and boundary element method, multi scale problems, Robin Transmission Condition<br />

I. INTRODUCTION<br />

The finite element method (FEM) is a powerful tool<br />

for simulating high frequency structures. There are<br />

several features <strong>of</strong> the method that have become expected<br />

elements <strong>of</strong> a successful commercial simulator. These<br />

elements include spurious mode free hierarchical, higher<br />

order vector basis functions, curvilinear elements,<br />

automatic/adaptive meshing, transfinite elements, mesh<br />

truncation methods, broad band frequency sweeps,<br />

parameterization and preconditioned iterative solvers.<br />

However, new challenges have emerged in recent years:<br />

the simulation tools need to cope with multi-scale<br />

problems that start on the chip level, couple to the<br />

package and board levels, and encompass the platform<br />

and antenna levels. Each component in a multi-scale<br />

problem can require millions <strong>of</strong> unknowns to simulate.<br />

Chip complexity rises to billions <strong>of</strong> circuit elements;<br />

packages involve large numbers <strong>of</strong> ports; printed circuit<br />

boards (PCBs) <strong>of</strong>ten contain thousands <strong>of</strong> traces on many<br />

layers; and platforms and antennas <strong>of</strong>ten involve<br />

dimensions <strong>of</strong> hundreds <strong>of</strong> wavelengths.<br />

While recent advances in High Performance<br />

Computing (HPC) hardware greatly accelerate numerical<br />

computations, new algorithms are needed to exploit the<br />

new HPC environment. In particular, efficient and<br />

effective physics-based parallelization is required to<br />

address the challenges <strong>of</strong> multi-scale and multi-domain<br />

simulation. This paper presents an overview <strong>of</strong> domain<br />

decomposition methods that exploits the physical nature<br />

<strong>of</strong> multi-scale, multi-domain problems to tackle<br />

heret<strong>of</strong>ore impossibly complex high frequency problems.<br />

II. BASICS OF DOMAIN DECOMPOSITION METHOD<br />

A basic characteristic <strong>of</strong> HPC is the use <strong>of</strong> multiple<br />

processors to perform computations in parallel. An<br />

algebraic approach to using HPC is to partition large<br />

matrices into smaller sub-matrices. In some cases, this is<br />

inefficient even when iterative solvers are employed.<br />

Physics-based domain decomposition is typically more<br />

efficient because the subdomains exploit the geometry<br />

and the field. In this case, both the solution domain and<br />

the mesh are partitioned into smaller subdomains and<br />

meshes. The mesh <strong>of</strong> the sub-domains can be<br />

overlapping, conformal touching, non-conformal<br />

touching or even non-touching [1-3].<br />

Another important advantage for DDM is the ability to<br />

link differing solution methods and physics. For instance,<br />

the finite element method is better at simulating complex<br />

geometries, while the boundary element method copes<br />

better with electrically large but simple, smooth<br />

structures. This paper presents hybrid finite element–<br />

boundary integral (FE-BI) methods that allows nonconformal<br />

touching domains and disjoint regions.<br />

Figure 1: Incident surface electric and magnetic current<br />

densities impinging on an FEM domain<br />

A. Single FE domain with surface electric and<br />

magnetic current excitations<br />

Consider a computational domain where an incident<br />

field impinges on a section <strong>of</strong> a boundary as illustrated in<br />

Figure 1. The wave equation to be solved is<br />

2<br />

imp<br />

1 r<br />

E1<br />

ko1rE1 jkoJ<br />

1 (1)<br />

imp<br />

where J1 is the impressed current density. The total<br />

field description is used inside the domain, while the<br />

scattered field description is used outside<br />

sc inc<br />

sc inc<br />

E1 E1<br />

E1<br />

; H1 H1<br />

H1<br />

(2)<br />

For the sake <strong>of</strong> simplicity, a Dirichlet boundary condition<br />

is used on 1 \ 12<br />

<br />

n 1 E 0<br />

(3)<br />

, both the electric and the magnetic field jump<br />

On 12


inc<br />

inc<br />

with n1 E1<br />

and n2 H2<br />

, respectively [4].<br />

Consequently, an absorbing boundary condition is<br />

sc<br />

sc<br />

required for E and H . Assume we know an operator<br />

that provides perfect absorption:<br />

sc<br />

sc<br />

n1 H1<br />

ABC ( n1<br />

n1<br />

E1<br />

)<br />

(4)<br />

Since, the total field description is used in the<br />

computational domain, the scattered field variables can<br />

be eliminated using (2). Then, the Neumann boundary<br />

condition for the total magnetic field is<br />

n1<br />

1r<br />

E1<br />

<br />

(5)<br />

inc inc<br />

jo ( ABC ( e1)<br />

ABC ( e1<br />

) J1<br />

/ )<br />

where the J electric and the magnetic current densities e<br />

is introduced as<br />

J n<br />

H and e n<br />

n<br />

E<br />

(6)<br />

where is the wave impedance. Introducing the first<br />

order ABC generates the simple form<br />

inc inc<br />

n1 1r<br />

E1<br />

jko( e1<br />

e1<br />

J1<br />

)<br />

(7)<br />

Equations (5) and (7) are Robin transmission boundary<br />

conditions. They generalize the Neumann boundary<br />

condition to include the incident fields. Thus, to excite<br />

the computational domain by an external incident field,<br />

the electric and magnetic surface current densities inc<br />

J<br />

inc<br />

and e need to be specified. The transmission<br />

conditions are first order when the first order ABC is<br />

used and higher order when higher order ABC’s are used.<br />

B. Multiple FE domains with surface electric and<br />

magnetic current coupling<br />

Now consider a computational domain that is<br />

subdivided into two subdomains (Fig. 2).<br />

Figure 2: Decomposition into two non-overlapping<br />

subdomains<br />

The boundary value problem (BVP) for the first domain<br />

is<br />

2<br />

imp<br />

1 r<br />

E1<br />

ko1rE1 jkoJ<br />

1 in 1 (8)<br />

inc inc<br />

n1 1r<br />

E1<br />

jkoJ1 jko(<br />

e1<br />

e1<br />

J1<br />

) on 12 <br />

(9)<br />

and similarly for the second domain<br />

2<br />

imp<br />

<br />

2r<br />

E2 ko 2rE2<br />

jkoJ<br />

2 in 2 (10)<br />

inc inc<br />

n2 2r<br />

E2<br />

jkoJ 2 jko(<br />

e2<br />

e2<br />

J2<br />

) on 12 <br />

(11)<br />

The incident field for the first domain is the field <strong>of</strong> the<br />

second domain and vice versa<br />

inc<br />

e1 e2<br />

; e2 e1<br />

inc (12)<br />

inc<br />

J J<br />

J J<br />

inc<br />

<br />

(13)<br />

1<br />

2 ; 2 1<br />

- 2 - 15th IGTE Symposium 2012<br />

Applying this to Equations (9) and (11), we get<br />

n1 1r<br />

E1<br />

jkoJ1<br />

jko(<br />

e1<br />

e2<br />

J2<br />

) (14)<br />

n2 2r<br />

E2<br />

jkoJ<br />

2 jko(<br />

e2<br />

e1<br />

J1)<br />

(15)<br />

The right hand sides <strong>of</strong> Eqs. (14) and (15) are the<br />

Neumann Boundary conditions for domain 1 and 2 ,<br />

respectively. They will be included into the finite element<br />

formulation as natural boundary conditions. Since J1 and<br />

J 2 were introduced, Eqs. (14), (15) must be prescribed<br />

explicitly as well<br />

J1 e1<br />

e2<br />

J2<br />

(16)<br />

J2 e2<br />

e1<br />

J1<br />

(17)<br />

It can be proved ([1]) that solution <strong>of</strong> the differential<br />

equations (8) and (10) are unique applying natural and<br />

essential interface conditions (14), (15) and (16), 17),<br />

respectively. Applying Galerkin’s method, the bilinear<br />

form and the essential boundary condition for domain 1 <br />

is the following:<br />

b(<br />

, E ) jk v , e jk v , e jk v , J<br />

v1 1 o 1 1 o 1 2<br />

12<br />

o 1 2<br />

12<br />

12<br />

imp v1J11 jk <br />

(18)<br />

o<br />

jk o(<br />

w1, e1<br />

<br />

12<br />

w1,<br />

J1<br />

<br />

12<br />

w1,<br />

e2<br />

<br />

12<br />

w1,<br />

J2<br />

) 0<br />

12<br />

(19)<br />

The same applies to 2 . Note, that the testing functions<br />

w should be orthogonal to those <strong>of</strong> v. Discretizing the<br />

scalar products, yields the matrix equation [1]<br />

K1<br />

<br />

<br />

G 21<br />

where<br />

G12<br />

u1<br />

y1<br />

<br />

<br />

<br />

K<br />

<br />

2 u<br />

2<br />

y1<br />

<br />

(20)<br />

Ek<br />

bk<br />

<br />

u <br />

<br />

k <br />

ek<br />

<br />

; y<br />

<br />

k <br />

<br />

0<br />

<br />

; k=1,2<br />

<br />

J k <br />

0 <br />

(21)<br />

A<br />

k<br />

T<br />

K k <br />

<br />

Ck<br />

<br />

0<br />

Ck<br />

vv<br />

Bk<br />

jkoTkk<br />

wv<br />

jkoTkk<br />

0 <br />

0<br />

<br />

<br />

ww<br />

jk oTkk<br />

<br />

(22)<br />

0<br />

G<br />

<br />

12 G 21 <br />

<br />

0<br />

<br />

0<br />

0<br />

vv<br />

jkoT12<br />

wv<br />

jkoT12<br />

0 <br />

vw<br />

jk<br />

<br />

oT12<br />

<br />

ww<br />

jk oT12<br />

<br />

(23)<br />

, v n v n<br />

vv<br />

;<br />

ww<br />

n<br />

w , n<br />

w<br />

j<br />

12<br />

Tij i Tij i j<br />

12<br />

(24)<br />

vw<br />

Tij n<br />

vi<br />

, n<br />

w j<br />

<br />

(25)<br />

12<br />

Matrices A, B, C and b are the same as in the case <strong>of</strong><br />

standard FE discretization and can be found in [1] along<br />

with the definitions <strong>of</strong> the scalar products. The structure<br />

<strong>of</strong> Eq. (18) shows, that the variables <strong>of</strong> the FE domains<br />

are coupled just via the surface electric and magnetic<br />

current variables, which are called cement variables.<br />

C. Hybrid FE - BI domains with surface electric and<br />

magnetic current coupling<br />

Fig. 3 shows two separated domains 1 and 2 . The<br />

fields in these domains are coupled via the free space<br />

domain ext . The FEM is used in 1 and 2 , while<br />

Boundary Integral Method (BI) is used in ext .


Figure 3: Decomposition into two FEM and one BI<br />

subdomains<br />

The boundary value problem for the finite element<br />

domains is similar to that in section B<br />

2<br />

imp<br />

<br />

ir<br />

Ei ko irEi<br />

jkoJ<br />

i in i (26)<br />

<br />

ni 1i<br />

i o i o i i i<br />

<br />

<br />

<br />

<br />

E jk J jk ( e e J ) on i (27)<br />

<br />

Ji ei<br />

ei<br />

Ji<br />

on i (28)<br />

Eqs. (27) and (28) are the Neumann and the Robin<br />

transmission boundary conditions, respectively.<br />

For the unbounded subdomain ext , the boundary<br />

integral equation representation is used, based on<br />

Stratton-Chu [2]. The boundary integral equation for the<br />

electric and the magnetic current densities are<br />

1<br />

2<br />

inc <br />

<br />

ei ei<br />

{ nk<br />

( C(<br />

nk<br />

ek<br />

)) jk onk<br />

( A(<br />

Jk<br />

)) <br />

2<br />

k1<br />

1 <br />

<br />

( jk o)<br />

<br />

(<br />

Jk<br />

)} on i (29)<br />

jk<br />

2<br />

o inc<br />

<br />

J i Ji<br />

{ jkonk<br />

nk<br />

( C(<br />

Jk<br />

)) <br />

2<br />

k1<br />

2 <br />

<br />

<br />

jkonk nk (<br />

A(<br />

nk<br />

ek<br />

)) <br />

(<br />

nk<br />

ek<br />

)} on i (30)<br />

and the Robin transmission boundary condition is:<br />

<br />

<br />

Ji ei<br />

ei<br />

Ji<br />

on i (31)<br />

where<br />

'<br />

'<br />

'<br />

' '<br />

A ( x)<br />

xgds ; (<br />

x ) ( x)<br />

gds ; C (x)<br />

x <br />

gds<br />

<br />

<br />

<br />

(32)<br />

Applying Galerkin’s method again, the matrix equation to<br />

be solved is<br />

K<br />

1 N12x<br />

1<br />

y1<br />

<br />

(33)<br />

T<br />

N12<br />

K 2 x<br />

2<br />

y1<br />

where<br />

AII<br />

A 0 0<br />

0<br />

I<br />

<br />

<br />

<br />

<br />

A A T D T D <br />

I <br />

<br />

<br />

T<br />

T<br />

K 0 D T D T <br />

i<br />

<br />

<br />

<br />

<br />

<br />

T<br />

T<br />

<br />

0 T D Q T P<br />

<br />

ii <br />

ii <br />

<br />

T<br />

T<br />

T<br />

<br />

0 D T P Q T <br />

<br />

<br />

ii<br />

ii <br />

I<br />

0<br />

0 0 0 0 Ei<br />

y i<br />

<br />

<br />

0 0 0 0 0 e<br />

<br />

i 0 <br />

;<br />

<br />

N12<br />

0<br />

0 0 0 0 x i J<br />

;<br />

i y (34)<br />

i 0<br />

<br />

inc<br />

E <br />

0<br />

0 0 Q12<br />

P12<br />

ei<br />

y<br />

i <br />

T<br />

<br />

inc<br />

<br />

0<br />

0 0 P <br />

12 Q12<br />

<br />

J<br />

i <br />

H <br />

y<br />

i <br />

Further details are provided in [1].<br />

This general hybrid finite element-boundary integral<br />

equation method (hybrid FE-BI) is very flexible. The<br />

subdomains can be FEM, BEM or any other numerical<br />

method. If just one FEM subdomain exists, it provides a<br />

perfect absorbing boundary condition (FE-BI).<br />

- 3 - 15th IGTE Symposium 2012<br />

III. SOLVING THE MATRIX EQUATIONS<br />

In this section, the solution <strong>of</strong> the matrix equations is<br />

presented via a stationary alternating Schwartz algorithm<br />

based on Jacobi Splitting. The idea is to eliminate the<br />

internal variables and solve for the surface current<br />

densities also called cement variables. Performing this<br />

process iteratively is called domain iteration. Partitioning<br />

the variables accordingly<br />

e<br />

k Ek<br />

<br />

c k ; u k <br />

J<br />

; (35)<br />

k c<br />

k <br />

Eq. (20) for the k-th domain is<br />

Ek<br />

bk<br />

0 <br />

Kk <br />

c j<br />

c<br />

<br />

g<br />

<br />

k 0<br />

<br />

kj<br />

(36)<br />

g kj can be read from Eq.(23) and k and j are the domain<br />

indices. Note, that the internal variables are expressed by<br />

the cement variables. Supposing, the inverse matrix <strong>of</strong> the<br />

k-th domain is known and also partitioned to internal and<br />

cement variable blocks, we get:<br />

E E<br />

E c<br />

k Ek<br />

k k<br />

P b P<br />

g <br />

k k k k kj<br />

c<br />

j<br />

c<br />

<br />

(37)<br />

c<br />

c c<br />

k E<br />

k k<br />

k <br />

P b <br />

<br />

Pk<br />

g<br />

k k<br />

kj <br />

where<br />

Ek<br />

Ek<br />

Ek<br />

ck<br />

P<br />

<br />

k Pk<br />

K k c <br />

k Ek<br />

ck<br />

ck<br />

Pk<br />

Pk<br />

<br />

1<br />

(38)<br />

These equations allow domain iterations to be applied<br />

k E P b P g c<br />

n1<br />

k<br />

E Ek<br />

k<br />

c E<br />

k<br />

k<br />

Ek<br />

ck<br />

k<br />

n<br />

kj j<br />

(39)<br />

n1<br />

k k ck<br />

ck<br />

n<br />

ck P bk<br />

Pk<br />

gkjc<br />

j<br />

(40)<br />

where the superscript n provides the iteration number.<br />

cc<br />

The matrix P k is called the numerical Green’s function<br />

and quantities c k and c j in Eq. (40) are called the<br />

cement variables. The domain iteration works with the<br />

cement variables only, but it needs blocks <strong>of</strong> the inverse<br />

<strong>of</strong> the system matrix <strong>of</strong> the internal variables. Eq. (39)<br />

provides the update for the internal variables, which are<br />

not included in the domain iteration because they do not<br />

needed to be updated unless the right-hand-side changes.<br />

Other, popular methods also can be applied, such as<br />

GMRES, a Krylov Subspace Method. The domain<br />

iteration needs to invert the subdomain matrices in each<br />

iteration. For this purpose, either a multifrontal direction<br />

solver can be used or a p-type multiplicative Schwarz<br />

preconditioner (pMUS) iterative solver. The iteration<br />

matrix<br />

cc<br />

Akj Pk<br />

(41)<br />

is dense but it can be replaced by a sparse matrix using<br />

Adaptive Cross Approximation (ACA)<br />

~ mn<br />

kj<br />

mn<br />

mr<br />

rn<br />

Akj<br />

A Ukj<br />

Vkj<br />

(42)<br />

where m and n are the row and column numbers and r is<br />

the rank <strong>of</strong> the matrix.<br />

The domain iteration method presented above was<br />

derived for the case when the solution domain is<br />

partitioned into two sub-domains with one coupling<br />

surface interface. If the solution domain is partitioned<br />

into multiple domains with multiple coupling surfaces,<br />

the number <strong>of</strong> the cement variables increases but the


essence remains the same: the subdomain variables are<br />

expressed in terms <strong>of</strong> cement variables and the domain<br />

iteration is set up for the cement variables. The same<br />

applies when the subdomains are coupled via BI domains.<br />

The convergence <strong>of</strong> the domain iteration depends on<br />

the order <strong>of</strong> the Robin transmission boundary conditions.<br />

For simplicity, first order Robin boundary conditions<br />

were used in the above derivations. Higher order<br />

conditions are also available in [5]. Higher order<br />

transmission condition enforce the requirement that the<br />

eigenvalues <strong>of</strong> the system matrix be inside the unit circle.<br />

This is a necessary condition for a good domain iteration<br />

convergence. A second order transmission boundary<br />

condition can be realized as in Eqs. (16) and (17)<br />

J j Aje j Bj<br />

<br />

e<br />

j Jk<br />

Ake<br />

k Bk<br />

<br />

ek<br />

(43)<br />

J A e B <br />

e<br />

J<br />

A e B <br />

e<br />

k k k k k j j j j <br />

(44)<br />

where k and j are the indices <strong>of</strong> the neighboring domains<br />

and denotes the surface operator. Constants k A , k B , Aj<br />

and B j can be optimized for convergence. Figure 4<br />

shows the improvement in convergence provided by the<br />

second order Robin Transmission Condition (RTC).<br />

Figure 4: Convergence with first and second order RTC<br />

III. REPETITIVE STRUCTURES<br />

If identical substructures exist in the computational<br />

domain, the computational effort <strong>of</strong> storing and solving<br />

the equations is dramatically reduced. Repeated<br />

identical substructures are called unit cells and have<br />

the same mesh. Only one unit cell is stored physically<br />

in the computer; the other unit cells are virtual. The<br />

physically-stored unit cell is called the parent, while<br />

the virtual ones are called children. A structure can<br />

have multiple parents. In the case <strong>of</strong> non-conformal<br />

domain decomposition, no constraints are applied to<br />

the mesh. In the case <strong>of</strong> conformal DDM, the parent<br />

mesh must be constrained so that it matches with the<br />

surface meshes <strong>of</strong> the children.<br />

For the sake <strong>of</strong> simplicity, assume that the entire<br />

computational domain consists <strong>of</strong> just one repeated<br />

structure. This is usually the case when finite antenna<br />

arrays are simulated. Figure 5 shows a single parent<br />

case with an internal block and matrices <strong>of</strong> repetitive<br />

unit cells. Here there is one system matrix and two<br />

coupling matrices. Thus, only three <strong>of</strong> the sixteen<br />

- 4 - 15th IGTE Symposium 2012<br />

j<br />

matrix blocks need to be stored and matrix block A<br />

must be factored once instead <strong>of</strong> 4 times.<br />

Figure 5: Internal blocks and corresponding matrices<br />

<strong>of</strong> repetitive unit cells<br />

IV. MULTI DOMAIN DDM WITH FE-BI<br />

As it has been shown, DDM is based on a divide-andconquer<br />

philosophy. Instead <strong>of</strong> tackling a large and<br />

complex problem directly as a whole, the original<br />

problem is partitioned into smaller, possibly repetitive,<br />

and easier to solve sub-domains. In this paper, DDM is<br />

used as an effective FEM preconditioner, where a higher<br />

order Robin’s transmission condition (RTC) is devised to<br />

enforce the continuity <strong>of</strong> electromagnetic fields between<br />

adjacent sub-domains and accelerates the convergence <strong>of</strong><br />

the iterative process. DDM is also employed to provide a<br />

hybrid FEM-BEM approach where the treatment <strong>of</strong> the<br />

radiation condition is exact. The hybrid finite elementboundary<br />

integral (FE-BI) method allows FEM-domains<br />

to be disconnected with the coupling between disjoint<br />

domains provided via Green’s functions. The advantages<br />

<strong>of</strong> DDM-based FE-BI compared to traditional FE-BI<br />

include modularity <strong>of</strong> FEM and BI domains in terms <strong>of</strong><br />

mesh and basis functions. This “non-conformal” ability<br />

significantly simplifies the integration <strong>of</strong> existing state<strong>of</strong>-art<br />

FEM and BEM solvers. The continuity<br />

enforcement through Robin’s RTC naturally renders<br />

present FE-BI free <strong>of</strong> internal resonance issue. Since<br />

domains are allowed to be disjoint, if one or more subdomains<br />

are purely metallic or highly conducting, DDM<br />

can allow the integral equation method to be applied to<br />

these sub-domains directly to reduce memory<br />

consumption.<br />

V. APPLICATIONS<br />

To illustrate the effectiveness and accuracy <strong>of</strong> DDM,<br />

an array <strong>of</strong> tapered slot antennas is considered. The<br />

antenna element is <strong>of</strong> the Vivaldi type. The antenna is<br />

similar to the one described in [8]. The rectangular array<br />

spacing is 34 mm along y and 36mm along x. The εr = 6<br />

substrate is 0.02 λ0 thick and the height and opening size<br />

<strong>of</strong> the slot is ≈0.5 and 0.45λ0 respectively. To show the<br />

accuracy <strong>of</strong> the simulation an array <strong>of</strong> 81 elements (9x9)<br />

was analyzed using DDM. For comparison, a full array<br />

model <strong>of</strong> the 81 elements with a slightly different edge<br />

treatment was also created and simulated using FEM in a<br />

single domain. The model simulated without using DDM<br />

will be referred to as the explicit model. The two patterns<br />

for the φ=0° cut (perpendicular to the slot faces) for all<br />

elements excited with equal amplitude and 0° phase shift<br />

are shown in Figure 6. Excellent agreement is obtained.<br />

In addition to being able to compute the field patterns, the<br />

full scattering matrix can also be extracted from the DDM<br />

simulation. To verify the accuracy <strong>of</strong> this computation,<br />

consider the data shown in Figure 7. This plot compares


the refection coefficient <strong>of</strong> the center element (element<br />

#41) in the array and also the coupling terms (S41,-- dB)<br />

for the coupling between the center element and the next<br />

4 elements along the same row <strong>of</strong> slots. Again agreement<br />

between the two sets <strong>of</strong> data is excellent.<br />

Another infinite array simulation was performed using<br />

linked boundary conditions and the active element pattern<br />

was computed [9]. The active element pattern is the<br />

radiation pattern for an infinite array <strong>of</strong> elements where<br />

only a single element is excited. Finite arrays <strong>of</strong> 9x9 and<br />

21x21 elements were simulated using DDM. The<br />

radiation pattern with only the center element excited was<br />

computed for each <strong>of</strong> these arrays. For comparison a<br />

single antenna element on a finite ground plane was also<br />

analyzed. The normalized φ=90° patterns for these 4<br />

antennas is shown in Figure 8. The agreement with the<br />

infinite array active element pattern improves as the array<br />

size increases from 1 to 9x9 to 21x21 elements. This<br />

demonstrates the accuracy <strong>of</strong> the DDM simulation<br />

procedure for large arrays. As a final test, a 15x15 array<br />

<strong>of</strong> Vivaldi elements was simulated using DDM. In this<br />

case, the radiation pattern for 0° scan angle was<br />

calculated. The 3D polar <strong>of</strong> this pattern is shown in<br />

Figure 9.<br />

All simulations were run on a Linux cluster. Each<br />

machine in the cluster had 12 CPUs and 96 GB memory.<br />

The explicit 81 element model was run on a single<br />

machine. For the DDM simulation, the domain<br />

simulations were distributed over several machines and<br />

CPUs. For the 9x9 array, 62 domains were used and<br />

21GB Ram was required; for the 15x15 array, 68<br />

domains were used and the total memory required was<br />

≈28GB. The latter simulation shows the power <strong>of</strong> this<br />

approach – even though the number <strong>of</strong> tetrahedra<br />

increased significantly, the memory usage was still less<br />

than 30GB.<br />

Figure 6: Comparison <strong>of</strong> the φ=0° patterns for all<br />

elements excited with equal phase and magnitude for 9x9<br />

array. The DDM data is the solid black line and the<br />

explicit model data is the dashed red line.<br />

Table I shows a comparison <strong>of</strong> solver statistics <strong>of</strong><br />

different element sizes and methods.<br />

TABLE I<br />

COMPARISON OF MEMORY AND SOLUTION TIME OF<br />

DIFFEREN METHODS/ARRAYS<br />

Time Number<br />

<strong>of</strong> tets<br />

Memory<br />

Explicit (9 x 9) 190 min 1.7 m 50 GB<br />

DDM (9 x 9) 90 min 1.6 m 21 GB<br />

DDM (15 x 15) 300 min 4.3 m 28 GB<br />

- 5 - 15th IGTE Symposium 2012<br />

Figure 7. S41,-where element 41 is the center element <strong>of</strong><br />

the array and elements 42-45 are the remaining elements<br />

along the middle row computed using two different<br />

approaches.<br />

Figure 8: Phi =90 °element patterns calculated using the<br />

infinite array approximation (Element_pattern) and from<br />

a 9x9 and 21x21 element array compared to the pattern<br />

for a single isolated element (iso).<br />

Figure 9: 3D polar plot <strong>of</strong> the radiation pattern for the<br />

15x15 array where all elements are excited with equal<br />

amplitude and phase<br />

The next example demonstrates the efficiency <strong>of</strong> FE-<br />

BI. Figure 10 shows an Apache helicopter with a<br />

conformal FE-BI surface and it has been simulated at 900<br />

MHz. Table II shows a comparison with pure FEM and<br />

IE methods. The results show the superiority <strong>of</strong> FE-BI,<br />

due to its conformal mesh truncation capability.


Figure 10: Apache helicopter with conformal FE-BI<br />

boundary<br />

TABLE II<br />

COMPARISON OF MEMORY AND SOLUTION TIME OF<br />

DIFFEREN METHODS<br />

Number<br />

<strong>of</strong> cores<br />

Memory Time<br />

FEM (PML box) 12 300 GB 330 min<br />

IE 12 83 GB 328 min<br />

FE-BI<br />

(conformal)<br />

12 21 GB 63 min<br />

VI. CONCLUSION<br />

The proliferation <strong>of</strong> High Performance Computing<br />

(HPC) has made parallelization a basic requirement for<br />

simulation codes today. Computational tasks can be<br />

distributed on different machines (nodes) or cores<br />

(distributed or shared memory). DDM is an ideal<br />

procedure for achieving high HPC efficiency. The<br />

subdomain solutions are fully independent <strong>of</strong> each other,<br />

so they can be evaluated in parallel, either by using<br />

distributed or shared memory. Subdomain solvers also<br />

exploit multi-processing and iterative solution methods.<br />

Both the Schwartz or Krylov domain iteration methods<br />

distribute tasks with high parallelism. The standard<br />

Message Passing Interface (MPI) can be used to control<br />

the data exchange between the nodes and cores. As<br />

demonstrated in the examples, the hybridized FE and BI<br />

DDM procedure provides a flexible and efficient tool to<br />

solve multi scale multi domain problems.<br />

- 6 - 15th IGTE Symposium 2012<br />

[1]<br />

REFERENCES<br />

K. Zhao, V. Rawat, S. Lee and J.F Lee, "A Domain Decomposition<br />

Method with Nonconformal Meshes for Finite Periodic and<br />

Semi-Periodic Structures," IEEE Transactions on Antennas and<br />

Propagation, vol. 55, pp. 2559 - 2570, September, 2007.<br />

[2] K. Zhao, V. Rawat and J.F Lee, "A Domain Decomposition<br />

Method for Electromagnetic Radiation and Scattering Analysis <strong>of</strong><br />

Multi-Target Problems," IEEE Transactions on Antennas and<br />

Propagation, vol. 56, pp. 2211 - 2221, August 2008.<br />

[3] I. Bardi, Zs. Badics and Z. Cendes, "Total and Scattered Field<br />

Formulations in the Transfinite Element Method," IEEE<br />

Transactions on Magnetics, vol. 44, pp. 778-781, June, 2008.<br />

[4] R. F. Harrington, Time–Harmonic Electromagnetic Fields, John<br />

Wiley & Sons, Inc. New York, 2000.<br />

[5] Y. Shao, Z. Peng and J.F Lee, "Full-Wave Real-Life 3-D Package<br />

Signal Integrity Analysis Using Nonconformal Domain<br />

[6]<br />

Decomposition Method," IEEE Transactions on Nicrowave<br />

Theory and Techniques, vol. 59, pp. 230 - 241, February 2011.<br />

W. C. Chew and C.C. Lu, "The use <strong>of</strong> Huygens’ equivalence<br />

principle for solving the volume integral equation for scattering,"<br />

IEEE Transactions on Antennas and Propagation, vol. 41, pp. 897<br />

- 904, July 1993.<br />

[7] Y.J Li and J.M. Jin, "A New Dual–Primal Domain Decomposition<br />

Approach for Finite Element Simulation <strong>of</strong> 3-D Large – Scale<br />

Electromagnetic Problems," IEEE Transactions on Antennas and<br />

Propagation, vol. 55, pp. 2803 – 2810, October 2007.<br />

[8] L.E. R. Petersson and J-M Jin, “Analysis <strong>of</strong> periodic structures via<br />

a time-domain finite-element formualiton with a Floquet abc,”<br />

IEEE Trans. AP, pp. 933-944, Mar. 2009.<br />

[9] J. Manges, J. Silvestro and R. Petersson, “Accurate and Efficient<br />

Extraction <strong>of</strong> Antenna Array Performance from Numerical Unit-<br />

Cell Data,” 2011 European Microwave Conference


- 7 - 15th IGTE Symposium 2012<br />

Parallelization <strong>of</strong> the Transmission Line Matrix<br />

method. Modelling Schumann Resonances and<br />

Atmospherics<br />

S. Toledo-Redondo∗ ,A.Salinas∗ , J. Fornieles∗ ,J.Portí † ,B.Besser ‡ , and H.I.M. Lichtenegger ‡<br />

∗Department <strong>of</strong> Electromagnetism and Matter Physics, <strong>University</strong> <strong>of</strong> Granada, Spain.<br />

† Department <strong>of</strong> Applied Physics, <strong>University</strong> <strong>of</strong> Granada, Spain.<br />

‡ Space Research Institute, Austrian Academy <strong>of</strong> Sciences, <strong>Graz</strong>, Austria<br />

E-mail: sergiotr@ugr.es<br />

Abstract—In this paper, a parallelization <strong>of</strong> the Transmission-Line Modelling (TLM) method is presented. It is intended to<br />

work efficiently regardless <strong>of</strong> the spatial topology <strong>of</strong> the problem, by transforming the initial topology into a one-dimensional<br />

structure. It is designed for shared memory environments, and its implementation is carried out using OpenMP directives.<br />

The algorithm is applied to find the first cut-<strong>of</strong>f frequency <strong>of</strong> the Earth-ionosphere waveguide by solving two models <strong>of</strong><br />

the real system. The performance <strong>of</strong> the algorithm for the mentioned problem is studied in terms <strong>of</strong> speedup over two<br />

different platforms. Relative speedups <strong>of</strong> up to 16 are achieved with the use <strong>of</strong> 32 CPUs. Finally, the whole Earth-ionosphere<br />

cavity is simulated, with an accuracy <strong>of</strong> 5 km grid size, leading to an error <strong>of</strong> less than 1.5% in the Schumann Resonance<br />

frequencies. The spatial resolution achieved also enables for the first time the possibility <strong>of</strong> using this model to study the<br />

global effects generated by local phenomena in the Earth-ionosphere cavity.<br />

Index Terms—Earth-ionosphere waveguide, Schumann resonances, shared memory, speedup, TLM.<br />

I. INTRODUCTION<br />

Numerical methods are a tool for embracing scientific<br />

and technological problems which are difficult or even<br />

impossible to solve analytically. In addition, simulation is<br />

<strong>of</strong>ten an intermediary step between design and construction<br />

<strong>of</strong> prototypes in industry. High performance computers<br />

are one <strong>of</strong> the keys <strong>of</strong> the present importance <strong>of</strong> these<br />

methods, because they allow simulating more and more<br />

complex situations as the technology evolves. However,<br />

the top speed <strong>of</strong> processors seems to have reached its top<br />

[1], and the tendency <strong>of</strong> CPU manufacturers is to ship<br />

multi-core processors instead <strong>of</strong> building faster single<br />

CPUs [2].<br />

The Transmission-Line Modelling method [3] is employed<br />

for the simulation <strong>of</strong> electromagnetic problems<br />

since 1971 [4], although it can simulate other problems<br />

as well, such as heat or particle diffusion, acoustic propagation,<br />

deformation in electric solids, waves in fluids,<br />

etc. [5] [6]. It has been used previously, in 2D form,<br />

for the study <strong>of</strong> atmospheric phenomena, e.g., Schumann<br />

Resonances [7], which is a problem similar to the one<br />

that will be addressed in this paper.<br />

The propagation <strong>of</strong> atmospherics in the Earthionosphere<br />

waveguide is a complex problem which involves<br />

several natural media (ground, oceans, atmosphere,<br />

ionospheric plasma) as well as the phenomena <strong>of</strong><br />

lightning [8], [9]. A parallel-TLM algorithm is employed<br />

to model the propagation <strong>of</strong> these natural signals and<br />

allows finding the first cut-<strong>of</strong>f frequency under two<br />

different approximated models <strong>of</strong> the natural waveguide<br />

formed by the ground and the ionosphere.<br />

Programming efficient algorithms with these relatively<br />

new hardware solutions is not straightforward. Different<br />

approaches must be taken into account according to<br />

the kind <strong>of</strong> hardware used. For instance, the way <strong>of</strong><br />

designing a parallel code for a Graphical Processing Unit<br />

(GPU) [10] will be different than for a multi-core system<br />

with shared memory access [11]. Programming shared<br />

memory environments is probably the most similar to<br />

traditional computing, i.e., not parallel, but still there is<br />

a great difference in the way we should conceive the<br />

algorithms [12].<br />

In Section 2, the TLM method is briefly introduced, as<br />

well as the Symmetric Condensed Node (SCN). Section<br />

3 describes the approach employed to parallelize the<br />

method. In Section 4, the Earth-ionosphere waveguide is<br />

introduced, and it is solved by means <strong>of</strong> the proposed<br />

algorithm. The model is benchmarked and its performance<br />

in terms <strong>of</strong> speedup is presented. In Section 5,<br />

the model is employed to solve the whole lossless Earthionosphere<br />

cavity, obtaining its Schumann Resonances.<br />

A brief summary as well as the main conclusions <strong>of</strong> the<br />

paper are detailed in Section 6.<br />

II. THE TLM METHOD<br />

TLM is a numerical method intended for simulation <strong>of</strong><br />

propagation problems which are governed by differential<br />

equations. Problems which have to deal with electromagnetics,<br />

heat diffusion, gravity waves, acoustic waves,<br />

etc. are suitable to be modelled with this technique. The


Fig. 1. Scheme <strong>of</strong> the Symmetric Condensed Node with 12 link lines.<br />

idea behind the method is to build a circuit based on<br />

transmission lines which behaves in analogous form as<br />

the problem we want to implement, i.e., the governing<br />

equations are the same for the circuit and for the physical<br />

problem. In this work, TLM will be used to solve<br />

Maxwell equations and the study <strong>of</strong> the Earth-ionosphere<br />

waveguide.<br />

The TLM method discretizes both time and space and<br />

sets up an iterative process in which the six components<br />

<strong>of</strong> the electromagnetic field evolve in time from a known<br />

initial situation. Therefore, the fields radiated and/or<br />

propagated in the space are simulated with arbitrary<br />

accuracy, which is constrained by the size <strong>of</strong> the space<br />

discretization. A thumbnail rule is that the minimum<br />

wavelength (λ) <strong>of</strong> interest must be ten times larger than<br />

the size <strong>of</strong> the cell (Δl), i.e., Δl ≤ λ/10 [13].<br />

Depending on the problem we want to model, each<br />

independent cell will be simulated by a different set up<br />

<strong>of</strong> transmission lines and node. For 3D electromagnetic<br />

problems, the most used circuit since it was formulated is<br />

the Symmetrical Condensed Node (SCN) [14], together<br />

with its variations. In this paper, the SCN with stubs for<br />

conductivity [15] will be used. In Fig. 1, a scheme <strong>of</strong> the<br />

transmission lines arrangement is shown. With its 12 link<br />

lines, the node is capable <strong>of</strong> modelling the behavior <strong>of</strong><br />

Maxwell’s equations for a differential <strong>of</strong> volume, ΔV .<br />

Regardless <strong>of</strong> the node employed for modelling each<br />

ΔV , the process iteration <strong>of</strong> TLM is always the same.<br />

For each node and time step n, t = nΔt (where Δt ≤<br />

Δl/2c for a cubic SCN, being c the speed <strong>of</strong> light in the<br />

medium), there is a set <strong>of</strong> incident pulses or voltages Vi,<br />

one for each transmission line <strong>of</strong> the node. During the<br />

time step they travel along the line, and they are either<br />

reflected and or transmitted to other lines, depending on<br />

the node structure. The transmitted/reflected pulses, or<br />

simply the scattered pulses Vr, are related to Vi by the<br />

scattering matrix S:<br />

Vr = S · Vi . (1)<br />

At the next time step, the scattered voltages from<br />

each node are converted into incident voltages <strong>of</strong> the<br />

- 8 - 15th IGTE Symposium 2012<br />

nearby node, thus propagating the pulses along the entire<br />

network. It is important to fix time synchronism in a<br />

manner that all pulses in the mesh are simultaneously<br />

incident at the center <strong>of</strong> their respective node at each<br />

time nΔt.<br />

III. PARALLELIZATION OF TLM<br />

TLM method is a time and memory consuming application,<br />

when applied to large problems. In its most<br />

basic form, at least 12 floating point numbers (floats)<br />

are needed in order to store the 12 voltages <strong>of</strong> each<br />

node. Six more floats become necessary if either nodes<br />

<strong>of</strong> variable size, permittivity (εr), or permeability (μr) <strong>of</strong><br />

the medium are required. Finally, three more floats must<br />

be used for adding electric conductivity. The problems<br />

solved in this work make use <strong>of</strong> 15 different transmission<br />

lines per node, 12 required by the basic SCN configuration,<br />

plus 3 for modelling the electric conductivity.<br />

The operation which consumes most <strong>of</strong> the computational<br />

time is reflected in Eq. 1, since this matrix<br />

multiplication must be performed at each node for each<br />

time step. For the case involved in this paper, the matrix<br />

multiplication has been reduced to 18 floating point<br />

multiplications and 36 additions, at the cost <strong>of</strong> executing<br />

different portion <strong>of</strong> code for the nodes which are at the<br />

border <strong>of</strong> the spatial distribution. The number <strong>of</strong> nodes<br />

for the problems involved in this work are on the order <strong>of</strong><br />

10 6 , and all these computations can be done concurrently<br />

for different nodes.<br />

Parallelizing the independent matrix multiplications<br />

<strong>of</strong> Eq. 1 requires doubling the minimum memory size<br />

required to hold the problem. Since the input voltage<br />

for one node is the output voltage for another node,<br />

Vi and Vr must be stored in different variables. On a<br />

sequential implementation <strong>of</strong> the model, i.e., not parallel,<br />

the order <strong>of</strong> execution is known, and the calculated Vr can<br />

overwrite Vi, if its value is previously stored on a local<br />

variable which can be erased when all related Vr have<br />

been calculated. Since the order in which the matrix multiplications<br />

will be performed is not known for a parallel<br />

version, doubling the minimum required memory size is<br />

a non-avoidable penalty <strong>of</strong> engaging parallelization. In<br />

the described TLM implementation, each node has the<br />

requirement <strong>of</strong> 30 floating-point variables (120 bytes) for<br />

storing the line voltages.<br />

The algorithm has been designed to be independent<br />

<strong>of</strong> its spatial topology, and it should provide the same<br />

performance regardless <strong>of</strong> the arrangement <strong>of</strong> the nodes<br />

in the space. The motivation for this constraint is to have<br />

a very flexible tool for solving different problems. In<br />

order to work in the same way regardless <strong>of</strong> the spatial<br />

geometry, the algorithm is split into two different steps:<br />

the pre-processing and the TLM computation itself.<br />

The pre-processing is a fast (when compared to TLM<br />

computation) operation which is in charge <strong>of</strong> transforming<br />

an arbitrary topology to a common one which can<br />

accommodate any kind <strong>of</strong> spatial distribution. The idea


is to assign a unique identifier to each node <strong>of</strong> the initial<br />

topology and to store a vector with the unique identifiers<br />

<strong>of</strong> the adjacent nodes. In this way, the result can be seen<br />

as a one-dimensional vector <strong>of</strong> nodes where each one<br />

knows which others are their neighbors. Any complex<br />

distribution <strong>of</strong> nodes can be simplified to this unified<br />

arrangement, regardless <strong>of</strong> the arbitrary initial geometry.<br />

On the other hand, abstracting the initial topology to<br />

this new paradigm brings a penalty <strong>of</strong> 6 integers per<br />

node to store the neighbor identifier at each direction<br />

(for 3D topologies) plus another integer to mark the kind<br />

<strong>of</strong> medium that the node belongs to. Therefore, the total<br />

amount <strong>of</strong> RAM memory required for each node is 152<br />

bytes.<br />

The TLM computation loop employed is shown in<br />

high-level pseudo-code below, where the OpenMP directives<br />

have been included:<br />

#pragma omp parallel private(private variables)<br />

{<br />

for(t=0..TotalTime)<br />

{<br />

#pragma omp single<br />

{<br />

//The reflected pulses become the incident at new time step<br />

Vi = Vr;<br />

//system feeding<br />

for(i=0..NumberOfFeeds) V[i]= feeding;<br />

//store the relevant output<br />

for(i=0..NumberOfOutputs) output=V[i];<br />

}<br />

#pragma omp for schedule (static)<br />

for(i=0..NumberOfNodes) Vr[i]=S*Vi[i];<br />

}end for(t)<br />

}end pragma parallel<br />

The main loop <strong>of</strong> the code is inside a #pragma omp. In<br />

this way, the overhead <strong>of</strong> creating (and destroying) new<br />

threads needs to be computed only once for all the execution.<br />

It mainly consists <strong>of</strong> iteration over time steps, which<br />

are not parallelizable, and which need synchronization<br />

<strong>of</strong> the threads which work inside each iteration. Each<br />

time step iteration is divided into two blocks; a sequential<br />

block and a parallel block. The sequential block performs<br />

three different operations:<br />

• Swap Vi by Vr. As we mentioned before, Vi and<br />

Vr must be stored in separate memory addresses in<br />

order to enable parallelization. At the beginning <strong>of</strong><br />

a time step, the reflected pulses from the previous<br />

iteration become the incident pulses on the neighbor<br />

nodes. In this implementation, the pointers <strong>of</strong> the<br />

vectors Vi and Vr are only exchanged, and the<br />

complexity <strong>of</strong> neighbor swapping the pulses is done<br />

implicitly in the matrix calculations, avoiding extra<br />

reading and writing to memory, although adding a<br />

small penalty in processing and complexity to the<br />

code.<br />

• System feeding. The initial electromagnetic problem<br />

may have sources on its initial definition. These<br />

sources bring external voltage pulses to the system,<br />

which are added in this part <strong>of</strong> the code.<br />

• Output storage. Some key nodes are marked as<br />

output and, therefore, the temporal evolution <strong>of</strong><br />

their voltages is necessary to reconstruct the fields’<br />

- 9 - 15th IGTE Symposium 2012<br />

evolution afterwards. All the line voltages at each<br />

time step from these output nodes are stored in<br />

memory.<br />

The parallel block is in charge for the matrix multiplication<br />

<strong>of</strong> each node. It is composed <strong>of</strong> a parallel for.<br />

Since the Vr calculation can be performed independently<br />

for each node, the OpenMP directive is in charge to<br />

distribute the computations between the available number<br />

<strong>of</strong> threads. Therefore, each thread will compute a portion<br />

<strong>of</strong> the total range <strong>of</strong> i. Since the clause schedule<br />

(static) is present, all the available threads will<br />

iterate the same portion <strong>of</strong> the i range.<br />

In order to reduce the total time <strong>of</strong> computation,<br />

several optimizations have been included here, which<br />

make the real code hard to read. One <strong>of</strong> them reduces<br />

the number <strong>of</strong> multiplications, by identifying operations<br />

which are performed several times for the calculation <strong>of</strong><br />

different reflected pulses. Another, the most complex one,<br />

deals with the neighboring <strong>of</strong> the nodes. It is implemented<br />

in such a way that the nodes on the edges <strong>of</strong> the initial<br />

geometry are treated in a different manner than the<br />

internal nodes are. The code is different but the amount<br />

<strong>of</strong> computation remains similar, except for the nodes<br />

which are edge in more than one <strong>of</strong> its sides. In this case<br />

the computations are larger. The number <strong>of</strong> nodes being<br />

edge for more than one side is usually small on most<br />

geometries. This is true for the models considered in this<br />

work due to the fact that scheduling the parallel for<br />

as static improves the performance <strong>of</strong> the algorithm,<br />

although few nodes require a bit more computation than<br />

others.<br />

IV. MODELLING THE EARTH-IONOSPHERE<br />

WAVEGUIDE<br />

The algorithm described above has been employed to<br />

simulate the Earth-ionosphere waveguide. The surface <strong>of</strong><br />

Earth behaves like a good conductor in the Very Low<br />

Frequency range (VLF, i.e., in the order <strong>of</strong> kHz), with<br />

conductivity ∼10−2 S/m for ground and ∼3.2 S/m for<br />

sea water [16]. There is air above the ground, which is<br />

<strong>of</strong> dielectric nature. As the altitude increases, the number<br />

<strong>of</strong> free electrons increases too, the density <strong>of</strong> neutral<br />

decreases, and the air starts behaving like a conductor.<br />

A typical conductivity pr<strong>of</strong>ile with altitude dependence<br />

is shown in Fig. 2 (top) [17].<br />

The excitation sources <strong>of</strong> the waveguide are lightning<br />

strokes [18]. They generate a broadband signal which<br />

differs in orientation, strength and duration depending <strong>of</strong><br />

its nature (cloud to ground, cloud to cloud, Q-bursts, etc.).<br />

A typical stroke in positive cloud to ground lightning<br />

is depicted in Fig. 2 (bottom) [19], which has been<br />

employed as excitation in the problem considered.<br />

The signal originated by the stroke travels a certain distance,<br />

guided between the two plates before extinguishing<br />

due to losses. On a first approximation, the system<br />

can be regarded as the infinite parallel-plate waveguide.<br />

According to [20], the cut-<strong>of</strong>f frequencies for a lossless<br />

waveguide <strong>of</strong> this geometry are located at:


Fig. 2. Conductivity pr<strong>of</strong>ile <strong>of</strong> the atmosphere with altitude (top),<br />

extracted from [17], and typical current for cloud to ground lightning<br />

(bottom).<br />

Amplitude [a.u.]<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

Vertical electric field as a function <strong>of</strong> frequency<br />

lossy waveguide<br />

lossless waveguide<br />

0<br />

1000 2000 3000 4000 5000 6000 7000 8000<br />

Frequency (Hz)<br />

Fig. 3. Detail <strong>of</strong> the first and second cut-<strong>of</strong>f frequencies for the lossless<br />

and the lossy Earth-ionosphere waveguide.<br />

fn = nc<br />

(2)<br />

2h<br />

where c is the speed <strong>of</strong> light in vacuum, h is the<br />

distance between the parallel plates, n is the mode<br />

number, and fn the associated cut-<strong>of</strong>f frequency <strong>of</strong> the<br />

mode.<br />

The problem has been modelled with the algorithm,<br />

both for a lossless and for a lossy waveguide. For the<br />

lossless waveguide, the conductivity is supposed to be<br />

zero in the dielectric. For the lossy waveguide, the night<br />

conductivity pr<strong>of</strong>ile from Fig. 2 (top) is applied. The<br />

parallel plates are taken as perfect conductors in both<br />

cases. Details <strong>of</strong> the first and second cut-<strong>of</strong>f frequencies<br />

are depicted in Fig. 3, which correspond to electric<br />

field in the z-direction, at a distance <strong>of</strong> 45 km in ydirection<br />

from the source (see Fig. 4 for definition <strong>of</strong><br />

the directions). It is interesting to note the effect <strong>of</strong><br />

the conductivity, which increases the value <strong>of</strong> the cut<strong>of</strong>f<br />

frequencies, being equivalent to having a narrower<br />

waveguide.<br />

A. Algorithm Benchmarking<br />

The total execution time <strong>of</strong> the waveguide model over<br />

different computers has been measured, using a different<br />

- 10 - 15th IGTE Symposium 2012<br />

Fig. 4. Spatial arrangement <strong>of</strong> the waveguide model.<br />

total time <strong>of</strong> execution (s)<br />

2000<br />

1000<br />

500<br />

200<br />

100<br />

Scalability <strong>of</strong> TLM algorithm<br />

Absolute time <strong>of</strong> execution Relative speedups<br />

SM32<br />

SM32 round-robin<br />

SM8<br />

SM8 round-robin<br />

0 5 10 15 20 25 30<br />

Number <strong>of</strong> CPUs<br />

speedup = time 1core / time nCores<br />

18<br />

16<br />

14<br />

12<br />

10<br />

8<br />

6<br />

4<br />

2<br />

SM32<br />

SM32 round-robin<br />

SM8<br />

SM8 round-robin<br />

0<br />

0 5 10 15 20<br />

Number <strong>of</strong> CPUs<br />

25 30<br />

Fig. 5. Total time <strong>of</strong> execution (left) and relative speedups (right) for<br />

the different platforms.<br />

number <strong>of</strong> CPUs, in order to determine the scalability <strong>of</strong><br />

our algorithm. Two different computers have been used<br />

in the benchmarking process:<br />

• SuperMicro8 (SM8). Server with 2 AMD opteron<br />

quad-core processors 2.0 GHz and 32 GB RAM, in<br />

Not Uniform Memory Access (NUMA) configuration.<br />

The OS is OpenSUSE 11.4 and the compiler<br />

employed is opencc 4.2.4 (level 2 <strong>of</strong> optimization).<br />

• SuperMicro32 (SM32). Server with 4 AMD opteron<br />

eight-core processors 2.0 GHz and 96 GB RAM, in<br />

Not Uniform Memory Access (NUMA) configuration.<br />

The OS is OpenSUSE 11.4 and the compiler<br />

employed is opencc 4.2.4 (level 2 <strong>of</strong> optimization).<br />

The problem benchmarked makes use <strong>of</strong> symmetry<br />

and the initial grid is two-dimensional. According to<br />

Fig. 4, the symmetry is applied in the x-direction. The<br />

conductivity pr<strong>of</strong>ile is extended along z, and the output<br />

measured at a certain distance along the y-direction. The<br />

node size is 1.5 km, the time step is 2.5 μs, the number<br />

<strong>of</strong> time steps is 7,500, and the total number <strong>of</strong> nodes is<br />

∼106 (67 nodes in z, 15,000 nodes in y). The excitation is<br />

placed next to the ground, at the center <strong>of</strong> the waveguide.<br />

With this configuration, a total <strong>of</strong> 7.5·109 step-node<br />

computations must be performed to solve the problem.<br />

The total execution time and relative speedups are shown<br />

in Fig. 5, for both platforms. The relative speedup is<br />

defined as the execution time using n cores divided by<br />

the execution time using 1 core. A maximum speedup <strong>of</strong><br />

6 is achieved with SM8, when making use <strong>of</strong> its 8 CPUs.<br />

On SM32, a maximum speedup <strong>of</strong> 16 is obtained when<br />

using 30 CPUs.<br />

As it can be observed in Fig. 5, two benchmarks<br />

have been measured for each computer. The aim is to<br />

compare the performance when using or not Round-<br />

Robin memory allocation policy. This policy consists in<br />

requesting the operative system to balance the memory<br />

reservation equally among the different portions <strong>of</strong> RAM.


This can be accomplished via the numactl tool [21].<br />

If this policy is not enabled, the memory reservation<br />

will be performed sequentially, and only some memory<br />

blocks will be used. Since the computers employed have<br />

NUMA architecture, each processor can access faster to<br />

a certain RAM circuit, while the access to the others is<br />

slower. Moreover, if the Round-Robin policy is not set,<br />

the different CPUs will have to compete to gain access<br />

to the particular RAM circuit, slowing down the overall<br />

computation. This technique is especially effective for a<br />

large number <strong>of</strong> CPUs (see Fig. 5).<br />

V. MODELLING THE WHOLE EARTH-IONOSPHERE<br />

CAVITY<br />

In this section, the whole cavity is considered in the<br />

simulation, leading to a much more time and RAM memory<br />

consuming model. The cavity has been considered as<br />

the space between two concentric spheres <strong>of</strong> 6,370 and<br />

6,470 km, with perfect conducting walls at the borders<br />

and no conductivity at the interior. The spherical shell<br />

has been modeled by cubic nodes, in this case <strong>of</strong> Δl=5<br />

km <strong>of</strong> size. The total number <strong>of</strong> nodes is ∼4.14·108 ,and<br />

the amount <strong>of</strong> RAM required is ∼61.5 GBytes. Around<br />

1.1 GBytes are employed for storing the outputs. For a<br />

spatial grid with a 5 km resolution, the time step required<br />

is 8.34 μs. The number <strong>of</strong> time iterations calculated was<br />

2.4·105 , and therefore the simulated time length is ∼2<br />

s. A frequency resolution <strong>of</strong> 0.5 Hz is achieved when<br />

the FFT is computed with these parameters. The total<br />

execution time required when using 32 cores on SM32<br />

(see Section IV-A) is roughly 6.0·105 s, i.e., around seven<br />

days.<br />

The excitation source <strong>of</strong> the cavity has been located at<br />

θ=0 and r=6,372 km, i.e., the North Pole. The excitation<br />

corresponds to a vertical positive Cloud to Ground (+CG)<br />

lightning, and its current is shown in Fig. 2 (bottom). This<br />

stroke starts at t=0, and lasts for 500 μs.<br />

With this spatial arrangement, the problem has symmetry<br />

over the φ coordinate, and therefore the outputs had<br />

been located all φ=0. A total <strong>of</strong> 101 nodes are marked as<br />

output, and they are equally spaced along the coordinate<br />

θ, from 0 to π, for r=6,370 km, i.e., at the surface,<br />

because it is the common location for SR measurements.<br />

As SR analytical models state [22] [23], the two<br />

relevant components <strong>of</strong> the electromagnetic field are Er<br />

and Hφ. In Figure 6, the six components <strong>of</strong> the output<br />

corresponding to θ=π/4 have been plotted, in order to<br />

show this fact. The other output nodes show similar<br />

results, where the two components mentioned are much<br />

greater than the rest.<br />

In order to corroborate the results from the simulations,<br />

the relationship between the modal amplitude <strong>of</strong> the six<br />

first SR and the angular distance to the source for the<br />

101 nodes marked as output (θ=0, θ=π/100,..., θ=π) has<br />

been plotted in Figure 7. This result is in agreement<br />

with analytical model results [22] [23], which show the<br />

amplitude dependence <strong>of</strong> SR modes with the distance to<br />

the source.<br />

- 11 - 15th IGTE Symposium 2012<br />

Fig. 6. Spectra <strong>of</strong> Electric (left) and Magnetic (right) field components.<br />

The relevance <strong>of</strong> Er and Hφ can be observed.<br />

Hphi [T/sqrt(Hz)]<br />

6e-09<br />

5e-09<br />

4e-09<br />

3e-09<br />

2e-09<br />

1e-09<br />

Dependence <strong>of</strong> SR amplitude with distance to the source (θ), lossless cavity<br />

SR1<br />

SR2<br />

SR3<br />

0<br />

0 π/4 π/2<br />

θ [rad]<br />

3π/4 π<br />

Hphi [T/sqrt(Hz)]<br />

1.2e-08<br />

1e-08<br />

8e-09<br />

6e-09<br />

4e-09<br />

2e-09<br />

SR4<br />

SR5<br />

SR6<br />

0<br />

0 π/4 π/2<br />

θ [rad]<br />

3π/4 π<br />

Fig. 7. Dependence <strong>of</strong> SR modal amplitude with θ, for the lossless<br />

cavity.<br />

The simulation has been repeated changing only the<br />

size <strong>of</strong> the spatial grid to Δl=10 km. Doubling the size<br />

<strong>of</strong> the nodes reduces by a factor <strong>of</strong> eight the number<br />

<strong>of</strong> nodes, at the cost <strong>of</strong> a poorer fitting <strong>of</strong> the spherical<br />

geometry and worse spatial resolution. The maximum<br />

valid frequency is also reduced by a factor <strong>of</strong> two, but<br />

this is not important for the study <strong>of</strong> SR, because the<br />

top frequency is still 3 kHz (the condition is λ ≥ 10Δl).<br />

The amount <strong>of</strong> memory required is reduced to 9.1 GBytes<br />

(with 1.1 GBytes for storing the results). The execution<br />

time, again with 32 cores in SM32, is reduced to 7.6·10 4<br />

s, i.e., roughly 21 hours. The magnetic fields in φ direction<br />

at an angular distance <strong>of</strong> π/4 <strong>of</strong> the two simulations<br />

are compared in Figure 8.<br />

The six maxima from each spectra <strong>of</strong> Hφ have been<br />

extracted and averaged, with the aim <strong>of</strong> using them as a<br />

proxy <strong>of</strong> the resonance position. The results are shown<br />

in Table I, for both simulations.<br />

It can be observed that the results for the central frequencies<br />

<strong>of</strong> the six SR are similar in the two simulations<br />

and with the results from the analytical solution. For the<br />

case <strong>of</strong> the 10 km size simulation, the errors for the<br />

central frequencies are always under 3%. This error is


Amplitude [a.u.]<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

10 km<br />

5 km<br />

Comparison <strong>of</strong> H φ at θ=π/4<br />

0<br />

0 10 20 30 40 50<br />

Frequency (Hz)<br />

Fig. 8. Comparison <strong>of</strong> Hφ at θ=π/4, for the two simulations (5 km<br />

and 10 km).<br />

TABLE I<br />

SR CENTRAL VALUES IN HZ, LOSSLESS CAVITY.<br />

1st SR 2nd SR 3rd SR 4th SR 5th SR 6th SR<br />

10 km 10.24 17.74 24.98 32.35 39.63 46.93<br />

5km 10.47 17.99 25.48 32.96 39.98 47.47<br />

Analytical 10.51 18.20 25.75 33.24 40.71 48.17<br />

reduced to less than 1.5% for the 5 km simulation.<br />

VI. CONCLUSION<br />

The TLM is briefly described and the inherent parallel<br />

areas <strong>of</strong> the algorithm are pointed out. It has been<br />

parallelized for shared memory architectures, by using<br />

OpenMP. The solution obtained has been employed to<br />

simulate the Earth-ionosphere waveguide, and to observe<br />

the changes produced by the conductivity pr<strong>of</strong>ile over<br />

the cut-<strong>of</strong>f frequency, by comparing the results with<br />

the lossless waveguide. The effect <strong>of</strong> this conductivity<br />

is to increase the value <strong>of</strong> the cut-<strong>of</strong>f frequencies,<br />

being equivalent to having a narrower waveguide. The<br />

algorithm has been run on two different platforms and<br />

benchmarked. The algorithm scales up to a speedup <strong>of</strong><br />

16 by using 30 CPUs. In order to obtain the maxima<br />

speedups, it is necessary to set a policy <strong>of</strong> Round-Robin<br />

memory allocation, in order to minimize the effects <strong>of</strong><br />

the NUMA architecture. Finally, a huge simulation (the<br />

whole Earth-ionosphere cavity with a 5 km resolution)<br />

has been performed for validation <strong>of</strong> the parallel algorithm.<br />

The lossless version <strong>of</strong> the cavity is solved, and<br />

the electromagnetic fields obtained are consistent with<br />

the analytical solution <strong>of</strong> the cavity. Since a Cartesian<br />

grid <strong>of</strong> only 5 km size per cell was employed, the<br />

errors are lower than 1.5% for the Schumann resonance<br />

frequencies. The spatial resolution achieved enables the<br />

possibility <strong>of</strong> using this model to study the global effects<br />

generated by local phenomena in the Earth-ionosphere<br />

cavity.<br />

Acknowledgments: This work was supported by the<br />

Consejería de Innovación, Ciencia y Empresa <strong>of</strong> Andalusian<br />

Government and Ministerio de Ciencia e Innovación<br />

- 12 - 15th IGTE Symposium 2012<br />

<strong>of</strong> Spain under projects with references PO7-FQM-03280<br />

and FIS2010-15170, co-financed with FEDER funds <strong>of</strong><br />

the European Union.<br />

REFERENCES<br />

[1] Flynn, L.J.: Intel halts development <strong>of</strong> 2 new microprocessors, The<br />

New York Times, May 8, 2004, retrieved on March 2, 2011 (2004)<br />

[2] Yu, W., Yang X., Liu, Y., Ma, L.-C., Su, T., Huang, N.-T., Mittra,<br />

R., Maaskant, R., Lu, Y., Che, Q., Lu, R., Su, Z.: A new direction<br />

in computational electromagnetics: solving large problems using<br />

the parallel FDTD on the BlueGene/L Supercomputer providing<br />

teraflop performance, IEEE antennas and Propag. Mag., 50(2), 26–<br />

44 (2008).<br />

[3] Christopoulos, C.: The Transmission-Line Modelling Method,<br />

TLM, IEEE Press, Piscataway, N.J. (1995)<br />

[4] Johns, P.B, Beurle, R.L.: Numerical solution <strong>of</strong> 2-dimensional<br />

scattering problems using a transmission-line matrix, Proc. Inst.<br />

Elec. Eng., 118(9): 1203–1208, 1971.<br />

[5] De Cogan, D., Pulko, S.H., O’Connor, W.J.: Transmission-Line<br />

Matrix in computational mechanics, CRC Press, Boca Raton, Fla.<br />

(1995)<br />

[6] Enders, P., Pulko, S.H., Stubbs, D.M.: TLM for diffusion: consistent<br />

first time step. Two-dimensional case, International Journal <strong>of</strong><br />

numerical modelling-electronic networks devices and fields, 15(3),<br />

251–259 (2002)<br />

[7] Morente, J.A., Molina-Cuberos, G.J., Portí, J., Besser, B.P., Salinas,<br />

A., Schwingenschuch, K., Lichtenegger, H.: A numerical simulation<br />

<strong>of</strong> Earth’s electromagnetic cavity with the Transmission Line<br />

Matrix method: Schumann resonances, J. Geophys, Res., 108(A5),<br />

1195–1205 (2003)<br />

[8] S. Toledo-Redondo, Parrot, M., and Salinas, A., “Variation <strong>of</strong> the<br />

first cut-<strong>of</strong>f frequency <strong>of</strong> the Earth-ionosphere waveguide observed<br />

by DEMETER”, J. Geophys. Res., vol. 117, pp. A04321, 2012.<br />

[9] Cummer, S.A.:Modeling electromagnetic propagation in the Earthionosphere<br />

waveguide, IEEE Trans. ant. Propag., 48(9), 1420–1432<br />

(2000)<br />

[10] Kirk, D.B., Hwu, W.W.: Programming massively parallel processors,<br />

a hands-on approach, Morgan Kaufmann, Burlington, M.A.<br />

(2010)<br />

[11] Chapman, B., Jost, G., van der Pas, R.: Using OpenMP: Portable<br />

Shared Memory Parallel Programming, The MIT Press, Cambridge,<br />

Massachussets (2007)<br />

[12] Breshears, C.: The art <strong>of</strong> concurrency, a thread monkey’s guide<br />

to writing parallel applications, O’Reilly, Sebastopol, CA (2009)<br />

[13] Morente, J.A., Jiménez, G., Portí, J., Khalladi, M.: Dispersion<br />

analysis for TLM mesh <strong>of</strong> symmetrical condensed nodes with<br />

stubs, IEEE Trans. Microwave Theory Tech. 43(2), 452–456 (1995)<br />

[14] Johns, P.B.: A symmetrical condensed node for the TLM method,<br />

IEEE Trans. Microwave Theory Tech. 35(4), 370–377 (1987)<br />

[15] Naylor, P., Desai, R.A.: New three dimensional symmetrical condensed<br />

lossy node for solution <strong>of</strong> electromagnetic wave problems<br />

by TLM, Electron. Lett., 26(7), 492-494 (1990)<br />

[16] Rycr<strong>of</strong>t, M.J., Harrison, R.G., Nicoll, K.A., Mareev, E.A.: An<br />

overview <strong>of</strong> Earth’s global electric circuit and atmospheric conductivity,<br />

Space Sci. Rev., 137, 83–105 (2008)<br />

[17] Pechony, O., Price, C.: Schumann resonance parameters calculated<br />

with a partially uniform model on Earth, Venus, Mars, and<br />

Titan, Radio Sci., 39, RS5007 (2004)<br />

[18] Storey, L.R.O.: An investigation <strong>of</strong> whistling atmospherics, Philosophical<br />

transactions <strong>of</strong> the royal society <strong>of</strong> London series A -<br />

Mathematical and physical sciences, 246(908), 113–141 (1953)<br />

[19] Baba, Y., Rakov, V.A.: Present understanding <strong>of</strong> the lightning return<br />

stroke, in Lightning: Principles, instruments and applications,<br />

Springer (2009)<br />

[20] Cheng, D.K.: Field and wave electromagnetics, Addison-Wesley<br />

(1989)<br />

[21] Linux Manual pages, numactl(8),<br />

http://linuxmanpages.com/man8/numactl.8.php<br />

[22] Sentman, D.D., Schumann Resonances, in Handbook <strong>of</strong> atmospheric electrodynamics,<br />

CRC Press, Boca Raton, Fla, (1995)<br />

[23] Toledo-Redondo, S., Salinas, A., Portí, J., Morente, J.A., Fornieles, J.,<br />

Méndez, A., Galindo-Zaldívar, J., Pedrera, A., Ruiz-Constán, A., and Anahnah,<br />

F., Study <strong>of</strong> Schumann resonances based on magnetotelluric records<br />

from the western Mediterranean and Antarctica, J. Geophys. Res., 115, D22,<br />

114, (2010).


- 13 - 15th IGTE Symposium 2012<br />

A Novel Parametric Model Order Reduction<br />

Approach with Applications to Geometrically<br />

Parameterized Microwave Devices<br />

Stefan Burgard∗ , Ortwin Farle∗ , and Romanus Dyczij-Edlinger∗ ∗Chair for Electromagnetic Theory, Saarland <strong>University</strong>, D-66123 Saarbrücken, Germany<br />

E-mail: edlinger@lte.uni-saarland.de<br />

Abstract—Methods <strong>of</strong> model-order reduction approximate the transfer behavior <strong>of</strong> a given high-dimensional system by that<br />

<strong>of</strong> a low-order one, which is much faster to evaluate. In the parametric case, the system features additional parameters,<br />

such as material properties or geometric design variables. The parametric order-reduction methods available today still<br />

exhibit a number <strong>of</strong> limitations, particularly with respect to convergence rates and the size <strong>of</strong> the reduced-order model.<br />

This contribution presents a novel technique based on affine parameter reconstruction and parameter-dependent projection<br />

matrices. It features high rates <strong>of</strong> convergence, supports local adaptation, and yields reduced-order models that are <strong>of</strong> very<br />

low dimension and thus fast to evaluate.<br />

Index Terms—Computer-aided engineering, geometric parameters, parametric model order reduction, parametric models.<br />

I. INTRODUCTION<br />

This paper addresses microwave components with linear<br />

time-invariant system properties. Since most practical<br />

structures possess complicated shape and inhomogeneous<br />

material properties, their fields-level analysis requires numerical<br />

methods, such as the finite-element (FE) method.<br />

FE discretization in the frequency domain results in<br />

systems <strong>of</strong> linear equations which are characterized by<br />

sparse matrices <strong>of</strong> high dimension. While solving a FE<br />

system at one single operating frequency may not be particularly<br />

demanding on modern computers, the analysis<br />

<strong>of</strong> broad frequency bands still tends to be very timeconsuming.<br />

The situation gets even worse when multiple<br />

parameters, such as material properties or geometric<br />

design variables, are present, and entire response surfaces<br />

are to be computed.<br />

Methods <strong>of</strong> model-order reduction (MOR) address this<br />

issue by approximating the behavior <strong>of</strong> the original system<br />

by a reduced-order model (ROM) that is very cheap<br />

to solve. As long as the frequency is the sole parameter,<br />

powerful single-point [1], [2] or multi-point [3], [4]<br />

algorithms are readily available. The incorporation <strong>of</strong><br />

additional parameters, especially those <strong>of</strong> geometric type,<br />

still poses challenges, with respect to convergence rates,<br />

computing times, and model dimension. One particular<br />

difficulty with geometric parameters is that they enter the<br />

FE matrices in the form <strong>of</strong> multivariate rational functions<br />

<strong>of</strong> complicated structure. The authors use the technique<br />

<strong>of</strong> [5] and [6] for affine geometry approximation.<br />

Present parametric model-order reduction (PMOR)<br />

techniques fall under two categories: The one class comprises<br />

methods [7], [5] that employ one global projection<br />

space for all parameters, including the frequency. It is<br />

characteristic <strong>of</strong> such entire-domain methods that the<br />

ROM dimension is large and rises quickly with increasing<br />

size <strong>of</strong> the parameter domain. The other class includes<br />

methods that instantiate frequency-domain ROMs for<br />

a set <strong>of</strong> sampling points in the domain <strong>of</strong> geometric<br />

parameters and employ interpolation over sub-domains<br />

to account for geometry variations. Thanks to their local<br />

nature, the resulting sub-domain ROMs are <strong>of</strong> low dimension.<br />

While existing techniques [8] - [11] interpolate<br />

the frequency-domain ROMs directly, the method proposed<br />

in this paper interpolates projection matrices. One<br />

particular advantage <strong>of</strong> this approach is to decouple the<br />

approximation <strong>of</strong> the effects <strong>of</strong> geometric parameters on<br />

the FE matrices, which may be the dominant source <strong>of</strong><br />

error, from the actual PMOR process.<br />

The remainder <strong>of</strong> the paper is organized as follows:<br />

Section II presents the underlying parameter-dependent<br />

FE system. The treatment <strong>of</strong> geometric parameters is<br />

reviewed in Section III. The new PMOR approach is<br />

developed in Section IV. This constitutes the main contribution<br />

<strong>of</strong> the paper. Section V gives numerical examples<br />

that demonstrate the benefits <strong>of</strong> the suggested approach.<br />

A brief summery in Section VI closes the paper.<br />

II. ORIGINAL SYSTEM<br />

We consider a time-harmonic electromagnetic FE system<br />

<strong>of</strong> dimension N which possesses Q inputs and outputs,<br />

respectively, and depends on the frequency f ∈ R<br />

and a vector p ∈P⊂RP <strong>of</strong> P geometric parameters.<br />

The input vector is denoted by u, the generalized state<br />

by x(f,p), and the output by y(f,p). The system is<br />

assumed to be <strong>of</strong> the form<br />

I<br />

J<br />

<br />

φi(f)Ai(p) x(f,p) = θj(f)Bj u, (1a)<br />

i=1<br />

j=1<br />

J<br />

y(f,p) = ηj(f)B<br />

j=1<br />

T <br />

j x(f,p), (1b)<br />

wherein Bj ∈ RN×Q , and the functions φi,θj,ηj : R →<br />

C and Ai : RP → RN×N are continuous. Eq. (1) implies


that the topology <strong>of</strong> the FE mesh, i.e. the number and<br />

connectivity <strong>of</strong> the FE nodes, must remain the same over<br />

the whole parameter domain P. Meshes <strong>of</strong> this kind<br />

can be constructed for a wide class <strong>of</strong> parameterized<br />

geometries by, e.g., the morphing method <strong>of</strong> [12].<br />

III. GEOMETRY INTERPOLATION<br />

In many cases, the parameter-dependent matrices<br />

Ai(p) are just multivariate rational functions. Nevertheless,<br />

their explicit representation [13] is quite complex<br />

in practice, because it requires tracking the effects <strong>of</strong><br />

all geometric parameters from the solid model through<br />

the mesh generation process to the FE matrix generation<br />

stage. Therefore, the present paper follows the suggestion<br />

<strong>of</strong> [5] and [6] to approximate Ai(p) by a function <strong>of</strong><br />

simpler structure. We set<br />

Ai(p) ≈ <br />

Γβ(p)A<br />

β<br />

β<br />

i for p ∈P, (2)<br />

wherein β =[β1,...,βP ] is a multi-index, A β<br />

i ∈ CN×N ,<br />

and Γβ : R P ↦→ R is a suitable interpolation function.<br />

Thus, the approximate system reads:<br />

<br />

φi(f)<br />

i<br />

<br />

Γβ(p)A<br />

β<br />

m <br />

i x ′ (f,p)= <br />

θj(f)Bju, (3a)<br />

y<br />

j<br />

′ <br />

(f,p) =<br />

<br />

x ′ (f,p), (3b)<br />

ηj(f)B<br />

j<br />

T j<br />

The interpolation functions Γβ are obtained as follows:<br />

For each parameter p, choose a set Np <strong>of</strong> interpolation<br />

points ψ k p,<br />

Np = ψ k p ∈ R <br />

k =1,...,Kp , (4)<br />

and associated interpolation functions γ k p : R → R with<br />

γ k p (ψ l p)=δkl for ψ l p ∈Np. (5)<br />

Next construct a tensor-grid G = N1 × ... ×NP for<br />

the domain P. Then the interpolation point pβ and<br />

interpolation function Γβ are given by<br />

pβ =[ψ β1<br />

1 ,...,ψβP P ], (6a)<br />

Γβ(p) =γ β1<br />

1 (ψ1) · ...· γ βP<br />

P (ψP ). (6b)<br />

By (6) and (2), the interpolation matrices A β<br />

i<br />

are given<br />

by the system matrices Ai <strong>of</strong> the original FE system (1)<br />

at the interpolation point pβ:<br />

A β<br />

i = Ai(pβ). (7)<br />

IV. PARAMETRIC ORDER REDUCTION<br />

We construct the parametric ROM by replacing the<br />

test and trial space, respectively, <strong>of</strong> the interpolated FE<br />

system (3) by an n dimensional subspace S(p) which<br />

depends continuously on the parameter vector p. For<br />

this purpose, a Galerkin procedure based on a parameterdependent<br />

projection matrix V(p) :RP → RN×n , with<br />

S(p) =range {V(p)} , (8)<br />

- 14 - 15th IGTE Symposium 2012<br />

1 1<br />

1 2<br />

[ , ]<br />

1 2<br />

1 2<br />

[ , ]<br />

1 3<br />

1 2<br />

1,1<br />

1,2<br />

2 1<br />

1 2<br />

[ , ]<br />

2 2<br />

1 2<br />

2,1<br />

2,2<br />

3 1<br />

1 2<br />

[ , ]<br />

3 2<br />

[ , ] [ , ]<br />

1 2<br />

2 3 3 3<br />

[ , ] [ , ] [ , ]<br />

Fig. 1. Hypercube topology<br />

1 2<br />

1 2<br />

is applied to (3). The resulting ROM is <strong>of</strong> the form<br />

<br />

φi(f)<br />

i<br />

<br />

Γβ(p)<br />

β<br />

Ãβi<br />

(p)<br />

<br />

˜x = <br />

θj(f)<br />

j<br />

˜ Bj(p)u, (9a)<br />

<br />

˜y(f,p) = ηj(f) ˜ B T <br />

j (p) ˜x(f,p), (9b)<br />

with<br />

j<br />

à β<br />

i (p) =VT (p)A β<br />

i V(p), (10a)<br />

˜Bj(p) =V T (p)Bj. (10b)<br />

As long as n ≪ N, the frequency response <strong>of</strong> the ROM<br />

can be computed much more efficiently than the original<br />

one.<br />

A. Parameter dependent projection matrix<br />

We start by computing n dimensional one-parameter<br />

ROMs with respect to frequency at all interpolation<br />

points pβ ∈ G. The resulting projection matrices are<br />

denoted by ˆ Vβ ∈ C N×n .<br />

The interpolation points pβ ∈Gsubdivide the param-<br />

eter domain into hypercubes H β ⊂ R P . Based on the<br />

line segments L k p = ψ k p,ψ k+1<br />

p<br />

H β = L β1<br />

1<br />

,wehave<br />

× ...×LβP P . (11)<br />

Fig. 1 illustrates the setting in the two-dimensional case.<br />

Starting from one-dimensional hat functions ξ k p : R → R,<br />

ξ k p (ψ) =<br />

⎧<br />

ψ k−1<br />

p −ψ<br />

for ψ ∈Lk−1 p ,<br />

⎪⎨ ψ<br />

⎪⎩<br />

k−1<br />

p −ψk p<br />

ψ k+1<br />

p −ψ<br />

ψ k+1<br />

p −ψk for ψ ∈L<br />

p<br />

k p,<br />

0 else,<br />

(12)<br />

we construct piecewise multi-linear interpolation functions<br />

Ξβ : R P → R <strong>of</strong> compact support:<br />

Ξβ(p) =ξ β1<br />

1 (ψ1) · ...· ξ βP<br />

P (ψP ). (13)<br />

Within a given hypercube Hα , the parameterdependent<br />

projection matrix V(p) is defined by<br />

V(p) = <br />

Ξβ(p) ˆ VβT α β for p ∈H α . (14)<br />

pβ∈H α


Herein, the matrices T α β ∈ Rn×n are provided in order<br />

to conduct state transformations. They are constructed<br />

as follows: Following [8] and [9], a singular value<br />

decomposition [14] is performed,<br />

<br />

= U diag σW H , (15)<br />

<br />

ˆVβ1 ,..., ˆ Vβ (2P )<br />

to determine a basis Rα ∈ RN×n for the n dimensional<br />

subspace <strong>of</strong> highest energy over the hypercube Hα , i.e.,<br />

the subspace corresponding to the n largest singular<br />

values:<br />

Rα = U(:, 1:n). (16)<br />

For any relevant state Rα˜x, we require the ROM state<br />

at the interpolation point pβ, ˆ VβTα β ˜x, to be as close as<br />

possible:<br />

!<br />

=min ∀˜x ∈ C n , (17a)<br />

Rα˜x − ˆ VβT α β ˜x2<br />

⇒ ˜x − R H α ˆ VβT α β ˜x = 0 ∀˜x ∈ Cn . (17b)<br />

Thus,<br />

T α β =<br />

<br />

R T α ˆ −1 Vβ . (18)<br />

Eq. (18) underlines that interpolating the bases ˆ Vβ directly,<br />

which is equivalent to taking Tα β = I, may cause<br />

gross error.<br />

B. Assembly<br />

Plugging (14) into (10) leads to the following representation<br />

<strong>of</strong> the reduced matrices within the hypercube Hα :<br />

à β<br />

<br />

i (p) =<br />

pγ∈Hα <br />

pδ∈Hα Ξγ(p)Ξδ(p) A β<br />

i,γ,δ , (19a)<br />

˜Bj(p) = <br />

Ξγ(p)Bj,γ, (19b)<br />

wherein<br />

pγ∈H α<br />

A β<br />

i,γ,δ =(Tα γ ) T V T γ A β<br />

i VδT α δ , (20a)<br />

Bj,γ =(T α γ ) T V T γ Bj. (20b)<br />

Note that all the coefficient matrices in (20) are <strong>of</strong><br />

reduced size and can be computed in advance. No O(N)<br />

operations are required during the solution process.<br />

V. NUMERICAL EXAMPLES<br />

In the examples below, the single-parameter ROMs<br />

with respect to frequency at the interpolation points are<br />

computed by means <strong>of</strong> the single-point algorithm <strong>of</strong> [1].<br />

A. Dielectric Post<br />

Fig. 2 shows the H plane filter <strong>of</strong> [16]. It consists <strong>of</strong><br />

a dielectric post at the center <strong>of</strong> an air-filled rectangular<br />

waveguide. The model features two parameters: the operating<br />

frequency f ∈ [15, 25] GHz and the geometric<br />

parameter p ∈ [−1.5, 1.5] mm which defines the radius r<br />

<strong>of</strong> the post according to<br />

r =2.5 mm + p. (21)<br />

Fig. 3 shows instantiations <strong>of</strong> the parametric mesh [12]<br />

for p ∈{−1.5, 0, 1.5} mm.<br />

- 15 - 15th IGTE Symposium 2012<br />

10<br />

Γ (1)<br />

WG<br />

Ω<br />

10<br />

ɛd<br />

20<br />

μd<br />

Γ (2)<br />

d<br />

ɛr = μr =1<br />

Γ (1)<br />

d<br />

2r<br />

Γ (2)<br />

WG<br />

5<br />

Fig. 2. Structure <strong>of</strong> rectangular waveguide filter [16]. All dimensions<br />

are in mm. Material properties <strong>of</strong> rod: relative electric permittivity ɛd =<br />

4, relative magnetic permeability μd =1. Waveguide ports are denoted<br />

by Γ (1)<br />

WG and Γ(2)<br />

WG , respectively.<br />

Fig. 3. Instantiations <strong>of</strong> the parametric FE mesh for different values<br />

<strong>of</strong> the geometry parameter: p ∈ {−1.5, 0, 1.5} mm. Note that the<br />

meshes share the same topology.<br />

1) Response Surface and Errors: Fig. 4 presents the<br />

response surface <strong>of</strong> the magnitude <strong>of</strong> the reflection coefficient<br />

|S11|. The parametric ROM is based on M =5<br />

interpolation points placed at the locations <strong>of</strong> the zeros<br />

<strong>of</strong> the fifth-order Chebyshev polynomial <strong>of</strong> the first kind.<br />

The expansion frequency for the single-parameter ROMs<br />

is set at the center <strong>of</strong> the frequency band, f exp =20GHz.<br />

We define the error in S11 by<br />

ES11 (f,p) = S11(f,p) − S11(f,p), (22)<br />

wherein ˜ S11 denotes the PMOR result, and S11 is the<br />

reference solution, which is computed by conventional<br />

FE analysis, using the same mesh. The complete error<br />

surface is given in Fig. 5. It can be seen that errors are<br />

in the order <strong>of</strong> 10 −3 , which is below the typical level <strong>of</strong><br />

the FE discretization error. Note that calculating the error<br />

surface is only possible for very simple structures, like<br />

the present filter, because each <strong>of</strong> the 101×101 sampling<br />

points in f − p space requires a separate FE run.


Fig. 4. Dielectric post: Response surface <strong>of</strong> the magnitude <strong>of</strong> the<br />

reflection coefficient S11 as a function <strong>of</strong> operating frequency f and<br />

radius variation p.<br />

Fig. 5. Dielectric post: Error surface <strong>of</strong> |S11|.<br />

Computational data for conventional FE analysis and<br />

two different PMOR models, ROM3 and ROM5, using<br />

M =3and M =5interpolation points, respectively,<br />

are given in Table I. It can be seen that, even though the<br />

dimension <strong>of</strong> the original FE system is very small, the<br />

larger <strong>of</strong> the two models, ROM5, is still 150 times faster<br />

to evaluate.<br />

2) Analysis <strong>of</strong> Suggested Procedure: Our first goal<br />

is to compare the proposed method, which employs<br />

TABLE I<br />

COMPUTATIONAL DATA FOR DIELECTRIC POST.<br />

Model ROM5 ROM3 FE<br />

Number <strong>of</strong> grid points 5 3 -<br />

Moment-matching order 10 7 -<br />

Dimension 22 16 5616<br />

Model generation (s) ∗ 365.8 177.7 -<br />

Evaluations per s∗ 481.0 757.3 3.2<br />

|Average error in S11| 4.99 · 10−4 5.93 · 10−3 0<br />

∗ MATLAB code on Intel Pentium 4 (3GHz), one thread used.<br />

- 16 - 15th IGTE Symposium 2012<br />

|E |<br />

S11<br />

10 0<br />

10 −1<br />

10 −2<br />

10 −3<br />

10 −4<br />

Direct interpolation <strong>of</strong> ROM bases<br />

Present method<br />

10<br />

−1.5 −1 −0.5 0 0.5 1 1.5<br />

−5<br />

Radius variation (mm)<br />

Fig. 6. Dielectric post: Magnitude <strong>of</strong> error in reflection coefficient S11<br />

as a function <strong>of</strong> radius variation p at f =25GHz. Note the positive<br />

effects <strong>of</strong> state transformations in the present method.<br />

state transformations, to direct interpolation <strong>of</strong> the ROM<br />

bases ˆ Vβ. For this purpose, we consider a ROM based<br />

on M =5equidistant sampling points. Fig. 6 presents<br />

the error in S11 (22) as a function <strong>of</strong> radius variation p<br />

at f =25GHz: The necessity <strong>of</strong> proper state transformations<br />

is evident.<br />

The next test addresses the rate <strong>of</strong> convergence <strong>of</strong> the<br />

proposed method. We start from 3 equidistant interpolation<br />

points at refinement level r =1, and refine the grid<br />

recursively. Thus, the total number <strong>of</strong> points at refinement<br />

level r is<br />

|G| =2 r +1. (23)<br />

Our measure <strong>of</strong> error is ĒS11 , the average error in S11<br />

at f =25GHz,<br />

ĒS11<br />

= 1<br />

Ns<br />

Ns <br />

n=1<br />

| S11(pn) − S11(pn)|, (24)<br />

based on Ns = 257 equally spaced sampling points,<br />

pn ∈ [−1.5, 1.5] mm. Fig. 7 presents the magnitude <strong>of</strong><br />

the average error as a function <strong>of</strong> refinement level for<br />

direct ROM interpolation, a variant <strong>of</strong> the present method<br />

which uses piecewise linear geometry interpolation, and<br />

the suggested approach, employing global polynomial<br />

interpolation for the geometry. The results <strong>of</strong> Fig. 7<br />

show that the total error is dominated by the effects<br />

<strong>of</strong> geometry interpolation: The suggested method clearly<br />

outperforms competing approaches.<br />

B. Mitered Microstrip Bend<br />

Our second example, the mitered microstrip bend<br />

shown in Fig. 8, is a truly three-dimensional structure<br />

with more than one million FE unknowns. The<br />

model features two parameters, the operating frequency<br />

f ∈ [1, 10] GHz and a geometric parameter p ∈<br />

[−0.7, 0.7] mm which controls the width t <strong>of</strong> the mitered


|Average error in S 11 |<br />

10 0<br />

10 −1<br />

10 −2<br />

10 −3<br />

10 −4<br />

10 −5<br />

10 −6<br />

Proposed approach − global polynomial<br />

Proposed approach − piecewise linear<br />

ROM interpolation − piecewise linear<br />

0 1 2<br />

Refinement level<br />

3 4<br />

Fig. 7. Dielectric post: Magnitude <strong>of</strong> average error versus grid<br />

refinement level r at f = 25 GHz. The proposed method benefits<br />

from interpolating the FE matrices by polynomials <strong>of</strong> higher-order.<br />

μr,ɛr<br />

2.413<br />

t<br />

60<br />

60 0.794<br />

Fig. 8. Structure <strong>of</strong> a mitered microstrip bend. Dimensions are in mm.<br />

Material properties <strong>of</strong> substrate: ɛr =2.2, μr =1.<br />

bend. We have:<br />

t =1.7062 mm + p. (25)<br />

Again, the parametric ROM is based on M = 5<br />

interpolation points placed at the locations <strong>of</strong> the zeros<br />

<strong>of</strong> the fifth-order Chebyshev polynomial <strong>of</strong> the first kind.<br />

The expansion frequency for the single-parameter ROMs<br />

is set to f exp =5GHz.<br />

Fig. 9 shows the response surface <strong>of</strong> the magnitude <strong>of</strong><br />

the reflection coefficient S11, calculated by the proposed<br />

method. Fig. 10 presents |S11| and the corresponding<br />

error |ES11 | (22) with respect to conventional FE simulations<br />

versus frequency for the case p =0.2 mm. The<br />

fact that the error is always more than 25 dB below the<br />

signal level underlines the high quality <strong>of</strong> the ROM.<br />

Table II provides computational data for conventional<br />

FE simulation and the ROM. It can be seen that it<br />

takes more than 2 hours to build the parametric ROM.<br />

However, once the ROM is available, it can be evaluated<br />

more than 2200 times per second. For comparison, one<br />

- 17 - 15th IGTE Symposium 2012<br />

Fig. 9. Mitered microstrip bend: Response surface <strong>of</strong> the magnitude<br />

<strong>of</strong> the reflection coefficient |S11| as a function <strong>of</strong> operating frequency<br />

f and miter parameter p.<br />

|S 11 | (dB)<br />

−30<br />

−40<br />

−50<br />

−60<br />

−70<br />

−80<br />

−90<br />

−100<br />

Proposed approach<br />

Error <strong>of</strong> proposed approach<br />

2 4 6<br />

Frequency (GHz)<br />

8 10<br />

Fig. 10. Mitered microstrip bend: |S11| and error |ES11 | versus<br />

frequency. Parameter: p =0.2 mm.<br />

conventional FE solution takes 180 s, which is more than<br />

400 000 times longer!<br />

VI. CONCLUSIONS<br />

This paper has presented a PMOR methodology for<br />

FE models with geometric parameters. It is characteristic<br />

<strong>of</strong> the new approach that geometry approximation<br />

is separated from the actual ROM generation process.<br />

Moreover, the suggested method incorporates state transformations<br />

that improve the quality <strong>of</strong> the interpolated<br />

projection matrices. In consequence, the present PMOR<br />

method reaches higher rates <strong>of</strong> convergence than previous<br />

approaches. Since the resulting parametric models are <strong>of</strong><br />

small dimension, they are very fast to evaluate.<br />

TABLE II<br />

COMPUTATIONAL DATA FOR MICROSTRIP BEND.<br />

Model ROM FE<br />

Number <strong>of</strong> grid points 5 -<br />

Moment-matching order 20 -<br />

Dimension 42 1,175,382<br />

Model generation (s) ∗ 7513.2 -<br />

Evaluations per s ∗ 2267.9 5.56 · 10 −3<br />

|Avr. error in S11| at p =0.2 mm 1.06 · 10 −4 0<br />

∗ MATLAB code on Intel Xeon E5620, one thread used.


REFERENCES<br />

[1] R. D. Slone, R. Lee, and J. F. Lee, “Broadband Model Order<br />

Reduction <strong>of</strong> Polynomial Matrix Equation using Single-Point<br />

Well-Conditioned Asymptotic Waveform Evaluation: Dreivations<br />

and Theory,” Int. J. Numer. Meth. Eng., vol. 58, pp. 2325 – 2342,<br />

Dec. 2003.<br />

[2] Y. Zuh, A. C. Cangellaris, “Finite Element-Based Model Order<br />

Reduction <strong>of</strong> Electromagnetic Devices,” Int. J. Numer. Model.,<br />

vol. 15, pp. 73 – 92, 2002.<br />

[3] R. D. Slone, J.-F. Lee, R. Lee, “Automating Multipoint Galerkin<br />

AWE for a FEM Fast Frequency Sweep,” IEEE Trans. Magn.,<br />

vol. 38, no. 3, pp. 637 – 640, March 2002.<br />

[4] A. Schultschik, O. Farle, R. Dyczij-Edlinger, “An Adaptive Multi-<br />

Point Fast Frequency Sweep for Large-Scale Finite Element<br />

Models,” IEEE Trans. Magn., vol. 45, no. 3, pp. 1108 – 1111,<br />

March 2009 .<br />

[5] R. Dyczij-Edlinger and O. Farle, “Finite element analysis <strong>of</strong><br />

linear boundary value problems with geometrical parameters,”<br />

COMPEL, vol. 28, no. 4, pp. 779 – 794, 2009.<br />

[6] O. Farle, S. Burgard, and R. Dyczij-Edlinger “Passivity Preserving<br />

Parametric Model-Order Reduction for Non-affine Parameters,”<br />

Math. Comp. Model. Dyn. Sys., vol. 17, no. 3, pp. 279 – 294,<br />

2011.<br />

[7] J.R. Phillips, “Variational interconnect analysis via PMTBR,”<br />

ICCAD, pp. 872 – 879, 7-11 Nov. 2004.<br />

[8] B. Lohmann, R. Eid, “Efficient Order Reduction <strong>of</strong> Parametric and<br />

Nonlinear Models by Superposition <strong>of</strong> Locally Reduced Models,”<br />

Methoden und Anwendungen der Regelungstechnik, pp. 27 – 36,<br />

Aachen:Shaker-Verlag, 2009.<br />

[9] H. Panzer, J. Mohring, R. Eid, and B. Lohmann, “Parametric<br />

Model Order Reduction by Matrix Interpolation,” at - Automatisierungstechnik,<br />

vol. 58, no. 8, pp. 475 – 484, 2010.<br />

[10] O. Farle and R. Dyczij-Edlinger, “Numerically Stable Moment<br />

Matching for Linear Systems Parameterized by Polynomials in<br />

Multiple Variables with Applications to Finite Element Models <strong>of</strong><br />

Microwave Structures,” IEEE Trans. Antennas Propag., vol. 58,<br />

no. 11, pp. 3675 – 3684, Sep. 2010.<br />

[11] D. Amsallem, J. Cortial, K. Carlberg, C. Farhat, “A method<br />

for interpolating on manifolds structural dynamics reduced-order<br />

models,” Int. J. Numer. Meth. Eng., vol. 80, no. 9, pp. 1241 –<br />

1258, Nov. 2009.<br />

[12] S. Burgard, Morphing von Finite-Elemente-Netzen, Studienarbeit,<br />

Lehrstuhl für Theoretische Elektrotechnik, Universität des Saarlandes,<br />

2008. In German.<br />

[13] J. Pomplun and F. Schmidt, “Accelerated a Posteriori Error<br />

Estimation for the Reduced Basis Method with Application to<br />

3D Electromagnetic Scattering Problems,” SIAM J. Sci. Comput.,<br />

vol. 32, no. 2, pp. 498 – 520, 2010.<br />

[14] G. H. Golub and C. F. Van Loan, Matrix Computations, Baltimore:Johns<br />

Hopkins <strong>University</strong> Press, pp. 69 – 75, 1996.<br />

[15] J. Rubio, J. Arroyo, and J. Zapata, “SFELP - An Efficient Methodology<br />

for Microwave Circuit Analysis,” IEEE Trans. Microw.<br />

Theory Techn., vol. 49, no. 3, pp. 509 – 516, Mar. 2001.<br />

[16] J. P. Webb, “Finite Element Analysis <strong>of</strong> H-plane Rectangular<br />

Waveguide Problems,” Microw. Antennas Propag., IEE <strong>Proceedings</strong><br />

H, vol. 133, no. 2, pp. 91 – 94, April 1986.<br />

- 18 - 15th IGTE Symposium 2012


- 19 - 15th IGTE Symposium 2012<br />

Efficient Finite-Element Computation <strong>of</strong><br />

Far-Fields <strong>of</strong> Phased Arrays by Order Reduction<br />

A. Sommer∗ , O. Farle∗ , and R. Dyczij-Edlinger∗ ∗Chair for Electromagnetic Theory, Saarland <strong>University</strong>, D-66123 Saarbrücken, Germany<br />

E-mail: edlinger@lte.uni-saarland.de<br />

Abstract—This paper presents an efficient numerical method for computing the far-fields <strong>of</strong> phased antenna arrays over<br />

broad frequency bands as well as wide ranges <strong>of</strong> steering and look angles. The suggested approach combines finiteelement<br />

analysis, projection-based model-order reduction, and empirical interpolation. Numerical results demonstrate that<br />

evaluation times are reduced by orders <strong>of</strong> magnitude, compared to traditional methods.<br />

Index Terms—Empirical interpolation, far field computation, finite-element method, and model order reduction.<br />

I. INTRODUCTION<br />

In many areas <strong>of</strong> application, such as radar or wireless<br />

communications, phased antenna arrays need to be analyzed<br />

over broad frequency bands as well as wide ranges<br />

<strong>of</strong> steering and look angles. Finite-element (FE) based<br />

analysis <strong>of</strong> such structures involves two major steps:<br />

First, the near field is computed as a function <strong>of</strong> angular<br />

frequency ω and steering angles (θs,φs). Second, a<br />

discrete near-field-to-far-field (NF-FF) operator is applied<br />

to determine the far field as a function <strong>of</strong> frequency and<br />

look angles (θ,φ). Using conventional approaches, this<br />

procedure tends to be very time-consuming. The reasons<br />

are as follows: Since typical antenna arrays are electrically<br />

large and consist <strong>of</strong> high numbers <strong>of</strong> radiators, the<br />

corresponding FE systems are <strong>of</strong> very large dimension.<br />

Moreover, broadband analysis requires large numbers <strong>of</strong><br />

sampling frequencies. At each <strong>of</strong> them, the large-scale<br />

FE system has to solved, by matrix factorization or some<br />

iterative method. In addition, wide variations in steering<br />

angles imply large numbers <strong>of</strong> sampling angles (θs,φs).<br />

Each <strong>of</strong> them leads to separate excitation and right-hand<br />

side (RHS), respectively. Finally, wide variations in look<br />

angles result in large numbers <strong>of</strong> sampling points (θ,φ).<br />

Each <strong>of</strong> them requires a separate NF-FF transformation<br />

at each operating point (θs,φs; ω) <strong>of</strong> the antenna array.<br />

To reduce computational efforts, we propose a twostep<br />

approach: We first construct a reduced-order model<br />

(ROM) for the near fields which is very cheap to solve<br />

at any value <strong>of</strong> the parameter triple (θs,φs; ω). For<br />

this purpose, a multi-point model-order reduction (MOR)<br />

method with self-adaptive expansion point selection [1],<br />

[2] is applied. The second step utilizes the empirical<br />

interpolation (EI) method [3], [4] to construct an affine<br />

approximation to the NF-FF operator as a function <strong>of</strong><br />

frequency and look angles. It, too, is very fast to evaluate<br />

at any value <strong>of</strong> the parameter triple (θ,φ; ω). Combining<br />

both steps yields a highly efficient numerical model<br />

for computing the far-fields as a function <strong>of</strong> the five<br />

parameters (θ,φ; θs,φs; ω), which is ideally suited for<br />

fast online evaluation: In Section VI, computing a farfield<br />

pattern based on 4830 look angles takes only 2.4 s,<br />

on a personal computer executing plain MATLAB code.<br />

The construction <strong>of</strong> the model by MOR and EI is<br />

much more time-consuming and must be performed in<br />

advance, in an <strong>of</strong>fline step. The most expensive procedure<br />

in the MOR algorithm is the FE analysis <strong>of</strong> the array at a<br />

number <strong>of</strong> expansion points (θs,φs; ω), which are chosen<br />

adaptively. Note that, as long as a direct solver is used,<br />

changes in the three parameters are not equally expensive:<br />

Since ω affects the FE matrix, each frequency value<br />

requires a new matrix factorization, which is computationally<br />

expensive. On the other hand, the steering angles<br />

(θs,φs) enter the RHS only. Thus, changes in angle just<br />

require additional forward-back substitutions, which are<br />

much cheaper. Therefore, the adaptive point placement<br />

strategy <strong>of</strong> the MOR method ought to vary ω as rarely<br />

as possible. The new frequency-slicing greedy method<br />

presented in Section IV-B implements this strategy in<br />

a systematic fashion. For the highest-accuracy ROM <strong>of</strong><br />

Section VI, it improves computing time by a factor <strong>of</strong> 7<br />

at the cost <strong>of</strong> increasing ROM size by 5%, compared to<br />

state-<strong>of</strong>-the-art methods [1], [2].<br />

The numerical experiments <strong>of</strong> Section VI indicate that<br />

both MOR and EI feature exponential convergence, in<br />

accordance with theoretical results [5]. Our results for<br />

a real-world example [6] demonstrate that the suggested<br />

two-step approach for the broadband analysis <strong>of</strong> radiation<br />

patterns <strong>of</strong> phased arrays achieves high accuracy and reduces<br />

evaluation times by orders <strong>of</strong> magnitude, compared<br />

to conventional approaches.<br />

II. FAR-FIELD COMPUTATION<br />

By the vector Huygens principle in the frequency<br />

domain [7], the radiation vector F <strong>of</strong> an arbitrary antenna<br />

array, which is enclosed by a surface S, isgivenby<br />

<br />

ω j ˆr·r c F (ˆr,ω)=ˆr × e 0<br />

S<br />

′<br />

J s (r ′ ,ω)dS ′ × ˆr<br />

+ 1<br />

<br />

ω j c ˆr·r<br />

e 0<br />

η0 S<br />

′<br />

M s (r ′ ,ω)dS ′ × ˆr. (1)<br />

Here c0 and η0 denote the vacuum speed <strong>of</strong> light and<br />

characteristic impedance, respectively, and ˆr is the unit<br />

vector in the direction <strong>of</strong> the observer. The equivalent<br />

electric and magnetic surface current densities, J s and


M s, are given in terms <strong>of</strong> the electric and magnetic nearfields<br />

E and H by<br />

J s (r ′ ,ω)=ˆn × H (r ′ ,ω) with r ′ ∈ S, (2a)<br />

M s (r ′ ,ω)=−ˆn × E (r ′ ,ω) with r ′ ∈ S, (2b)<br />

wherein ˆn stands for the outward-pointing unit normal<br />

vector on the Huygens surface S. The far-fields EF and<br />

HF are obtained from the radiation vector (1) by<br />

ω<br />

c r<br />

e−j 0<br />

EF (r,ω)=−jμ0ω F (ˆr,ω) , (3a)<br />

4πr<br />

HF (r,ω)=−j ω<br />

ω −j c r<br />

e 0<br />

ˆr × F (ˆr,ω) (3b)<br />

c0 4πr<br />

and the directive gain [8] is given by<br />

2 μ0ω F (ˆr,ω)<br />

D (ˆr,ω)=<br />

8πc0<br />

2<br />

2 , (4)<br />

P (ω)<br />

wherein μ0 describes the vacuum permeability. In (4),<br />

the total radiated power P (ω) is determined from<br />

P (ω) = 1<br />

2 ℜ<br />

<br />

ˆn × E (r<br />

S<br />

′ ,ω) · H (r ′ ,ω)dS ′<br />

<br />

. (5)<br />

III. FE MODEL AND MULTI-POINT MOR METHOD<br />

FE analysis <strong>of</strong> the near fields <strong>of</strong> a phased array <strong>of</strong> L<br />

antennas leads to a linear system <strong>of</strong> the form<br />

(A0 + ωA1+ω 2 L<br />

A2) x (p) =ω up (p) bp, (6a)<br />

p=1<br />

y (p) = C0 + ω −1 <br />

C1 x (p) , (6b)<br />

P (p) =ω −1 x ∗ (p) Dx (p) . (6c)<br />

Herein, A0, A1, A2 ∈ CN×N are the stiffness, damping,<br />

and mass matrices, respectively, x denotes the solution<br />

vector in terms <strong>of</strong>E, p = (θs,φs,ω) ∈ R3 the parameter<br />

vector, and N the dimension <strong>of</strong> the FE system. The<br />

output vector y ∈ C6H holds the electric and the magnetic<br />

near-field values E and H, respectively, sampled<br />

at H points on the Huygens surface S. In (6b), the<br />

matrices C0 ∈ C6H×N and C1 ∈ C6H×N carry out<br />

the sampling process and magnetic field computation on<br />

S. Furthermore, the Hermitian matrix D ∈ CN×N <strong>of</strong> the<br />

bilinear form (6c) represents the computation <strong>of</strong> the total<br />

radiated power (5). It can be seen that the system matrix<br />

<strong>of</strong> (6a) depends on the angular frequency ω only, while<br />

the RHS also depends on the steering angles θs and φs.<br />

Note that the RHS is constructed by a superposition <strong>of</strong><br />

L linearly independent vectors bp ∈ CN with parameterdependent<br />

weights up (p).<br />

To obtain the near-fields vector y and the total radiated<br />

power P , the large-scale system (6a) has to be solved for<br />

each parameter vector p <strong>of</strong> interest. Our goal is to bypass<br />

this time-consuming procedure. Since the FE system (6)<br />

exhibits affine parameter dependence [1], it is well-suited<br />

for projection-based MOR. The idea is to approximate<br />

the FE solution x (p) in a low dimensional subspace<br />

according to<br />

x (p) ≈ ˆx (p) =V˜x (p) (7)<br />

- 20 - 15th IGTE Symposium 2012<br />

with ˆx ∈ CN , ˜x ∈ Cn , V ∈ CN×n , and n ≪ N for<br />

all p ∈ D. Here, D denotes the considered parameter<br />

domain. For numerical stability, the columns <strong>of</strong> the trial<br />

matrix V are chosen to be orthogonal. Substituting the<br />

approximation (7) for x (p) in (6a) and testing with V∗ leads to the ROM:<br />

2<br />

ω q L<br />

Ãq ˜x (p) =ω up (p) ˜ bp, (8a)<br />

q=0<br />

ˆy (p) =<br />

p=1<br />

1<br />

ω −r Cr˜x ˜ (p) , (8b)<br />

r=0<br />

ˆP (p) =ω −1˜x ∗ (p) ˜ D˜x (p) , (8c)<br />

wherein the reduced matrices and vectors are given by<br />

Ãq = V ∗ AqV with Ãq ∈ C n×n , (9)<br />

˜bp = V ∗ bp with bp<br />

˜ ∈ C n , (10)<br />

˜Cr = CrV with Cr<br />

˜ ∈ C 6H×n , (11)<br />

˜D = V ∗ DV with D˜ n×n<br />

∈ C . (12)<br />

Using a multi-point (MP) MOR method, the trial matrix<br />

V is constructed from FE solutions on a discrete set<br />

De ⊂D<strong>of</strong> expansion points pi ∈ De such that<br />

range V =span{x (p1) ,...,x (pn)} . (13)<br />

As long as n ≪ N, the ROM (8) can be solved much<br />

more efficiently than the original system (6). Thus, the<br />

computational costs for determining both the near-field<br />

values and the total radiated power can be kept very low.<br />

IV. SELF-ADAPTIVE EXPANSION POINT SELECTION<br />

The residual r <strong>of</strong> ˆx with respect to (6a) takes the form<br />

r (p) =<br />

2<br />

ω q L<br />

(AqV) ˜x (p) − ω up (p) bp. (14)<br />

q=0<br />

Thus, the computation <strong>of</strong> its 2-norm,<br />

r (p) 2<br />

2 =<br />

2 2<br />

p=1<br />

ω<br />

q1=0 q2=0<br />

q1+q2 ˜x (p) ∗ V ∗ A ∗ q1Aq2V ˜x (p)<br />

− 2ℜ<br />

+ ω 2<br />

L<br />

p=0 q=0<br />

L<br />

p1=0 p2=0<br />

2<br />

ω q+1 b ∗ pAqV <br />

˜x (p)<br />

L<br />

up1(p)up2(p) b ∗ p1bp2 <br />

, (15)<br />

just involves matrices and vectors <strong>of</strong> the ROM dimension<br />

n ≪ N and is therefore very fast. This motivates the use<br />

<strong>of</strong> a residual-based error indicator ρ(Dds) in the pointplacement<br />

strategy:<br />

ρ(Dds) = max r(p)2 . (16)<br />

p∈Dds<br />

Herein, Dds stands for a dense sampling <strong>of</strong> the considered<br />

domain.


Algorithm 1 Conventional Greedy Algorithm.<br />

Given: Dds, p1 ∈ Dds, and ɛ.<br />

n =0. {Initialize ROM dimension.}<br />

repeat<br />

n ← n +1.<br />

ωc = ω(pn).<br />

Compute LU factorization <strong>of</strong> A(ωc).<br />

Determine x (pn) by forward-back substitution.<br />

Construct ROM by (8a).<br />

Compute residual r(p) 2 for all p ∈ Dds.<br />

Place expansion point pn+1 using (17).<br />

until ρn(Dds)


Let P denote the number <strong>of</strong> far-field look angles for<br />

which (23) is to be evaluated. It can be seen that, although<br />

the solution ˜x (p) <strong>of</strong> the ROM (8) is used, the computational<br />

effort for merely one single operating-point p ∈ D<br />

is still <strong>of</strong> complexity O (PH + Hn), i.e., the far-field<br />

computation itself is expensive, too. Our solution to this<br />

problem is to adopt an idea from [2] and employ the<br />

EI method [3], [4] to construct an affine decomposition<br />

<strong>of</strong> the exponential function (22). The <strong>of</strong>fline part <strong>of</strong> this<br />

method uses a greedy strategy to determine a set <strong>of</strong> M<br />

basis functions {qm} M<br />

m=1 , interpolation points {r′ m} M<br />

m=1<br />

and parameter values {d ′ m} M<br />

m=1 such that the interpolant<br />

ê (d, r ′ ) defined by<br />

ê (d, r ′ M<br />

)= αm (d) qm (r ′ ) (24)<br />

m=1<br />

approximates (22) for all (r ′ , d) ∈ S ×M. Having<br />

constructed the interpolation matrix<br />

⎡<br />

⎤<br />

⎢<br />

BM = ⎣<br />

. ..<br />

⎥<br />

⎦ (25)<br />

q1 (r ′ 1)<br />

.<br />

q1 (r ′ M ) ... qM (r ′ M )<br />

<strong>of</strong>fline, the parameter-dependent coefficients<br />

{αm (d)} M<br />

m=1 are obtained online, by solving the<br />

lower triangular system<br />

⎡ ⎤ ⎡<br />

α1 (d) e (r<br />

⎢<br />

[BM ]<br />

. ⎥ ⎢<br />

⎣ . ⎦ = ⎣<br />

αM (d)<br />

′ ⎤<br />

1, d)<br />

. ⎥<br />

. ⎦ . (26)<br />

, d)<br />

e (r ′ M<br />

Substituting the empirical interpolant (24) for e (d, r ′ ) in<br />

(23) results in<br />

Ix (˜x (p) , ê (d, r ′ ) ,ω)=ω −1 M<br />

ΔS αm (d)<br />

H<br />

h=1<br />

m=1<br />

qm (r ′ h) ˆn (r ′ h) × ˜ C1 (r ′ h) ˜x (p) . (27)<br />

Under the precondition that the sampling points on the<br />

Huygens surface S remain constant for all M steps <strong>of</strong> the<br />

EI method, the online part <strong>of</strong> (24) can be implemented<br />

such that it takes only O (M) operations. Thus, the<br />

computational efforts for computing P far-field values<br />

by (27) for a given operating-point p ∈ D are only <strong>of</strong><br />

order O (PM + Mn). Since, in practice, M ≪ H, the<br />

costs <strong>of</strong> the far-field computation are greatly reduced.<br />

VI. NUMERICAL RESULTS<br />

In the following, we consider the FE model <strong>of</strong> a<br />

dual-polarized tapered slot antenna array (TSAA) [6]<br />

consisting <strong>of</strong> L = 40 antennas, whose geometry is<br />

depicted in Fig. 1. The frequency band is given by<br />

f ∈ [2, 4] GHz, and the scan angles <strong>of</strong> interest are in<br />

the range <strong>of</strong> (θs,φs) ∈ 0, π<br />

2<br />

3 × [0, 2π) rad .Weuse<br />

#Dds = 17040 training points in the <strong>of</strong>fline part <strong>of</strong><br />

the self-adaptive multi-point method <strong>of</strong> Section IV and<br />

construct the ROM (8) by both the conventional greedy<br />

algorithm and the new FSG approach <strong>of</strong> Section IV-B.<br />

- 22 - 15th IGTE Symposium 2012<br />

Fig. 1. Geometry <strong>of</strong> the TSAA [6]. Dimensions: length l =8cm,<br />

width w = 8 cm, height h = 7 cm, and displacement <strong>of</strong> adjacent<br />

antennas s =2cm.<br />

Maximum norm: local error indicator<br />

10 0<br />

10 −2<br />

10 −4<br />

10 −6<br />

10 −8<br />

Conventional greedy method<br />

FSG method<br />

50 100 150 200 250 300 350<br />

ROM dimension n<br />

Fig. 2. Normalized error indicator (16) versus ROM dimension n<br />

for the conventional and the new FSG method. Circles mark changes<br />

in expansion-point frequency in the FSG method, requiring matrix<br />

factorization.<br />

A. Properties <strong>of</strong> FSG method<br />

Fig. 2 presents the behavior <strong>of</strong> the normalized error<br />

indicator (16) as a function <strong>of</strong> ROM dimension n. It<br />

can be seen that the standard method achieves nearly<br />

constant rates <strong>of</strong> convergence, whereas the FSG approach<br />

converges rather slowly during the early stages <strong>of</strong> the<br />

iteration. This behavior is expected because, early on, the<br />

frequency sampling <strong>of</strong> the FSG method is very poor. On<br />

the other hand, the standard procedure must factorize the<br />

FE matrix at each iteration, whereas the FSG method requires<br />

factorizations only when the expansion frequency<br />

changes, i.e., at the iterations marked by circles in Fig. 2.<br />

Thus, to compare overall computational efficiency, we<br />

have measured computing times for the same threshold<br />

ɛ <strong>of</strong> the error indicator. Table I presents the results. It<br />

can be seen that, depending on the threshold level, the<br />

proposed FSG method is 5 to 7.5 times faster.


Relative error e n<br />

TABLE I<br />

TSAA: COMPUTING TIME FOR ROM CONSTRUCTION (8)<br />

Residual Time t ∗ Speed-up Dimension n<br />

threshold ɛ Alg. 1 Alg. 2 factor Alg. 1 Alg. 2<br />

2.7 e−3 92.16 h 18.34 h 5.025 153 285<br />

4.6 e−5 164.98 h 24.25 h 6.803 265 333<br />

1.3 e−6 221.32 h 29.46 h 7.513 346 363<br />

∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />

10 0<br />

10 −2<br />

10 −4<br />

10 −6<br />

10 −8<br />

Conventional greedy method<br />

FSG method<br />

50 100 150 200 250 300 350<br />

ROM dimension n<br />

Fig. 3. Relative error in near-fields (28) versus ROM dimension n<br />

for the conventional approach and the FSG method. Parameter: p =<br />

( π π<br />

rad, − rad, 3.645 GHz) /∈ Dds.<br />

4 6<br />

B. Error in near-fields<br />

Our measure for the error in the near-fields at a given<br />

parameter vector p is the relative error e(p) defined by<br />

en (p) = x (p) − ˆxn (p)2 .<br />

x (p)2 (28)<br />

To investigate the convergence behavior <strong>of</strong> the<br />

projection-based MOR method, we choose a representative<br />

parameter vector, p = π π<br />

4 rad, − 6 rad, 3.645 GHz /∈<br />

Dds, and evaluate (28) as a function <strong>of</strong> ROM dimension<br />

n. Fig. 3 shows that both the conventional approach<br />

and the FSG method exhibit exponential convergence.<br />

C. Error in far-fields<br />

The following tests are based on #Mds = 28380<br />

training points in the <strong>of</strong>fline part <strong>of</strong> the EI method. The<br />

considered look angles are in the range <strong>of</strong> (θ,φ) ∈<br />

π<br />

2<br />

0, × [0, 2π) rad .<br />

2<br />

We first investigate the error <strong>of</strong> the empirical interpolant<br />

êm (d, r ′ ) <strong>of</strong> (24) with respect to the true value<br />

<strong>of</strong> the exponential function e (d, r ′ ) <strong>of</strong> (22). For this<br />

purpose, we choose a representative parameter vector,<br />

d =(3.789 GHz, − π π<br />

4 rad, 5 rad) /∈ Mds and monitor the<br />

relative error em(d),<br />

<br />

<br />

<br />

<br />

em (d) = max <br />

<br />

<br />

, (29)<br />

r ′ ∈Sh<br />

e (d, r ′ ) − êm (d, r ′ )<br />

e (d, r ′ )<br />

as a function <strong>of</strong> the number <strong>of</strong> EI coefficients m. The<br />

results shown in Fig. 4 demonstrate that the EI method<br />

leads to exponential convergence.<br />

- 23 - 15th IGTE Symposium 2012<br />

Relative error e m<br />

10 2<br />

10 0<br />

10 −2<br />

10 −4<br />

10 −6<br />

10<br />

0 200 400 600 800<br />

−8<br />

Number <strong>of</strong> coefficients m<br />

Fig. 4. Relative error in exponential function (29) versus number <strong>of</strong><br />

EI coefficients m. Parameter: d = 3.789 GHz, − π<br />

4<br />

rad, π<br />

5 rad .<br />

TABLE II<br />

AVERAGE ERROR IN DIRECTIVE GAIN.<br />

Steering angles<br />

(θs,φs)<br />

Average error eD (30)<br />

2.57 GHz 3.30 GHz 3.95 GHz<br />

( π π<br />

rad, 4 2 rad) 1.2 × 10−3 1.7 × 10−3 2.6 × 10−3 ( π π<br />

rad, 4 4 rad) 1.1 × 10−3 1.8 × 10−3 2.8 × 10−3 ( π<br />

6 rad, 0 rad) 1.3 × 10−3 1.9 × 10−3 3.3 × 10−3 ( π π<br />

rad, − 3 3 rad) 2.2 × 10−3 2.0 × 10−3 3.1 × 10−3 In our final test, we consider the error in radiation<br />

pattern <strong>of</strong> the phased antenna array, by measuring the<br />

average error in directive gain eD,<br />

eD (p) = 1<br />

<br />

P <br />

D<br />

(p, dp) −<br />

<br />

P <br />

p=1<br />

ˆ <br />

D (p, dp)<br />

<br />

<br />

. (30)<br />

D (p, dp) <br />

Table II presents error values for 12 different parameter<br />

vectors p /∈ Dds, corresponding to the far-field plots in<br />

Fig. 5 – Fig. 7. It can be seen that the results <strong>of</strong> the<br />

suggested MOR approach are in very good agreement<br />

with reference data. Computational parameters for this<br />

test can be found in Table III and Table IV.<br />

D. Overall runtime performance<br />

Fig. 5 – Fig. 7 show three-dimensional radiation patterns<br />

<strong>of</strong> the TSAA for different operating frequencies<br />

and four steering angles per frequency. Computational<br />

data <strong>of</strong> the original FE model and the ROM are given in<br />

Table III and Table IV, respectively. Without doubt, the<br />

<strong>of</strong>fline part <strong>of</strong> the algorithm leads to some one-time costs<br />

for constructing the ROM and the affine approximation<br />

to the NF-FF operator. However, once they are available,<br />

computing time for the near-fields improves by a factor <strong>of</strong><br />

68000, compared to conventional FE analysis. Moreover,<br />

post-processing time for one radiation pattern based on<br />

P =4, 830 look angles reduces by a factor <strong>of</strong> 12. Thus,<br />

the total speed-up factor for computing one near-field<br />

solution plus the corresponding far-field pattern is 910.<br />

VII. CONCLUSIONS<br />

An efficient two-step MOR method for computing<br />

the far-field patterns <strong>of</strong> phased antenna arrays has been


(a) θs = π<br />

4<br />

rad, φs = π<br />

2<br />

π<br />

π<br />

rad. (b) θs = rad, φs = 4 4 rad.<br />

(c) θs = π<br />

π<br />

π<br />

rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />

Fig. 5. Radiation patterns <strong>of</strong> the TSAA at f =2.57 GHz, determined<br />

by the two-step MOR method using P =4, 830 look angles.<br />

(a) θs = π<br />

4<br />

rad, φs = π<br />

2<br />

π<br />

π<br />

rad. (b) θs = rad, φs = 4 4 rad.<br />

(c) θs = π<br />

π<br />

π<br />

rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />

Fig. 6. Radiation patterns <strong>of</strong> the TSAA at f =3.30 GHz, determined<br />

by the two-step MOR method using P =4, 830 look angles.<br />

presented. Thanks to the new FSG technique, evaluation<br />

times for a real-world example [6] improve by a factor<br />

<strong>of</strong> 5 to 7.5 over earlier MOR approaches, and by a factor<br />

<strong>of</strong> 910 compared to conventional FE analysis.<br />

REFERENCES<br />

[1] V. de la Rubia, U. Razafison, and Y. Maday, ”Reliable fast<br />

frequency sweep for microwave devices via the reduced-basis<br />

method”, IEEE Trans. Microw. Theory Techn., vol. 57, pp. 2923-<br />

2937, Dec. 2009.<br />

[2] M. Fares, J. S. Hesthaven, Y. Maday, and B. Stamm, ”The reduced<br />

basis method for the electric field integral equation”, J. Comput.<br />

Phys., vol. 230, pp. 5532-5555, 2011.<br />

- 24 - 15th IGTE Symposium 2012<br />

(a) θs = π<br />

4 rad, φs = π<br />

2 rad. (b) θs = π<br />

4 rad, φs = π<br />

4 rad.<br />

(c) θs = π<br />

π<br />

π<br />

rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />

Fig. 7. Radiation patterns <strong>of</strong> the TSAA at f =3.95 GHz, determined<br />

by the two-step MOR method using P =4, 830 look angles.<br />

TABLE III<br />

COMPUTATIONAL DATA OF ORIGINAL FE MODEL OF TSAA.<br />

Parameters: θs = π<br />

π<br />

rad, φs = rad, f = 2.57 GHz.<br />

4 2<br />

FE dimension N 2, 553, 439<br />

Number <strong>of</strong> near-field points H 12, 800<br />

Number <strong>of</strong> look angles P 4, 830<br />

Time for solving FE system (6a) 2192.4 s∗ Time for computing radiation pattern 28.9 s∗ ∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />

TABLE IV<br />

COMPUTATIONAL DATA OF REDUCED-ORDER MODEL OF TSAA.<br />

Parameters: θs = π<br />

π<br />

rad, φs = rad, f = 2.57 GHz.<br />

4 2<br />

ROM dimension n 300<br />

Number <strong>of</strong> EI coefficients m 350<br />

Number <strong>of</strong> look angles P 4, 830<br />

Offline time for generating ROM (8) 20.36 h∗ Offline time for EI method 33.97 h∗ Online time for solving ROM (8a) 0.0321 s∗ Online time for radiation pattern 2.4087 s∗ ∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />

[3] M. Barrault, Y. Maday, N. C. Nguyen, and A. T. Patera, ”An<br />

’empirical interpolation’ method: application to efficient reducedbasis<br />

discretisation <strong>of</strong> partial differential equations”, C. R. Acad.<br />

Sci. Paris, Ser. I 339, pp. 667-672, 2004.<br />

[4] M. A. Grepl, Y. Maday, N. C. Nguyen, A. T. Patera, ”Efficient reduced<br />

basis treatment <strong>of</strong> nonaffine and nonlinear partial differential<br />

equations”, M2AN Math. Model. Numer. Anal. 41, pp. 575605,<br />

2007.<br />

[5] P. Binev, A. Cohen, W. Dahmen, R. DeVore, G. Petrova, and P.<br />

Wojtaszczyk, ”Convergence rates for greedy algorithms in reduced<br />

basis methods”, SIAM J. Math. Anal., vol. 43, pp. 1457-1472,<br />

2011.<br />

[6] T.-H. Chio, and D. H. Schaubert, ”Parameter study and design <strong>of</strong><br />

wide-band widescan dual-polarized tapered slot antenna arrays”,<br />

IEEE Trans. Antennas Propag., vol. 48, pp. 879-886, June 2000.<br />

[7] E. J. Rothwell and M. J. Cloud, ”Electromagnetics”, CRC Press,<br />

2009<br />

[8] S. J. Orfanidis, ”Electromagnetic Waves and Antennas”,<br />

http://www.ece.rutgers.edu/ orfanidi/ewa.


- 25 - 15th IGTE Symposium 2012<br />

Nanoparticle device for biomedical and<br />

optoelectronics applications<br />

R. Iovine, L. La Spada and L. Vegni<br />

Department <strong>of</strong> Applied Electronics, <strong>University</strong> <strong>of</strong> Roma Tre, Via della Vasca Navale 84, 00146 Rome, Italy<br />

E-mail: riovine@uniroma3.it<br />

Abstract—In this contribution a nanoparticle device, operating in the visible regime based on the Localized Surface Plasmon<br />

Resonance (LSPR) phenomenon, is presented. The nanoparticle electromagnetic properties are evaluated by a new analytical<br />

model and compared to the results obtained by numerical analysis. A near-field enhancement is obtained by arranging the<br />

nanoparticles in a linear array. Analytical formulas, describing such enhancement, are presented. The structure can find<br />

application for medical diagnostics and optoelectronics applications.<br />

Index Terms— LSPR, Medical diagnostics, Nanoparticle, Near-field Enhancement, Optoelectronics Applications<br />

I. INTRODUCTION<br />

In the last few years, several researches have paid<br />

attention to gold nanoparticles optical properties relate to<br />

the interaction <strong>of</strong> these structures with electromagnetic<br />

field at Visible (VIS) and Near Infrared Region (NIR)<br />

[1].<br />

When the electromagnetic field interacts with small metal<br />

particles the conduction electrons start oscillating<br />

collectively. This phenomenon is now well known and<br />

called Localized Surface Plasmon (LSP) [2]. If the<br />

frequency <strong>of</strong> the incident field matches the natural<br />

frequency oscillation <strong>of</strong> the electrons cloud the resonance<br />

condition is established with a strong dependence on the<br />

shape, size, composition <strong>of</strong> the nanoparticles as well as<br />

on the dielectric properties <strong>of</strong> the background<br />

environment [3].<br />

The analytical closed form electromagnetic solution to<br />

evaluate the electromagnetic behavior <strong>of</strong> metal<br />

nanoparticles exists only for the spherical shapes [4]. The<br />

possibility to predict the electromagnetic properties <strong>of</strong><br />

different kind <strong>of</strong> shapes is now very important due to the<br />

fact that the progress in nan<strong>of</strong>abrication technology<br />

allows to realize many shapes <strong>of</strong> particles [5] suitable for<br />

several application field such as biomedical sensing [6]<br />

and thin film solar cells [7].<br />

For example in [8] the possibility to control the<br />

enhancement <strong>of</strong> the Surface Enhanced Raman Scattering<br />

(SERS) using gold nanoparticles in the field <strong>of</strong> diagnostic<br />

oncology is reported. In [9] the possibility to use gold<br />

nanoparticles to produce in an efficient way heat energy<br />

from absorbed light energy that may be employed for<br />

selective PhotoThermal Therapy (PTT) is referred.<br />

The aim <strong>of</strong> this contribution is to propose the design <strong>of</strong> a<br />

nanostructure device consisting in a gold linear chain<br />

array <strong>of</strong> nanocubes, deposited on a silica substrate.<br />

For the cube particle a new analytical quasi static model<br />

describing its resonant behavior in terms <strong>of</strong> absorption<br />

and scattering cross section is presented. The results<br />

obtained by the analytical model are compared to the<br />

other ones performed through proper full-wave<br />

simulations [10] and by using the boundary integral<br />

method approach [11].<br />

The electromagnetic behavior <strong>of</strong> the device is evaluated<br />

for different inter-particle distance. In particular, the far<br />

field properties and the near electric field distribution are<br />

numerically obtained and the performances <strong>of</strong> the<br />

structure are analyzed for possible optoelectronics<br />

applications (design <strong>of</strong> absorbing layers) and for<br />

biosensing applications (refractive index measurements).<br />

II. QUASI STATIC ANALYTICAL MODEL FOR THE<br />

CUBE PARTICLE<br />

In general the nanoparticles have a size smaller compared<br />

to wavelength (e.g. at optical frequencies) so, it is<br />

possible to assume that all the conduction electrons in a<br />

nanoparticle see the same field at a given time (quasi –<br />

static approximation).<br />

Figure 1: Scheme <strong>of</strong> interaction between electromagnetic field and<br />

small particles compared to wavelength.<br />

The displacement <strong>of</strong> the electrons by incident<br />

electromagnetic field induces a dipolar charge separation<br />

(positive nuclei – free electrons) generating a restoring<br />

force which conflicts with incident field. The electron<br />

position is determined by the following equation:<br />

<br />

<br />

(1)<br />

<br />

where is the electron mass, is the electron damping<br />

coefficient and is the restoring coefficient.<br />

The relation (1) is a second order inhomogeneous<br />

differential equation with the following solution for<br />

harmonic excitation:<br />

<br />

<br />

(2)<br />

where <br />

is the natural frequency <strong>of</strong> the system.


This model is equivalent to a classical mechanical<br />

oscillator and represents a good physical interpretation to<br />

understand the Localized Surface Plasmon Resonance<br />

(LSPR) phenomenon. The resonance condition is<br />

established for and the denominator <strong>of</strong> (2) tends<br />

to zero and the coefficient and are very difficult to<br />

evaluate and are implicitly related to<br />

geometry/electromagnetic properties <strong>of</strong> the particles and<br />

permittivity value <strong>of</strong> the dielectric environment.<br />

However, exploiting the limit <strong>of</strong> electrically small<br />

particles it is possible to evaluate the resonant behavior <strong>of</strong><br />

the cube nanoparticle in accurate way. In order to study<br />

such electromagnetic properties, in terms <strong>of</strong> scattering<br />

and absorption cross-section, the following assumptions<br />

will be done:<br />

the particle is homogeneous and the surrounding<br />

material is a homogeneous, isotropic and nonabsorbing<br />

medium.<br />

The impinging plane wave has the electric field E<br />

parallel and the propagation vector k perpendicular<br />

to the nanoparticle principal axis, as depicted in<br />

Figure 2.<br />

Figure 2: Geometrical sketch <strong>of</strong> the gold nanocube particle.<br />

Under such conditions, we can relate the macroscopic<br />

nanoparticle properties to the polarizability <strong>of</strong> the<br />

nanoparticle.<br />

It is well known that [12], in case <strong>of</strong> an arbitrary shaped<br />

particle, its polarizability can be expressed as:<br />

(3)<br />

where is the volume <strong>of</strong> the particle, the surrounding<br />

dielectric environment permittivity, the inclusion<br />

dielectric permittivity and is the depolarization factor.<br />

The nanoparticle polarizability strongly depends on the<br />

inclusion geometry, its metallic electromagnetic<br />

properties , and the permittivity <strong>of</strong> the surrounding<br />

dielectric environment . In particular, the factor <strong>of</strong><br />

a nanoparticle plays a critical role in the polarizability<br />

resonant behaviour for the LSPR strength.<br />

Starting from [12], it is possible to develop new<br />

analytical closed-form formulas for the scattering and<br />

absorption cross-section <strong>of</strong> the aforementioned particles.<br />

The general corresponding expressions read, respectively:<br />

<br />

<br />

(4)<br />

- 26 - 15th IGTE Symposium 2012<br />

where is the wavenumber, is the<br />

wavelength and is the refractive index <strong>of</strong> the<br />

surrounding dielectric environment. Im stands for<br />

"Imaginary part".<br />

By considering the electric field polarization <strong>of</strong> the<br />

impinging plane wave, the absorption cross-section reads<br />

[13]:<br />

<br />

<br />

<br />

<br />

where is:<br />

<br />

<br />

<br />

<br />

<br />

III. BOUNDARY ELEMENT METHOD APPROACH<br />

Under quasi - static approximation the electric field can<br />

be expressed through the scalar potential as:<br />

(5)<br />

(6)<br />

(7)<br />

For homogeneous isotropic frequency-dispersive media<br />

can be determined easily from the Laplace equation:<br />

<br />

(8)<br />

In fact, by assuming an impulsive source the solution <strong>of</strong><br />

(8) is well known through the Green function <br />

as:<br />

<br />

<br />

<br />

<br />

(9)<br />

where and are the position vector and source vector,<br />

respectively.<br />

However, if we have an inhomogeneous medium such as<br />

a nanoparticle embedded in a dielectric environment<br />

(Figure 3) the solutions (9) are also valid but need to be<br />

satisfied by appropriate boundary conditions.<br />

Figure 3: Gold nanocube particle embedded in a dielectric environment.


In [11] it is possible to evaluate the scalar potential for<br />

the inhomogeneous medium:<br />

<br />

<br />

<br />

(10)<br />

by adding an artificial charge distribution at the boundary<br />

<strong>of</strong> discontinuity, determined from the continuity <strong>of</strong><br />

the tangential electric field and normal component <strong>of</strong> the<br />

dielectric displacement [14].<br />

The expression (10) can be converted from boundary<br />

integrals to bounday elements. Following the procedure<br />

reported in [15] it is possible to discretize the particle<br />

boundary into small surface by assuming that surface<br />

charges are located at the center <strong>of</strong> the surface element.<br />

In this way, it is possible to obtain numerically for a<br />

given external excitation the surface charge density <br />

and, consequently, the near electric field distribution and<br />

the far field properties in terms <strong>of</strong> absorption, scattering<br />

and extinction cross sections.<br />

IV. RESULTS FOR THE SINGLE PARTICLE<br />

The electromagnetic properties for the cube particle are<br />

evaluated using the quasi static analytical model,<br />

boundary element method (BEM) approach [15] and are<br />

compared to the results obtained with full-wave<br />

numerical simulations [10].<br />

We have assumed that the structure is excited by an<br />

impinging plane wave as shown in Figure 2. In addition:<br />

for the cube particle, experimental values [16] <strong>of</strong> the<br />

complex permittivity function have been inserted;<br />

the surrounding dielectric medium is vacuum.<br />

Far field properties in terms <strong>of</strong> absorption and scattering<br />

cross - section are shown in Figure 4 and Figure 5.<br />

Figure 4: Absorption and scattering cross section spectra obtained with<br />

the analytical model (l=50 nm).<br />

- 27 - 15th IGTE Symposium 2012<br />

Figure 5: Absorption and scattering cross section spectra obtained<br />

through full-wave simulations (l=50nm).<br />

There is a good agreement among the results obtained<br />

with the analytical model (Figure 4) and full-wave<br />

simulations (Figure 5).<br />

Full-wave simulations are also compared with the<br />

numerical results obtained with the BEM as shown in<br />

Figure 6.<br />

Figure 6: Comparison between extinction spectra obtained with BEM<br />

and full-wave simulations (l=50nm).<br />

The difference among the results shown in Figure 6 could<br />

be associated to the different discretization <strong>of</strong> the edge <strong>of</strong><br />

the particle with these two approaches.<br />

Near electric field distribution is obtained through fullwave<br />

simulation as depicted in Figure 7.<br />

Figure 7: Near electric field distribution for a single nanocube particle<br />

(l=50nm). The incident electric field amplitude is 1 V/m.<br />

In Figure 7 is clearly shown the dipolar charge repartition<br />

according to the quasi-static approach.<br />

V. LSPR DEVICE<br />

To enhance the mechanism <strong>of</strong> the LSPR it is possible the<br />

use <strong>of</strong> inter-coupling among nanoparticles. Such effect


originates from the charge induction among two or more<br />

nanoparticles which interact stronger as they get closer to<br />

each other [17].<br />

To use this enhancement mechanism we propose a<br />

structure consisting in a linear chain <strong>of</strong> gold nanocubes<br />

deposited on a silica substrate, excited by a plane wave as<br />

depicted in Figure 8.<br />

Figure 8: Linear chain <strong>of</strong> gold nanocubes on silica substrate with<br />

a=500nm, b=100nm, l=50 nm, l/8


Figure 11: Absorption and scattering cross section spectra obtained with<br />

the full-wave simulations (d=l= 50nm).<br />

VII. BIOSENSING APPLICATION OF THE DEVICE<br />

By using very small inter-particle distance among the<br />

nanoparticles it is possible to obtain high scattering and<br />

low absorption efficiencies (Figure 12, TABLE I). These<br />

properties are very important for biosensing applications.<br />

In fact high absorption efficiency could heat the<br />

biological sample invalidating medical diagnosis.<br />

Figure 12: Absorption and scattering cross section spectra obtained with<br />

the full-wave simulations (d=l/8= 6.25nm).<br />

For biosensing application we suppose that the device<br />

(grey) is in direct contact with the biological sample<br />

under test (green) as depicted in Figure 13. The sensor<br />

behavior is related to the effective refractive index<br />

variation <strong>of</strong> the overall system "LSPR device - biological<br />

compound".<br />

Once the biological compound is placed on the device,<br />

the system "sensor-biological compound" is illuminated<br />

by an optical electromagnetic field (Figure 13). The<br />

detected signal has a new frequency position and its<br />

magnitude and amplitude width are both dependent on<br />

the different characteristics <strong>of</strong> the biological compound.<br />

- 29 - 15th IGTE Symposium 2012<br />

Figure 13: The sensing system operation scheme<br />

The biological sample used to test this device is an insilico<br />

replica with values or Refractive Index (RI) taken<br />

from the literature. In particular the RI values <strong>of</strong> rat<br />

mammary adipose and tumor tissue have been considered<br />

[18]. These data (TABLE II) were acquired using an<br />

interferometric imaging system (Optical Coherence<br />

Tomography - OCT technique).<br />

TABLE II<br />

Tissue type Refractive<br />

index<br />

(mean value)<br />

Tumor 1.39<br />

Adipose 1.467<br />

The data show that a difference exists between the RI <strong>of</strong> a<br />

adipose tissue and that <strong>of</strong> tumor tissue.<br />

The electromagnetic sensor response is evaluated in terms<br />

<strong>of</strong> extinction cross-section through full-wave simulations<br />

[10] as depicted in Figure 14.<br />

Figure 14: Extinction spectra for rat mammary cancer (RI=1.39) and<br />

adipose tissue (RI=1.467).<br />

As shown in Figure 14 the resonant peak shifts from 634<br />

nm for a tumor tissue to 650 nm for a regular adipose<br />

tissue. Sensitivity is evaluated as S=Δλ/Δn expressed in<br />

nm/RIU (Refractive Index Unit). In this case sensitivity<br />

reached 207nm/RIU.


Near electric field distribution obtained for this sensing<br />

platform (Figure 15) is less concentrated compared to the<br />

other one obtained for d=l (Figure 10).<br />

Figure 15: Near electric field distribution for d=l/8= 6.25 nm. The<br />

incident electric field amplitude is 1 V/m.<br />

This result is in accord to the prevailing scattering<br />

phenomenon (TABLE I).<br />

VIII. CONCLUSION<br />

In this paper a nanostructure device operating in the<br />

visible regime was proposed. The device consisting in a<br />

gold linear chain array <strong>of</strong> nanocubes, deposited on a silica<br />

substrate. In this way a near-field enhancement is<br />

obtained and analytical formulas to describe this<br />

phenomenon are presented.<br />

For the single nanoparticle good agreement among<br />

analytical results and numerical solutions was achieved.<br />

Exploiting electromagnetic properties <strong>of</strong> the device it was<br />

shown that the proposed structure could be successfully<br />

used as a biomedical sensor or as an optoelectronic<br />

device.<br />

[1]<br />

REFERENCES<br />

A. Moores and F. Goettmann, "The plasmon band in noble metal<br />

nanoparticles: an introduction to theory and applications," New<br />

Journal <strong>of</strong> Chemistry, vol. 30, pp. 1121-1132, 2006.<br />

[2] E. Hutter and J.H. Fendler, "Exploitation <strong>of</strong> Localized Surface<br />

Plasmon Resonance," Advanced Materials, vol. 16, pp. 1685-<br />

1706, 2004.<br />

[3] L.J. Sherry, S.-H. Chang, G.C. Schatz and R.P. Van Duyne,<br />

"Localized Surface Plasmon Resonance Spectroscopy <strong>of</strong> Single<br />

Silver Nanocubes," Nano Lett., vol. 5, pp. 2034–2038, 2005.<br />

[4] G. Mie, "Contributions to the optics <strong>of</strong> turbid media, particularly<br />

<strong>of</strong> colloidal metal solutions," Ann. Phys., vol. 25, pp. 377-445,<br />

1908.<br />

[5] M. Tréguer-Delapierre, J. Majimel, S. Mornet, E. Duguet and S.<br />

Ravaine, "Synthesis <strong>of</strong> non-spherical gold nanoparticles," Gold<br />

Bulletin, vol. 41, pp. 195-207, 2008.<br />

[6] W. Cai, T. Gao, H. Hong and J. Sun, “Application <strong>of</strong> gold<br />

nanoparticles in cancer nanotechnology,” Nanotechnology,<br />

[7]<br />

Science and Application, vol. 1, pp. 17-32, 2008.<br />

K.R. Catchpole and A. Polman, “Plasmonic solar cells,” Optics<br />

Express, vol. 16, pp. 21793-21800, 2008.<br />

[8] D.S. Grubisha, R.J. Lipert, H.-Y. Park, J. Driskell and M.D.<br />

Porter, "Femtomolar Detection <strong>of</strong> Prostate-Specific Antigen: An<br />

Immunoassay Based on Surface - Enhanced Raman Scattering and<br />

Immunogold Labels," Anal. Chem., vol. 75, pp. 5936-5943, 2003.<br />

[9] S. Kessentini, D. Barchiesi, T. Grosges and M. Lamy de la<br />

Chapelle, "Selective and Collaborative Optimization Methods for<br />

Plasmonics: A Comparison," PIERS Online, vol. 7, pp. 291-295,<br />

2011.<br />

[10] CST Computer Simulation <strong>Technology</strong>, www.cst.com<br />

- 30 - 15th IGTE Symposium 2012<br />

[11] U. Hohenester and J. Krenn, "Surface plasmon resonances <strong>of</strong><br />

single and coupled metallic nanoparticles: A boundary integral<br />

method approach," Phys. Rev. B, vol. 72, pp.195429, 2005.<br />

[12] A. Sihvola, "Electromagnetic Mixing Formulas and<br />

Applications," The Instution <strong>of</strong> Engineering and <strong>Technology</strong> -<br />

London, 2008.<br />

[13] L. La Spada, R. Iovine and L. Vegni, "Nanoparticle<br />

Electromagnetic Properties for Sensing Applications," Advances<br />

in Nanoparticles, vol. 1, pp. 9-14, 2012.<br />

[14] F.J. Garcìa de Abajo, "Retarded field calculation <strong>of</strong> electron<br />

energy loss in inhomogeneous dielectrics," Physical Review B,<br />

vol. 65, pp. 115418.1-115418.17, 2002.<br />

[15] U. Hohenester and A. Trugler, "MNPBEM- A Matlab toolbox for<br />

the simulation <strong>of</strong> plasmonic nanoparticles," Computer Physics<br />

Communications, vol. 183, pp. 370-381, 2012.<br />

[16] P.B. Johnson and R.W. Christy, “Optical Constants <strong>of</strong> the Noble<br />

Metals,” Phys. Rev. B, vol. 6, pp.4370-4379, 1972.<br />

[17] T. Chung, S.-Y. Lee, E.Y. Song, H. Chun and B. Lee, "Plasmonic<br />

Nanostructures for Nano-Scale Bio - Sensing," Sensors, vol. 11,<br />

pp. 10907-10929, 2011.<br />

[18] A.M. Zisk, E.J. Chaney and S.A. Boppart, "Refractive index <strong>of</strong><br />

carciogen-induced rat mammary tumours," Phys. Med. Biol., vol.<br />

51, pp. 2165-2177, 2006.


- 31 - 15th IGTE Symposium 2012<br />

Validation <strong>of</strong> measurements with conjugate heat<br />

transfer models<br />

M. Schrittwieser 1, 2 , O. Bíró 1, 2 , E. Farnleitner 3 , and G. Kastner 3<br />

1 Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />

2 Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />

Innfeldgasse 18 A-8010 <strong>Graz</strong>, Austria<br />

3 Andritz Hydro GmbH, Dr. Karl- Widdmann- Strasse 5, A-8160 Weiz, Austria<br />

E-mail: schrittwieser@tugraz.at<br />

Abstract— The paper presents a comparison <strong>of</strong> thermal measurements on three stator duct models <strong>of</strong> an electrical machine.<br />

These models differ from each other by the slot section components. The measurements show the advantages and<br />

disadvantages <strong>of</strong> different variations. In order to study the measurement results in detail, a comparison with Computational<br />

Fluid Dynamics (CFD) was conducted, where it was useful to apply the Conjugate Heat Transfer (CHT) method, because it<br />

takes the convection and conduction into account. Therefore the conditions for the numerical heat transfer model can be<br />

determined more realistically, especially for the temperature rise in the solid domains caused by losses.<br />

Index Terms— Fluid Flow, Measurement, Stators, Thermal Analysis<br />

I. INTRODUCTION<br />

Hydro generators located in water power plants<br />

produce electric power in the range <strong>of</strong> more than 10<br />

MVA. The arising losses lead to a temperature rise in the<br />

electrical machine. The temperature rise is caused by<br />

copper, hysteresis, eddy current and mechanical losses<br />

during the generator operating. The heat has to be<br />

discharged to ensure the operating characteristics and this<br />

is the purpose <strong>of</strong> the cooling scheme. For designing the<br />

cooling <strong>of</strong> a generator, thermal and air flow networks are<br />

mostly used. Therefore the used parameters have to be<br />

established theoretically, by measurements or by CFD.<br />

The temperature rise has to be handled by solving the<br />

energy equation with the focus on heat convection and<br />

heat conduction. The convective heat transfer coefficient<br />

(HTC) is one <strong>of</strong> the most important parameters <strong>of</strong> these<br />

networks and must be known accurately. Examples <strong>of</strong><br />

networks are presented in [1] and [2].<br />

In the last years several investigations have been<br />

carried out on the topic <strong>of</strong> heat transfer, especially for low<br />

power electrical machines. Two different methods have<br />

emerged to get information about the HTC. One uses<br />

thermal resistances, defined with the aid <strong>of</strong> temperatures<br />

gained by measurements [3], [4] and [5]. The other<br />

employs CFD calculations combined with measurements<br />

[6], [7] and [8].<br />

The convective HTC has been calculated for large<br />

numerical models with CFD at different parts in [9] and<br />

[10] where the numerical effort is very high due to the<br />

large number <strong>of</strong> nodes in the model. A special set-up <strong>of</strong><br />

boundary conditions has been tried to reduce the section<br />

to be analyzed for comparable results with special<br />

attention payed to the rotor stator interaction. Only the<br />

fluid material properties are significant in these CFD<br />

simulations and the temperatures have been defined at the<br />

walls as a boundary condition from measurements. The<br />

refinement <strong>of</strong> the mesh near the wall for calculating an<br />

exact heat transfer is very important. An indicator <strong>of</strong> the<br />

mesh density is the dimensionless wall distance y + which<br />

should be about y 1<br />

+ ≤ [11]. The primary reason for this<br />

is that the HTC is a function <strong>of</strong> the dimensionless wall<br />

distance [12].<br />

The heat transfer caused by conduction has been<br />

considered in several papers by the finite element method<br />

(FEM) [13], [14] and [15]. The advantage <strong>of</strong> CFD over<br />

FEM is the consideration <strong>of</strong> the actual wall heat transfer<br />

coefficient. The disadvantage <strong>of</strong> the CFD is that the<br />

losses cannot be considered, while FEM is capable <strong>of</strong><br />

this. Therefore the copper and iron losses have to be<br />

implemented differently in CFD e.g. using the conjugate<br />

heat transfer (CHT) method. The sources can be defined<br />

in the solid domains and the material properties play an<br />

important role for the CHT solution.<br />

This paper presents a mutual validation <strong>of</strong> calorimetric<br />

measurements and a numerical calculation. The CHT<br />

method (fluid and solid heat transfer) is applied to a stator<br />

duct model. The losses have been defined as sources in<br />

the solid domains. The main objective is to evaluate the<br />

slot geometries with different winding assemblies. All<br />

three models have been measured at 5 flow rate points to<br />

pinpoint their thermal characteristics.<br />

II. MEASUREMENT<br />

A simplified model <strong>of</strong> a stator section has been under<br />

experimental investigation at the ANDRITZ Hydro. The<br />

main objective <strong>of</strong> the measurements has been to find and<br />

compare the thermal characteristics <strong>of</strong> different winding<br />

assemblies.<br />

Air has been used as cooling fluid for the experimental<br />

set-up.<br />

A. Investigated model<br />

The laboratory model and the cooling scheme are<br />

shown in detail in Fig. 1. The cooling fluid streams from<br />

the Inlet through the measuring nozzle (a) to the<br />

temperature probe (b) and from there through rectangular<br />

channels <strong>of</strong> wood (c) into the stator duct model (d). After<br />

heat exchange the warm air streams through wood ducts<br />

to an outlet channel, which contains resistance thermo<br />

elements (f). The outer surface <strong>of</strong> the model has been


insulated (e) for reduction <strong>of</strong> secondary heat flux.<br />

Fig. 1: Calorimetric measurement and experimental set-up <strong>of</strong> the stator<br />

laboratory model; (a) measuring nozzle, (b) Pt-100 temperature probe,<br />

(c) wood channels; (d) stator duct model; (e) insulation <strong>of</strong> the model and<br />

(f) resistance thermo elements<br />

B. Measuring physical parameters<br />

The measuring nozzle defines the volume flow rate Vin<br />

immediately in front <strong>of</strong> the model inlet.<br />

The fluid temperature T and density has been<br />

measured at the inlet and outlet <strong>of</strong> the stator duct model.<br />

These calorimetric measurement data allow calculating<br />

the heat flux after reaching steady state. The energy<br />

exchange occurs in the stator duct model. Therefore, it is<br />

important to calculate also the solid temperature and fluid<br />

temperature in the ducts. Fig. 2 shows the positions <strong>of</strong> the<br />

temperature probes in the iron domain.<br />

Fig. 2: Position <strong>of</strong> measurement probes in the iron; (a) 1 st stator core, (b)<br />

2 nd stator core and (c) heating rod<br />

The stator model consists <strong>of</strong> a section including 5 slots<br />

in circumferential direction and 3 ventilation ducts with<br />

distance bars between the laminated iron sheets in axial<br />

direction. The temperature has been measured in two<br />

stator cores. Therefore, fifteen Pt-100 resistance<br />

thermometers with 20 mm probe length have been<br />

positioned at each stator core.<br />

The heat sources have been simulated with heating<br />

rods positioned in the winding bars made <strong>of</strong> solid copper.<br />

The source has been induced with heating rods positioned<br />

in a hole in the middle <strong>of</strong> the copper bars, see Fig. 3. The<br />

- 32 - 15th IGTE Symposium 2012<br />

length <strong>of</strong> the rod has been 100 mm, with a diameter <strong>of</strong> 6<br />

mm and a constant heat output. The upper and lower bars<br />

have been heated up to reach steady state. The heat output<br />

has been constant during the whole experiment.<br />

Fig. 3: Position <strong>of</strong> temperature measurement devices for the cooling<br />

fluid; solid temperature positions for measuring (a) copper temperature<br />

and (b) spacer temperature; position <strong>of</strong> (c) the heating rod<br />

Thereupon the temperatures have been measured in<br />

each copper bar with two NiCrNi thermocouples 60 mm<br />

in length. The temperature in the spacer has been<br />

measured by a Pt-100.<br />

C. Results <strong>of</strong> measurements<br />

Table I shows the measurement data obtained for the 5<br />

different operating points for each model under<br />

investigation. The temperature differences have been<br />

normalized by the fluid inlet temperature.<br />

Model A<br />

Model B<br />

Model C<br />

TABLE I<br />

CALORIMETRIC MEASUREMENT RESULTS<br />

Vin in Tout − TinTcopper<br />

−Tin<br />

m Tin<br />

Tin<br />

3 /s kg/m 3<br />

T − T<br />

iron in<br />

Tin<br />

0.080 1.130 0.12 1.39 0.27<br />

0.060 1.129 0.16 1.60 0.36<br />

0.040 1.138 0.26 1.99 0.55<br />

0.025 1.134 0.41 2.35 0.81<br />

0.015 1.133 0.68 2.96 1.30<br />

0.079 1.155 0.15 1.80 0.48<br />

0.061 1.154 0.21 2.04 0.60<br />

0.041 1.152 0.31 2.40 0.83<br />

0.025 1.154 0.51 2.94 1.22<br />

0.015 1.152 0.81 3.55 1.80<br />

0.078 1.155 0.15 1.80 0.51<br />

0.060 1.149 0.20 2.03 0.64<br />

0.041 1.147 0.31 2.34 0.85<br />

0.025 1.149 0.51 2.84 1.23<br />

0.015 1.151 0.81 3.39 1.77<br />

III. MODEL GEOMETRIES<br />

The measurement set-up has been implemented in<br />

ANSYS CFX [11].<br />

The whole numerical model is shown in Fig. 4. The<br />

cooling scheme is the same as during the measurements<br />

i.e. the wood channels have also been modeled. Adiabatic<br />

walls have been defined at the top and the bottom <strong>of</strong> the


numerical model in z-direction.<br />

In addition to this simulation, a pperiodic<br />

boundary<br />

condition has been defined at the surfaaces<br />

normal to the<br />

x-direction. The goal <strong>of</strong> this is to reduce<br />

the section to be<br />

analyzed (less number <strong>of</strong> nodes togeether<br />

with smaller<br />

elements).<br />

Fig. 4: CHT model<br />

Fig. 5 visualizes the numerical statoor<br />

model in detail.<br />

For the calculation, one slot section hass<br />

been investigated<br />

due to the inlet condition, which is the same for each slot<br />

section as in the measurement. The wwinding<br />

assemblies<br />

are nearly the same, i.e. copper bars (d) and (e) with<br />

insulation (f), spacer (g) between the wwinding<br />

bars and,<br />

for positioning in radial direction, the sllot<br />

wedge (h). The<br />

iron (b) and (c) is located at the top aand<br />

bottom <strong>of</strong> the<br />

fluid (a).<br />

The difference in the models is thee<br />

contact between<br />

insulation and the iron.<br />

Fig. 5: Numerical stator model consists <strong>of</strong> the (a) fluid in the stator duct,<br />

(b) iron teeth, (c) iron yoke, (d) top copper bar, (e) bottom copper bar,<br />

(f) insulation, (g) spacer between bars and (h) slott<br />

wedge<br />

The following models differ from eacch<br />

other in the type<br />

<strong>of</strong> the winding assembly. There are diffferent<br />

options for<br />

mounting the winding, which will be eexplained<br />

in detail<br />

for each model.<br />

A. Model A<br />

This is a model with an air gap (white) between<br />

insulation and iron teeth, see Fig. 6. TThis<br />

air gap has a<br />

constant length. The cooling fluid caan<br />

stream in axial<br />

direction from one duct to another due tto<br />

the air gap.<br />

Fig. 6: Model A with air gap (white)<br />

- 33 - 15th IGTE Symposium 2012<br />

B. Model B<br />

A ripple spring (white dashhed)<br />

is positioned on one<br />

side between the iron teeth annd<br />

the insulation instead <strong>of</strong><br />

the air gap, as shown in Fig. 7. This ripple spring has had<br />

a corrugation in diagonal direcction.<br />

This corrugation has<br />

been smoothed along the surfaace.<br />

The implementation <strong>of</strong><br />

this has been done with a thermmal<br />

resistance at the contact<br />

interface. This causes an asymmmetric<br />

energy transport and<br />

the fluxes are higher at the sidee<br />

without a ripple spring.<br />

Fig. 7: Model B with ripple spring (whhite<br />

dashed)<br />

C. Model C<br />

This model is similar to moodel<br />

A, with the difference<br />

that epoxy resin (white dotted) ) is present. This is shown<br />

in Fig. 8. In this case the air caan<br />

stream in axial direction<br />

through the air gap (white), likee<br />

in model A.<br />

Fig. 8: Model C with epoxy resin (whitte<br />

dotted) and air gap (white)<br />

IV. NUMERICAAL<br />

METHOD<br />

The material properties havee<br />

a significant influence on<br />

the numerical solution. In CFDD,<br />

the fluid properties play<br />

the most important role and tthe<br />

solid domains are not<br />

taken into account. Only the coonvection<br />

has an influence<br />

in such calculations and the connduction<br />

is not considered.<br />

The conjugate heat transferr<br />

method differs from the<br />

conventional CFD simulation iin<br />

the consideration <strong>of</strong> the<br />

heat conduction in the energyy<br />

equation. Therefore, the<br />

thermal conductivity has to bbe<br />

known and defined for<br />

each medium in the CFD code [16].<br />

A. Turbulence Model<br />

Computational Fluid Dynammics<br />

uses the Finite Volume<br />

Method for solving the transporrt<br />

equations:<br />

∂ ρ ∂ρ<br />

u j<br />

+<br />

∂t ∂x<br />

j<br />

= 0<br />

(1)<br />

∂ρu ∂ρuu<br />

i i j ∂ p ∂ρτ<br />

ij<br />

+ = + + ρ fi<br />

∂t ∂xj∂ x xi ∂x<br />

j<br />

(2)<br />

∂ρet ∂ρue i t<br />

+<br />

∂t ∂x ∂uip<br />

∂ ∂uiτij ∂qj<br />

=− + + ρuf<br />

i i + + Q(3)<br />

∂x<br />

∂x ∂x<br />

j j<br />

j j<br />

These equations can be solveed<br />

for laminar flows. If the<br />

velocity and all other parametters<br />

vary in a random and<br />

chaotic way, the regime is calleed<br />

turbulent [17]. For most<br />

problems, it is unnecessary to resolve the detailed<br />

turbulent fluctuations and it is sufficient to calculate the<br />

time averaged properties <strong>of</strong> the flow. Therefore, the


Reynolds Averaged Navier Stokes (RANS) equations<br />

have been used. The Reynolds Stress Tensor is another<br />

unknown variable and further equations must be defined<br />

for the solution to calculate the unknown parameters [18].<br />

In the present case the Shear Stress Transport (SST)<br />

turbulence model [19] has been used. The advantage <strong>of</strong><br />

the SST turbulence model is that it combines the<br />

advantages <strong>of</strong> the k- turbulence model in the free stream<br />

and the advantage <strong>of</strong> the k- turbulence model near the<br />

wall [20], [21].<br />

B. Model Configuration and Boundary Conditions<br />

The mass flow rate and the inlet temperature have been<br />

defined as measured, see Table I. The pressure at the<br />

outlet has been defined as ambient pressure.<br />

The heat output from the heating element has been<br />

defined on a length <strong>of</strong> 100 mm in the middle <strong>of</strong> the<br />

copper bars with a constant value gained from the<br />

measurement.<br />

C. Fluid Properties<br />

The specific heat capacity cp, the dynamic viscosity <br />

and the thermal conductivity have been defined as the<br />

following constant values in Table II.<br />

TABLE II<br />

AIR IDEAL GAS MATERIAL PARAMETERS<br />

cp <br />

J/kgK Pa s W/mK<br />

1004.4 1.831·10 -5 2.61·10 -5<br />

For an ideal gas, the density is calculated with the ideal<br />

gas equation [16]:<br />

n⋅p ρ =<br />

R ⋅T<br />

0<br />

abs<br />

- 34 - 15th IGTE Symposium 2012<br />

(4)<br />

dh = cp dT<br />

(5)<br />

Here, n is the molecular weight, pabs is the absolute<br />

pressure, T is the temperature, R0 is the universal gas<br />

constant and is the density.<br />

These material properties from Table 2 have been<br />

adapted to implement the temperature dependence <strong>of</strong> the<br />

streaming fluid. In this case the specific heat capacity cp<br />

is expressed by the zero pressure polynomial [11]<br />

cp<br />

2 3 4<br />

= a1+ a2T + aT 3 + a4T + aT 5 (6)<br />

R<br />

S<br />

with the temperature T in Kelvin and the gas constant for<br />

air Rs = 287.058 J/kgK and the following coefficients:<br />

a1 = 3.57 , a2 = -4.3·10 -4 K -1 ,<br />

a3 = -4.2·10 -8 K -2 , a4 = 3.1·10 -9 K -3 ,<br />

a5 = -2.4·10 -12 K -4 .<br />

The values for the viscosity are approximated by the<br />

Sutherlands formula<br />

nμ<br />

μ T0+ Sμ T <br />

= ,<br />

(7)<br />

μ0<br />

T + SμT0 <br />

similarly to the conductivity <br />

λ<br />

λ<br />

T + S T <br />

+ <br />

0 λ<br />

= <br />

0 T SλT0 In these formulas, S and S stand for the Sutherland<br />

constant and n and n for the appropriate exponents. For<br />

the reference viscosity and reference conductivity the<br />

following values has been chosen from a material<br />

property table [22] at the reference temperature T0=325 K<br />

which is close to the mean operating temperature <strong>of</strong> the<br />

cooling fluid.<br />

S=77.80 K , 0=1.97·10 -5 Pa s , n=1.57 ,<br />

S=60.71 K , 0=2.82·10 -3 W/mK , n=1.66 .<br />

The material properties are accurately approximated in<br />

the temperature range from about 260 K to 670 K with<br />

this approach. It is not recommended to use the same<br />

parameters outside this range <strong>of</strong> temperature [22].<br />

D. Solid Properties<br />

The CHT method solves the following transport<br />

equation in the solid domains:<br />

nλ<br />

∂ρh ∂ρu h ∂ ∂T<br />

+ = λ+ S<br />

∂t ∂x ∂x <br />

∂x<br />

<br />

t s t<br />

j E<br />

<br />

j j j<br />

The important parameter in this equation is the thermal<br />

conductivity , which has a great influence on the results<br />

<strong>of</strong> the heat conduction and have to be known exactly.<br />

These parameters have been defined as isotropic for the<br />

copper, insulation, spacer, slot wedge, ripple spring and<br />

epoxy resin and as anisotropic for the iron.<br />

V. COMPARING NUMERICAL RESULTS WITH<br />

MEASUREMENTS<br />

The following figures show a comparison <strong>of</strong><br />

measurement data (dashed line) and CHT solution data<br />

(solid line) for the three different parameters. The<br />

temperature values have been normalized in the following<br />

figures.<br />

A. Copper temperature<br />

The temperature difference in Fig. 9 is calculated with<br />

an average value <strong>of</strong> the copper temperature in the top and<br />

bottom bar and the fluid temperature at the inlet. The<br />

deviation is due to the copper temperature because the<br />

inlet temperature has been defined from the<br />

measurements for the CFD calculation and is exactly the<br />

same like in the measurement.<br />

An average deviation has been calculated with 1.98 %<br />

for model A, 0.95 % K for model B and 1.06 % for model<br />

C. The diagram shows that the differences between the<br />

models become smaller with a higher flow rate for the<br />

measurements contrary to the calculation. The difference<br />

<strong>of</strong> the results at the highest flow rate is calculated for<br />

model A with 3.54 %, for model B with 1.34 % and for<br />

model C with 1.57 %.<br />

.<br />

.<br />

(8)<br />

(9)


Normalized temperature difference<br />

1,1<br />

1,0<br />

0,9<br />

0,8<br />

0,7<br />

0,6<br />

Measurement A CHT A<br />

Measurement B CHT B<br />

Measurement C CHT C<br />

0,5<br />

0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />

Volume flow rate in m 3 /s<br />

Fig. 9: Normalized temperature difference between copper temperature<br />

and fluid inlet temperature for the three stator duct models<br />

The distribution <strong>of</strong> the temperature is plotted in Fig. 10<br />

for stator duct model A and B. The surface is defined at<br />

the middle <strong>of</strong> the first stator core (see Fig. 2) and the<br />

temperature is shown at the whole solid domains.<br />

Fig. 10: Temperature distribution through the middle <strong>of</strong> stator core 1<br />

with all parts; (a) model A, (b) model B and (c) model C<br />

The highest copper temperatures are found in model A<br />

(a). The asymmetric temperature in model B (b) is also<br />

- 35 - 15th IGTE Symposium 2012<br />

recognizable in the iron; the temperature in the iron is<br />

higher on the opposite side <strong>of</strong> the ripple spring (bottom<br />

side) caused by the higher heat flux. The epoxy resin in<br />

model C (c) contributes a lower temperature in the<br />

insulation than along the air gap (see detailed in Fig. 10<br />

c). This will have a positive effect on the properties <strong>of</strong> the<br />

insulation during the aging.<br />

B. Iron temperature<br />

The iron temperature has been calculated as an average<br />

value <strong>of</strong> all measuring points (Fig. 2). The difference to<br />

the fluid inlet temperature has been plotted as before. The<br />

average deviation is 13.20 % for model A, 6.04 % for<br />

model B and 3.52 % for model C. It is worth noting that<br />

model A has the highest deviation for the iron<br />

temperature, see Fig. 11. The deviation for each winding<br />

assembly decreases with a higher volume flow rate.<br />

Normalized temperature difference<br />

1,2<br />

1,1<br />

1,0<br />

0,9<br />

0,8<br />

0,7<br />

0,6<br />

0,5<br />

0,4<br />

0,3<br />

0,2<br />

0,1<br />

Measurement A CHT A<br />

Measurement B CHT B<br />

Measurement C CHT C<br />

0,0<br />

0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />

Volume flow rate in m<br />

Fig. 11: Temperature difference between iron temperature and fluid<br />

inlet temperature for the three stator duct models<br />

3 /s<br />

Normalized temperature difference<br />

C. Fluid temperatures<br />

The fluid temperature rise is shown in Fig. 12.<br />

1,2<br />

1,1<br />

1,0<br />

0,9<br />

0,8<br />

0,7<br />

0,6<br />

0,5<br />

0,4<br />

0,3<br />

0,2<br />

0,1<br />

Measurement A CHT A<br />

Measurement B CHT B<br />

Measurement C CHT C<br />

0,0<br />

0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />

Volume flow rate in m<br />

Fig. 12: Difference between fluid inlet and outlet temperature<br />

3 /s


The temperature difference has been calculated with the<br />

inlet and outlet temperature <strong>of</strong> the air. The difference<br />

decreases with the volume flow rate. The average value<br />

<strong>of</strong> the difference is 0.97 % for model A, 1.09 % for model<br />

B and 0.99 % for model C. The highest deviation at the<br />

lowest flow rate is about 2.84 % for model A, 3.47 % for<br />

model B and 3.10 % for model C.<br />

VI. CONCLUSION<br />

The paper has described the conjugate heat transfer<br />

method for a stator model example. The advantage <strong>of</strong><br />

using CHT is that the heat transfer coefficient is<br />

inherently solved and needs not be defined as constant at<br />

the surfaces.<br />

The comparison <strong>of</strong> the numerical solution shows a<br />

good agreement with measurements for each stator duct<br />

model. The average deviation <strong>of</strong> the temperature<br />

difference between copper and fluid inlet temperature has<br />

been less than about 1.4 % for all models. The<br />

temperature difference has been calculated between the<br />

iron and fluid inlet temperature. Therefore, the average<br />

deviation is under 8 %. The heating up <strong>of</strong> the air has been<br />

calculated with a difference less than 1.5 % and at the<br />

lowest operating point the difference reaches the maximal<br />

deviation <strong>of</strong> 3.1 % and the slightest deviation with the<br />

highest flow rate <strong>of</strong> about 0.2 %. This can be explained<br />

by the fact that steady state is reached faster for lower<br />

flow rates than for higher ones. Based on these results,<br />

the conclusion can be drawn that the best agreement is<br />

obtained for model C and the worst for model A. These<br />

investigations provide a determination <strong>of</strong> proper model<br />

conditions in the slot region, which can be used for<br />

further CHT researches.<br />

VII. ACKNOWLEDGMENT<br />

This work has been supported by the Christian Doppler<br />

Laboratory for Multiphysical Simulation, Analysis and<br />

Design <strong>of</strong> Electrical Machines (MuSicEl) and ANDRITZ<br />

Hydro GmbH.<br />

REFERENCES<br />

[1] E. Farnleitner and G. Kastner, “Contemporary methods <strong>of</strong><br />

ventilation design for pumped storage generators,“ e&I, vol. 127,<br />

no. 1-2, pp. 24-29, 2010, DOI: 10.1007/s00502-010-0711-8.<br />

[2] G. Traxler-Samek, R. Zickermann and A. Schwery, “Cooling<br />

airflow, losses, and temperatures in large air-cooled synchronous<br />

machines,“ IEEE Transactions on Industrial Electronics, vol. 57,<br />

no. 1, pp. 172-180, Jan. 2010.<br />

[3] C. Kral, T. G. Habetler, R. G. Harley, F. Pirker, G. Pasoli, H.<br />

Oberguggenberger and C. J. M. Fenz, “Rotor temperature<br />

estimation <strong>of</strong> squirrel-cage induction motors by means <strong>of</strong> a<br />

combined scheme <strong>of</strong> parameter estimation and a thermal<br />

equivalent model,“ IEEE Transactions on Industry Applications,<br />

vol. 40, no. 4, July-Aug. 2004.<br />

[4] D. Staton, A. Boglietti, and A. Cavagnio, ”Solving the motor<br />

difficult aspects <strong>of</strong> electric motor thermal analysis in small and<br />

medium size industrial induction motors,” IEEE Transactions on<br />

Energy Conversion, vol. 20, no. 3, Sept. 2005, DOI:<br />

10.1109/TEC.2005.847979.<br />

[5] A. Boglietti and A. Cavagnino, “Analysis <strong>of</strong> the endwinding<br />

cooling effects in TEFC induction motors,” IEEE Transactions in<br />

Industry Applications, vol. 43, no. 5, pp. 1214-1222, 2007, DOI:<br />

10.1109/TIA.2007.904399.<br />

[6] B.D.J. Maynes, R.J. Kee, C.E. Tindall and R.G. Kenny,<br />

“Simulation <strong>of</strong> airflow and heat transfer in small alternators using<br />

- 36 - 15th IGTE Symposium 2012<br />

CFD,” IEE <strong>Proceedings</strong>- Electric Power Applications, vol. 150,<br />

no. 2, pp. 146-152, 2003, DOI: 10.1049/ip-epa:20020754.<br />

[7] C. Kral, A. Haumer, M. Haigis, H. Lang and H. Kapeller,<br />

“Comparison <strong>of</strong> a CFD analysis and a thermal equivalent circuit<br />

model <strong>of</strong> a TEFC induction machine with measurements,“ IEEE<br />

Transactions on Energy Conversion, vol. 24, no. 4, pp. 809-818,<br />

Dec. 2009, DOI: 10.1109/TEC.2009.2025428.<br />

[8] M. Hettegger, B. Streibl, O. Biro and H. Neudorfer,<br />

“Measurements and simulations <strong>of</strong> the convective heat transfer<br />

coefficients on the end windings <strong>of</strong> an electrical machine,“ IEEE<br />

Transactions on Industrial Electronics, vol. 59, no. 5, pp. 2299-<br />

2308, May 2012, DOI: 10.1109/TIE.2011.2161656.<br />

[9] M. Schrittwieser, A. Marn, E. Farnleitner and G. Kastner,<br />

“Numerical analysis <strong>of</strong> heat transfer and flow <strong>of</strong> stator duct<br />

models,” XX th International Conference on Electrical Machines,<br />

Sept. 2012.<br />

[10] S. Klomberg, E. Farnleitner, G. Kastner and O. Bìrò, “Heat<br />

transfer analysis on end windings <strong>of</strong> a hydro generator using a<br />

stator-slot-section model,” 15 th IGTE Symposium, Sept. 2012<br />

[11] ANSYS Inc., “ANSYS CFX- Solver Modeling Guide”, Release<br />

13.0, ANSYS Inc.<br />

[12] W. Vieser, T. Esch and F. Menter, “Heat transfer predictions<br />

using advanced two equation turbulence models,“ CFX Technical<br />

Memorandum, 2002.<br />

[13] C.C. Hwang, S. Wu and Y. Jiang, “Novel approach to the solution<br />

<strong>of</strong> temperature distribution in the stator <strong>of</strong> an induction motor,”<br />

IEEE Transactions on Energy Conversion, vol. 15, no. 4, pp. 401-<br />

406, Dec. 2000, DOI: 10.1109/60.900500.<br />

[14] S. Mezani, N. Takorabet and B. Laporte, “A combined<br />

electromagnetic and thermal analysis <strong>of</strong> induction motors,” IEEE<br />

Transactions on Magnetics, vol. 41, no. 5, pp. 1572-1575, May<br />

2005, DOI: 10.1109/TMAG.2005.845044.<br />

[15] L. Weili, C. Guan and P. Zheng, “Calculation <strong>of</strong> a Complex 3-D<br />

Model <strong>of</strong> a turbogenerator with end region regarding electrical<br />

losses, cooling, and heating,” IEEE Transactions on Energy<br />

Conversion, vol. 26, no. 4, pp. 1073-1080, Dec. 2011, DOI:<br />

10.1109/TEC.2011.2161610.<br />

[16] ANSYS Inc., “ANSYS CFX- Solver Theory Guide,” Release<br />

13.0, ANSYS Inc.<br />

[17] F. Kreith and M. S. Bohn, Principles <strong>of</strong> Heat Transfer, 6 th ed.<br />

Southbank, Australia: Thomson Learn., 2001.<br />

[18] H. K. Versteeg and W. Malalasekera, “An Introduction to<br />

Computational Fluid Dynamics – The finite volume method”,<br />

[19] F. R. Menter, “Two- equation eddy- viscosity turbulence models<br />

for Engineering Applications,” AIAA Journal, vol. 32, pp. 1598-<br />

1605, Aug. 1994.<br />

[20] D. C. Wilcox, Turbulence Modeling for CFD: Solutions Manual.<br />

2 nd Edition, La Canada, CA: DCW Industries Inc., 1994.<br />

[21] B. Launder and D. Spalding, Mathematical models <strong>of</strong> turbulence.<br />

London, U.K.: Academic Press, 1972.<br />

[22] VDI Heat Atlas, 10 th ed. Berlin, Germany: Springer- Verlag, 2006.


- 37 - 15th IGTE Symposium 2012<br />

Computing the shielding effectiveness <strong>of</strong> waveguides using FE-mesh<br />

truncation by surface operator implementation<br />

C. Tuerk∗ , W. Renhart † , and C. Magele †<br />

∗Armament and Defence <strong>Technology</strong> Agency, Ministry <strong>of</strong> Defence and Sports, Rossauer Laende 1, A-1090 Vienna<br />

† Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong><br />

E-mail: christian.tuerk@bmlvs.gv.at<br />

Abstract—A plane wave incident perpendicular to one open end <strong>of</strong> a conductive tube, as part <strong>of</strong> a honeycomb-structure, is<br />

attenuated on its way through it. In order to calculate its total attenuation for various frequencies the FE-method will be<br />

used. This requires a reflectionless truncation <strong>of</strong> the FE-mesh for which a Surface Operator Boundary Condition (SOBC)<br />

will be employed. In order to show the accuracy and applicability <strong>of</strong> the FEM with SOBC, the results will be compared<br />

to entirely analytical solutions as well as to easy-to-use engineering formulae.<br />

Index Terms—Finite Element Method (FEM), Shielding, Surface Operator Boundary Condition (SOBC), Waveguide<br />

I. INTRODUCTION<br />

Previous works i.e. [1] have shown the implementation<br />

<strong>of</strong> a surface operator boundary condition derived from<br />

an analytical model into the FE-mesh. Honeycombs can<br />

be considered waveguides-beyond-cut<strong>of</strong>f (WBC) and are<br />

therefore employed as vents for large shielded enclosures,<br />

like shielded rooms, while maintaining a certain degree<br />

<strong>of</strong> attenuation <strong>of</strong> a plane wave incident on it.<br />

The resulting attenuation imposed by a single conductive<br />

tube will be calculated under different ratios <strong>of</strong><br />

length-to-diameter <strong>of</strong> the tube and at selected frequencies.<br />

Existing literature like [2] provide engineering rules for<br />

designing waveguides-beyond-cut<strong>of</strong>f (WBC) as a shielding<br />

component whereas others like [3] present analytical<br />

details on the physics <strong>of</strong> the transmission <strong>of</strong> electromagnetic<br />

power in waveguides <strong>of</strong> various cross-sections. The<br />

results, obtained through numerical computation in the<br />

frequency range <strong>of</strong> 3GHz to 18GHz for a practical<br />

design <strong>of</strong> a real life waveguide are then compared to<br />

both approaches and subsequently discussed.<br />

II. MODELLING<br />

A. Surface Operator Boundary Conditions (SOBC)<br />

Fig. 1 shows the setup used for the computation <strong>of</strong> the<br />

plane wave field strength incident on the tube and exiting<br />

it. The right-hand-side boundary is modelled by means<br />

<strong>of</strong> SOBCs (Γtr) matching the impedance <strong>of</strong> free space<br />

between the tube and the termination <strong>of</strong> the problem area.<br />

A plane wave travelling along the x-axis will experience<br />

a certain degree <strong>of</strong> attenuation by the waveguide as long<br />

as the waveguide-beyond-cut<strong>of</strong>f (WBC) condition is met.<br />

A fraction <strong>of</strong> the initial power <strong>of</strong> the wave penetrates the<br />

waveguide and is terminated reflectionless at Γtr.<br />

Based on results obtained through [5] and [4] the implementation<br />

<strong>of</strong> this surface operator boundary condition<br />

for the truncation surface Γtr <strong>of</strong> the FE-mesh can be<br />

directly derived from the Maxwell equations<br />

∇× E = −jωμ H, ∇× H = jωɛ E. (1)<br />

After splitting the field vectors E and H as well as the<br />

∇-operator into their normal and orthogonal tangential<br />

Fig. 1. Modelling the Waveguide<br />

components as in 2<br />

E = Et + nEn, H = Ht + nHn, ∇ = ∇t + ∂<br />

n (2)<br />

∂n<br />

the Maxwell Equations can be reformulated as follows:<br />

n Hn = − 1<br />

jωμ ∇t × Et<br />

(3)<br />

n En = 1<br />

jωɛ ∇t × Ht. (4)<br />

With these relations the normal components <strong>of</strong> the field<br />

components En and Hn can be eliminated in equation 1.<br />

A couple <strong>of</strong> mathematical operations finally yield<br />

∂(n × Et)<br />

= −jωμ<br />

∂n<br />

Ht − 1<br />

jωɛ∇t × (∇t × Ht) (5)<br />

∂(n × Ht)<br />

= jωɛ<br />

∂n<br />

Et + 1<br />

jωμ∇t × (∇t × Et). (6)<br />

These equations are commonly valid, consequently on the<br />

truncation surface (see fig. 1) too. On Γtr the situation<br />

is as shown in fig. 2 in a local coordinate system.<br />

The propagation <strong>of</strong> the wave can be represented by the<br />

wave vector k. Due to the knowledge <strong>of</strong> the angle <strong>of</strong><br />

incidence on Γtr it can be decomposed into its normal<br />

and tangential components as given in the following set<br />

<strong>of</strong> equations:<br />

k = kt<br />

+ <br />

β, β = ± k2 − kt 2 , k = ω √ μɛ. (7)


Fig. 2. Wave at any point on Γtr<br />

The surface normal n is represented by the local coordinate<br />

ζ. In order to get rid <strong>of</strong> the ∂<br />

∂n term on the left-handside<br />

<strong>of</strong> equation 5 and equation 6 an integration along ζ<br />

over the half-space must be performed. Assuming a lossy<br />

media, the field components must decay to zero at infinity<br />

which allows for<br />

∞<br />

Ht0e <br />

ζ=0<br />

−jβζ dζ = 1<br />

jβ Ht0<br />

(8)<br />

∞<br />

Et0e −jβζ dζ = 1<br />

jβ Et0.<br />

ζ=0<br />

<br />

Ht0 and <br />

Et0 are the tangential field vectors at ζ =0.<br />

Together with equation 7, relations 5 and 6 can now be<br />

rewritten as<br />

n × <br />

Et0 =<br />

n × <br />

Ht0 =<br />

−ωμHt0 <br />

<br />

k2 − kt 2 + ∇t × (∇t × Ht0)<br />

ωɛ k2 − kt 2<br />

(9)<br />

ωɛEt0 <br />

<br />

k2 − kt 2 − ∇t × (∇t × Et0)<br />

ωμ k2 − kt 2<br />

. (10)<br />

Transverse components <strong>of</strong> the outgoing wave may be<br />

transformed into the Fourier domain, only to see, that its<br />

tangential derivatives can be expressed as ∇t = −j kt.<br />

Substitution in equation 9 and 10 leads to<br />

n × <br />

Et0 =<br />

n × <br />

Ht0 =<br />

−ωμHt0 <br />

<br />

k2 − kt 2 − kt × ( kt × Ht0)<br />

ωɛ k2 − kt 2<br />

(11)<br />

ωɛEt0 <br />

<br />

k2 − kt 2 + kt × ( kt × Et0)<br />

ωμ k2 .<br />

2<br />

− kt<br />

(12)<br />

These relations between the tangential components <strong>of</strong><br />

Et0<br />

and Ht0 can now be used to model the so called<br />

surface operator boundary conditions (SOBC) on Γtr.<br />

Equations 11 and 12 allow for any angle <strong>of</strong> incidence <strong>of</strong><br />

the plane wave on a truncating surface Γtr. Since only<br />

perpendicular incidence on the waveguide and on Γtr are<br />

considered, the use <strong>of</strong> a first-order SOBC is reasonable<br />

- kt =0.<br />

Application <strong>of</strong> the Galerkin method to the well-known<br />

A, v-formulation makes use <strong>of</strong> the n × Ht on the Neu-<br />

- 38 - 15th IGTE Symposium 2012<br />

mann Boundary (see [6]).<br />

− <br />

+ <br />

Ω<br />

+ <br />

ΓH<br />

Ω<br />

∇× Ni · 1<br />

μ ∇× AdΩ<br />

Ni · (n × ( 1<br />

μ ∇× A)) dΓ<br />

<br />

n× H<br />

Ni · (σ + jωɛ)jω( A −∇v)dΩ =0. (13)<br />

On the Neumann boundary (ΓH) the underbraced term<br />

in equation 13 is substituted by the Fourier transformed<br />

integral <strong>of</strong> equation 6 which prescribes the truncation <strong>of</strong><br />

the FE-mesh directly.<br />

B. Surface Impedance Boundary Conditions (SIBC)<br />

An increased incident angle results always in a larger<br />

wave vector kt and obviously the curl curl-terms in<br />

equations 11 and 12 become more and more relevance<br />

to achieve accurate boundary conditions. If the wave<br />

propagates perpendicularly to Γtr, the vector kt equals<br />

zero. This is the considered case for all results presented<br />

herein. Hence the second term on the right-hand-side in<br />

equations 11 and 12 equal zero and first order SIBCs<br />

remain:<br />

n × <br />

<br />

−ωμHt0 μ<br />

Et0 = = −<br />

k<br />

ɛ Ht0 = −Z0 Ht0 (14)<br />

n × <br />

ωɛ <br />

<br />

Et0 ɛ<br />

Ht0 = = −<br />

k μ Et0 = 1<br />

Et0<br />

(15)<br />

Z0<br />

The impedance <strong>of</strong> the mesh-terminating plane Γtr can<br />

now be directly prescribed.<br />

III. SETUP<br />

Fig. 1 shows the setup used for the computation <strong>of</strong> the<br />

plane wave field strength incident on the tube and exiting<br />

it. The right-hand-side <strong>of</strong> the problem area is terminated<br />

by means <strong>of</strong> the introduced SOBC. A plane wave originating<br />

from the stimulus plane penetrates the tube. Only<br />

a fraction <strong>of</strong> the incident power ”leaks” through it, since<br />

at the frequencies considered it represents a waveguidebeyond-cut<strong>of</strong>f<br />

(WBC). This small fraction <strong>of</strong> the incident<br />

wave is terminated reflectionless at Γtr. The detail <strong>of</strong> the<br />

aluminium tube with a square cross-section and lengths<br />

ranging from 20mm ... 80mm is shown in fig. 3.<br />

The grid shown in figure 3 represents the macro<br />

elements used for modelling only.<br />

IV. RESULTS<br />

A. Finite Element Method with SOBC<br />

Since frequencies above 1GHz are <strong>of</strong> interest, simulations<br />

at distinct frequencies in the range <strong>of</strong> 3 ... 18GHz at<br />

a stepwidth <strong>of</strong> 3GHz are considered. At each frequency<br />

the length <strong>of</strong> the tube is stepped through by 10mm in<br />

the range between 20mm and 80mm. The cross-section<br />

<strong>of</strong> the waveguide is kept constant. Fig. 4 shows the<br />

resulting attenuation <strong>of</strong> a plane wave on its way through


Fig. 3. Details <strong>of</strong> the waveguide-beyond-cut<strong>of</strong>f<br />

the WBC. At 15GHz the attenuation <strong>of</strong> the incident<br />

wave starts to approach zero and the tube becomes a<br />

waveguide as known from RF-applications and has also<br />

been described in [3]. As long as the frequencies are<br />

Fig. 4. Attenuation <strong>of</strong> a plane wave at distinct lengths and frequencies<br />

below the cut<strong>of</strong>f-frequency, the attenuation does not only<br />

depend on the ratio between f, the frequency used, and<br />

the cut<strong>of</strong>f-frequency fc <strong>of</strong> the structure, but also depends<br />

on the length <strong>of</strong> the tube. The relationship is non-linear<br />

and therefore clearly contrasting the engineering rules<strong>of</strong>-thumb<br />

as provided in the following section.<br />

The following figure (Fig. 5) shows the computation<br />

<strong>of</strong> the field strengths on either side <strong>of</strong> the waveguidebeyond-cut<strong>of</strong>f.<br />

It is operated at 9GHz and the righthand-side<br />

is terminated by means <strong>of</strong> the SOBC described<br />

before. The colors in the figure refer to the absolute value<br />

<strong>of</strong> the field strengths <strong>of</strong> the electrical component <strong>of</strong> the<br />

plane wave at a particular moment. Due to the necessity<br />

<strong>of</strong> a fine mesh for the computation <strong>of</strong> fields along the<br />

waveguide (coloured grey), no field strengths are visible.<br />

Following the general formula <strong>of</strong> the power density <strong>of</strong> a<br />

plane wave<br />

S = 1<br />

2 Re( E × H∗ ) (16)<br />

and the impedance <strong>of</strong> free space <strong>of</strong><br />

Z0 =<br />

μ0<br />

ɛ0<br />

- 39 - 15th IGTE Symposium 2012<br />

Fig. 5. A waveguide 30mm in length, operated at 9GHz<br />

the attenuation <strong>of</strong> the power through the waveguide can<br />

be calculated. With<br />

at =20lg |Emaxin|<br />

|Emaxout|<br />

(17)<br />

the degree <strong>of</strong> the attenuation (at)[dB] can be determined<br />

based on the field strength <strong>of</strong> the electrical component<br />

<strong>of</strong> the plane wave on the left-hand-side <strong>of</strong> the tube<br />

(|Emaxin|) and on the right-hand-side (|Emaxout|). The<br />

maxima <strong>of</strong> the respective field strengths are taken from<br />

a line parallel to the x-axis along the centre <strong>of</strong> the tube.<br />

B. Engineering Rules<br />

For applications using frequencies below approximately<br />

1GHz [2] proposes the use <strong>of</strong> simple ”design<br />

rules”:<br />

fc = 150<br />

b , fc[GHz], diameter[mm] (18)<br />

at = 27.3<br />

b l, at[dB],<br />

b =<br />

diameter, length[mm] (19)<br />

√ 2a, forsquare cross − section[mm]<br />

f ≤ fc<br />

10 , usablefrequencyf (20)<br />

with fc being the cut<strong>of</strong>f-frequency in [GHz], at representing<br />

the shielding effectiveness in [dB] and any<br />

dimension given in [mm]. Formulae 18 to 20 show that<br />

the cut<strong>of</strong>f-frequency only depends on the diameter <strong>of</strong> the<br />

tube which is, to some degree, in accordance with [3].<br />

It has to be distinguished whether a square, rectangular<br />

or circular waveguide is used. As for the rectangular<br />

cross-sections [3] reads that the larger dimension governs<br />

the cut<strong>of</strong>f-frequency fc. For circular shapes the diameter<br />

counts. One may also have noticed that the engineering<br />

rules do not account for any matter in the waveguide<br />

but free space. Since the WBC is used as a vent with<br />

shielding properties its cut<strong>of</strong>f-wavelength follows<br />

λc = c0<br />

fc<br />

≈ 2a. (21)<br />

This is in line with [3] and equation 22 if ɛ = ɛ0 and<br />

μ = μ0. Waveguides filled with dielectric matter for<br />

transmission properties are beyond the scope <strong>of</strong> this work<br />

since they are neither useful as vents nor as a shielding<br />

component.


As long as the frequency <strong>of</strong> interest is below the<br />

highest usable frequency as given in equation 20 the<br />

tube yields an attenuation according to equation 19.<br />

Application <strong>of</strong> this set <strong>of</strong> formulae to the waveguide<br />

under consideration at 12GHz provides the following<br />

graph (fig. 6):<br />

Fig. 6. Engineering rules applied at 12GHz<br />

Figure 6 shows the application <strong>of</strong> the engineering rules<br />

at the cut<strong>of</strong>f-frequency fc = 12GHz. The calculation<br />

<strong>of</strong> the shielding effectiveness with the engineering rules<br />

(blue dashed line) naturally exceed the limits obtained<br />

by means <strong>of</strong> the numerical value since equation 20 has<br />

not been considered so far. This equation is obviously a<br />

very rough estimate <strong>of</strong> the maximum usable frequency.<br />

It requires this waveguide not to be used above 1.2GHz.<br />

This is very conservative, since the green solid line<br />

(the uppermost line) shows the course <strong>of</strong> the shielding<br />

effectiveness at 9GHz <strong>of</strong> this particular waveguide. The<br />

engineering rules yield similar results, but on the safe<br />

side. Since it is not clear which limit in terms <strong>of</strong> shielding<br />

effectiveness underlies this set <strong>of</strong> easy-to-use engineering<br />

rules, one has to be very careful with its application. Even<br />

if it was possible to adjust equation 20 to this result, the<br />

behaviour <strong>of</strong> a waveguide may render this unreliable due<br />

to its nonlinear attenutation <strong>of</strong> a plane wave as fig. 4<br />

clearly shows.<br />

C. Analytical Approach<br />

When considering a waveguide-beyond-cut<strong>of</strong>f (WBC)<br />

for shielding purposes, the lowest mode <strong>of</strong> a TE or TMwave<br />

propagating through it is <strong>of</strong> interest. It represents<br />

the cut<strong>of</strong>f-frequency fc. For waveguides with a square<br />

cross-section [3] reads for TE10-mode<br />

fc = 1<br />

2 √ 1<br />

(22)<br />

ɛμ a<br />

with a being the length <strong>of</strong> the edge <strong>of</strong> the square. For a<br />

waveguide as used for this work, fc =14.99GHz which<br />

matches the result shown in figure 4. With increasing<br />

frequencies the attenuation <strong>of</strong> the plane waves vanishes<br />

above approximately 15GHz regardless <strong>of</strong> the length <strong>of</strong><br />

it. In other words, illuminating this particular waveguide<br />

at frequencies ≥ 15GHz will render it useless as a shield.<br />

- 40 - 15th IGTE Symposium 2012<br />

Since waveguides are generally used for transmission<br />

<strong>of</strong> electromagnetic energy there are, apart from the engineering<br />

rules above, no analytical formulations available<br />

to determine the attenuation <strong>of</strong> a plane wave penetrating a<br />

waveguide below its cut<strong>of</strong>f-frequency - there is no distinct<br />

mode <strong>of</strong> energy flow in the waveguide. For the same<br />

reason there are no analytical formulations known for<br />

plane waves penetrating a waveguide at other angles than<br />

perpendicular to the cross-section <strong>of</strong> it (see section V).<br />

V. CONCLUSION<br />

This paper shows how Surface Operator Boundary<br />

Conditions (SOBC) can be implemented in an A − v<br />

formulation to be used with the Galerkin method. The<br />

SOBC are used to model a Neumann Boundary Condition<br />

which allows for reflectionless termination <strong>of</strong> a problem<br />

area. The use <strong>of</strong> the SOBC allows for a significant<br />

speed-up <strong>of</strong> the computation <strong>of</strong> the problem because the<br />

absorbing boundary is only a single term which does not<br />

require additional finite elements to be modelled. For the<br />

construction <strong>of</strong> vents in a shielded room, waveguides below<br />

their cut<strong>of</strong>f-frequencies are employed. The described<br />

model has been used for the computation <strong>of</strong> the shielding<br />

effectiveness <strong>of</strong> waveguides at frequencies exceeding<br />

1GHz and compared and contrasted to an analytical<br />

approach and a set <strong>of</strong> easy-to-use engineering rules. It<br />

can now clearly be shown, that well known and verified<br />

analytical solutions can be met by numerical models<br />

as far as the cut<strong>of</strong>f-frequency <strong>of</strong> square waveguides is<br />

concerned. By the same token, it can be shown that<br />

simple design rules are very conservative i.e. delivering<br />

smaller numbers <strong>of</strong> shielding attenuation than actually<br />

can be yielded in real designs. It can not be said, that this<br />

set <strong>of</strong> easy-to-use rules are valid only below ≈ 1GHz.<br />

So far, only plane waves incident perpendicular to<br />

an open end <strong>of</strong> the waveguide have been modelled and<br />

computed. Future efforts will be put on different angles<br />

<strong>of</strong> incidence. There exist hints, that stacked arrays <strong>of</strong><br />

waveguides (honeycomb structures) suffer a deterioration<br />

<strong>of</strong> total shielding effectiveness compared to the attenuation<br />

provided by a single tube. This behaviour may also<br />

be investigated in the future.<br />

REFERENCES<br />

[1] W. Renhart, C. Magele and C. Tuerk, ”Thin Layer Transition<br />

Matrix Description Applied to the Finite Element Method”,IEEE<br />

Trans on Magn., Vol. 45, No. 3, 2009, pp. 1638- 1641.<br />

[2] Louis T. Gnecco, ”The Design <strong>of</strong> Shielded Enclosures”, Newnes<br />

Press, ISBN 0-7506-7270-6<br />

[3] Karoly Simonyi, ”Theoretische Elektrotechnik”, 6. Auflage, VEB<br />

Deutscher Verlag der Wissenschaften, Berlin 1977<br />

[4] W. Renhart, C. Magele and C. Tuerk, ”Improved FE-mesh truncation<br />

by surface operator implementation to speed up antenna<br />

design” (unpublished).<br />

[5] Sergei Tretyakov, Analytical Modeling in Applied Electromagnetics,<br />

1st ed. Artech House, chapters 2, 3, 2003.<br />

[6] O. Biro, ”Edge element formulations <strong>of</strong> eddy current problems”,<br />

Computer methods in applied mechanics and engineering, vol. 169,<br />

pp. 391-405, 1999.


- 41 - 15th IGTE Symposium 2012<br />

Heat Transfer Analysis on End Windings <strong>of</strong> a Hydro<br />

Generator using a Stator-Slot-Sector Model<br />

1, 2 Stephan Klomberg, 3 Ernst Farnleitner, 3 Gebhard Kastner, 1, 2 Oszkár Bíró<br />

1 Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />

Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />

2 Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />

3 Andritz Hydro GmbH, Dr.-Karl-Widdmann-Strasse 5, A-8160 Weiz, Austria<br />

E-mail: stephan.klomberg@tugraz.at<br />

Abstract — An accurate evaluation <strong>of</strong> the convective heat transfer coefficient on end windings needs usually large numerical<br />

models. These calculations involve an enormous amount <strong>of</strong> time and are not feasible for finding correlations between the<br />

convective heat transfer coefficient, massflow, rotational speed and geometry. On the basis <strong>of</strong> a parameter study this paper<br />

shows that a simplified stator-slot-sector model is equally accurate as a pole-sector-model but more practicable and faster.<br />

Index Terms—Cooling, Electric machines, Fluid dynamics,<br />

Heat transfer.<br />

I. INTRODUCTION<br />

Large hydro generators are working with a high<br />

efficiency nevertheless the losses <strong>of</strong> these machines can<br />

reach up to several MW’s. These heat losses must be<br />

dissipated from the generator. Designing the cooling <strong>of</strong> a<br />

generator is nowadays well-engineered. The use <strong>of</strong> flow<br />

and thermal networks in this subject is state <strong>of</strong> the art [1].<br />

Flow networks decompose the complex geometry in<br />

discrete network elements to calculate the air flow<br />

through a machine. They include pressure generating<br />

elements (active) like fans and poles or other rotating<br />

components and passive elements like ducts, inlets or<br />

outlets. The fundamentals <strong>of</strong> these components are<br />

determined theoretically, by measurements or<br />

computational fluid dynamics (CFD). The computation <strong>of</strong><br />

the temperature rise in the active parts is handled with<br />

thermal networks where the convective heat transfer and<br />

heat conduction coefficients and a reference air<br />

temperature are major parameters. Examples about flow<br />

and thermal networks are found in [1] and [2].<br />

Strictly speaking, the most important factor, the<br />

convective heat transfer coefficient, has to be calculated<br />

with large numerical generator models. Analyzing these<br />

models needs much time and is inappropriate for<br />

parameter studies. The need <strong>of</strong> coefficients for the<br />

networks makes the consideration <strong>of</strong> an equivalent<br />

smaller model enabling a faster calculation rational.<br />

The standard equation for the convective wall heat<br />

transfer coefficient is<br />

q W<br />

=<br />

(1)<br />

TW- Tref According to [3] this equations is applicable for forced<br />

convection if q W is the wall heat flux density, TWall the<br />

wall temperature and Tref a reference temperature in a<br />

properly control surface in the calculating volume.<br />

In the last years, several investigations have been<br />

carried out on the topic <strong>of</strong> heat transfer on end windings<br />

<strong>of</strong> electrical machines especially for totally enclosed fan<br />

cooled induction motors. The most development has been<br />

on smaller machines in a power class <strong>of</strong> a few kVA. One<br />

method obtains the heat transfer coefficients by<br />

measuring temperatures and implements these<br />

temperatures in thermal resistances [4], [5]. A second<br />

kind <strong>of</strong> approach involves CFD calculations combined<br />

with measurements [6], [7]. The end winding heat<br />

transfer <strong>of</strong> large hydro generators have not yet been<br />

investigated.<br />

The large hydro generator presented in this paper is air<br />

ventilated by a motor-driven fan in radial direction. A<br />

longitudinal section <strong>of</strong> the investigated generator is<br />

shown in Figure 1. The fluid enters the machine at the<br />

inlet (a) without a spin, flows through the end winding<br />

bars (d, e) in the pole area (f) and through the stator ducts<br />

(h) to the outlet (j).<br />

b<br />

c<br />

d<br />

e<br />

axis <strong>of</strong> rotation<br />

a j<br />

Figure 1: Pr<strong>of</strong>ile <strong>of</strong> the investigated hydro generator<br />

showing the (a) inlet, (b) bearing support, (c) support<br />

ring, (d) bottom bar, (e) top bar, (f) salient pole, (g) airgap,<br />

(h) stator ducts, (i) outlet area, (j) outlet, (k) shaft<br />

The purpose <strong>of</strong> this paper is to develop a so called slotsector<br />

model which is smaller and faster to calculate than<br />

k<br />

h<br />

i<br />

f<br />

g


a standard model with all components and an enormous<br />

number <strong>of</strong> elements. The slot-sector model should be<br />

investigated and optimized for calculating an accurate<br />

heat transfer coefficient.<br />

II. NUMERICAL SIMULATION OF THE HEAT FLUX<br />

Turbulence models are one <strong>of</strong> the most important parts<br />

in numerical fluid simulation. Therefore the two most<br />

commonly used models, the standard k- model by Jones<br />

and Launder [8] and the shear stress transport model by<br />

Menter [9] have been compared.<br />

The fundamental equation to calculate the heat flux at<br />

the wall is<br />

W= cpu *<br />

T + (TW -T) (2)<br />

where is the density and cp is the specific heat capacity.<br />

It should be pointed out that the two turbulence models<br />

apply different approaches for the dimensionless near<br />

wall velocity u * and the dimensionless temperature at the<br />

wall T + . Vieser et al tested and explained these heat<br />

transfer predictions in [10] for different test cases.<br />

A strong impact on the wall treatment has the density<br />

<strong>of</strong> the used mesh near walls described by the<br />

dimensionless distance from the wall<br />

y + = u y<br />

<br />

(3)<br />

This parameter depends especially on the height <strong>of</strong> the<br />

first element adjacent to the wall y. The other<br />

parameters are the friction velocity u and the kinematic<br />

viscosity . The smaller y is chosen, the more accurate<br />

the calculated heat transfer coefficient becomes. A<br />

parameter study in section IV will show this correlation.<br />

A short overview <strong>of</strong> the influence <strong>of</strong> y + on the<br />

convective heat transfer coefficient is<br />

T + =fy + <br />

=fW W=fT + , u * y + =f(y)<br />

u * =fy + (4)<br />

<br />

The commercial s<strong>of</strong>tware package ANSYS-CFX-13<br />

[11] has been used for the numerical simulations<br />

described in this paper. There are two main calculation<br />

methods for a rotor stator simulation, the transient and the<br />

steady-state approach. A transient calculation needs large<br />

computing resources and takes a long time. Therefore the<br />

steady-state method has been chosen. There are two<br />

variants, the stage method and the frozen rotor method.<br />

These steady-state approaches are only approximations<br />

because the transient terms in the flow equations are<br />

neglected. Nevertheless, their balance <strong>of</strong> computational<br />

efficiency and accuracy is ideal for parameter studies<br />

with many working points. They differ in the treatment <strong>of</strong><br />

the interface between two components. The stage method<br />

averages the fluxes in circumferential direction on bands<br />

and transmits these fluxes to the downstream component.<br />

- 42 - 15th IGTE Symposium 2012<br />

Only one passage per component has to be modeled, and<br />

furthermore, it can be used for large pitch ratios which<br />

highly reduce the number <strong>of</strong> elements.<br />

The frozen approach works with a frame change at the<br />

interface without averaging the fluxes. Therefore, it needs<br />

to model more passages per component. The conservation<br />

equations for the rotor are solved in a rotating system, the<br />

equations <strong>of</strong> the stator in a static frame <strong>of</strong> reference. The<br />

consistency <strong>of</strong> speed and pressure is combined at the<br />

interface. These relations are illustrated in Figure 2 and<br />

explained in detail in [11].<br />

Figure 2: Steady-state methods: stage and frozen rotor<br />

R1/ R2 and S1/ S2 are rotational periodicities; pR/ pS are<br />

pitch ratios<br />

The standard setting in ANSYS-CFX-13 for ideal gas<br />

is temperature independent, i.e. the thermal conductivity<br />

, the specific heat capacity cp and the dynamic viscosity<br />

are constant. This is a simplification <strong>of</strong> reality which<br />

may make the results more inaccurate. An ideal gas with<br />

temperature dependence has been modeled as a<br />

consequence, and compared to a temperature independent<br />

ideal gas.<br />

The dynamic viscosity (5) and the thermal conductivity<br />

(6) have been modeled with the Sutherland’s formula<br />

[11]. The reference temperature Tref has been set to 325<br />

K. The reference molecular viscosity o, the reference<br />

molecular conductivity o, the Sutherland constants S/ S<br />

and the temperature exponents n/ n are listed below for<br />

both equations.<br />

<br />

0<br />

<br />

0<br />

S = 77.8 K<br />

0 = 1.972 10 -5 Pa s<br />

n = 1.574<br />

= Tref + S <br />

T + S T<br />

n<br />

<br />

Tref (5)<br />

= Tref + S <br />

T + S T<br />

n (6)<br />

Tref S = 60.7 K<br />

0 = 2.82 10 -2 W / m K<br />

n = 1.676<br />

As illustrated in equation (7), the specific heat capacity<br />

has been calculated with the zero pressure polynomial<br />

[11].<br />

cp RS = t 1 +t 2 T+t 3 T 2 +t 4T 3 +t 5T 4 (7)


RS = 287.058 J / kg K<br />

t1 = 3.574<br />

t2 = -4.2691 10 -4<br />

t3 = -4.1854 10 -8<br />

t4 = 3.0986 10 -9<br />

t5 = -2.3848 10 -12<br />

All physical values have been found by automatically<br />

adjusting them to measured thermodynamic properties <strong>of</strong><br />

dry air gathered in [12]. These values are valid in a<br />

temperature range from 260 K to 670 K.<br />

III. EXPLANATION OF THE 3D MODEL<br />

This chapter shows the structure <strong>of</strong> the reference model<br />

called pole-sector model (PSM). Four different slot-sector<br />

models (SSM) will be explained, too.<br />

The reference model has been reduced to one pole<br />

sector <strong>of</strong> the whole circumference <strong>of</strong> the generator. A<br />

rotational periodic condition is given at both<br />

circumferential sides. A symmetry condition in axial<br />

direction further reduces the numerical effort. The model<br />

is shown in Figure 3. It is simulated with the frozen rotor<br />

approach and the number <strong>of</strong> elements is about 30 million.<br />

The calculation time <strong>of</strong> the PSM is longer than a week<br />

because <strong>of</strong> this large amount <strong>of</strong> elements. Nevertheless,<br />

the mesh <strong>of</strong> this model is rather coarse over the whole<br />

volume. This fact is especially important near the wall <strong>of</strong><br />

the end windings.<br />

c<br />

d<br />

b<br />

a<br />

Figure 3: Pole-sector model: (a) inlet, (b) bearing support,<br />

(c) support ring, (d) bottom bar, (e) top bar, (f) salient<br />

pole, (g) air-gap, (h) stator ducts, (i) outlet area, (j) outlet<br />

Measuring temperatures at walls in a large hydro<br />

generator demands a high effort and a long preparation<br />

time. The measuring sensors must be fixed during the<br />

construction <strong>of</strong> the components, which makes such<br />

investigations complicated and expensive. Not least due<br />

to these facts, the calculated wall heat transfer<br />

coefficients (WHTC) <strong>of</strong> the PSM have been taken as<br />

e<br />

j<br />

f<br />

i<br />

h<br />

g<br />

- 43 - 15th IGTE Symposium 2012<br />

reference values. By means <strong>of</strong> simulating several working<br />

points with different volume flow rates and rotational<br />

speeds, a wide scope has been covered. The volume flow<br />

rate is set as the inlet boundary condition and the static<br />

pressure as the outlet condition. All walls, especially the<br />

end winding walls, <strong>of</strong> the model have a fixed<br />

temperature. Conduction in solids is not considered.<br />

The aim <strong>of</strong> the investigations is developing a<br />

simplified model with acceptable computational demands<br />

for a numerical parameter study. The best fitting<br />

computational approach for this issue is the stage model.<br />

The question is how much can the PSM be reduced by<br />

maintaining similar accuracy. To clarify this, four<br />

different simplifications have been modeled.<br />

The first idea was to reduce the model as much as<br />

possible. In order to achieve this, the whole generator has<br />

been reduced to a circumferential section <strong>of</strong> one slot.<br />

Furthermore, the rotor, the stator ducts and the outlet area<br />

are not considered. The interface between the rotor<br />

domain and the inlet domain as well as the air gap serves<br />

as a simplified outlet. Due to this, it is difficult to find an<br />

appropriate boundary condition at the simplified outlet.<br />

Only one end winding bar is considered and a diffuser<br />

has been put in front <strong>of</strong> the inlet to get a radial inflow<br />

onto the end winding area. The number <strong>of</strong> elements is<br />

only about 0.6 million due to all these reductions. This<br />

slot-sector model is named SSM_1.<br />

The next step was extending the model SSM_1 with<br />

the rotor domain to get the second model (SSM_2). The<br />

outlet is moved to the symmetry plane <strong>of</strong> the rotor. The<br />

interface between the rotor and the stator ducts is<br />

assumed to be a fixed wall. The number <strong>of</strong> elements is<br />

less than 1 million.<br />

The third model (SSM_3) is enhanced with the stator<br />

ducts and the outlet area. These parts have also a<br />

circumferential extension <strong>of</strong> one slot only. Because <strong>of</strong><br />

this, the same boundary conditions as in the PSM are<br />

possible. The number <strong>of</strong> elements rises to 1.5 million.<br />

The last remaining problem has been the inflow. A slot<br />

section <strong>of</strong> the inlet domain doesn’t allow a three<br />

dimensional spreading <strong>of</strong> the flow onto the end windings.<br />

The consideration <strong>of</strong> the entire inlet area leads to the last<br />

simplified slot-sector model called SSM_4. The final<br />

model includes all components, but it contains only one<br />

slot with a pitch ratio <strong>of</strong> 22.5, i.e. one end winding bar<br />

and its surrounding stator ducts are modeled. This model<br />

has the best features for using the steady-state approach<br />

stage and a circumferential averaging <strong>of</strong> the WHTC is<br />

expected to be appropriate.<br />

The number <strong>of</strong> elements has been reduced to 2 million.<br />

On the one hand, only one slot section has been modeled,<br />

and on the other hand, the rotor and the inlet domains<br />

have been geometrically simplified and meshed coarser<br />

than the components <strong>of</strong> the PSM. The meshes <strong>of</strong> the end<br />

winding bars <strong>of</strong> the PSM and <strong>of</strong> all four SSM have the<br />

same structure and mesh density.<br />

An accurate prediction <strong>of</strong> the heat transfer coefficient<br />

is possible with a fine near wall mesh only. By means <strong>of</strong><br />

a parameter study, the influence <strong>of</strong> the mesh density on<br />

the WHTC has been investigated. The end windings’<br />

domain is meshed starting from an extremely coarse grid


to a very fine one. These various meshes are illustrated in<br />

Table 1.<br />

TABLE I<br />

DIFFERENT MESHES OF THE END WINDING BAR<br />

y 1.element number <strong>of</strong> <br />

[mm] elements<br />

<br />

5,00 45.000 <br />

3,00 65.000 <br />

2,00 81.000 <br />

1,00 146.000 <br />

0,50 318.000 <br />

0,25 693.000 <br />

0,12 989.000 <br />

0,06 1.492.000 <br />

0,05 1.682.000 <br />

The focus has been on a defined height <strong>of</strong> the first<br />

element at the walls. The rest <strong>of</strong> the volume is<br />

automatically meshed with a defined ratio <strong>of</strong> growth and<br />

a Poisson distribution normal to the wall [13].<br />

IV. RESULTS<br />

The evaluation has been carried out by calculating the<br />

WHTC at the end windings as defined in (1). Further<br />

results are normalized to the reference values for a better<br />

overview. The end winding bar is split into 5 zones to get<br />

the variation <strong>of</strong> the WHTC along the bar. Figure 4 shows<br />

the zones, beginning with T1 and T2 on the top bar. TB3<br />

is the junction from the top to the bottom bar and it is<br />

followed by B2 and B1.<br />

Figure 4: End winding bar with 5 zones<br />

Figure 5 shows the comparison <strong>of</strong> the WHTCs<br />

obtained by the four slot-sector models along an end<br />

winding bar. As a criterion for an acceptable agreement<br />

between the PSM and the SSMs, a ratio PSM / SSM in<br />

the range <strong>of</strong> 0.8 - 1.2 has been chosen. The first 3 slotsector<br />

models cannot fulfill this target, especially the area<br />

TB3 at the end <strong>of</strong> the bar is too inaccurate. The version<br />

SSM_4 is just in the range, except in zone T1. This area<br />

- 44 - 15th IGTE Symposium 2012<br />

<strong>of</strong> the top bar is located at the beginning <strong>of</strong> the air gap<br />

and the rotor and is highly influenced by the motion <strong>of</strong><br />

the rotor. This can be also seen in Figure 6, detail x and y,<br />

where the velocity is very high.<br />

PSM / SSM<br />

2,0<br />

1,8<br />

1,6<br />

1,4<br />

1,2<br />

1,0<br />

0,8<br />

0,6<br />

0,4<br />

0,2<br />

SSM_1 SSM_2<br />

SSM_3 SSM_4<br />

0,0 T1 T2 TB3 B2 B1<br />

Figure 5: Comparison <strong>of</strong> the four slot-sector models<br />

Figure 6 shows the comparison <strong>of</strong> the velocity contours<br />

in the symmetry plane <strong>of</strong> the inlet for the models SSM_4<br />

and SSM_3. The contours <strong>of</strong> SSM_1 and SSM_2 are<br />

similar to SSM_3. These pictures essentially show that<br />

the inflow velocity is higher if the whole inlet is used for<br />

the calculations. Hence the ratios <strong>of</strong> the slot-sector<br />

models 1-3 in the zones TB3, B2 and B1 in Figure 5 are<br />

out <strong>of</strong> the range <strong>of</strong> 0.8 - 1.2.<br />

x y<br />

Figure 6: Velocity in the symmetry plane <strong>of</strong> the inlet<br />

domain <strong>of</strong> a) SSM_4 and b) SSM_3<br />

The interaction between the rotor and the end windings is<br />

illustrated in Figure 7. The turbulent kinetic energy at the<br />

interfaces between the inlet and the rotor as well as<br />

between the top bar and the inlet is shown. There are<br />

vortices with high energy at the PSM contour seen in<br />

Figure 7a. The computation with the model SSM_4


generates the well known circumferential bands (see<br />

Figure 7b) characteristic <strong>of</strong> the stage method. In other<br />

words, by averaging the physical values on<br />

circumferential bands it is not possible to calculate local<br />

vortices. Therefore the use <strong>of</strong> a slot-sector model<br />

underestimates the rotor stator interaction.<br />

Inlet – Top bar<br />

Inlet – Top bar<br />

Inlet - Rotor<br />

Inlet - Rotor<br />

Figure 7: Turbulent kinetic energy on selected interfaces<br />

in a) PSM, b) SSM_4<br />

The graph in Figure 8 shows the heat transfer<br />

coefficient in dependence on the dimensionless distance<br />

from the wall at the top bar. The SST and the k-<br />

turbulence models have been used for this study. The<br />

WHTC increases with decreasing dimensionless distance<br />

from the wall. The coefficient reaches its peak at about y +<br />

= 5 and fluctuates around the maximum value. The factor<br />

y + is very sensitive to varying the near wall velocity due<br />

to a different volume flow rate or rotational speed with<br />

the same mesh and geometry. This mesh refinement study<br />

confirms the investigations <strong>of</strong> [14].<br />

- 45 - 15th IGTE Symposium 2012<br />

y=min<br />

1,1<br />

1,0<br />

0,9<br />

0,8<br />

0,7<br />

0,6<br />

0 1<br />

y<br />

10 100<br />

+ [-]<br />

Figure 8: Mesh refinement study in the zones T1, T2,<br />

TB3 with the k- and the SST turbulence model<br />

Depending on the previous investigations, a parameter<br />

study with various working points has been carried out.<br />

The slot-sector model SSM_4 has been used with<br />

different operating conditions but the SST turbulence<br />

model has always been applied. The results have been<br />

averaged and a standard deviation has been calculated.<br />

Figure 9 shows the results obtained.<br />

PSM / SSM<br />

1,4<br />

1,2<br />

1,0<br />

0,8<br />

0,6<br />

0,4<br />

0,2<br />

T1 SST T1 k-e<br />

T2 SST T2 k-e<br />

TB3 SST TB3 k-e<br />

averaged ratio (y+ < 30, ideal gas temperature independent)<br />

averaged ratio (y+ < 1, ideal gas temperature independent)<br />

averaged ratio (y+ < 1, ideal gas temperature dependent)<br />

standard deviation<br />

0,0 T1 T2 TB3 B2 B1<br />

Figure 9: Normalized WHTC in dependence <strong>of</strong> y + and the<br />

type <strong>of</strong> ideal gas<br />

First, the SSM has been calculated with the same mesh<br />

density near the end winding walls as the PSM. The<br />

results are located in the given range <strong>of</strong> 0.8 – 1.2. The<br />

second simulation run has been done with the finest mesh<br />

<strong>of</strong> the end winding domain. The curve is decreasing<br />

nearly parallel to the first one. The reference model has<br />

been only simulated with a coarse mesh near the end<br />

winding bars, hence an estimation <strong>of</strong> the accuracy is not<br />

possible. These two curves have been calculated with a<br />

temperature independent ideal gas. Therefore, the last<br />

curve has been simulated with an ideal gas with<br />

temperature dependency. The ratio <strong>of</strong> the heat transfer<br />

coefficients increases about 5% with temperature<br />

independence assumed.<br />

Regarding these findings, it can be stated that a slotsector<br />

model with a very fine mesh near walls and an<br />

adjusted ideal gas leads to sufficiently accurate results.


V. CONCLUSION<br />

The comparison <strong>of</strong> a pole-sector model with various<br />

slot-sector models shows that a simplification with less<br />

numerical effort is possible. An extreme reduction <strong>of</strong> the<br />

generator is not recommended, because all components<br />

have to be considered. Due to modeling one slot only, the<br />

stage approach is an adequate and fast calculating method<br />

for this kind <strong>of</strong> model structure. The averaged deviation<br />

<strong>of</strong> the wall heat transfer coefficient from the reference<br />

values is about 12%. Possible improvements <strong>of</strong> the slotsector<br />

model are the use <strong>of</strong> an adjusted ideal gas and a<br />

fine mesh near walls. Furthermore, the influence <strong>of</strong> the<br />

dimensionless distance from the wall has been confirmed.<br />

ACKNOWLEDGMENT<br />

This work has been supported by the Christian<br />

Doppler Research Association (CDG) and by the Andritz<br />

Hydro GmbH.<br />

- 46 - 15th IGTE Symposium 2012<br />

REFERENCES<br />

[1] E. Farnleitner and G. Kastner, "Moderne Methoden der<br />

Ventilationsauslegung von Pumpspeichergeneratoren," e&i, vol.<br />

127, pp. 24-29, 2010.<br />

[2] G. Traxler-Samek, R. Zickermann and A. Schwery, "Cooling<br />

airflow, losses, and temperatures in large air-cooled synchronous<br />

machines," IEEE Transactions on Industrial Electronics, vol. 57,<br />

no. 1, pp. 172-180, Jan. 2010.<br />

[3] H. Herwig, "Kritische Anmerkungen zu einem weitverbreiteten<br />

Konzept: der Wärmeübergangskoeffizient a," Forschung im<br />

Ingenieurwesen, vol. 63, pp. 13-17, 1997.<br />

[4] A. Boglietti and A. Cavagnino, "Analysis <strong>of</strong> the endwinding<br />

cooling effects in TEFC induction iotors," IEEE Transactions on<br />

Industry Applications, vol. 43, no. 5, pp. 1214-1222, Sept.-Oct.<br />

2007.<br />

[5] A. Boglietti, A. Cavagnino, D. Staton and M. Popescu,<br />

"Experimental assessment <strong>of</strong> end region cooling arrangements in<br />

induction motor endwindings," IET Electric Power Applications,<br />

vol. 5, no. 2, pp. 203-209, Feb. 2011.<br />

[6] C. Micallef, S. Pickering, K. Simmons and K. Bradley, "Improved<br />

cooling in the end region <strong>of</strong> a strip-wound totally enclosed fancooled<br />

induction electric machine," IEEE Transactions on<br />

Industrial Electronics, vol. 55, no. 10, pp. 3517-3524, Oct. 2008.<br />

[7] M. Hettegger, B. Streibl, O. Bíró and H. Neudorfer, "Identifying<br />

the heat transfer coefficients on the end-winding <strong>of</strong> an electrical<br />

machine by measurements and simulations," in 19th ICEM, Rome,<br />

2010.<br />

[8] W. P. Jones and B. E. Launder, "The prediction <strong>of</strong> laminarization<br />

with a two-equation model <strong>of</strong> turbulence," International Journal <strong>of</strong><br />

Heat and Mass Transfer, vol. 15, no. 2, pp. 301-314, Feb. 1972.<br />

[9] F. R. Menter, "Two-equation eddy-viscosity turbulence models for<br />

engineering applications," AIAA Journal, vol. 32, pp. 1598-1605,<br />

1994.<br />

[10] W. Vieser, T. Esch and F. Menter, "Heat transfer predictions using<br />

advanced two-equation turbulence models," CFX Technical<br />

Memorandum, vol. CFX-VAL10/0602, 2002.<br />

[11] "ANSYS 13.0 documentation," ANSYS, Inc., Canonsburg, 2010.<br />

[12] F. Kreith, R. M. Manglik and M. S. Bohn, Principles <strong>of</strong> Heat<br />

Transfer, 7 ed., Stamford: Cengage Learning, 2011.<br />

[13] "ANSYS ICEM CFD 13.0 documentation," ANSYS, Inc.,<br />

Canonsburg, 2010.<br />

[14] M. Schrittwieser, A. Marn, E. Farnleitner and G. Kastner,<br />

"Numerical analysis <strong>of</strong> heat transfer and flow <strong>of</strong> stator duct<br />

models," in 20th ICEM, Marseille, 2012.


- 47 - 15th IGTE Symposium 2012<br />

Numerical Investigation <strong>of</strong> Linear Systems Obtained<br />

by Extended Element-Free Galerkin Method<br />

Taku Itoh∗ , Soichiro Ikuno∗ , and Atsushi Kamitani †<br />

∗ Tokyo <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, 1404-1 Katakura, Hachioji, Tokyo 192-0982, Japan<br />

† Yamagata <strong>University</strong>, 4-3-16 Johnan, Yonezawa, Yamagata, 992-8510, Japan<br />

E-mail: taku@m.ieice.org<br />

Abstract—To impose not only the essential boundary condition but also the natural one without any integrations, the<br />

Element-Free Galerkin method (EFG) has been reformulated, and this method is called an eXtended EFG (X-EFG). A<br />

linear system obtained by the X-EFG becomes an asymmetric saddle point problem. Numerical experiments show that,<br />

by using IC-Bi-CGSTAB and IC-GMRES(m), the linear system can be solved more than 9 times faster than the LU<br />

factorization in relatively large problem. However, there are some cases where these iterative methods sometimes do not<br />

converge regardless <strong>of</strong> the condition number. To avoid these cases, and to stably solve the linear system as fast as possible,<br />

a flow chart for choosing an appropriate solver has been constructed by using the results <strong>of</strong> the numerical experiments.<br />

Index Terms—Meshless methods, Element-free Galerkin methods, Saddle point problems, Asymmetric linear systems<br />

I. INTRODUCTION<br />

Meshless methods such as the Element-Free Galerkin<br />

method (EFG) [1] and the Meshless Local Petrov-<br />

Galerkin method (MLPG) [2] have widely been applied<br />

to numerical simulations in a lot <strong>of</strong> fields, including<br />

electromagnetics [3], [4], [5], [6], [7]. In the meshless<br />

methods, elements <strong>of</strong> a geometrical structure are no<br />

longer necessary.<br />

Especially in the EFG, the Lagrange multiplier [1] is<br />

employed for imposing the essential boundary condition.<br />

Recently, the EFG has been reformulated together with<br />

anewimposing method <strong>of</strong> the essential boundary condition<br />

[8]. In this EFG, the essential boundary condition<br />

can be satisfied without using any integrations. However,<br />

it must be noted here that, in this EFG, the natural<br />

boundary condition is imposed by evaluating integrations.<br />

Especially for a three-dimensional (3D) problem, surface<br />

integrals must be evaluated for imposing the natural<br />

boundary condition. For this reason, if there exists an<br />

easier method for imposing the natural boundary condition,<br />

the method is helpful for developing a numerical<br />

code based on the EFG.<br />

The purpose <strong>of</strong> the present study is to reformulate<br />

a 3D EFG so that not only the essential boundary<br />

condition but also the natural one can be imposed without<br />

any integrations. The reformulated method is called an<br />

eXtended EFG (X-EFG). A linear system obtained by<br />

the X-EFG becomes a saddle point problem, and the<br />

coefficient matrix is asymmetric. For the purpose <strong>of</strong><br />

stably obtaining a numerical solution as fast as possible,<br />

appropriate solvers for the asymmetric linear system are<br />

also investigated numerically.<br />

II. EXTENDED ELEMENT-FREE GALERKIN METHOD<br />

A. New Reformulation<br />

In this section, we describe a new reformulation <strong>of</strong><br />

EFG which is different from that described in [8]. For<br />

simplicity, we consider a 3D Poisson problem:<br />

−Δu = f in V, (1)<br />

u =ū on SD, (2)<br />

∂u<br />

∂n =¯q on SN, (3)<br />

where V is a region bounded by a simple closed surface<br />

∂V that consists <strong>of</strong> both SD and SN. Here, SD and SN<br />

satisfy SD ∪ SN = ∂V and SD ∩ SN = φ. In addition, ū<br />

and ¯q are known functions on SD and SN, respectively,<br />

and n is an outward unit normal on ∂V . Furthermore,<br />

f(x) is a given function on V , and x =[x, y, z] T .<br />

From (1), the weak form is derived as<br />

<br />

<br />

∀<br />

w s.t. w∂V : ∇w ·∇u d<br />

=0 3 <br />

x = wf d 3 x . (4)<br />

V<br />

where w(x) is a test function. Note that the constraint <strong>of</strong><br />

w(x) in (4) is different from that in [8].<br />

To discretize (4), the nodes, x1, x2,...,xN are first<br />

placed both in V and on ∂V , and shape functions<br />

φ1(x),φ2(x),...,φN(x) are determined by using the<br />

Moving Least-Squares (MLS) approximation [1], [2], [7].<br />

Here, N is the number <strong>of</strong> nodes in V ∪ ∂V . In the<br />

following, M denotes the number <strong>of</strong> nodes on ∂V .In<br />

addition, the orthonormal system in R N and that in R M<br />

are denoted by {e1, e2,...,eN } and {ē1, ē2,...,ēM },<br />

respectively.<br />

Let us first discretize the weak form (4). To this end,<br />

we assume that u and w can be expanded with φi(x)(i =<br />

1, 2,...,N) as follows:<br />

N<br />

N<br />

u(x) = ûiφi(x), w(x) = ˆwiφi(x). (5)<br />

i=1<br />

By substituting (5) into (4), we obtain<br />

where<br />

i=1<br />

( ˆw,Aû − f) =0, (6)<br />

ˆw =[ˆw1, ˆw2,..., ˆwN ] T , (7)<br />

V


and<br />

û =[û1, û2,...,ûN] T . (8)<br />

In addition, A and f are defined as<br />

N N<br />

<br />

A ≡ ∇φi ·∇φj d 3 xeie T j , (9)<br />

f ≡<br />

i=1 j=1<br />

N<br />

<br />

V<br />

φif d 3 xei . (10)<br />

i=1 V<br />

Next, the constraint w| ∂V =0 in (4) is discretized. To<br />

this end, the constraint is rewritten as the equivalent<br />

proposition:<br />

<br />

∀<br />

β(s, t) :<br />

∂V<br />

β(s, t)w(x(s, t)) dS =0, (11)<br />

and an arbitrary function β(s, t) is assumed to be<br />

contained in span(N1,N2,...,NM ), where N1(s, t),<br />

N2(s, t),...,NM (s, t) are linearly independent functions<br />

on ∂V . Here, s and t are parameters for representing<br />

∂V . By using N1(s, t),N2(s, t),...,NM (s, t), (11) can<br />

be discretized as<br />

( ˆw, ck) =0(k =1, 2,...,M), (12)<br />

where ck (k =1, 2,...,M) are defined as<br />

N<br />

<br />

ck ≡ Nk(s, t)φi(x(s, t)) dS ei. (13)<br />

i=1<br />

∂V<br />

Note that (12) indicate ˆw ∈ V ⊥ , where<br />

V = span(c1, c2,...,cM ). (14)<br />

Hence, the weak form (4) can be discretized as<br />

∀ ˆw ∈ V ⊥ :(ˆw,Aû − f) =0. (15)<br />

Since (V ⊥ ) ⊥ = V, (15) can be written as<br />

Aû − f ∈ V. (16)<br />

Therefore, there exists ˆv ∈ R M such that<br />

Aû + C ˆv = f, (17)<br />

where C ≡ [c1, c2,...,cM ] and ˆv ≡ [ˆv1, ˆv2,...,ˆvM ] T .<br />

Finally, the essential boundary condition (2) and<br />

the natural one (3) are simultaneously discretized. By<br />

the similar procedures for discretizing the constraint<br />

w| ∂V =0 , (2) and (3) can be discretized as<br />

D T û = g, (18)<br />

where D ≡ [d1, d2,...,dM ], and<br />

⎧<br />

N<br />

<br />

Nk(s, t)φi(x(s, t)) dS ei,<br />

⎪⎨ i=1 ∂V<br />

for xk ∈ SD,<br />

dk ≡ N<br />

<br />

Nk(s, t)<br />

⎪⎩ i=1 ∂V<br />

∂φi<br />

(x(s, t)) dS ei,<br />

∂n<br />

for xk ∈ SN<br />

(k =1, 2,...,M). (19)<br />

- 48 - 15th IGTE Symposium 2012<br />

In addition, g ≡ [g1,g2,...,gM ], and<br />

⎧ <br />

⎪⎨ Nk(s, t)ū(s, t) dS, for xk ∈ SD,<br />

SD<br />

gk ≡ <br />

⎪⎩ Nk(s, t)¯q(s, t) dS, for xk ∈ SN<br />

SN<br />

(k =1, 2,...,M). (20)<br />

Note that, for xk ∈ SD, dk is exactly the same asck.<br />

Equations (17) and (18) can be written in the form,<br />

<br />

A C<br />

DT <br />

û f<br />

= . (21)<br />

O ˆv g<br />

Equation (21) is a discretized form <strong>of</strong> the Poisson problem.<br />

Throughout this subsection, the reformulation <strong>of</strong><br />

EFG is finished. In the following, the reformulated EFG<br />

is referred to as an eXtended EFG (X-EFG).<br />

B. Selection <strong>of</strong> linearly independent functions<br />

As mentioned above, the linearly independent functions<br />

Nk (k =1, 2,...,M) are required for discretizing<br />

the essential and natural boundary conditions. Here, the<br />

δ-functions defined on ∂V are employed as Nk (k =<br />

1, 2,...,M) so that the these boundary conditions may<br />

be satisfied exactly. The explicit form <strong>of</strong> Nk(s, t) is given<br />

as<br />

Nk(s, t) = δ(s − sk)δ(t − tk)<br />

<br />

<br />

<br />

∂x ∂x <br />

(k =1, 2,...,M).<br />

× <br />

∂s ∂t <br />

(22)<br />

Note that, on ∂V , the kth boundary node xk is represented<br />

by sk and tk, i.e., xk = x(sk,tk). By using (22),<br />

C, dk and gk(k =1, 2,...,M) can be rewritten as<br />

C =<br />

N<br />

M<br />

φi(x(sk,tk))eiē<br />

i=1 k=1<br />

T<br />

k , (23)<br />

⎧<br />

N<br />

⎪⎨ φi(x(sk,tk))ei, for xk ∈ SD,<br />

dk = i=1<br />

N ∂φi<br />

⎪⎩<br />

∂n<br />

i=1<br />

(x(sk,tk))ei,<br />

(24)<br />

for xk ∈ SN,<br />

<br />

ū(sk,tk), for xk ∈ SD,<br />

gk =<br />

(25)<br />

¯q(sk,tk), for xk ∈ SN.<br />

It must be noted here that, in the X-EFG, the coefficient<br />

matrix is not symmetric except for the case where<br />

∂V = SD. However, the essential and natural boundary<br />

conditions can easily be imposed, since C, D and g can<br />

be evaluated without any integrations.<br />

III. SOLVING LINEAR SYSTEM (21)<br />

A linear system that has a coefficient matrix <strong>of</strong> a 2 ×<br />

2 block structure as in (21) are called a saddle point<br />

problem. In this section, we consider solving the saddle<br />

point problem (21). For the following discussion, (21) is<br />

rewritten as<br />

Aˆx = b, (26)


where<br />

A≡<br />

A C<br />

D T O<br />

<br />

û<br />

, ˆx ≡<br />

ˆv<br />

<br />

f<br />

, and b ≡<br />

g<br />

<br />

. (27)<br />

A. Direct solvers<br />

As a direct solver for saddle point problems, there is a<br />

method that utilizes the 2 × 2 structure <strong>of</strong> A [9]. In this<br />

method, A is decomposed by the Cholesky factorization.<br />

Since A is a main part <strong>of</strong> A, the computational cost may<br />

be decreased by using this method in comparison with<br />

that <strong>of</strong> the Gaussian elimination. However, we do not<br />

employ this method for solving (21). This is because<br />

there were some cases that the Cholesky factorization<br />

<strong>of</strong> A was failed in preliminary numerical experiments.<br />

Thus, we consider that the method in [9] is not stable<br />

for solving (21) in this problem.<br />

It must be noted here that A can be decomposed by the<br />

LU factorization. Hence, we adopt the LU factorization as<br />

a direct solver. In addition, an ordering method is used to<br />

decrease fill-ins before the LU factorization is executed.<br />

B. Iterative Schemes for Solving Saddle Point Problems<br />

As an iterative scheme for solving saddle point problems,<br />

Uzawa’s method [10] is well known. Starting with<br />

initial guesses û0 and ˆv0, Uzawa’s method consists <strong>of</strong><br />

the following coupled iteration:<br />

Aûk+1 = f − C ˆvk, (28)<br />

ˆvk+1 = ˆvk + ω(D T ûk+1 − g), (29)<br />

where ω>0 is a relaxation parameter. In (28), a linear<br />

system depending on the size <strong>of</strong> A must be solved.<br />

Hence, if the size <strong>of</strong> A is large, the computational cost<br />

for solving (28) may be expensive.<br />

On the other hand, the Arrow-Hurwicz method [10]<br />

is also well known as an inexpensive iterative scheme<br />

in comparison with the Uzawa’s method. Starting with<br />

initial guesses û0 and ˆv0, the Arrow-Hurwicz method<br />

consists <strong>of</strong> the following coupled iteration:<br />

ûk+1 = ûk + α(f − Aûk − C ˆvk), (30)<br />

ˆvk+1 = ˆvk + ω(D T ûk+1 − g), (31)<br />

where α is also a relaxation parameter. The Arrow-<br />

Hurwicz method is useful for the case where the size <strong>of</strong><br />

A is large. This is because a linear system do not exist<br />

in this iteration. This iteration can be written in terms<br />

<strong>of</strong> a matrix splitting A = P−Q, i.e., as the fixed-point<br />

iteration,<br />

P ˆxk+1 = Qˆxk + b, (32)<br />

where<br />

<br />

1<br />

P≡ αI O<br />

DT − 1<br />

ω I<br />

<br />

1<br />

, Q≡ αI − A −C<br />

O − 1<br />

ω I<br />

<br />

, (33)<br />

and ˆx T k ≡ ûT k ˆvT <br />

k . In 3D problems, the size <strong>of</strong> A tends<br />

to be large. Thus, we adopt the Arrow-Hurwicz method<br />

as an iterative scheme for solving (21).<br />

- 49 - 15th IGTE Symposium 2012<br />

C. Preconditioned Krylov Subspace Methods<br />

For asymmetric linear systems, the incomplete LU<br />

factorization (ILU) [10] is well known as a preconditioner<br />

for Krylov subspace methods. Although ILU can be applied<br />

to (21), we do not employ ILU. This is because the<br />

coefficient matrix <strong>of</strong> (21) is almost symmetric. Namely,<br />

we consider utilizing the matrix property.<br />

To utilize “almost symmetric”, we adopt the incomplete<br />

Cholesky factorization (IC) [11] as a preconditioner<br />

for Krylov subspace methods. To this end, we propose<br />

a strategy for generating preconditioned matrices. In<br />

this strategy, preconditioned matrices LDLT <strong>of</strong> IC are<br />

generated as<br />

<br />

A D<br />

DT <br />

LDL<br />

O<br />

T , (34)<br />

where L is a lower triangular matrix, and D is a diagonal<br />

matrix. In (34), we assume<br />

<br />

A C<br />

A =<br />

DT <br />

A D<br />

<br />

O DT <br />

. (35)<br />

O<br />

As mentioned above, if xk ∈ SD, the kth column <strong>of</strong>D<br />

is exactly the same as that <strong>of</strong> C. In addition, there is<br />

no difference between the matrix A <strong>of</strong> (21) and that <strong>of</strong><br />

(34). Hence, we consider that the assumption (35) can<br />

be acceptable. Note that we adopt an algorithm <strong>of</strong> IC in<br />

which matrices LDL T are as sparse as the matrix A [11].<br />

Even we use IC as a preconditioner, Krylov subspace<br />

methods for asymmetric linear systems have to be chosen.<br />

As Krylov subspace methods for asymmetric linear<br />

systems, Bi-CGSTAB [12] and GMRES(m) [13] are well<br />

known and these iterative methods have produced a lot<br />

<strong>of</strong> attractive results [14]. Here, m is some fixed integer<br />

parameter, and GMRES(m) restarts every m steps [13].<br />

For solving (21), we adopt both methods with IC. In the<br />

following, Bi-CGSTAB with IC and GMRES(m) with IC<br />

are referred to as IC-Bi-CGSTAB and IC-GMRES(m),<br />

respectively.<br />

IV. NUMERICAL EXPERIMENTS<br />

In this section, some numerical solvers as chosen<br />

in Section III are applied to a linear system (21). To<br />

generate the linear system (21), a 3D Poisson problem is<br />

discretized by using the X-EFG.<br />

Throughout the present section, the region V is assumed<br />

as V =(−0.5, 0.5) × (−0.5, 0.5) × (−0.5, 0.5).<br />

In addition, the natural boundary condition is imposed<br />

on the surface SN defined as −0.25 ≤ x ≤ 0.25,<br />

−0.25 ≤ y ≤ 0.25 and z = 0.5, and the essential<br />

boundary condition is imposed on SD ≡ ∂V − SN.<br />

Moreover, the functions f(x), ū and ¯q are determined<br />

so that the analytic solution <strong>of</strong> the 3D Poisson problem<br />

may be u =exp(−x 2 − y 2 − z 2 ).<br />

The boundary nodes x1, x2,...,xM are uniformly<br />

placed on ∂V , and the nodes xM+1, xM+2,...,xN are<br />

also uniformly placed in V . In addition, the exponential


Fig. 1. Dependence <strong>of</strong> the relative error on the size <strong>of</strong> coefficient<br />

matrix. In this figure, u e and u n are exact and numerical solutions,<br />

respectively.<br />

weight function [1],<br />

⎧<br />

⎨exp[−(r/c)<br />

w(r) ≡<br />

⎩<br />

2 ] − exp[−(R/c) 2 ]<br />

1 − exp[−(R/c) 2 (r ≤ R),<br />

]<br />

(36)<br />

0 (r>R),<br />

is adopted for the MLS approximation. Here, R denotes a<br />

support radius, and c is a user-specified parameter. We set<br />

R =1.9h and c = h, where h is the minimum distance<br />

between two nodes.<br />

In the MLS approximation, the shape functions<br />

φi(x) (i =1, 2,...,N) can be determined by<br />

φi(x) =p T (x)B −1 (x)bi(x), (37)<br />

where p T (x) =[1xyz]. In addition, the matrix B(x)<br />

and the vector bi(x) are defined as<br />

B(x) =<br />

N<br />

wk(x)p(xk)p T (xk), (38)<br />

k=1<br />

bi(x) =wi(x)p(xi), (39)<br />

where wi(x) =w(|x−xi|). In (9), the partial derivatives<br />

<strong>of</strong> φi(x) by X(= x, y, and z) can be obtained as<br />

where<br />

φi,X(x) =p T X(x)B −1 (x)bi(x)<br />

+p T (x)[B −1<br />

X (x)bi(x)+B −1 (x)bi,X(x)], (40)<br />

B −1<br />

X (x) =−B−1 (x)BX(x)B −1 (x). (41)<br />

For evaluating (9) and (10), a cubic cell structure being<br />

independent <strong>of</strong> the nodes is used [1], [7], and the Gauss-<br />

Legendre quadrature is employed. The number NQ <strong>of</strong><br />

quadrature points depends on the number m <strong>of</strong> nodes in<br />

a cell. Throughout this section, NQ is handled on the<br />

similar criterion in [1], i.e., NQ = nQ × nQ × nQ, where<br />

nQ = ⌊ √ m +0.5⌋ +2. In addition, the number NC <strong>of</strong><br />

cells is set as NC = mC × mC × mC, where mC =<br />

⌊N 1/3 ⌋.<br />

As a LU factorization, we adopt the sequential SuperLU<br />

[15]. In addition, the Column Approximate Minimum<br />

Degree Ordering (COLAMD) [16] is employed<br />

- 50 - 15th IGTE Symposium 2012<br />

Fig. 2. Dependence <strong>of</strong> the computational time for solving (21) on the<br />

size <strong>of</strong> coefficient matrix.<br />

as an ordering method. This ordering method can easily<br />

be used by setting options.ColPerm = COLAMD in<br />

the SuperLU. For the Arrow-Hurwicz method, we set<br />

α =1.5 and ω =0.05. In addition, for IC-GMRES(m),<br />

we set m = 200. For all iterative solvers, an initial guess<br />

<strong>of</strong> ˆx in (26) is set as ˆx0 = 0.<br />

Computations were performed on a computer equipped<br />

with a 2.66GHz Intel Core i7 920 processor, 24GB RAM,<br />

Ubuntu Linux ver. 12.04, and g++ ver. 4.6.3. Note that we<br />

only used a single core <strong>of</strong> this processor in the following<br />

experiments. Compiler options were set as “-O3 -Wall<br />

-m64” for all solvers.<br />

A. Determining εtol for Iterative Solvers<br />

For the Arrow-Hurwicz method, the iteration is repeated<br />

until ||ˆxk+1 − ˆxk||/||ˆxk+1|| ≤ εtol is satisfied,<br />

where k is the iteration number and ˆxk is the approximate<br />

solution in the kth iteration. Also, for IC-Bi-<br />

CGSTAB and IC-GMRES(m), the iteration is repeated<br />

until ||rk+1||/||b|| ≤ εtol is satisfied, where rk+1 is the<br />

(k +1)th residual vector that can be obtained in the<br />

algorithm <strong>of</strong> Krylov subspace methods. Note that the<br />

maximum norm is adopted for the definition <strong>of</strong> ||·||.<br />

To determine εtol, the dependence <strong>of</strong> relative error on<br />

the size <strong>of</strong> coefficient matrix is shown in Fig. 1. Here,<br />

the relative error ε ≡||ue−un ||/||ue ||, where ue and un are the exact and numerical solutions <strong>of</strong> u, respectively.<br />

In addition, by the first equation <strong>of</strong> (5), un is evaluated<br />

with û that is determined by the LU factorization. From<br />

Fig. 1, we see ε>10−4 . Hence, we consider that, in this<br />

problem, εtol =10−8is sufficient for obtaining un that<br />

has almost the same accuracy shown in Fig. 1.<br />

B. Performance <strong>of</strong> Direct and Iterative Solvers<br />

Let us first investigate the performance <strong>of</strong> the LU factorization,<br />

the Arrow-Hurwicz method, IC-Bi-CGSTAB<br />

and IC-GMRES(m). To this end, the dependence <strong>of</strong><br />

the computational time for solving (21) by using these<br />

methods on the size <strong>of</strong> coefficient matrix is shown in


(a)<br />

(b)<br />

Fig. 3. Histories <strong>of</strong> the relative residual for IC-Bi-CGSTAB and IC-<br />

GMRES(m), and those <strong>of</strong> the relative error for the Arrow-Hurwicz<br />

method. (a) and (b) are for N + M = 19083 and 42083, respectively.<br />

Fig. 2. We see from this figure that the computational<br />

time <strong>of</strong> the LU factorization is less than that <strong>of</strong> the<br />

Arrow-Hurwicz method. In addition, from this figure,<br />

there is no obvious difference between the computational<br />

time <strong>of</strong> IC-Bi-CGSTAB and that <strong>of</strong> IC-GMRES(m), and<br />

the computational time <strong>of</strong> both methods is less than that<br />

<strong>of</strong> the LU factorization. Especially for the case where the<br />

size N + M <strong>of</strong> the coefficient matrix is relatively large,<br />

the computational time can be decreased by using both<br />

methods, e.g., for N + M = 42083, IC-BiCGSTAB and<br />

IC-GMRES(m) are about 9 and 15 times faster than the<br />

LU factorization. From these results, we consider that the<br />

strategy described in (34) works well for solving (21).<br />

It must be noted here that the Arrow-Hurwicz method<br />

does not converge for N +M = 42083. Similarly, IC-Bi-<br />

CGSTAB and IC-GMRES(m) do not converge for N +<br />

M = 19083. Thus, in Fig. 2, there is no data for these 3<br />

cases. Hence, the iterative solvers are not always stable<br />

in this problem.<br />

To investigate behavior <strong>of</strong> the iterative solvers for<br />

N + M = 19083 and 42083, histories <strong>of</strong> the relative<br />

residual ||rk+1||/||b|| for Krylov subspace methods, and<br />

those <strong>of</strong> the relative error ||ˆxk+1 − ˆxk||/||ˆxk+1|| for<br />

- 51 - 15th IGTE Symposium 2012<br />

Fig. 4. Dependence <strong>of</strong> the condition number <strong>of</strong> A on the size <strong>of</strong><br />

coefficient matrix.<br />

the Arrow-Hurwicz method are shown in Fig. 3. We<br />

see from Fig. 3(a) that IC-BICGSTAB rapidly diverges<br />

and IC-GMRES(m) oscillates. In addition, the Arrow-<br />

Hurwicz method converges though the convergence speed<br />

is slow. Indeed, the iteration number for the Arrow-<br />

Hurwicz method is 35324 when ||ˆxk+1 − ˆxk||/||ˆxk+1||<br />

is satisfied. From Fig. 3(b), we see that IC-Bi-CGSTAB<br />

and IC-GMRES(m) converge rapidly. In addition, the<br />

relative residual <strong>of</strong> IC-GMRES(m) is stably decreased<br />

until restarting. For N +M = 42083, the Arrow-Hurwicz<br />

method does not converge, even after more than 500000<br />

iterations.<br />

Next, we investigate a property <strong>of</strong> the coefficient matrix<br />

<strong>of</strong> (21). To this end, the dependence <strong>of</strong> the condition<br />

number <strong>of</strong> A on the size <strong>of</strong> coefficient matrix is shown<br />

in Fig. 4. We see from this figure that the condition<br />

numbers are not very large, even for N + M = 19083<br />

and 42083. Hence, from the condition numbers, it is<br />

difficult to obtain the reason why the Krylov subspace<br />

methods and the Arrow-Hurwicz method do not converge<br />

for N + M = 19083 and 42083, respectively.<br />

From these results, it is difficult that we recognize an<br />

appropriate solver in advance. Hence, to stably solve (21)<br />

as fast as possible, we suggest that IC-GMRES(m) is<br />

first used in order to choose an appropriate solver. After<br />

ˆn iterations <strong>of</strong> IC-GMRES(m), if<br />

||rk+1||/||b|| ≤ ˆεtol<br />

(42)<br />

is not satisfied, then it is recognized that IC-GMRES(m)<br />

will not converge. In this case, the iteration <strong>of</strong> IC-<br />

GMRES(m) is finished, and (21) is solved by the LU<br />

factorization. If (42) is satisfied after ˆn iterations, then it<br />

is recognized that IC-GMRES(m) will converge. Hence,<br />

in this case, the iteration <strong>of</strong> IC-GMRES(m) is continued.<br />

We consider that ˆεtol =10 −2 and ˆn = max(30, (N +<br />

M)/500) are reasonable choice for this problem.<br />

Although the convergence speed is slow, the Arrow-<br />

Hurwicz method may work for the case where not only<br />

Krylov subspace methods do not converge but also the<br />

LU factorization cannot execute. This may occur when<br />

N + M is too large.


The above suggestions for choosing solvers are summarized<br />

as a flow chart shown in Fig. 5. Note that, in<br />

this flow chart, IC-Bi-CGSTAB can be used instead <strong>of</strong><br />

IC-GMRES(m) by setting ˆεtol and ˆn appropriately. This<br />

is because, in Fig. 2, when IC-GMRES(m) converges,<br />

IC-Bi-CGSTAB also converges, and there is no obvious<br />

difference between the computational time <strong>of</strong> IC-<br />

GMRES(m) and that <strong>of</strong> IC-Bi-CGSTAB.<br />

V. CONCLUSION<br />

To impose not only the essential boundary condition<br />

but also the natural one without any integrations, the EFG<br />

has been reformulated, and this method is called a X-<br />

EFG. A linear system obtained by the X-EFG becomes<br />

an asymmetric saddle point problem. To investigate appropriate<br />

solvers for this problem, the linear system that<br />

is obtained from a 3D Poisson problem discretized by<br />

the X-EFG has been solved by the LU factorization,<br />

the Arrow-Hurwicz method, IC-Bi-CGSTAB, and IC-<br />

GMRES(m) in the numerical experiments. Conclusions<br />

obtained in the present study are summarized as follows:<br />

1) By using the X-EFG, the essential and natural<br />

boundary conditions can be imposed without any<br />

integrations.<br />

2) Although the linear system obtained by the X-<br />

EFG has the asymmetric coefficient matrix, the<br />

incomplete Cholesky factorization works well as a<br />

preconditioner for Bi-CGSTAB and GMRES(m).<br />

3) By using IC-BiCGSTAB and IC-GMRES(m), the<br />

linear system can be solved faster than the LU<br />

factorization in relatively large problems. However,<br />

these iterative methods sometimes do not converge<br />

regardless <strong>of</strong> the condition number.<br />

4) To stably solve the linear system as fast as possible,<br />

an appropriate solver can be chosen by the flow<br />

chart shown in Fig. 5.<br />

As future work, the X-EFG will be applied to more<br />

practical problems in various fields, including electromagnetics.<br />

ACKNOWLEDGMENTS<br />

This work was partially supported by JSPS KAKENHI<br />

Grant Numbers 24700053 and 22360042.<br />

REFERENCES<br />

[1] T. Belytschko, Y. Y. Lu, and L. Gu, “Element-free Galerkin<br />

methods,” Int. J. Numer. Methods Eng., vol. 37, pp. 229–256,<br />

1994.<br />

[2] S. N. Atluri and T. Zhu, “A new meshless local Petrov-Galerkin<br />

(MLPG) approach in computational mechanics,” Comput. Mech.,<br />

vol. 22, pp. 117–127, 1998.<br />

[3] A. Manzin, D. P. Ansalone, and O. Bottauscio, “Numerical<br />

modeling <strong>of</strong> biomolecular electrostatic properties by the elementfree<br />

Galerkin method,” IEEE Trans. on Magn., vol. 47, no. 5, pp.<br />

1382–1385, 2011.<br />

[4] S. Ikuno, T. Hanawa, T. Takayama, and A. Kamitani, “Evaluation<br />

<strong>of</strong> parallelized meshless approach: Application to shielding<br />

current analysis in HTS,” IEEE Trans. on Magn., vol. 44, pp.<br />

1230–1233, 2008.<br />

- 52 - 15th IGTE Symposium 2012<br />

✓<br />

Start<br />

✒<br />

✏<br />

✑<br />

❄<br />

Iteration <strong>of</strong> IC-GMRES(m) is repeated<br />

until iteration number k =ˆn.<br />

❄<br />

✟ ✟✟✟✟✟<br />

❍<br />

❍<br />

❍<br />

❍<br />

||rk+1|| ❍<br />

❍ ≤ ˆεtol? ❍Yes<br />

❍❍❍❍❍ ||b|| ✟<br />

✟<br />

✟<br />

✟<br />

✟<br />

✟<br />

No ✓ ❄<br />

IC-GMRES(m)<br />

✒<br />

❄<br />

✟ ✟✟✟✟✟<br />

❍<br />

❍<br />

❍<br />

❍<br />

❍<br />

❍ Is N + M too large? ❍Yes<br />

✟<br />

❍❍❍❍❍<br />

✟<br />

✟<br />

✟<br />

✟<br />

✟ No ✓ ❄<br />

✓ ❄<br />

Arrow-Hurwicz<br />

✒<br />

✏<br />

LU factorization<br />

✒<br />

✑<br />

✏<br />

✑<br />

✏<br />

✑<br />

Fig. 5. A flow chart for choosing an appropriate solver. Note that<br />

IC-Bi-CGSTAB can be used instead <strong>of</strong> IC-GMRES(m) in this flow<br />

chart.<br />

[5] G. F. Parreira, E. J. Silva, A. Fonseca, and R. Mesquita, “The<br />

element-free Galerkin method in three-dimensional electromagnetic<br />

problems,” IEEE Trans. on Magn., vol. 42, no. 4, pp. 711–<br />

714, 2006.<br />

[6] G. Ni, S. L. Ho, S. Yang, and P. Ni, “Meshless local Petrov-<br />

Galerkin method and its application to electromagnetic field<br />

computations,” International Journal <strong>of</strong> Applied Electromagnetics<br />

and Mechanics, vol. 19, pp. 111–117, 2004.<br />

[7] G. R. Liu, Meshfree Methods: Moving beyond the Finite Element<br />

Method (2nd Edition). Boca Raton: CRC Press LLC, 2009.<br />

[8] A. Kamitani, T. Takayama, T. Itoh, and H. Nakamura, “Extension<br />

<strong>of</strong> meshless Galerkin/Petrov-Galerkin approach without using<br />

Lagrange multipliers,” Plasma and Fusion Research, vol. 6, no.<br />

2401074, 2011.<br />

[9] J. Zhao, “The generalized Cholesky factorization method for<br />

saddle point problems,” Applied Mathematics and Computation,<br />

vol. 92, pp. 49–58, 1998.<br />

[10] Y. Saad, Iterative Methods for Sparse Linear Systems, 2nd Edition.<br />

Philadelphia: SIAM, 2003.<br />

[11] G. H. Golub and C. F. Van Loan, Matrix Computations, 3rd<br />

Edition. Baltimore and London: Johns Hopkins <strong>University</strong> Press,<br />

1996.<br />

[12] H. A. van der Vorst, “Bi-CGSTAB: A fast and smoothly converging<br />

variant <strong>of</strong> Bi-CG for the solution <strong>of</strong> nonsymmetric linear<br />

systems,” SIAM J. Sci. Stat. Comput., vol. 13, no. 2, pp. 631–644,<br />

1992.<br />

[13] Y. Saad and M. H. Schultz, “GMRES: A generalized minimal<br />

residual algorithm for solving nonsymmetric linear systems,”<br />

SIAM J. Sci. Stat. Comput., vol. 7, no. 3, pp. 856–869, 1986.<br />

[14] H. A. van der Vorst, Iterative Krylov Methods for Large Linear<br />

Systems (Cambridge Monographs on Applied & Computational<br />

Mathematics). Cambridge: Cambridge <strong>University</strong> Press, 2003.<br />

[15] J. W. Demmel, S. C. Eisenstat, J. R. Gilbert, X. S. Li, and J. W. H.<br />

Liu, “A supernodal approach to sparse partial pivoting,” SIAM J.<br />

Matrix Analysis and Applications, vol. 20, pp. 720–755, 1999.<br />

[16] T. A. Davis, J. R. Gilbert, S. Larimore, and E. Ng, “A column<br />

approximate minimum degree ordering algorithm,” ACM Trans.<br />

Mathematical S<strong>of</strong>tware, vol. 30, pp. 353–376, 2004.


- 53 - 15th IGTE Symposium 2012<br />

Electromagnetic Wave Propagation Simulation<br />

in Corrugated Waveguide using Meshless Time<br />

Domain Method<br />

Soichiro Ikuno∗ , Yoshihisa Fujita∗ , Taku Itoh∗ , Susumu Nakata † and Atsushi Kamitani ‡<br />

∗Tokyo <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, 1404-1 Katakura, Hachioji, Tokyo 192-0982, Japan<br />

† Ritsumeikan <strong>University</strong>, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan<br />

‡ Yamagata <strong>University</strong>, 4-3-16 Johnan, Yonezawa, Yamagata, 992-8510, Japan<br />

E-mail: s.ikuno@ieee.org<br />

Abstract—The simulation <strong>of</strong> the electromagnetic wave propagation in complex shaped corrugated waveguide using Meshless<br />

Time Domain Method (MTDM) based on the Radial Point Interpolation method is numerically investigated. MTDM does<br />

not require finite elements or meshes <strong>of</strong> a geometrical structure as well as other meshless method. In MTDM, only the<br />

necessary information is the location <strong>of</strong> nodes, and the arrangement <strong>of</strong> the node structure <strong>of</strong> electric fields and magnetic<br />

fields. By using the simulation code for analyzing a magnetic wave propagation phenomenon in a complex shaped waveguide,<br />

the influence <strong>of</strong> node alignment on a wave propagation is numerically evaluated. Moreover, the influence <strong>of</strong> frequencies<br />

and pitch <strong>of</strong> corrugate on the dumping rate is evaluated. The results <strong>of</strong> computation show that the node alignment based<br />

on the staggered grid that is generally used in standard FDTD is suitable for the numerical calculation. In addition, the<br />

relationship between a pitch <strong>of</strong> corrugate and a frequency is numerically evaluated.<br />

Index Terms—FDTD, RPIM, wave propagation, corrugated waveguide<br />

I. INTRODUCTION<br />

In the Large Helical Device (LHD), the electron cyclotron<br />

heating device is used for plasma heating. The<br />

electrical power that is made by the gyrotron system<br />

transmits to LHD by using long corrugated waveguide.<br />

However, it is not clear that the shape <strong>of</strong> curvature <strong>of</strong> the<br />

waveguide or transmission gain <strong>of</strong> electromagnetic wave<br />

propagation theoretically.<br />

Finite Difference Time Domain (FDTD) method has<br />

provided the solution <strong>of</strong> Maxwell’s equation directly,<br />

and the method is applied for electromagnetic wave<br />

propagation simulation frequently [1], [2]. Furthermore,<br />

FDTD method has great advantages in terms <strong>of</strong> parallelization<br />

and treatment <strong>of</strong> problems and so on. However,<br />

the numerical domain should be divided into rectangle<br />

meshes if FDTD method is applied for the simulation,<br />

and it is difficult to treat the problem in the complex<br />

domain.<br />

As is well known that the meshless approach does<br />

not require finite elements or meshless <strong>of</strong> a geometrical<br />

structure. And various meshless approaches such as the<br />

diffuse element method [3], the element-free Galerkin<br />

(EFG) method [4] and the meshless local Petrov-Galerkin<br />

(MLPG) [5] method and the radial point interpolation<br />

method (RPIM) has been developed [6]. And these<br />

methods are applied to a variety <strong>of</strong> engineering fields<br />

and the fields <strong>of</strong> computational magnetics [7], [8], [9].<br />

Particularly, meshless approaches based on RPIM are<br />

applied to time dependent problems [10]. Meshless Time<br />

Domain Method (MTDM) [11] does not require finite<br />

elements or meshes <strong>of</strong> a geometrical structure as well as<br />

other meshless method. In MTDM, only the necessary<br />

information is the location <strong>of</strong> nodes, and the arrangement<br />

<strong>of</strong> the node structure <strong>of</strong> electric fields and magnetic<br />

fields. Thus, MTDM can be easily applied for the time<br />

dependent simulation <strong>of</strong> the problem in the complex<br />

shaped domain.<br />

The purpose <strong>of</strong> the present study is to develop the<br />

numerical code for analyzing electromagnetic wave propagation<br />

in corrugated waveguide, and to investigate the<br />

optimal shape <strong>of</strong> corrugated waveguide.<br />

II. SHAPE FUNCTION BASED ON RPIM<br />

First, we scatter N nodes x1, x2, ··· , xN in the target<br />

domain and the boundary, and assign the Radial Basis<br />

Function (RBF) w1(x),w2(x), ··· ,wN (x) with compact<br />

support to the nodes. Then, the solution u(x) can<br />

be expanded as<br />

u(x) =[w(x) T , p(x) T ]G −1<br />

<br />

u<br />

= φ(x)u, (1)<br />

0<br />

where the vector w(x), p(x), u(x) and φ(x) are defined<br />

by<br />

w(x) =[w1(x),w2(x), ··· ,wN (x)] T , (2)<br />

p(x) =[p1(x),p2(x), ··· ,pM(x)] T , (3)<br />

u =[u1,u2, ··· ,uN ] T , (4)<br />

φ(x) =[φ1(x),φ2(x), ··· ,φN(x)] T . (5)<br />

where φi(x) denotes a shape function on i−th node.<br />

The components <strong>of</strong> the vector p(x) are monomials <strong>of</strong><br />

the space variables. For example, p(x) T =[1,x,y] and<br />

p(x) T =[1,x,y,x2 ,xy,y2 ] are monomials for the linear<br />

and the quadratic approximation. Furthermore, the matrix<br />

G is defined by following equation.<br />

<br />

W P<br />

G =<br />

P T <br />

, (6)<br />

O


Here, the matrices W and P are defined by following<br />

equations.<br />

W =[w(x1), w(x2), ··· , w(xn)] T , (7)<br />

P =[p(x1), p(x2), ··· , p(xn)] T . (8)<br />

In the present study, following three functions are<br />

adopted for the weight function.<br />

⎧<br />

⎨ e<br />

wi(xj) =<br />

⎩<br />

−(r/c)2 − e−(R/c)2 1.0 − e−(R/c)2 , r < R,<br />

(9)<br />

0, r ≥ R,<br />

<br />

r<br />

2 wi(xj) =1.0− 6.0<br />

R<br />

<br />

r<br />

3 <br />

r<br />

4 +8.0 − 3.0<br />

(10)<br />

R R<br />

r<br />

<br />

2<br />

−0.5<br />

wi(xj) = +1.0<br />

(11)<br />

R<br />

Here, R denotes a support radius <strong>of</strong> the influence domain<br />

and c denotes a constant. Moreover, r is defined by<br />

r = |x − xi|. Under the above assumptions, the shape<br />

function and its derivative can be expressed as<br />

N<br />

M<br />

φk(x) = wi(x)gi,k + pj(x)gN+j,k, (12)<br />

∂φk<br />

∂x =<br />

∂φk<br />

∂y =<br />

N<br />

i=1<br />

N<br />

i=1<br />

i=1<br />

∂wi(x)<br />

∂x gi,k +<br />

∂wi(x)<br />

gi,k +<br />

∂y<br />

j=1<br />

M<br />

j=1<br />

M<br />

j=1<br />

∂pj(x)<br />

∂x gN+j,k, (13)<br />

∂pj(x)<br />

gN+j,k, (14)<br />

∂y<br />

where, gi,j denotes the (i, j) element <strong>of</strong> matrix G −1 .<br />

Note that the shape function satisfy the Kronecker<br />

delta function property, i.e.<br />

<br />

1, i = j,<br />

φi(xj) =<br />

(15)<br />

0, i = j.<br />

From this property the function can be expanded by using<br />

the shape function based on RPIM as follows.<br />

u(xi) =<br />

N<br />

φi(xj)ui = ui. (16)<br />

j=1<br />

In the next section, Meshless Time Domain Method is<br />

formulated by using above shape function.<br />

III. MESHLESS TIME DOMAIN METHOD<br />

In the present study, 2D electromagnetic wave propagation<br />

<strong>of</strong> TM mode is adopted for the evaluation. The<br />

governing equation <strong>of</strong> the problem is defined by<br />

ε ∂Ez<br />

∂t = −σEz + ∂Hy<br />

∂x<br />

μ ∂Hx<br />

∂t<br />

μ ∂Hy<br />

∂t<br />

∂Hx<br />

− , (17)<br />

∂y<br />

= −∂Ez , (18)<br />

∂y<br />

∂Ez<br />

= , (19)<br />

∂x<br />

- 54 - 15th IGTE Symposium 2012<br />

where, Hx and Hy denote the magnetic field <strong>of</strong> x and<br />

y component, and Ez denotes the electric field <strong>of</strong> z<br />

component. In addition, ε, σ and μ denote permitivity,<br />

permeability and electroconductivity, respectively.<br />

The system is discretized with respect to time by<br />

applying Leap Frog Method, and it is transformed to<br />

following equations.<br />

ε n+1<br />

Ez − E<br />

Δt<br />

n z<br />

+ σE n+ 1<br />

2<br />

z<br />

= ∂Hn+ 1<br />

1<br />

2<br />

2<br />

y ∂Hn+ x<br />

− ,<br />

∂x ∂y<br />

μ<br />

<br />

H<br />

Δt<br />

(20)<br />

n+1/2<br />

x − H n−1/2<br />

<br />

x = − ∂En z<br />

,<br />

∂y<br />

μ<br />

<br />

H<br />

Δt<br />

(21)<br />

n+1/2<br />

y − H n−1/2<br />

<br />

y = ∂En z<br />

.<br />

∂x<br />

(22)<br />

As we mentioned above, the shape function <strong>of</strong> RPIM<br />

has the Kronecker delta function property (15). By using<br />

the shape function and the property, the system can be<br />

discretized with respect to space as follows.<br />

E n+1<br />

<br />

ε σ<br />

<br />

z,i = α − E<br />

Δt 2<br />

n z,i<br />

⎤<br />

N 1 n+ 2<br />

+ H<br />

N 1 n+ 2 H ⎦ , (23)<br />

H<br />

H<br />

1 n+ 2<br />

x,i<br />

1 n+ 2<br />

y,i<br />

j=1<br />

y,j<br />

1<br />

2 = Hn− x,i<br />

1<br />

2 = Hn− y,i<br />

Here, φ E i and φH i<br />

∂φ H j<br />

∂x −<br />

Δt<br />

−<br />

μ<br />

Δt<br />

+<br />

μ<br />

N<br />

j=1<br />

N<br />

j=1<br />

j=1<br />

x,j<br />

E n ∂φ<br />

z,j<br />

E j<br />

∂y<br />

E n ∂φ<br />

z,j<br />

E j<br />

∂x<br />

∂φ H i<br />

∂y<br />

, (24)<br />

. (25)<br />

denote the shape function for electric<br />

field and magnetic field, and the parameter α is defined<br />

as following equation.<br />

Note that, the average <strong>of</strong> E n z,i<br />

1<br />

α = ε σ . (26)<br />

+<br />

Δt 2<br />

and En+1 z,i is adopted for<br />

E n+1/2<br />

z,i . By solving (23), (24) and (25) alternately in<br />

each time step, we can obtain the result that describes<br />

the time dependent behavior <strong>of</strong> the electromagnetic wave<br />

propagation in various shape <strong>of</strong> wave guide.<br />

In the present study, the Perfectly Matched Layer<br />

(PML) and the Perfect Magnetic Conductor (PMC) are<br />

used for absorbing boundary condition and boundary<br />

condition. The electric field <strong>of</strong> z component is divided<br />

into<br />

Ez = Ezx + Ezy, (27)<br />

where components are governed by following equations.<br />

jωεEzx + σxEzx = ∂Hy<br />

, (28)<br />

∂x<br />

jωεEzy + σyEzx = − ∂Hx<br />

. (29)<br />

∂x


Here, j denotes a imaginary unit, and ω denotes a angular<br />

frequency. By using (27), the basic governing equation<br />

<strong>of</strong> PML is written as follows.<br />

ε ∂Ezx<br />

∂t = −σxEzx + ∂Hy<br />

, (30)<br />

∂x<br />

ε ∂Ezy<br />

∂t = −σyEzy − ∂Hx<br />

, (31)<br />

∂x<br />

μ ∂Hx<br />

∂t = −σ∗ yHx − ∂Ez<br />

, (32)<br />

∂y<br />

μ ∂Hy<br />

∂t = −σ∗ yHy + ∂Ez<br />

. (33)<br />

∂x<br />

Here, μ denotes permeability. Taking into account the<br />

delta function property <strong>of</strong> the shape function based on<br />

RPIM, and discretizing respect to time using the Leap-<br />

Flog method, we can obtain following discretized equations<br />

for PML<br />

E n zx,m =<br />

E n zy,m =<br />

H<br />

H<br />

1 n+ 2<br />

x,m =<br />

1 n+ 2<br />

y,m =<br />

ε<br />

Δt<br />

ε<br />

Δt<br />

<br />

σx<br />

− E<br />

2<br />

n−1<br />

zx,m +<br />

N<br />

H<br />

i=1<br />

ε σy<br />

+<br />

Δt 2<br />

<br />

σx<br />

− E<br />

2<br />

n−1<br />

zy,m +<br />

N<br />

H<br />

i=1<br />

ε σy<br />

+<br />

Δt 2<br />

<br />

μ<br />

Δt − σ∗ <br />

1<br />

y n− 2 Hx,m −<br />

2<br />

μ<br />

Δt + σ∗ y<br />

2<br />

<br />

μ<br />

Δt − σ∗ <br />

1<br />

x n− 2 Hy,m +<br />

2<br />

μ<br />

Δt + σ∗ x<br />

2<br />

N<br />

i=1<br />

N<br />

i=1<br />

n− 1<br />

2<br />

y,i<br />

n− 1<br />

2<br />

x,i<br />

∂φ H i<br />

∂x<br />

∂φ H i<br />

∂y<br />

E n ∂φ<br />

z,i<br />

E i<br />

∂y<br />

E n ∂φ<br />

z,i<br />

E i<br />

∂x<br />

, (34)<br />

, (35)<br />

, (36)<br />

, (37)<br />

where Δt denotes a step size <strong>of</strong> time and superscript n<br />

denotes number <strong>of</strong> steps.<br />

In MTDM, nodes for electric field and magnetic field<br />

should be separated, and following four types <strong>of</strong> node<br />

alignment is adopted for accuracy evaluation as shown<br />

in Fig. 1.<br />

Fist alignment type is based on normal meshless<br />

method that means a node for electric field and magnetic<br />

field is located same position as shown in Fig. 1 (a). The<br />

node for magnetic field located a center <strong>of</strong> diagonally <strong>of</strong><br />

nodes for electric field in second type as shown in Fig.<br />

1 (b). Third type is based on staggered grid which is<br />

generally used in standard FDTD, and fourth type is a<br />

mixed version <strong>of</strong> second and third type as shown in Fig.<br />

1 (c) and (b), respectively.<br />

IV. INFLUENCE OF NODE ALIGNMENT<br />

As is well known that FDTD is an explicit method.<br />

Thus, the method must be satisfies the Courant condition,<br />

- 55 - 15th IGTE Symposium 2012<br />

(a) (b)<br />

(c) (d)<br />

Fig. 1. The schematic view <strong>of</strong> four types <strong>of</strong> node alignment <strong>of</strong> electric<br />

field and magnetic field.<br />

i.e.,<br />

Δt < 1<br />

v<br />

1<br />

<br />

2 1<br />

+<br />

Δx<br />

<br />

1<br />

Δy<br />

<br />

,<br />

2<br />

(38)<br />

where Δx and Δy denote a division size <strong>of</strong> x and y<br />

direction, and v denotes a wave speed. On the other<br />

hand, MTDM has not concept <strong>of</strong> mesh as we mentioned<br />

above. Therefore, following criterion is derived for stable<br />

calculation [11].<br />

min |xi − x|<br />

i<br />

Δt <<br />

. (39)<br />

v<br />

Here, min |xi − x| denotes a distance <strong>of</strong> neighboring<br />

i<br />

node, and the step size <strong>of</strong> time Δt is determined so as<br />

to satisfy the criterion (39).<br />

To evaluate the influence <strong>of</strong> node alignment, value <strong>of</strong><br />

the dumping rate RD is introduced.<br />

<br />

RD =<br />

<br />

Γout<br />

Γin<br />

Pz dl<br />

Pz dl<br />

(40)<br />

Here, Pz denote a pointing vector P = B × E <strong>of</strong> z<br />

component, and Γin, Γout denote a the source input line<br />

and the observation line, respectively. We can see from<br />

above equation, if the value <strong>of</strong> RD satisfies RD =1,<br />

the waveguide regards as an ideal zero loss waveguide.<br />

In addition, physical parameters for the calculation are<br />

shown in Table I


TABLE I<br />

PHYSICAL PARAMETERS FOR THE CALCULATION. HERE λ DENOTES<br />

A WAVE LENGTH.<br />

Damping rate, R D<br />

Input Wave Sine wave<br />

Amplitude 1.0 [V/m]<br />

Frequency 1.0, 15.0, 30.0 [GHz]<br />

Wave speed 3.0 × 10 8 [m/s]<br />

Distance <strong>of</strong> neighboring node 20/λ<br />

Number <strong>of</strong> layer for PML 16<br />

Dimension <strong>of</strong> PML 4<br />

Reflectance factor <strong>of</strong> PML −80 [dB]<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

(a)<br />

(b)<br />

(c)<br />

0<br />

0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />

Support radius, R<br />

Fig. 2. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />

support radius R in case <strong>of</strong> the first type node alignment as shown in<br />

Fig. 1 (a). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />

(9), (10) and (11), respectively.<br />

In Fig. 2, 3, 4 and 5, we show the influence <strong>of</strong><br />

support radius R on dumping rate RD with various<br />

weight functions. Note that a normal line waveguide is<br />

adopted for the evaluation, and same value <strong>of</strong> support<br />

radius R is adopted for electric field shape function and<br />

magnetic field shape function. In addition, a frequency<br />

<strong>of</strong> input wave is fixed as 1 [GHz]. We can see from<br />

these figures that the values <strong>of</strong> dumping rate RD are not<br />

strictly stable in case <strong>of</strong> spline weight function is adopted<br />

for the shape function construction. On the other hand,<br />

if the Gauss type weight function is adopted for weight<br />

function the value <strong>of</strong> dumping rate RD generally continue<br />

to be flat around unit value in case <strong>of</strong> all the types<br />

<strong>of</strong> node alignment. From this result, Gauss type weight<br />

function (9) is suitable for MTDM weight function, and<br />

for the rest <strong>of</strong> this Gauss type weight function is adopted<br />

for following calculation. Furthermore, we can see from<br />

these figures that node alignments <strong>of</strong> third type (see Fig.<br />

1 (c)) lead us stable calculation. Thus, in the following<br />

calculation node alignment <strong>of</strong> third type is adopted.<br />

V. WAVE PROPAGATION SIMULATION IN<br />

CORRUGATED WAVEGUIDE<br />

Let us first show the distribution <strong>of</strong> electric field in<br />

curved corrugated waveguide. The schematic view <strong>of</strong><br />

the curved corrugated waveguide which is used in the<br />

calculation is shown in Fig. 6 (a). The pitch <strong>of</strong> the<br />

- 56 - 15th IGTE Symposium 2012<br />

Damping rate, R D<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

(a)<br />

(b)<br />

(c)<br />

0<br />

0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />

Support radius, R<br />

Fig. 3. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />

support radius R in case <strong>of</strong> the second type node alignment as shown<br />

in Fig. 1 (b). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />

(9), (10) and (11), respectively.<br />

Damping rate, R D<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

(a)<br />

(b)<br />

(c)<br />

0<br />

0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />

Support radius, R<br />

Fig. 4. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />

support radius R in case <strong>of</strong> the third type node alignment as shown in<br />

Fig. 1 (c). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />

(9), (10) and (11), respectively.<br />

Damping rate, R D<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

(a)<br />

(b)<br />

(c)<br />

0<br />

0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />

Support radius, R<br />

Fig. 5. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />

support radius R in case <strong>of</strong> the fourth type node alignment as shown<br />

in Fig. 1 (d). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />

(9), (10) and (11), respectively.


y(m)<br />

0.4<br />

0.35<br />

0.3<br />

0.25<br />

0.2<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

0 0.05 0.1 0.15 0.2<br />

x(m)<br />

(a) (b)<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

E z<br />

Fig. 6. (a): The schematic view <strong>of</strong> the analytic region and (b): the<br />

distribution <strong>of</strong> electric field Ez in corrugated waveguide in case <strong>of</strong><br />

W =50mm.<br />

(a) (b)<br />

Fig. 7. Analytic models for evaluating dumping rate RD. (a): Line<br />

wave guide, (b): Curved wave guide<br />

corrugate made at regular intervals for straight part, and<br />

unequally-spaced gaps are made on a curved part as<br />

shown in Fig. 6 (a). The distribution <strong>of</strong> electric field<br />

<strong>of</strong> z component Ez in case <strong>of</strong> W = 50 mm is also<br />

shown in Fig. 6 (b). In this figure, the reflected wave<br />

is observed at the curved part <strong>of</strong> waveguide. Note that<br />

the reflected wave increase as the width <strong>of</strong> waveguide<br />

W increases. In other words, the damping rate increase<br />

as the value <strong>of</strong> W increase, and this phenomenon also<br />

relevant to wavelength and curvature <strong>of</strong> waveguide.<br />

Next, we evaluate the influence <strong>of</strong> frequencies and<br />

pitch <strong>of</strong> corrugate on the dumping rate RD. The analytic<br />

models for evaluating dumping rate RD are line corrugated<br />

waveguide (see Fig. 7 (a)) and curved corrugated<br />

waveguide (see Fig. 7 (b)). The pitch <strong>of</strong> corrugate shape<br />

is Cλ where C denotes a constant, and the pitch <strong>of</strong><br />

the corrugate made at regular intervals for straight part<br />

and unequally-spaced gaps are made on a curved part<br />

in curved corrugated waveguide as well as previous<br />

evaluation.<br />

By using the analytic models, the influence <strong>of</strong> frequen-<br />

-2.5<br />

- 57 - 15th IGTE Symposium 2012<br />

Damping rate, R D<br />

2<br />

1.5<br />

1<br />

0.5<br />

1GHz<br />

15GHz<br />

30GHz<br />

0<br />

0λ 0.2λ 0.4λ 0.6λ 0.8λ 1λ 1.2λ<br />

Pitch <strong>of</strong> Corrugated waveguide<br />

Fig. 8. The influence <strong>of</strong> frequencies and pitch <strong>of</strong> corrugate on dumping<br />

rate RD in the line waveguide.<br />

Damping rate, R D<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

1GHz<br />

15GHz<br />

30GHz<br />

0<br />

0λ 0.2λ 0.4λ 0.6λ 0.8λ 1λ 1.2λ<br />

Pitch <strong>of</strong> Corrugated waveguide<br />

Fig. 9. The influence <strong>of</strong> frequencies and pitch <strong>of</strong> corrugate on dumping<br />

rate RD in the curved waveguide.<br />

cies and pitch <strong>of</strong> corrugate D on dumping rate RD in the<br />

corrugated waveguide is evaluated, and the results are<br />

shown in Fig. 8 and 9. In the line waveguide, the magnetic<br />

wave propagates stationary in case <strong>of</strong> 0.0λ


• The values <strong>of</strong> dumping rate RD are not strictly stable<br />

in case <strong>of</strong> spline weight function is adopted for the<br />

shape function construction.<br />

• On the other hand, if the Gauss type weight function<br />

is adopted for weight function the value <strong>of</strong> dumping<br />

rate RD generally continue to be flat around unit<br />

value in case <strong>of</strong> all the types <strong>of</strong> node alignment.<br />

• The node alignment based on the staggered grid that<br />

is generally used in standard FDTD should be used<br />

for MTDM simulation.<br />

• The reflected wave increase as the width <strong>of</strong> waveguide<br />

W increases. In other words, the damping rate<br />

increase as the value <strong>of</strong> W increase, and this phenomenon<br />

also relevant to wavelength and curvature<br />

<strong>of</strong> waveguide.<br />

• In the line corrugated waveguide, the magnetic wave<br />

propagated stationary in case <strong>of</strong> 0.0λ


- 59 - 15th IGTE Symposium 2012<br />

Optimization <strong>of</strong> Permanent Magnet Linear Actuator<br />

for Braille Screen<br />

*Ivan S. Yatchev, *Iosko S. Balabozov, *Krastio L. Hinov, *Vultchan T. Gueorgiev and<br />

**Dimitar N. Karastoyanov<br />

* Faculty <strong>of</strong> Electrical Engineering, Technical <strong>University</strong> <strong>of</strong> S<strong>of</strong>ia, 8, Kliment Ohridsky Blvd., 1000 S<strong>of</strong>ia, Bulgaria<br />

** Institute <strong>of</strong> Information and Communication Technologies, Bulgarian Academy <strong>of</strong> Sciences, Acad. G. Bonchev St.,<br />

Block 2, 1113 S<strong>of</strong>ia, Bulgaria<br />

E-mail: yatchev@tu-s<strong>of</strong>ia.bg<br />

Abstract—Permanent magnet linear actuator intended for driving a needle in Braille screen has been optimized. The mover<br />

<strong>of</strong> the actuator is a combined one - it consists <strong>of</strong> permanent magnet and ferromagnetic discs. The optimization is carried out<br />

with respect to minimal magnetomotive force ensuring required minimum electromagnetic force on the mover. The<br />

optimization factors are dimensions <strong>of</strong> the cores and mover parts under additional constraint for overall dimension <strong>of</strong> the<br />

actuator. Finite element analysis, response surface methodology and design <strong>of</strong> experiments have been employed for the<br />

optimization. The obtained optimal solution is verified again by finite element analysis.<br />

Index Terms—actuators, Braille screen, optimization, secondary models.<br />

I. INTRODUCTION<br />

Application <strong>of</strong> permanent magnets in the constructions<br />

<strong>of</strong> different actuators has been intensively increased in<br />

recent years. One <strong>of</strong> the reasons for their application is<br />

the possibility for development <strong>of</strong> energy efficient<br />

actuators. New constructions <strong>of</strong> permanent magnet<br />

actuators are employed for different purposes. One such<br />

purpose is the facilitation <strong>of</strong> perception <strong>of</strong> images by<br />

visually impaired people using the so called Braille<br />

screens. Recently, different approaches have been utilized<br />

for the actuators used to move Braille dots [1]-[6].<br />

Typical view <strong>of</strong> a Braille screen is shown in Fig. 1.<br />

Figure 1: Braille screen with needles (dots) driven by<br />

linear actuators.<br />

In the present paper, recently developed permanent<br />

magnet linear actuator for driving a needle (dot) in Braille<br />

screen is optimized using response surface methodology<br />

(RSM) and design <strong>of</strong> experiments (DoE).<br />

The nature <strong>of</strong> the main application puts very firm<br />

requirements about the driver <strong>of</strong> the Braille screen<br />

needles. These requirements can be summarized as<br />

follows:<br />

- firm dimension constraints-especially in radial<br />

direction: outer diameter <strong>of</strong> the driver 3-6 mm;<br />

- holding force 02-05 N;<br />

- minimum energy consumption.<br />

The minimum energy consumption can be achieved by<br />

polarized construction <strong>of</strong> the driving electromagnet<br />

actuator because no power will be consumed at steady<br />

state.<br />

II. ACTUATOR CONSTRUCTION<br />

The principal actuator construction is shown in Fig. 2.<br />

The moving part is axially magnetized cylindrical<br />

permanent magnet with two ferromagnetic discs on both<br />

sides.<br />

The two coils are connected in series in such way that<br />

they create magnetic flux <strong>of</strong> opposite directions in the<br />

region <strong>of</strong> the permanent magnet. In this way, depending<br />

on the polarity <strong>of</strong> the power supply, the permanent<br />

magnet will move either up or down. When motion up is<br />

needed, the upper coil should create flux in the air gap<br />

coinciding with the flux <strong>of</strong> the permanent magnet. Lower<br />

coil at the same time will create opposite flux and the<br />

permanent magnet will move in upper direction. When<br />

motion down is needed, the polarity <strong>of</strong> the power supply<br />

is reversed. The motion is transferred to the Braille dot<br />

using the non-magnetic shaft.<br />

Figure 2: Principal construction <strong>of</strong> the studied actuator.<br />

1–upper shaft; 2–upper core; 3–outer core; 4–upper coil; 5-upper disc;<br />

6–magnet; 7–lower disc; 8–lower coil; 9–lower core; 10–lower shaft


The actuator features increased energy efficiency, as<br />

the power supply is needed only during the switching<br />

between the two end positions <strong>of</strong> the mover. In each end<br />

position, the permanent magnet creates holding force,<br />

which keeps the mover in this position.<br />

III. STATIC FORCE CHARACTERISTICS<br />

Static magnetic field <strong>of</strong> the actuator is modeled using<br />

the finite element method and the program FEMM [7].<br />

Axisymmetric model is adopted as the actuator features<br />

rotational symmetry. The electromagnetic force acting on<br />

the mover is obtained using the weighted stress tensor<br />

approach.<br />

Typical static force characteristics <strong>of</strong> the actuator are<br />

shown in Fig. 3. The stroke <strong>of</strong> the actuator, denoted with<br />

x, is set to zero when the shaft is situated symmetrically<br />

between the upper and lower cores.<br />

c1=-1,c2=1<br />

c1=1,c2=-1<br />

1.2<br />

1<br />

F, N<br />

c1=0,c2=0<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

-0.6 -0.4 -0.2 -0.2 0 0.2 0.4 0.6<br />

-0.4<br />

-0.6<br />

-0.8<br />

-1<br />

-1.2<br />

x, mm<br />

Figure 3: Typical force-stroke characteristics <strong>of</strong> the<br />

studied construction.<br />

c1 and c2 show the direction <strong>of</strong> MMF in both coils <strong>of</strong> the construction.<br />

c1=-1, c2=1 – upward movement <strong>of</strong> the shaft;<br />

c1=1, c2=-1 –downward movement <strong>of</strong> the shaft; c1=0, c2=0 –non<br />

energized coils, the force is due to the permanent magnet only.<br />

The upper and lower curves in Fig. 3 represent the<br />

force when the shaft is moving in upward and downward<br />

direction. The middle curve shows the force when no<br />

current flows in the coils. In that case the force is due to<br />

the magnetic flux <strong>of</strong> the permanent magnet. The<br />

characteristic is symmetrical towards the origin <strong>of</strong> the<br />

force-stroke coordinate system and its final values (when<br />

the shaft is close to upper or lower cores) is called<br />

holding force – Fh. This is the only force that keeps the<br />

shaft in both stable position – upper and lower and it<br />

should resist to the force created by the touching fingers<br />

and the mover’s own weight.<br />

The starting force – Fs is the initial force that acts on<br />

the shaft when it is in its final upper position and both<br />

coils are energized in such a manner to create force in<br />

downward direction or the opposite – the shaft is in final<br />

lower position and force is acting upward.<br />

The construction should guarantee overcoming <strong>of</strong> the<br />

holding force, created by the permanent magnet, when<br />

the coils are properly energized.<br />

The upper coil excites in the upper core magnetic flux<br />

that is equal or bigger than the flux <strong>of</strong> the permanent<br />

magnet but contrary directed. At the same time, the flux<br />

excited by the lower coil is coincident with the flux <strong>of</strong> the<br />

- 60 - 15th IGTE Symposium 2012<br />

permanent magnet.<br />

The construction minimizes the requirements towards<br />

the starting force and guarantees that it will start moving<br />

even for small value <strong>of</strong> the starting force if only it<br />

exceeds the own weight <strong>of</strong> the shaft.<br />

IV. SECONDARY MODELS<br />

Finite element method, DoE and RSM have been used<br />

for creation <strong>of</strong> the secondary models. Full factorial design<br />

has been applied.<br />

The fixed geometric parameters are shown in Fig. 4 and<br />

their values are given in Table 1.<br />

Figure 4. Fixed parameters <strong>of</strong> the actuator.<br />

TABLE I<br />

FIXED GEOMETRIC PARAMETERS<br />

Dimension<br />

Designation<br />

(in Fig. 4)<br />

Value<br />

(in mm)<br />

Outer core diameter D 5<br />

Outer magnet diameter Dm 2<br />

Inner coil diameter Dw1 2.4<br />

Outer coil diameter Dw2 4<br />

Shaft diameter Ds 1<br />

Inner core diameter Dc 1.2<br />

Core thickness hc 2<br />

The varied parameters are:<br />

- The length <strong>of</strong> the upper and lower cores - hw,<br />

- The axial dimension <strong>of</strong> the ferromagnetic disks -<br />

hd,<br />

- The length <strong>of</strong> the permanent magnet - hm,<br />

- The current density in the coils - J.<br />

The varied parameters with geometric representations<br />

are shown in Fig. 5.


Figure 5: Varied geometric parameters <strong>of</strong> the actuator.<br />

The DoE methodology has been used for varied<br />

parameters to create polynomial secondary models. For<br />

each combination <strong>of</strong> values <strong>of</strong> the varied parameters a<br />

family <strong>of</strong> static force-stroke characteristics was obtained.<br />

Based on them secondary models for holding force Fh,<br />

starting force Fs and ampere-turns <strong>of</strong> the coils – NI have<br />

been made.<br />

The precision <strong>of</strong> secondary models has been estimated<br />

by the relative error between the value obtained by te<br />

secondary model and corresponding value obtained by<br />

the FEM model. The difference between secondary and<br />

FEM models for 27 calculation points is given in Fig.6.<br />

relative error, %<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

-0.05<br />

-0.1<br />

-0.15<br />

-0.2<br />

-0.25<br />

0 5 10 15 20 25 30<br />

number <strong>of</strong> calculation points<br />

Figure 6: Relative error between secondary and FEM<br />

models.<br />

V. OPTIMIZATION<br />

The objective function is minimal magnetomotive force<br />

<strong>of</strong> the coils. The optimization parameters are dimensions<br />

<strong>of</strong> the permanent magnet, ferromagnetic discs and the<br />

cores. As constraints, minimal electromagnetic force<br />

acting on the mover, minimal starting force and overall<br />

outer diameter <strong>of</strong> the actuator have been set. The<br />

Fs<br />

Fh<br />

- 61 - 15th IGTE Symposium 2012<br />

optimization is carried out using sequential quadratic<br />

programming.<br />

The canonic form <strong>of</strong> the optimization problem is:<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

where:<br />

- NI — ampere-turns — minimizing energy<br />

consumption with satisfied force requirements;<br />

- Fh — holding force — mover (shaft) in upper<br />

position, no current in the coils;<br />

- Fs — starting force — mover (shaft) in upper or<br />

lower position and energized coils;<br />

- J — coils current density;<br />

- hw, hm, hd—geometric dimensions according to<br />

the sketch in Fig. 6.<br />

Minimization <strong>of</strong> magneto-motive force NI is direct<br />

subsequence <strong>of</strong> the requirement for minimum energy<br />

consumption.<br />

Constraints for Fs and Fh have already been discussed.<br />

The lower bounds for the dimensions are imposed by the<br />

manufacturing limits and the upper bound for the current<br />

density is determined by the thermal balance <strong>of</strong> the<br />

actuator.<br />

The radial dimensions <strong>of</strong> the construction are directly<br />

dependent by the outer diameter <strong>of</strong> the core – D which<br />

fixed value was discussed earlier. The influence <strong>of</strong> those<br />

parameters on the behavior <strong>of</strong> the construction have been<br />

studied in previous work [8] that make clear that there is<br />

no need radial dimensions to be included in the set <strong>of</strong><br />

optimization parameters.<br />

The optimization is carried out by sequential quadratic<br />

programming. The optimization results are as follows:<br />

<br />

<br />

<br />

<br />

<br />

The optimal parameters were set as input values to the<br />

FEM model. The force-stroke characteristics <strong>of</strong> the<br />

optimal actuator is shown in Fig.7 and Fig.8.<br />

Fh, N<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

-0.1<br />

-0.2<br />

-0.3<br />

Fh - holding force in lower position <strong>of</strong> the shaft<br />

Fh - holding force in upper position <strong>of</strong> the shaft<br />

-0.4 -0.3 -0.2 -0.1 0<br />

x, mm<br />

0.1 0.2 0.3 0.4<br />

Figure 7: Force-stroke characteristic <strong>of</strong> the optimal<br />

actuator. The force is created by the permanent magnet<br />

only (no current in the coils).


F, N<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

F - final force (shaft imoved in lower position, coils still energized)<br />

Fs - starting force (shaft in upper position)<br />

Fh - force switches to the holding force when current is ceased<br />

-0.4 -0.3 -0.2 -0.1 0<br />

x, mm<br />

0.1 0.2 0.3 0.4<br />

Figure 8. Force-stroke characteristic <strong>of</strong> the optimal<br />

actuator. Coils are energized. The shaft is displaced from<br />

final upper to final lower position.<br />

In Figs. 9 and 10, the magnetic field <strong>of</strong> the optimal<br />

actuator is plotted for two cases.<br />

Figure 9: Magnetic field <strong>of</strong> the optimal actuator with<br />

shaft in upper position and coils energized to create<br />

downward force.<br />

Figure 10: Magnetic field <strong>of</strong> the optimal actuator with no<br />

current in the coils.<br />

- 62 - 15th IGTE Symposium 2012<br />

The force constraints for Fs and Fh are active which<br />

can be expected when minimum energy consumption is<br />

required. The active constraint for hw is also expected<br />

because longer upper and lower cores size which<br />

respectively means longer coils will increase the leakage<br />

coil flux and corrupted coil efficiency.<br />

VI. CONCLUSION<br />

The employed approach has confirmed its robustness<br />

for solution to the optimization problem for the actuator.<br />

The obtained optimal solution satisfies the specific<br />

requirements for actuators for Braille screen.<br />

[1]<br />

REFERENCES<br />

Nobels T., F. Allemeersch, K. Hameyer, “Design <strong>of</strong> a high power<br />

density electromagnetic actuator for a portable Braille display.“<br />

Int. Conf. EPE-PEMC 2002, Dubrovnik & Cavtat, 2002.<br />

[2] Kawaguchi Y., K. Ioi, Y. Ohtsubo, “Design <strong>of</strong> new Braille display<br />

using inverse principle <strong>of</strong> tuned mass damper.” Proc.<strong>of</strong> SICE<br />

annual conference 2010, Taipei, Taiwan, Aug. 18-21, pp. 379-383.<br />

[3] Kwon, H.J., Lee, S.W., Lee, S. Braille code display device with a<br />

PDMS membrane and thermopneumatic actuator. IEEE<br />

[4]<br />

international conference on micro electro mechanical systems<br />

(XXI), MEMS, Tucson, 2008, pp. 527-530.<br />

Chaves, D., Peixoto, I., Lima, A., Vieira, M., de Araujo, C.<br />

Microtuators <strong>of</strong> SMA for Braille display system. IEEE<br />

international workshop on medical measurements and<br />

[5]<br />

applications, MeMeA Cetraro, Italy, May 20-30, 2009, pp. 64-68.<br />

Hernandez, H., Preza, E., Velazquez, R. Characterization <strong>of</strong> a<br />

piezoelectric ultrasonic linear motor for braille displays.<br />

Electronics, robotics and automotive mechanics conference<br />

CERMA Cuernavaca, Mexico, Sep. 22-25, 2009, pp. 402-407.<br />

[6] Cho, H.C., Kim, B.S., Park, J.J., Song, J.B. (2006) Development<br />

<strong>of</strong> a Braille display using piezoelectric linear motors.<br />

[7]<br />

International joint conference SICE-ICASE, 2006, Busan, Korea,<br />

Oct. 18-21, pp. 1917-1921.<br />

D. Meeker, Finite element method magnetics version 3.4, 2005.<br />

[8] Yatchev I., K. Hinov, V. Gueorgiev, D. Karastoyanov, I.<br />

Balabozov, Force characteristics <strong>of</strong> an electromagnetic actuator<br />

for Braille screen, <strong>Proceedings</strong> <strong>of</strong> Thirteenth International<br />

Conference on Electrical Machines, Drives and Power Systems<br />

ELMA 2011, 21-22 October 2011, Varna, Bulgaria, pp. 338-341.


- 63 - 15th IGTE Symposium 2012<br />

3D Finite Element Analysis <strong>of</strong> Induction Heating<br />

System for High Frequency Welding<br />

*Ilona I. Iatcheva, *Georgi H. Gigov , *Georgi C. Kunov and *Rumena D. Stancheva<br />

*Technical <strong>University</strong> <strong>of</strong> S<strong>of</strong>ia, Kliment Ohridski 8, S<strong>of</strong>ia 1000, Bulgaria<br />

E-mail: iiach@tu-s<strong>of</strong>ia.bg<br />

Abstract—The aim <strong>of</strong> the work is investigation <strong>of</strong> induction heating system used for longitudinal, high frequency pipe<br />

welding. The problem was considered as 3D coupled electromagnetic and temperature field problem and has been solved<br />

using finite element method and COMSOL 4.2 s<strong>of</strong>tware package. Time harmonic electromagnetic and transient thermal fields<br />

have been studied in order to estimate system efficiency and factors influencing on the quality <strong>of</strong> the welding process and<br />

required energy.<br />

Index Terms— finite element method, high frequency welding, 3D coupled field analysis.<br />

small scale, carbon steel tubes and pipes. It consists <strong>of</strong><br />

spiral inductor, which induced a voltage across the edges<br />

<strong>of</strong> the moving open pipe material. The induced voltage<br />

causes high frequency currents, concentrated on the<br />

surface layer due to the skin and proximity effects. The<br />

currents flow along the two edges in opposite directions<br />

in so called “V”-zone (Fig.2) to the point where they<br />

meet, causing rapid heating <strong>of</strong> the metal and surface<br />

melting. The weld squeeze rolls are used to apply<br />

pressure, which forces the heated metal into contact and<br />

forms welding bond.<br />

I. INTRODUCTION<br />

The induction heating is widely used in the heat<br />

treatment <strong>of</strong> conducting details due to its advantages:<br />

high quality and efficiency <strong>of</strong> the heating processes, good<br />

accuracy in heating <strong>of</strong> certain zones in a short time and<br />

clean operating conditions [ 1]-[4].<br />

The aim <strong>of</strong> the present research is investigation <strong>of</strong><br />

induction heating system used for high frequency<br />

longitudinal pipe welding. The main task is to determine<br />

optimal factors and parameters influencing on quality <strong>of</strong><br />

the welding process and required energy: welding<br />

frequency, welding speed, ‘vee’ angle, presence <strong>of</strong> the<br />

ferrite impeder (inner and outer), tube thickness and etc.<br />

The solution <strong>of</strong> the problem is based on the precise 3Dmodelling<br />

and FEM analysis <strong>of</strong> the electromagnetic and<br />

thermal processes, taking place in the investigated system.<br />

Detailed determination <strong>of</strong> the electromagnetic and<br />

temperature field distribution and its dependence on the<br />

mentioned above parameters is important condition for<br />

effective control and management <strong>of</strong> the welding process.<br />

II. INVESTIGATED INDUCTION HEATING SYSTEM<br />

The principal geometry <strong>of</strong> the investigated system is<br />

shown in Fig.1.<br />

squeeze<br />

point <strong>of</strong> roll<br />

closure<br />

direction <strong>of</strong><br />

movement<br />

impeder<br />

core<br />

inductor<br />

steel pipe<br />

cooling water<br />

welded<br />

bond<br />

Figure 1: Geometry <strong>of</strong> the investigated induction system<br />

The system is designed for high frequency welding <strong>of</strong><br />

Figure 2: In the “V”-zone HF currents flow along the two edges in<br />

opposite directions.<br />

The system includes also inner ferrite impeder, which<br />

concentrates magnetic flux and improves the welding<br />

efficiency. The cooling water flows inside the inductor<br />

and impeder for system cooling.<br />

As it can be seen from the geometry in Fig.1 the<br />

impeder is located not along the pipe axe, but moved<br />

closer to the welded region - i.e. the system is not<br />

axesymmetric and has to be analysed as three<br />

dimensional.<br />

The system has been investigated and electromagnetic<br />

and thermal processes have been modelled for the<br />

parameters shown in Table I.<br />

TABLE I<br />

PARAMETERS OF THE SYSTEM<br />

Parameter Value<br />

Applied current I 1000 A<br />

Voltage U 500 V<br />

cos 0,1<br />

Frequency f 200kHz 500kHz<br />

End heating temperature 13001450 0 C<br />

Cooling water<br />

temperature<br />

40 0 C


III. MATHEMATICAL MODEL OF THE COUPLED FIELD<br />

PROBLEM<br />

Mathematical modeling <strong>of</strong> the processes in the<br />

investigated system for high frequency welding are based<br />

on the analysis <strong>of</strong> coupled – electromagnetic and<br />

temperature field distribution in the considered device.<br />

As it has been already mention the geometry <strong>of</strong> the object<br />

is a complex, nonsymmetric and electromagnetic and<br />

thermal field have to be studied as three-dimensional.<br />

The present work deals with modeling <strong>of</strong> the 3D time<br />

harmonic electromagnetic field. The eddy current losses,<br />

obtained in electromagnetic field analysis are field<br />

sources in modeling <strong>of</strong> the transient thermal field<br />

The electromagnetic field problem has been studied not<br />

only in the system elements, but also in wide buffer zone<br />

around the devise. It helps to define correct boundary<br />

conditions in field modeling. In Fig.3 is shown<br />

investigated region, used in electromagnetic field<br />

modeling. It includes domains: 1- inductor; 2impeder;<br />

3- welded pipe; 4- cooling water; 5- buffer<br />

zone with air.<br />

1<br />

4<br />

Figure 3: Investigation domains<br />

2<br />

5<br />

3<br />

Electromagnetic field distribution can be described with<br />

equations (1) and (2):<br />

<br />

<br />

-1<br />

A<br />

<br />

( A)<br />

J e <br />

(1)<br />

t<br />

<br />

E j<br />

A V<br />

(2)<br />

where A is magnetic vector potential , J is current<br />

density, E is electrical strength , V is scalar electric<br />

potential, is electric conductivity and is magnetic<br />

permeability.<br />

The boundary conditions are A 0<br />

<br />

for the buffer zone<br />

boundaries.<br />

The time varying electromagnetic field produces eddy<br />

currents:<br />

- 64 - 15th IGTE Symposium 2012<br />

<br />

J jA<br />

(3)<br />

and corresponding Joule losses – source <strong>of</strong> the heating in<br />

the region:<br />

<br />

1<br />

*<br />

[ ] JJ<br />

Q <br />

2<br />

<br />

(4)<br />

The transient thermal field is modeled by equation:<br />

T<br />

. C (<br />

kT<br />

) Q (5)<br />

t<br />

where k is thermal conductivity , T is temperature, is<br />

density, C is heat capacity and Q is heat source, obtained<br />

in electromagnetic field analysis.<br />

IV. FEM ANALYSIS - 3D COUPLED PROBLEM<br />

Numerical simulation <strong>of</strong> the coupled - electromagnetic<br />

and thermal fields was carried out using FEM and<br />

COMSOL 4.2 package [4].<br />

In Fig.4 is shown investigated system with the buffer<br />

zone around it and Fig.5 presents FEM mesh, used in<br />

solving the problem.<br />

Figure 4: Investigated system with the buffer zone around it.<br />

Figure 5: FEM mesh.


Some results, obtained in solving the problem for<br />

frequency 300 KHz are shown in Fig. 6, Fig. 7, Fig. 8,<br />

Fig. 9 and Fig. 10.<br />

The analysis <strong>of</strong> electromagnetic field distribution<br />

indicates that maximal value <strong>of</strong> the magnetic flux density<br />

is about 0.19T. These values are reached in the “V” zone<br />

and around the inductor. Two different cross sections<br />

illustrate distribution <strong>of</strong> the magnetic flux density in the<br />

system in Fig.6 and Fig.7.<br />

Figure 6: Distribution <strong>of</strong> magnetic flux density in the<br />

investigated region, f= 300 KHz.<br />

Figure 7: Distribution <strong>of</strong> magnetic flux density along the<br />

“V” zone, f= 300 KHz.<br />

The results, obtained for current density distribution in<br />

the entire region are shown in Fig.8. Two specific for the<br />

problem cross sections - around “point <strong>of</strong> closure” and<br />

spiral inductors are picking out. The maximal value is<br />

- 65 - 15th IGTE Symposium 2012<br />

1,13x10 9 A/m 2 . Current density distribution around the<br />

“point <strong>of</strong> closure” is shown in Fig.9 and in Fig.10 around<br />

the spiral inductor.<br />

.<br />

Figure 8: Current density distribution in the entire region<br />

Figure 9: Current density distribution around “point <strong>of</strong><br />

closure”<br />

V. CONCLUSION<br />

3D-coupled electromagnetic and temperature field<br />

problem and has been solved using finite element method<br />

and COMSOL 4.2 s<strong>of</strong>tware package in order to<br />

investigate induction heating system for high frequency


pipe welding. The obtained temperature value around the<br />

“point <strong>of</strong> closure” is about 1400 0 C.<br />

REFERENCES<br />

[1] R.Baumer, Y.Adonyi ”Transient High-Frequency Welding<br />

[2]<br />

Simulations <strong>of</strong> Dual-Phase Steels”, Welding Journal, October<br />

2009, vol. 88, pp. 193 – 201<br />

D.Kim, T. Kim, Y.Park, K.Sung, M.Kang, C.Kim, I.Lee and<br />

S.Rhee, “Estimation <strong>of</strong> weld quality in high-frequency electric<br />

resistance welding”, Welding Journal, March 2007, pp. 27 – 31.<br />

[3] A. Shamov, I. Lunin, V. Ivanov, High frequency metal welding,<br />

[4]<br />

Leningrad, ‘Mashinostroenie”1977 (In Russian).<br />

COMSOL Version 4.2 User’s Guide, 2011.<br />

- 66 - 15th IGTE Symposium 2012


- 67 - 15th IGTE Symposium 2012<br />

Optimization Algorithms in the View <strong>of</strong> State<br />

Space Concepts<br />

M. Neumayer∗ , D. Watzenig∗ , G. Steiner∗ , and B. Brandstätter †<br />

∗Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing, <strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>,<br />

Kopernikusgasse 24/4, A-8010 <strong>Graz</strong>, Austria, E-mail: neumayer@TU<strong>Graz</strong>.at<br />

† Elin Motoren GmbH, Elinmotorenstrasse 1, A-8160 Preding/Weiz, Austria<br />

Abstract—The working principles <strong>of</strong> optimization algorithms <strong>of</strong>fer several characteristics which naturally arise in state<br />

estimation, or more generally when dealing with state space systems. In this paper we will treat similarities between the<br />

two disciplines and show how concepts <strong>of</strong> state estimation, including the incorporation <strong>of</strong> model uncertainty information,<br />

can be used in optimization.<br />

Index Terms—optimization, state space methods<br />

I. INTRODUCTION<br />

Numerical optimization is generally referred to as<br />

solving a problem <strong>of</strong> form [1]<br />

x ∗ = argminΨ(x) (1)<br />

s.t. C(x) ≤ 0, (2)<br />

where Ψ:R N → R 1 is called the objective function<br />

and the vector x ∈ R N contains the variables <strong>of</strong> interest.<br />

Possible constraints on the vector x are formulated by<br />

the vectorial function C(x) as a set <strong>of</strong> equalities and<br />

inequalities. By this the space <strong>of</strong> the feasible solutions<br />

becomes a subspace <strong>of</strong> R N .<br />

An enormous variety <strong>of</strong> algorithms and solution<br />

strategies for such problems has been the output <strong>of</strong><br />

research activities in the past years. Yet it has to be<br />

mentioned that the presented form <strong>of</strong> the optimization<br />

problem is only a part <strong>of</strong> actual problems. I.e. the<br />

discipline <strong>of</strong> multi objective optimization asks for an<br />

optimal solution given several objective functions Ψi [2].<br />

Such formulations are <strong>of</strong> importance in multi physical<br />

problem scenarios. Another important class are robust<br />

optimization approaches which aim for a stable solution<br />

under the scenario <strong>of</strong> uncertainty or tolerances in the<br />

objective function [3]. Thus, existing manufacturing<br />

tolerances can be incorporated to optimization based<br />

design process. A general distinction between the number<br />

<strong>of</strong> the different (iterative) optimization algorithms used<br />

today can be made by separating them into deterministic<br />

and stochastic methods.<br />

Deterministic methods most <strong>of</strong>ten make use <strong>of</strong> gradient<br />

and curvature information <strong>of</strong> the objective function Ψ<br />

in order to efficiently detect the minimum. Hereby efficiency<br />

is typically defined by the number <strong>of</strong> evaluations<br />

<strong>of</strong> Ψ. Classical first and second order deterministic methods<br />

try to minimize Ψ by determining a descent direction<br />

out <strong>of</strong> gradient or gradient and Hessian information. I.e.<br />

the classical steepest descent method uses the iteration<br />

xk+1 = xk − sg(xk), (3)<br />

to find x∗ in a step by step approach. Hereby<br />

g(xk) = ∇Ψ is defined as the gradient <strong>of</strong> Ψ with<br />

respect to the elements <strong>of</strong> xk. Due to this nature the<br />

result <strong>of</strong> deterministic methods can be strongly affected<br />

by the starting point x0. Also local minima <strong>of</strong> Ψ will<br />

result in a termination <strong>of</strong> the algorithm before the global<br />

minima is found. Stochastic methods rely on some<br />

sort <strong>of</strong> randomness to explore the parameter space in<br />

search for the minimum. A main difference with respect<br />

to deterministic methods is their ability to overcome<br />

local minima <strong>of</strong> the objective function. For stochastic<br />

optimization algorithms it has become common to let<br />

them run for a certain time or a number <strong>of</strong> evaluations <strong>of</strong><br />

Ψ. Of course, hybrid algorithms have been proposed to<br />

combine the advantages <strong>of</strong> the two classes <strong>of</strong> algorithms.<br />

Although we have only pointed out the basics <strong>of</strong> some<br />

fundamental concepts <strong>of</strong> optimization we can observe,<br />

that in all algorithms some kind <strong>of</strong> evolution from the<br />

vector xk to the vector xk+1 occurs. Modern system<br />

theory uses so called state space models as a unified<br />

framework to describe dynamical systems [4]. The general<br />

form <strong>of</strong> a discrete time, nonlinear, time-variant state<br />

space model is given by<br />

xk+1 = F k(xk)+Bk(uk)+wk, (4)<br />

yk = Hk(xk)+vk, (5)<br />

where F k : R N → R N presents the system dynamics,<br />

Bk : R L → R N describes the affect onto xk+1 due<br />

to an input term uk ∈ R L , and Hk : R N → R M<br />

describes a measurement process. The terms wk ∈ R N<br />

and vk ∈ R M are referred to as process noise and<br />

measurement noise. We observe some similarities<br />

between the state space concept and the topics discussed<br />

in concern with optimization. Yet we have to say that<br />

state space methods and models follow a quite organized


scheme.<br />

In this paper we will point out similarities between optimization<br />

techniques and state space models and methods.<br />

The paper is structured as follows. In section II we<br />

give a short introduction about the state space concept.<br />

In section II we review optimization techniques in the<br />

sense <strong>of</strong> state space methods and present similarities<br />

as well as mathematical tools potential for a general<br />

description. Section IV lists several state space techniques<br />

which relate with topics <strong>of</strong> optimization and thus could<br />

potentially be used to improve optimization. Finally we<br />

present an exemplary hybrid optimization scheme which<br />

we derive from the state space view and demonstrate its<br />

behavior using some <strong>of</strong> the suggested approaches.<br />

II. THE STATE SPACE CONCEPT IN MORE DETAIL<br />

uk Bk<br />

wk<br />

xk+1<br />

z −1<br />

F k<br />

Hk<br />

Fig. 1. Diagram <strong>of</strong> the state space model given by equation (4) and (5).<br />

Figure 1 depicts the structure <strong>of</strong> a state space model<br />

given by equations (4) and (5). The core purpose <strong>of</strong> a<br />

state space model is to describe the evolution <strong>of</strong> the<br />

state vector x over the time or the discrete time steps,<br />

respectively. As can be observed by the equations (4)<br />

and (5) or by figure 1, this evolution is determined<br />

by a deterministic drift due to the dynamics <strong>of</strong> F k<br />

and the input uk and a stochastic diffusion due to<br />

the process noise wk. The system is referred to be an<br />

autonomous system if B is zero. The function Hk<br />

provides a deterministic measure about the internal state.<br />

In addition the measurement noise vk acts as an additive<br />

disturber. We can already observe that the state space<br />

concept is able to provides several aspects which we<br />

pointed out in the introductional part about optimization<br />

algorithms in a natural way.<br />

It should be mentioned that a state space model is<br />

referred to be linear if all system components are matrices.<br />

This class is <strong>of</strong> large importance as many technical<br />

processes can be described by this.<br />

A. State Space Methods<br />

In this part <strong>of</strong> section II we want to give a<br />

short introduction about two important disciplines in<br />

association with state space models. These are state<br />

estimation and state control.<br />

State estimation is referred to the task to find an<br />

estimate ˆxk <strong>of</strong> the state xk using the measurements y k ,<br />

the input uk and the model.<br />

vk<br />

y k<br />

- 68 - 15th IGTE Symposium 2012<br />

State control is a special kind <strong>of</strong> feedback control<br />

where the system input uk is formed as a function <strong>of</strong><br />

the state vector xk. A notable controller out <strong>of</strong> this class<br />

is the dead beat control system. This approach enables<br />

a control system to reach the steady state within a finite<br />

number <strong>of</strong> iterations.<br />

III. OPTIMIZATION AND STATE SPACE METHODS<br />

Looking onto all points discussed so far we can consider<br />

a relation between the measurement function H k<br />

and the objective function Ψ. I.e. for a design problem<br />

where one is interested to meet a desired output yd, Ψ<br />

could be <strong>of</strong> form<br />

Ψ(x) =(H k(x) − y d ) T W (Hk(x) − y d ) . (6)<br />

Hereby the positive definite matrix W presents a weighting<br />

matrix. From a system theoretic point <strong>of</strong> view we<br />

could consider the function Ψ as a (nonlinear) control<br />

plant <strong>of</strong> MISO (multiple input single output) type.<br />

A. Classical Deterministic Methods Reviewed<br />

Classical deterministic optimization methods like the<br />

already mentioned steepest descent method (see equation<br />

(3)) take use <strong>of</strong> local gradient or curvature information<br />

<strong>of</strong> the function Ψ. While the steepest descent algorithm<br />

just takes use <strong>of</strong> the gradient information the well known<br />

Gauss-Newton (GN) method defined by<br />

xn+1 = xn + sG −1<br />

k gk, (7)<br />

takes use <strong>of</strong> the Hessian G matrix which provides curvature<br />

information about Ψ to improve the convergence<br />

behavior. In both schemes, the steepest descent method<br />

and the GN method, the system matrix F isgivenbythe<br />

identity matrix I. For objective functions Ψ <strong>of</strong> form (6),<br />

the practical realization <strong>of</strong> the GN method is given by<br />

xn+1 = xn − s(JJ T ) −1 Jr (8)<br />

where J is the Jacobian <strong>of</strong> the system H with respect<br />

to the state vector x. Herebyr =(y − y d) defines the<br />

residual vector <strong>of</strong> the output <strong>of</strong> F with respect to y d .<br />

The gradient g = ∇xΨ <strong>of</strong> the objective function (6) with<br />

respect to x (to keep the notation short we set the matrix<br />

W to be the identity matrix I) isgivenasg = J(y−y d)<br />

and (JJ T ) approximates the Hessian G [1].<br />

<br />

JJ T −1<br />

z −1<br />

I<br />

−sJ<br />

H k<br />

Fig. 2. State space representation <strong>of</strong> a second order scheme.<br />

y k<br />

−y d<br />

Figure 2 depicts the GN scheme as a control system for<br />

the objective function (plant) Ψ following equation (8).<br />

For the steepest descent method the matrix B is replaced<br />

by the identity matrix I. Note, that all matrices depend


on the iteration index k. A control system with this<br />

property is referred to as a time varying control system.<br />

We observe that neither the steepest descent algorithm<br />

nor the GN method are state space control systems as<br />

these methods do not take use <strong>of</strong> the state vector x.<br />

However, we can observe a closed loop scheme in figure<br />

2. It is hard to argue whether we see the steepest descent<br />

algorithm as a drive system or as a closed loop control<br />

system as for B = −g and s replacing the input u<br />

(scalar) no closed loop is required. However, with respect<br />

to the different input matrix B the powerfulness <strong>of</strong> the<br />

GN-method becomes clear from a system theoretic point<br />

<strong>of</strong> view.<br />

B. Stochastic Methods Reviewed<br />

With the availability <strong>of</strong> more and more computational<br />

power stochastic optimization methods have become<br />

<strong>of</strong> increased interest for many practical problems.<br />

Interesting issues for the application <strong>of</strong> stochastic<br />

methods is their ability to overcome local minima, and<br />

the not given necessity for derivative information. This<br />

is <strong>of</strong> concern for not differentiable or not continuous<br />

problems<br />

In contrast to deterministic methods, stochastic<br />

methods most <strong>of</strong>ten rely on a set <strong>of</strong> N individual vectors<br />

x N which explore the objective function on their own.<br />

Over the time a mutual exchange <strong>of</strong> information from<br />

the the different realizations x N is performed which<br />

mixes the individuals. Concepts about the individual<br />

exploration <strong>of</strong> each individual on Ψ as well as the<br />

exchange <strong>of</strong> mutual information between the individuals<br />

is <strong>of</strong>ten based on concepts <strong>of</strong> nature like evolution<br />

principles resulting in the class <strong>of</strong> genetic algorithms<br />

(GA). I.e. certain elements <strong>of</strong> two arbitrarily selected<br />

vectors xi and xj are exchanged, replaced by a weighted<br />

mean or just individually disturbed by a random variable.<br />

A contrastable aspect with respect to the behavior <strong>of</strong><br />

deterministic methods is the fact, that the combining<br />

principles do not automatically remove the weakest<br />

individual (the realization with the highest value <strong>of</strong> Ψ).<br />

Instead also the strongest individual can be removed by<br />

some random procedure. Exactly this property enables<br />

the behavior that stochastic methods can overcome local<br />

minima. Other well known strategies for stochastic<br />

optimization are particle swarm optimization (PSO),<br />

nitching evolution techniques or differential evolution<br />

(DE) [5]. Also the behavior <strong>of</strong> an ant colony or bacteria<br />

in a nutrient solution [6] have been used as strategies to<br />

find a solution minimizing Ψ.<br />

The enormous variety <strong>of</strong> differently labeled stochastic<br />

algorithms [7] makes it <strong>of</strong>ten hard to distinguish the<br />

differences between. More important it is hard to<br />

charge the efficiency <strong>of</strong> the different methods and their<br />

suitability for different applications. In the following we<br />

will provide an approach to present several aspects <strong>of</strong><br />

- 69 - 15th IGTE Symposium 2012<br />

stochastic optimization within the unified framework <strong>of</strong><br />

state space techniques.<br />

In state space models randomness has the unified<br />

entrance into the system formulated by the process noise<br />

w. By setting the deterministic input vector u to zero the<br />

resulting system becomes an autonomous system. While<br />

different stochastic optimization strategies are originated<br />

by more or less random inspiritments, system theory<br />

takes use <strong>of</strong> probabilistic methods to describe the behavior<br />

in concern with randomness. Hereby any random<br />

process is described by a probability density function<br />

(pdf) denoted by π(·). The mathematical framework used<br />

to describe stochastic behavior is based on Bayes law<br />

π(x|y) = π(y|x)π(x)<br />

∝ π(y|x)π(x), (9)<br />

π(y)<br />

where π(y|x) is referred to as the likelihood function<br />

and π(x) is referred to as prior. The evidence π(y) has<br />

the role <strong>of</strong> a normalization constant and can be skipped<br />

leading to the right hand side formula <strong>of</strong> the posterior<br />

distribution π(x|y). The likelihood function provides a<br />

probability measure for x originating a certain output y.<br />

The prior π(x) gives a probability statement about x.<br />

We can already link this concepts to the optimization<br />

problem given by equation (1) and the constraints<br />

formulated in equation (2), as the likelihood function<br />

obviously is able to express C(x) by becoming zero for<br />

infeasible solutions. However, this concept also enables<br />

the possibility <strong>of</strong> a continuous measure for the state x,<br />

i.e. we can incorporate ”gray regions” for the solution.<br />

The understanding <strong>of</strong> the likelihood is maybe not that<br />

obvious. For easier explanation we write the likelihood<br />

corresponding to the objective function (6) as<br />

<br />

π(yd |x) ∝ exp − (Hk(x) − yd ) T <br />

W (H k(x) − yd ) .<br />

(10)<br />

The likelihood function is the exponential <strong>of</strong> the negative<br />

objective function but it states Ψ as a probability measure.<br />

It has to be noted that 0 < <br />

N<br />

exp(−Ψ(x))dx < ∞<br />

has to hold in the Lebesgue sense, to form a likelihood<br />

function from an objective function. Such a formulation<br />

is known from simulated annealing (SA). Hereby a<br />

stochastic algorithm seeks for the modes (maxima) <strong>of</strong><br />

the function exp(−Ψ(x)/T ), where T is an artificial<br />

temperature which decreases over time. Note, that due<br />

to the temperature T , SA is different with respect to<br />

Bayesian inference as the likelihood has a physical<br />

meaning where no term like T occurs. Given all these<br />

aspects stochastic optimization can be fully seen in the<br />

context <strong>of</strong> state estimation and we can work out some<br />

conceptual ideas that are used in state estimation in the<br />

next section.<br />

The exchange <strong>of</strong> mutual information depends on a so<br />

called resampling scheme which stays outside the state<br />

space model. While different stochastic methods have<br />

brought up a variety <strong>of</strong> exchange schemes also state


estimation methods have brought up unified methods like<br />

residual, stratified, or systematic resampling, etc. [8]. We<br />

will not focus on these aspects <strong>of</strong> stochastic optimization<br />

methods at this point, but we will provide a description<br />

about the stochastic diffusion <strong>of</strong> states in the state space<br />

view <strong>of</strong> optimization.<br />

While deterministic methods select the state update<br />

from gradient or curvature information in order to decrease<br />

Ψ and thus follow strictly deterministic rules,<br />

the probabilistic change is summarized by means <strong>of</strong><br />

pdf’s π(·). State space theorists have developed the<br />

ChapmanKolmogorov equation<br />

<br />

π(xk|yd )= π(xk|xk−1)π(xk−1|yd )dxk−1, (11)<br />

R N<br />

to provide a probabilistic measure about the state evolution<br />

given the current state and its posterior. While<br />

equation (11) is not hard to derive using the mathematical<br />

tool <strong>of</strong> marginalization, it provides two interesting insides<br />

about the update in stochastic optimization methods.<br />

• The state update is described by π(xk|xk−1) and<br />

does not depend on the current value <strong>of</strong> the objective<br />

function.<br />

• The update probability depends on π(xk−1|y d ),but<br />

there is no guarantee that xk−1 will be changed.<br />

The transition kernel π(xk|xk−1) describes the probability<br />

<strong>of</strong> the state exchange from state xk−1 to the state xk.<br />

A remarkable point about this formulation is the fact, that<br />

the update is independent from the current value <strong>of</strong> Ψ or<br />

the posterior. This is an important fact that explains the<br />

powerfulness <strong>of</strong> stochastic methods. If the kernel would<br />

depend on Ψ, stochastic methods would end with the<br />

same stalling behavior in local minima as deterministic<br />

methods do, as then a deterministic drift is present.<br />

In most cases the kernel π(xk|xk−1) is even reduced<br />

to π(xk). The pdf π(xk−1|y d) in equation (11) induces<br />

another important principle in stochastic optimization<br />

which can be directly connected to the mutual information<br />

exchange. It states, that the update <strong>of</strong> the state due<br />

to the proposal kernel is not guaranteed. Instead we can<br />

see the result π(xk|y d) only provides a relative number<br />

for the new state π(xk) to be accepted.<br />

IV. STATE SPACE METHODS FOR OPTIMIZATION<br />

In this section we want to discuss some more state<br />

space concepts and their use for stochastic optimization.<br />

We have selected these methods as we see them<br />

to be important with nowadays needs. State estimation<br />

techniques are among the algorithms which have seen<br />

one <strong>of</strong> the strongest developments in the past decades.<br />

The early origin was given by the Apollo space flight<br />

programm in the 1960’s where the Kalman filter has<br />

seen it’s breakthrough. Since then both, single point<br />

and population-based methods have been developed, to<br />

regain knowledge from the hidden states <strong>of</strong> a system<br />

given the actually observed function values for an optimal<br />

designed objective function in order to recover x from<br />

- 70 - 15th IGTE Symposium 2012<br />

noisy observations. In this sense we first have to discuss<br />

the meaning <strong>of</strong> the likelihood function π(x|y d) in more<br />

detail. Following the definition <strong>of</strong> a multivariate Gaussian<br />

random variable y<br />

y ∝ exp −(y − μ) T Σ −1 (y − μ) , (12)<br />

where μ expresses the mean and Σ is the covariance<br />

matrix, we observe, that the likelihood function has the<br />

mean <strong>of</strong> a Gaussian distribution expressing uncertainty<br />

about y. In this sense the measurement noise v becomes<br />

relevant for a first as the likelihood function expressed<br />

this noise in terms <strong>of</strong> a probability measure. This will<br />

lead us directly to the aspects brought in the following<br />

subsection.<br />

The consequent use <strong>of</strong> this approach brought up powerful<br />

stochastic state estimation algorithms like state observers,<br />

sequential Monte Carlo methods like the already<br />

mentioned Kalman filter or Particle filters, or even more<br />

powerful Markov chain Monte Carlo (MCMC) methods.<br />

A. Enhanced Error Model<br />

A matter <strong>of</strong> concern with the solution <strong>of</strong> physical<br />

motivated optimization problems are the computational<br />

costs in concern with the evaluation <strong>of</strong> the objective<br />

function Ψ. This especially holds if the underlying problem<br />

requires the solution <strong>of</strong> partial differential equations<br />

(PDE’s) which has to be done by numerical methods<br />

like the finite element method (FEM). Recently the<br />

use <strong>of</strong> approximation techniques has become popular in<br />

both, state estimation and optimization [9], [10]. Hereby<br />

the computational costly evaluation <strong>of</strong> H k is replaces<br />

by a cheap approximation or surrogate function H ∗ k.<br />

Subsequently this leads to the cost function Ψ∗ due to<br />

the approximation error<br />

e = H ∗ k − H k. (13)<br />

We can reformulate the relation between H k and H ∗ k to<br />

H ∗ k = Hk +(H ∗ k − H k) =Hk + e. (14)<br />

This is an interesting formulation as we can look on<br />

the approximation error e as an additive error similar to<br />

the measurement noise v depicted in figure 1. Although<br />

the approximation error e depends on the state x, and<br />

thus is a deterministic error, we can think about a<br />

probabilistic description about e in the concept <strong>of</strong> a<br />

Gaussian distribution. This is an approach <strong>of</strong>ten taken<br />

in several fields <strong>of</strong> state estimation and system theory.<br />

It ends up exactly in the idea covered by the so called<br />

enhanced error model [11]. Although the approximation<br />

error e is <strong>of</strong> deterministic nature a probabilistic model is<br />

built from samples about the state space R N . Then the<br />

likelihood function π ∗ (y d|x) becomes<br />

π ∗ (y d|x) ∝ exp −(y ∗ − y d + μ e) T Σ −1<br />

e (y ∗ − y d + μ e) ,<br />

(15)<br />

and the optimization can be performed on this<br />

computational less costly function. Given the degree


<strong>of</strong> accuracy <strong>of</strong> the approximation H ∗ k the solution can<br />

be seen as good as a solution obtained by Hk, orthe<br />

approximation approach can be used to find a good<br />

initial solution which can be refined in less optimization<br />

steps using the accurate model.<br />

In general the determination <strong>of</strong> the mean μ e and the<br />

covariance matrix Σe requires a large number <strong>of</strong> samples.<br />

However, during the setup and model testing phase for the<br />

optimization problem typically enough data is generated<br />

to describe e in the presented way.<br />

B. Hybrid Schemes<br />

Another useful aspect about the use <strong>of</strong> state space<br />

schemes for optimization is the natural possibility to incorporate<br />

both, deterministic and stochastic methods for<br />

building hybrid optimization schemes. This can be easily<br />

done by enabling the input vector uk and building an<br />

outer feedback system as discussed in subsection III-A.<br />

The natural representation <strong>of</strong> the interaction between<br />

the deterministic drift and the stochastic interaction is<br />

therefore <strong>of</strong> interest, as it illustrates the powerfulness<br />

<strong>of</strong> the combination. I.e. if only some elements <strong>of</strong> the<br />

gradient g are available because the function Ψ is not<br />

steady with respect to this variables, we can only use<br />

the available gradient information for the input vector<br />

u. The other components <strong>of</strong> x are updated by the<br />

stochastic algorithm. In addition, an outer resampling<br />

scheme retains the property <strong>of</strong> a stochastic optimization<br />

scheme to overcome local minima.<br />

C. Robust Schemes<br />

Uncertainty is in many aspects a concerning topic in<br />

state estimation. This is given due to the fact, that models<br />

<strong>of</strong>ten do not cover all physical aspects due to reduction.<br />

Also optimization engineers have developed robust target<br />

functions in order to find solutions insensitive with<br />

respect to parameter variations <strong>of</strong> x [3]. Such robust<br />

objective functions are typically <strong>of</strong> form<br />

min max Ψ(x, ξ), (16)<br />

x ξ<br />

where ξ describes an immanent given uncertainty in the<br />

parameters. Most <strong>of</strong>ten the absolute value <strong>of</strong> ξ is limited.<br />

Robust state estimation has brought up the H∞ concept<br />

[4], where the estimation error e = x − ˆx is minimized<br />

using an approach <strong>of</strong> form<br />

min max J (x, ˆx, v, w). (17)<br />

ˆx v,w<br />

Hereby no limitations about the process noise w and<br />

the measurement noise v are assumed. The H∞ filter<br />

seeks for the best estimate under worst case conditions.<br />

Mostly game theoretic approaches are used to formulate<br />

the function J . We pointed this out, as control scientist<br />

have gained a lot <strong>of</strong> experience in the field and there<br />

might be useful aspects for optimization.<br />

- 71 - 15th IGTE Symposium 2012<br />

V. A NUMERICAL EXAMPLE<br />

To provide a numerical example about the presented<br />

considerations <strong>of</strong> state space methods for optimization<br />

we want to present a simple optimization problem<br />

consisting <strong>of</strong> an inverse problem for a resistor network<br />

example. Figure 3(a) depicts the resistor network under<br />

investigation. The black lines illustrate resistors with<br />

a value <strong>of</strong> R1 = 1Ω. The gray colored lines mark a<br />

circular disk <strong>of</strong> radius r where resistors with a value <strong>of</strong><br />

R2 are placed. Hereby the mapping between the circle<br />

radius and the resistor values is discontinuous by the<br />

way, that the resistor has to be fully placed inside the<br />

circle. It is now aim to find the radius r <strong>of</strong> the circle<br />

and the resistor value R2 from some electrical boundary<br />

measurements. These measurement built the vector yd. A problem <strong>of</strong> this kind is a classical inverse problem<br />

where we aim on the determination <strong>of</strong> the state vector<br />

x = T r R2 from measurements yd.<br />

Figure 3 exemplary depicts a part <strong>of</strong> the cost function.<br />

Hereby the R2 and r were set to R2 =0.5Ω and r =<br />

0.5m. The corner length was set to r =1mand was<br />

discretized by 40 resistors. A current is injected at the<br />

upper left corner and 5 equidistant measurement points<br />

(ampere meters) are connected to the lower edge.<br />

(a) Resistor network.<br />

0.7<br />

0.6<br />

0<br />

x 10<br />

8<br />

7<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

−4<br />

Fig. 3. Test example and objective function.<br />

Ψ<br />

0.5<br />

0.4<br />

r (m)<br />

0.55<br />

0.5<br />

0.45<br />

0.3<br />

0.2 0.4<br />

R (Ω)<br />

2<br />

(b) Objective function Ψ.<br />

We now want to apply a hybrid optimization approach<br />

where the corner points about the algorithm can be stated<br />

by the following:<br />

• We use a population based scheme.<br />

• We use gradient information about the resistor<br />

value R2.<br />

• We work on a reduced model (only half the number<br />

<strong>of</strong> resistor elements per edge).<br />

In state estimation such an algorithm belongs to the class<br />

<strong>of</strong> sequential Monte Carlo (SMC) methods and is mostly<br />

referred to as Particle filter (PF) [12].<br />

Arguable one <strong>of</strong> the most interesting points in this list<br />

is the use <strong>of</strong> a reduced model to solve the optimization<br />

problem. Figure 4(a) depicts the objective function (6)<br />

(W was set to be the identity matrix) when using the<br />

reduced model for solving the optimization problem with<br />

data from the fine model. One can obtain, that the depicted<br />

part <strong>of</strong> the objective function does not even include<br />

a minima. Figure 4(b) depicts the likelihood <strong>of</strong> form (15),<br />

using an enhanced error model. As we can see, the point


where the likelihood function has its maxima presents<br />

the true solution. Thus, if our optimization algorithm is<br />

designed to minimize the corresponding objective function<br />

is should be possible to find the solution although<br />

working on the reduced model. Figure 5 depicts the<br />

Ψ *<br />

0.2<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

r (m)<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

R (Ω)<br />

2<br />

(a) Objective function Ψ ∗ .<br />

π(r,R 2 )<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

r (m)<br />

0.3<br />

0.4<br />

R 2 (Ω)<br />

(b) Posteriori probability π ∗ (r, R2).<br />

Fig. 4. Determination <strong>of</strong> r and R2 using a reduced model.<br />

behavior and the result <strong>of</strong> the proposed hybrid scheme<br />

for the given problem using the reduced model for the<br />

solution. Figure 5(a) depicts the state <strong>of</strong> the population.<br />

As can be seen, the population is clustered around the<br />

correct solution. Hereby the background color depicts<br />

the objective function for the fine model but as stated<br />

the coarse model is used! Figure 5(b) and figure 5(c)<br />

depict the decrease <strong>of</strong> the objective function and the<br />

increase <strong>of</strong> the likelihood function, respectively. The dots<br />

illustread the spread <strong>of</strong> the population. As can be seen<br />

both, the likelihood function and the objective function<br />

can become smaller or larger, respectively. Thus, the<br />

property <strong>of</strong> stochastic methods is given.<br />

Ψ(r,R 2 )<br />

0.035<br />

0.03<br />

0.025<br />

0.02<br />

0.015<br />

0.01<br />

0.005<br />

r (m)<br />

0.7<br />

0.65<br />

0.6<br />

0.55<br />

0.5<br />

0.45<br />

0.4<br />

0.35<br />

0<br />

1 2 3 4 5 6<br />

Iteration<br />

7 8 9 10<br />

(b) Objective function.<br />

0.3<br />

0.3 0.4 0.5 0.6 0.7<br />

R (Ω)<br />

2<br />

(a) Particles.<br />

Fig. 5. Output <strong>of</strong> the particle filter.<br />

π(r,R 2 )<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

1 2 3 4 5 6<br />

Iteration<br />

7 8 9 10<br />

(c) Posteriori probability.<br />

VI. OUTLOOK<br />

In this paper we demonstrated a state space view on<br />

optimization algorithms. Both, deterministic and stochastic<br />

methods were exploited in the content <strong>of</strong> the unified<br />

state space representation. We demonstrated that<br />

0.5<br />

0.6<br />

0.7<br />

- 72 - 15th IGTE Symposium 2012<br />

deterministic approaches can be considered as standard<br />

feedback systems, whereas stochastic methods can be<br />

directly linked to state estimation. We explored features<br />

<strong>of</strong> stochastic state estimation using Bayes law and subsequently<br />

demonstrated the usefulness <strong>of</strong> state estimation<br />

techniques for optimization. All our considerations are<br />

summarized in a hybrid optimization algorithm working<br />

on a reduced model where we demonstrated the natural<br />

interaction <strong>of</strong> deterministic and stochastic methods using<br />

state space descriptions. Further research will focus in<br />

two directions. First, we would extend the presented<br />

hybrid scheme using some more sophisticated methods.<br />

Second we consider work on a formal description <strong>of</strong><br />

different stochastic algorithms using methods from probability<br />

theory.<br />

REFERENCES<br />

[1] R. Fletcher, Practical Methods <strong>of</strong> Optimization; (2nd Ed.), Wiley-<br />

Interscience, New York, USA, 1987.<br />

[2] L. dos Santos Coelho and P. Alotto, Multiobjective Electromagnetic<br />

Optimization Based on a Nondominated Sorting Genetic Approach<br />

With a Chaotic Crossover Operator, IEEE Transactions on Magnetics,<br />

vol.44, no.6, pp.1078-1081, 2008.<br />

[3] P. Alotto, C. Magele, W. Renhart, A. Weber, G. Steiner Robust<br />

target functions in electromagnetic design, COMPEL: The International<br />

Journal for Computation and Mathematics in Electrical<br />

and Electronic Engineering, Vol. 22 Iss: 3, pp.549 - 560, 2003.<br />

[4] D. Simon, Optimal state estimation, Kalman, H∞ and nonlinear<br />

approaches, Wiley - Interscience, John Wiley & Sons, Inc., New<br />

Jersey, 2006.<br />

[5] R. Storn and K. Price, Differential evolution - a simple and efficient<br />

heuristic for global optimization over continuous spaces, Journal<br />

<strong>of</strong> Global Optimization 11: pp.341-359, 1997.<br />

[6] L. dos Santos Coelho, C. da Costa Silveira, C.A. Sierakowski, and<br />

P. Alotto, Improved Bacterial Foraging Strategy Applied to TEAM<br />

Workshop Benchmark Problem, IEEE Transactions on Magnetics,<br />

vol.46, no.8, pp.2903-2906, Aug. 2010.<br />

[7] O. Hajji, S. Brisset, and P. Brochet, Comparing stochastic optimization<br />

methods used in electrical engineering, Systems, Man<br />

and Cybernetics, 2002 IEEE International Conference on , vol.7,<br />

no., pp. 6 pp. vol.7, 6-9 Oct. 2002.<br />

[8] R. Douc, O. Cappe, and E. MoulinesComparison <strong>of</strong> resampling<br />

schemes for particle filtering, In 4th International Symposium on<br />

Image and Signal Processing and Analysis (ISPA), pp.64-69, 2005.<br />

[9] Albunni, M.N.; Rischmuller, V.; Fritzsche, T.; Lohmann, B.; ,<br />

Multiobjective Optimization <strong>of</strong> the Design <strong>of</strong> Nonlinear Electromagnetic<br />

Systems Using Parametric Reduced Order Models, IEEE<br />

Transactions on Magnetics, vol.45, no.3, pp.1474-1477, March<br />

2009.<br />

[10] A. I. Forrester, A. Sóbester and A. J. Keane, Engineering Design<br />

via Surrogate Modelling A Practical Guide, Wiley, 2008.<br />

[11] J. P. Kaipio and E. Somersalo, Statistical and computational<br />

inverse problems, New York: Applied Mathematical Sciences,<br />

Springer, 2004.<br />

[12] M.S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp,<br />

A tutorial on particle filters for online nonlinear/non-Gaussian<br />

Bayesian tracking IEEE Transactions on Signal Processing 50 (2),<br />

pp.174188, 2002.


- 73 - 15th IGTE Symposium 2012<br />

Quasi TEM Analysis <strong>of</strong> 2D Symmetrically Coupled<br />

Strip Lines with Finite Grounded Plane using HBEM<br />

*Saša S. Ilić, *Mirjana T. Perić, *Slavoljub R. Aleksić and *Nebojša B. Raičević<br />

*<strong>University</strong> <strong>of</strong> Niš, Faculty <strong>of</strong> Electronic Engineering <strong>of</strong> Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia<br />

E-mail: sasa.ilic@elfak.ni.ac.rs<br />

Abstract—The hybrid boundary element method (HBEM), based on combination between equivalent electrodes method<br />

(EEM) and boundary element method (BEM), is applied for characteristic parameters determination <strong>of</strong> symmetrically coupled<br />

strip lines with a finite width grounded plane. Even and odd modes are considered in the paper. All results for the characteristic<br />

impedance and the effective dielectric permittivity are compared with the finite element method (FEM).<br />

Index Terms—Characteristic impedance, Equivalent Electrodes Method (EEM), Finite Element Method (FEM), Hybrid<br />

Boundary Element Method (HBEM).<br />

strip lines parameters, when the strip line is above an infinite-width<br />

grounded plane [14]. The HBEM can be also<br />

applied to analysis <strong>of</strong> corona effects [15] and metamaterial<br />

structures [16]. A problem <strong>of</strong> symmetrically<br />

coupled strip line placed above infinite grounded plane is<br />

investigated in [17].<br />

The HBEM is applied, in this paper, to calculate the<br />

even- and odd- mode characteristic impedance <strong>of</strong> 2D<br />

symmetrically coupled strip lines with finite grounded<br />

plane, shown in Fig. 1. The quasi TEM analysis is used in<br />

this paper.<br />

I. INTRODUCTION<br />

Over the years, many authors have analyzed<br />

symmetrically and asymmetrically coupled or ordinary<br />

strip lines with width-limited dielectric substrate using<br />

numerous numerical and analytical methods [1]-[9]. The<br />

variational method [1], the Garlekin’s method, the method<br />

<strong>of</strong> moments [2]-[4], the boundary element method [5], the<br />

conformal mapping, the moving perfect electric wall<br />

method [6]-[9] etc. are some <strong>of</strong> the commonly used<br />

methods. On the other side, the problem <strong>of</strong> the widthlimited<br />

microstrip grounded plane has not been so <strong>of</strong>ten<br />

investigated, although these forms <strong>of</strong> microstrips are<br />

typical in practice. In [7]-[9] the microstrip line with<br />

finite-width dielectric and grounded plane was analyzed.<br />

A so-called moving perfect electric wall method (MPEW)<br />

in conjunction with the conformal mapping method<br />

(CMM) was applied in those papers.<br />

An application <strong>of</strong> boundary element method (BEM)<br />

usually contains singular and nearly singular integrals<br />

whose evaluation is difficult although original problems<br />

are not singular. In order to avoid numerical integrations,<br />

it is possible to substitute small boundary segments by<br />

total charges placed at their centres. The Green’s function<br />

for the electric scalar potential <strong>of</strong> the charges, placed in<br />

the free space at the boundary <strong>of</strong> two dielectrics, is used<br />

and the proposed method is called the hybrid boundary<br />

element method (HBEM) [10-17].<br />

This method presents a combination <strong>of</strong> BEM and<br />

equivalent electrodes method (EEM). The basic idea is in<br />

replacing an arbitrary shaped electrode by equivalent<br />

electrodes (EEs), and an arbitrary shaped boundary<br />

surface between any two dielectric layers by discrete<br />

equivalent total charges per unit length placed in the air.<br />

The basic Green’s function for the electric scalar potential<br />

<strong>of</strong> the charges placed in the free space at the boundary<br />

surface <strong>of</strong> two dielectrics is used. The method is based on<br />

the EEM, on the point-matching method (PMM) for the<br />

potential <strong>of</strong> the perfect electric conductor (PEC)<br />

electrodes and for the normal component <strong>of</strong> the electric<br />

field at the boundary surface between any two dielectric<br />

layers.<br />

The HBEM is applied, until now, to solving<br />

multilayered electromagnetic problems [10], grounding<br />

systems [12], electromagnetic field determination in vicinity<br />

<strong>of</strong> cable terminations [13], as well as to calculation <strong>of</strong><br />

Figure 1: Symmetrically coupled strip line with finite grounded plane.<br />

Symmetrically coupled strip lines can be used as basic<br />

elements for filters, phase shifters, directional couplers,<br />

baluns and combiners [18].<br />

II. THEORETICAL BACKGROUND<br />

The HBEM is applied and corresponding model is formed,<br />

Fig. 2.<br />

Figure 2: Corresponding HBEM model.


Using the existing symmetry, the electric scalar potential<br />

<strong>of</strong> whole system from Fig. 2 is determined:<br />

(e (e, o)<br />

2<br />

B ln l<br />

4<br />

B ln l<br />

3<br />

B ln l<br />

Ki<br />

A<br />

i 1 k 1<br />

( x<br />

Ki<br />

i 3 k 1<br />

( x<br />

Mi<br />

i 1 m 1<br />

( x<br />

q<br />

0<br />

d<br />

ik<br />

2<br />

q<br />

2<br />

2<br />

dik<br />

a<br />

ik<br />

q<br />

x<br />

x<br />

0<br />

a ik<br />

t<br />

im<br />

x<br />

0<br />

t im<br />

)<br />

)<br />

ln<br />

)<br />

2<br />

2<br />

2<br />

ln<br />

ln<br />

( x<br />

( y<br />

( x<br />

( y<br />

( y<br />

( x<br />

x<br />

y<br />

dik<br />

y<br />

y<br />

dik<br />

x<br />

a ik<br />

)<br />

t im<br />

)<br />

a ik<br />

x<br />

)<br />

)<br />

2<br />

2<br />

)<br />

2<br />

t im<br />

)<br />

2<br />

2<br />

2<br />

,<br />

( y<br />

( y<br />

( y<br />

y<br />

dik<br />

y<br />

)<br />

a ik<br />

y<br />

2<br />

)<br />

t im<br />

where the coefficients A and B have following values:<br />

0,<br />

odd(o)<br />

mode;<br />

A<br />

1,<br />

even(e)<br />

mode.<br />

B<br />

1,<br />

odd(o)<br />

mode;<br />

1,<br />

even(e)<br />

mode .<br />

The electric field is E grad( g ( ) . The total number <strong>of</strong><br />

unknowns N tot , will be denoted by:<br />

4<br />

3<br />

N K M A .<br />

tot<br />

i<br />

i 1 i 1<br />

A relation between the normal component <strong>of</strong> the<br />

electric field and the total surface charges is given with<br />

Eq. (2):<br />

n ˆi ( 0<br />

Eim<br />

)<br />

0<br />

( 0<br />

)<br />

t<br />

im i , t im i<br />

q<br />

t im i<br />

,<br />

l im<br />

(2)<br />

where i M m , , 1 , 3 , 2 , i 1 , nˆ i ( nˆ 1 nnˆ<br />

ˆ2<br />

yyˆ<br />

ˆ , nˆ 3 xxˆ<br />

ˆ )<br />

are unit normal vectors oriented from the layer<br />

the layer 0 .<br />

towards<br />

Using the PMM for the potential <strong>of</strong> the perfect<br />

conductors given by (1), the PMM for the normal<br />

component <strong>of</strong> the electric field (2), and the electrical<br />

neutrality condition (3) (only for the even mode!), it is<br />

possible to determine unknown free charges per unit<br />

length on conductors, the total charges per unit length on<br />

the boundary surfaces between two dielectric layers and<br />

the unknown constant 0 .<br />

The electrical neutrality condition is:<br />

2<br />

Ki<br />

4 Ki<br />

q d dik<br />

qa<br />

a ik 0 0.<br />

(3)<br />

i 1 k 1 i 3 k 1<br />

After solving the system <strong>of</strong> linear equations, it is<br />

possible to calculate the capacitance per unit length <strong>of</strong> the<br />

i<br />

2<br />

)<br />

2<br />

(1)<br />

- 74 - 15th IGTE Symposium 2012<br />

strip line given by (4):<br />

K1<br />

K3<br />

(e, o) 1<br />

C qd1k<br />

qa<br />

3k<br />

. (4)<br />

U<br />

k 1 k 1<br />

With the developed program code, the characteristic<br />

impedance <strong>of</strong> the symmetrically coupled strip line is<br />

calculated as<br />

(e, o) (e, o) eff ef (e, o)<br />

Z c Zc<br />

0 / r ,<br />

where<br />

ef eff (e, o) ( (e, o) ( (e, o)<br />

r<br />

C<br />

/C 0<br />

(e, o)<br />

is the effective dielectric permittivity, and Z c0<br />

is the<br />

characteristic impedance <strong>of</strong> the symmetrically coupled<br />

strip line without dielectric layer (free space), for even (e)<br />

and odd (o) modes, respectively.<br />

In order to verify the obtained numerical results for the<br />

characteristic impedance and the effective dielectric<br />

permittivity, the finite element method (FEM) [19] is<br />

used.<br />

III. RESULTS<br />

The results convergence and computation time for the<br />

even and odd modes can be noticed from Table I, for<br />

parameters: r 3 , d / w1<br />

4 , h / d 0 0.<br />

5 , t 1 / w1<br />

0<br />

. 1 ,<br />

s / w1<br />

1<br />

. 0 , w 2 / w1<br />

6 6.<br />

0 and t 2 / t1<br />

2<br />

. 0 , where N tot<br />

is the total number <strong>of</strong> unknowns.<br />

N tot<br />

TABLE I<br />

CONVERGENCE OF RESULTS AND CPU TIME<br />

Even mode Odd mode<br />

eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ]<br />

t(s)<br />

298 2.119 158.860 1.817 77.293 4.4<br />

370 2.120 158.871 1.823 77.207 6.9<br />

444 2.121 158.901 1.827 77.162 9.7<br />

585 2.123 158.906 1.832 77.093 16.7<br />

655 2.123 158.917 1.833 77.077 20.9<br />

726 2.124 158.905 1.835 77.048 25.7<br />

800 2.124 158.916 1.836 77.039 31.5<br />

872 2.124 158.916 1.837 77.026 38.8<br />

940 2.125 158.905 1.838 77.007 45.4<br />

1014 2.125 158.914 1.839 77.003 51.4<br />

1085 2.125 158.904 1.840 76.987 58.1<br />

1155 2.125 158.911 1.840 76.986 65.4<br />

1225 2.125 158.917 1.840 76.984 74.1<br />

1296 2.126 158.909 1.841 76.973 86.5<br />

1370 2.125 158.915 1.841 76.972 97.6<br />

First, a very good convergence <strong>of</strong> values <strong>of</strong> both<br />

parameters is achieved for the both modes. Second, a<br />

computation time was much shorter comparing to the time<br />

required by FEM: we needed up to 97.6 seconds for the<br />

system <strong>of</strong> 1370 unknowns, while FEM for solving the<br />

same problem took about 15 minutes with a few hundreds<br />

<strong>of</strong> thousands <strong>of</strong> finite elements.<br />

Equipotential contours and distributions <strong>of</strong> polarized


charges per unit length along boundary surface are shown<br />

in Figs. 3-6 (even and odd modes, respectively) for para-<br />

meters:<br />

r 3 , d / w1<br />

4 , h / d 0 0.<br />

5,<br />

t 1 / w1<br />

0 0.<br />

1,<br />

s / w1<br />

1<br />

. 0 , w 2 / w1<br />

6<br />

. 0 and t 2 / t1<br />

2<br />

. 0 .<br />

Figure 3: Equipotential contours (Even mode).<br />

Figure 4: Equipotential contours (Odd mode).<br />

t 2<br />

t1<br />

1<br />

2<br />

3<br />

4<br />

- 75 - 15th IGTE Symposium 2012<br />

Figure 5: Distribution <strong>of</strong> polarized charges per unit length along<br />

boundary surface (Even mode).<br />

Figure 6: Distribution <strong>of</strong> polarized charges per unit length along<br />

boundary surface (Odd mode).<br />

TABLE II<br />

COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 2 1 t t AND h d FOR PARAMETERS:<br />

r 3 , d / w1<br />

4 , t 1/<br />

w1<br />

0 0.<br />

05 , s / w1<br />

1<br />

. 0 AND w 2 / w1<br />

6 6.<br />

0 .<br />

h<br />

d<br />

Even mode Odd mode<br />

HBEM FEM HBEM FEM<br />

eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ]<br />

0.2 2.3967 84.950 2.3969 84.864 2.0470 64.218 2.0558 63.942<br />

0.4 2.2182 138.000 2.2185 137.829 1.9009 77.198 1.9120 76.805<br />

0.6 2.0856 182.116 2.0862 181.851 1.8642 81.055 1.8757 80.626<br />

0.8 1.9849 220.428 1.9863 220.066 1.8550 82.450 1.8676 81.983<br />

1.0 1.9059 254.371 1.9076 253.892 1.8533 83.020 1.8666 82.318<br />

0.2 2.3938 84.935 2.3965 84.783 2.0470 64.215 2.0555 63.948<br />

0.4 2.2144 137.901 2.2172 137.624 1.9008 77.192 1.9119 76.801<br />

0.6 2.0817 181.901 2.0844 181.526 1.8642 81.048 1.8757 80.618<br />

0.8 1.9810 220.107 1.9844 219.602 1.8543 82.457 1.8676 81.981<br />

1.0 1.9021 253.950 1.9056 253.368 1.8533 83.016 1.8670 82.545<br />

0.2 2.3923 84.907 2.3957 84.741 2.0469 64.213 2.0553 63.949<br />

0.4 2.2122 137.786 2.2156 137.492 1.9008 77.188 1.9119 76.794<br />

0.6 2.0793 181.675 2.0827 181.265 1.8641 81.043 1.8755 80.620<br />

0.8 1.9785 219.777 1.9826 219.241 1.8543 82.453 1.8676 81.978<br />

1.0 1.8997 253.521 1.9038 252.901 1.8532 83.013 1.8666 82.554<br />

0.2 2.3912 84.878 2.3942 84.739 2.0468 64.212 2.0557 63.939<br />

0.4 2.2105 137.677 2.2139 137.412 1.9007 77.184 1.9118 76.792<br />

0.6 2.0775 181.465 2.0809 181.083 1.8641 81.039 1.8756 80.600<br />

0.8 1.9766 219.468 1.9809 218.918 1.8543 82.449 1.8675 81.973<br />

1.0 1.8978 253.120 1.9021 252.483 1.8532 83.010 1.8666 82.548


- 76 - 15th IGTE Symposium 2012<br />

TABLE III<br />

COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 2 1 w w FOR PARAMETERS:<br />

r 3 , d / w1<br />

4 , t 1/<br />

w1<br />

0 0.<br />

05 , h / d 0<br />

. 5 , s / w1<br />

1<br />

. 0 AND t 2 / t1<br />

2<br />

. 0 .<br />

Even mode Odd mode<br />

w 2 HBEM FEM HBEM FEM<br />

w1<br />

eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ]<br />

4.5 2.2100 168.392 2.2120 168.080 1.8799 80.064 1.8915 79.655<br />

5.0 2.1823 165.314 2.1845 165.004 1.8784 79.885 1.8899 79.479<br />

5.5 2.1605 162.805 2.1630 162.490 1.8772 79.743 1.8888 79.344<br />

6.0 2.1430 160.759 2.1458 160.438 1.8749 79.656 1.8876 79.242<br />

8.0 2.0981 155.589 2.1026 155.229 1.8718 79.427 1.8858 78.982<br />

10.0 2.0753 152.991 2.0805 152.606 1.8670 79.352 1.8848 78.898<br />

15.0 2.0492 150.340 2.0575 149.885 1.8598 79.343 1.8598 79.343<br />

TABLE IV<br />

COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 1 1 w t FOR PARAMETERS:<br />

r 3 , d / w1<br />

4 , w 2 / w1<br />

6 6.<br />

0 , h / d 0 0.<br />

5 , s / w1<br />

1<br />

. 0 AND t 2 / t1<br />

2<br />

. 0 .<br />

Even mode Odd mode<br />

t1<br />

HBEM FEM HBEM FEM<br />

w1<br />

eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ] eff ef<br />

r<br />

Zc<br />

[ ]<br />

0.01 2.1768 162.282 2.1632 162.257 1.9031 82.453 1.9251 81.744<br />

0.02 2.1600 161.998 2.1583 161.804 1.8971 81.628 1.9147 81.052<br />

0.03 2.1526 161.595 2.1537 161.316 1.8899 80.913 1.9051 80.416<br />

0.04 2.1474 161.167 2.1496 160.857 1.8827 80.257 1.8963 79.801<br />

0.05 2.1430 160.759 2.1458 160.438 1.8749 79.656 1.8876 79.242<br />

0.06 2.1388 160.362 2.1422 160.021 1.8679 79.074 1.8800 78.670<br />

0.07 2.1352 159.976 2.1388 159.641 1.8610 78.519 1.8754 78.222<br />

0.08 2.1319 159.605 2.1356 159.271 1.8543 77.985 1.8645 77.640<br />

0.09 2.1288 159.250 2.1264 158.700 1.8447 77.470 1.8577 77.130<br />

0.10 2.1258 158.909 2.1293 158.592 1.8413 76.973 1.8507 76.655<br />

To validate the accuracy <strong>of</strong> the presented method, a<br />

comparison is made with the finite element method results<br />

obtained using the FEM s<strong>of</strong>tware [19]. Those results are<br />

shown in Tables II-IV.<br />

As presented in the tables, numerical results for the<br />

effective dielectric permittivity and the characteristic<br />

impedance obtained using the HBEM are obviously in<br />

very good agreement with the FEM values (with few<br />

hundreds <strong>of</strong> thousands finite elements) with divergence<br />

less than 0.4% for the most <strong>of</strong> the cases.<br />

Distributions <strong>of</strong> characteristic impedance versus s / w1<br />

for different values <strong>of</strong> dielectric permittivity r are<br />

shown in Figs. 7 and 8, for:<br />

d / w1<br />

4 , h / d 0 0.<br />

5,<br />

t 1 / w1<br />

0 0.<br />

1 , w 2 / w1<br />

6 6.<br />

0<br />

and t 2 / t1<br />

2 2.<br />

0 .<br />

Fig.7 shows that increasing the values <strong>of</strong> parameter<br />

s / w1,<br />

decreasing the characteristic impedance for even<br />

mode. But, for the odd mode, Fig. 8, increasing the<br />

parameter s / w1,<br />

increasing the characteristic impedance<br />

too. The lowest values for the characteristic impedance<br />

are obtained for the highest value <strong>of</strong> dielectric<br />

permittivity.<br />

The obtained values are compared with the FEM<br />

results, also. A very good results agreement is obtained.<br />

Figure 7: Distribution <strong>of</strong> characteristic impedance versus s / w1<br />

for<br />

different values <strong>of</strong> dielectric permittivity (Even mode).<br />

IV. CONCLUSION<br />

A newly developed hybrid boundary element method is<br />

applied to quasi TEM analysis <strong>of</strong> 2D symmetrically<br />

coupled strip lines with finite grounded plane. Two quasistatic<br />

parameters are calculated: effective dielectric<br />

permittivity and characteristic impedance <strong>of</strong> the line. We<br />

have compared the values <strong>of</strong> parameters with those<br />

obtained by the finite element method. A very good<br />

agreement <strong>of</strong> the results is achieved: maximal relative


error <strong>of</strong> the characteristic impedance is less than 0.4%.<br />

Figure 8: Distribution <strong>of</strong> characteristic impedance versus s / w1<br />

for<br />

different values <strong>of</strong> dielectric permittivity (Odd mode).<br />

All calculations were performed on computer with dual<br />

core INTEL processor 2.8 GHz and 4 GB <strong>of</strong> RAM.<br />

This method can be successfully applied to static,<br />

stationary and quasi-stationary electromagnetic fields, as<br />

well as to the analysis <strong>of</strong> the fields in mechanics, fluid<br />

dynamics, conductive heat flow etc.<br />

Acknowledgement<br />

This research was partially supported by funding from<br />

the Serbian Ministry <strong>of</strong> Education and Science in the<br />

frame <strong>of</strong> the project TR 33008.<br />

REFERENCES<br />

[1] T. Fukuda, T. Sugie, K. Wakino, Y.-D. Lin, and T. Kitazawa,<br />

“Variational method <strong>of</strong> coupled strip lines with an inclined<br />

dielectric substrate,” in Asia Pacific Microwave Conference –<br />

APMC 2009, December 7-10, 2009, pp. 866-869.<br />

[2] R. F. Harrington, Field computation by Moment Methods. New<br />

York: Macmillan, 1968.<br />

[3] T. G. Bryant and J. A. Weiss, “Parameters <strong>of</strong> microstrip<br />

transmission lines and <strong>of</strong> coupled pairs <strong>of</strong> microstrip lines,” IEEE<br />

Trans. Microwave Theory Tech., vol. MMT-16, pp. 1021-1027,<br />

Dec. 1968.<br />

[4] A. Farrar and A. T. Adams, “Characteristic impedance <strong>of</strong><br />

microstrip by the method <strong>of</strong> moments,” IEEE Trans. Microwave<br />

Theory Tech., vol. MMT-18, pp. 65-66, Jan. 1970.<br />

[5] K. Li, and Y. Fujii, “Indirect boundary element method <strong>of</strong> applied<br />

- 77 - 15th IGTE Symposium 2012<br />

to generalized microstrip line analysis with applications to sideproximity<br />

effect in MMICs,” IEEE Trans. Microwave Theory and<br />

Techniques, vol. 40, pp. 237–244, Feb. 1992.<br />

[6] C.E. Smith, and R.S. Chang, “Microstrip transmission line with<br />

finite width dielectric,” IEEE Trans. Microwave Theory and<br />

Techniques, vol. 28, pp. 90–94, Feb. 1980.<br />

[7] J. Svacina, “Analytical models <strong>of</strong> width-limited microstrip lines,”<br />

Microwave and Optical <strong>Technology</strong> Letters, vol. 36, pp. 63–65,<br />

Jan. 2003.<br />

[8] J. Svacina, “New method for analysis <strong>of</strong> microstrip with finitewidth<br />

ground plane”, Microwave and Optical <strong>Technology</strong> Letters,<br />

Vol. 48, No. 2, pp. 396-399, Feb. 2006.<br />

[9] C.E. Smith, and R.S. Chang, “Microstrip transmission line with<br />

finite width dielectric and ground plane,” IEEE Trans. Microwave<br />

Theory and Techniques, vol. 33, pp. 835–839, Sept. 1985.<br />

[10] N. B. Raičević, S. R. Aleksić and S. S. Ilić, “A hybrid boundary<br />

element method for multilayer electrostatic and magnetostatic<br />

problems,” J. Electromagnetics, No. 30, pp. 507-524, 2010.<br />

[11] N. B. Raičević, S. R. Aleksić, “One method for electric field determination<br />

in the vicinity <strong>of</strong> infinitely thin electrode shells,”<br />

Journal Engineering Analysis with Boundary Elements, Elsevier,<br />

No. 34, pp. 97-104, 2010.<br />

[12] S. S. Ilić, N. B. Raičević, and S. R. Aleksić, “Application <strong>of</strong> new<br />

hybrid boundary element method on grounding systems,” in 14th<br />

International IGTE'10 Symp., <strong>Graz</strong>, Austria, Sept. 19-22, 2010,<br />

pp. 160-165.<br />

[13] N. B. Raičević, S. S. Ilić, and S. R. Aleksić, “Application <strong>of</strong> new<br />

hybrid boundary element method on the cable terminations,” in<br />

14th International IGTE'10 Symp., <strong>Graz</strong>, Austria, Sept. 19-22,<br />

2010, pp. 56-61.<br />

[14] S. S. Ilić, S. R. Aleksić, and N. B. Raičević, “TEM analysis <strong>of</strong><br />

strip line with finite width <strong>of</strong> dielectric substrate by using new<br />

hybrid boundary element method,” in 10-th International Conf.<br />

on Applied Electromagnetics ПЕС 2011, Niš, Serbia, September<br />

25-29, Sept. 2011, CD Proc. О8-4.<br />

[15] B. Petković, S. Ilić, S. Aleksić, N. Raičević, and D. Antić, “A<br />

novel approach to the positive DC nonlinear corona design,” J.<br />

Electromagnetics, vol. 31, no. 7, pp. 505-524, Oct. 2011.<br />

[16] N. B. Raicevic, and S. S. Ilic, “One hybrid method application on<br />

complex media strip lines determination,” in 3rd International<br />

Congress on Advanced Electromagnetic Materials in Microwaves<br />

and Optics, METAMATERIALS 2009, London, United Kingdom,<br />

2009, pp. 698-700.<br />

[17] S. S. Ilić, M. T. Perić, S. R. Aleksić, and N. B. Raičević, “Quasi<br />

TEM analysis <strong>of</strong> 2D symmetrically coupled strip lines with<br />

infinite grounded plane using HBEM,” in Proc. XVII-th<br />

International Symposium on Electrical Apparatus and<br />

Technologies SIELA 2012, Bourgas, Bulgaria, 28–30 May, 2012,<br />

pp.147-155.<br />

[18] A. M. Abbosh, “Analytical closed-form solutions for different<br />

configurations <strong>of</strong> parallel-coupled microstrip lines”, in IET<br />

Microwaves, Antennas & Propagation, Vol. 3, Iss. 1, pp. 137-<br />

147, 2009.<br />

[19] D. Meeker, FEMM 4.2, Available:<br />

http://www.femm.info/wiki/Download


- 78 - 15th IGTE Symposium 2012<br />

Design Approach for a Line-Start Internal Permanent<br />

Magnet Synchronous Motor<br />

1,2 V. Elistratova, 1 M. Hecquet, 1 P. Brochet, 2 D. Vizireanu and 2 M. Dessoude<br />

1 L2EP, Ecole Centrale de Lille, Cité Scientifique - BP 48 - 59651 Villeneuve d'Ascq, France<br />

2 EDF R&D, 1 avenue du Général de Gaulle, 92141 Clamart Cedex, France<br />

E-mail: vera.elistratova@ec-lille.fr<br />

Abstract—The work described in this paper deals with the analytical design and optimization <strong>of</strong> a line-start permanent<br />

magnet synchronous motor (LSPM) with radial magnet configuration. The design approach considers a LSPM as an<br />

induction motor (IM) combined with a permanent magnet rotor arrangement and takes into account the characteristics <strong>of</strong><br />

both asynchronous and synchronous regimes and the motor thermal behavior.<br />

Index Terms — LSPM, Eco-design, Optimization, Multi-physical model.<br />

I. INTRODUCTION<br />

A large amount <strong>of</strong> the primary energy resources are<br />

converted into electric energy. As the main portion <strong>of</strong><br />

greenhouse gases is produced by fossil fuels, electricity<br />

generation is responsible for the worldwide air pollution<br />

and global warming [1].<br />

Electric motors are one <strong>of</strong> the main sources <strong>of</strong><br />

electricity consumption (Fig.1) and this expense is up to<br />

70% in the industrial processes in Europe [2]. As so, the<br />

electric motors are responsible for a huge share <strong>of</strong><br />

emission <strong>of</strong> CO2. Moreover, there are a total potential <strong>of</strong><br />

improving the energy efficiency <strong>of</strong> applications using<br />

electric motors in the range <strong>of</strong> 20-30%. The main factors<br />

<strong>of</strong> such improvements are the use <strong>of</strong> variable speed drives<br />

and the use <strong>of</strong> energy efficient motors. Therefore, electric<br />

motor optimization for a better efficiency is essential for<br />

energy saving and the reduction <strong>of</strong> CO2 emissions.<br />

Figure 1: Distribution <strong>of</strong> the industrial electric consumption [2]<br />

Until now low- and medium - power induction motors<br />

(IMs) are widely used in many industrial applications,<br />

such as pumps and fans. In spite <strong>of</strong> the low cost, IMs<br />

normally suffer from relatively poor operational<br />

efficiency and power factor [3]. Although a permanent<br />

magnet synchronous machine (PMSM) can achieve high<br />

operational efficiency and power factor, it lacks the<br />

starting capability <strong>of</strong> the IM. For last few decades the<br />

line-start permanent magnet synchronous motor (LSPM)<br />

has been designed, constructed and tested. Compared<br />

with an IM, a LSPM has a lot <strong>of</strong> advantages: synchronous<br />

speed, higher power factor and efficiency, small size, etc.<br />

Besides it has an ability to start when connected directly<br />

to the mains.<br />

II. PERFORMANCE DESIGN<br />

The objective <strong>of</strong> this research is to find an analytical<br />

model for the LSPM with different magnet<br />

configurations. In the present paper the LSPM with radial<br />

magnet arrangement (Fig.2) is designed. This<br />

configuration has a number <strong>of</strong> advantages: simple and<br />

robust structure, better protection <strong>of</strong> buried permanent<br />

magnets (PMs) from demagnetization. Moreover, in<br />

comparison with the other topologies this one has one <strong>of</strong><br />

the best asynchronous loading capabilities [4, 10].<br />

Figure 2: LSPM architecture under study<br />

An analytical model <strong>of</strong> a PMSM with the same rotor<br />

architecture may be found in [6, 11]. In our case this<br />

model can be applied under the assumption that during<br />

the steady-state regime a LSPM operates as a PMSM.<br />

In general, the structure <strong>of</strong> a LSPM is similar to an IM<br />

but the rotor includes both cage and inserted permanent<br />

magnets. Hence, the LSPM combines IMs and PMSM<br />

structure features: the LSPM will start due to the resultant<br />

<strong>of</strong> two torque components i.e. the asynchronous torque<br />

and magnet opponent torque (braking torque). If for the<br />

entire speed range during starting the asynchronous<br />

torque is higher than the sum <strong>of</strong> the braking and load<br />

torque, the motor will reach the synchronous regime [4].<br />

Therefore, the design <strong>of</strong> a LSPM has to take into account<br />

both types <strong>of</strong> performances: starting capacity and<br />

efficiency in steady state regime.<br />

To simplify the LSPM design process, the proposed<br />

procedure applied in this article treats separately the<br />

running modes: the motor can be considered as an IM<br />

during its start and synchronization and as a PMSM<br />

during the steady-state regime. Figure 3 shows the<br />

workflow diagram applied for the design approach.


Figure 3: Diagram summarizing the design procedure<br />

At the first stage <strong>of</strong> the design we enter the data<br />

concerning the power and the stator parameters. At this<br />

stage the same design methodology as for an IM could be<br />

applied. For economic reasons, the stator <strong>of</strong> the LSPM is<br />

identical to the IM <strong>of</strong> the same power.<br />

The stage 2 is to choose a squirrel cage that gives the<br />

value <strong>of</strong> the rotor cage resistance, the level <strong>of</strong> saturation<br />

in a rotor tooth and the number <strong>of</strong> rotor bars.<br />

At the next stage a configuration <strong>of</strong> the permanent<br />

magnets has to be chosen. As soon as we know the<br />

geometry <strong>of</strong> the rotor and the PMs, the d- and q-axis<br />

reactances Xd and Xq and the no-load EMF could be<br />

computed. Using these parameters, it can be simulated<br />

the start <strong>of</strong> the LSPM taking into account the braking<br />

torque caused by PMs. If the motor is not able to start, the<br />

designer has to go back either to the Stage 2 in order to<br />

change the rotor squirrel cage to improve the starting<br />

torque or to the Stage 3 to change the configuration <strong>of</strong><br />

PMs and to reduce the breaking torque.<br />

Finally, after the successful start <strong>of</strong> the LSPM (Stage<br />

5), performances and steady-state characteristics are<br />

calculated. If they are acceptable, the design solution is to<br />

be considered as one for the optimization procedure (see<br />

chapters III, IV). Otherwise, the design procedure is to be<br />

repeat starting from the Stage 2.<br />

Each stage <strong>of</strong> the diagram will be further detailed.<br />

A. Asynchronous Torque<br />

The electromagnetic design <strong>of</strong> induction motor is a<br />

well-known problem. In this paper the asynchronous part<br />

design is based on the methodology proposed in [8].<br />

The classical expression for the asynchronous torque<br />

can be written as follows:<br />

- 79 - 15th IGTE Symposium 2012<br />

'<br />

2 R2<br />

3pV<br />

Tc<br />

<br />

s<br />

'<br />

R2<br />

2 ' 2<br />

2 f ( Rsc ) ( X1 cX2)<br />

s<br />

<br />

<br />

where V is the phase RMS voltage, p is the number <strong>of</strong><br />

poles, c=1+X1σ/Xm, X1σ , X`2σ are the stator and rotor<br />

leakage reactances, Xm is the magnetizing reactance.<br />

B. Braking torque<br />

The braking torque is found as a function <strong>of</strong> the back-<br />

EMF and the stator resistance [5]:<br />

T<br />

br<br />

2<br />

2 2 2<br />

(1 s)( Rs Xsq(1 s)<br />

)<br />

s<br />

s<br />

2<br />

RsXsd Xsq 2 2<br />

s<br />

3p<br />

ER<br />

<br />

2 ( (1 ) )<br />

where Rs is the stator resistance, ωs is synchronous<br />

electrical speed, E is the RMS value <strong>of</strong> back-EMF, s is<br />

the slip, Xsd, Xsq are the direct and quadrature<br />

synchronous reactances respectively.<br />

The braking torque peaks the maximum at low speed<br />

and declines near synchronous speed.<br />

C. Steady state regime<br />

During the steady state regime the rotor <strong>of</strong> LSPM<br />

rotates at synchronous speed and its cage has no<br />

influence. In this state the performance <strong>of</strong> the machine<br />

could be calculated as for PMSM.<br />

As stated in [10] the RMS armature current is a<br />

function <strong>of</strong> the motor equivalent electrical parameters:<br />

2 2<br />

Ia I ad Iaq,<br />

(3)<br />

where the axis currents are<br />

I<br />

ad<br />

I<br />

V( X cos R sin )<br />

EX <br />

ad<br />

sq s<br />

2<br />

Xsq Xsd Rs<br />

sq<br />

V( R cos X sin )<br />

ER <br />

,<br />

s sd<br />

2<br />

Xsq Xsd Rs<br />

s<br />

where δ is the load angle.<br />

The input power <strong>of</strong> the motor is<br />

2<br />

in 3[ aq ad aq( sd sq) s a ],<br />

,<br />

(1)<br />

(2)<br />

(4)<br />

(5)<br />

P I EI I X X R I<br />

(6)<br />

Neglecting the stator core losses the electromagnetic<br />

power is<br />

P 3[ I EI I ( X X )].<br />

(7)<br />

elm aq ad aq sd sq<br />

The electromagnetic torque developed by a PMSM is<br />

Pelm<br />

Telm<br />

,<br />

(8)<br />

2<br />

ns<br />

where ns is the synchronous speed <strong>of</strong> the rotating<br />

magnetic field.


Taking into consideration the Joule losses Pj, the<br />

mechanical losses Pm and the stator core losses Ps, the<br />

efficiency can be expressed as<br />

Pin PjPsPm .<br />

(9)<br />

P<br />

in<br />

D. Calculation <strong>of</strong> the back-EMF and the direct and<br />

quadrature synchronous reactances<br />

The analytical model <strong>of</strong> the d-q machine parameters is<br />

interesting as it provides the fast evaluation <strong>of</strong> LSPM<br />

performances at steady-state regime, the obtainment <strong>of</strong> all<br />

the characteristics and their integration into the<br />

optimization procedure. For example, it permits us to find<br />

the optimal volume <strong>of</strong> magnets in terms <strong>of</strong> the improved<br />

efficiency and reduced braking torque.<br />

To compute the d- and q-axis reactances Xd and Xq and<br />

the back-EMF, the finite element simulation or<br />

experimental testing are usually used [3-5, 9, 12].<br />

However, there are a number <strong>of</strong> papers where all these<br />

parameters are analytically expressed as function <strong>of</strong> the<br />

studied machine geometry [6, 10, 11].<br />

According to the model in [6] the flat-topped value <strong>of</strong><br />

the flux density in the air gap is<br />

B<br />

ag<br />

2Br<br />

emehhmp <br />

(4eagehhmmp2eagempRh ,<br />

e e R 2 e e pR e e R )<br />

ag m h m h r m h r<br />

(10)<br />

where em is the magnet width, hm is the magnet height, eag<br />

is the air gap length, eh is the hub thickness, Rh is the<br />

external hub radius (in our case as the shaft is made <strong>of</strong> the<br />

non-magnetic material eh = Rh), Rr is the rotor radius, μ0<br />

is the permeability <strong>of</strong> vacuum, μm is the magnet relative<br />

permeability, Br is the remanent magnetization,<br />

α=em/(2∙Rh), β=/(2·Rr) are geometrical coefficients.<br />

The first harmonic <strong>of</strong> the flux density in the air gap is<br />

4 i Bag1 Bag<br />

sin ,<br />

(11)<br />

2<br />

where αi=2/π is the ratio <strong>of</strong> the average-to-maximum<br />

value <strong>of</strong> the normal component <strong>of</strong> the air gap magnetic<br />

flux density.<br />

The first harmonic <strong>of</strong> the back EMF can be expressed<br />

32RbLact f NskwBremehhmsin( p)<br />

E <br />

,<br />

pe ( (4 eh pe( 2 p) R) ehR)<br />

ag h m m m h h m r<br />

(12)<br />

where Lact is the rotor active length, Ns is the number <strong>of</strong><br />

turns in series per phase, kw is the global winding<br />

coefficient.<br />

The direct and quadrature synchronous inductances<br />

could be found as a function <strong>of</strong> the machine geometry<br />

and winding arrangement:<br />

- 80 - 15th IGTE Symposium 2012<br />

2<br />

<br />

4sin( p<br />

)<br />

2p <br />

p<br />

<br />

<br />

<br />

Ld<br />

<br />

<br />

2 eagempRhemeagRh) <br />

<br />

<br />

<br />

2 2<br />

6Lact0kwNsRb<br />

Lq <br />

2p sin(2 p)<br />

2 2<br />

eag p <br />

<br />

<br />

2(6eag Rb2(4 eag Rb)cos(<br />

)<br />

<br />

<br />

(2 eag Rb)(cos(2<br />

) sin(2 )))<br />

<br />

.<br />

2<br />

p(2 eag Rb)<br />

eag<br />

<br />

<br />

<br />

2 2<br />

6Lact0kNR w s b sin(2 p) 8eeR<br />

m h r sin( p)<br />

,<br />

2 2<br />

eag p p(4eagehhmpehemRr (13)<br />

(14)<br />

III. OPTIMIZATION PROBLEM<br />

The goal <strong>of</strong> optimization process consists in finding the<br />

set <strong>of</strong> optimal configurations R * taking into account<br />

parameters and constraints imposed by the design<br />

specification. Table I presents the specification for the<br />

studied LSPM. In the presented study the dependency<br />

between the efficiency and the magnet braking torque is<br />

analyzed.<br />

Table I. Specification <strong>of</strong> the designed LSPM machine<br />

Parameter Value/Feasible interval<br />

Power, [kW] 7.5<br />

Voltage LL, [V] 400<br />

Supply frequency f, [Hz] 50<br />

Rated speed, [rpm] 1500<br />

Height <strong>of</strong> the shaft axe, [mm] 132<br />

Rated Torque, [Nm] 47.75<br />

Overload conditions, Tmax/Trated<br />

≥1.6<br />

Ambient temperature, Tamb [°C] [-10; 40]<br />

Stator winding temperature rise<br />

average, [K]<br />

80<br />

Stator winding hot spot, [K] 90<br />

Load torque, [p.u.]<br />

Linearly from zero to nominal<br />

speed, starting from 0.8pu to 1<br />

p.u.<br />

Power factor, cosφ ≥0.8<br />

Efficiency η, [%] To be maximized<br />

Geometry <strong>of</strong> permanent magnets Radial magnet configuration<br />

The design vector X =[x1, x2,…, xn] T identifies the set<br />

<strong>of</strong> design variables. The design variables can be freely<br />

varied by the designer to define a designed object [7].<br />

The permanent magnet geometry is analytically<br />

predetermined form the imposed specification (Table I).<br />

Consequently, the design vector <strong>of</strong> the studied problem is<br />

composed <strong>of</strong> 3 variables: x1 – length <strong>of</strong> the air gap eag; x2<br />

– magnet height hm; x3 – magnet width em. According to<br />

equations (12-14) these 3 parameters are sufficient to<br />

compute the d- and q-axis reactances Xd and Xq and the<br />

back-EMF. Due to manufacturing constraints all <strong>of</strong> the<br />

components <strong>of</strong> design vector X are discontinuous and<br />

standardized.


Formally, the problem is expressed as follows:<br />

<br />

minimize1<br />

η, Tbr ,<br />

X<br />

<br />

(15)<br />

subject to GX ( )= g 1( X), g 2( X),..., g n(<br />

X)<br />

0,n<br />

=2.<br />

Electromagnetic constraints <strong>of</strong> the problem G(X) are<br />

specified in the Table II.<br />

Table II. Constraints <strong>of</strong> the optimization problem<br />

Function Constraint level<br />

Power factor cosφ, p.u. ≥0.8<br />

≥1.6<br />

Tmax/Trated<br />

Where {Tmax/Trated, cosϕ} are the feasible domains for<br />

the maximum torque ratio for synchronous operation and<br />

power factor.<br />

Taking into consideration the fact that all the<br />

components are discrete, in order to find R* a lot <strong>of</strong><br />

configurations have to be investigated.<br />

IV. OPTIMIZATION TECHNIQUE<br />

The optimization method applied for the considering<br />

problem (15) is the exhaustive enumeration (EE) [7, 13].<br />

It is an exact method with evaluations <strong>of</strong> all possible<br />

combinations <strong>of</strong> the PM dimensions and air gap length.<br />

The method doesn’t have any heuristic rules at all.<br />

Because <strong>of</strong> the presence <strong>of</strong> several objective functions,<br />

the aim <strong>of</strong> multi-objective evolutionary algorithms is to<br />

find compromise solutions rather than a single optimal<br />

point as in scalar optimization problems [14].<br />

These trade<strong>of</strong>f solutions are usually called Pareto<br />

optimal solutions. The EE was applied in order to obtain<br />

a genuine Pareto-Front. The method is not pretended to<br />

be the best one in terms <strong>of</strong> total time <strong>of</strong> calculation, but<br />

on the other hand, it gives reliable results.<br />

Input parameters were:<br />

Design vector:<br />

X =[x1, x2,…, xn] T in our case n = 3.<br />

Objective functions:<br />

F(X) = {f1(X),f2(X),…, fm(X)} in our case m=2<br />

and F(X) = {(1- η), Tbr}.<br />

Constraints:<br />

G(X) = {g1(X), g2(X),…, gk(X)} in our case k =2.<br />

The feasible set Ω= {ω1, ω2,…, ωn},where ωi is the<br />

subset which contains all feasible values for the<br />

component xi <strong>of</strong> the design vector, for i=1…n. In<br />

our case n = 3. As all <strong>of</strong> the components <strong>of</strong> design<br />

vector X are discrete, Ω is a finite set that is<br />

composed <strong>of</strong> the possible standardized values.<br />

Output parameters:<br />

The set <strong>of</strong> optimal solutions:<br />

R * = {Xi * X 0 |G (Xi * ) ≤, for i=1,..m}, where<br />

Xi * is the degenerate interval, and each component<br />

*<br />

<strong>of</strong> X is a Pareto optimal solution. Therefore Xi<br />

has following features:<br />

- 81 - 15th IGTE Symposium 2012<br />

*<br />

fl( ) fl( i) for l 1...<br />

m,<br />

*<br />

f j( ) f j( i)<br />

for at least one index j.<br />

The problem (15) was treated and a total <strong>of</strong> 360<br />

combinations has been enumerated. Among these 360<br />

combinations there are 160 that belong to the feasible<br />

domain defined by optimization constraints. In Fig.4 the<br />

feasible set <strong>of</strong> solutions for the EE and the Pareto frontier<br />

are presented.<br />

Figure 4: Pareto front <strong>of</strong> efficiency versus braking torque<br />

V. DESIGN RESULTS<br />

A boundary point <strong>of</strong> the maximal efficiency from the<br />

Pareto frontier has been chosen for a deeper investigation.<br />

Based on this optimal solution and solving the set <strong>of</strong><br />

equations (1-14) all the characteristics for steady-state<br />

regime and optimal dimensions <strong>of</strong> permanent magnets<br />

and air gap length have been found (Table III).<br />

Table III. Optimal solution for <strong>of</strong> the designed LSPM<br />

Parameter Value<br />

Efficiency η, [%] 91.2<br />

Braking torque, Nm 11.66<br />

Power factor cosφ, p.u. 0.983<br />

Air gap length eag, mm 0.7<br />

Magnet height hm, mm 29.0<br />

Magnet width em, mm 15.0<br />

Overload condition, Tmax/Trated<br />

1.783<br />

Table III shows that the efficiency <strong>of</strong> the designed<br />

LSPM compared with a premium efficiency class<br />

induction motor (PEIM) is greater than 0.8% [20]. The<br />

power factor <strong>of</strong> the PEIM (0.9 p.u.) is much lower<br />

compared with the designed LSPM (0.983 p.u.). It means<br />

the LSPM can achieve a very high power factor in a wide<br />

output power range. This feature assists in saving energy<br />

when the motor is running at different loads.<br />

Based on the designed data the static and dynamic<br />

characteristics were obtained. Figures 5-7 show that the<br />

designed LSPM is able to start and synchronize even at<br />

85% <strong>of</strong> the rated voltage.


Figure 5: Torque versus speed curve <strong>of</strong> the studied motor<br />

with supplied phase voltage equal 231V<br />

Figure 6: Torque versus speed curve <strong>of</strong> the studied motor<br />

with supplied phase voltage equal 85% * 231V<br />

Figure 7: Motor speed during transient start<br />

supplied with different voltages<br />

VI. THERMAL MODEL<br />

An increase in motor temperature can cause the stator<br />

winding insulation degradation and permanent magnet<br />

material decreased performances. According to the design<br />

specification (Table I), the acceptable heating in the<br />

LSPM doesn’t have to exceed 90K. To predict the motor<br />

transient thermal behavior an analytical model based on<br />

the general cylindrical component [21] was developed<br />

(Fig. 8).<br />

Figure 8: A simplified model <strong>of</strong> LSPM as the heating body<br />

- 82 - 15th IGTE Symposium 2012<br />

This model corresponds to the system <strong>of</strong> equations:<br />

dcu<br />

cu 12 ( ) 1<br />

<br />

P A cu st A cu C1 ,<br />

dt<br />

<br />

dst<br />

P 2 12<br />

st A st A ( cu st ) C2 .<br />

<br />

dt<br />

(16)<br />

where ∆θcu and ∆θst are the average heating in the copper<br />

and respectively the stator laminations, A1, A2, A12 are<br />

the heat transfer coefficients, C1 and C2 represent the heat<br />

capacities <strong>of</strong> stator core and stator winding. In order to<br />

determine the temperature, a simplified equivalent<br />

thermal network (ETN) model <strong>of</strong> the LSPM is considered<br />

(Fig. 9).<br />

Figure 9: Simplified equivalent thermal network <strong>of</strong> LSPM<br />

(<br />

cu, a cu, c ) cu -cu, a a, in - cu, c s, st Pcu,<br />

<br />

(<br />

cu, a rot, c c, f ) s, st cu, c curot, c rot <br />

<br />

c, ff Ps,<br />

st,<br />

(17)<br />

<br />

<br />

(<br />

rot, a rot, c ) s, st rot, a a, in rot, s s, st Prot<br />

<br />

<br />

( cu, a rot, a a, f ) a, in cu,<br />

a curot, a rot <br />

a, ff Pa,<br />

in,<br />

<br />

( c, fa, f f) fc, fs, sta, fa, in0.<br />

where Δθcu is the heating in the copper winding; Δθs,st -<br />

heating in the steel stator pack, Δθrot - heating in the rotor;<br />

Δθa,in - heating in the air gap; Δθf - heating in the motor<br />

case, Рcu - source <strong>of</strong> losses in the copper winding, Рs,st -<br />

source <strong>of</strong> losses in the stator pack, Рrot - source <strong>of</strong> losses<br />

in the rotor, Рa,in - source <strong>of</strong> mechanical and additional<br />

losses, Λcu,c - thermal conductivity between the slot<br />

winding and stator core, Λcu,a – thermal conductivity<br />

between the winding and the air gap, Λrot,a – thermal<br />

conductivity between the rotor and the air gap, Λrot,с –<br />

thermal conductivity between the rotor and the stator<br />

core, Λa,f – thermal conductivity between the air inside<br />

the motor and the motor case, Λс,f – thermal conductivity<br />

between the stator core and the motor frame; Λf – thermal<br />

conductivity between the motor frame and the external<br />

air.<br />

The solution <strong>of</strong> the systems (16, 17) enabled us to<br />

model overheating in the main parts <strong>of</strong> the designed<br />

motor (Figs. 10, 11).


Figure 10: The increase <strong>of</strong> winding temperature<br />

Figure 11: The increase <strong>of</strong> rotor core temperature<br />

According to figures 10, 11 maximal overheating in<br />

winding is 79.4°C, maximal overheating in rotor is about<br />

<strong>of</strong> 26.3°C that is in compliance with the specification<br />

requirements (Table I).<br />

VII. CONCLUSION<br />

A design method for a LSPM motor considering the<br />

asynchronous starting capacity and the synchronous<br />

steady state performances is proposed in order to find out<br />

an optimal design solution for the given motor topology.<br />

The approach is based on the design <strong>of</strong> an asynchronous<br />

machine incorporating the effect <strong>of</strong> magnets. The present<br />

analytical model takes into account the radial magnet<br />

topology and is to be extended for the other LSPM<br />

architectures.<br />

It has been shown that the efficiency and the power<br />

factor <strong>of</strong> the designed LSPM is greater compared with a<br />

PEIM <strong>of</strong> the same power.<br />

Thereafter, an analytical thermal model was developed.<br />

The proposed thermal model allows predicting the<br />

overheating in the main parts <strong>of</strong> the motor. During the<br />

design process the thermal model didn’t take part in<br />

optimization procedure.<br />

In future investigations it might be possible to combine<br />

the electro-magnetic and thermal optimization problems<br />

in order to integrate them into optimization procedure.<br />

The verification <strong>of</strong> the analytical approach will be<br />

provided by both finite element and experimental models.<br />

- 83 - 15th IGTE Symposium 2012<br />

REFERENCES<br />

[1] Key world energy statistics. International Energy Agency, 2010.<br />

[2] La rentabilité énergétique les entrainements, Mesures 803, Mars<br />

2008, www.mesures.com.<br />

[3] Jian Li and Jungtae Song and Yunhyun Cho. A High-Performance<br />

Line-Start Permanent Magnet Synchronous Motor Amended From<br />

a Small Industrial Three-Phase Induction Motor. In Industrial<br />

Electronics, 2010 IEEE International Symposium, pp. 1308 -1313.<br />

[4] T. Ruan, H. Pan, Y. Xia « Design and Analysis <strong>of</strong> Two Different<br />

Line-Start PM Synchronous Motors», Artificial Intelligence,<br />

Management Science and Electronic Commerce (AIMSEC), 2011.<br />

[5] Soulard, J.; Nee, H.-P.; , "Study <strong>of</strong> the synchronization <strong>of</strong> linestart<br />

permanent magnet synchronous motors," Industry<br />

Applications Conference, 2000. Conference Record <strong>of</strong> the 2000<br />

IEEE , vol.1, no., pp.424-431 vol.1, 2000.<br />

[6] X. Jannot, J.-C. Vannier, J. Saint-Michel and M. Gabsi, An<br />

Analytical Model for Interior Permanent-Magnet Synchronous<br />

Machine with Circumferential Magnetization Design, IEEE,<br />

10.1109/ELECTROMOTION.2009.5259155, July 2009.<br />

[7] P.Venkataraman, Applied Optimization with<br />

Matlab Programming, A Wiley - Interscience publication, John<br />

Wiley & Sons, New York, 2001.<br />

[8] I.P. Kopylov, Electric Machines: M., Energoatomizdat, 1986.<br />

[9] K. Kurihara, M. Azizur Rahman, High Efficiency Line-Start<br />

Interior Permanent Magnet Synchronous Motors, IEEE Trans.<br />

Industry Applications, Vol. 40 Issue 3, May 2004.<br />

[10] J.F.Gieras, M. Wing, Permanent Magnet Motor <strong>Technology</strong>,<br />

USA, Marcel Dekker, 2002.<br />

[11] D.Fodorean, A. Miraoui, Dimensionnement rapide des machines<br />

synchrones à aimants permanents (MSAP), Techniques de<br />

l’ingénieur, Nov. 10, 2009.<br />

[12] H-P. Nee, L. Lefevre, P. Thelin, J. Soulard, Determination <strong>of</strong> d<br />

and q reactances <strong>of</strong> permanent magnet synchronous motors<br />

without measurements <strong>of</strong> the rotor position, IEEE Trans. on<br />

Industry Applications, Vol. 36, No. 5, 1330-1335, Oct. 2000.<br />

[13] D. Samarkanov, F. Gillon, P.Brochet, D. Laloy , Optimal design<br />

<strong>of</strong> induction machine using interval algorithms, COMPEL: The<br />

International Journal for Computation and Mathematics in<br />

Electrical and Electronic Engineering, Vol. 31, N°.5, pages. 1492 -<br />

1502, ISBN. 0332-1649, 8-2012.<br />

[14] P. Alotto, U. Baumgartner, F. Freschi, M. Jaindl, A. Köstinger,<br />

Ch. Magele, W. Renhart, and M. Repetto, SMES Benchmark<br />

Extended: Introducing Pareto Optimal Solutions Into TEAM22,<br />

IEEE Transactions on Magnetics, Vol. 44, No.6, pp. 1066-1069,<br />

2008.<br />

[15] Mellor, P.H.; Roberts, D.; Turner, D.R.; , "Lumped parameter<br />

thermal model for electrical machines <strong>of</strong> TEFC design," Electric<br />

Power Applications, IEE <strong>Proceedings</strong> B , vol.138, no.5, pp.205-<br />

218, Sep 1991.<br />

[16] IEC 60034-30, Standard on efficiency classes for low voltage AC<br />

motors, 2008.<br />

[17] D. Stoia, M. Antonoaie, D. Ilea, M. Cernat, Design <strong>of</strong> Line Start<br />

PM Motors with High Power Factor, Proc. POWERENG 2007,<br />

Setubal, Portugal, 12-14 April, 2007, published on CD-Rom,<br />

IEEE Catalog Number 07EX1654C, ISBN: 1-4244-0895-4, paper<br />

186.<br />

[18] T. Miller, Synchronization <strong>of</strong> line-start permanent magnet AC<br />

motor, IEEE Trans. Power Apparatus and Systems, vol. PAS-103,<br />

July 1984, pp 1822-1828.<br />

[19] T. Tran, S. Brisset, P. Brochet, A Benchmark for Multi-objective,<br />

Multi-Level and Combinatorial Optimizations <strong>of</strong> a Safety<br />

Isolating Transformer, COMPUMAG 2007, Aachen, Germany,<br />

6- 2007<br />

[20] X. Feng, L. Liu, J. Kang, Y. Zhang, Super Premium Efficient<br />

Line Start-up Permanent Magnet Synchronous Motor, Proc. Of<br />

XIX International Conference on Electrical Machines, ICEM2010,<br />

Roma, Italy, Sept. 6-8, 2010.<br />

[21] A.I. Borisenko Cooling <strong>of</strong> industrial electrical machinery,<br />

Energoatomizdat, 1983.


- 84 - 15th IGTE Symposium 2012<br />

Speed-up <strong>of</strong> Nonlinear Magnetic Field Analysis using a Modified<br />

Fixed-Point Method<br />

Norio Takahashi 1 , Kousuke Shimomura 1 , Daisuke Miyagi 2 and Hiroyuki Kaimori 3<br />

1 Dept. Electrical and Electronic Eng., Okayama <strong>University</strong>, Okayama 700-8530 Japan<br />

2 Dept. Electrical Eng., Tohoku <strong>University</strong>, Sendai 980-8579 Japan<br />

3 Science Solutions Int. Lab., Inc., Tokyo 153-0065 Japan<br />

The nonlinear finite element analysis <strong>of</strong> magnetic fields using the Fixed-Point method (FPM) requires a number <strong>of</strong> iterations and<br />

long CPU time compared with those using the Newton-Raphson method (NRM). On the other hand, the Fixed-Point method has an<br />

advantage that the convergence can be obtained even for a complicated nonlinear anisotropy problem, <strong>of</strong> which the convergence is<br />

very difficult using a conventional Newton-Raphson method. Moreover, it has an advantage that a s<strong>of</strong>tware can be easily obtained by<br />

slightly modifying a linear FEM s<strong>of</strong>tware. We then achieved the speed-up <strong>of</strong> the Fixed-Point method by updating the reluctivity at each<br />

iteration (This is called a modified Fixed-Point method). It is shown that the formulation <strong>of</strong> the Fixed-Point method using the<br />

derivative <strong>of</strong> reluctivity is almost the same as that <strong>of</strong> the Newton-Raphson method. The convergence properties <strong>of</strong> these methods are<br />

compared. It is shown that the modified Fixed-Point method has an advantage that the programming is easy and it has a similar<br />

convergence property to the Newton-Raphson method for an isotropic nonlinear problem.<br />

Index Terms—finite element method, Fixed-Point method, Newton-Raphson method, nonlinear electromagnetic analysis<br />

I. INTRODUCTION<br />

The Fixed-Point method [1,2] has an advantage that the<br />

convergence can be obtained even for a complicated nonlinear<br />

problems [3] such as the analysis considering vector magnetic<br />

properties treating an anisotropic material [4, 5], in which the<br />

convergence is sometimes difficult. In addition, it has an<br />

advantage that the s<strong>of</strong>tware for nonlinear analysis can be<br />

easily obtained by adding a small change to that for linear<br />

analysis. But, the Fixed-Point method requires a number <strong>of</strong><br />

iterations and long CPU time compared with those <strong>of</strong> the<br />

Newton-Raphson method [6]. It is reported that the CPU time<br />

can be reduced by using a constant reluctivity in the<br />

beginning <strong>of</strong> nonlinear iterations [7,8 ]. However, nearly ten<br />

times longer CPU time is still necessary compared with the<br />

Newton-Raphson method.<br />

In this paper, a modified Fixed-Point method, which<br />

updates the derivative <strong>of</strong> reluctivity at each iteration, is<br />

proposed. Furthermore, it is pointed out that the formulation <strong>of</strong><br />

the Fixed-Point method using the derivative <strong>of</strong> reluctivity is<br />

the same as the Newton-Raphson method. The convergence<br />

characteristic <strong>of</strong> the newly proposed Fixed-Point method is<br />

compared with those <strong>of</strong> the Newton-Raphson method.<br />

II. FORMULATION OF NRM AND FPM<br />

A. Newton-Raphson Method<br />

There are two kinds <strong>of</strong> methods which deal with the<br />

nonlinearity in the Newton-Raphson method (NRM). One is<br />

the method A (NRM(B 2 )) which uses ν-B 2 curve. In this<br />

method, the magnetic field strength H is given by<br />

2<br />

H ( B ) B<br />

(1)<br />

B is the flux density. The reluctivity ν is given by<br />

2 H(<br />

B)<br />

( B ) <br />

(2)<br />

B<br />

The other is the method B (NRM(B)) which uses the B-H<br />

curve directly. In this method, the magnetic field strength H is<br />

given by<br />

B<br />

H H(<br />

B )<br />

(3)<br />

B<br />

1) Method A (NRM(B 2 )<br />

The static magnetic field equation can be written as follows<br />

in the case <strong>of</strong> the Newton-Raphson method using the -B 2<br />

curve:<br />

H <br />

( <br />

A)<br />

J<br />

(4)<br />

0<br />

where, A is the magnetic vector potential. J0 is the forced<br />

current density. The Galerkin equation G * i(A (k) ) <strong>of</strong> (4) is given<br />

by<br />

* ( k )<br />

( k 1)<br />

( k 1)<br />

Gi ( A ) <br />

N i ( A ) dV N iJ<br />

0dV<br />

(5)<br />

where, Ni is the interpolation function <strong>of</strong> the edge element.<br />

The residual Gi(A) at the k-th nonlinear iteration is given by<br />

( k ) * ( k )<br />

( k )<br />

G ( A ) G i ( A ) G<br />

( A )<br />

i<br />

( k )<br />

j<br />

i<br />

( k 1)<br />

( k 1)<br />

N i ( A ) dV N iJ<br />

0dV<br />

<br />

( k 1)<br />

( k 1)<br />

N i ( A ) dV A<br />

A<br />

* ( k )<br />

G i ( A ) N ( <br />

<br />

i<br />

( k 1)<br />

Ν ) dV<br />

( k 1)<br />

<br />

( k 1)<br />

( k )<br />

N dV<br />

i<br />

A A<br />

( k )<br />

i<br />

A<br />

j<br />

2<br />

<br />

B<br />

<br />

B<br />

2B<br />

(7)<br />

2<br />

2<br />

Aj<br />

B<br />

Aj<br />

B<br />

Aj<br />

where, A, ν etc. in (6) and (7) are values at the k-th iteration.<br />

∂ν/∂B 2 is the term which represents nonlinear magnetic<br />

properties. The process <strong>of</strong> calculation is as follows:<br />

1) The initial value <strong>of</strong> ν is determined.<br />

2) δA (0) is set to zero.<br />

3) A is updated by A (k) =A (k-1) +δA (k) using δA (k) calculated by<br />

(6).<br />

j<br />

( k )<br />

i<br />

1<br />

(6)


4) ν (k) is calculated using the ν-B 2 curve from B obtained by<br />

A (k) .<br />

5) The process from 3) to 5) is repeated.<br />

6) It is judged to be converged if δB(A (k) ) is less than a<br />

specified small value.<br />

2) Method B (NRM(B))<br />

The static magnetic field equation can be written as follows<br />

in the case <strong>of</strong> the Newton-Raphson method using the B-H<br />

curve:<br />

H J<br />

(8)<br />

0<br />

The Galerkin equation G * i(A) <strong>of</strong> (8) is given by<br />

<br />

* ( k )<br />

G i ( A ) <br />

N i HdV<br />

N iJ<br />

0dV<br />

(9)<br />

The residual Gi(A) at the k-th nonlinear iteration is given by<br />

( k ) * ( k )<br />

( k )<br />

G ( A ) G i ( A ) G<br />

( A )<br />

i<br />

i<br />

( k 1)<br />

( k )<br />

N dV dV dV<br />

i H N iJ<br />

N i H<br />

0<br />

( k 1)<br />

* ( k )<br />

H<br />

( B ) ( k ) (10)<br />

G i ( A ) N <br />

dV<br />

i<br />

Bi<br />

B<br />

( k 1)<br />

* ( k )<br />

H<br />

( B )<br />

( k )<br />

G i ( A ) N i A<br />

dV<br />

B<br />

∂H(B)/∂B is the term which represents nonlinear magnetic<br />

properties.<br />

The process <strong>of</strong> calculation is as follows:<br />

1) The initial value <strong>of</strong> ∂H(B (0) )/∂B is determined.<br />

2) δA (0) is set to zero.<br />

3) A is updated by A (k) =A (k-1) +δA (k) using δA (k) calculated by<br />

(10).<br />

4) H (k) is calculated using the B-H curve from B obtained by<br />

A (k) .<br />

5) The process from 3) to 5) is repeated.<br />

6) It is judged to be converged if δB(A (k) ) is less than a<br />

specified small value.<br />

B. Fixed-Point Method<br />

In the Newton-Raphson method, the reluctivity is updated<br />

in each nonlinear iteration as explained above. In the Fixed-<br />

Point method, the reluctivity is fixed at the first step and it is<br />

not changed during the nonlinear iterations.<br />

According to the concept <strong>of</strong> the Fixed-Point method [1], the<br />

magnetic field strength is given by<br />

H( B)<br />

ν B H<br />

(11)<br />

FP<br />

FP<br />

where, FP is the Fixed-Point reluctivity which is constant<br />

during the nonlinear iterations, HFP is an additional magnetic<br />

field strength.<br />

The static magnetic field equation can be written as follows<br />

in the case <strong>of</strong> the Fixed-Point method:<br />

FP 0 )<br />

( J H B FP<br />

(12)<br />

where, HFP (k) at the k-th nonlinear iteration can be obtained by<br />

the following equation:<br />

( k )<br />

( k1)<br />

( k1)<br />

H FP H(<br />

B ) νFPB<br />

(13)<br />

where, H(B (k-1) ) is the magnetic field strength vector on the B-<br />

H curve corresponding to the flux density B (k-1) at the (k-1)-th<br />

nonlinear iteration. HFP (k) converges to some value after<br />

iterations. The residual Gi(A) <strong>of</strong> (12) is given by<br />

<br />

- 85 - 15th IGTE Symposium 2012<br />

<br />

<br />

( k )<br />

Gi<br />

( A) <br />

( k )<br />

N i ( FP<br />

A ) dV N iJ<br />

0dV<br />

(14)<br />

<br />

( k )<br />

N i H FP dV<br />

By substituting HFP (k) in (13) into HFP (k) in (14), we obtain<br />

( k ) * ( k )<br />

G ( A ) G ( A ) N H<br />

i<br />

i<br />

)<br />

dV<br />

* ( k )<br />

( k 1)<br />

( k 1)<br />

G ( A ) N ( H(<br />

B ) <br />

B ) dV<br />

i<br />

<br />

<br />

<br />

* ( k )<br />

( k )<br />

G ( A ) N ( A<br />

) dV<br />

i<br />

i<br />

i<br />

i<br />

<br />

<br />

FP<br />

FP<br />

FP<br />

i<br />

i<br />

i<br />

FP<br />

( k<br />

FP<br />

( k )<br />

( k 1)<br />

N ( A ) dV N J dV N H ( B ) dV<br />

( k 1)<br />

N ( A ) dV<br />

i 0<br />

( k 1)<br />

( k 1)<br />

N ( A<br />

) dV N J dV N H(<br />

B ) dV<br />

where, δA (k) =A (k) -A (k-1) .<br />

Gi * (A (k) ) is given by<br />

<br />

<br />

*<br />

Gi i FP<br />

i 0<br />

<br />

<br />

i<br />

0<br />

FP<br />

<br />

<br />

<br />

<br />

i<br />

i<br />

2<br />

(15)<br />

( k )<br />

( k )<br />

( A ) <br />

N (<br />

<br />

A ) dV N J dV<br />

(16)<br />

In the actual calculation, (14) is used in the Fixed-Point<br />

method.<br />

The process <strong>of</strong> calculation is as follows:<br />

1) The initial value <strong>of</strong> FP is determined.<br />

2) HFP (0) is set to zero.<br />

3) B (k) is obtained from A (k) which is calculated by (14).<br />

4) HFP (k) is obtained by (13).<br />

5) The right hand side <strong>of</strong> (14) is updated and the process from<br />

3) to 5) is repeated.<br />

6) It is judged to be converged if the change <strong>of</strong> B (k) is less<br />

than the specified small value.<br />

According to (15), we found that HFP (k) is given by<br />

H <br />

A<br />

H H<br />

(17)<br />

( k )<br />

( k )<br />

( k )<br />

( k1)<br />

FP FP<br />

FP<br />

FP<br />

(17) means that the difference HFP (k) is the same as H in (9)<br />

<strong>of</strong> the Newton-Raphson method and it can be used as the<br />

judgment <strong>of</strong> the convergence.<br />

Fig.1 shows the concept <strong>of</strong> the nonlinear magnetic field<br />

analysis using the Fixed-Point method. A white circle on the B<br />

axis is a convergence target. In this method, the reluctivity FP<br />

shown in Fig.1 is given as an initial value, and FP is not<br />

changed during the iterations. The flux density B (1) is obtained<br />

by the linear magnetic field analysis. Next, the HFP (1) which<br />

corresponds to the flux density B (1) on the B-H curve and<br />

FPB (1) on the line <strong>of</strong> FP shown in Fig.1(a) is obtained. During<br />

iterations, HFP (k) becomes the same value, which means the<br />

difference HFP (k) becomes almost zero. Then, the converged<br />

result can be obtained.<br />

C. Modified Fixed-Point Method<br />

In the modified Fixed-Point method, the derivative <strong>of</strong><br />

reluctivity is updated at each iteration. In this expression, the<br />

HFP (k) at the k-th nonlinear iteration in (13) can be rewritten by<br />

the following equation:<br />

( k 1)<br />

( k )<br />

( k 1)<br />

H(<br />

B ) ( k 1)<br />

H H(<br />

B ) B<br />

FP<br />

(18)<br />

B<br />

The residual Gi(A (k) ) is given by<br />

k<br />

k<br />

k<br />

k<br />

k<br />

Gi<br />

G i i<br />

dV <br />

( 1)<br />

( ) * ( )<br />

( 1)<br />

H(<br />

B ) ( 1)<br />

<br />

( A ) ( A ) N H(<br />

B ) B (19)<br />

<br />

B<br />

<br />

(19) can be written as follows:


H<br />

FPB (1) FPB (1)<br />

HB (1) HB νFP<br />

(1) νFP<br />

H (1)<br />

FP<br />

H<br />

FPB (2) FPB (2)<br />

HB (2) HB (2) <br />

H (1)<br />

FP<br />

H (2)<br />

FP<br />

H<br />

FPB (2) FPB (2)<br />

HB (3) HB (3) <br />

H (2)<br />

FP<br />

H (3)<br />

FP<br />

ν<br />

ν<br />

FP<br />

ν<br />

FP<br />

FP<br />

ν<br />

ν<br />

FP<br />

FP<br />

(a)<br />

B (1) B (1)<br />

B (2) B (2)<br />

(b)<br />

<br />

H<br />

B (3) B (3)<br />

<br />

H<br />

( 3 )<br />

FP<br />

<br />

H<br />

( 2 )<br />

FP<br />

( 1 )<br />

FP<br />

B-H curve<br />

( k 1)<br />

( k ) H<br />

( B )<br />

( k ) <br />

G ( A ) NAdV NJdV<br />

i<br />

i<br />

i 0<br />

B<br />

<br />

( k 1)<br />

( k 1)<br />

H<br />

( B )<br />

( k 1)<br />

<br />

<br />

N i H(<br />

B ) dV <br />

N i<br />

A dV<br />

B<br />

(20)<br />

( k 1)<br />

( k 1)<br />

H<br />

( B )<br />

( k ) <br />

<br />

N H(<br />

B ) dV N J dV <br />

N A<br />

dV<br />

i<br />

i 0<br />

i<br />

B<br />

<br />

( k 1)<br />

* ( k ) H<br />

( B )<br />

( k ) <br />

G ( A ) <br />

N A<br />

dV<br />

i<br />

i<br />

B<br />

<br />

(10) and (20) denote that the formulation <strong>of</strong> the modified<br />

Fixed-Point method is the same as that <strong>of</strong> the Newton-<br />

Raphson method.<br />

In the actual calculation <strong>of</strong> the modified Fixed-Point<br />

method, (19) is used.<br />

The process <strong>of</strong> calculation is as follows:<br />

1) The initial value <strong>of</strong> ∂H(B (0) )/∂B is determined.<br />

2) HFP (0) is set to zero.<br />

3) B (k) is obtained from A (k) which is calculated by (19).<br />

4) HFP (k) is obtained by (18).<br />

B<br />

B-H curve<br />

B-H curve<br />

(c)<br />

Fig. 1 Conceptual diagram <strong>of</strong> Fixed-Point method. (a) 1 st step. (b) 2 nd step.<br />

(c) 3 rd step.<br />

B<br />

- 86 - 15th IGTE Symposium 2012<br />

5) The right hand side <strong>of</strong> (19) is updated and the process from<br />

3) to 5) is repeated.<br />

6) It is judged to be converged if the change <strong>of</strong> B (k) is less<br />

than the specified small value.<br />

Fig.2 shows the concept <strong>of</strong> the nonlinear magnetic field<br />

analysis using the modified Fixed-Point method. In this<br />

method, the reluctivity νFP shown in Fig.2 (a) is given as an<br />

initial value, and the derivative ∂H/∂B is updated at each<br />

iteration. At the initial iteration, the linear magnetic field<br />

analysis is carried out using the given ∂H/∂B, and the flux<br />

density B (1) is obtained. Next, H(B (1) ) corresponding to the<br />

flux density B (1) on the B-H curve and<br />

∂H(B (1) )/∂B·B (1) =VFPB (1) on the line ∂H(B (1) )/∂B shown in<br />

Fig.2(a) is obtained. At the first step, HFP (k) =HFP (1) following<br />

the definition <strong>of</strong> HFP (1) in (17). The iteration is carried out<br />

H(B (1) H(B ) (1) )<br />

H<br />

FPB (1) FPB (1)<br />

H (1)<br />

FP<br />

0<br />

H<br />

H(B (2) H(B ) (2) )<br />

FPB (2) FPB (2)<br />

H (1)<br />

FP<br />

0<br />

H<br />

H(B (3) H(B ) (3) )<br />

FPB (3) FPB (3)<br />

H (2)<br />

FP<br />

0<br />

( 1 )<br />

H ( B )<br />

<br />

B<br />

( 2 )<br />

B (1) B (1)<br />

(a)<br />

H ( B )<br />

( 2 )<br />

H FP<br />

<br />

B<br />

<br />

H<br />

<br />

H<br />

( 1 )<br />

FP<br />

( 2 )<br />

FP<br />

B (2) B<br />

( 2)<br />

H(<br />

B )<br />

B<br />

(2)<br />

( 2)<br />

H(<br />

B )<br />

B<br />

(b)<br />

( 3 )<br />

H ( B )<br />

( 3 )<br />

H FP<br />

<br />

B<br />

B (3) B (3)<br />

( 3 )<br />

H ( B )<br />

<br />

B<br />

H ( B )<br />

<br />

B<br />

<br />

H<br />

B-H curve<br />

( 1 )<br />

B<br />

B-H curve<br />

B<br />

B-H curve<br />

( 3 )<br />

FP<br />

B<br />

(c)<br />

Fig. 2 Conceptual diagram <strong>of</strong> Modified Fixed-Point method. (a) 1 st step. (b)<br />

2 nd step. (c) 3 rd step.<br />

3


until δHFP becomes near to zero. H (which corresponds to<br />

HFP in (17)) can be directly obtained by using the Newton-<br />

Raphson method as shown in (10). The modified Fixed-Point<br />

method needs two steps (Eqs. (18) and (19)), but the concept<br />

is the same as that <strong>of</strong> the Newton- Raphson method.<br />

III. ANALYZED MODEL<br />

The modified Fixed-Point method is applied to the analysis<br />

<strong>of</strong> the magnetic field in the billet heater model [9] shown in<br />

Fig.3. Analysis domain <strong>of</strong> the model is 1/8. The material <strong>of</strong><br />

the yoke is 35A230(non-oriented electrical steel). The material<br />

<strong>of</strong> the billet is S45C(carbon steel). The numbers <strong>of</strong> elements<br />

and nodes are 107632 and 115101, respectively. The ampere<br />

turns <strong>of</strong> the coil are set as 70000AT (60Hz). The CPU time<br />

and number <strong>of</strong> iteration <strong>of</strong> the Fixed-Point method (FPM),<br />

modified Fixed-Point method (MFPM), Newton-Raphson<br />

method using ν-B 2 curve (NRM(B 2 )), and Newton-Raphson<br />

method using B-H curve (NRM(B)) are compared. For<br />

simplicity, only the calculation <strong>of</strong> the 1st step <strong>of</strong> the step by<br />

step method for the nonlinear eddy current analysis is carried<br />

out in order to compare the performance <strong>of</strong> each method. As<br />

the total CPU time is almost equal to the multiple <strong>of</strong> number<br />

<strong>of</strong> steps, the comparison <strong>of</strong> only the 1st step is sufficient for<br />

the comparison <strong>of</strong> each method.<br />

IV. RESULTS AND DISCUSSION<br />

Fig.4 shows an example <strong>of</strong> distribution <strong>of</strong> flux density <strong>of</strong><br />

NRM(B) and MFPM. The results <strong>of</strong> NRM(B 2 ) and FPM are<br />

also the same as Fig.4. The comparison <strong>of</strong> the CPU time and<br />

the number <strong>of</strong> iterations are shown in Table I. The<br />

convergence property is shown in Fig.5. The convergence<br />

criterion is B(A) < 2.010 -3 . The convergence criterion<br />

( )<br />

G n<br />

/ G<br />

( 0)<br />

<strong>of</strong> the ICCG method is chosen as less than 10 -5 .<br />

Intel Core2 Duo E8400@ 3.16GHz, 3GB RAM is used. These<br />

results suggest that the convergence property <strong>of</strong> MFPM is near<br />

to that <strong>of</strong> NRM. Especially, MFPM is faster than NRM (B). It<br />

is also clarified that NRM(B 2 ) is faster than NRM(B).<br />

fire-resistant material<br />

billet<br />

y<br />

150<br />

z<br />

x<br />

unit:mm<br />

200<br />

100 100<br />

<br />

yoke<br />

15 15 2510 2510 50 50<br />

adiabator<br />

billet<br />

iron core<br />

<br />

V. CONCLUSIONS<br />

z<br />

x<br />

unit:mm<br />

The obtained results can be summarized as follows:<br />

(a) The formulation <strong>of</strong> the modified Fixed-Point method<br />

y<br />

10<br />

200<br />

25<br />

200<br />

300<br />

15<br />

50<br />

(a) (b)<br />

Fig. 3 Analyzed model <strong>of</strong> billet heater. (a) bird’s eye biew (1/8 region). (b) xy<br />

plane.<br />

- 87 - 15th IGTE Symposium 2012<br />

148<br />

TABLE I<br />

COMPARISON OF CPU TIME AND ITERATIONS<br />

Method CPU Time (sec) Iterations<br />

NRM(B2) 370.08 13<br />

NEM(B) 654.83 28<br />

FPM 3432.24 101<br />

MFPM 781.69 22<br />

PC performance : Intel Core2 Duo E8400@ 3.16GHz, 3GB RAM<br />

Flux density B[T]<br />

3.16<br />

3.24<br />

2.88<br />

2.52<br />

2.16<br />

1.80<br />

1.44<br />

1.08<br />

0.72<br />

0.36<br />

0.00<br />

y<br />

x<br />

(a) (b)<br />

Fig.4 Comparison <strong>of</strong> numerical results <strong>of</strong> Flux distribution using NRM(B)<br />

and MFPM. (a) NRM(B). (b) MFPM.<br />

Number <strong>of</strong> nonconverged<br />

elements<br />

<br />

<br />

<br />

<br />

<br />

<br />

Fig. 5 Convergence Property.<br />

NRM(B2 NRM(B )<br />

NRM(B)<br />

FPM<br />

MFPM<br />

2 )<br />

NRM(B)<br />

FPM<br />

MFPM<br />

<br />

<br />

Iterations<br />

(MFPM) using the derivative <strong>of</strong> reluctivity is almost the<br />

same as that <strong>of</strong> the Newton-Raphson method (NRM).<br />

(b) The modified Fixed-Point method (MFPM) has an<br />

advantage that the CPU time is less than that <strong>of</strong> the<br />

Newton-Raphson method (NRM) in some condition, or<br />

MFPM has almost the same performance as NRM.<br />

Moreover,the programming is easy compared with NRM. <br />

[1]<br />

REFERENCES<br />

F.I.Hantila, G.Preda, and M.Vasiliu : “Polarization method for static<br />

fields”, IEEE Trans. Magn., vol.36, no.4, pp.672-675, 2000.<br />

[2] M.Chiampi, D.Chiarabaglio, and M.Repetto: “A Jiles-Atherton and<br />

fixed-point combined technique for time periodic magnetic field<br />

problems with hysteresis” , IEEE Trans. Magn., vol.31, no.6, pp.4306-<br />

4311, 1995.<br />

[3] D.Miyagi, K.Shimomura, N. Takahashi, H. Kaimori: “Usefulness <strong>of</strong><br />

fixed point method in electromagnetic field analysis in consideration <strong>of</strong><br />

nonlinear magnetic anisotropy”, Digest <strong>of</strong> IEEE CEFC, 2012.<br />

[4] S.Urata, M.Enokizono, T.Todaka, and H.Shimoji: “Magnetic<br />

[5]<br />

characteristic analysis <strong>of</strong> the motor considering 2-D vector magnetic<br />

property”, IEEE Trans. Magn., vol.42, no.4, pp.615-618, 2006.<br />

K.Fujiwara, T.Adachi, and N.Takahashi: “A proposal <strong>of</strong> finite-element<br />

analysis considering two-dimension magnetic properties”, IEEE Trans.<br />

Magn., vol.38, no.2, pp.889-892, 2002.<br />

[6] T.Nakata, N.Takahashi, K.Fujiwara, and N.Okamoto: “Improvements <strong>of</strong><br />

convergence characteristics <strong>of</strong> Newton-Raphson method for nonlinear<br />

4


magnetic field analysis”, IEEE Trans. Magn., vol.28, no.2, pp.1048-<br />

1051, 1992.<br />

[7] E.Dlala, A.Belahcen, and A.Arkkio : “Locally convergent fixed-point<br />

method for solving time-stepping nonlinear field problems”, IEEE Trans.<br />

Magn., vol.43, no.11, pp.3969-3975, 2007.<br />

[8] E.Dlala, A.Belahcen, and A.Arkkio : “A fast fixed-point method for<br />

solving magnetic field problems in media <strong>of</strong> hysteresis”, IEEE Trans.<br />

Magn., vol.44, no.6, pp. 1214 -1217, 2008.<br />

[9] N.Takahashi, S.Nakazaki, D.Miyagi, N.Uchida, K.Kawanaka, and<br />

H.Namba: “3-D optimal design <strong>of</strong> laminated yoke <strong>of</strong> billet heater for<br />

rolling wire rod using ON/OFF method”, Archives <strong>of</strong> Electrical<br />

Engineering, vol.61, no.1, pp.115-123, 2012.<br />

- 88 - 15th IGTE Symposium 2012<br />

5


- 89 - 15th IGTE Symposium 2012<br />

S<strong>of</strong>tware Agent Based Domain Decomposition Method<br />

1) M. Jüttner, 1) A. Buchau, 2) M. Rauscher, 1) W. M. Rucker, and 2) P. Göhner<br />

1) Institute for Theory <strong>of</strong> Electrical Engineering, Pfaffenwaldring 47, D-70569 Stuttgart, Germany<br />

2) Institute <strong>of</strong> Industrial Automation and S<strong>of</strong>tware Engineering, Pfaffenwaldring 47, D-70569 Stuttgart, Germany<br />

E-mail: ite@ite.uni-stuttgart.de<br />

Abstract—A workbench is described, able to divide complex coupled three dimensional multiphysics simulations into<br />

smaller parts based on existing domain decomposition techniques. These parts are calculated by s<strong>of</strong>tware agents allowing to<br />

widely distributes the calculation over multiple distributed computers and even into the cloud to speed up the performance,<br />

to make larger simulations possible and to actively manipulate and control the strategy and the process <strong>of</strong> solving.<br />

Index Terms— cloud computing, coupled multiphysics problems, domain decomposition, s<strong>of</strong>tware agents<br />

different resources like idle workstations, laptops or<br />

smartphones and even online resources located within the<br />

cloud. These resources can be used by established and<br />

multifunctional solving methods like FEM or BEM. FEM<br />

is able to solve non-linear and anisotropic material effects<br />

and lead to large systems <strong>of</strong> non-linear equations. An<br />

alternative to the FEM is the BEM. At BEM only the<br />

surface <strong>of</strong> a model needs to be discretised. This leads to a<br />

much smaller system <strong>of</strong> equations represented in a dense<br />

matrix. The calculation time for the matrix gets<br />

acceptable if we use matrix compression. Therefore the<br />

fast multipole method (FMM) and the adaptive cross<br />

approximation (ACA) can be used [7], allowing to<br />

calculate hysteresis effects for magnetic fields [8].<br />

I. INTRODUCTION<br />

Finding a solution for complex three dimensional<br />

coupled field problems more efficiently is the goal <strong>of</strong> this<br />

approach. Therefore, established methods for numerical<br />

solutions like finite element methods (FEM) and<br />

boundary element methods (BEM) are combined with the<br />

idea <strong>of</strong> s<strong>of</strong>tware agents.<br />

S<strong>of</strong>tware agents are a way to develop flexible and<br />

efficient s<strong>of</strong>tware based on the concept <strong>of</strong> agent oriented<br />

s<strong>of</strong>tware development [1]. Therefore, the system is<br />

divided into autonomous and self-organized agents.<br />

These agents are independent <strong>of</strong> each other and capable<br />

to make decisions within there possibilities. Therefore<br />

they are able to interact with each other via messages and<br />

data exchange. Based on this the agents negotiate to reach<br />

their individual goals. The communication between the<br />

agents also allows a dynamic handling <strong>of</strong> multiple<br />

situations per agent as well as for the global system to<br />

grant dynamic and well fitting agent behaviour. Within<br />

the context <strong>of</strong> agent based systems a systematic<br />

distribution <strong>of</strong> the functions, necessary for solving a<br />

problem, to different agents grant a limited coupling<br />

between different agents and results into an even more<br />

flexible and manageable system. This flexibility and<br />

dynamic allow the approach described in section II to<br />

perfectly handle systems with multiple boundary<br />

conditions and to solve weak coupled systems. The<br />

approach <strong>of</strong> s<strong>of</strong>tware agents is currently well established<br />

in automation technology and used for example for selfmanagement<br />

in automation systems [2], modelling smart<br />

grids [3], prioritization <strong>of</strong> test cases [4] or optimising<br />

electromagnetic field problems [5].<br />

Because <strong>of</strong> the big influence <strong>of</strong> available computer<br />

resources for solving numerical problems nowadays<br />

workstations including multiple multicore CPUs and a<br />

relatively large RAM could be used. To handle these<br />

resources modern programming languages are available<br />

and grant a quite good usage <strong>of</strong> all <strong>of</strong> these resources.<br />

Based on the increase <strong>of</strong> calculation power the problems<br />

getting larger and coupled effects are considered as well.<br />

Nowadays large simulation problems are mostly solved<br />

on huge computer clusters with identical computers. The<br />

usage <strong>of</strong> temporary available resources grant, due to<br />

modern operation systems a large performance alternative<br />

[6]. Products like the Micros<strong>of</strong>t High Performance<br />

Computing Server or the Enterprise Linux Cluster from<br />

Redhat or Suse <strong>of</strong>fer a simple way to spread tasks to<br />

II. COUPLING SOFTWARE AGENTS AND SOLVER<br />

Considering a large coupled simulation, the creation <strong>of</strong><br />

equations including all effects is mostly not reasonable.<br />

Splitting the problem into smaller parts that can be solved<br />

iteratively can reduce the total expense <strong>of</strong> the large nonlinear<br />

problem [9], especially when small changes in the<br />

partial problems can be ignored and do not lead to further<br />

iterations <strong>of</strong> the calculation. Therefore a so called<br />

coordination agent is created. This agent splits the<br />

coupled problem into partial problems. Examples for<br />

different classes <strong>of</strong> partial problems are different single<br />

physic problems as well as geometry or material based<br />

partial problems. All functions <strong>of</strong> the coordination agent<br />

are described in section II.B. Then, the partial problems<br />

are assigned to different calculation agents. Fig. 1<br />

describes the cooperation <strong>of</strong> s<strong>of</strong>tware agents.<br />

Fig. 1: Concept <strong>of</strong> cooperation agents


Each calculation agent solves its problem with an<br />

optimized approach for its partial problem. Due to that a<br />

combination <strong>of</strong> multiple methods like FEM or BEM for<br />

different partial problems are possible. This allows a<br />

combination <strong>of</strong> the advantages <strong>of</strong> FEM and BEM for<br />

multiple different calculation resources. For the<br />

calculation agents there is no need to be within one<br />

system. Different resources can be used if the calculation<br />

agents are distributed to multiple computers or even the<br />

cloud as displayed in Fig. 2. The calculation agents are<br />

described in detail in section II.C. The collection <strong>of</strong> these<br />

s<strong>of</strong>tware agents is able to solve coupled problems. The<br />

necessity <strong>of</strong> this new approach handling multiple physics<br />

and large systems can be seen in [10]. The simulation was<br />

only possible with height effort to reach convergence.<br />

<br />

Fig. 2: Distributed Agents<br />

A. Steps to a successful simulation<br />

Setting the approach into its context and describing the<br />

process <strong>of</strong> creating and solving a complex coupled<br />

problem with this approach is topic <strong>of</strong> this subsection.<br />

The process is visualized in Fig. 3.<br />

Build a finer<br />

mesh<br />

Modelling the system with a FEM-s<strong>of</strong>tware<br />

Including mesh and boundary conditions<br />

Export mesh and boundary conditions<br />

Divide mesh into smaller parts<br />

Mapping the boundary conditions<br />

Mesh-management within the agent-system<br />

Calculation Agent 1<br />

Solve partial problem<br />

Parallel and independent<br />

Exchange <strong>of</strong> boundary<br />

conditions<br />

Exchange <strong>of</strong> status<br />

Exchange <strong>of</strong> results<br />

no<br />

status,<br />

boundary<br />

conditions<br />

convergence<br />

yes<br />

Combine solutions<br />

Calculation<br />

Agent<br />

2<br />

...<br />

conceivable<br />

Calculation<br />

Agent<br />

n<br />

Fig. 3: The process for a simulation<br />

- Initially a model estimating the actual problem<br />

needs to be created. This approach does not set any<br />

special requirements to the model itself. So the model can<br />

be created with commonly used CAD-s<strong>of</strong>tware tools. The<br />

same holds for the creation <strong>of</strong> the mesh <strong>of</strong> the model, so<br />

common meshing-tools can be used. For complex<br />

coupled problems it is important to consider all effects<br />

within one single mesh because the calculation is<br />

influenced by the geometry <strong>of</strong> all physics as well as their<br />

coupling.<br />

- To allow any solver to create a suitable solution, the<br />

boundary conditions for the given mesh including all<br />

coupled physics has to be set. The boundary conditions as<br />

well as the previously created mesh have to be exported<br />

in a way it can be handed over to the coordination agent.<br />

- 90 - 15th IGTE Symposium 2012<br />

- The mesh and the boundary conditions are now<br />

handed to a coordination agent. The coordination agent is<br />

responsible for finding a solution for the given problem.<br />

Therefore it splits the problem into smaller parts. The<br />

quantity <strong>of</strong> smaller parts depends on the number <strong>of</strong><br />

available calculation agents or is defined manually. In<br />

case <strong>of</strong> a resource based splitting the assignment <strong>of</strong> the<br />

partial problems to the different calculation agents is<br />

intuitive. In case <strong>of</strong> a manual quantity <strong>of</strong> partial problems<br />

each partial problem is solved iteratively according to<br />

availability <strong>of</strong> resources. The partial problems are now<br />

distributed to the available calculation agents.<br />

- The calculation agents now start solving their<br />

allocated partial problem. Because <strong>of</strong> each calculation<br />

agent running on its own hardware the agent is able to use<br />

all resources <strong>of</strong> this hardware to solve its partial problem<br />

fast and highly parallel. If a new calculation agent,<br />

including new hardware resources, appears in the system<br />

the coordination agents has to determine if the resource<br />

should used and which agent splits its partial problem. In<br />

case <strong>of</strong> a drop out <strong>of</strong> a calculation agent the coordination<br />

agent needs to attach the partial problem to another<br />

calculation agent. This behaviour allows a dynamic<br />

adaption <strong>of</strong> the system. To do so, information based on<br />

status <strong>of</strong> different calculation agents has to be distributed.<br />

- To allow the system <strong>of</strong> agents to solve a coupled<br />

problem, the calculation agents has to exchange results<br />

between each other as soon as they are available. If a<br />

calculation agent is able to understand and interpret the<br />

results <strong>of</strong> another calculation agent, new boundary<br />

conditions can be derived and integrated into the own<br />

calculation. This process continues until all calculation<br />

agents finished. If some partial problems do not reach<br />

convergence the calculation can be interrupted and<br />

reinitialised with a new set <strong>of</strong> partial problems without<br />

recalculating successfully solved parts.<br />

- Finally the coordination agent combines all results<br />

depending on the way they were split before and returns<br />

the result to the user.<br />

An example describing the advantages <strong>of</strong> this approach<br />

is shown in section II.E.<br />

B. The Coordination Agent<br />

The coordination agent is an independent program with a<br />

small set <strong>of</strong> functions. It’s visible to the in- and outside<br />

and represents the interface between the users and the<br />

calculation agents. The graphical user interface (GUI)<br />

provides the interface to the user. The GUI is controls all<br />

functions described below. The internal interface is<br />

realised via a message system handling different types <strong>of</strong><br />

messages received from other agents. In addition to that<br />

the coordination agent <strong>of</strong>fers process variables like the<br />

convergence criteria and the overall progress, so each<br />

problem needs to be assigned to at least one coordination<br />

agent. The different functions <strong>of</strong> the coordination agent<br />

are summarized in Fig. 4 and described in detail in the<br />

following.<br />

- Via its GUI the coordination agent <strong>of</strong>fers the<br />

interface to load a problem. A second problem can only<br />

be loaded, if the first is solved and the results are either<br />

collected by the user or the actual calculations are<br />

interrupted and possible results are dropped. The GUI is<br />

additionally used to display calculated results.


- Solving a model only gets possible if calculation<br />

agents are available. These agents need to be able to solve<br />

all different classes <strong>of</strong> the actual problem. Therefore the<br />

coordination agent collects and manages information<br />

about all available calculation agents and their<br />

possibilities to solve problems. In this context economical<br />

aspects can also be considered within the process <strong>of</strong><br />

solving by the possibility to weight different agents. In<br />

case <strong>of</strong> multiple coordination agents working in the same<br />

surrounding it’s necessary to care about the status <strong>of</strong><br />

agents to avoid multiple tasks for the same agent.<br />

- To instruct a calculation agent to solve a partial<br />

problem, the partial problem must be created. In case <strong>of</strong> a<br />

simple problem and a single calculation agent able to<br />

solve the problem, the partial problem can be the problem<br />

and can directly be handed over to the calculation agent.<br />

In all other cases the problem must be split.<br />

An obvious splitting for weak coupled systems is based<br />

on the different types <strong>of</strong> physics. Further opportunities<br />

for splitting results out <strong>of</strong> the method <strong>of</strong> BEM-FEM<br />

coupling (combining the positive effects on both methods<br />

by calculating non-linear equations with FEM while the<br />

linear once are calculated with BEM). An approach<br />

splitting the different physics and considering BEM/FEM<br />

coupling is realised for electromagnetic field problems in<br />

[11]. There it was shown that iterative coupling <strong>of</strong> BEM<br />

and FEM results in an increase <strong>of</strong> convergence compared<br />

to a strong coupling for the different physics. So this<br />

segmentation based on the different type <strong>of</strong> equations and<br />

physics is used.<br />

Another way to decompose different domains is based<br />

on regions solvable with the same numerical method. The<br />

regions are usually segmented by borders <strong>of</strong> the different<br />

materials. This is especially useful for distributed<br />

calculation and for different discretisation size within one<br />

model. The idea as well as a domain decomposition based<br />

on the number <strong>of</strong> available resources is realised for FEM<br />

in FETI [12] and for BEM in BETI [13].<br />

Another idea is based on overlapping regions only<br />

considering Dirichlet boundary conditions [14]. This is <strong>of</strong><br />

special interest, if we take a look at the amount <strong>of</strong> data<br />

exchanged between different agents.<br />

All mentioned methods for domain decomposition<br />

have in common, that the decomposition has to be done<br />

before the actual solving is done. A flexible or dynamic<br />

adaption to results or partial solutions gets possible with<br />

this agent based approach. This rapidly increases the<br />

speed <strong>of</strong> convergence for a complex simulation. So this<br />

approach gets more flexible, more dynamic and more<br />

adapted to available resources compared to existing<br />

domain decomposition algorithms.<br />

- In a next step the partial problems need to be<br />

distributed to the different calculation agents. Therefore<br />

the coordination agent reserves required calculation<br />

agents. Further it shares all necessary information<br />

including the actual partial problem and initialises the<br />

solving process.<br />

If a calculation agent finishes, a notification is received<br />

by the coordination agent. The coordination agent then<br />

updates its progress variables and checks, if all other<br />

agents working on the same problem have finished. In<br />

- 91 - 15th IGTE Symposium 2012<br />

this case the overall solution is available. If other agents<br />

are still working, only a partial solution can be <strong>of</strong>fered. In<br />

case <strong>of</strong> no solution can be found a creation <strong>of</strong> a finer<br />

mesh and an initialisation <strong>of</strong> the splitting process are done<br />

in hope to find solvable partial problems. The user finally<br />

gets informed about these circumstances.<br />

Fig. 4: The Coordination agent<br />

C. The Calculation Agent<br />

Calculation agents are independent programs. At their<br />

start up all necessary parameters are set independent from<br />

a usage by a coordination agent. The functions <strong>of</strong> the<br />

calculation agent are summarized in Fig. 5 and described<br />

in detail in the following.<br />

- Before a calculation agent can <strong>of</strong>fer its service to a<br />

coordination agent, it has to complete its description.<br />

Therefore a unique name must be set for each calculation<br />

agent while it’s initialised. Also the status the agent is<br />

currently in and the problem classes the calculation agent<br />

is capable <strong>of</strong> solving needs to be specified as well. The<br />

problem classes the calculation agent is able to solve<br />

depend on the specific solver the calculation agent is<br />

connected to. To reach a flexible system and to allow<br />

calculation agents to connect to different solvers, a solver<br />

interface is created capable <strong>of</strong> handling all data and<br />

information connected to the solver. The solver interface<br />

is described in detail in section II.D. A important<br />

information is the commissioner <strong>of</strong> the actual tasks the<br />

calculation agent is working for. Therefore the name <strong>of</strong><br />

the coordination agent is stored within each agent to send<br />

notifications to the commissioner if this gets necessary.<br />

- To manage the solver interface and to satisfy its<br />

needs is the major task for a running calculation agent.<br />

Therefore the agent provides all information requested by<br />

a solver and pass them onto the solver interface.<br />

Examples are the initial boundary conditions that are<br />

received from the coordination agent and the tolerances<br />

for the solver.<br />

The calculation agent also has to make sure, when ever<br />

another calculation agent reports an available result, the<br />

calculation agent has to check whether the result does<br />

influence the own calculation or not. In case <strong>of</strong> an<br />

influence, a re-initialisation <strong>of</strong> the solver process is<br />

necessary. This includes a stop <strong>of</strong> the actual solver<br />

process, an update <strong>of</strong> the boundary conditions after<br />

calculation agent has received the result from the other<br />

calculation agent and a start <strong>of</strong> the new initialized solver.<br />

In case <strong>of</strong> a successful calculated partial solution the<br />

calculation agent notify all calculation agents connected<br />

to the problem about this result and distribute the result<br />

about the new boundary conditions if they are requested.<br />

Also the coordination agent needs to be informed about


the successful calculation and the availability <strong>of</strong> the<br />

results. In case <strong>of</strong> a failure or a not converging solving<br />

algorithm chosen by the calculation agent, the<br />

coordination agent also has to be informed. The same<br />

holds for a drop <strong>of</strong> available solver resources.<br />

- Whenever a calculation agent is started a GUI<br />

provided by each calculation agent is displayed. This GUI<br />

allows setting the calculation agent parameters as well as<br />

connecting it to a solver interface includes setting<br />

necessary parameters therefor. Examples for these<br />

parameters are the host the solver is running on, the port<br />

this host allows to establish a connection and the problem<br />

classes that could be solved with the given solver. All<br />

functions the GUI provides are needed whenever the<br />

coordination agent instructs a calculation agent to solve a<br />

problem. Therefore the GUI provides functions like<br />

loading a model, starting the calculation, extracting the<br />

results and a possibility to cancel the actual solving<br />

process. These functions can also being used without a<br />

connection to a coordination agent. In this case the<br />

calculation agent solves simple problems on its own.<br />

- To track the actual process <strong>of</strong> solving a partial<br />

problem and to understand the behaviour <strong>of</strong> a calculation<br />

agent each calculation agent <strong>of</strong>fers a separate function <strong>of</strong><br />

writing a log file and displaying it within the GUI.<br />

To grant the availability for the coordination agent during<br />

the complete process <strong>of</strong> solving and to allow the<br />

calculation agent to react flexible to information from<br />

other agents at least two threads are created within the<br />

calculation agent. The first thread represents the<br />

functionality <strong>of</strong> the calculation agent. Further threads are<br />

used by the solver and its interface to solve the partial<br />

problem. Only the realisation with multiple threads<br />

allows the calculation agent to handle requests form the<br />

coordination agent as well as checking for changes in the<br />

boundary setting and interrupt in case <strong>of</strong> necessity while<br />

the solver is calculating a solution for the problem. A<br />

quick and efficient message exchange allows sending and<br />

receiving as well as processing messages with very little<br />

delays is another important part to grant the flexibility <strong>of</strong><br />

the system. Consequences for the solver <strong>of</strong> received<br />

messages are handled by the solver interface.<br />

Fig. 5: The Calculation Agent<br />

D. The Solver Interface<br />

The solver interface is part <strong>of</strong> the calculation agent. It is<br />

the bridge between the calculation agent and a solver. It<br />

allows the calculation agent to solve at least on problem<br />

class. The functions <strong>of</strong> the solver interface are described<br />

in the following and summarize Fig. 6.<br />

- To establish a connection between the calculation<br />

agent and different solvers the solver interface can be<br />

understood as a collection <strong>of</strong> libraries controlling a<br />

variety <strong>of</strong> solvers. While starting the calculation agent the<br />

specific type <strong>of</strong> solver must be selected and parameters<br />

- 92 - 15th IGTE Symposium 2012<br />

for the reachability <strong>of</strong> the solver must be set. Examples<br />

are the host name and the network port the solver can be<br />

reached or the local running solver application. In case <strong>of</strong><br />

multiple solver interfaces managed by a single calculation<br />

agent it gets possible to create calculation agents with the<br />

possibility to solve multiclass problems.<br />

- If a connection to a solver is established the<br />

necessary parameters need to be set. Therefore the partial<br />

problem received by the calculation agent must be<br />

translated to a form the solver does understand and<br />

passed to the solver.<br />

- In the next step the solver interface starts the solver.<br />

- While the solver is calculating the major task <strong>of</strong> the<br />

solver interface is to control the solver and manage its<br />

output. This includes monitoring all information created<br />

by the solver as well as interrupting the solver for<br />

checking possible changes due to results <strong>of</strong> other agents.<br />

Additional information like the convergence behaviour <strong>of</strong><br />

the actual partial problem, the availability <strong>of</strong> the solver,<br />

its resources and a guess for the remaining calculation<br />

time are collected by the solver interface and passed to<br />

the calculation agent. The analyses <strong>of</strong> the information<br />

allow a quick reaction from the calculation agent and also<br />

the coordination agent to any changes <strong>of</strong> the system. An<br />

example is a temporary unavailable solver. Due to that<br />

act the calculation agent gets temporary unable to solve<br />

problems and has to disconnect itself from the<br />

coordination agent and the problem. Then the<br />

coordination agent has to find an alternative for the<br />

calculation agent to successfully solve the problem.<br />

Another example concerns the possibility <strong>of</strong> convergence.<br />

If the calculation agent recognizes a convergence is<br />

unlikely, the calculation agent has to reconsider the<br />

chosen form <strong>of</strong> the solver or in the worst case a message<br />

has to be sent to the coordination agent to replace the<br />

actual splitting by a different on.<br />

- After a successful calculation the solver interface<br />

notifies the calculation agent to inform other agents about<br />

the available result. All information connected to the<br />

message exchange between different agents is handled by<br />

the calculation agent. The solver interface only takes care<br />

about the information directly connected to the solver.<br />

If the result is requested, the solver interface extracts the<br />

result and translates it into a form that can be shared with<br />

other agents. In case the result or the solver is no longer<br />

needed the solver interface detach the connection to the<br />

solver, release reserved resources and initialise the<br />

deregistering process <strong>of</strong> the calculation agent.<br />

Fig. 6: The solver interface<br />

E. Processing Details<br />

The way a problem is solved do significantly depend<br />

on the timing <strong>of</strong> the agents finishing their calculations<br />

and informing others agents about their results. So in<br />

coupled systems not every effect has the same meaning at<br />

each moment within the process <strong>of</strong> solving. The time to


calculate a partial result for two physics depends for an<br />

identic mesh on the linearity <strong>of</strong> the materials for the<br />

different physics as well as on the resources each agent is<br />

able to use. Therefore the dependent partial results are<br />

<strong>of</strong>fered at a different time and the timing issue to the<br />

global system needs to be taken special care <strong>of</strong>.<br />

As an example the temperature at a circuit board after a<br />

certain time should be calculated like it’s shown in Fig. 7.<br />

The system consists <strong>of</strong> one coordination agent and two<br />

calculation agents. The first calculation agent calculates<br />

the electric field and as a side effect, the resistive losses.<br />

The second agent takes care <strong>of</strong> the calculation <strong>of</strong> the heat<br />

conduction from the transistor. In the described approach<br />

the recalculation <strong>of</strong> the overall temperature simulation<br />

will automatically be initialised if the result <strong>of</strong> the electric<br />

simulation including the resistive losses is present and do<br />

significantly change the result <strong>of</strong> the temperature<br />

simulation. Because <strong>of</strong> the parallel calculation <strong>of</strong> the<br />

circuit board it’s only necessary to recalculate some<br />

values <strong>of</strong> the matrix. Fig. 8 and Fig. 9 show two different<br />

scenarios for handling the different calculation times.<br />

Fig. 7: Coupled Problem<br />

In Fig. 8 a calculation procedure is assumed where the<br />

calculation agent responsible for solving the electric field<br />

problem has finished its calculation first. In that case the<br />

calculation agent responsible for the temperature has to<br />

check whether the heat radiated from the electric current<br />

does significantly change the own result or can be<br />

ignored. In the example the heat has to be considered.<br />

Therefore the temperature calculation agent has to update<br />

its calculation based on the result <strong>of</strong> the electric<br />

calculation agent. Therefore it adapts its boundary<br />

conditions to the result and recalculates again. If the<br />

temperature calculation agent also finishes and in case <strong>of</strong><br />

no more calculation agents working on the problem the<br />

- 93 - 15th IGTE Symposium 2012<br />

coordination agent requests all calculated partial results<br />

and combines them to a single result that is finally<br />

<strong>of</strong>fered to the user.<br />

Fig. 8: Solution case I<br />

In Fig. 9 the opposite case is considered. The<br />

calculation agent responsible for solving the temperature<br />

problem finishes first. In this simplified case the<br />

temperature does not have any influence on the electric<br />

conductivity so the electric calculation agent responses a<br />

“not acknowledge” (NACK) to the given information<br />

about a result. This NACK means that there is no<br />

influence expected from the partial result <strong>of</strong> the<br />

temperature calculation agent to the electric calculation<br />

agent. Then the electric calculation agent passes on and<br />

finishes its calculation regularly. If the temperaturecalculation<br />

agent obtains the result <strong>of</strong> the electriccalculation<br />

agent it checks its result and recalculates it as<br />

in the previous case. The following continues equally.<br />

Fig. 9: Solution case II


III. IMPLEMENTATION ENVIRONMENT<br />

A. The Agent System<br />

The actual implementation is using the Java Agent<br />

Development Framework (JADE) as a middle-ware to<br />

implement the mentioned agents. JADE is a Java agent<br />

based development framework. It is distributed by<br />

Telecom Italia and currently available under Lesser<br />

General Public License Version 2 (LGPLv2) with the<br />

latest version 4.2.0 and a release on 26 th June 2012. The<br />

implementation in Java, grants a system and operating<br />

independent environment for usage. The minimal system<br />

requirement is a running Java 1.4 runtime-environment<br />

available for mostly every system and even smartphones.<br />

The agent communication is based on a protocol<br />

containing seven layers that ensure the correct<br />

transmission and reception <strong>of</strong> message from different<br />

types. The complete system is thereby based on a<br />

standard <strong>of</strong>fered by the “Foundation for Intelligent<br />

Physical Agents” (FIPA) that was inherited by IEEE in<br />

2005<br />

B. The Solver<br />

Each calculation agent can connect itself to two different<br />

solvers. As a FEM-s<strong>of</strong>tware, COMSOL Multiphysics<br />

[15] <strong>of</strong>fers a bench <strong>of</strong> modern algorithms able to solve<br />

coupled problems. It also includes the creation <strong>of</strong> a mesh<br />

with multiple mesh types. In this context it is quite useful<br />

that all elements available in the s<strong>of</strong>tware can be reached<br />

via the Java-API COMSOL Multiphysics <strong>of</strong>fers. It is also<br />

possible to run the s<strong>of</strong>tware as a server and connect to the<br />

server via an <strong>of</strong>fered jar library for Java-programs. The<br />

library also passes results that can be visualised within a<br />

GUI. The prototype <strong>of</strong> the GUI for the calculation agent<br />

is based on the example for the usage <strong>of</strong> the COMSOL<br />

API. Within the API the GUI and the solver are already<br />

realised in parallel threats and notifications are send when<br />

a task starts or finishes. This helps initialising further<br />

events. Necessary functions to control the solver and its<br />

behaviour are also implemented. As a BEM-s<strong>of</strong>tware the<br />

calculation agents are prepared to connect to FAMU [8].<br />

IV. CONCLUSION<br />

This approach will efficiently solve highly complex<br />

three dimensional coupled field problems based on the<br />

idea <strong>of</strong> s<strong>of</strong>tware agents spread onto multiple distributed<br />

computers including the cloud. The distributed computers<br />

run a so called calculation agent, able to solve smaller<br />

problems. The total expenditure for finding a solution is<br />

reduced by the creation <strong>of</strong> multiple smaller equation<br />

systems. The complex coupled problem is split by a<br />

variation <strong>of</strong> already established domain decomposition<br />

methods into these smaller problems. The domain<br />

decomposition used, is based on different physics and<br />

different material properties as well as geometrical<br />

aspects. Every partial problem is then assigned to a<br />

calculation agent and handled for its own. Finding a<br />

solution <strong>of</strong> a coupled problem only gets possible by the<br />

communication and negotiation between the different<br />

agents. The communication also allows to dynamically<br />

adapt the system to new surrounding and to find<br />

convergence in coupled systems<br />

- 94 - 15th IGTE Symposium 2012<br />

An additional approach to reduce the effort for finding<br />

a solution is reached by the independent decision agents<br />

are allowed to take. This concerns especially the way the<br />

calculation agents solving the given partial problem. This<br />

includes the decision for a numerical method like FEM or<br />

BEM and the reaction on changed boundary conditions.<br />

Meaning, slightly modified boundary conditions are<br />

skipped for the partial result if no change in the result is<br />

expected instead <strong>of</strong> initialisation a new calculation cycle.<br />

The domain decomposition and the coordination <strong>of</strong> the<br />

interworking <strong>of</strong> different calculation agents are handled<br />

by a so called coordination agent. In this paper the<br />

realisation <strong>of</strong> a calculation agent as well as the realisation<br />

<strong>of</strong> a coordination agent with their different functions and<br />

their interworking is described.<br />

REFERENCES<br />

[1] N. Jennings, "Agent-Oriented S<strong>of</strong>tware Engineering," in Multi-<br />

Agent System Engineering, Berlin, Springer, 1999, pp. 1-7.<br />

[2] H. Mubarak and P. Göhner, "An agent-oriented approach for selfmanagement<br />

<strong>of</strong> industrial automation systems," 8th International<br />

Conference on Industrial Informatics, pp. 721-726, 2010.<br />

[3] M. Pipattanasomporn, H. Feroze and S. Rahman, "Multi-agent<br />

systems in a distributed smart grid: Design and implementation,"<br />

Power Systems Conference and Exposition, pp. 1-8, 2009.<br />

[4] C. Malz, N. Jazdi and P. Göhner, "Prioritization <strong>of</strong> Test Cases<br />

Using S<strong>of</strong>tware Agents and Fuzzy Logic," 5th Conference on<br />

S<strong>of</strong>tware Testing, Verification and Validation, pp. 483-486, 2012.<br />

[5] D. G. Lymperopoulos, N. L. Tsitsas and D. I. Kaklamani, "A<br />

Distributed Intelligent Agent Platform for Genetic Optimization in<br />

CEM: Applications in a Quasi-Point Matching Method,"<br />

Transactions on Antennas and Propagation, vol. 55, no. 3, pp.<br />

619-628, 2007.<br />

[6] A. Buchau, S. M. Tsafak, W. Hafla and W. M. Rucker,<br />

"Parallelization <strong>of</strong> a Fast Multipole Boundary Element Method<br />

with Cluster OpenMP," Transactions on Magnetics, vol. 44, no. 6,<br />

pp. 1338-1341, 2008.<br />

[7] A. Buchau, W. M. Rucker, O. Rain, V. Rischmuller, S. Kurz and<br />

S. Rjasanow, "Comparison between different approaches for fast<br />

and efficient 3-D BEM computations," Transactions on Magnetics,<br />

vol. 39, no. 3, pp. 1107- 1110, 2003.<br />

[8] A. Buchau, W. Hafla, F. Groh and W. M. Rucker, "Fast multipole<br />

method based solution <strong>of</strong> electrostatic and magnetostatic field<br />

problems," Computing and Visualization in Science, vol. 8, no. 3,<br />

pp. 137-144, 2005.<br />

[9] V. Rischmuller, S. Kurz and W. M. Rucker, "Parallelization <strong>of</strong><br />

coupled differential and integral methods using domain<br />

decomposition," Transactions on Magnetics, vol. 38, no. 2, pp.<br />

981-984, 2002.<br />

[10] P. Alotto, M. Guarnieri and F. Moro, "A Fully Coupled Three-<br />

Dimensional Dynamic Model <strong>of</strong> Polymeric Membranes for Fuel<br />

Cells," Transactions on Magnetics, vol. 46, no. 8, pp. 3257-3260,<br />

2010.<br />

[11] J. Albert, R. Banucu, W. Hafla and W. M. Rucker, "Simulation<br />

Based Development <strong>of</strong> a Valve Actuator for Alternative Drives<br />

Using BEM-FEM Code," Transactions on Magnetics, vol. 45, no.<br />

3, pp. 1744-1777, 2009.<br />

[12] C. Farhat and F.-X. Roux, "A method <strong>of</strong> finite element tearing and<br />

interconnecting and its parallel solution algorithm," International<br />

Journal for Numerical Methods in Engineering, vol. 32, no. 6, pp.<br />

1205-1227, 1991.<br />

[13] U. Langer and O. Steinbach, "Boundary Element Tearing and<br />

Interconnecting Methods," Computing, vol. 71, no. 3, pp. 205-228,<br />

2003.<br />

[14] D. Lavers, I. Boglaev and V. Sirotkin, "Numerical solution <strong>of</strong><br />

transient 2-D eddy current problem by domain decomposition<br />

algorithms," Transactions on Magnetics, vol. 32, no. 3, pp. 1413-<br />

1416, 1996.<br />

[15] COMSOL AB, Tegnérgatan 23, SE-111 40 Stockholm.


- 95 - 15th IGTE Symposium 2012


- 96 - 15th IGTE Symposium 2012


- 97 - 15th IGTE Symposium 2012


- 98 - 15th IGTE Symposium 2012


- 99 - 15th IGTE Symposium 2012


- 100 - 15th IGTE Symposium 2012


- 101 - 15th IGTE Symposium 2012<br />

Human exposure to the magnetic field<br />

produced by MFDC spot welding systems<br />

D. Bavastro ∗ , A. Canova ∗ , L. Giaccone ∗ , M. Manca ∗ , M. Simioli †<br />

∗ Politecnico di Torino - Dipartimento Energia, C.so Duca degli abruzzi 24, 10129 Torino, Italy<br />

† KGR S.p.a, via Nicolao Cena 65, 10032 Brandizzo, Italy<br />

E-mail: luca.giaccone@polito.it<br />

Abstract—In this paper the magnetic field emission <strong>of</strong> Medium Frequency Direct Current (MFDC) spot<br />

welding system is analyzed with reference to the exposure <strong>of</strong> working population. In the first part <strong>of</strong> the<br />

paper experimental measurements have been carried out in order to get the magnetic field emission <strong>of</strong><br />

a selected MFDC system. The measurement is performed in time domain acquiring the waveform <strong>of</strong> the<br />

magnetic field. In the second part <strong>of</strong> the paper the methodologies suggested by the ICNIRP guidelines have<br />

been adopted for the analysis <strong>of</strong> the waveforms: the equivalent frequency method, the multiple frequency method<br />

and the weighted multiple frequency method. An exhaustive comparison <strong>of</strong> the possible methodology suggested<br />

by the guidelines is given and contextualized in the regulatory framework. The emission <strong>of</strong> MFDC spot<br />

welding system has been completely characterized. By means <strong>of</strong> the spectral analysis it is found that the<br />

overcoming <strong>of</strong> the limit is mainly due to the 2000 Hz and 4000 Hz components. This result is useful for<br />

manufacturers because, in order to minimize the overall emission, it is possible to think about a mitigation<br />

system that encloses only the related internal components.<br />

Index Terms—pulsed magnetic fields, quasi-static magnetic fields, spot welding, MFDC<br />

I. INTRODUCTION<br />

Protection <strong>of</strong> the working population against<br />

the possible effects <strong>of</strong> extremely low frequency<br />

(ELF) electromagnetic fields is a concern <strong>of</strong> the<br />

European Community, which has published 2004<br />

Directive 2004/40/EC [1]. The Directive refers to<br />

the risk to the health and safety <strong>of</strong> workers due<br />

to known short-term adverse effects in the human<br />

body caused by the circulation <strong>of</strong> induced currents,<br />

by energy absorption, and contact currents. One <strong>of</strong><br />

the most important points stated in the Directive is<br />

the rationale <strong>of</strong> exposure at low frequency which<br />

is defined in accordance with ICNIRP 1998 guidelines<br />

[2]. These Guidelines report that in the ELF<br />

frequency range, the risk to the health and safety <strong>of</strong><br />

workers is due to known short-term adverse effects<br />

caused by the circulation <strong>of</strong> induced currents in the<br />

human body.<br />

INCIRP guidelines and directive 2004/40/EC<br />

provide a definition <strong>of</strong> reference or action values<br />

(i.e., values which can be directly measured like<br />

magnetic flux density) and exposure limit values<br />

(i.e., limit which are based directly on established<br />

health effects and biological considerations like<br />

current density).<br />

In this paper the exposure to the magnetic field<br />

produced by Medium Frequency Direct Current<br />

(MFCD) spot welding devices is analyzed. For this<br />

application the magnetic field waveform is pulsed<br />

and non-sinusoidal. While continuous wave mode<br />

<strong>of</strong> exposure is strictly defined in ICNIRP guidelines,<br />

the evaluation for pulsed or non-sinusoidal<br />

magnetic field waveforms is still an open question<br />

[3], [4], [5], [6], [7], [8], [9]. In the ICNIRP guidelines<br />

(year 1998), the problem <strong>of</strong> non-sinusoidal<br />

waveforms was tackled by means <strong>of</strong> superposition<br />

<strong>of</strong> harmonic values. This approach, even if possible,<br />

has been highly criticized afterwards because<br />

<strong>of</strong> an excessive conservative estimates <strong>of</strong> exposure<br />

levels. Due to the increasing importance <strong>of</strong> nonsinusoidal<br />

sources <strong>of</strong> magnetic fields, in 2003<br />

ICNIRP has published a new guideline for pulsed<br />

and complex non-sinusoidal waveforms [10]. This<br />

document is strongly based on the result obtained<br />

from K. Jokela [11]. It addresses the exposure evaluation<br />

in non-sinusoidal conditions by means <strong>of</strong><br />

proper weighting factors to be applied to different<br />

harmonic components <strong>of</strong> the waveform spectrum.<br />

In 2010 the ICNIRP published a new set <strong>of</strong><br />

guidelines [13]. There are two main differences<br />

between this document and the older one: 1) the<br />

dosimetric quantity used for ELF electromagnetic<br />

field is E (V/m) instead <strong>of</strong> J (A/m2 ). 2) The limits<br />

imposed on action values have been increased<br />

as can be observed in Fig. 1.<br />

From the analysis <strong>of</strong> the current literature, several<br />

papers analyze the same problem by computing<br />

the spectrum <strong>of</strong> the measured welding current.


Fig. 1. Reference levels for occupational exposure to time<br />

varying magnetic field. Comparison between 1998 and 2010<br />

values.<br />

Afterward, the welder is usually modeled as a<br />

coil and, simulations are performed in frequency<br />

domain for each spectral component <strong>of</strong> the current<br />

in order to derive the current density inside a<br />

human model or a simplified and standardized<br />

model [8], [9], [12]. Even if these kind <strong>of</strong> simulations<br />

are not an easy task, the procedure is <strong>of</strong>ten<br />

preferred because it is easier to measure the current<br />

in time domain rather than the magnetic field,<br />

excpecially for quasi-rectangular waveform [14].<br />

The drawback <strong>of</strong> this procedure is that assuming<br />

a spectrum for the current, the generated magnetic<br />

field is characterized by the same spectrum in all<br />

the surrounding point due to the neglection <strong>of</strong> the<br />

nonlinear electrical devices inside the body <strong>of</strong> the<br />

welder.<br />

In this paper the different methodologies provided<br />

by the ICNIRP to analyze pulsed and nonsinusoidal<br />

magnetic field will be applied to the<br />

MFCD spot welding devices. The actual waveform<br />

<strong>of</strong> the magnetic field have been measured taking<br />

care to the possible measurement problem [14].<br />

Finally, in order to compute the exposure level<br />

the limit provided by the ICNIRP 1998 has been<br />

employed due to the fact that currently the Italian<br />

regulation framework refers to those guidelines.<br />

II. MFDC SYSTEMS<br />

In Fig. 2 the conversion chain <strong>of</strong> the MFDC<br />

system is represented. The supply power, taken<br />

from the three-phase system at 50 Hz, is driven<br />

by means <strong>of</strong> a rectifier to an IGBT switch. The<br />

waveform at the input/output <strong>of</strong> the transformer is<br />

characterized by a frequency <strong>of</strong> 1000 Hz and, after<br />

a full-wave rectification (f = 2000 Hz), is applied<br />

to the welder terminals. The welder terminal can<br />

be considered as a R-L load. The switching <strong>of</strong><br />

the IGBT bridge is controlled so that the welding<br />

current reaches a desired (constant) value. The<br />

- 102 - 15th IGTE Symposium 2012<br />

welding process can be performed by means <strong>of</strong><br />

a single impulse or with more the one impulse.<br />

In Fig. 3 the waveform <strong>of</strong> the weld current is<br />

shown. As it can be observed, the actual current<br />

is not perfectly rectangular because the conversion<br />

system is not able to nullify completely a ripple at<br />

2000 Hz (and higher harmonics) that is superposed<br />

to the weld current. The main weld parameter are:<br />

the current peak (Ip) that is usually in the order<br />

<strong>of</strong> some kA, the weld time that is the duration <strong>of</strong><br />

the single pulse and the hold time that is a period<br />

when the current is zero after the welding, but the<br />

electrodes are still applied to the sheet to chill the<br />

weld.<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Fig. 2. MFDC spot welding device under analysis. The<br />

reference system for the measurement is placed in the center<br />

<strong>of</strong> the welding coil.<br />

current<br />

peak<br />

weld time hold time<br />

Fig. 3. Weld current and its main parameters. Weld time =<br />

140 ms, hold time = 40 ms.<br />

In the spot welding sector the MFDC welder are<br />

<strong>of</strong>ten preferred to the AC ones because <strong>of</strong> some<br />

benefit: a shorter weld times is required due to the<br />

the DC output current, hence significant energy<br />

saving is obtained. Moreover, MFDC systems are<br />

very stable in working condition far from the rating<br />

power (useful range: 20-95%). Conversely, AC<br />

systems are unstable and inefficient when used<br />

outside the 70-90% <strong>of</strong> the rating power.<br />

III. EXPERIMENTAL MEASUREMENT<br />

In this paper the magnetic field emission <strong>of</strong><br />

a MFDC welder produced by KGR S.p.A. is<br />

analyzed. In Fig. 4 the layout with dimensions


is shown. The MWG model is a manual welder,<br />

therefore the operator is quite close to the device<br />

during the welding operation. In Fig. 5 a classical<br />

working configuration is reported in frontal and<br />

lateral view.<br />

Several observation points have been defined<br />

in order to evaluate the human exposure in the<br />

working configuration represented in Fig. 5. It<br />

has to be stressed that the considered working<br />

configuration is also the most critical one because<br />

the operator is faced to the welder coil.<br />

The measurements points are summarized in Table<br />

I. The coordinates are related to the reference<br />

system in Fig. 6.<br />

Fig. 4. Weld current and its main parameters. Weld time =<br />

140 ms, hold time = 40 ms.<br />

(a) (b)<br />

Fig. 5. working configuration: front view (a) and side view<br />

(b)<br />

TABLE I<br />

FIELD POINTS<br />

field point x (m) y (m) z (m)<br />

P1 0 0 0.28<br />

P2 0 0 0.5<br />

P3 0.5 0 0.28<br />

P4 0.5 0 0.5<br />

P5 0.8 0 0.28<br />

P6 0.8 0 0.5<br />

Al the measurement will be referred to the<br />

current represented in Fig. 3: current peak (Ip)<br />

equal to 12 kA, wled time equal to 140 ms and<br />

hold time equal to 40 ms.<br />

For each measurement point the waveform <strong>of</strong><br />

the magnetic field has been measured along the<br />

three axis (x, y and z). Finally the rms values<br />

- 103 - 15th IGTE Symposium 2012<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Fig. 6. Reference system for measurement points definition<br />

has been computed. The total field waveforms for<br />

each observation points are shown in Fig. 7. By<br />

comparing those waveforms it is possible to derive<br />

some considerations: 1) as expected, the maximum<br />

value is observed for the field points P1 and P2<br />

that are faced to the welder coil. 2) P1, P3 and P5<br />

present a higher peak values with respect to P2,<br />

P5 and P6 because they are closer to the welder.<br />

3) by defining two groups <strong>of</strong> points with the same<br />

distance from the welder [P1, P3, P5] and [P2,<br />

P4, P6] it is possible to note that the peak value<br />

decreases by moving far from the welder coil. 4)<br />

the decreasing low for the peak value is not true<br />

for the medium frequency ripple superposed to the<br />

waveforms. It seems that the observation points<br />

P3 and P4 are characterized by the higher ripple.<br />

This consideration will be better investigated in the<br />

following by analyzing the spectrum components<br />

<strong>of</strong> all the waveforms. At this stage, a qualitative<br />

explanation can be given considering that P3 and<br />

P4 are located in front <strong>of</strong> the full wave rectifier<br />

connected to the secondary winding <strong>of</strong> the medium<br />

frequency transformer. Hence, P3 and P4 are the<br />

points more influenced by the conversion system.<br />

IV. HUMAN EXPOSURE EVALUATION<br />

The magnetic field produced by MFDC spot<br />

welding system is a pulsed magnetic field (see<br />

Fig. 7). With reference to the ICNIRP guidelines<br />

[2], [10], [11], it can be analyzed with three different<br />

approaches: the equivalent frequency method,<br />

the multiple frequency method and the weighted<br />

multiple frequency method.<br />

A. Equivalent frequency method<br />

The equivalent frequency method is introduced<br />

with the note 4 <strong>of</strong> Table 4 and 6 in the ICNIRP<br />

guidelines [2] . Afterward, it is better detailed in<br />

the guidelines focused on pulsed magnetic fields<br />

[10]. The method simply takes into account the<br />

pulse duration (tp) and maximum value <strong>of</strong> the<br />

waveform during impulse. Finally it refers to the<br />

equivalent and continuous sinusoidal field with a<br />

frequency feq =1/tp in order to test compliance<br />

with the reference levels.


(a)<br />

(b)<br />

(c)<br />

Fig. 7. P1 and P2 (a) P3 and P4 (c) P5 and P6<br />

Fig. 8. Equivalent frequency method: application to the<br />

measurement point P1<br />

If a single pulse <strong>of</strong> the magnetic field measured<br />

in the observation point P1 is analyzed by means<br />

<strong>of</strong> the equivalent frequency method the result <strong>of</strong><br />

Fig. 8 is obtained. The single pulse is characterized<br />

- 104 - 15th IGTE Symposium 2012<br />

by tp = 140 ms that leads to a continuous sinusoidal<br />

field with frequency equal to feq =3.5 Hz.<br />

The amplitude <strong>of</strong> the sinusoidal field is imposed<br />

to the maximum valued observed during the single<br />

pulse, i.e. 6298.3 μT.<br />

It is now simple to identify the reference level<br />

for a 3.5 Hz sinusoidal field from Fig. 1, that is<br />

16327 μT. It must be stressed that reference level<br />

is a rms value, therefore it must be compared with<br />

6298.3/ √ 2 = 4453.5 μT. Finally, by means <strong>of</strong> the<br />

equivalent frequency method, the welder produces<br />

a magnetic field that is 3.5 times lower than the<br />

applicable reference level in the observation point<br />

with the higher emission.<br />

B. Multiple frequency method<br />

The multiple frequency method is suggested for<br />

non-coherent waveform, i.e. waveform that can not<br />

be measured with repeatability property because <strong>of</strong><br />

their intrinsic variation in time.<br />

The procedure is summarized in the following<br />

steps:<br />

• selection <strong>of</strong> the signal to be analyzed<br />

• perform the Fourier Transform <strong>of</strong> the signal<br />

• computing the global indicator defined as:<br />

where:<br />

Is =<br />

65 kHz<br />

Bj<br />

BL,j<br />

j=1 Hz<br />

+<br />

10MHz j>65 kHz<br />

Bj<br />

b<br />

(1)<br />

• Bj is the magnetic flux density at frequency<br />

j;<br />

• BL,j is the magnetic flux density reference<br />

level;<br />

• b is 30.7 μT (rms) for occupational exposure<br />

The exposure is compliant with the limit if Is < 1.<br />

In order to test the compliance with this method<br />

a single impulse in each observation point has<br />

been selected in order to perform the Fourier<br />

Transform. Finally, in order to better point out the<br />

rationale <strong>of</strong> this methodology, the Is representation<br />

is provided:<br />

• graphically: in the x-axis is represented<br />

the frequency and in the y-axis the ration<br />

Bj/BL,j. With this it is possible to understand<br />

what are the frequency components that<br />

exceed the relative limit.<br />

• numerically: computation <strong>of</strong> Is with (1)<br />

For the sake <strong>of</strong> brevity the graphical result are<br />

shown just for the measurement points P1 and P3.<br />

In Fig. 9 it is possible to see the spectrum <strong>of</strong> the P1<br />

and P3 waveforms. It is possible to observe that the<br />

harmonic content is mainly located in the range 0-<br />

100 Hz with significant values <strong>of</strong> DC component.<br />

The spectrum is also characterized from a 2000 Hz


(a)P1:x=0m,y=0m,z=0.28 m<br />

(b)P3:x=0.5m,y=0m,z=0.28 m<br />

Fig. 9. Spectral analysis <strong>of</strong> the measured waveform<br />

(a)P1:x=0m,y=0m,z=0.28 m<br />

(b)P3:x=0.5m,y=0m,z=0.28 m<br />

Fig. 10. Computation <strong>of</strong> the ICNIRP limit by means <strong>of</strong> the<br />

multiple frequency method<br />

- 105 - 15th IGTE Symposium 2012<br />

component and its multiple. As explained before<br />

those components are generated from the final<br />

full wave rectifier. In fact, the observation points<br />

P3 that is faced to the rectifier, present in its<br />

spectrum the higher harmonic content for this set<br />

<strong>of</strong> frequencies.<br />

From the analysis <strong>of</strong> Fig. 10 it comes out that<br />

none <strong>of</strong> the observation points can be considered<br />

complaint if the multiple frequency method is<br />

adopted because the Is values exceed significantly<br />

the unitary reference level (see also Table II for the<br />

other measurement points). Remembering that the<br />

y-axis <strong>of</strong> Fig. 10 represents the summation terms<br />

<strong>of</strong> (1) it is possible to observe that the overcoming<br />

<strong>of</strong> the limit is mainly due to the 2000 Hz and 4000<br />

Hz components. For point P3 those harmonics are<br />

about 5 times higher than the relative reference<br />

level. It is worth noting that reference levels become<br />

stricter for higher frequencies. Moreover,<br />

above 820 Hz eq. (1) imposes a constant limit<br />

equal to 30.7 μT even if in Fig. 1 the trend is<br />

still decreasing above 65 kHz.<br />

Finally, it must be stressed that this procedure<br />

is based on the assumption that the spectral components<br />

add in phase, i.e., all maxima coincide at<br />

the same time and results in a sharp peak. This is<br />

a realistic assumption when the number <strong>of</strong> spectral<br />

components is limited and their phases are not<br />

coherent, i.e., they vary randomly. For fixed coherent<br />

phases the assumption may be unnecessarily<br />

conservative [2], [11], [13].<br />

C. Weighted multiple frequency method<br />

The weighted multiple frequency method is proposed<br />

for coherent waveform, i.e. waveform that<br />

can be measured with repeatability property. For<br />

these signals the multiple frequency method always<br />

leads to a conservative exposure assessments.<br />

This modification provides an alternative method<br />

based on weighted peak that more closely recognizes<br />

the nature <strong>of</strong> biological interactions.<br />

The procedure is summarized in the following<br />

steps:<br />

• selection <strong>of</strong> the signal to be analyzed<br />

• perform the Fourier Transform <strong>of</strong> the signal<br />

• computing the global indicator defined as:<br />

<br />

<br />

<br />

<br />

<br />

<br />

Iw = max <br />

(WF) j<br />

Bj cos (2πfjt + θj + ϕj) <br />

<br />

j<br />

<br />

(2)<br />

where:<br />

• Bj is the amplitude <strong>of</strong> the j-th frequency<br />

component;


• WFj is the weighting function where the<br />

magnitude is equal with the inverse <strong>of</strong> the<br />

peak reference level at j-th frequency<br />

• θj is the phase <strong>of</strong> the i-th component <strong>of</strong> B<br />

• ϕj is the phases <strong>of</strong> the weighting function<br />

<strong>of</strong> the i-th component. It should satisfy the<br />

conditions:<br />

– ϕ(f) =π/2 if ffc<br />

• fc = 820 Hz for occupational exposure<br />

The exposure is compliant with the limit if Iw < 1.<br />

Conversely from the multiple frequency method<br />

here the spectrum <strong>of</strong> the waveform is weighted by<br />

a complex function. It is possible to observe that<br />

the ICNIRP definition <strong>of</strong> the weighting function<br />

for magnetic flux density waveform is a high-pass<br />

filter with cut-<strong>of</strong>f frequency equal to fc = 820 Hz<br />

[10], [11].<br />

In order to better point out the rationale <strong>of</strong> this<br />

methodology, the Iw representation is provided:<br />

• graphically: it is provided the<br />

graph <strong>of</strong> the function <strong>of</strong> time:<br />

<br />

j (WF) j Bj cos (2πfjt + θj + ϕj)<br />

• numerically: computation <strong>of</strong> Iw with (2)<br />

From the analysis <strong>of</strong> the results presented in<br />

Fig. 11 it is possible to observe that even applying<br />

the weighting methods none <strong>of</strong> the observation<br />

points can be considered compliant with<br />

the unitary reference level (see also Table II for<br />

the other measurement points). In spite <strong>of</strong> this<br />

result, the weighting procedure allows a significant<br />

reduction for some observation points (see P1<br />

and P3 values). This is in accordance to the fact<br />

that for pulsed fields associated to spot welding<br />

machine the multiple frequency method is a too<br />

conservative assessment procedure.<br />

V. CONCLUSIONS<br />

In this paper the magnetic field generated from<br />

MFDC spot welding systems have been analyzed.<br />

All the possible methodology suggested in the<br />

international guidelines provided by the ICNIRP<br />

have been employed: the equivalent frequency<br />

method, the multiple frequency method and the<br />

weighted multiple frequency method.<br />

The main conclusion is that the three methodologies<br />

lead to contrasting result. By means <strong>of</strong><br />

equivalent frequency method the emission is compliant<br />

with the reference level even in the maximum<br />

emission point (3.5 times lower than the<br />

limit). On the other hand, the other two approaches<br />

provide opposite results, i.e., none <strong>of</strong> the surrounding<br />

points is compliant with the limit.<br />

In Table II the results for weighted and nonweighted<br />

multiple frequency method are summarized.<br />

For most <strong>of</strong> the observation points, a high<br />

- 106 - 15th IGTE Symposium 2012<br />

(a)P1:x=0m,y=0m,z=0.28 m<br />

(b)P3:x=0.5m,y=0m,z=0.28 m<br />

Fig. 11. Computation <strong>of</strong> the ICNIRP limit by means <strong>of</strong> the<br />

wighted multiple frequency method<br />

difference in the result is observed. In our opinion<br />

for the spot welding process the multiple frequency<br />

method is too conservative because the waveform<br />

is clearly coherent due to the supply control system.<br />

However, result for weighted multiple frequency<br />

method are still quite far from the unity<br />

limit.<br />

The definition <strong>of</strong> exposure limits for pulsed<br />

magnetic fields is still an open problem. Therefore<br />

no Standard and guidelines define precisely<br />

what is the constraint and the procedure to be<br />

observed for spot welding machine as well as other<br />

technologies characterized by similar emission <strong>of</strong><br />

pulsed magnetic field (e.g MRI devices). One <strong>of</strong><br />

the consequence is that the Directive 2004/40/EC<br />

has been modified extending the deadline for the<br />

application <strong>of</strong> the reference levels [15], [16] that<br />

now is fixed for October 31th 2013. Concerning<br />

the EU directives, it must be stressed that reference<br />

levels are always related to acute effects and not to<br />

possible long term effect. In the literature, as well<br />

as in the real life applications, there is no evidence<br />

<strong>of</strong> acute effects. For possible long term effects, the<br />

epidemiological studies recognizes that it is hard<br />

to estimate the exposure because may arise other<br />

type <strong>of</strong> exposure in the same workplace which are<br />

correlated to the same EMF exposure and which<br />

may affect the health <strong>of</strong> workers. An example<br />

concerns exposure to welding fumes which may


increase lung cancer risks among welders [17],<br />

[18].<br />

TABLE II<br />

COMPARISON BETWEEN WEIGHTED AND NON-WEIGHTED<br />

MULTIPLE FREQUENCY METHOD<br />

field point Is Iw<br />

P1 99.68 16.77<br />

P2 13.60 4.23<br />

P3 125.28 36.41<br />

P4 18.23 7.51<br />

P5 23.64 7.80<br />

P6 6.38 4.61<br />

From the technical point <strong>of</strong> view it is quite impossible<br />

to apply a mitigation system to the welder<br />

coil for (obvious) operating reason. The analysis <strong>of</strong><br />

Table II together with Fig. 9 allows to understand<br />

that the overcoming <strong>of</strong> the limit is mainly due to<br />

the 2000 Hz and 4000 Hz components. For point<br />

P3 those harmonics are about 5 times higher than<br />

the relative reference level. Therefore, the future<br />

development <strong>of</strong> this work is to design a shielding<br />

case for the transformer and the full wave rectifier.<br />

Obviously the mitigation system will not reduce<br />

the maximum value <strong>of</strong> the magnetic field (that is<br />

generated from the welder coil). The aim is just to<br />

reduce as much as possible the medium frequency<br />

components that seem to play a significant role in<br />

equations (1) and (2).<br />

REFERENCES<br />

[1] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council<br />

<strong>of</strong> 29 april 2004 on the minimum health and safety<br />

requirements regarding the exposure <strong>of</strong> workers to the<br />

risks arising from physical agents (electromagnetic fields)<br />

european parliament and council.<br />

[2] ICNIRP. Guidelines for limiting exposure to time varying<br />

electric, magnetic and electromagnetic fields (up to 300<br />

GHz). Health Phys, 74(4):494–522, 1998.<br />

[3] H. Heinrich. Assessment <strong>of</strong> non-sinusoidal, pulsed or<br />

intermittent exposure to low frequency electric and magnetic<br />

fields. Health Phys, 96(6):541–546, 2007.<br />

[4] R. Scorretti, N. Burais, A. Fabregue, and O. Nicolas, and<br />

L. Nicolas. Computation <strong>of</strong> the induced current density<br />

into the human body due to relative LF magnetic field<br />

generated by realistic devices. IEEE Transactions on<br />

Magnetics, 40(2):643–646, 2004.<br />

[5] D. Desideri and A. Maschio. Magnetic field emissions<br />

up to 400 kHz from a welding equipment. In Proc.<br />

Int. Symp. Electromagnetic Compatibility, pages 151–<br />

156, Barcelona, 2006.<br />

[6] D. Desideri, A. Maschio, and P. Mattavelli. Human<br />

exposure topulsed current waveforms below 100 kHz.<br />

In 391-396, editor, Proc. Int. Symp. Electromagnetic<br />

Compatibility, Hamburg, Sep. 8–12, 2008.<br />

[7] G. Kang and O. Gandhi. Comparison <strong>of</strong> various safety<br />

guidelines for electronic article surveillance devices with<br />

pulsed magnetic fields. IEEE Transactions on Biomedical<br />

Engineering, 50(1), 2003.<br />

[8] A. Canova, F. Freschi, and M. Repetto. Evaluation <strong>of</strong><br />

workers expo- sure to magnetic fields. The European<br />

Physical Journal Applied Physics, 52(2), 2010.<br />

- 107 - 15th IGTE Symposium 2012<br />

[9] A. Canova, F. Freschi, L. Giaccone, and M. Repetto. Exposure<br />

<strong>of</strong> working population to pulsed magnetic fields.<br />

IEEE Transaction on Magnetics, 46(8):2819–2822, 2010.<br />

[10] ICNIRP. Guidance on determining compliance <strong>of</strong> exposure<br />

to pulsed and complex non-sinusoidal waveform<br />

below 100 kHz with icnirp guidelines. Health Phys,<br />

84(3):383–387, 2003.<br />

[11] K. Jokela. Restricting exposure to pulsed and broadband<br />

magnetic fields. Health Phys, 79(4):373–388, 2000.<br />

[12] F. Dughiero, M. Forzan, and E. Sieni. A numerical<br />

evaluation on electromagnetic fields exposure on real<br />

human body models until 100 khz. COMPEL, 29:1552–<br />

1561, 2010.<br />

[13] ICNIRP. Guidelines for limiting exposure to time-varying<br />

electric and magnetic fields (1 Hz to 100 kHz). Health<br />

Phys, 99(6):818–836, 2010.<br />

[14] G. Crotti and D. Giordano. Problems in the detection<br />

<strong>of</strong> quasi-rectangular magnetic flux density waveforms. In<br />

18th Symposium IMEKO TC4, Natal (Brasil), September<br />

2001.<br />

[15] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council <strong>of</strong><br />

23 april 2008 amending directive 2004/40/ec on minimum<br />

health and safety requirements regarding the exposure<br />

<strong>of</strong> workers to the risks arising from physical agents<br />

(electromagnetic fields) (18th individual directive within<br />

the meaning <strong>of</strong> article 16(1) <strong>of</strong> directive 89/391/eec).<br />

[16] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council <strong>of</strong><br />

19 april 2012 amending directive 2004/40/ec on minimum<br />

health and safety requirements regarding the exposure<br />

<strong>of</strong> workers to the risks arising from physical agents<br />

(electromagnetic fields) (18th individual directive within<br />

the meaning <strong>of</strong> article 16(1) <strong>of</strong> directive 89/391/eec).<br />

[17] R.M. Sterns. Cancer incidence among welders: possible<br />

effects <strong>of</strong> exposure to extremely low frequency<br />

electromagnetic radiation (elf) and to welding fumes.<br />

Environmental Health Perspectives, 76:221–229, 1987.<br />

[18] Review <strong>of</strong> the scientific evidence for limiting exposure to<br />

electromagnetic fields (0-300 ghz). Technical Report Vol.<br />

15 N.3, National Radiological Protection Board, 2004.


- 108 - 15th IGTE Symposium 2012<br />

A Circuital Approach for Eddy Currents Fast<br />

Evaluation in Beam-like Structures<br />

A. Formisano<br />

Dipartimento di Ingegneria Industriale e dell’Informazione<br />

Seconda Università di Napoli, via Roma 29, I-81031 Aversa (CE), Italy<br />

E-mail: Alessandro.Formisano@unina2.it<br />

Abstract — The electromagnetic analysis <strong>of</strong> mechanic or civil structures composed by an interconnection <strong>of</strong> beam-like<br />

elements, mechanically interconnected to create a structural mesh, can be formulated in terms <strong>of</strong> an equivalent lumped<br />

elements electric network. This is the case <strong>of</strong> the eddy currents evaluation in a truss bridge or a bridge crane exposed to a<br />

time varying electromagnetic field or in the reinforcement <strong>of</strong> buildings concrete. In such cases, if compatible with the<br />

accuracy needs, the network approach can be very effective thanks to the strong reduction <strong>of</strong> the model complexity. The<br />

paper proposes such a kind <strong>of</strong> formulation, based on concept <strong>of</strong> partial inductance. Advantage is taken from automated treebuilding<br />

algorithms for electric networks, and on minimum order formulations based on loop currents to further reduce<br />

computational complexity.<br />

Index Terms— Eddy Currents, Electric Circuit Theory, Filamentary Structures<br />

I. INTRODUCTION<br />

The use <strong>of</strong> metallic materials in mechanical and civil<br />

engineering is a common practice, due to the extremely<br />

favorable behavior <strong>of</strong> such materials with respect to<br />

stresses and mechanical solicitations. On the other hand,<br />

when exposed to time varying electromagnetic fields,<br />

metallic structures react by generating a field due to<br />

induced currents in their volume. The effect <strong>of</strong> such fields<br />

may reveal critical in some particular applications, such<br />

as when forces on the structures must be taken under<br />

control, or when aging phenomena are facilitated by<br />

electric currents in the metal, or when the electromagnetic<br />

field map must be strictly controlled in critical regions<br />

not far from the structures (e.g. to reduce interference on<br />

electronic devices or to avoid impact on physical<br />

phenomena requiring controlled field maps, such as<br />

Nuclear Magnetic Resonance, or finally to limit human<br />

exposure to electromagnetic energy).<br />

In these cases, the possible interactions with<br />

surrounding electromagnetic field sources must be<br />

considered in the design phase. Unfortunately, fully 3D<br />

numerical analysis would usually be required, since no<br />

particular symmetry or simplification can be expected to<br />

reduce model complexity. On the other hand, such a<br />

model would require a quite large computational effort,<br />

although just an estimate <strong>of</strong> the electromagnetic effects<br />

due to the mechanical structure are <strong>of</strong>ten enough in the<br />

design step.<br />

In the particular cases when the mechanical structures<br />

can be modeled using interconnects <strong>of</strong> beam-like<br />

elements (e.g. truss bridges, bridge cranes, iron rebar in<br />

reinforced concrete, etc.), the electromagnetic analysis, in<br />

the magneto-quasi-static limit and assuming linear<br />

behavior <strong>of</strong> all the materials, can be formulated in terms<br />

<strong>of</strong> an equivalent electric network, composed by lumped<br />

elements. The current in each branch <strong>of</strong> the network is<br />

related to the current density in the corresponding beamlike<br />

element <strong>of</strong> the structure thanks to a filamentary<br />

current element approximation <strong>of</strong> the actual beam. Each<br />

current element, or current stick, is defined by stick tips,<br />

and a (scalar) stick currents.<br />

The interconnection <strong>of</strong> sticks is defined through a<br />

suitable incidence matrix, defined from the actual 3D<br />

topology. The lumped network is created by associating<br />

to each single beam <strong>of</strong> the original structure a lumped<br />

parameter model, falling in the typical circuit classes. It<br />

follows that a resistive parameter has to be used to model<br />

the Ohmic behavior <strong>of</strong> the metallic element. In addition, a<br />

set <strong>of</strong> inductances should be used to represent the<br />

induction phenomena among sticks and their capability to<br />

accumulate magnetic energy. Finally, an electromotive<br />

force (typical <strong>of</strong> the voltage sources in the circuits) can be<br />

used to represent the induction phenomena from assigned<br />

external currents.<br />

In principle, also capacitive parameters should be<br />

considered to take into account the capability to<br />

accumulate electric energy, but in the range <strong>of</strong><br />

frequencies here considered the impact <strong>of</strong> capacitive<br />

phenomena can be neglected.<br />

The mathematical tools able to face such a class <strong>of</strong><br />

systems are the well known Kirchh<strong>of</strong>f laws, replacing the<br />

most general Maxwell ones, significantly reducing the<br />

complexity <strong>of</strong> the model.<br />

Although the lumped network approach to treat similar<br />

structures is quite diffused in the electromagnetic analysis<br />

<strong>of</strong> mechanical structures [1-4], the network analysis is<br />

usually performed using standard computer codes, not<br />

necessarily guaranteeing the minimum computational<br />

effort. In this paper, an automated fundamental loop<br />

method is proposed to achieve a minimum complexity<br />

resolution <strong>of</strong> the network.<br />

Once currents in each branch are known, simple closed<br />

formulas can be used to estimate the field produced by<br />

each stick, forces on the structure elements, and Ohmic<br />

losses due to induction phenomena. This approach is<br />

particularly useful when a quick, yet not accurate<br />

estimation <strong>of</strong> the impact <strong>of</strong> metallic structures, is


equired.<br />

II. MATHEMATICAL MODELING<br />

Let's consider a structure composed <strong>of</strong> Nb metallic<br />

beams, connected in a general way at their tips in Nn<br />

nodes.<br />

The equivalent lumped network will be composed <strong>of</strong><br />

Nb branches, with the same topology as the mechanical<br />

structure. According to the geometrical characteristics <strong>of</strong><br />

the beam and, in addition, to the electromagnetic<br />

characteristics <strong>of</strong> the materials, each branches is endowed<br />

with a suitable set <strong>of</strong> circuit elements, including a<br />

resistance, an inductance, some mutual inductances, and a<br />

suited number <strong>of</strong> voltage sources.<br />

A very simple example is reported in Fig. 1(a), while<br />

in Fig. 1(b) the equivalent electric network is sketched.<br />

Since the assembly is immersed in a time varying field,<br />

we will assume that each stick, arbitrarily oriented,<br />

carries a current ik, k=1, 2...Nb, that represents the<br />

unknown to be determined. The currents are induced by<br />

an external field, but their value depends also on the<br />

structure itself, through material properties and<br />

geometrical relationships.<br />

voltage<br />

source<br />

+ -<br />

(a)<br />

branch<br />

resistance<br />

branch<br />

current ik<br />

(b)<br />

self inductance<br />

and mutual with<br />

all other branches<br />

Figure 1: (a) An example <strong>of</strong> mechanical interconnect <strong>of</strong> beam like<br />

metallic elements (the structure <strong>of</strong> a truss bridge); (b) its representation<br />

as an electric network.<br />

Each stick can be characterized by a resistance Rk.<br />

depending from its resistivity k, length Lk and cross<br />

section Sk. If assuming that the penetration depth at the<br />

highest frequency involved is smaller than the transverse<br />

dimension <strong>of</strong> the beam modeled by stick, and that both<br />

- 109 - 15th IGTE Symposium 2012<br />

resistivity and cross section are uniform along the beam,<br />

the resistance associated the k-th stick can be computed<br />

as [5]:<br />

Rk=k*Lk/Sk<br />

Of course, if any <strong>of</strong> the above exposed hypotheses<br />

falls, the more general expression using the line integral<br />

along the beam axis <strong>of</strong> k(l)/Sk(l) can be used.<br />

The resistances are then assembled into a diagonal<br />

resistance matrix R.<br />

In addition, if assuming a linear magnetic behavior, the<br />

sticks assembly is characterized also by an inductance<br />

matrix Mb, whose elements describe the mutual<br />

inductance between sticks or, on the diagonal, their self<br />

inductance. Under the same assumptions used for (1)<br />

about skin depth, the self inductance Mkk <strong>of</strong> the k-th stick<br />

can be computed using [6]:<br />

4 2Lk <br />

Mkk 210 Lkln<br />

1 r<br />

<br />

Lk<br />

where r is the geometric mean distance and is the<br />

arithmetic mean distance on the corresponding k-th beam<br />

cross section. (2) provides self inductance in Henry is Lk<br />

is in meters.<br />

The mutual inductance Mjk between the j-th and k-th<br />

stick can be computed using formulas from [6]; as a<br />

possible alternative, assuming a limited dimension <strong>of</strong> the<br />

cross section, the mutual inductance can be evaluated also<br />

by line integrating (numerically) the vector potential<br />

Ak(x) <strong>of</strong> stick k on the axis <strong>of</strong> stick j:<br />

ˆ<br />

M A x tˆdl<br />

jk k j dl j<br />

j<br />

where j is the centerline along the j-th beam, xj is a<br />

generic point along j, and ˆt is the centerline tangent unit<br />

vector. The following expression has been used in this<br />

study for Ak [5]:<br />

j <br />

where a, b and c are defined in Fig. 2, â is the unit<br />

vector along the k-th stick, and suitable countermeasures<br />

have been taken to avoid singularities when sticks are<br />

aligned [7].<br />

(1)<br />

(2)<br />

(3)<br />

I<br />

k 0 ˆ<br />

cba A x a ln<br />

4<br />

<br />

cba <br />

(4)<br />

<br />

c<br />

Ak b<br />

xj<br />

Figure 2: Basic elements form computation <strong>of</strong> vector potential using<br />

(3).<br />

Note that eq. (3) can be easily generalized to massive<br />

a


conductors, if the thin beam approximation may reveal<br />

too crude for the analysis, while this is not the case for<br />

closed form expressions found in [6].<br />

The structure is supposed to be immersed in the timevarying<br />

magnetic field produced by another set <strong>of</strong> Ne<br />

external currents ie, linked with the sticks by means <strong>of</strong> a<br />

mutual inductance matrix Me.<br />

Elements <strong>of</strong> Me can be easily evaluated using<br />

expressions based on (3). For the particular shape <strong>of</strong> field<br />

source, suitable analytical (possibly approximate)<br />

expression are available; e.g., the mutual inductance <strong>of</strong> a<br />

stick and a power line can be easily computed using<br />

formulas from [6]. Of course, the more general procedure<br />

based on suitable decomposition in elementary sticks and<br />

a numerical evaluation <strong>of</strong> (3) can be used.<br />

If assuming that external sources are given (because<br />

not influenced by eddy currents induced in the structure<br />

or some other factor) in each branch, the induced emf can<br />

be circuitally described as a voltage source. The set <strong>of</strong> the<br />

voltages is given by<br />

e= Me die/dt + d Me /dt ie<br />

where the last term vanishes in case <strong>of</strong> time invariance <strong>of</strong><br />

the matrix Me.<br />

Within these hypotheses, the system can be regarded as<br />

a R-L circuit, where sticks play the role <strong>of</strong> branches and<br />

Nn nodes represent the stick tips.<br />

The network topology is described by the incidence<br />

matrix (Nn rows and Nb column) providing, for each<br />

branch, the couple <strong>of</strong> starting-ending nodes. The<br />

incidence matrix can be easily recovered from CAD<br />

schemes for the mechanical assembly, where available, or<br />

by survey <strong>of</strong> the drawings.<br />

It is well known that the rank <strong>of</strong> incidence matrix is<br />

lower than the number <strong>of</strong> nodes and its value, for a<br />

connected network is Nn–1; consequently, <strong>of</strong>ten the<br />

“reduced” incidence matrix A is used, as it will be done in<br />

the following. The graph theory guarantees that<br />

independent columns in an incidence matrix do not form<br />

loops. It follows that a basis <strong>of</strong> the columns set defines a<br />

set <strong>of</strong> branches able to connect all the nodes <strong>of</strong> the<br />

network, i.e. a tree <strong>of</strong> the network<br />

A fast and effective way to search for independent<br />

columns is to determine the echelon form <strong>of</strong> A [8] and<br />

select the branch corresponding to the leading<br />

coefficients Of course, in general several trees can be<br />

defined for an assigned network; the choice <strong>of</strong> the<br />

extracted tree among all the possible ones can be<br />

controlled by ordering the columns <strong>of</strong> the incidence<br />

matrix in such a way that column corresponding to<br />

favourite branches are the leftmost ones.<br />

In order to simplify a number <strong>of</strong> automatic treatment <strong>of</strong><br />

the network topology, it is recommended to rearrange the<br />

branch numbering <strong>of</strong> the matrix in such a way to include<br />

in the first NT positions (Nn-1 in case <strong>of</strong> connected<br />

networks), the columns <strong>of</strong> the tree branches. Then, the<br />

incidence matrix A can be partitioned as:<br />

(5)<br />

A =(AT; AC) (6)<br />

- 110 - 15th IGTE Symposium 2012<br />

where, AT is the NTxNT non singular matrix corresponding<br />

to tree branches, and AC the NTxNb matrix corresponding<br />

to co-tree branches.<br />

Several effective methods can be used to face with the<br />

analysis <strong>of</strong> this network.<br />

One <strong>of</strong> the most effective and popular is the nodal<br />

technique whose unknowns are the nodes potential set vn.<br />

Here, for simplicity just the formulation in case <strong>of</strong> linear,<br />

memory free, voltage controlled components is<br />

highlighted:<br />

Yn vn = Jn<br />

where Yn and Jn are the nodal matrix and the nodal drive<br />

equivalent current vector, respectively. Both can be easily<br />

evaluated by the branch parameters. In particular<br />

Yn = A Yb A T , where Yb is the NbxNb matrix with the<br />

conductances (self or mutual) <strong>of</strong> the branches. The<br />

method can be extended to circuits with linear current<br />

controlled components or linear dynamical components<br />

and, in addition, also in presence <strong>of</strong> non linear<br />

components. Of course, in any case, the existence and the<br />

uniqueness <strong>of</strong> solution has to be assessed.<br />

Here the classical dual formulation is proposed, whose<br />

unknowns are the principal loop currents IL [9]. The<br />

model, for general dynamic systems, can be stated in time<br />

domain; here, taking advantage from linearity, the more<br />

compact formulation in the Laplace space is used:<br />

M<br />

s s s <br />

L M<br />

(7)<br />

Z I E (8)<br />

where IL and EM are the arrays <strong>of</strong> the principal loop<br />

currents and driven voltages, respectively, and s is the<br />

complex frequency.<br />

In (8) for simplicity, the hypotheses <strong>of</strong> linear, voltage<br />

controlled components has been assumed.<br />

T<br />

The loop impedance matrix BZ<br />

M b B<br />

Z can be<br />

deduced from the branch impedance matrix Zb = (Rb+sMb)<br />

and from the topological NLxNb matrix B <strong>of</strong> the principal<br />

loops related to the tree [8], where NL = Nb-NT is the<br />

maximum number <strong>of</strong> independent loops. Similar<br />

expressions hold for loop voltage sources EM, driven by<br />

external currents.<br />

It should be noticed that the partitioned form <strong>of</strong> B<br />

B = B ; 1 includes an identity matrix for the co-tree<br />

<br />

T L<br />

columns. In addition, the first partition B can be easily<br />

T<br />

deduced by the reduced incidence matrix:<br />

T -1<br />

T T L<br />

B =- A A<br />

(9)<br />

Once loop currents are known, currents in each branch<br />

are easily computed as I = B IL.<br />

From branch currents, estimates <strong>of</strong> the other electrical<br />

quantities can be easily recovered using closed form<br />

expression for stick currents. This is the case, for<br />

example, <strong>of</strong> the flux density produced in any points <strong>of</strong>


space [5], or the total Ohmic power, or, finally, the net<br />

force acting on the structure.<br />

III. NUMERICAL EXAMPLES<br />

In this section, firstly a simple example is discussed to<br />

illustrate the various steps <strong>of</strong> the proposed method; then a<br />

more complex case is presented to show the effectiveness<br />

<strong>of</strong> the approach.<br />

1. As a first example, the field produced by a reinforced<br />

concrete beam near a power line is considered. The<br />

beam is 2.4 m long, with a transverse dimension <strong>of</strong> 30<br />

cm, a reinforcement diameter <strong>of</strong> 2 cm, a resistivity <strong>of</strong><br />

5x10 -5 m and is 5 m away from the line. The line<br />

carries 100 A <strong>of</strong> current at 50 Hz, and is assumed 20<br />

m long.<br />

The networks has 16 nodes; the 15 tree branches are<br />

branches 1-15 in Fig. 3, and the non-trivial part <strong>of</strong> the<br />

fundamental loop matrix BT is reported in table I.<br />

The currents in each branch can easily be computed<br />

using a linear system with 13 equations, and then<br />

post-processed to obtain estimates <strong>of</strong> the desired<br />

quantities.<br />

The highest currents are in the “longitudinal”<br />

branches #17 and #19 (1.4 mA), and #21 and #23 (1.5<br />

mA).<br />

A 2D FEM model, neglecting connecting elements in<br />

the mesh, and correcting conductivity to take into<br />

account the finite length <strong>of</strong> actual geometry, provides<br />

a current <strong>of</strong> 1.5 mA in the four “longitudinal” beams.<br />

The "disturbance" magnetic field produced by the<br />

beam is 1.72 nT at a point 10 cm away from the<br />

power line. Note that in the 2D FEM model, the<br />

reinforcement contribution was hidden by the<br />

numerical errors.<br />

2. The second example compares computational<br />

complexity in the case <strong>of</strong> a fully 3D geometry either<br />

using a commercial FEM package (COMSOL<br />

Multiphysics Ver. 4.2a, [10]) or using the proposed<br />

approach.<br />

The aim is to estimate total Ohmic losses in the<br />

metallic structure <strong>of</strong> the truss beam depicted in Fig. 1.<br />

The bridge is 16 m long, 4 m large and 4 m high.<br />

Each beam is a square with a 0.4 m side.<br />

The bridge is made <strong>of</strong> non magnetic structural steel,<br />

with a conductivity <strong>of</strong> 4x10 6 S/m.<br />

The excitation field is provided by a circular coil<br />

radius 5 m, hanging 10 m above the bridge. Of course<br />

such excitation is not realistic, but has been adopted<br />

for its ease <strong>of</strong> modelling with FEM package.<br />

The FEM model is solved with a mesh composed <strong>of</strong><br />

34,000 2 nd order tetrahedral elements, giving<br />

300,000 unknowns, and requires 165 s to be solved<br />

on a i7-based PC, with 4GB RAM.<br />

The total Ohmic losses are 4 mW using FEM model<br />

and 3.5 mW using the proposed method. A map <strong>of</strong><br />

losses density is reported in Fig.4.<br />

- 111 - 15th IGTE Symposium 2012<br />

0.3 m<br />

Figure 3: Graph <strong>of</strong> the equivalent network for case 1: capital letters<br />

indicate nodes, number indicate branches, dashed lines are co-tree<br />

branches<br />

Loops<br />

C<br />

3<br />

2<br />

B<br />

1<br />

16<br />

A<br />

19<br />

D<br />

18<br />

2.4 m<br />

G<br />

7<br />

6<br />

4<br />

F<br />

5<br />

17<br />

23<br />

H<br />

22<br />

20<br />

E<br />

TABLE I<br />

NON TRIVIAL PART OF THE FUNDAMENTAL LOOP MATRIX<br />

Branches<br />

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />

1 -1 -1 -1 0 0 0 0 0 0 0 0 0 0 0 0<br />

2 -1 -1 -1 -1 1 1 1 0 0 0 0 0 0 0 0<br />

3 0 -1 -1 -1 0 1 1 0 0 0 0 0 0 0 0<br />

4 0 0 -1 -1 0 0 1 0 0 0 0 0 0 0 0<br />

5 0 0 0 0 -1 -1 -1 0 0 0 0 0 0 0 0<br />

6 0 0 0 0 -1 -1 -1 -1 1 1 1 0 0 0 0<br />

7 0 0 0 0 0 -1 -1 -1 0 1 1 0 0 0 0<br />

8 0 0 0 0 0 0 -1 -1 0 0 1 0 0 0 0<br />

9 0 0 0 0 0 0 0 0 -1 -1 -1 0 0 0 0<br />

10 0 0 0 0 0 0 0 0 -1 -1 -1 -1 1 1 1<br />

11 0 0 0 0 0 0 0 0 0 -1 -1 -1 0 1 1<br />

12 0 0 0 0 0 0 0 0 0 0 -1 -1 0 0 1<br />

13 0 0 0 0 0 0 0 0 0 0 0 0 -1 -1 -1<br />

Figure 4: Ohmic Losses density plot for FEM solution <strong>of</strong> example 2.<br />

IV. CONCLUSIONS<br />

A prompt method to assess the effect <strong>of</strong> mechanical<br />

structures composed by interconnected conducting<br />

beams, exposed to low frequency magnetic fields, has<br />

been presented. The method is based on equivalent<br />

electric network analysis, and adopts a minimum order<br />

formulation to solve the network.<br />

ACKNOWLEDGEMENTS<br />

11<br />

L 26<br />

O<br />

15<br />

14<br />

P<br />

12<br />

N 28<br />

13 M<br />

The author wishes to thank pr<strong>of</strong>. R. Martone for<br />

fruitful discussions and for his support.<br />

This work has been partly supported by Seconda<br />

Università di Napoli, Italy, under PRIST grant<br />

“Generazione distribuita di energia da fonti tradizionali e<br />

8<br />

21<br />

K<br />

10<br />

J<br />

9<br />

27<br />

24<br />

I<br />

25


innovabili: aspetti ingegneristici e giuridici-economiciambientali”<br />

REFERENCES<br />

[1] A. Ruehli, "Equivalent Circuit Models for Three-Dimensional<br />

Multiconductor Systems", IEEE Trans. on Microwave Th. and<br />

Tech., vol. MTT-22, pp. 216-221, 1974.<br />

[2] W. Pinello, A. Ruehli, “Time Domain Solutions for Coupled<br />

Problems using PEEC Models with Waveform Relaxation”,<br />

<strong>Proceedings</strong> <strong>of</strong> Antennas and Propagation Society International<br />

Symposium AP-S. Digest, pp. 2118-2121, 1996.<br />

[3] A.Y. Wu, K.S. Sun, “Formulation and implementation <strong>of</strong> the<br />

current filament method for the analysis <strong>of</strong> current diffusion and<br />

heating in railguns and homopolar generators”, IEEE Trans. on.<br />

Mag., vol. 25, pp. 610-615, 1989.<br />

[4] B. Azzerboni, E. Cardelli, M. Raugi, ”Network mesh model for<br />

flux compression generators analysis, IEEE Trans. on Magn., vol.<br />

27, pp. 3951-3954, 1991.<br />

[5] H. A. Haus, J. R. Melcher, Electromagnetic Fields and Energy,<br />

Englewood Cliffs, NJ: Prentice Hall, 1989.<br />

[6] F. Grover, Inductance Calculation, New York: Van Nostrand,<br />

1946.<br />

[7] J. D. Hanson, S. P. Hirshman, “Compact expressions for the Biot–<br />

Savart fields <strong>of</strong> a filamentary segment”, Physics <strong>of</strong> Plasmas, vol.<br />

9, pp. 4410-4412, 2002.<br />

[8] L. Chua, I. Lin, Computer-Aided Analysis <strong>of</strong> Electronic Circuits,<br />

3rd ed., vol. 2. Oxford: Clarendon Press, 1982.<br />

[9] J. Nilsson, S. Riedel. Electric Circuits, Englewood Cliffs, NJ:<br />

Prentice Hall, 2010.<br />

[10] www.Comsol.com, last visited on Sept., 4 th 2012.<br />

- 112 - 15th IGTE Symposium 2012


- 113 - 15th IGTE Symposium 2012<br />

Effectiveness <strong>of</strong> the Preconditioned<br />

MRTR Method Supported by Eisenstat’s Technique<br />

in Real Symmetric Sparse Matrices<br />

*Yoshifumi Okamoto, *Tomonori Tsuburaya, † Koji Fujiwara, and *Shuji Sato<br />

*Department <strong>of</strong> Electrical and Electronic Systems Engineering, Utsunomiya <strong>University</strong><br />

7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />

† Department <strong>of</strong> Electrical Engineering, Doshisha <strong>University</strong><br />

1-3 Tataramiyakodani, Kyotanabe, Kyoto 610-0321, Japan<br />

E-mail: okamotoy@cc.utsunomiya-u.ac.jp<br />

Abstract—The Incomplete Cholesky Conjugate Gradient (ICCG) method is widely used to solve the indefinite algebraic<br />

equations obtained from the edge-based finite element method. However, when a linear solver based on the minimum<br />

residual without residual oscillations is used, there is a possibility <strong>of</strong> the elapsed time being shortened. This paper shows the<br />

effectiveness <strong>of</strong> the preconditioned Minimized Residual method based on the Three-term Recurrence formula <strong>of</strong> the CG-type<br />

(MRTR) method with Eisenstat’s technique by making a comparison with ICCG method for real symmetric sparse matrices.<br />

Index Terms— Eisenstat’s technique, ICCG method, MRTR method, split preconditioning.<br />

A x b,<br />

(1)<br />

I. INTRODUCTION<br />

where A is a large sparse n-by-n matrix, x is a solution n-<br />

The ICCG method [1] is widely used as a solver for a vector, and b is a n-vector. Now, suppose that the<br />

real symmetric indefinite linear equation derived from the diagonal scaling has already been applied to (1).<br />

edge-based finite element method. While the behavior <strong>of</strong> The recurrence formula for x in MRTR method is<br />

a residual in CG iterations is oscillatory, a monotonic designed using the expression<br />

decrease in the residual is mathematically ensured in<br />

MRTR method [2], which is an algorithm identical to that<br />

<strong>of</strong> Orthomin(2) [3] and which is based on the minimum<br />

residual. Therefore, there is a possibility that MRTR can<br />

solve linear equations faster than the CG method.<br />

IC factorization is widely recognized as a powerful<br />

preconditioner for a symmetric linear system. However, it<br />

may not necessarily be powerful in various magnetic<br />

field problems. Some <strong>of</strong> the split preconditioners such as<br />

symmetric Gauss-Seidel (SGS) and diagonal IC<br />

factorization (DIC) might be capable <strong>of</strong> improving the<br />

convergence characteristics <strong>of</strong> linear solvers.<br />

SGS and DIC preconditioners, in which the <strong>of</strong>fdiagonal<br />

components in the original linear system are<br />

directly used in the preconditioned matrix, can utilize the<br />

Eisenstat’s technique [4], in which the matrix-vector<br />

product can be replaced by forward and backward<br />

substitutions. Therefore, there is a possibility that SGS<br />

and DIC preconditioners supported by Eisenstat’s<br />

technique can reduce the elapsed time by reducing the<br />

computational cost <strong>of</strong> an iterative process.<br />

This paper shows the effectiveness <strong>of</strong> preconditioned<br />

MRTR method supported by Eisenstat’s technique for the<br />

xk 1 x0<br />

zk<br />

1,<br />

zk<br />

1<br />

K k ( A;<br />

r0<br />

) ,<br />

(2)<br />

where xk+1 is the solution vector in step (k + 1) in the<br />

iterative process and Kk(A;r0) is the Krylov subspace<br />

spanned by A and the initial residual vector r0. The<br />

residual vector rk+1 is comprehended in Kk+1(A;r0) as<br />

follows:<br />

rk 1<br />

b Axk 1<br />

r0<br />

Azk<br />

1<br />

K k 1(<br />

A;<br />

r0<br />

) . (3)<br />

Furthermore, the approximate solution (x0 + z) in step (k<br />

+ 1) satisfies the minimization condition as follows:<br />

min || b A(<br />

x0<br />

z)<br />

|| 2 min || r0<br />

Az<br />

|| 2 , (4)<br />

zS<br />

k<br />

zS<br />

k<br />

where Sk is a subspace comprehended by Kk(A;r0). Using<br />

a three-term recurrence formula involving Lanczos<br />

polynomials, the algorithm <strong>of</strong> MRTR method is given as<br />

follows:<br />

Algorithm 1 (MRTR method). Let x0 be an initial guess,<br />

and put r0 = b – Ax0. Set y0 = – r0 and 0 = (y0, y0).<br />

For k = 0, 1, 2, …, repeat the following steps until the<br />

condition ||rk||2/||b||2 < MR holds:<br />

(<br />

Ark<br />

, rk<br />

) / ( Ark<br />

, Ark<br />

)<br />

( k 0)<br />

<br />

k ( Ark<br />

, r )<br />

k<br />

k<br />

,<br />

( k 1)<br />

<br />

<br />

k ( Ark<br />

, Ark<br />

) ( yk<br />

, Ark<br />

)( Ark<br />

, yk<br />

)<br />

several linear systems derived from the edge-based finite<br />

element method in the magnetic field analysis.<br />

Comparisons have been made with other well-known<br />

preconditioned CG methods.<br />

0<br />

<br />

( yk<br />

, Ark<br />

)( Ark<br />

, r )<br />

k<br />

k<br />

<br />

<br />

k ( Ark<br />

, Ark<br />

) ( yk<br />

, Ark<br />

)( Ark<br />

, yk<br />

)<br />

k 1<br />

k ( Ark , rk<br />

),<br />

( k 0)<br />

,<br />

( k 1)<br />

II. PRECONDITIONED MRTR METHOD<br />

A. MRTR method<br />

The symmetric sparse linear system can be defined as<br />

follows:<br />

k 1<br />

pk rk<br />

k<br />

pk<br />

1,<br />

k<br />

xk 1<br />

xk<br />

k pk<br />

,<br />

yk 1<br />

<br />

k yk<br />

k Ark<br />

,


k 1<br />

rk<br />

yk<br />

1,<br />

where (u, v) denotes the inner product <strong>of</strong> vectors u and v.<br />

The above algorithm is mathematically equivalent to the<br />

conjugate residual (CR) method [5].<br />

TABLE I shows the computational cost <strong>of</strong> MRTR<br />

method, along with a comparison with the CG method.<br />

Au, (u, v), (u + v), and u denote a matrix-vector<br />

product, the inner product, the sum <strong>of</strong> vectors, and a<br />

scalar-vector product, respectively. The computational<br />

cost <strong>of</strong> MRTR method is nearly identical to that <strong>of</strong> the<br />

CG method owing to the same number <strong>of</strong> computations<br />

for the matrix-vector product.<br />

TABLE I<br />

COMPUTATIONAL COST OF LINEAR SOLVERS<br />

linear solver Au (u, v) u + v u<br />

CG 1 2 3 3<br />

MRTR 1 4 4 4<br />

B. Preconditioning<br />

MRTR method can be combined with split<br />

preconditioning techniques as long as the preconditioner<br />

M, which can be written in the form M = CC T (with C : a<br />

lower triangular matrix), is used. The preconditioned<br />

T<br />

matrix Aˆ<br />

1<br />

<br />

C AC retains the symmetry <strong>of</strong> A. Here,<br />

we utilize M and C derived using shifted IC factorization<br />

[6], DIC [1], and SGS preconditioning [7].<br />

IC preconditioner<br />

IC factorization is performed as follows:<br />

ˆ ˆ ˆ T<br />

, ˆ ˆ 1/<br />

2<br />

M LDL<br />

C LD<br />

,<br />

(5)<br />

i1<br />

<br />

2<br />

<br />

<br />

aii<br />

<br />

li<br />

k d k k ( i j),<br />

k 1<br />

lij j1<br />

(6)<br />

aij<br />

<br />

<br />

li<br />

kl<br />

j kd<br />

k k ( i j),<br />

k 1<br />

dii 1 / lii<br />

,<br />

(7)<br />

where lij and dii are components <strong>of</strong> Lˆ and Dˆ and is<br />

the shifted parameter. is determined by performing the<br />

following steps: 1. Set = 1.05. 2. Perform IC<br />

factorization. 3. If all diagonal components become<br />

positive, shifted IC factorization is stopped. Otherwise,<br />

return to step 1, add 0.05 to , and iterate steps 1-3.<br />

If (5) is used for forward and backward substitutions,<br />

there is a possibility <strong>of</strong> cache miss in backward<br />

substitution. Hence, M is modified as follows:<br />

ˆ ˆ ˆ 1<br />

( ) ( ˆ ˆ T<br />

M LD<br />

D LD)<br />

.<br />

(8)<br />

Therefore, the process <strong>of</strong> forward and backward<br />

substitutions to compute the unknown vector u can be<br />

described as follows:<br />

ˆ ˆ ˆ 1 ( ) ( ˆ ˆ T<br />

L D D LD)<br />

u v,<br />

(9)<br />

where v is a known vector and the diagonal components<br />

<strong>of</strong> LDˆ ˆ become 1.0. Forward and backward substitutions<br />

is performed by following a two-step procedure<br />

consisting <strong>of</strong><br />

ˆ ˆ<br />

ˆ 1<br />

( ) ,<br />

( ˆ ˆ T<br />

LD<br />

y v y D LD)<br />

u,<br />

(10)<br />

- 114 - 15th IGTE Symposium 2012<br />

ˆ ˆ T<br />

( LD)<br />

u Dˆ<br />

y.<br />

(11)<br />

Consequently, the computational cost can be reduced by<br />

using the forward substitution (10) and backward<br />

T<br />

substitution (11) instead <strong>of</strong> the expression M Lˆ<br />

Dˆ<br />

Lˆ<br />

.<br />

DIC preconditioner<br />

The large sparse matrix A can be split into three terms<br />

as follows:<br />

T<br />

A L I L ,<br />

(12)<br />

where L is the strictly lower triangular part <strong>of</strong> A and I is a<br />

unit matrix. The diagonal matrix Dˆ obtained using<br />

shifted IC factorization (see (6) and (7)) is utilized. Thus,<br />

M can be defined as follows:<br />

ˆ ˆ 1<br />

ˆ T<br />

( ) ( ) , ( ˆ ) ˆ 1/<br />

2<br />

M L D D L D C L D D . (13)<br />

Similar to the case <strong>of</strong> the IC preconditioner, the<br />

procedure for forward and backward substitution should<br />

be attentively schemed. The forward and backward<br />

substitution for the DIC preconditioner is designed to<br />

make the diagonal component 1.0:<br />

ˆ 1 ( ) ˆ ( ˆ 1<br />

T<br />

L D I D LD<br />

I)<br />

u v,<br />

(14)<br />

ˆ 1<br />

( ) , ˆ ( ˆ 1<br />

T<br />

LD I y v y D LD<br />

I ) u,<br />

(15)<br />

( LDˆ<br />

1<br />

T<br />

I ) u Dˆ<br />

1<br />

y.<br />

(16)<br />

SGS preconditioner<br />

Using (12), M for the SGS preconditioner can be<br />

defined as follows:<br />

T<br />

M ( L I ) ( L I ) , C L I.<br />

(17)<br />

Then, the algorithm <strong>of</strong> the preconditioned CG method is<br />

as follows:<br />

Algorithm 2 (Preconditioned CG method). Let x0 be M –1<br />

b, and put r0 = b – Ax0. Set p0 = M –1 r0 and u0 = p0.<br />

For k = 0, 1, 2, …, repeat the following steps until the<br />

condition ||rk||2/||b||2 < CG holds:<br />

Apk<br />

,<br />

( rk , uk<br />

) / ( pk<br />

, ),<br />

x x <br />

p ,<br />

k <br />

k 1<br />

k k k<br />

rk rk<br />

<br />

k,<br />

1<br />

uk<br />

M r<br />

,<br />

1<br />

1 k 1<br />

k ( rk 1,<br />

uk<br />

1)<br />

/ ( rk<br />

, uk<br />

),<br />

pk 1<br />

uk<br />

1<br />

k pk<br />

.<br />

The algorithm <strong>of</strong> preconditioned MRTR method [8] is as<br />

follows:<br />

Algorithm 3 (Preconditioned MRTR method). Let x0 be<br />

M –1 b, and put r0 = b – Ax0. Set u0 = M –1 r0, y0 = – r0, and<br />

z0 = M –1 y0.<br />

For k = 0, 1, 2, …, repeat the following steps until the<br />

condition ||rk||2/||b||2 < MR holds:<br />

1<br />

AM rk<br />

Auk<br />

,<br />

w<br />

1<br />

1<br />

1<br />

M AM rk<br />

M<br />

,<br />

(<br />

w,<br />

rk<br />

) / ( ,<br />

w)<br />

<br />

<br />

k ( w,<br />

r )<br />

k<br />

k<br />

<br />

<br />

k ( ,<br />

w)<br />

( yk<br />

, w)(<br />

w,<br />

y<br />

k<br />

)<br />

( k 0)<br />

,<br />

( k 1)


0<br />

<br />

( yk<br />

, w)(<br />

w,<br />

r )<br />

k<br />

k<br />

<br />

<br />

k ( ,<br />

w)<br />

( yk<br />

, w)(<br />

w,<br />

y<br />

( w,<br />

r ),<br />

k 1<br />

k k<br />

<br />

p <br />

k 1<br />

k uk<br />

k pk<br />

1<br />

k<br />

x x p<br />

k 1<br />

k k k<br />

yk 1<br />

<br />

k yk<br />

k,<br />

rk<br />

1<br />

rk<br />

yk<br />

1,<br />

zk 1<br />

<br />

k zk<br />

k w,<br />

u u z<br />

k 1<br />

k k 1.<br />

,<br />

,<br />

k<br />

)<br />

( k 0)<br />

,<br />

( k 1)<br />

C. Eisenstat’s technique<br />

Here, Eisenstat’s approach, in which the preconditioned<br />

matrix and vectors are mainly utilized, is applied to the<br />

preconditioned linear solvers in order to reduce the<br />

computational cost for the matrix-vector product.<br />

First, we apply Eisenstat’s technique to the DIC<br />

preconditioner using the expression ˆ 1<br />

( ) ˆ 1/<br />

2<br />

C LD<br />

I D ,<br />

and the preconditioned matrix-vector product Apˆ ˆ<br />

k can be<br />

transformed into<br />

Aˆ<br />

pˆ<br />

k<br />

ˆ 1/<br />

2 ˆ 1<br />

1<br />

T<br />

D ( LD<br />

I)<br />

( L I L )<br />

ˆ 1<br />

T<br />

ˆ 1/<br />

2<br />

( LD<br />

I)<br />

D pˆ<br />

k<br />

ˆ 1/<br />

2<br />

( ˆ 1<br />

1<br />

) {( ˆ 1<br />

D LD<br />

I LD<br />

I)<br />

Dˆ<br />

( 2Dˆ<br />

I)<br />

ˆ ( ˆ 1<br />

T<br />

) }( ˆ 1<br />

T<br />

) ˆ 1/<br />

2<br />

D LD<br />

I LD<br />

I D pˆ<br />

k<br />

ˆ 1/<br />

2<br />

( ˆ 1<br />

T<br />

) ˆ 1/<br />

2 ˆ ˆ 1/<br />

2<br />

( ˆ 1<br />

1<br />

D LD<br />

I D pk<br />

D LD<br />

I)<br />

{ ˆ 1/<br />

2 ˆ ( 2 ˆ )( ˆ 1<br />

T<br />

) ˆ 1/<br />

2<br />

D p<br />

ˆ<br />

k D I LD<br />

I D pk<br />

}<br />

ˆ 1/<br />

2 T<br />

1<br />

ˆ { ˆ 1/<br />

2 ˆ ( 2 ˆ T<br />

D C p<br />

) ˆ<br />

k C D pk<br />

D I C pk<br />

} .<br />

(18)<br />

It is shown that Apˆ ˆ<br />

k can be replaced by one backward<br />

substitution<br />

T<br />

C pˆ k and one forward substitution<br />

1<br />

ˆ 1/<br />

2<br />

{ ˆ ( 2 ˆ T<br />

C D p ) ˆ<br />

k D I C pk<br />

} . On the other hand, the<br />

formulation <strong>of</strong> Apˆ ˆ<br />

k with the SGS preconditioner is as<br />

follows:<br />

Aˆ<br />

pˆ<br />

k<br />

1<br />

T<br />

T<br />

( L I)<br />

( L I L ) ( L I)<br />

pˆ<br />

k<br />

1<br />

T<br />

T<br />

( L I)<br />

{( L I)<br />

I ( L I)<br />

} ( L I)<br />

pˆ<br />

k<br />

(19)<br />

T<br />

1<br />

T<br />

( L<br />

I)<br />

pˆ<br />

( ) { ˆ ( ) ˆ<br />

k L I pk<br />

L I pk<br />

}<br />

T<br />

C pˆ<br />

1<br />

C ( pˆ<br />

T<br />

C pˆ<br />

) .<br />

k<br />

k<br />

The applicable scope <strong>of</strong> Eisenstat’s technique is restricted<br />

to the preconditioned matrix, in which the lower<br />

triangular part <strong>of</strong> the original equation is used as it is. The<br />

preconditioned CG method supported by Eisenstat’s<br />

technique is as follows:<br />

Algorithm 4 (Preconditioned CG method supported by<br />

Eisenstat’s technique). Set<br />

<br />

( LDˆ<br />

C <br />

L I<br />

1<br />

I)<br />

Dˆ<br />

1/<br />

2<br />

k<br />

( DIC)<br />

.<br />

( SGS)<br />

ˆ0 0<br />

Let x0 be M –1 1<br />

b, and put r0 = b – Ax0. Set r C r ,<br />

pˆ rˆ<br />

.<br />

0<br />

0<br />

- 115 - 15th IGTE Symposium 2012<br />

For k = 0, 1, 2, …, repeat the following steps until the<br />

condition ||rk||2/||b||2 < CG holds:<br />

T<br />

u C pˆ<br />

k ,<br />

ˆ 1/<br />

2 1<br />

{ ˆ 1<br />

ˆ ˆ<br />

<br />

<br />

D u C D<br />

A pk<br />

1<br />

u C ( pˆ<br />

k u)<br />

( ˆ , ˆ ) / ( ˆ , ˆ ˆ<br />

k rk<br />

rk<br />

pk<br />

Apk<br />

),<br />

xk 1<br />

xk<br />

<br />

ku,<br />

r rˆ<br />

<br />

Ap ˆ ˆ ,<br />

ˆk 1<br />

k k k<br />

r ˆ k 1 C rk<br />

1,<br />

( ˆ , ˆ ) / ( ˆ , ˆ<br />

k rk 1<br />

rk<br />

1<br />

rk<br />

rk<br />

pˆ ˆ ˆ<br />

k 1<br />

rk<br />

1<br />

k pk<br />

.<br />

/ 2<br />

),<br />

pˆ<br />

( 2Dˆ<br />

I)<br />

u}<br />

k<br />

( DIC)<br />

,<br />

( SGS)<br />

Preconditioned MRTR method supported by Eisenstat’s<br />

technique is as follows:<br />

Algorithm 5 (Preconditioned MRTR method supported<br />

by Eisenstat’s technique). Set<br />

<br />

( LDˆ<br />

C <br />

L I<br />

1<br />

I)<br />

Dˆ<br />

1/<br />

2<br />

( DIC)<br />

.<br />

( SGS)<br />

Let x0 be M –1 1<br />

b and put r0 = b – Ax0. Set rˆ<br />

0 C r0<br />

,<br />

yˆ ˆ 0 r0<br />

.<br />

For k = 0, 1, 2, …, repeat the following steps until the<br />

condition ||rk||2/||b||2 < MR holds:<br />

T<br />

u C rˆ<br />

k ,<br />

ˆ 1/<br />

2 1 ˆ 1/<br />

2<br />

{ ˆ ( 2 ˆ<br />

ˆ<br />

) }<br />

ˆ<br />

<br />

<br />

D u C D rk<br />

D I u<br />

Ark<br />

1<br />

u C ( rˆ<br />

k u)<br />

(<br />

Aˆ<br />

rˆ<br />

ˆ ˆ ˆ ˆ ˆ<br />

k , rk<br />

) / ( Ark<br />

, Ark<br />

)<br />

<br />

<br />

( ˆ<br />

k ˆ , ˆ<br />

k Ark<br />

rk<br />

)<br />

<br />

( ˆ ˆ , ˆ ˆ ) ( ˆ , ˆ ˆ )( ˆ<br />

<br />

A A A Aˆ<br />

, ˆ<br />

k rk<br />

rk<br />

yk<br />

rk<br />

rk<br />

yk<br />

)<br />

0<br />

<br />

<br />

ˆ ˆ<br />

( yˆ<br />

ˆ ˆ ˆ<br />

k , Ark<br />

)( Ark<br />

, r )<br />

k<br />

k<br />

( ˆ ˆ , ˆ ˆ ) ( ˆ , ˆ ˆ )( ˆ<br />

<br />

A A A Aˆ<br />

, ˆ<br />

k rk<br />

rk<br />

yk<br />

rk<br />

rk<br />

y<br />

( ˆ ˆ , ˆ<br />

k 1<br />

k Ark rk<br />

),<br />

k 1<br />

pk u k<br />

pk<br />

1,<br />

<br />

x x p<br />

k 1<br />

k k k<br />

y <br />

yˆ<br />

Ar ˆ ˆ ,<br />

ˆ k 1<br />

k k k k<br />

rˆ<br />

ˆ ˆ<br />

k 1<br />

rk<br />

yk<br />

1,<br />

r ˆ k 1 C rk<br />

1.<br />

k<br />

,<br />

k<br />

)<br />

( DIC)<br />

,<br />

( SGS)<br />

( k 0)<br />

,<br />

( k 1)<br />

( k 0)<br />

( k 1)<br />

In the linear solver using Eisenstat’s technique, the lower<br />

triangular matrix-vector product C r ˆk 1<br />

is computed to<br />

evaluate the residual rk+1.<br />

D. Computational cost <strong>of</strong> preconditioned linear solvers<br />

TABLE II shows the computational cost <strong>of</strong><br />

preconditioned linear solvers. Au, Lu, L -1 u, and L -T u<br />

denote the matrix-vector product, the lower triangular<br />

matrix-vector product, forward substitution, and<br />

backward substitution, respectively. The abbreviations<br />

EDIC and ESGS represent the DIC and SGS<br />

,


preconditioners using Eisenstat’s technique, respectively.<br />

The computational cost <strong>of</strong> the preconditioned linear<br />

solvers using Eisenstat’s technique (EDIC and ESGS) is<br />

lower than that <strong>of</strong> the other preconditioned solvers by<br />

10 %. The reason why the computational cost does not<br />

reduce significantly when Eisenstat’s technique is used is<br />

the additional computation <strong>of</strong> the lower triangular matrixvector<br />

product Cr ˆk 1<br />

, whose cost is approximately equal<br />

to that <strong>of</strong> forward or backward substitution.<br />

TABLE II<br />

COMPUTATIONAL COST OF PRECONDITIONED LINEAR SOLVERS<br />

linear<br />

solver precond. Au Lu L-1u L -T u<br />

app. costs per one ite.<br />

(Au + Lu + L -1 u + L -T u)<br />

IC 1 0 1 1 1.0<br />

DIC 1 0 1 1 1.0<br />

CG EDIC 0 1 1 1 0.9<br />

SGS 1 0 1 1 1.0<br />

ESGS 0 1 1 1 0.9<br />

MRTR<br />

IC 1 0 1 1 1.0<br />

DIC 1 0 1 1 1.0<br />

EDIC 0 1 1 1 0.9<br />

SGS 1 0 1 1 1.0<br />

ESGS 0 1 1 1 0.9<br />

approximate computational costs:<br />

Au = 0.4, Lu = 0.3, L -1 u = 0.3, L -T u = 0.3<br />

III. ANALYSIS MODEL<br />

Figure 1 shows finite element meshes <strong>of</strong> model<br />

problems used for performing a magnetic field analysis.<br />

The unknown numbers in all meshes are determined by<br />

the absolute edge number based on the nodal number.<br />

Figure 1 (a) shows a box shield model [9] in which the<br />

magnetic shielding part is composed <strong>of</strong> four-layer finite<br />

elements in the thickness direction; the shielding<br />

thickness is 1 mm. Magnetostatic and eddy current<br />

analyses are carried out by considering the magnetic<br />

nonlinearity <strong>of</strong> SS400.<br />

Figures 1 (b) and (c) show the permanent-magnet-type<br />

MRI model [10]. For this model, magnetostatic field<br />

analysis is performed by considering the magnetic<br />

nonlinearity <strong>of</strong> the pole piece, yoke, and props. The 2ndorder<br />

hexahedral elements are <strong>of</strong> the Serendipity type.<br />

Finally, Figure 1 (d) shows the IPM motor (D-model)<br />

proposed by the IEEJ committee. A strongly coupled<br />

analysis is performed between the magnetic field and<br />

AC-driven three-phase circuit. The stator and overhung<br />

rotor are considered to be magnetically nonlinear, and the<br />

conductivity <strong>of</strong> the magnet is set to be 6.944 × 10 5 S/m.<br />

The number <strong>of</strong> revolutions per minute is set to 1500, and<br />

the pitch <strong>of</strong> the mechanical angle is 1°. The total number<br />

<strong>of</strong> time steps is set to 360.<br />

- 116 - 15th IGTE Symposium 2012<br />

TABLE III lists the analyzed conditions. The Newton-<br />

Raphson (NR) method, along with the line search<br />

technique based on functional minimization (0, 1.0) [11],<br />

is used as the nonlinear analysis method. GEAR’s<br />

implicit scheme [12], [13] <strong>of</strong> 2nd order is used for the<br />

discretization <strong>of</strong> the time domain based on the A-<br />

formulation.<br />

IV. NUMERICAL RESULTS<br />

A. Verification <strong>of</strong> computational accuracy<br />

The computational accuracy <strong>of</strong> preconditioned MRTR<br />

method is verified for the box shield model. Figure 2<br />

shows the analysis results. The magnetic flux density Bz<br />

in the z-direction on z-axis is shown in Figure 2 (a). The<br />

characteristic <strong>of</strong> the ICCG method coincide with that <strong>of</strong><br />

ESGS-MRTR. The relative error for two characteristics is<br />

less than 10 -4 % at points A and B. Similarly, the relative<br />

error for eddy current loss PJe as shown in Figure 2 (b) is<br />

less than 10 -4 % at these points. It is to be noted that other<br />

preconditioned linear solvers have similar characteristics.<br />

20<br />

240<br />

y<br />

75<br />

20d<br />

y<br />

(unit:mm) magnet: Br = 1.2 T<br />

z<br />

y z<br />

yoke: SS400<br />

z<br />

16<br />

100<br />

(unit: mm)<br />

coil:2 kAT<br />

magnetic shielding<br />

(SS400)<br />

x<br />

(a) Box shield model<br />

prop:<br />

SS400<br />

x gradientcoil<br />

polepiece:<br />

SS400<br />

13<br />

(unit:mm) magnet: Br = 1.2 T<br />

z<br />

240 20<br />

y<br />

75<br />

yoke: SS400<br />

13<br />

prop: SS400<br />

x<br />

gradientcoil<br />

polepiece<br />

: SS400<br />

(b) MRI (1st order tetra.) (c) MRI (2nd order hexa.)<br />

(unit : mm)<br />

45<br />

32.5<br />

TABLE III<br />

ANALYZED CONDITIONS<br />

x<br />

v<br />

w<br />

u<br />

v<br />

w<br />

u<br />

rotor core<br />

(50A350)<br />

stator core<br />

(50A350)<br />

30<br />

shaft<br />

(S45C)<br />

(d) IPM motor<br />

Figure 1: Finite element meshes.<br />

analysis model formul. discret. no. <strong>of</strong> nodes no. <strong>of</strong> elements DoF nonlinear circuit field<br />

box shield<br />

A<br />

A <br />

1st-hexa 72,900 67,980<br />

magnet<br />

enlarged view<br />

CG, MR || B<br />

|| 2<br />

197,472 static 10-3 10-2 <br />

206,427 time domain 10-3 10-2 <br />

MRI<br />

A 1st-tetra 279,090 49,813 323,965 static 10-3 10-3 A 2nd-hexa 93,879 87,120 1,014,600 static 10-3 10-3 IPM motor 1st-hexa 381,197 352,980 1,030,156 time domain 10-3 10-2 <br />

<br />

A


B z [T]<br />

ESGS-MRTR<br />

0.025<br />

0.020<br />

0.015<br />

0.010<br />

0.005<br />

ICCG<br />

point A<br />

point B<br />

0 0.05 0.10 0.15 0.20<br />

z [m]<br />

2.0<br />

1.5<br />

1.0<br />

P Je [W] ESGS-MRTR<br />

0.5<br />

0 0.01 0.02 0.03 0.04<br />

t [s]<br />

point B<br />

point A<br />

ICCG<br />

(a) (b)<br />

Figure 2: Some characteristics <strong>of</strong> the box shield model:<br />

(a) the Bz distribution in the z-axis direction and (b) the<br />

distribution <strong>of</strong> eddy current loss.<br />

log 10(||r (k) || 2 / ||b|| 2)<br />

log 10(||r (k) || 2 / ||b|| 2)<br />

1<br />

0 DIC-CG<br />

-1<br />

-2<br />

EDIC-CG<br />

DIC-MRTR<br />

EDIC-MRTR<br />

SGS-CG<br />

-3<br />

ICCG<br />

ESGS-CG<br />

-4<br />

IC-MRTR<br />

-5<br />

SGS-MRTR<br />

-6 ESGS-MRTR<br />

-7<br />

0 200 400 600<br />

iteration number k<br />

800 1000<br />

1<br />

0<br />

-1<br />

-2<br />

-3<br />

(a)<br />

DIC-CG<br />

EDIC-CG<br />

DIC-MRTR<br />

EDIC-MRTR<br />

SGS-CG<br />

ESGS-CG<br />

-4 IC-MRTR<br />

-5<br />

SGS-MRTR<br />

ESGS-MRTR<br />

-6<br />

ICCG<br />

-7<br />

0 150 300 450 600<br />

iteration number k<br />

750 900<br />

(b)<br />

Figure 3: Convergence characteristics <strong>of</strong> preconditioned<br />

linear solvers for the box shield model. (a) Magnetostatic<br />

field analysis and (b) eddy current analysis.<br />

B. Convergence characteristics and elapsed time<br />

Figure 3 shows the convergence characteristics <strong>of</strong> the<br />

box shield model, obtained by the magnetostatic and<br />

eddy current analyses. The characteristics are normalized<br />

by the initial norm <strong>of</strong> the residual in the 1st NR iteration.<br />

In MRTR method, the monotonic decrease in the residual<br />

has been mathematically proved; nevertheless, there are<br />

some noise spikes in the characteristics in the case <strong>of</strong><br />

preconditioned MRTR method. The generation <strong>of</strong> noise<br />

is likely to be caused by changes in the NR iteration.<br />

Noise generation is also observed for the preconditioned<br />

CG method. The characteristics <strong>of</strong> preconditioned MRTR<br />

method are superior to those <strong>of</strong> the preconditioned CG<br />

method because the monotonic decrease in the residual is<br />

mathematically guaranteed in the former method. The<br />

characteristics <strong>of</strong> the ESGS and EDIC preconditioners<br />

- 117 - 15th IGTE Symposium 2012<br />

linear<br />

solver<br />

CG<br />

MRTR<br />

linear<br />

solver<br />

CG<br />

MRTR<br />

TABLE IV<br />

ANALYSIS RESULTS FOR BOX SHIELD MODEL<br />

(a) MAGNETOSTATIC FIELD ANALYSIS<br />

precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />

IC 544 (1.00) 5 1.07 8.2 (1.00)<br />

DIC 979 (1.80) 5 1.06 13.8 (1.68)<br />

EDIC 979 (1.80) 5 1.07 13.0 (1.59)<br />

SGS 653 (1.20) 5 0.03 8.3 (1.01)<br />

ESGS 653 (1.20) 5 0.03 7.9 (0.96)<br />

IC 448 (0.82) 5 1.01 7.4 (0.90)<br />

DIC 812 (1.49) 5 1.09 12.4 (1.51)<br />

EDIC 812 (1.49) 5 1.06 11.5 (1.40)<br />

SGS 552 (1.01) 5 0.03 7.7 (0.94)<br />

ESGS 552 (1.01) 5 0.03 7.0 (0.85)<br />

(b) EDDY CURRENT ANALYSIS IN TIME DOMAIN (1ST TIME STEP)<br />

precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />

IC 503 (1.00) 4 1.07 8.6 (1.00)<br />

DIC 844 (1.68) 4 1.05 13.6 (1.58)<br />

EDIC 844 (1.68) 4 1.06 12.8 (1.49)<br />

SGS 595 (1.18) 4 0.03 8.7 (1.01)<br />

ESGS 595 (1.18) 4 0.03 8.2 (0.95)<br />

IC 393 (0.78) 4 1.00 7.4 (0.86)<br />

DIC 694 (1.38) 4 1.07 12.1 (1.41)<br />

EDIC 694 (1.38) 4 1.03 11.2 (1.30)<br />

SGS 470 (0.93) 4 0.03 7.4 (0.86)<br />

ESGS 470 (0.93) 4 0.03 6.8 (0.79)<br />

are consistent with those <strong>of</strong> SGS and DIC, respectively,<br />

and the characteristics <strong>of</strong> the DIC preconditioner are<br />

inferior to those <strong>of</strong> other preconditioned solvers. The IC-<br />

MRTR characteristics are the best among all<br />

preconditioned solvers. However, the elapsed time <strong>of</strong><br />

ESGS-MRTR is the shortest among all solvers, as can be<br />

seen in TABLE IV. The reason for this is the reduction in<br />

the computational cost when Eisenstat’s technique is<br />

used. All results are obtained by using a PC (CPU: Intel<br />

Core i7 2600K/4.2 GHz; memory: 16 GB). Following all<br />

problems are solved with the same hardware.<br />

Figure 4 shows the convergence characteristics <strong>of</strong> MRI<br />

models. The convergence characteristics <strong>of</strong><br />

preconditioned MRTR are superior to those <strong>of</strong> the<br />

preconditioned CG method. The DIC preconditioner is<br />

not very effective in improving the convergence<br />

characteristics. While the IC preconditioner is successful<br />

in the case <strong>of</strong> a tetrahedron, the SGS preconditioner is the<br />

most effective for a 2nd-order hexahedron. The<br />

effectiveness <strong>of</strong> the preconditioner depends on the target<br />

problem. TABLE V shows the analysis results for the<br />

MRI model. The elapsed time <strong>of</strong> ESGS-MRTR is the<br />

shortest among all preconditioned solvers.<br />

TABLE VI shows the analysis results for an IPM<br />

motor. The number <strong>of</strong> NR iterations is different for all<br />

solvers owing to the slight discrepancy in the converged<br />

solution in every time step. The elapsed time <strong>of</strong> ESGS-<br />

MRTR is the shortest among all linear solvers.<br />

V. CONCLUSION<br />

This paper shows the suitability <strong>of</strong> preconditioned<br />

MRTR method for solving an algebraic equation derived<br />

from the edge-based finite element method in a magnetic<br />

field. There is a possibility <strong>of</strong> reducing the elapsed time<br />

in the case <strong>of</strong> MRTR method by using the symmetric


Gauss-Seidel preconditioner supported by Eisenstat’s<br />

technique.<br />

log 10(||r (k) || 2 / ||b|| 2)<br />

log 10(||r (k) || 2 / ||b|| 2)<br />

1<br />

0<br />

-1<br />

DIC-CG<br />

EDIC-CG DIC-MRTR<br />

EDIC-MRTR<br />

-2<br />

ICCG<br />

-3<br />

SGS-CG<br />

-4<br />

-5<br />

IC-MRTR<br />

ESGS-CG<br />

-6 SGS-MRTR<br />

-7<br />

-8<br />

ESGS-MRTR<br />

-9<br />

0 200 400 600 800<br />

iteration number k<br />

1000<br />

1<br />

0<br />

-1<br />

-2<br />

-3<br />

(a)<br />

ICCG<br />

SGS-CG<br />

ESGS-CG<br />

DIC-MRTR<br />

EDIC-MRTR<br />

DIC-CG<br />

-4<br />

EDIC-CG<br />

-5<br />

IC-MRTR<br />

-6 SGS-MRTR<br />

-7 ESGS-MRTR<br />

-8<br />

0 900 1800 2700 3600<br />

iteration number k<br />

4500<br />

(b)<br />

Figure 4: Convergence characteristics <strong>of</strong> preconditioned<br />

linear solvers for the MRI model. (a) Tetrahedron and (b)<br />

2nd-order hexahedron.<br />

linear<br />

solver<br />

CG<br />

MRTR<br />

linear<br />

solver<br />

CG<br />

MRTR<br />

TABLE V<br />

ANALYSIS RESULTS FOR THE MRI MODEL<br />

(a) 1ST ORDER TETRAHEDRON<br />

precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />

IC 572 (1.00) 7 0.71 11.5 (1.00)<br />

DIC 958 (1.67) 7 0.73 18.6 (1.62)<br />

EDIC 958 (1.67) 7 0.72 17.6 (1.53)<br />

SGS 649 (1.14) 7 0.03 11.9 (1.03)<br />

ESGS 649 (1.14) 7 0.03 11.3 (0.98)<br />

IC 481 (0.84) 7 0.73 10.8 (0.94)<br />

DIC 777 (1.36) 7 0.72 16.7 (1.45)<br />

EDIC 777 (1.36) 7 0.74 15.3 (1.33)<br />

SGS 542 (0.95) 7 0.03 11.0 (0.96)<br />

ESGS 542 (0.95) 7 0.03 10.1 (0.88)<br />

(b) 2ND ORDER HEXAHEDRON<br />

precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />

IC 3,795 (1.00) 8 25.6 570.7 (1.00)<br />

DIC 4,305 (1.13) 8 25.2 643.5 (1.13)<br />

EDIC 4,219 (1.11) 8 25.3 585.4 (1.03)<br />

SGS 2,590 (0.68) 8 0.50 370.1 (0.65)<br />

ESGS 2,590 (0.68) 8 0.50 342.8 (0.60)<br />

IC 2,785 (0.73) 8 23.9 443.3 (0.78)<br />

DIC 3,357 (0.88) 8 25.7 530.9 (0.93)<br />

EDIC 3,357 (0.88) 8 23.3 470.3 (0.82)<br />

SGS 2,112 (0.56) 8 0.50 316.1 (0.55)<br />

ESGS 2,112 (0.56) 8 0.50 287.6 (0.50)<br />

- 118 - 15th IGTE Symposium 2012<br />

linear<br />

solver<br />

CG<br />

MRTR<br />

TABLE VI<br />

ANALYSIS RESULTS FOR THE IPM MOTOR<br />

precond. total linear ite. total NR ite. time for precond. [h] elapsed time [h]<br />

IC 1,666,584 (1.00) 4,070 (0.80) 4.84 39.1 (1.00)<br />

DIC 1,799,885 (1.08) 5,065 (1.00) 4.81 43.6 (1.12)<br />

EDIC 1,779,282 (1.07) 4,999 (0.99) 4.82 40.9 (1.05)<br />

SGS 1,400,867 (0.84) 4,996 (0.99) 0.03 29.7 (0.76)<br />

ESGS 1,406,236 (0.84) 4,993 (0.99) 0.03 28.6 (0.73)<br />

IC 931,375 (0.56) 4,450 (0.88) 5.24 26.6 (0.68)<br />

DIC 1,088,578 (0.65) 4,716 (0.93) 4.95 29.7 (0.76)<br />

EDIC 1,067,788 (0.64) 5,242 (1.03) 4.96 27.6 (0.71)<br />

SGS 866,637 (0.52) 5,178 (1.02) 0.04 19.6 (0.50)<br />

ESGS 864,545 (0.52) 5,162 (1.02) 0.04 18.2 (0.47)<br />

ACKNOWLEDGMENT<br />

The authors would like to thank Dr. K. Abe and Dr. Y.<br />

Takahashi for their advice and helpful comments. This<br />

work was supported by a Japan Society for the Promotion<br />

<strong>of</strong> Science (JSPS) Grant-in-Aid for Young Scientists (B)<br />

(Grant Number: 23760252).<br />

REFERENCES<br />

[1] J. A. Meijerink and H. A. van der Vorst, “An iterative solution<br />

method for linear systems <strong>of</strong> which the coefficient matrix is a<br />

symmetric M-matrix,” Mathematics <strong>of</strong> Computation, Vol. 31, No.<br />

137, pp. 148-162, Jan. 1977.<br />

[2] K. Abe, S.-L. Zhang, and T. Mitsui, “MRTR method: an iterative<br />

method based on the three-term recurrence formula <strong>of</strong> CG-type for<br />

nonsymmetric matrix,” The Japan Society for Industrial and<br />

Applied Mathematics, Vol. 7, No. 1, pp. 37-50, Mar. 1997. (in<br />

Japanese)<br />

[3] K. Abe and S.-L. Zhang, “A variant algorithm <strong>of</strong> the Orthomin(m)<br />

method for solving linear systems,” Appl. Math. Comput., Vol. 206,<br />

No. 1, pp. 42-49, Dec. 2008.<br />

[4] S. C. Eisenstat, “Efficient implementation <strong>of</strong> a class <strong>of</strong><br />

preconditioned conjugate gradient methods,” SIAM J. Sci. Stat.<br />

Comput., Vol. 2, No. 1, pp. 1-4, Mar. 1981.<br />

[5] S. C. Eisenstat, H. C. Elman, and M. H. Schultz, “Variational<br />

iterative methods for nonsymmetric systems <strong>of</strong> linear equations,”<br />

SIAM J. Numer. Anal., Vol. 20, No. 2, pp. 345-357, Apr. 1983.<br />

[6] K. Fujiwara, T. Nakata, and H. Fusayasu, “Acceleration <strong>of</strong><br />

convergence characteristic <strong>of</strong> the ICCG method,” IEEE Trans.<br />

Magn., Vol. 29, No. 2, pp. 1958-1961, Mar. 1993.<br />

[7] O. Axelsson, “A generalized SSOR method,” BIT Numerical<br />

Mathematics, Vol. 12, No. 4, pp. 443-467, Jul. 1972.<br />

[8] A. Shiode, S. Fujino, and K. Abe, “Preconditioning for symmetric<br />

positive definite matrices <strong>of</strong> MRTR method,” Trans. JSCES, No.<br />

20060007, pp. 231-237, Feb. 2006. (in Japanese)<br />

[9] A. Kameari, “Improvement <strong>of</strong> ICCG convergence for thin<br />

elements in magnetic field analyses using the finite-element<br />

method,” IEEE Trans. Magn., Vol. 44, No. 6, pp. 1178-1181, Jun.<br />

2008.<br />

[10] K. Miyata, K. Ohashi, A. Muraoka, and N. Takahashi, “3-D<br />

magnetic field analysis <strong>of</strong> permanent-magnet type <strong>of</strong> MRI taking<br />

account <strong>of</strong> minor loop,” IEEE Trans. Magn., Vol. 42, No. 4, pp.<br />

1451-1454, Apr. 2006.<br />

[11] Y. Okamoto, K. Fujiwara, and R. Himeno, “Exact minimization <strong>of</strong><br />

energy functional for NR method with line-search technique,” IEEE<br />

Trans. Magn., Vol. 45, No. 3, pp. 1288-1291, Mar. 2009.<br />

[12] C. W. GEAR, Numerical Initial Value Problems in Ordinary<br />

Differential Equations. Englewood Cliffs, NJ: Prentice-Hall, Inc.,<br />

1971.<br />

[13] Y. Okamoto, K. Fujiwara, and Y. Ishihara, “Effectiveness <strong>of</strong><br />

higher order time integration in time-domain finite-element<br />

analysis,” IEEE Trans. Magn., Vol. 46, No. 8, pp. 3321-3324, Aug.<br />

2010.


- 119 - 15th IGTE Symposium 2012<br />

High Frequency Mixing Rule Based Effective<br />

Medium Theory <strong>of</strong> Metamaterials<br />

Zsolt Szabó<br />

Department <strong>of</strong> Broadband Infocommunications and Electromagnetic Theory,<br />

Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics, Egry József 18, 1111 Budapest, Hungary,<br />

E-mail: szabo@evt.bme.hu<br />

Abstract— The electromagnetic response <strong>of</strong> metamaterials is governed by the collective behavior <strong>of</strong> engineered electric and<br />

magnetic dipoles. Therefore metamaterials may be replaced by hypothetical composites <strong>of</strong> spherical particles embedded in a<br />

host material. The effective electric permittivity and magnetic permeability <strong>of</strong> such systems can be computed with high<br />

frequency extension <strong>of</strong> the Maxwell-Garnett mixing rule. The validity <strong>of</strong> this assumption is discussed and as a benchmark the<br />

effective electromagnetic material parameters <strong>of</strong> a deep subwavelength spherical composite are calculated in three different<br />

ways: with the Maxwell-Garnett mixing rule, high frequency mixing rule and directly extracted from transmission reflection<br />

data. The developed theory is applied to find the parameters <strong>of</strong> a composite with similar magnetic response as a metamaterial<br />

built <strong>of</strong> split ring resonator.<br />

Index Terms—metamaterials, effective medium theory, Maxwell-Garnett mixing, Mie theory.<br />

I. INTRODUCTION<br />

Recently metamaterials are in focus <strong>of</strong> very intensive<br />

research and due to their unique properties are promising<br />

over the full electromagnetic spectrum [1-3]. The research<br />

<strong>of</strong> metamaterials has started with the goal <strong>of</strong> producing<br />

materials with negative refractive index i.e. simultaneous<br />

negative electric permittivity and magnetic permeability<br />

for imaging applications below the diffraction limit [4].<br />

However, the ultimate goal <strong>of</strong> the metamaterial research<br />

is to fabricate materials with arbitrarily configurable<br />

electric and magnetic properties. With an advance in<br />

micro- and nano-manufacturing techniques there are<br />

possibilities to produce subwavelength structures that can<br />

support symmetric and anti-symmetric modes. The<br />

associated current flow produces electric and magnetic<br />

dipole moments. A metamaterial with a customized<br />

optical response can be built as a superposition <strong>of</strong> such<br />

nano-elements. A very common design <strong>of</strong> an artificial<br />

material with tailored negative permittivity is the wire<br />

medium [5]. The most common designs to produce<br />

artificial magnetism are the variations <strong>of</strong> the split ring<br />

resonators [6] or pairs <strong>of</strong> nanorods [7]. The superposition<br />

<strong>of</strong> subwavelength structures with negative electric<br />

permittivity and magnetic permeability can lead to<br />

negative refractive index [8] even at optical frequencies<br />

[9]. However the losses and the finite size <strong>of</strong> the unit cell<br />

results in a cut<strong>of</strong>f frequency, limiting the applicability <strong>of</strong><br />

metamaterials. In addition toward optical frequencies it is<br />

increasingly challenging to fabricate the meta-structures,<br />

especially the negative magnetic response.<br />

The design <strong>of</strong> devices with metamaterials <strong>of</strong>ten<br />

requires the application <strong>of</strong> the effective medium theory.<br />

However robust effective metamaterial parameter<br />

extraction and homogenization are unsolved theoretical<br />

challenges <strong>of</strong> the metamaterial research. In spite <strong>of</strong><br />

considerable progress, researchers are still debating the<br />

fundamental issues and question the validity <strong>of</strong> the<br />

effective medium concept, which is considered by many<br />

as the Achilles-heel <strong>of</strong> this research field.<br />

In this paper it is argued that metamaterials can be<br />

homogenized when their electromagnetic response is<br />

governed by the excitation <strong>of</strong> electric and magnetic<br />

dipoles. The electromagnetic response <strong>of</strong> spherical<br />

particles can be replaced with static dipoles when the<br />

sphere is very small compared to the optical wavelength<br />

<strong>of</strong> the incident electromagnetic wave and by radiating<br />

dipoles, when the size is larger. The analytical formulas<br />

<strong>of</strong> the Mie theory explain precisely the scattering<br />

mechanism. Metamaterials may be equivalent to a<br />

properly chosen hypothetical composite <strong>of</strong> spherical<br />

particles embedded in a host material. Therefore well<br />

developed effective medium theories <strong>of</strong> composite<br />

materials can be applied to metamaterials. The validity <strong>of</strong><br />

this assumption is discussed and as a benchmark the<br />

effective electromagnetic material parameters <strong>of</strong> a deep<br />

subwavelength composite <strong>of</strong> spherical particles are<br />

calculated in three different ways: with the Maxwell-<br />

Garnett mixing rule, high frequency mixing rule and<br />

extracted directly from transmission reflection data. The<br />

developed theory is applied to find the parameters <strong>of</strong> a<br />

composite with similar magnetic response as a<br />

metamaterial built up <strong>of</strong> split ring resonator.<br />

II. EFFECTIVE MEDIUM THEORIES OF METAMATERIALS<br />

Several effective medium theories <strong>of</strong> metamaterials<br />

have been developed. In Fig. 1 two models <strong>of</strong><br />

metamaterial homogenization are presented. The effective<br />

metamaterial parameters can be extracted by replacing the<br />

electromagnetic response <strong>of</strong> the metamaterials with the<br />

electromagnetic response <strong>of</strong> a homogeneous isotropic slab<br />

is it is shown in Fig. 1.b. The model <strong>of</strong> Fig. 1.c replaces<br />

the metamaterial with the hypothetical composite <strong>of</strong><br />

spherical particles embedded in a host material. In both<br />

cases the electromagnetic properties can be determined in<br />

such a way that the metamaterial slab and the slab with<br />

the homogenized material parameters have the same<br />

reflection S 11 and transmission S 21 parameters.<br />

When the metamaterial is replaced with homogeneous<br />

slab, from the Fresnel relations the effective metamaterial<br />

parameters can be expressed. However the extracted wave<br />

impedance is exact only in the quasi static limit [10] and


the unique extraction <strong>of</strong> the refractive index is<br />

cumbersome due to the branching problem <strong>of</strong> the<br />

refractive index; that is the calculation <strong>of</strong> the refractive<br />

index involves the evaluation <strong>of</strong> a complex logarithm that<br />

is a multi-valued function. To remove this ambiguity, the<br />

Kramers–Kronig relation can be applied to estimate the<br />

refractive index from the extinction coefficient [11]. The<br />

physically realistic exact values <strong>of</strong> the refractive index are<br />

determined by selecting those branches <strong>of</strong> the logarithmic<br />

function which are closest to those predicted by the<br />

Kramers–Kronig relation. Finally from the wave<br />

impedance and from the refractive index the electric<br />

permittivity and the magnetic permeability can be<br />

calculated.<br />

(a)<br />

- 120 - 15th IGTE Symposium 2012<br />

x = ε μ ωr<br />

c , where ω is the angular frequency <strong>of</strong><br />

h h<br />

r r 0<br />

the incident radiation and c 0 is the speed <strong>of</strong> light in<br />

vacuum, provides the guideline for the validity <strong>of</strong> the<br />

Maxwell Garnett mixing rule, with the necessary<br />

condition x 1.<br />

However the limits <strong>of</strong> the Mixing-<br />

Garnett mixing rule can be extended. The Mie theory<br />

explains precisely the scattering mechanism <strong>of</strong> standalone<br />

spherical particles <strong>of</strong> any size and <strong>of</strong>fer analytic solution<br />

in form <strong>of</strong> infinite series [14]. When the magnetic<br />

permeability <strong>of</strong> the host material and <strong>of</strong> the spherical<br />

particle is equal, the Mie coefficients are<br />

mΨn( mx) Ψ′ n( x) −Ψn( x) Ψ′<br />

n(<br />

mx)<br />

an<br />

=<br />

,<br />

mΨ mx ξ′ x −ξ x Ψ′<br />

mx<br />

b<br />

n( ) n( ) n( ) n(<br />

)<br />

( mx) ′ ( x) m ( x) ′ ( mx)<br />

( mx) ξ′ ( x) mξ ( x) ′ ( mx)<br />

Ψ Ψ − Ψ Ψ<br />

=<br />

, (4)<br />

n n n n<br />

n<br />

Ψn n − n Ψn<br />

where m =<br />

i i<br />

ε r μr h h<br />

ε r μr<br />

is the contrast <strong>of</strong> the<br />

refractive index and n Ψ and ξ n are the Riccati-Bessel<br />

functions. The radiating electric and magnetic dipole<br />

polarizabilities correspond to the first terms <strong>of</strong> the<br />

expansion and can be expressed with the Mie scattering<br />

coefficients as<br />

3<br />

3<br />

3r<br />

3r<br />

α e = i a1,<br />

α 3 m = i b1.<br />

3<br />

2x<br />

2x<br />

(5)<br />

(b) (c)<br />

Figure 1: Homogenization models <strong>of</strong> metamaterials<br />

Substituting (5) in the Clausius-Mossotti relation leads to<br />

the expressions <strong>of</strong> the effective electric permittivity and<br />

with a similar argument to the expression <strong>of</strong> the effective<br />

magnetic permeability [15, 16]<br />

The Maxwell-Garnett mixing rule [10, 12, 13] can<br />

provide the effective electric permittivity <strong>of</strong> dilute, two<br />

component mixtures and it is derived with the assumption<br />

that the spherical inclusions can be replaced by static<br />

electric dipoles with polarizability<br />

3<br />

eff h x + 3iζa1(<br />

mx)<br />

εr = εr<br />

3<br />

x − 3 iζa1( mx<br />

2 )<br />

3<br />

eff h x + 3iζb1(<br />

mx)<br />

μr = μr<br />

3<br />

x − 3 iζb1( mx<br />

2 )<br />

,<br />

. (6)<br />

i h<br />

εr − εr<br />

3<br />

αe<br />

= r , i h<br />

εr + 2εr<br />

(1)<br />

where 1<br />

h<br />

where ε r is the electric permittivity <strong>of</strong> the host material,<br />

i<br />

ε r is the electric permittivity and r is the radius <strong>of</strong> the<br />

spherical inclusions. The connection between the<br />

eff<br />

polarizability and the effective electric permittivity ε r is<br />

given by the Clausius-Mossotti relation [13]<br />

eff h<br />

εr − εr ζ<br />

= α<br />

eff h 3 e , (2)<br />

εr + 2εr<br />

r<br />

where ζ is the filling factor <strong>of</strong> the spherical inclusion.<br />

When (1) is substituted in the Clausius-Mossotti relation<br />

it results in the Maxwell-Garnett mixing formula<br />

eff h i h<br />

εr −εr εr −εr<br />

= ζ . (3)<br />

eff h i h<br />

εr + 2εr εr + 2εr<br />

In this relation, the size <strong>of</strong> the spherical inclusions is not<br />

appearing in a direct way; the filling factor ζ is the only<br />

geometry factor in the Maxwell-Garnett formula. The<br />

static dipole approximation is valid only for spheres,<br />

which are very small compared to the optical wavelength<br />

<strong>of</strong> the incident electromagnetic wave. The size parameter<br />

a and b 1 are the first terms <strong>of</strong> the Mie scattering<br />

coefficients and i = − 1 is the imaginary unit. The<br />

evaluation <strong>of</strong> a 1 and b1 is trivial, because in (5) for<br />

n = 1 , the Riccati-Bessel functions and the derivatives<br />

can be expressed with simple expression <strong>of</strong> trigonometric<br />

functions as<br />

sin ρ<br />

Ψ 1 ( ρ) = − cos ρ ,<br />

ρ<br />

1 cosρ<br />

Ψ ′ 1 ( ρ) = sin ρ1−<br />

2 +<br />

,<br />

ρ ρ<br />

cos ρ <br />

ξ1( ρ) =Ψ1( ρ) − i + sin ρ<br />

ρ ,<br />

1 <br />

ξ′ 1( ρ) =Ψ ′ 1( ρ) + i Ψ 1( ρ) + cos ρ 2 <br />

ρ .<br />

When the size <strong>of</strong> the spherical inclusions is not small<br />

enough to be replaced with static dipoles, but it is small<br />

enough to disregard all higher order modes <strong>of</strong> (4) then the<br />

high frequency mixing formulas (6) are applicable. Note<br />

that the resonance based magnetic metamaterials are<br />

working under similar conditions [1]. Metamaterials has


finite unit cell sizes, and especially magnetic<br />

metamaterials has unit cells, which are not deep<br />

subwavelength. The strength <strong>of</strong> the resonance decreases<br />

with the size <strong>of</strong> the unit cell and the resonance is not<br />

strong enough to produce negative permeability for<br />

structures with deep sub-wavelength elements.<br />

Metamaterials with larger unit cell can support higher<br />

order modes at frequencies, which are just slightly<br />

different than the frequency region where the double<br />

negative behavior occurs. Special care must be taken<br />

when metamaterial parameters are extracted directly from<br />

transmission reflection data, and it is not sufficient to<br />

enforce the continuity <strong>of</strong> the refractive index, because we<br />

may extract erroneous effective metamaterial parameters<br />

for frequency regions where they do not even exist. The<br />

high frequency mixing, which is based on the Mie theory<br />

provides estimate for the limits <strong>of</strong> the homogenization.<br />

III. EFFECTIVE MATERIAL PARAMETERS OF COMPOSITE<br />

WITH SPHERICAL METALLIC INCLUSIONS<br />

In this section the effective parameters <strong>of</strong> the<br />

composite material with the unit cell illustrated in Fig. 2.a<br />

are calculated. This composite serves as benchmark to<br />

compare the effective material parameters calculated with<br />

the Maxwell Garnett mixing rule, the high frequency<br />

mixing rule and directly extracted from transmission<br />

reflection data. The geometry and the composition are<br />

selected such that the size parameter <strong>of</strong> the spheres at<br />

optical frequencies satisfies the condition x 1.<br />

The<br />

length <strong>of</strong> the cubic unit cell is 15 nm and the radius <strong>of</strong> the<br />

sphere is 3 nm. The spherical inclusions are made <strong>of</strong> Ag<br />

and are embedded in SiO2 host and the calculations take<br />

into account the frequency dispersion <strong>of</strong> the materials<br />

parameters. Fig. 2.b presents the electric permittivity <strong>of</strong><br />

the Ag inclusions, and Fig. 2.c plots the electric<br />

permittivity <strong>of</strong> the SiO2 host [17, 18]. The composite is<br />

considered infinitely large in the x and y directions (see<br />

Fig. 2.a) and only-one-unit-cell thick in the z direction.<br />

The Maxwell-Garnett type mixing rules do not require<br />

cubic unit cells; the requirement is that the inclusions are<br />

separated. For periodically arranged spherical inclusions,<br />

when the filling factor is high, the Maxwell-Garnett<br />

mixing rule has to be modified [12]. On the other hand<br />

disorder and inaccuracy <strong>of</strong> shapes destroys the collective<br />

effects and extends the limits <strong>of</strong> the theory.<br />

The aim <strong>of</strong> the calculation is to determine the effective<br />

parameters <strong>of</strong> this composite in the frequency range from<br />

0.4 to 1 PHz. The calculations <strong>of</strong> the transmission<br />

reflection data (S-parameters) <strong>of</strong> this paper are performed<br />

with the frequency-domain solver <strong>of</strong> the commercial<br />

s<strong>of</strong>tware CST Microwave Studio [19]. Due to periodicity,<br />

one unit cell with perfect electric conducting and perfect<br />

magnetic conducting boundary conditions in the x and the<br />

y directions is sufficient to calculate the S-parameters<br />

below the frequencies where diffraction occurs. In the z<br />

direction additional air regions are added to the<br />

computational space by positioning waveguide ports at<br />

one-unit-cell distance from the surface <strong>of</strong> the composite.<br />

The fundamental mode <strong>of</strong> the waveguide ports is excited<br />

to launch a plane wave, which is propagating along the z<br />

direction; at the same time the waveguide ports act as<br />

- 121 - 15th IGTE Symposium 2012<br />

absorbing boundary condition and permits the automatic<br />

calculation <strong>of</strong> the S-parameters [19]. The online algorithm<br />

[20] is applied to extract the electromagnetic parameters<br />

from the S parameters. To get a good estimate for the<br />

Kramers–Kronig integral, the simulations cover the 0.25–<br />

1.25 PHz frequency interval. When this frequency<br />

interval is even larger, the accuracy <strong>of</strong> the Kramers–<br />

Kronig approximation does not change noticeably in the<br />

frequency range <strong>of</strong> interest.<br />

(a)<br />

(b)<br />

(c)<br />

Figure 2: The geometry <strong>of</strong> the composite material is<br />

shown in (a), the electric permittivity <strong>of</strong> the spherical<br />

inclusions made <strong>of</strong> Ag is presented in (b) and the electric<br />

permittivity <strong>of</strong> the SiO2 host materials is plotted in (c).<br />

Fig. 3.a and 3.b presents the magnitude and phase <strong>of</strong><br />

2 2<br />

the S-parameters. The absorption A = 1− S11<br />

− S21<br />

in<br />

function <strong>of</strong> frequency is plotted as well, showing a<br />

resonant peek at f 1 = 0.7192 PHz.<br />

The effective electric permittivity <strong>of</strong> the composite<br />

material, which is calculated in three different ways, with<br />

the high frequency mixing rule, with the Maxwell-Garnett<br />

mixing and extracted from the S-parameters, are<br />

presented in Fig. 4. The magnetic permeability is obtained<br />

from the high frequency mixing rule and it is extracted<br />

from the S-parameters as well, and it has values close to<br />

one over the frequency range <strong>of</strong> interest. Comparing the<br />

real and imaginary parts <strong>of</strong> the electric permittivity<br />

obtained with the three different methods, a very good<br />

agreement can be observed. The peak in the imaginary<br />

part <strong>of</strong> the electric permittivity corresponds to the<br />

absorption peek <strong>of</strong> Fig. 3.a, which reveals that it is<br />

electric resonance. The electric permittivity has Lorentz<br />

shape and can be successfully fitted with a single<br />

oscillator model


( − )<br />

2<br />

εrs εr∞ω0 εr ( ω) = εr∞+<br />

. (7)<br />

2 2<br />

ω0+ iδω−ω<br />

where the static electric permittivity ε rs = 2.379 , the<br />

electric permittivity at very high frequencies ε r∞<br />

= 2.23 ,<br />

the resonant frequency ω0= 2π ⋅ 0.7228 rad/fs and<br />

damping constant δ = 0.33 1/fs.<br />

(a)<br />

(b)<br />

Figure 3: The S-parameters <strong>of</strong> the one-unit-cell thick<br />

composite material. In (a), the magnitude, and in (b) the<br />

phase <strong>of</strong> the S-parameters is plotted. Note the absorption<br />

peek at f 1 = 0.7192 PHz.<br />

Figure 4: The effective electric permittivity <strong>of</strong> the<br />

composite material calculated with the high frequency<br />

mixing rule, Maxwell-Garnett mixing and extracted from<br />

the S-parameters.<br />

The electric permittivity at the frequency <strong>of</strong> the<br />

absorption peek is ε = 2.5319 + 2.0465i<br />

and the<br />

r<br />

- 122 - 15th IGTE Symposium 2012<br />

corresponding optical wavelength is<br />

λ opt = c0 ( n f1)<br />

= 245.04 nm, which is much larger than<br />

any characteristic dimension <strong>of</strong> the composite (the size <strong>of</strong><br />

the unit cell is 15 nm), showing that the resonant behavior<br />

is related to the composition rather than structuring.<br />

IV. EQUIVALENT COMPOSITES OF METAMATERIALS<br />

DESIGNED WITH THE HIGH FREQUENCY MIXING RULE<br />

In this section the equivalent composite <strong>of</strong> a magnetic<br />

metamaterial is determined. The geometry <strong>of</strong> the<br />

metamaterial is the well studied split ring resonator [1, 2,<br />

3, 8] as it is shown in Fig. 5. The dimensions and the<br />

material parameters are the same as in [8]. The size <strong>of</strong> the<br />

cubic unit cell is 5 mm, the split ring resonators are made<br />

<strong>of</strong> copper, the outer length <strong>of</strong> the exterior split ring<br />

resonators is 3 mm, the width <strong>of</strong> both split rings is<br />

0.25 mm, the thickness is 0.02 mm, the size <strong>of</strong> the gaps<br />

and the distance between the split ring resonators is<br />

0.5 mm. The substrate is made <strong>of</strong> dielectric with<br />

ε r = 3.84 and the thickness <strong>of</strong> the substrate is 0.25 mm.<br />

The metamaterial is periodic in the direction<br />

perpendicular to the propagation <strong>of</strong> the electromagnetic<br />

wave (z direction), the electric field is polarized in y<br />

direction, which means that the magnetic field is<br />

perpendicular to the plane <strong>of</strong> the split ring resonators. The<br />

metamaterial <strong>of</strong> [8] was designed to experimentally<br />

demonstrate the negative refraction. The role <strong>of</strong> the splitring<br />

resonators is to provide the negative magnetic<br />

response, while additional copper wires placed on the<br />

back side <strong>of</strong> the substrate are responsible for producing<br />

the negative electric permittivity, leading to a negative<br />

refractive index at a frequency <strong>of</strong> 10 GHz. In our<br />

numerical simulations the metamaterial is only-one-unitcell<br />

thick.<br />

Figure 5: The unit cell <strong>of</strong> the magnetic metamaterial slab<br />

is composed <strong>of</strong> metallic split-ring resonators.<br />

The reflection, transmission and absorption spectrum<br />

<strong>of</strong> the double negative metamaterial [8] is presented in<br />

Fig. 8.a, while in Fig. 8.b the electromagnetic response <strong>of</strong><br />

the split ring resonators is shown. The simulations reveal<br />

that the position <strong>of</strong> the resonant peek at 10 GHz is not<br />

changed by removing the wires; nevertheless the shape <strong>of</strong><br />

the transmission and reflection curves is greatly affected.


The effective parameters <strong>of</strong> the magnetic metamaterial<br />

built <strong>of</strong> split ring resonators are extracted from the Sparameters<br />

with [20] and are shown in Fig. 7.a. The<br />

magnetic permeability has Lorentz shape with negative<br />

values and it is similar to the magnetic permeability <strong>of</strong><br />

[8]. The effective electric permittivity has a shape <strong>of</strong> antiresonance,<br />

which may be an artifact caused by the<br />

replacement <strong>of</strong> the anisotropic metamaterial structure with<br />

the homogenized model <strong>of</strong> isotropic slab.<br />

(a)<br />

(b)<br />

Figure 6: In (a) the reflection, transmission and<br />

absorption spectrum <strong>of</strong> the double negative metamaterial<br />

is presented, while in (b) the electromagnetic response <strong>of</strong><br />

the split ring resonators is shown.<br />

Spectral fitting is carried out to find the parameters <strong>of</strong><br />

the composite, which is magnetically equivalent to the<br />

metamaterial, built <strong>of</strong> split ring resonators. The<br />

parameters <strong>of</strong> the high frequency mixing rule, the radius<br />

and the electric permittivity <strong>of</strong> the spherical inclusions,<br />

the filling factor and the electric permittivity <strong>of</strong> the host<br />

material are determined by minimizing the mean square<br />

error,<br />

2 2<br />

N HF TR HF TR<br />

Re( μri ) Re( μri ) Im ( μri) Im ( μ <br />

− − ri ) <br />

+ <br />

TR TR <br />

i= 1 Re( μri ) Im(<br />

μri<br />

) <br />

<br />

Ω=<br />

<br />

2N<br />

where N is the number <strong>of</strong> data points in the spectra, Re()<br />

and Im() return the real and imaginary parts <strong>of</strong> the<br />

magnetic permeability<br />

μ extracted from the S-<br />

TR<br />

r<br />

parameters or HF<br />

μ r calculated with the high frequency<br />

- 123 - 15th IGTE Symposium 2012<br />

mixing rule. The minimization is performed with the<br />

differential evolution algorithm [21]. The minimization<br />

provides r = 2.31 mm for the radius <strong>of</strong> the spherical<br />

inclusions, the electric permittivity <strong>of</strong> the inclusions is<br />

i<br />

ε r = 37.67 , the filling factor is ζ = 0.13 and the electric<br />

h<br />

permittivity <strong>of</strong> the host material is ε r = 1.<br />

Note that the<br />

results <strong>of</strong> this optimization are implementable; several<br />

materials exist at microwave frequencies with even higher<br />

electric permittivity and are available as powder or<br />

suspension [7].<br />

(a)<br />

(b)<br />

Figure 7: In (a) the effective magnetic permeability and<br />

electric permittivity <strong>of</strong> the metamaterial made <strong>of</strong> split-ring<br />

resonators extracted from S-parameters is shown. In (b)<br />

the material parameters <strong>of</strong> the equivalent composite are<br />

presented.<br />

The real and imaginary parts <strong>of</strong> the magnetic<br />

permeability and the electric permittivity <strong>of</strong> the equivalent<br />

composite are plotted in Fig. 7. b. As it can be seen there<br />

is a good agreement between the effective magnetic<br />

permeability <strong>of</strong> the metamaterial and the permeability <strong>of</strong><br />

the composite. Comparing the real part <strong>of</strong> the electric<br />

permittivities it can be observed that they are comparable<br />

TR<br />

at low frequencies, for example at 5 GHz ε = 1.57 and<br />

ε = 1.43 , even though there is no optimization goal<br />

HF<br />

r<br />

formulated for permittivity in the mean square error <strong>of</strong> the<br />

minimization procedure. On the other hand there is no<br />

anti-resonant behavior in the electric permittivity <strong>of</strong> the<br />

composite in the frequency region <strong>of</strong> the magnetic<br />

resonance. The magnetic resonance <strong>of</strong> the composite is<br />

followed by electric resonance, which appears at the<br />

r


upper end <strong>of</strong> the investigated frequency region. To move<br />

the electric resonance outside <strong>of</strong> this frequency region,<br />

the bounds <strong>of</strong> the optimization parameters were changed<br />

and several minimization runs were performed. As a<br />

result it can be observed that the model does not provide<br />

enough freedom to maintain the strength and the position<br />

<strong>of</strong> the magnetic resonance and at the same time to change<br />

the position <strong>of</strong> the electric resonance to higher<br />

frequencies. The extension <strong>of</strong> the high frequency model to<br />

ellipsoidal particles may solve this issue.<br />

In Fig. 8 the magnitudes <strong>of</strong> the S-parameters for the<br />

metamaterial built <strong>of</strong> split ring resonator and those for the<br />

equivalent composite are presented. The difference<br />

between the curves is due to the difference in electric<br />

permittivities. The correspondence may be improved by<br />

considering frequency dependent material parameters.<br />

Figure 8: Comparison between the magnitudes <strong>of</strong> the<br />

S-parameters <strong>of</strong> the metamaterial built <strong>of</strong> split ring<br />

resonator and the S-parameters <strong>of</strong> the equivalent<br />

composite.<br />

V. CONCLUSIONS<br />

High frequency mixing rule, which is based on the<br />

Clausius-Mossotti relation and the first terms <strong>of</strong> the Mie<br />

expansion corresponding to radiating dipoles has been<br />

applied to characterize composites and metamaterials. To<br />

validate the model the effective electric permittivity and<br />

magnetic permeability <strong>of</strong> a deep subwavelength<br />

composite were calculated and it was shown that similar<br />

results are produced by the high frequency mixing rule,<br />

the Maxwell-Garnett mixing rule or extracted directly<br />

from the S-parameters.<br />

The developed high frequency model can open<br />

alternative ways to engineer required electromagnetic<br />

properties. It was shown that equivalent composite, which<br />

has similar effective magnetic permeability, can be<br />

assigned to the magnetic metamaterial built <strong>of</strong> split ring<br />

resonators.<br />

VI. ACKNOWLEDGEMENT<br />

This work has been supported by the János Bolyai<br />

Research Fellowship <strong>of</strong> the Hungarian Academy <strong>of</strong><br />

Sciences and OTKA 105996.<br />

- 124 - 15th IGTE Symposium 2012<br />

[1]<br />

REFERENCES<br />

L. Solymár and E. Shamonina, Waves in Metamaterials, Oxford,<br />

<strong>University</strong> Press, 2009.<br />

[2] Marqués R., Martín F., Sorolla M., Metamaterials with<br />

NegativeParameters. John Willey and Sons, 2008.<br />

[3] N. Engheta, R. W. Ziolkowski, Metamaterials Physics and<br />

Engineering Applications, John Willey and Sons, 2006.<br />

[4] J. B. Pendry, Negative Refraction Makes a Perfect Lens, Physical<br />

Review Letters, vol. 85, no. 18, pp. 3966–3969, 2000.<br />

[5] J.B. Pendry, A.J. Holden, D.J. Robbins, and W.J. Stewart,<br />

Magnetism from Conductors and Enhanced Non-Linear<br />

[6]<br />

Phenomena, IEEE Transactions on Microwave Theory and<br />

Techniques, vol. 47, 2075, 1999.<br />

D. R. Smith, W. Padilla, D. Vier, S. Nemat-Nasser and S. Schultz,<br />

Composite Medium with Simultaneously Negative Permeability<br />

and Permittivity, Phys. Rev. Lett., vol. 84, p. 4184, 2000.<br />

[7] V. M. Shalaev, Optical negative-index metamaterials, Nature<br />

Photonics, vol. 1, pp. 41-48, 2006.<br />

[8] R. A. Shelby, D. R. Smith, S. Schultzm, Experimental<br />

Verification <strong>of</strong> a Negative Index <strong>of</strong> Refraction Science, vol 292,<br />

pp. 77-79, 2001.<br />

[9] G. Dolling, M. Wegener, C. Soukoulis and M. S. Linden,<br />

Negative-index metamaterial at 780 nm wavelength, Opt. Let.,<br />

vol. 32, no. 1, pp. 53-55, 2007.<br />

[10] A. F. de Baas (editor), Nanostructured Metamaterials, European<br />

Comission, 2010.<br />

[11] Zs. Szabó, G.-H. Park, R. Hedge, and E.-P. Li, “A unique<br />

extraction <strong>of</strong> metamaterial parameters based on Kramers-Kronig<br />

relationship,” IEEE Trans. Microwave Theory Tech., vol. 58, no.<br />

10, pp. 2646-2653, 2010.<br />

[12] A. Sihvola, Electromagnetic Mixing Formulas and Applications,<br />

The Institution <strong>of</strong> Electrical Engineers, London, United Kingdom,<br />

1999.<br />

[13] D. E. Aspnes, Local-field effects and effective-medium theory: A<br />

microscopic perspective, Am. J. Phys., vol. 50, no. 8, pp. 704-709,<br />

1982.<br />

[14] C. F. Bohren, D. R. Huffman, Absorption and Scattering <strong>of</strong> Light<br />

by Small Particles, Wiley-VCH, 2004.<br />

[15] R. Ruppin, Evaluation <strong>of</strong> extended Maxwell-Garnett theories,<br />

Optics Communications, vol 182, pp. 273–279, 2000.<br />

[16] C. A. Grimes, D. M. Grimes, Permeability and permittivity<br />

spectra <strong>of</strong> granular materials, Phys. Rev. B, vol. 43, pp. 10780–<br />

10788, 1991.<br />

[17] E.D. Palik and G.K. Ghosh, Editors, Handbook <strong>of</strong> Optical<br />

Constants <strong>of</strong> Solids, Academic Press, New York, 1997.<br />

[18] [Online] http://www.sspectra.com/sopra.html<br />

[19] [Online] www.cst.com<br />

[20] [Online] http://effmetamatparam.sourceforge.net/<br />

[21] K. V. Price, R. M. Storn, J. A. Lampinen, Differential Evolution,<br />

A Practical Approach to Global Optimization, Springer, 2005.


- 125 - 15th IGTE Symposium 2012<br />

Enhancement <strong>of</strong> Maximum Starting Torque and<br />

Efficiency in Permanent Magnet Synchronous Motors<br />

Jawad Faiz, Vahid Ghorbanian and Bashir Mahdi Ebrahimi<br />

Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer<br />

Engineering, College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran 1439957131, Iran<br />

(e-mail: jfaiz@ut.ac.ir)<br />

Abstract— This paper presents a new algorithm for enhancement <strong>of</strong> maximum starting torque and steady-state efficiency in<br />

permanent magnet (PM) motors. This algorithm includes two strategies which are used to raise starting torque and decrease<br />

losses in PM motors. Therefore, transient and steady-state operations <strong>of</strong> the PM motor are improved. It is essential to model<br />

the core in the efficiency estimation <strong>of</strong> losses <strong>of</strong> the PM motor. Appropriate control coefficients based on the introduced<br />

algorithm are set for two aforementioned goals. Simulation results are presented the competency <strong>of</strong> the proposed algorithm.<br />

Index Terms— PM motor, control strategy, starting torque, losses minimization, efficiency.<br />

I. INTRODUCTION<br />

Application <strong>of</strong> permanent magnet (PM) motors is<br />

increasing due to the advanced technology in PM<br />

manufacturing, and its high power density, improved<br />

power factor and high efficiency. Some applications such<br />

as electric and hybrid vehicles with huge start-stop<br />

actions need high starting torque to quickly accelerate the<br />

vehicles. Enhancement <strong>of</strong> the starting torque involves the<br />

increase <strong>of</strong> the stator windings currents leading to<br />

windings temperature rise. Therefore, the stator current<br />

must be limited. Since the performance improvement <strong>of</strong><br />

these motors has considerable effects upon the electrical<br />

power consumption over long time applications, the<br />

instantaneous optimal control and appropriate operating<br />

point over different loads and speeds should be<br />

considered.<br />

Nowadays, a wide range <strong>of</strong> the motor speeds and<br />

torques are achieved by application <strong>of</strong> vector control<br />

methods in the motors [1]-[4]. These control methods<br />

provide stability and precise required speed and accurate<br />

response because <strong>of</strong> the feedback in the motor which can<br />

control the flux and torque independently [5]. The<br />

controllable quantities are id and iq currents. By<br />

controlling current id, the motor flux and consequently<br />

speed is adjusted and by controlling current iq, the steadystate<br />

output torque is regulated [6]. In [7], [8], a method<br />

has been introduced to control the torque <strong>of</strong> the PM<br />

machine by limiting the windings currents. This machine<br />

has been used as a generator <strong>of</strong> a wind turbine. When the<br />

wind speed is higher than that <strong>of</strong> the rated speed, torque<br />

rises and the windings insulation may fail. In some<br />

methods [7], there is no need to have a mechanical sensor.<br />

Since magnetic saturation has considerable impact on the<br />

motor behavior at high currents; the saturation effects are<br />

approximately modeled in this paper. Meanwhile, strategy<br />

<strong>of</strong> maximum torque control is normally applied to the<br />

motor in the non-starting case [3].<br />

Different strategies have been so far applied to improve<br />

the efficiency <strong>of</strong> the PM motor. In [9], efficiency <strong>of</strong> the<br />

motor has been improved through introducing a new teeth<br />

and slot structure. In [7], [12], a method based on the dq<br />

model <strong>of</strong> the motor has been proposed in which the core<br />

losses have been taken into account by a resistance in the<br />

equivalent circuit <strong>of</strong> the motor. This technique is called<br />

the “loss model control”, in which copper and core losses<br />

are evaluated as an analytical function <strong>of</strong> equivalent<br />

circuit parameters and id and iq currents <strong>of</strong> the motor. By<br />

obtaining the optimal currents, the motor losses can be<br />

minimized and efficiency maximized. In interior PM<br />

(IPM) motors Lq and Ld are unequal and it is difficult to<br />

obtain the optimal operating point at different speeds and<br />

torques analytically. To overcome this problem, normally<br />

id and iq are expressed as functions <strong>of</strong> the motor speed and<br />

its coefficients are stored in a lookup table. The objection<br />

<strong>of</strong> this method is that by increasing the motor speed and<br />

torque ranges, the tables will be larger and it must be<br />

updated by change <strong>of</strong> the motor parameters. In [1], [5],<br />

[10], motor efficiency has been improved by id =0<br />

method. In the PM motor model, motor reluctance torque<br />

appears as coefficient <strong>of</strong> id and by putting id =0, the<br />

reluctance torque as an opposed torque is eliminated and<br />

consequently its output power increases for a fixed speed.<br />

Applying id=0 to IPM motors needs a high power<br />

inverter. Therefore, this method is normally applied to<br />

surface-mounted PM (SPM) motor [10]. The unity power<br />

factor method can develop less maximum torque<br />

compared to other methods [10]. So, it is not suitable in<br />

the high torque applications. Flux-linkage control method<br />

presents a better performance in IPM.<br />

On contrary to the loss model control which depends<br />

on the motor model accuracy, methods presented in [2],<br />

[3], [13] is called search control method which is<br />

independent <strong>of</strong> the motor and drive parameters. In this<br />

method, attempt has been made to reduce the input power<br />

and this is normally done through the control <strong>of</strong> voltage<br />

or dc link current <strong>of</strong> the inverter. Application <strong>of</strong> this<br />

control method may produce undesirable oscillations in<br />

the torque and speed <strong>of</strong> the motor which leads to<br />

instability [14]. In this method the use <strong>of</strong> a frequency<br />

stabilizer is necessary.<br />

Previous papers have not taken into account both high<br />

starting torque and steady-state efficiency improvement in<br />

PM motor. This paper investigates the control strategy <strong>of</strong><br />

the maximum torque and efficiency enhancement in the<br />

steady-state operation <strong>of</strong> the motor. By application <strong>of</strong> this<br />

method, torque raises up to 4 pu and current up to 2 pu.


Distinction <strong>of</strong> this paper and [10] is the design <strong>of</strong><br />

intelligent system for applying the limitation on id and iq,<br />

So, at any instant sensitivity <strong>of</strong> torque against each current<br />

components is measured and a component that has less<br />

effect in <strong>of</strong> the torque development is limited. To improve<br />

the steady-state efficiency <strong>of</strong> the motor, loss model<br />

control algorithm <strong>of</strong> [5] is used. In section II the motor<br />

model is introduced. In section III control strategy is<br />

described. Section IV and V present the simulation<br />

method and results respectively. Finally section VI<br />

concludes the paper.<br />

II. MODEL OF MOTOR<br />

The proposed control strategies in this paper are based<br />

on the analytical equations <strong>of</strong> the motor model. Since<br />

enhancement <strong>of</strong> the maximum starting torque and steadystate<br />

efficiency <strong>of</strong> the motor are carried out through the<br />

control <strong>of</strong> id and iq current vectors, the Park’s model<br />

converts three-phase abc equations <strong>of</strong> the motor into twophase<br />

dq equations in which id and iq currents <strong>of</strong> the<br />

motor are available. Figure 1 shows the IPM motor<br />

model where Ra is the stator resistance, Ld is the d-axis<br />

inductance and Lq is the q-axis inductance.<br />

Figure 1: Two-axes model <strong>of</strong> IPM motor<br />

These two inductances are not equal in the IPM motors<br />

and they develop a considerable reluctance torque. Rc is<br />

the iron losses equivalent resistance. The iron losses<br />

consist <strong>of</strong> the hysteresis and eddy current losses, and are<br />

modeled by equivalent resistance Rc, which depends on<br />

the temperature and frequency. To simplify the<br />

computations, this resistance is evaluated at the rated<br />

conditions. In [5], leakage and magnetizing inductances<br />

are separated and inserted in different branches. Since Rc<br />

is very larger than that <strong>of</strong> the other impedances, the<br />

current <strong>of</strong> the losses branch is small and the current <strong>of</strong> the<br />

left hand side and right hand side have no considerable<br />

difference. Therefore, sum <strong>of</strong> leakage and magnetizing<br />

inductances are used in the model.<br />

The governing equations <strong>of</strong> the motor model are as<br />

follows:<br />

diod<br />

vd<br />

Raid<br />

<br />

Lqioq<br />

Ld<br />

(1)<br />

dt<br />

- 126 - 15th IGTE Symposium 2012<br />

dioq<br />

vq<br />

Raiq<br />

Ldiod<br />

a Lq<br />

dt<br />

(2)<br />

icd id<br />

iod<br />

(3)<br />

icq iq<br />

ioq<br />

(4)<br />

diod<br />

( Lqioq<br />

Ld<br />

)<br />

i<br />

dt<br />

cd <br />

Rc<br />

(5)<br />

dioq<br />

( (<br />

Ldiod<br />

<br />

a ) Lq<br />

)<br />

i<br />

dt<br />

cq <br />

RC<br />

The developed electromagnetic torque <strong>of</strong> the motor is:<br />

(6)<br />

3P<br />

Te ( )[ aLq<br />

( Ld<br />

Lq<br />

) iodioq<br />

]<br />

2<br />

The dynamic equation <strong>of</strong> the motor is as follows:<br />

(7)<br />

dr<br />

Te<br />

Tm<br />

c sign(<br />

r ) Fr<br />

J<br />

dt<br />

(8)<br />

III. CONTROL STRATEGY<br />

A. Enhancement <strong>of</strong> Maximum Torque<br />

Eqn. (7) indicates that the motor torque depends on id<br />

and iq components <strong>of</strong> currents. At the starting, stator<br />

current does not so much depend on the load, and<br />

normally is 2 to 2.5 times the rated value. The starting<br />

torque <strong>of</strong> the motor can be up to 2.5 times the rated<br />

torque and generally there is no need to reduce it.<br />

However, in some applications such as ABS brake <strong>of</strong> cars<br />

the motor must have very short declaration time (about<br />

fractional <strong>of</strong> ms), and the starting torque, about 2.5 times<br />

the rated torque, cannot response quickly in the no control<br />

mode. Therefore, it is necessary to apply the high starting<br />

torque in the case <strong>of</strong> no control case over longer time in<br />

order to provide an appropriate declaration time. In the<br />

previous studies some control methods have been<br />

proposed to increase the starting torque; however a<br />

limited starting current has not been considered. Here a<br />

novel technique is introduced that optimally limits the<br />

stator current under vector control and also enhanced the<br />

maximum starting torque. By applying this method, the<br />

starting torque rises up to 4 times the rated torque. The<br />

basis <strong>of</strong> this method is the use <strong>of</strong> id and iq components <strong>of</strong><br />

current. Since the equivalent resistance <strong>of</strong> the iron losses<br />

is almost infinite:<br />

i i<br />

q<br />

oq<br />

id iod<br />

(10)<br />

Suppose the stator current is<br />

starting period, iq is as follows:<br />

is constant during the<br />

2 2 2 2 2 2<br />

(11)<br />

i i i i i i<br />

q<br />

d<br />

s<br />

q<br />

s<br />

d<br />

Combining (7) and (11) leads to:<br />

3P<br />

2 2<br />

(12)<br />

Te ( ) is<br />

id<br />

[ a ( Ld<br />

Lq<br />

) id<br />

]<br />

2<br />

where is can be taken as 2 to 2.5 times the rated current.<br />

In practice, the back-emf increases by acceleration <strong>of</strong> the<br />

motor and this decreases the stator current. However, to<br />

simplify the equations it was taken to be constant. The<br />

(9)


optimal id is obtained by putting the derivative <strong>of</strong> the<br />

maximum torque versus id equal to zero:<br />

2<br />

dTe<br />

a<br />

a 2 is<br />

0 id<br />

<br />

( ) <br />

2<br />

2<br />

did<br />

4(<br />

Ld<br />

Lq<br />

) 4(<br />

Ld<br />

Lq<br />

) 2 (13)<br />

For positive id, the reluctance torque increases and the<br />

total torque <strong>of</strong> the motor reduces. So, only the negative<br />

sign is acceptable.<br />

2<br />

a<br />

a 2 is<br />

id <br />

( ) <br />

2<br />

2<br />

4(<br />

Ld<br />

Lq<br />

) 4(<br />

Ld<br />

Lq<br />

) 2<br />

(14)<br />

By applying id from (14) to the reference point id,<br />

torque rises. But the stator current becomes larger than<br />

the permissible current by applying the obtained<br />

components <strong>of</strong> the current. Therefore, the torque<br />

sensitivity versus the current components is calculated<br />

and the current that has less influence the torque<br />

development is limited:<br />

T Te<br />

3P<br />

e<br />

(15)<br />

Si | i cte ( )( Ld<br />

Lq<br />

) iq<br />

d<br />

q <br />

id<br />

2<br />

T Te<br />

3P<br />

e Si | i cte ( )( a ( Ld<br />

Lq<br />

) iq<br />

)<br />

q<br />

d <br />

(16)<br />

iq<br />

2<br />

By applying this limit, the stator current does not rise<br />

further than 2 times the rated current.<br />

B. Improvement <strong>of</strong> Steady-state Efficiency<br />

The major factor in the efficiency reduction <strong>of</strong> the<br />

motor is the increase <strong>of</strong> the copper and iron losses. In the<br />

vector controlled motor supplied by an inverter, high<br />

order harmonics generates additional losses. In spite <strong>of</strong><br />

this, the major part <strong>of</strong> the losses allocated to the<br />

fundamental harmonic. Copper losses directly and iron<br />

losses indirectly is proportional with the motor current.<br />

The copper and iron losses arising from the fundamental<br />

harmonic is optimized by vector control <strong>of</strong> the stator<br />

currents. The high order harmonic losses are<br />

uncontrollable. The basis <strong>of</strong> the losses control is the<br />

estimation <strong>of</strong> the motor losses using the presented model<br />

and its optimization versus the motor currents. Since<br />

efficiency is defined in steady-state, the time derivatives<br />

<strong>of</strong> the dynamic equations are set equal to zero and losses<br />

are calculated as follows:<br />

Lqioq<br />

2 <br />

(<br />

iod<br />

) <br />

3R<br />

2 2 3R<br />

Rc<br />

<br />

Wcu<br />

( iod<br />

, ioq<br />

, ) ( )( id<br />

iq<br />

) ( ) <br />

<br />

2<br />

2 ioq<br />

(<br />

a Ldiod<br />

) 2<br />

(<br />

)<br />

<br />

<br />

<br />

Rc<br />

<br />

(17)<br />

2<br />

2<br />

3R<br />

<br />

( ) <br />

c 2 2 3<br />

Lqioq<br />

W fe(<br />

iod<br />

, ioq,<br />

) ( )( icd<br />

icq)<br />

( ) <br />

<br />

2<br />

2R<br />

2 (18)<br />

c ( a Ldiod<br />

) <br />

where Wcu and Wfe are the copper losses and iron losses<br />

respectively. The motor losses are function <strong>of</strong> iod, ioq and<br />

. In these equations, the influence <strong>of</strong> temperature rise<br />

and higher harmonics on the resistances and magnetic<br />

saturation upon the inductances have been ignored and<br />

taken to be constant. However, the losses increase<br />

nonlinearly due to the high harmonics and considerable<br />

rise <strong>of</strong> the magnetizing current because <strong>of</strong> the saturation.<br />

The magnetic saturation occurs normally at starting, and<br />

at the steady-state mode the motor operates at the knee <strong>of</strong><br />

the magnetization characteristic; therefore, neglecting the<br />

- 127 - 15th IGTE Symposium 2012<br />

saturation is acceptable. By combining (7), (17), (18), the<br />

following equation is obtained:<br />

Wc W fe(<br />

iod<br />

, Te<br />

, ) Wcu<br />

( iod<br />

, Te<br />

, )<br />

Wc<br />

( iod<br />

, Te<br />

, )<br />

(19)<br />

As indicated in (19), the total losses <strong>of</strong> the motor depend<br />

on function <strong>of</strong> the operating point and iod current. At the<br />

operating point with fixed speed and torque, the optimal<br />

iod is obtained for losses reduction using the analytical<br />

derivative <strong>of</strong> (19).However, in the IPM motor, Ld and Lq<br />

are not identical, therefore the equations are complicated<br />

and use <strong>of</strong> analytical derivative is difficult. Sometimes,<br />

the currents <strong>of</strong> the motor are expressed as a polynomial<br />

versus each other where its polynomial coefficients<br />

depending on the speed <strong>of</strong> the motor. The coefficients <strong>of</strong><br />

the polynomials versus the motor operating point are<br />

stored in a look-up table. However, this method is quick<br />

but interpolation over different operating points leads to a<br />

highly approximated method. Meanwhile, the tables over<br />

wide range <strong>of</strong> speed and torque become large, and these<br />

tables must be updated by change <strong>of</strong> the motor type.<br />

Flow-chart reported in [5] has optimized the motor losses<br />

without using analytical derivative and also lookup table.<br />

The presented algorithm is an iterative one and it<br />

normally converges to an appropriate solution after 14<br />

iterations. In the present paper, the section related to the<br />

response <strong>of</strong> the final conditional expression reported in<br />

[5] is modified and therefore the optimal response point is<br />

achieved with lower number <strong>of</strong> iterations at any operating<br />

point. This algorithm increases the developed<br />

electromagnetic torque <strong>of</strong> the motor at a constant <strong>of</strong><br />

operating point and consequently efficiency <strong>of</strong> the motor<br />

improves. Since iq is the torque component <strong>of</strong> the current,<br />

its value depends largely on the load <strong>of</strong> the motor and its<br />

large change will lose the stable operating point.<br />

Therefore, the reluctance torque value is controllable<br />

by change <strong>of</strong> id. In the traditional methods such as id=0,<br />

the reluctance torque is almost zero and demagnetization<br />

effect <strong>of</strong> the stator current diminishes. The idea used in<br />

the new method is to make negative id which makes the<br />

reluctance torque positive and improves the efficiency<br />

Figure 2: Loss minimization algorithm


Figure 3: Motor and control system<br />

<strong>of</strong> the motor at fixed speed. idmax is generally taken to be a<br />

small positive value and idmin a large negative value. d<br />

defines the step variations <strong>of</strong> id and x the mean value <strong>of</strong> id<br />

in every step. To achieve an appropriate response the step<br />

number depends on the value <strong>of</strong> id which is fixed at an<br />

optimal value. The simulation results show the<br />

improvement <strong>of</strong> the motor efficiency by applying the<br />

presented control method compared to id =0 method.<br />

IV. SIMULATION METHOD<br />

The above-mentioned control strategies are applied to a<br />

PM motor under vector control. The output <strong>of</strong> these<br />

control methods provides the reference values <strong>of</strong> the drive<br />

current. Since the motor supply under vector control is<br />

PWM type, the high order odd harmonics are injected to<br />

the motor. Amplitude <strong>of</strong> these harmonics varies with<br />

changing the operating point. Figure. 3 shows the<br />

complete system <strong>of</strong> the motor and drive. The LMA block<br />

is for efficiency improvement strategy in steady-state and<br />

T/A block is for the maximum starting torque<br />

enhancement. Limitation block limits the motor currents.<br />

The reference current values are transformed into the twophase<br />

reference voltages by Vqd_ref. Then the motor<br />

reference voltages are formed by transforming the twophase<br />

to three-phase voltages and applying to the PWM<br />

block. The maximum starting torque algorithm during<br />

transient and efficiency improvement algorithm during<br />

the steady-state periods are applied to the motor. In the<br />

motor model, a low-pass filter is used for eliminating<br />

high-order harmonics from the control process.<br />

Specifications <strong>of</strong> the simulated motor have been<br />

summarized in Table I.<br />

V. SIMULATION RESULTS<br />

The results have been obtained by simulation <strong>of</strong> the<br />

motor under control using Simulink. First, the results <strong>of</strong><br />

applying T/A algorithm during the transient mode to the<br />

motor is considered. By using motor parameters and<br />

- 128 - 15th IGTE Symposium 2012<br />

TABLE I<br />

NAMEPLATES AND PARAMETERS OF IPM<br />

MOTOR<br />

Number <strong>of</strong> poles 6<br />

Rated Torque (Nm) 1.8<br />

Rated rms current (A) 3.6<br />

Rated speed (rpm) 4000<br />

Stator winding resistance Ra ( 2.21<br />

Core loss equivalent resistance Rc ( 840<br />

Direct axis inductance (mH) 9.77<br />

Quadrature axis inductance (mH) 14.94<br />

Permanent magnet flux a (Wb) 0.0844<br />

Mechanical losses (Nm) 0.04<br />

considering constant maximum current <strong>of</strong> the motor, id<br />

component <strong>of</strong> the stator current is calculated by T/A and<br />

applied to the input reference <strong>of</strong> the drive. Since a high<br />

torque is necessary at the starting, at the first instant the<br />

current id increases in the negative direction considerably<br />

(Figure. 4a). Negative current id leads to the positive<br />

reluctance torque and increases the total torque <strong>of</strong> the<br />

motor. By applying T/A, value <strong>of</strong> iq also increases and the<br />

motor current rises over permissible limit.<br />

Therefore, currents are limited. Figure. 4b exhibits the<br />

variations <strong>of</strong> the normalized electromagnetic torque <strong>of</strong> the<br />

motor (based on the rated values) in which the torque<br />

raises up to 3.8 pu due to T/A applications. This torque is<br />

very larger than the case in which the motor is able to<br />

develop with no control strategy. In order to study the<br />

performance <strong>of</strong> the stator current limiter system, threephase<br />

currents <strong>of</strong> the motor are shown in figure. 5 which<br />

indicates that the phase current <strong>of</strong> the motor raises up to<br />

1.8 pu. This current rises up to 2.5 pu when current<br />

limiter is not used. The reason for a constant torque<br />

during the transient mode is that the tolerable peak<br />

current by the stator winding is assumed constant. If this<br />

current as a function <strong>of</strong> the motor emf is applied to the<br />

model, the motor torque will decrease by time. After<br />

completion <strong>of</strong> the transient period, the control algorithm


iabc(pu)<br />

Speed(rpm)<br />

id(A)<br />

torque(pu)<br />

-5<br />

0 0.01 0.02 0.03 0.04 0.05<br />

time(s)<br />

2<br />

1<br />

0<br />

-1<br />

0<br />

-1<br />

-2<br />

-3<br />

-4<br />

4<br />

3.5<br />

3<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0 0.01 0.02 0.03 0.04 0.05<br />

time(s)<br />

-2<br />

0 0.01 0.02 0.03 0.04 0.05<br />

time(s)<br />

5000<br />

4000<br />

3000<br />

2000<br />

1000<br />

(a)<br />

(b)<br />

Figure 4: (a) id and (b) motor torque at starting period<br />

is converted into LMA. Change <strong>of</strong> the control strategy is<br />

also visible in the three-phase currents <strong>of</strong> the motor which<br />

creates some irregularities in the currents. In addition to<br />

limiting the current, reducing overshoot<br />

Figure 5: Three-phase currents <strong>of</strong> controlled motor<br />

with T/A control<br />

Without T/A control<br />

0<br />

0 0.01 0.02 0.03 0.04<br />

time(s)<br />

Figure 6: Time variations <strong>of</strong> motor speed<br />

and shortening settling time <strong>of</strong> the motor speed are other<br />

advantages <strong>of</strong> applying T/A. Figure. 6 compares the<br />

motor speed variations without T/A application and<br />

- 129 - 15th IGTE Symposium 2012<br />

controlling cases. Settling time <strong>of</strong> the motor from 0 to the<br />

rated speed decreases from 0.025 in the no control case to<br />

0.01 in the application <strong>of</strong> T/A and without overshoot. In<br />

fact, by applying the maximum torque control, the motor<br />

becomes more stable. It is noted that the torque jump due<br />

to switching from T/A to LMA is not present in the speed<br />

signal. The reasons are the high inertia and long<br />

mechanical time constant <strong>of</strong> the motor compared to its<br />

electrical time constant. The LMA attempts to find the<br />

optimal id for the efficiency improvement. Normally, id is<br />

chosen negative values by applying this algorithm. The<br />

influence <strong>of</strong> the negative id is the enhancement <strong>of</strong> the<br />

torque and decrease <strong>of</strong> losses in the motor. Simulation <strong>of</strong><br />

PM motor under LMA control over a wide range <strong>of</strong> the<br />

speed and torque has been carried out and the effect <strong>of</strong><br />

this algorithm on the motor variables and system<br />

efficiency has been investigated. Meanwhile, the outputs<br />

<strong>of</strong> LMA with id=0 are compared and advantage <strong>of</strong> this<br />

method over conventional methods is given.<br />

Figure. 7 shows the variations <strong>of</strong> the copper and iron<br />

losses <strong>of</strong> the motor versus speed and torque. By raising<br />

the speed at the rated load, the iron losses increase and in<br />

this case the losses reduction algorithm shows its<br />

dominant effects. Also at fixed speed and high loads,<br />

reduction <strong>of</strong> the total iron and copper losses is<br />

considerable. Meanwhile, by increasing the negative<br />

value <strong>of</strong> id, demagnetization effect <strong>of</strong> PM decreases and<br />

for a fixed output power, supply voltage reduces. This<br />

means the efficiency improvement. Difference between<br />

the motor losses in two cases id=0 and LMA causes the<br />

motor efficiency change.<br />

Figure. 8 shows the efficiency versus speed and torque<br />

<strong>of</strong> the motor. Efficiency <strong>of</strong> the motor has been compared<br />

in two id=0 and LMA cases. According to figure. 8a,<br />

efficiency <strong>of</strong> the motor under LMA control over different<br />

speeds, shows a relative increase by id=0 method. By<br />

increasing the speed <strong>of</strong> the motor, the rate <strong>of</strong> efficiency<br />

improvement also rises. It means that whatever the motor<br />

approaches more to the rated operating point, its<br />

efficiency improves. Figure. 8b shows the efficiency<br />

versus load torque at the rated speed, and it emphasizes<br />

the efficiency improvement <strong>of</strong> the LMA in the motor<br />

compared to that <strong>of</strong> the conventional methods. The<br />

impact <strong>of</strong> this method is higher for higher loads. As<br />

shown in figure. 8a, there is no much difference between<br />

LMA and id=0 at low load levels. So, efficiency will not<br />

be considerably changed. This is not true over the low<br />

speeds. Generally, electrical machines operate in the knee<br />

<strong>of</strong> the magnetization characteristic where they have peak<br />

power density; therefore they have the maximum<br />

efficiency at the rated operating point. By applying LMA<br />

at the rated operating point, a 3% rise <strong>of</strong> the efficiency<br />

occurs. In the references, LMA algorithm has been<br />

applied to the PM motor experimentally. The difference<br />

between the simulation and experimental results is due to<br />

the approximations included in the simulation. The most<br />

important factor is ignoring the magnetic saturation.


Total loss(W)<br />

82<br />

80<br />

78<br />

76<br />

74<br />

72<br />

70<br />

LMA<br />

id=0<br />

68<br />

500 1000 1500 2000 2500 3000 3500 4000<br />

Speed(rpm)<br />

Total losses(W)<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

LMA<br />

id=0<br />

0<br />

0 0.5 1 1.5 2<br />

torque(Nm)<br />

Comparision <strong>of</strong> the motor efficiencies at rated load (1.8 N.m)<br />

- 130 - versus the angular 15th speed in the case IGTE <strong>of</strong> LMA and id=0 controls. Symposium 2012<br />

Efficiencies [%]<br />

Efficiencies [%]<br />

100<br />

90<br />

80<br />

70<br />

60<br />

50<br />

40<br />

80<br />

60<br />

40<br />

20<br />

0<br />

0 1000 2000 3000 4000<br />

Angular speed [rpm]<br />

(a) (a)<br />

VI. CONCLUSION<br />

Two algorithms for enhancing the maximum starting<br />

torque and steady-state efficiency <strong>of</strong> a PM motor were<br />

investigated. In both methods, stator current components<br />

have been used, transient and steady-state performance <strong>of</strong><br />

a PMSM have been improved by closed-loop control<br />

strategy and application <strong>of</strong> the two algorithms. These<br />

algorithms are independent <strong>of</strong> the PM motor type. The<br />

simulation results shown that by applying two algorithms,<br />

performance <strong>of</strong> the motor is considerably improved<br />

compared to that <strong>of</strong> the conventional control methods. By<br />

applying the stator current limiting method during the<br />

starting period, the stator winding insulation is prevented<br />

against the damage due to high current. The LMA method<br />

improves the steady-state efficiency <strong>of</strong> the motor up to<br />

3% at the rated load and T/A method causes the increase<br />

<strong>of</strong> the starting torque up to 4 times <strong>of</strong> the rated torque.<br />

Therefore, by applying the proposed control methods in<br />

addition to providing a high starting torques without the<br />

risk <strong>of</strong> the short circuit <strong>of</strong> the windings; the extra losses<br />

arising from the imprecise control <strong>of</strong> electrical motors can<br />

be prevented.<br />

AKNOWLEGEMENT<br />

We sincerely thank the Iran’s National Elites<br />

Foundation (INEF) for financial support <strong>of</strong> the project.<br />

VII. REFRENCES<br />

[1] S.Morimoto, Y.Tong, Y.Takeda, and T. Hirasa, “Loss minimization<br />

control <strong>of</strong> permanent magnet synchronous motor drives,”IEEE<br />

Transactions on Industrial Electronics, vol. 41, no. 5, pp. 511-517,<br />

Oct 1994.<br />

[2] C.Mademlis, L.Xypteras, and N.Margaris, “Loss minimization in<br />

surface permanent magnet synchronous motor drives",IEEE<br />

Transactions on Industrial Electronics,vol. 47, no. 1, pp. 115-122,<br />

Feb 2000.<br />

Comparision <strong>of</strong> the motor efficiencies at rated speed (4000)<br />

versus the load torque in the case <strong>of</strong> LMA and <strong>of</strong> id=0 controls<br />

id=0<br />

LMA<br />

id=0<br />

LMA<br />

30<br />

0 0.5 1 1.5 2<br />

Torque(N.m)<br />

(b) (b)<br />

Figure 7: Losses <strong>of</strong> motor versus (a) speed and (b) torque<br />

Figure 8: Efficiency <strong>of</strong> the motor versus (a)speed (b)<br />

torque<br />

[3] Sadegh Vaez,M.A.Rahman, "Adaptive Loss Minimization Control <strong>of</strong><br />

Inverter Fed IPM Motor Drives".IEEE Power Electronics Specialists<br />

Conference, pp. 861-868, vo. 2, 1997.<br />

[4] T.M.Jahns, G.B.Kliman, T.W.Neumann, “Interior permnanent<br />

magnet synchronous motor for adjustable speed drives,” IEEE<br />

Transactions Industry Applications, vol. 22, no. 4, pp. 738-747,<br />

July/August 1986<br />

[5] C.Cavallaro, A.O.Tommaso, R.Miceli, and A.Raciti, “Efficiency<br />

enhansment <strong>of</strong> permanent magnet synchronous motor drives by<br />

online loss minimization approaches,”IEEE Transactions on<br />

Industry Applications, vol. 52, no. 4, pp. 1153-1160, August 2005.<br />

[6] J.S.Yim, S.K.Sul, B.H.Bae, N.R.Patel, and S.Hiti, “Modified current<br />

control schemes for high-performance permanent-magnet ac drives<br />

with low sampling to operating frequency ratio,” IEEE Transactions<br />

Industry Applications, vol. 45, no. 2, pp. 763-771, March/April<br />

2009.<br />

[7] S.Morimoto, H.Nakayama, and M.Sanada, “Sensorless output<br />

maximization control for variable-speed wind generation system<br />

using IPMSG,”IEEE Transactions on Industry Applications, vol.<br />

41, no. 1, pp. 60-67, Jan/Feb 2005.<br />

[8] T.Nakamura, S.Morimoto, m.sanada, and Y.Takada, “Optimum<br />

control <strong>of</strong> IPMSG for wind generation system,” IEEE Power<br />

Conversion Conference, Osaka, pp. 1435-1440, 2002.<br />

[9] C.Chris, G.R.Slemon, and R.Bonert, “Minimization <strong>of</strong> iron loss<br />

<strong>of</strong> permanent magnet synchronous machines,”IEEE Transactions<br />

on Energy Conversion, vol. 20, no. 1, pp. 121- 127, March 2005.<br />

[10] S.Morimoto, Y.Takeda, and T.Hirasa, “Current phase control<br />

methods for permanent magnet synchronous motors,”IEEE<br />

Transactions on Power Electronics, vol. 5, no. 2, pp. 133, April<br />

1990.<br />

[11] S.Morimoto, Y.Takeda, T.Hirasa, and K.Taniguchi, “Expansion <strong>of</strong><br />

operating limits for permanent magnet motor by current vector<br />

control considering inverter capacity,”IEEE Transactions on<br />

Industry Applications, vol. 26, no. 5, pp. 866-871, Sep/Oct 1990.<br />

[12] K.Yamazaki, “Torque and efficiency calculation <strong>of</strong> an interior<br />

permanent magent motor considering harmonic iron losses <strong>of</strong> both<br />

the stator and rotor,” IEEE Transactions Magnetics,vol. 39, no. 3,<br />

pp. 1460-1463, May 2003.<br />

[13] R.S.Colby, and D.W.Novotny, “An efficiency-optimizing<br />

permanent magnet synchronous motor drive,”IEEE Transaction on<br />

Industry Applications, vol. 24, no. 3, pp. 462-469,May/June 1988.<br />

[14] A.Kusko, and D.Galler, “Control means for minimization <strong>of</strong> losses,<br />

in AC and DC motor drives,”IEEE Transactions on Industry<br />

Applications, vol. 19, no. 4, pp. 561-570 ,July/August 1983.


- 131 - 15th IGTE Symposium 2012<br />

Core Losses Estimation Techniques in Electrical<br />

Machines with Different Supplies – A Review<br />

Jawad Faiz, A.M. Takbash and B. M. Ebrahimi<br />

Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering, College <strong>of</strong><br />

Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran, Email: jfaiz@ut.ac.ir<br />

Abstract—In this paper, different methods for core losses estimation in ferromagnetic materials with non-sinusoidal supply<br />

are studied. At this end, the origin <strong>of</strong> the core losses in the aforementioned materials is addressed. Since magnetization<br />

excitation is the most effective factor upon the core losses, different core losses estimation with six general types <strong>of</strong> excitations<br />

are considered and features <strong>of</strong> these methods, their advantages and disadvantages are investigated.<br />

Index Terms—Core Loss, Finite Element Method, Hysteresis Loop, Steinmetz Equation.<br />

I. INTRODUCTION<br />

The major role <strong>of</strong> ferromagnetic materials in electrical<br />

machines leads to wide research towards a better<br />

realization <strong>of</strong> these materials and their characteristics.<br />

One <strong>of</strong> the most important features in these materials is<br />

their losses. Core losses are generated due to magnetic<br />

flux residual and eddy current. From physics point <strong>of</strong><br />

view, these two factors have an identical origin; they are<br />

movement <strong>of</strong> magnetic domains walls as well as internal<br />

movement <strong>of</strong> the magnetic domains. The magnetic<br />

residual flux is a well-known phenomenon. When an<br />

external magnetic field is applied to a ferromagnetic<br />

material, magnetic dipoles try to align with this external<br />

field. Even after removing the external magnetic field,<br />

some magnetic domains preserve their alignments and<br />

such case the material is magnetized. By changing the<br />

magnetic field a small amount <strong>of</strong> energy is stored in the<br />

material due to existing residual flux. The level <strong>of</strong> this<br />

stored energy depends on the material type. If a conductor<br />

is imposed on a varying magnetic field or moved<br />

appropriately in the magnetic field, eddy current is<br />

induced in the conductor. Any variation in magnetic field<br />

that causes the movement <strong>of</strong> the magnetic domains walls<br />

is a factor inducing eddy current. Eddy current generates<br />

heat and electromagnetic forces. For dc excitation, there<br />

are residual and eddy current losses as well. The reason<br />

for the residual losses is the internal movement <strong>of</strong> the<br />

magnetic domains which itself generates microscopic<br />

currents [1]. Based on these two factors, the core losses<br />

have been classified into two classes [2]. Some categorize<br />

the core losses into three classes [3], in which the third<br />

class is stray losses due to the external factors such as<br />

external magnetic fields. Two factors influence the core<br />

losses. The first factor depends on the magnetic material<br />

alloy, for instance rising Si content in SiFe magnetic alloy<br />

reduces the eddy current losses. The second factor is the<br />

external factor. Magnetic properties <strong>of</strong> magnetic materials<br />

are affected by cutting, pressing and welding processes.<br />

For instance, cutting or welding process <strong>of</strong> magnetic<br />

sheets can increase the residual losses <strong>of</strong> the sheets.<br />

Pressing the sheets causes the increase <strong>of</strong> the eddy<br />

currents. One <strong>of</strong> methods for reduction <strong>of</strong> the eddy<br />

current is using thin sheets. However, this leads to higher<br />

residual losses [4], [5]. Another external factor affecting<br />

the core losses is the magnetizing excitation. In the case<br />

<strong>of</strong> sinusoidal excitation, the well-known classic Steinmetz<br />

equations can be used for core losses estimation:<br />

P k f B k f B<br />

(1)<br />

x 2 2<br />

ir h s m e s m.<br />

where Pir is the core losses, fs is the supply frequency, Bm<br />

is the magnetic flux density magnitude and kh, ke, xare the<br />

Steinmetz factors. Such equations leads to error in the<br />

case <strong>of</strong> non-sinusoidal excitation and new equations must<br />

be introduced. Application <strong>of</strong> different drives with<br />

various switching patterns and also internal faults in<br />

electrical machines leads to non-sinusoidal excitation.<br />

Wide application <strong>of</strong> inverter-fed electrical machines and<br />

different faults such as rotor broken bars and eccentricity<br />

are important research topics in recent years. Therefore,<br />

core losses estimation in the magnetic core over such noncommon<br />

conditions is important. The trend <strong>of</strong> core losses<br />

estimation can be classified as shown in Figure 1. In this<br />

classification six general methods has been introduced<br />

which will be discussed in this paper.<br />

II. FINITE-ELEMENT-BASED METHODS<br />

Finite element methods (FEM) are time consuming and<br />

high precision techniques that take into account the<br />

geometry and physics <strong>of</strong> the machine. There are three<br />

steps in modeling electrical machines. They include<br />

geometrical modeling <strong>of</strong> motor considering physical<br />

characteristics, modeling motor supply considering<br />

electrical characteristics and finally modeling the motor<br />

load taking into account mechanical features. FEMs<br />

provide magnetic field distributions within induction<br />

motor based on its geometrical and magnetic parameters.<br />

Other quantities such as air gap flux density can be also<br />

estimated using the magnetic field distribution. In FEM,<br />

the coupling between electrical and magnetic fields and<br />

motor rotation can be taken into account. Losses in<br />

different parts <strong>of</strong> the machine can be estimated having the<br />

magnetic field distribution in various sections <strong>of</strong> the<br />

motor. A new method has been introduced in [6] for core<br />

losses estimation in a no-load motor with direct-fed and<br />

PWM-fed motor using 2D time stepping FEM in which<br />

the simulation results have been compared to the


Steinmetz<br />

Eq.<br />

Hysteresis<br />

Model<br />

FEM<br />

- 132 - 15th IGTE Symposium 2012<br />

Iron Loss<br />

Calculation<br />

Methods<br />

Equivalent<br />

Circuit<br />

Physical Eq.<br />

Figure 1: Different methods for core losses estimation in non-sinusoidal excitation<br />

experimental results. To include stray losses and<br />

rotational residual losses, a modification factor has been<br />

considered in the losses calculation process. This<br />

modification factor depends on the peak magnetic flux<br />

density and its distortion. In [7], a new model for<br />

laminated core <strong>of</strong> induction motor has been presented<br />

based on 3D FEM. This modeling method is based on the<br />

reduced magnetic potential equations and used to estimate<br />

the core losses with non-sinusoidal excitation caused by<br />

PWM application. The results are more precise than that<br />

<strong>of</strong> the 2D FEM. In [8], the authors emphasize the need for<br />

precise data in core losses estimation from magnetic<br />

fields, therefore 2D FE model is applied; impact <strong>of</strong> nonsinusoidal<br />

supply and its harmonics on the rotor magnetic<br />

field has been investigated and the well-known integral<br />

equations have been then used to estimate the core losses.<br />

Core losses estimation in rotating electrical machines is<br />

more complicated than that <strong>of</strong> the static machine because<br />

<strong>of</strong> more complicated structure and rotating magnetic<br />

fields [9]. So, FEM modeling has been modified in order<br />

to include the stray core losses due to magnetic field<br />

rotating vector and its harmonics. At this end, two types<br />

<strong>of</strong> PM motors with two different structures have been<br />

modeled using a new method and the simulation results<br />

have been compared to the experimental results. In [10],<br />

impact <strong>of</strong> the stator slot shapes upon the core losses has<br />

been discussed and three different structures <strong>of</strong> induction<br />

motor for minimizing the core losses have been<br />

investigated. In this case, core losses are estimated using<br />

magnetic flux density and field intensity and integration<br />

<strong>of</strong> their product. Core losses distribution over core crosssection<br />

and impact <strong>of</strong> different stator slot shapes has been<br />

considered using FEM. In [11], a model based on eddy<br />

current analysis in the magnetic sheets is presented which<br />

is capable to estimate the stray losses for computation <strong>of</strong><br />

high frequency core losses and residual losses in core<br />

sheets <strong>of</strong> electrical machines. Advantages <strong>of</strong> this method<br />

is taking into account the magnetic field distribution<br />

along sheets thickness using one-dimensional non-linear<br />

FEM over non-linear 2D elements and also impact <strong>of</strong><br />

frequency and magnetic flux density on the quantities<br />

related to the core losses. In [12], a model has been<br />

introduced to study the impact <strong>of</strong> PWM supply on the<br />

induction motor core losses. Triple losses <strong>of</strong> the core have<br />

been presented by a combined model using FEM. The<br />

results have been compared to the traditional modeling<br />

and experimental results. This combined model consists<br />

<strong>of</strong> two static and dynamic models in which the impact <strong>of</strong><br />

Control<br />

Strategy<br />

the residual minor loops has been also included. In<br />

addition, the losses distribution over the motor and<br />

separation <strong>of</strong> different components <strong>of</strong> the core losses has<br />

been pointed out. In [13], induction motor performance<br />

has been analyzed using 2D FEM. This is one <strong>of</strong> the few<br />

works in which the impact <strong>of</strong> internal fault <strong>of</strong> induction<br />

motor such as rotor broken bars and eccentricity and also<br />

application <strong>of</strong> the PWM drive upon core and Ohmic<br />

losses have been considered and shown that the rotor<br />

broken bar causes the increase <strong>of</strong> the losses around the<br />

damaged bar; in addition PWM supply also increases the<br />

core losses density. In [14], Permanent magnet (PM)<br />

motor under three static, dynamic and mixed<br />

eccentricities faults have been analyzed using FEM.<br />

Finally, impact <strong>of</strong> these faults on core and Ohmic losses<br />

has been investigated. Figure 2 shows the impact <strong>of</strong><br />

different eccentricities on the eddy current and residual<br />

losses.<br />

III. HYSTERESIS LOOP MODEL BASED METHODS<br />

Precise mathematical model <strong>of</strong> hysteresis loop in<br />

magnetic material could be useful in accurate estimation<br />

<strong>of</strong> the core losses. First, classical hysteresis models such<br />

as [15] have been used to calculate the losses. Recently,<br />

improved hysteresis model such as loss surface model<br />

(LSM) and energy-based hysteresis vector-model have<br />

been employed to estimate the losses which are briefly<br />

described below. The LSM is a numerical and dynamic<br />

model for core losses evaluation which has been applied<br />

to thick magnetic laminations in [16]. This method is<br />

based on the definition <strong>of</strong> the magnetic field as a surface<br />

function <strong>of</strong> magnetic flux density and its rate as follows:<br />

dB dB<br />

S H( B, ) Hstat ( B) Hdyn ( B,<br />

). (2)<br />

dt dt<br />

In fact, this method is combination <strong>of</strong> a static and<br />

dynamic model and is capable to model the static and<br />

dynamic behaviors <strong>of</strong> the hysteresis loop. The static<br />

behavior is modeled using different hysteresis curves and<br />

dynamic behavior using six parameters which depend on<br />

the magnetic flux density and time variation <strong>of</strong> its<br />

derivative. Vector magnetic hysteresis model has been<br />

used in [17] to estimate the core losses. In this method,<br />

magnetic field intensity has been evaluated using the<br />

vertical components <strong>of</strong> the magnetic flux density and then<br />

core losses have been calculated by integration <strong>of</strong> the<br />

product <strong>of</strong> the magnetic flux density and field intensity.


- 133 - 15th IGTE Symposium 2012<br />

Figure 2: (a) Hysteresis losses and (b) eddy current losses for different degrees and types <strong>of</strong> eccentricity [14]<br />

Application <strong>of</strong> two Preisach and Jill-Atherton models<br />

have been compared in three different magnetic materials<br />

in order to obtain an optimal method for precise modeling<br />

<strong>of</strong> the magnetic cores using FEM with reasonable<br />

computation time [18]. In mathematical models the full<br />

hysteresis loop and a series <strong>of</strong> the magnetic parameters <strong>of</strong><br />

the proposed material must be available which<br />

complicated its application [19]<br />

IV. STEINMETZ EQUATIONS-BASED METHODS<br />

Purpose <strong>of</strong> the improved Steinmetz equations is to<br />

estimate core losses for non-sinusoidal magnetic flux<br />

density analytically. This modification is done using<br />

different methods. Improved Steinmetz equations have<br />

been employed to estimate the core losses in switched<br />

reluctance motor (SRM) [20], [21]. First SRM behavior is<br />

analyzed using FEM and then improved core losses<br />

equations for SRM including the impact <strong>of</strong> the minor<br />

hysteresis loops beside <strong>of</strong> current harmonics effect are<br />

considered.<br />

ab. B 1 dB<br />

max<br />

2<br />

Pc kcfChfBmax C ( ) .<br />

2 e avg (3)<br />

2<br />

dt<br />

where Kcf is the modification factor that takes into<br />

account the impact <strong>of</strong> the minor hysteresis loops within<br />

the major loops. Another idea for improving the<br />

Steinmetz classical equations is obtaining an equivalent<br />

frequency for proposed non-sinusoidal signal which have<br />

been used for magnetic sheets [22] and transformer [23]:<br />

2<br />

2<br />

T dB<br />

feq <br />

( ) dt.<br />

2 2<br />

( B 0<br />

max Bmin ) (4)<br />

dt<br />

where Bmax and Bmin are the maximum and minimum <strong>of</strong><br />

the magnetic flux density. In the other words, remagnetizing<br />

frequency is substituted by an equivalent<br />

frequency versus magnetic flux density variations. In<br />

addition the impacts <strong>of</strong> dc upon the core losses have been<br />

considered in [22] and the modified Steinmetz equations<br />

(MSE) have been introduced. Another method <strong>of</strong><br />

modifying Steinmetz equations is introducing the<br />

coefficients in order to take into account the nonsinusoidal<br />

magnetic excitation waveform. In this case, the<br />

distorted magnetic flux density waveform is used to<br />

determine these coefficients. For instance, in [24] the<br />

Steinmetz equations have been changed as such that the<br />

hysteresis losses versus the mean value <strong>of</strong> the rectified<br />

waveform <strong>of</strong> the magnetic flux density and eddy current<br />

losses versus the rms value <strong>of</strong> this waveform have been<br />

expressed. Consequently, these coefficients are estimated<br />

when the magnetic flux density waveforms in two directfed<br />

and PWM-fed are known and the core losses in<br />

abnormal operation are expressed versus the core losses<br />

in the normal operation. In [25], the traditional Steinmetz<br />

equations and their modification have been used to<br />

estimate the magnetic sheets losses under PWM-fed; such<br />

that the impact <strong>of</strong> frequency and magnetic flux density<br />

variations upon the Steinmetz equation have been<br />

included in the coefficients for different materials and<br />

also impact <strong>of</strong> the magnetic flux density waveform<br />

variations due to drive on the core losses. In order to<br />

consider the frequency and magnetic flux density on the<br />

Steinmetz coefficients, these coefficients are considered<br />

as 3 rd order equations versus magnetic flux density.<br />

V. PHYSICS-DEFINED LOSSES BASED METHODS<br />

A long time ago, there was a procedure for core losses<br />

estimation based on the physics definitions <strong>of</strong> various<br />

core losses. For instance in [26], a modeling method was<br />

introduced for magnetic domains in material and then<br />

eddy current, its cause and losses were discussed.<br />

Advantage <strong>of</strong> this model is its capability to use over wide<br />

range <strong>of</strong> the flux density up to the saturation level and<br />

wide frequency band. In [27], forming the eddy current in<br />

the magnetic sheets and external magnetic fields effect<br />

has been considered. Meanwhile, impacts <strong>of</strong> the internal<br />

magnetic fields (adjacent magnetic domains walls) on the<br />

eddy current have been included. In [28], core losses as<br />

non-linear function <strong>of</strong> frequency are expressed as three<br />

types <strong>of</strong> hysteresis, classic and stray losses. The classic<br />

and stray losses are as follows:<br />

( class)<br />

2 2 2 2<br />

P d<br />

I max<br />

fm/6.<br />

(5)<br />

( exc) 2L<br />

( class)<br />

P (1.63 ) P .<br />

(6)<br />

d<br />

where is the conductivity, d is the magnetic sheet<br />

thickness and L is the magnetic domain dimensions. Core<br />

losses have been categorized into two types: 1: Core<br />

losses constant against frequency (hysteresis losses), and<br />

2: core losses depending on frequency (eddy current<br />

losses and abnormal stray losses) [29]. Each category has<br />

been expressed by equations versus frequency and<br />

magnetic flux density. This method is not a precise<br />

method, the main reason is the classification <strong>of</strong> the losses<br />

versus dependency and independency on the<br />

frequency. In addition, this method has appropriate results


Figure 4: various losses in the mains-fed and inverter-fed<br />

induction motor [31]<br />

over particular amplitude <strong>of</strong> the magnetic flux density due<br />

to the simplification <strong>of</strong> the method. In [30], [31], core<br />

losses have been divided into hysteresis losses, eddy<br />

current losses, and stray eddy current losses. Classic eddy<br />

current losses and stray eddy current losses are as follows:<br />

2<br />

d 1 T dB<br />

Wc( ) dt.<br />

f 12m<br />

T (7)<br />

0 dt<br />

v<br />

1 1 T dB<br />

W GV<br />

S dt.<br />

(8)<br />

c<br />

0<br />

fm 0<br />

v T dt<br />

where mv is the magnetic material density. In the stray<br />

eddy currents losses S is the magnetic sheet cross-section,<br />

G is the dimensions factor and V0 is a parameter that<br />

determines the local fields distribution. Then an<br />

equivalent frequency is defined to estimate the losses<br />

based on physics definition taking into account the<br />

harmonics within the core magnetic flux density. Figure 4<br />

shows various losses in the mains-fed and inverter-fed<br />

induction motor. Also in [32], integrally defined <strong>of</strong> triple<br />

subdivision <strong>of</strong> core losses has been used to estimate the<br />

core losses. In this case, losses equations have been<br />

presented versus mmf for different emf (sinusoidal,<br />

triangular and rectangular) waveforms considering<br />

relationship between emf and magnetic flux density and<br />

piecewise-linearized modeling <strong>of</strong> emf waveform. The<br />

relevant equations due to different losses have been given<br />

for each waveform. In addition, impact <strong>of</strong> different<br />

parameters such as duty cycle upon various core losses<br />

has been investigated. In [33], the influence <strong>of</strong> minor<br />

hysteresis loops within the magnetic sheet hysteresis loop<br />

upon core losses estimation has been investigated and<br />

their complicated time variations versus peak magnetic<br />

flux density and magnetic polarization vector for<br />

estimation <strong>of</strong> the triple core losses have been presented.<br />

Figure 5 shows that how these minor hysteresis loops are<br />

generated. Variations in the magnetic polarization<br />

envelop causes minor hysteresis loops within the major<br />

hysteresis loop. Dividing the core losses into three<br />

different losses and integral definitions versus magnetic<br />

flux density, its derivative lead to complicated<br />

computations. An important point in these methods is<br />

their dependency on the coefficients which depend on the<br />

physical and chemical characteristics <strong>of</strong> the proposed<br />

material and its molecular structure; however, they are not<br />

1.5<br />

- 134 - 15th IGTE Symposium 2012<br />

Figure 5: Generating minor hysteresis loops within major<br />

hysteresis loop [33].<br />

Figure 6: dq equivalent circuit considering core losses [36].<br />

available and their computations need some tests,<br />

therefore application <strong>of</strong> these methods is difficult. In<br />

[34], eddy current losses have been evaluated by<br />

application <strong>of</strong> different double-magnetic-excitation on the<br />

magnetic sheets using Maxwell equations. These<br />

computations have been carried out on a steel sheet and<br />

can be extended to the whole electrical machine. In<br />

addition, computations results have large difference with<br />

the test results and no justification has been given. Also<br />

application <strong>of</strong> the complicated Maxwell equations is an<br />

important problem particularly in the selection <strong>of</strong> the<br />

numerical solution method for solution <strong>of</strong> the equations.<br />

In [35], superposition method has been applied to<br />

estimate the eddy current losses in the PWM-fed motor<br />

and transformer. Since different losses do not vary versus<br />

magnetic flux density, application <strong>of</strong> the superposition is<br />

not correct.<br />

VI. EQUIVALENT CIRCUIT-BASED METHODS<br />

Equivalent circuit <strong>of</strong> induction motor has been<br />

frequently used to analyze the motor behavior and find its<br />

core losses. As shown in Figure 6, at this end a simplified<br />

dq model <strong>of</strong> ac motors can be used which does not lead to<br />

accurate results [36]. In addition, in dq model harmonics<br />

are ignored and this decreases the precision <strong>of</strong> the<br />

method. In [37], an equivalent circuit with a<br />

harmonic supply has been introduced to estimate core<br />

losses. In this case, dq model has been used and<br />

parameters <strong>of</strong> the equivalent circuit are calculated using<br />

FEM. In [38], impact <strong>of</strong> unbalanced supplied induction<br />

motor on the motor efficiency has been presented. The<br />

proposed equivalent circuit <strong>of</strong> induction motor is not<br />

accurate, so it has been modified by layered magnetic<br />

core. To do so, a resistance with leakage inductances has<br />

been connected in parallel. The major point in the


Method<br />

Steinmetz Eqn.<br />

MSE<br />

Hysteresis model<br />

Physical Eqn.<br />

FEM<br />

Equivalent Circuit<br />

Control Strategy<br />

application <strong>of</strong> the equivalent circuits for core losses<br />

estimation is its strong dependency on the parameters<br />

which may vary due to the operating conditions.<br />

VII. CONTROL STRATEGY-BASED METHODS<br />

Since efficiency <strong>of</strong> electrical machine is an important<br />

factor beside its life, one <strong>of</strong> the major quantities aiming to<br />

reduce is the core losses <strong>of</strong> drive-fed motor. In [39], [40],<br />

strategies for squirrel-cage induction motor and PM<br />

synchronous motor control have been introduced to<br />

decrease the losses. In these strategies, all equations for<br />

minimizing the losses depend on the motor parameters<br />

and determination <strong>of</strong> these parameters is themselves<br />

critical. In [41], impact <strong>of</strong> the core losses <strong>of</strong> induction<br />

motor on the stator-flux oriented control has been studied<br />

and a control strategy taking into account the induction<br />

motor losses has been introduced. In this method, core<br />

losses have been modeled by a resistance in parallel with<br />

the magnetizing inductance. In [42], a vector controlbased<br />

strategy for PMSM has been presented to maximize<br />

the motor efficiency. At this end, a model has been<br />

suggested for losses. In this method, d-component <strong>of</strong> the<br />

stator current is determined to maximize the IPM motor<br />

efficiency. In this strategy, losses are divided into Ohmic,<br />

core, mechanical and harmonic losses and they are then<br />

calculated.<br />

This test-based method can be extended to different<br />

types <strong>of</strong> PM and reluctance motors. However, they<br />

depend strongly on the motor parameters while these<br />

parameters are not in turn constant and vary by changing<br />

the operating point <strong>of</strong> the motor. Table I summarizes the<br />

core losses estimation for different supplies. In this table,<br />

some factors such as complexity <strong>of</strong> the methods, need for<br />

magnetic material parameters, response to the nonsinusoidal<br />

excitation waveforms and precision <strong>of</strong> the<br />

methods have been compared. As seen in Table 1, some<br />

accurate methods are complicated and need huge data<br />

from the magnetic material. Some methods are simple<br />

with low accurate responses. So, a proper method must be<br />

selected based-on the application.<br />

VIII. CONCLUSION<br />

Different methods were proposed for core losses<br />

estimation in magnetic materials for different excitations.<br />

These methods were classified and studied. Some factors<br />

such as complexity <strong>of</strong> methods for magnetic material<br />

parameters estimation, response to non-sinusoidal<br />

- 135 - 15th IGTE Symposium 2012<br />

TABLE I<br />

COMPARISON OF DIFFERENT CORE LOSSES ESTIMATION METHODS<br />

Complex waveform Complexity Material knowledge<br />

-<br />

Low<br />

Low<br />

+<br />

Low<br />

Low<br />

+<br />

High<br />

High<br />

+<br />

High<br />

High<br />

+<br />

High<br />

Medium<br />

-<br />

Medium<br />

Low<br />

+<br />

Medium<br />

Low<br />

Accuracy<br />

Low<br />

Medium<br />

Good<br />

Good<br />

Good<br />

Low<br />

Medium<br />

excitation waveforms and precision for each method were<br />

investigated and summarized. The methods such as FEM,<br />

hysteresis model, physical equation lead to accurate<br />

results but they are complicated methods and need a huge<br />

data <strong>of</strong> the magnetic material. MSE method is simple and<br />

no need huge data <strong>of</strong> the magnetic material with average<br />

accuracy. Control strategies are not used in direct core<br />

losses estimation. Equivalent electrical circuits <strong>of</strong> motor<br />

parameters also depend on the operating point <strong>of</strong> the<br />

motor and this is considered a difficulty <strong>of</strong> these methods.<br />

ACKNOWLEDGMENT<br />

The authors would like to thank Iran’s National Elites<br />

Foundation (INEF) for financial support <strong>of</strong> the project.<br />

[1] C. D. Graham,<br />

REFERENCES<br />

"Physical origin <strong>of</strong> losses in conducting<br />

[2]<br />

ferromagnetic materials," Journal <strong>of</strong> Applied Physics, vol. 53, no.<br />

11, pp. 8276-8280, Nov. 1982.<br />

M. S. Lancarotte, Jr. Aderbalde, and A. Penteado, "Estimation <strong>of</strong><br />

core losses under sinusoidal or non-sinusoidal induction by<br />

analysis <strong>of</strong> magnetization rate,” IEEE Transactions on Energy<br />

Conversion, vol.16, no. 2, pp. 174-179, June 2001.<br />

[3] M. Amar, and F. Protat, "A simple method for the estimation <strong>of</strong><br />

power losses in silicon iron sheets under alternating pulse voltage<br />

excitation," IEEE Transactions on Magnetics, vol. 30, no. 2, pp.<br />

942-944, March 1994.<br />

[4] W. Arshad, T. Ryckebusch, F. Magnussen, H. Lendenmann, B.<br />

Eriksson, J. Soulard, and B. Malmros, "Incorporating lamination<br />

processing and component manufacturing in electrical machine<br />

design tools," 42nd IAS Annual Meeting <strong>of</strong> Industry Applications<br />

Society, pp. 94-102, 2007.<br />

[5] H. Skarrie, "Design <strong>of</strong> powder core inductors," Licentiate Thesis,<br />

Lund <strong>University</strong>, Sweden, 2001.<br />

[6] Z. Gmyrek, A. Boglietti, and A. Cavagnino, "Estimation <strong>of</strong> iron<br />

losses in induction motors: calculation method, results, and<br />

analysis," IEEE Transactions on Industrial Electronics, vol. 57,<br />

no. 1, pp. 161-171, January 2010.<br />

[7] K. B. Tatis, A. G. Kladas, and J. A.Tegopoulos, "Harmonic iron<br />

loss determination in laminated iron cores by using a particular 3-<br />

D finite-element model," IEEE Transactions on Magnetics, vol.<br />

40, no. 2, pp. 860- 863, March 2004.<br />

[8] A. M. Knight, and Y. Zhan, "Identification <strong>of</strong> flux density<br />

harmonics and resulting iron losses in induction machines with<br />

nonsinusoidal supplies," IEEE Transactions on Magnetics, vol.<br />

44, no. 6, pp.1562-1565, June 2008.<br />

[9] L. Ma, M. Sanada, S. Morimoto, and Y. Takeda, "Iron loss<br />

prediction considering the rotational field and flux density<br />

harmonics in IPMSM and SynRM," IEE <strong>Proceedings</strong> - Electric<br />

Power Applications, vol. 150, no. 6, pp. 747- 751, November<br />

2003.<br />

[10] M. Enokizono, H. Shimoji, and T. Horibe, "Effect <strong>of</strong> stator<br />

construction <strong>of</strong> three-phase induction motors on core loss," IEEE<br />

Transactions on Magnetics, vol. 39, no. 3, pp. 1484- 1487, May<br />

2003.<br />

[11] K. Yamazaki, and N. Fukushima, "Experimental validation <strong>of</strong><br />

iron loss model for rotating machines based on direct eddy current


analysis <strong>of</strong> electrical steel sheets," IEEE International Conference<br />

on Electric Machines and Drives. IEMDC '09, pp. 851-857, 3-6<br />

May 2009.<br />

[12] E. Dlala, and A. Arkkio, "A General model for investigating the<br />

effects <strong>of</strong> the frequency converter on the magnetic iron losses <strong>of</strong> a<br />

squirrel-cage induction motor," IEEE Transactions on Magnetics<br />

, vol. 45, no. 9, pp. 3303-3315, Sept. 2009.<br />

[13] J. F. Bangura, and N. A. Demerdash, "Effects <strong>of</strong> broken bars/endring<br />

connectors and air gap eccentricities on Ohmic and core<br />

losses <strong>of</strong> induction motors in ASDs using a coupled finite<br />

element-state space method," IEEE Transactions on Energy<br />

Conversion, vol. 15, no. 1, pp. 40-47, Mar. 2000.<br />

[14] B. M. Ebrahimi, and J. Faiz, "Diagnosis and performance<br />

analysis <strong>of</strong> three phase permanent magnet synchronous motors<br />

with static, dynamic and mixed eccentricity," Electric Power<br />

Applications, IET, vol. 4, no. 1, pp. 53-66, January 2010.<br />

[15] D. Jiles, and D. Atherton, "Theory <strong>of</strong> ferro-magnetic hysteresis,"<br />

Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 61, no. 1-2,<br />

pp. 48-60, Sep.1986.<br />

[16] T. Chevalier, A. Kedous Lebouc, B. Cornut, and C. Cester, "A<br />

new dynamic hysteresis model for electrical steel sheet," Physica<br />

B: Condensed Matter, vol. 275, no. 1-3, pp. 197-201, 2000.<br />

[17] M. Enokizono, and T. Horibe, "Loss evaluation <strong>of</strong> induction<br />

motor by using E&S2 model magnetic hysteresis," IEEE<br />

Transactions on Magnetics, vol. 38, no. 5, pp. 2379- 2381.<br />

September 2002.<br />

[18] A. Benabou, S. Clenet, and F. Piriou, "Comparison <strong>of</strong> Preisach<br />

and Jiles–Atherton models to take into account hysteresis<br />

phenomenon for finite element analysis," Journal <strong>of</strong> Magnetism<br />

and Magnetic Materials, no. 261, pp. 139–160, 2003.<br />

[19] A. Krings, and J. Soulard, "Over view and comparison <strong>of</strong> iron<br />

loss models for electrical machines," Ecological Vehicle and<br />

Renewable Energy (EVER), Monaco, 25-28 March 2010.<br />

[20] B. Ganji, J. Faiz, K. Kasper, C. E. Carstensen, and R. W.<br />

DeDoncker, "Core loss model based on finite-element method for<br />

switched reluctance motors," IEE <strong>Proceedings</strong> - Electric Power<br />

Applications, vol. 4, no. 7, pp. 569–577, 2010.<br />

[21] J. Faiz, B. Ganji, B. R. W. De Doncker, and J. O. Fiedler,<br />

"Electromagnetic modeling <strong>of</strong> switched reluctance motor using<br />

finite element method," 32nd Annual Conference on IEEE<br />

Industrial Electronics, IECON, pp. 1557-1562, 6-10 Nov. 2006.<br />

[22] J. Reinert, A. Brockmeyer, and R. W. De Doncker, "Calculation<br />

<strong>of</strong> losses in ferro- and ferromagnetic materials based on the<br />

modified Steinmetz equation," IEEE Transactions on Industry<br />

Applications, vol. 37, no. 4, pp. 1055-1061, Jul./Aug. 2001.<br />

[23] M. Albach, T. Durbaum, and A. Brockmeyer, "Calculating core<br />

losses in transformers for arbitrary magnetizing currents: a<br />

comparison <strong>of</strong> different approaches," 27th Annual IEEE Power<br />

Electronics Specialists Conference, PESC '97, vol. 2, pp. 1463-<br />

1468, 23-27 Jun 1996.<br />

[24] A. Boglietti, A. Cavagnino, M. Lazzari, and M. Pastorelli,<br />

"Predicting iron losses in s<strong>of</strong>t magnetic materials with arbitrary<br />

voltage supply: an engineering approach," IEEE Transactions on<br />

Magnetics, vol. 39, no. 2, pp. 981- 989, Mar. 2003.<br />

[25] D. Ionel, M. Popescu, C. Cossar, M. I. McGilp, A. Boglietti,<br />

and A. Cavagnino, "A general model <strong>of</strong> the laminated steel losses<br />

in electric motors with PWM voltage supply," IEEE Industry<br />

Applications Society Annual Meeting. IAS '08., pp. 1-7, 5-9 Oct.<br />

2008.<br />

[26] R. H. Pry, and C. P. Bean, "Calculation <strong>of</strong> the energy loss in<br />

magnetic sheet materials using a domain model," Journal <strong>of</strong><br />

Applied Physics, vol. 29, no. 3, pp. 532-533, Mar. 1958.<br />

- 136 - 15th IGTE Symposium 2012<br />

[27] G. Bertotti, "Physical interpretation <strong>of</strong> eddy current losses in<br />

ferromagnetic materials. I. Theoretical considerations," Journal <strong>of</strong><br />

Applied Physics, vol. 57, no. 6, pp. 2110-2117, March 1985.<br />

[28] G. Bertotti, "General properties <strong>of</strong> power losses in s<strong>of</strong>t<br />

ferromagnetic materials," IEEE Transactions on Magnetics, vol.<br />

24, no. 1, pp. 621-630, Jan. 1988.<br />

[29] R. Newbury, "Prediction <strong>of</strong> loss in silicon steel from distorted<br />

waveforms," IEEE Transactions on Magnetics, vol. 14, no. 4, pp.<br />

263- 268, Jul. 1978.<br />

[30] M. Amar and R. Kaczmarek, "A general formula for prediction<br />

<strong>of</strong> iron losses under non-sinusoidal voltage waveform," IEEE<br />

Transactions on Magnetics, vol. 31, no. 5, pp. 2504-2509, Sep.<br />

1995.<br />

[31] T. C. Green, C. A. Hernandez-Aramburo, and A. C. Smith,<br />

"Losses in grid and inverter supplied induction machine drives,"<br />

IEE <strong>Proceedings</strong> - Electric Power Applications, vol. 150, no. 6,<br />

pp. 712- 724, 7 Nov. 2003,<br />

[32] W. A. Roshen, "A practical, accurate and very general core loss<br />

model for no sinusoidal waveforms," IEEE Transactions on<br />

Power Electronics, vol. 22, no. 1, pp. 30-40, Jan. 2007.<br />

[33] E. Barbisio, F. Fiorillo, and C. Ragusa, "Predicting loss in<br />

magnetic steels under arbitrary induction waveform with minor<br />

hysteresis loops," IEEE Transactions on Magnetics, vol. 40, no.<br />

4, pp. 1810- 1819, July 2004.<br />

[34] J. Sagarduy, A. J. Moses, and F. J. Anayi, "Eddy current losses in<br />

electrical steels subjected to matrix and classical PWM excitation<br />

waveforms," IEEE Transactions on Magnetics, vol. 42, no. 10,<br />

pp. 2818-2820, Oct. 2006.<br />

[35] Ruifang Liu, C. C. Mi, and D. W. Gao, "Modeling <strong>of</strong> eddycurrent<br />

loss <strong>of</strong> electrical machines and transformers operated by<br />

pulse width-modulated inverters," IEEE Transactions on<br />

Magnetics, vol. 44, no. 8, pp. 2021-2028, Aug. 2008.<br />

[36] M. Popescu, D. G. Dorrell, and D. M. Ionel, "A study <strong>of</strong> the<br />

engineering calculations for iron losses in 3-phase AC motor<br />

models," IEEE 33rd Annual Conference <strong>of</strong> the Industrial<br />

Electronics Society. IECON 2007., pp. 169-174, 5-8 Nov. 2007.<br />

[37] K. Yamazaki, "Torque and efficiency calculation <strong>of</strong> an interior<br />

permanent magnet motor considering harmonic iron losses <strong>of</strong> both<br />

the stator and rotor," IEEE Transactions on Magnetics, vol. 39,<br />

no. 3, pp. 1460- 1463, May 2003.<br />

[38] A. Vamvakari, A. Kandianis, A. Kladas, S. Manias, and J.<br />

Tegopoulos, "Analysis <strong>of</strong> supply voltage distortion effects on<br />

induction motor operation," IEEE Transactions on Energy<br />

Conversion, vol. 16, no. 3, pp. 209-213, Sep. 2001.<br />

[39] S. Lim, and K. Nam, "Loss-minimizing control scheme for<br />

induction motors," IEE <strong>Proceedings</strong> -Electric Power<br />

Applications, vol. 151, no. 4, pp. 385- 397, 7 July 2004.<br />

[40] J. Lee, K. Nam, S. Choi, and S. Kwon, "Loss-minimizing control<br />

<strong>of</strong> PMSM with the use <strong>of</strong> polynomial approximations," IEEE<br />

Transactions on Power Electronics, vol. 24, no. 4, pp. 1071-<br />

1082, April 2009.<br />

[41] S. D. Wee, M. H. Shin, and D. S. Hyun, "Stator-flux-oriented<br />

control <strong>of</strong> induction motor considering iron loss," IEEE<br />

Transactions on Industrial Electronics, vol. 48, no. 3, pp. 602-<br />

608, Jun. 2001.<br />

[42] C. Mademlis, I. Kioskeridis, and N. Margaris, "Optimal<br />

efficiency control strategy for interior permanent-magnet<br />

synchronous motor drives," IEEE Transactions on Energy<br />

Conversion, vol. 19, no. 4, pp. 715- 723, Dec. 2004.


- 137 - 15th IGTE Symposium 2012<br />

Fast Computation <strong>of</strong> Inductances and Capacitances <strong>of</strong><br />

High Voltage Power Transformer Windings<br />

*Župan, Tomislav, *Štih, Željko and *Trkulja, Bojan<br />

*Faculty <strong>of</strong> Electrical Engineering and Computing, <strong>University</strong> <strong>of</strong> Zagreb, Unska 3, 10000 Zagreb, Croatia<br />

E-mail: tomislav.zupan@fer.hr<br />

Abstract— Inductances and capacitances in analysis <strong>of</strong> fast transients in high voltage transformers are usually calculated on<br />

the basis <strong>of</strong> simple analytical approximations for applications in lumped circuit models. This paper presents the application<br />

<strong>of</strong> the boundary element method to calculation <strong>of</strong> capacitances and inductances <strong>of</strong> transformer windings with coils <strong>of</strong><br />

rectangular cross section. Two-dimensional axially symmetric mathematical model <strong>of</strong> electric and magnetic field in<br />

transformer, which is based on integral equations, is introduced. The accuracy and applicability <strong>of</strong> the proposed approach is<br />

illustrated by an example.<br />

Index Terms—boundary element method, capacitance calculation, inductance calculation, rectangular cross section coils.<br />

I. INTRODUCTION<br />

Power transformers are one <strong>of</strong> the most important<br />

segments <strong>of</strong> electric power systems. Due to their long<br />

term running requirements they are faced with numerous<br />

overvoltage strikes during their lifetime. Therefore it is <strong>of</strong><br />

great importance to know how the power transformer<br />

windings will react on such excitations.<br />

Since most <strong>of</strong> the overvoltage strikes are characterized<br />

by high frequency impulse signals (due to lightning<br />

discharges, short circuits, etc.), the capacitances between<br />

windings, which are negligible during normal operational<br />

frequency <strong>of</strong> power grid, become significant. Thus it is<br />

meaningful to have, alongside the strict calculation <strong>of</strong><br />

inductances, a method for precise calculation <strong>of</strong><br />

capacitances <strong>of</strong> high voltage power transformer windings.<br />

Distribution <strong>of</strong> voltage along windings <strong>of</strong> transformers<br />

due to fast excitations (switching, lightning, and testing)<br />

is one <strong>of</strong> the most important inputs to insulation design.<br />

Lumped circuit model <strong>of</strong> winding is most frequently<br />

applied in calculation <strong>of</strong> such distribution [1]-[3]. Parts <strong>of</strong><br />

windings are represented by equivalent capacitances,<br />

inductances and resistances. Depending on the way <strong>of</strong> the<br />

connection <strong>of</strong> the conductors, the equivalent diagram is<br />

made for each <strong>of</strong> the turns or, for the sake <strong>of</strong> faster<br />

calculation, turns are grouped.<br />

There are numerous papers dealing with the presented<br />

problem by using the numerical approach based on the<br />

finite element method (FEM) [1], [4]. In this paper we<br />

introduce the application <strong>of</strong> the boundary element method<br />

(BEM) to calculation <strong>of</strong> capacitances and inductances <strong>of</strong><br />

windings <strong>of</strong> transformers with coils <strong>of</strong> rectangular cross<br />

section. Main advantage <strong>of</strong> this approach is ease <strong>of</strong> use,<br />

because the discretization is done only at material<br />

interfaces and boundaries, thus effectively reducing the<br />

order <strong>of</strong> the mathematical model by one dimension.<br />

II. COMPUTATION OF CAPACITANCES<br />

Typical arrangement inside the high voltage power<br />

transformer can be seen in Fig. 1 (simplified). Windings<br />

usually consist <strong>of</strong> low voltage, high voltage and<br />

regulation voltage parts. Conductors are typically <strong>of</strong><br />

rectangular cross section insulated by paper insulation<br />

and immersed in oil, grouped in discs with radial and<br />

axial canals for heat transfer purposes.<br />

z<br />

r<br />

LV<br />

HV RV<br />

core<br />

Fig. 1. Windings in high voltage power transformers<br />

Usual approach to calculation <strong>of</strong> capacitances is based<br />

on simple analytical parallel-plate approximations [2]:<br />

2 2<br />

Rout Rin<br />

CParallel<br />

0<br />

,<br />

(1)<br />

drc<br />

<br />

<br />

<br />

o p<br />

where o and p are relative permittivities <strong>of</strong> oil and<br />

paper insulation, respectively, drc height <strong>of</strong> the radial<br />

canal between conductors in z axis direction, width <strong>of</strong><br />

insulation between conductors, in R and Rout conductor's<br />

inner and outer radius, respectively,<br />

cylindrical approximations [5]:<br />

or analytical<br />

20<br />

phc CSerial<br />

<br />

,<br />

Rav<br />

<br />

ln 2 <br />

R <br />

av <br />

2 <br />

(2)<br />

oil<br />

paper<br />

insulation<br />

conductor


where c h is conductor's height, and Rav average radius<br />

between two conductors.<br />

These analytical methods result in significant errors and<br />

consequently in unsatisfactory accuracy <strong>of</strong> computation<br />

<strong>of</strong> voltage distribution along the winding.<br />

Electromagnetic field in power transformers is<br />

transverse and the capacitances may be computed on the<br />

basis <strong>of</strong> electrostatic analysis. The electric field in a space<br />

composed <strong>of</strong> conducting regions at known potentials and<br />

regions filled with different dielectrics can be solved by a<br />

pair <strong>of</strong> coupled integral equations [6]. The potential (A)<br />

at any point A on the surface <strong>of</strong> conductor is related to the<br />

charge density (B) on total surface <strong>of</strong> all boundaries<br />

(conductor–dielectric and dielectric–dielectric) by the<br />

equation:<br />

A B G B, A dS 0,<br />

(3)<br />

<br />

<br />

S<br />

where:<br />

1<br />

GB, A<br />

,<br />

(4)<br />

4 d AB<br />

and dAB is the distance between points A and B. The<br />

charge density (D) at any point D on the dielectric–<br />

dielectric boundary is related to the charge density (B)<br />

on total surface <strong>of</strong> all boundaries by the equation:<br />

o <br />

<br />

i<br />

D2 BDNB, DndS 0.<br />

<br />

(5)<br />

o i S<br />

Here, n is the unit vector normal to the surface at the<br />

point D, o is the permittivity <strong>of</strong> the region in the<br />

direction <strong>of</strong> the normal unit vector on the surface at the<br />

point D and i is the permittivity <strong>of</strong> the region in the<br />

opposite direction.<br />

Geometry <strong>of</strong> the windings can be approximated with<br />

axially symmetry, and the problem becomes twodimensional.<br />

Therefore, we may integrate surface<br />

integrals in (3) and (5) with respect to circumferential<br />

direction and reduce them to line integrals. Kernels <strong>of</strong><br />

integral equations are:<br />

1 r<br />

Prr ( , ) kKk 2<br />

r<br />

<br />

DN( r, r) DR( r, r) ar DZ( r, r) az<br />

<br />

<br />

<br />

1<br />

DR( r, r)<br />

<br />

4 r<br />

3<br />

r k<br />

<br />

2 r1k KkEk rr <br />

EkKk 2<br />

<br />

k 2r<br />

<br />

(6)<br />

1<br />

DZ( r, r) <br />

4r 3<br />

r zz k<br />

E k r 2 2r 1<br />

k<br />

2<br />

k <br />

4rr<br />

.<br />

2 2<br />

rr zz <br />

Here, k is the modulus <strong>of</strong> the elliptic integrals <strong>of</strong> the<br />

first and the second kind K(k) and E(k). The vector<br />

<br />

r ra r za<br />

z defines the position <strong>of</strong> the source point<br />

<br />

(B), and the vector r rar zaz<br />

defines the position <strong>of</strong><br />

the point <strong>of</strong> interest (A, D).<br />

- 138 - 15th IGTE Symposium 2012<br />

<br />

Terms Prr ( , ) , DR( r, r)<br />

and DZ( r, r)<br />

represent the<br />

kernels for electric potential and radial and axial<br />

components <strong>of</strong> electrical induction vector <strong>of</strong> uniformly<br />

charged ring with negligible cross section, respectively<br />

[7]. Because the conductors inside power transformers<br />

have rectangular cross section, their surface can be<br />

divided, using BEM approach, into either thin cylinders<br />

or thin discs. As can be seen on Fig.2, both <strong>of</strong> the cases,<br />

for the sake <strong>of</strong> generality, can be represented with<br />

truncated cone.<br />

truncated<br />

cone<br />

disc<br />

cylinder<br />

Fig. 2. General representation <strong>of</strong> the conductor segment<br />

division<br />

<br />

The final expressions for Prr ( , b, re)<br />

, after integrating<br />

(6) over l, are:<br />

1 2<br />

l r() t K( k )<br />

Prr ( , b, re) <br />

dt<br />

q<br />

0<br />

2 2<br />

e b e b<br />

<br />

l z z r r<br />

rt () ( re rb) trb zt () ( ze zb) tzb, where r(t) and z(t) represent parametric notation <strong>of</strong> the<br />

general point on segment l, l is the length <strong>of</strong> the segment<br />

and q and k are:<br />

2<br />

<br />

2 2<br />

q r rt () zzt () 2<br />

rrt ()<br />

4 rr( t)<br />

k .<br />

q<br />

<br />

<br />

From the equation E ,0, <br />

r z<br />

<br />

<br />

obtain the kernels <strong>of</strong> electrical induction vector:<br />

1<br />

l<br />

DR( r, rb, re) r( t)<br />

<br />

<br />

0<br />

2 2<br />

2() rtKk ()(1 k) Ek ()2() rt krrt () <br />

<br />

<br />

2 2 3 2<br />

k (1 k ) q<br />

<br />

1<br />

l rt () zzt () Ek<br />

( )<br />

DZ( r, rb, re) <br />

dt.<br />

2 3 2<br />

(1 k ) q<br />

0<br />

z<br />

z<br />

ze<br />

zb<br />

The system <strong>of</strong> integral equations (3) and (5) is solved<br />

by BEM. Boundaries and interfaces between two<br />

dielectrics are divided into finite segments and the<br />

l<br />

rb re r<br />

dt<br />

r<br />

(7)<br />

(8)<br />

we<br />

(9)


unknown distribution <strong>of</strong> surface charge density on the ith<br />

segment is approximated as linear combination <strong>of</strong><br />

predefined basis functions:<br />

N<br />

<br />

( r) <br />

t ( r).<br />

(10)<br />

i in in<br />

i1<br />

The simplest approach is to use basis functions which<br />

are constant (N=1) on the finite segment. The application<br />

<strong>of</strong> (10) to (3) and (5) results in:<br />

N<br />

<br />

( r) P( r, r, r) dC ; r C<br />

<br />

0 i p k i 0<br />

i1 Ci<br />

N<br />

o i<br />

<br />

i( r) 2 i<br />

DN( r, rp, rk) dCi; o <br />

i i1 Ci<br />

<br />

r Ci.<br />

(11)<br />

Here, C0 is boundary at known potential 0 and Ci is<br />

interface between two dielectrics. A linear equation<br />

system for unknown coefficients i is derived by<br />

enforcing an exact solution at midpoints <strong>of</strong> each finite<br />

segment (point-matching [8]). The integrals in (11) are<br />

Ci,<br />

the integrals become singular. In such a case their vicinity<br />

is treated separately and this contribution is calculated<br />

analytically (logarithmic singularities).<br />

After the determination <strong>of</strong> the surface charge<br />

distribution<br />

calculated by:<br />

on conductors, the capacitances are<br />

Qij<br />

Cij ; i j.<br />

<br />

(12)<br />

i j<br />

Here, i and j are potentials <strong>of</strong> i-th and j-th conductor,<br />

respectively, and Qij is total charge on the j-th conductor<br />

influenced by the charge on the i-th conductor:<br />

j<br />

<br />

<br />

Q dS S<br />

ij j j kj kj<br />

S<br />

k 1<br />

j<br />

N<br />

,<br />

- 139 - 15th IGTE Symposium 2012<br />

(13)<br />

where kj is surface charge density on k-th segment <strong>of</strong> jth<br />

conductor, Nj is the number <strong>of</strong> finite segments on j-th<br />

conductor and Skj is the surface <strong>of</strong> the k-th segment <strong>of</strong> jth<br />

conductor.<br />

We calculate the capacitances by setting the potential <strong>of</strong><br />

the i-th conductor to 1V and the potential <strong>of</strong> all other<br />

conductors to zero. Then, we obtain the total charge on<br />

conductors and use (12) to calculate the capacitances.<br />

III. CAPACITANCE CALCULATION -EXAMPLE<br />

The following procedure has been tested on two<br />

examples, first one showing the calculation <strong>of</strong> turn-toturn<br />

capacitances <strong>of</strong> high voltage winding in a power<br />

transformer and the second one showing a more<br />

"macroscopic" approach, where the conductors in one<br />

row <strong>of</strong> high voltage winding are grouped into disc and<br />

then the disc-by-disc capacitances are observed.<br />

The turn-to-turn capacitances were calculated in three<br />

ways:<br />

BEM approach. Total number <strong>of</strong> unknown<br />

coefficients <strong>of</strong> surface charge distribution was<br />

896.<br />

FEM approach using Ans<strong>of</strong>t Maxwell ®<br />

package.<br />

Total number <strong>of</strong> elements was 36180, and total<br />

number <strong>of</strong> nodes was 2103, which results in 0.1%<br />

error in calculation <strong>of</strong> energy.<br />

Analytical approach based on cylindrical<br />

approximation shown in (2) for calculation <strong>of</strong><br />

serial capacitance between radially neighboring<br />

conductors or parallel-plate approximation shown<br />

in (1) for calculation <strong>of</strong> parallel capacitance<br />

between axially neighboring conductors.<br />

Following proposed BEM approach, graphical<br />

depiction <strong>of</strong> one example where the middle conductor's<br />

potential is set to 1V, illustrating the distribution <strong>of</strong> the<br />

surface charge density on the observed conductor and the<br />

influenced surface charge densities on neighboring<br />

conductors can be seen on Fig. 3.<br />

Fig. 3. BEM solution for calculation <strong>of</strong> turn-to-turn<br />

capacitances<br />

The same example was solved using Ans<strong>of</strong>t Maxwell ®<br />

package, as can be seen on Fig. 4.<br />

Fig. 4. FEM solution for calculation <strong>of</strong> turn-to-turn<br />

capacitances (Ans<strong>of</strong>t Maxwell ® )<br />

Comparison <strong>of</strong> the results for turn-to-turn capacitance<br />

calculation <strong>of</strong> various approaches is given in Table 1. CiS<br />

and CiP represent the serial and parallel capacitance<br />

between two innermost conductors, CS and CP between


two middlemost conductors, and CoS and CoP between<br />

two outermost conductors.<br />

TABLE I<br />

TURN-TO-TURN CAPACITANCE RESULTS<br />

turn-to- CiS CiP CS CP CoS CoP<br />

turn [nF] [nF] [nF] [nF] [nF] [nF]<br />

® Maxwell 1.70 0.09 1.81 0.03 1.93 0.10<br />

Analytical 1.56 0.04 1.66 0.04 1.76 0.05<br />

BEM 1.68 0.09 1.77 0.03 1.89 0.10<br />

The disc-to-disc example results solved by BEM can be<br />

seen in Fig. 5 and the same example solved using Ans<strong>of</strong>t<br />

Maxwell ® package can be seen in Fig. 6.<br />

Fig. 5. BEM solution for calculation <strong>of</strong> disc-to-disc<br />

capacitances<br />

Fig. 6. FEM solution for calculation <strong>of</strong> disc-to-disc<br />

capacitances (Ans<strong>of</strong>t Maxwell ® )<br />

Comparison <strong>of</strong> the results for disc-to-disc capacitance<br />

calculation is given in Table 2.<br />

TABLE II<br />

DISC-TO-DISC CAPACITANCE RESULTS<br />

disc-to- Cbottom Cmid1 Cmid2 Ctop<br />

disc [nF] [nF] [nF] [nF]<br />

®<br />

Maxwell 2.83 2.88 2.90 2.83<br />

Analytical 2.36 2.36 2.36 2.36<br />

BEM 2.81 2.73 2.79 2.75<br />

The cumulative results show significant errors in<br />

analytical approach and justify the necessity <strong>of</strong><br />

application <strong>of</strong> numerical approaches. Even in the case <strong>of</strong><br />

very coarse discretization <strong>of</strong> the BEM approach, the<br />

- 140 - 15th IGTE Symposium 2012<br />

results differ by only a few percents from the results<br />

obtained by FEM.<br />

IV. COMPUTATION OF INDUCTANCES<br />

Using the analogy introduced in capacitance<br />

computation, inductances can be computed on the basis<br />

<strong>of</strong> magnetostatic analysis. The magnetic field in space<br />

composed <strong>of</strong> conducting regions with known currents and<br />

regions filled with different magnetic materials can be<br />

solved by a pair <strong>of</strong> coupled integral equations. Due to the<br />

linearity <strong>of</strong> the computation, by imposing the constant<br />

magnetic permeability, the magnetic vector potential and<br />

magnetic field strength can be written as:<br />

<br />

Ar ( ) AM( r) AS( r)<br />

(14)<br />

Hr ( ) HM( r) HS(<br />

r),<br />

where A <br />

is total magnetic vector potential, AM <br />

is<br />

magnetic vector potential caused by surface<br />

magnetization current density KM <br />

and S A is magnetic<br />

vector potential caused by imposed current density S J .<br />

The same subscripts and definitions are valid for<br />

magnetic field strength.<br />

<br />

Magnetic vector potential AA ( ) at any point A on the<br />

surface that restricts the model is related to the surface<br />

<br />

magnetization current density K( B)<br />

on total surface <strong>of</strong><br />

all boundaries by the equation:<br />

<br />

<br />

AA ( ) KBGBAdS ( ) ( , ) A(<br />

A),<br />

(15)<br />

<br />

S<br />

where GBA ( , ) is written in equation (4). The surface<br />

<br />

magnetization current density K( D)<br />

at any point D on<br />

the boundary <strong>of</strong> two different magnetic materials is<br />

related to the surface magnetization current density<br />

<br />

K( B)<br />

on total surface <strong>of</strong> all the boundaries by the<br />

equation:<br />

o <br />

i<br />

K( D) 2 K( B) HT( B, D) d S<br />

o i<br />

S<br />

(16)<br />

o <br />

i <br />

2 HS( D) n.<br />

<br />

o i<br />

Here, n is the unit vector normal to the surface at the<br />

point D, o is the permeability <strong>of</strong> the region in the<br />

direction <strong>of</strong> the normal unit vector on the surface at the<br />

point D and i is the permeability <strong>of</strong> the region in the<br />

opposite direction.<br />

Assuming the same simplification as in the capacitance<br />

calculation, geometry <strong>of</strong> the windings can be<br />

approximated with axially symmetry, and the problem<br />

becomes two-dimensional so we may integrate surface<br />

integrals in (15) and (16) with respect to circumferential<br />

direction and reduce them to line integrals. Kernels <strong>of</strong><br />

integral equations are:<br />

2<br />

1 r 1 k <br />

<br />

Grr ( , ) 1 Kk<br />

( ) Ek<br />

( ) <br />

rk <br />

2 <br />

<br />

<br />

(17)<br />

<br />

HT ( r, r) HR( r, r) a HZ( r, r) a n,<br />

<br />

S<br />

r z


where Grr ( , ) is the kernel for magnetic vector potential<br />

<br />

<br />

and HR( r, r)<br />

and HZ( r, r)<br />

are kernels for radial and<br />

axial components <strong>of</strong> magnetic field strength:<br />

k zz HR( r, r) K(<br />

k)<br />

<br />

4<br />

rr<br />

r<br />

2 2<br />

2<br />

r r zz <br />

<br />

Ek ( ) <br />

2 2<br />

rr zz <br />

<br />

k<br />

HZ( r, r) K<br />

( k)<br />

<br />

(18)<br />

4<br />

rr<br />

2 2<br />

2<br />

r r zz <br />

<br />

Ek ( ) <br />

2 2<br />

rr zz <br />

<br />

2 4rr<br />

k <br />

.<br />

2 2<br />

rr zz Here, k is the modulus <strong>of</strong> the elliptic integrals <strong>of</strong> the<br />

first and the second kind K(k) and E(k). The vector<br />

<br />

r ra r za<br />

z defines the position <strong>of</strong> the source point<br />

<br />

(B), and the vector r rar zaz<br />

defines the position <strong>of</strong><br />

the point <strong>of</strong> interest (A, D).<br />

The system <strong>of</strong> integral equations (15) and (16) is solved<br />

by BEM using the same technique mentioned in<br />

capacitance calculation section above, dividing the<br />

interfaces between different magnetic materials into finite<br />

segments and approximating the unknown distribution <strong>of</strong><br />

surface magnetization current density with linear<br />

combination <strong>of</strong> predefined basis functions:<br />

N<br />

<br />

Ki( r) Kint in(<br />

r).<br />

(19)<br />

i1<br />

Again, using the simplest adequate approach, the basis<br />

functions are constant on the finite segment (N=1). The<br />

application <strong>of</strong> (19) to (15) and (16) results in:<br />

N<br />

<br />

Ar ( ) K Gr ( , r) dC A( r); rC <br />

K r<br />

<br />

0 i i S<br />

0<br />

i1 Ci<br />

<br />

<br />

<br />

<br />

<br />

HrS ( r) arHzS ( r) azn; r Ci<br />

N<br />

i( ) o i<br />

<br />

<br />

Ki HT( r, r ) dCi<br />

2 o <br />

i i1 Ci<br />

- 141 - 15th IGTE Symposium 2012<br />

(20)<br />

Here, C0 is boundary at known magnetic vector<br />

potential and Ci is interface between two magnetic<br />

materials. Using the point-matching technique, a linear<br />

equation system for unknown coefficients Ki is derived.<br />

The example <strong>of</strong> the distribution <strong>of</strong> surface magnetization<br />

current density on the core <strong>of</strong> the transformer is shown in<br />

Fig. 7.<br />

As can be seen through inspecting equations (15) and<br />

(16), it is still necessary to determine the magnetic vector<br />

<br />

potential contribution <strong>of</strong> imposed current density AS( r)<br />

and their radial and axial components <strong>of</strong> magnetic field<br />

strength HrS ( r) and HzS ( r) .<br />

The calculation for magnetic vector potential and<br />

magnetic field strength <strong>of</strong> circular conductor <strong>of</strong><br />

rectangular cross section have been done in [9] and are<br />

presented here for the completeness <strong>of</strong> proposed method.<br />

Fig. 7. Distribution <strong>of</strong> surface magnetization current density on<br />

transformer core boundaries<br />

Using Fig. 8. for clarity, the equations are:<br />

T1( R1, R2, r, zZ1) T1( R1, R2, r, Z2 z)<br />

<br />

;<br />

z Z1<br />

<br />

TA(<br />

R1, R2, r) T1( R1, R2, r, zZ1) AS<br />

<br />

T1(<br />

R1, R2, r, Z2 z); Z1 z Z2<br />

<br />

<br />

T1( R1, R2, r, Z2 z) T1( R1, R2, r, zZ1) <br />

;<br />

z Z2<br />

(21)<br />

T2( R1, R2, r, zZ1) T2( R1, R2, r, Z2 z)<br />

<br />

;<br />

z Z1<br />

<br />

TB(<br />

R1, R2, r) T2( R1, R2, r, zZ1) H zS <br />

T2(<br />

R1, R2, r, Z2 z); Z1 z Z2<br />

<br />

<br />

T2( R1, R2, r, Z2 z) T2( R1, R2, r, zZ1) <br />

;<br />

z Z2<br />

(22)<br />

H T ( R , R , r, Z z) T ( R , R , r, zZ ).<br />

rS<br />

3 1 2 2 3 1 2 1<br />

R1 R2<br />

r<br />

Fig. 8. Circular conductor with rectangular cross section<br />

The subfunctions TA, TB, T1, T2 and T3 are:<br />

<br />

0Ira<br />

R2 r T1( R1, R2, r, a)<br />

ln<br />

<br />

4 <br />

<br />

<br />

R1r 2 2 <br />

R2 r a<br />

<br />

2 2 <br />

<br />

R1r a<br />

<br />

<br />

3 <br />

2<br />

0IrR2 sin d 2 X( R , r, a, ) aX( R , r, a,<br />

)<br />

<br />

0<br />

z<br />

Z2<br />

Z1<br />

<br />

2 2<br />

I


3 <br />

2<br />

0 1<br />

sin<br />

IrR d <br />

2 <br />

<br />

X( R , r, a, ) a X( R , r, a,<br />

)<br />

<br />

<br />

0<br />

1 1<br />

<br />

0Ia<br />

co s X( R2, r, a, ) 2<br />

<br />

0<br />

X( R1, r, a, ) d<br />

<br />

0Ir rsin arctan<br />

2 <br />

a<br />

0<br />

2<br />

<br />

R2 <br />

<br />

X( R2, r, a, ) 2<br />

R <br />

1<br />

<br />

X( R1, r, a,<br />

)<br />

<br />

<br />

2 <br />

0Ir<br />

a sincossin <br />

<br />

<br />

<br />

cossind <br />

<br />

4 R rcos <br />

X( R , r,<br />

a,<br />

)<br />

0<br />

2 2<br />

2<br />

2 <br />

0Ir<br />

a<br />

1<br />

R <br />

1 d X( R2, r, a, ) 4 <br />

R <br />

1<br />

<br />

X( R<br />

0 1,<br />

r, a,<br />

)<br />

<br />

sin cos sin<br />

,<br />

cos ( , , , ) d<br />

<br />

<br />

<br />

R r X R r a <br />

(23)<br />

1 1<br />

I Ia<br />

T2( R1, R2, r, a) R2 R1 <br />

2 2<br />

<br />

<br />

<br />

<br />

<br />

2<br />

ln<br />

R r <br />

<br />

<br />

R1r 2 2<br />

R2 r a<br />

<br />

2 2 <br />

R1r a<br />

<br />

<br />

Iar<br />

2<br />

<br />

sin <br />

<br />

1 R<br />

0<br />

2 rco s X( R2, r, a, ) <br />

R2<br />

<br />

<br />

X( R2, r, a,<br />

)<br />

<br />

<br />

d<br />

<br />

<br />

sin R1<br />

<br />

1<br />

d<br />

<br />

<br />

<br />

<br />

R<br />

0<br />

1rco s X( R1, r, a, ) X( R1, r, a,<br />

)<br />

<br />

<br />

<br />

2<br />

Ir rsin R2<br />

arctan<br />

2<br />

<br />

a<br />

<br />

<br />

<br />

X(<br />

R<br />

0<br />

2,<br />

r, a,<br />

)<br />

2<br />

R <br />

1<br />

sin d, X( R , r, a,<br />

)<br />

<br />

(24)<br />

1<br />

2 2 2<br />

X( R, r, a, ) R r a 2Rrcos (25)<br />

2 2<br />

Ir R2 r R2 r a<br />

T3( R1, R2, r, a)<br />

ln<br />

<br />

4 2 2 <br />

R1 r R1 r a <br />

<br />

<br />

<br />

<br />

2<br />

I Ir<br />

co s X( R2, r, a, ) X( R1, r, a, ) d<br />

2 <br />

4<br />

0<br />

<br />

sincossin R2<br />

<br />

1 d<br />

R<br />

0<br />

2 rco s X( R2, r, a, ) X( R2, r, a,<br />

)<br />

<br />

<br />

<br />

<br />

<br />

0<br />

<br />

<br />

<br />

<br />

sincos sin<br />

R1<br />

<br />

1d R1rcos X( R1, r, a,<br />

)<br />

X( R1, r, a,<br />

)<br />

<br />

<br />

<br />

<br />

I R2 R1 ; r R1<br />

<br />

T ( R , R , r) I<br />

R r ; R r R<br />

B<br />

1 2 2 1 2<br />

<br />

<br />

0; r R2.<br />

(26)<br />

- 142 - 15th IGTE Symposium 2012<br />

<br />

<br />

0Ir<br />

R2 R1; r R<br />

<br />

1<br />

2<br />

<br />

3 3<br />

0Ir<br />

r R <br />

1<br />

TA( R1, R2, r) R2 r ; R<br />

2 1 r R2<br />

2 <br />

<br />

3r<br />

<br />

<br />

3 3<br />

0I<br />

R2 R <br />

1<br />

<br />

; r R2<br />

2 <br />

<br />

3r<br />

<br />

Finally, the inductance calculation can be done using<br />

the equation for the stored magnetic energy:<br />

1 2 1 <br />

LI <br />

2 2<br />

JS AdV V<br />

(27)<br />

1 <br />

L J ( ) ( ) .<br />

2 S AM r AS r dV<br />

I <br />

V<br />

As can be seen from equation (27), the inductance <strong>of</strong> ith<br />

conductor can be separated into two parts:<br />

Li LiM LiS,<br />

(28)<br />

where LiM is the contribution <strong>of</strong> magnetizing currents<br />

and LiS is the contribution <strong>of</strong> imposed currents<br />

(inductance <strong>of</strong> conductor in free space).<br />

The self and mutual inductance calculations <strong>of</strong> circular<br />

conductors with rectangular cross section have been done<br />

in a couple <strong>of</strong> papers [10]-[13]. Technically, the<br />

equations for L and M are the same with the difference in<br />

the limits <strong>of</strong> integration. The calculation <strong>of</strong> the selfinductance<br />

can be observed as the special case <strong>of</strong> the<br />

mutual-inductance equation.<br />

With the assumption <strong>of</strong> uniform distribution <strong>of</strong> current<br />

on conductor's cross section, the total energy stored in the<br />

magnetic field <strong>of</strong> the conductor is:<br />

2 2<br />

<br />

Z2 Z4 R2 R4<br />

0JJ<br />

1 2<br />

W cos<br />

r<br />

2<br />

<br />

<br />

0zZ1 ZZ3 rR1 RR3 RdRdrdZdzd<br />

<br />

r R 2Rrcos zZ 2<br />

.<br />

(29)<br />

Using the expressions:<br />

1<br />

I<br />

W MI1I2; J (30)<br />

2<br />

S<br />

the equation for the mutual inductance <strong>of</strong> circular<br />

conductor with rectangular cross section is:<br />

0<br />

M <br />

Q,<br />

(31)<br />

( R2 R1)( Z2 Z1)( R4 R3)( Z4 Z3)<br />

where Q represents the above written quintuple integral<br />

in (29).<br />

Using the equivalences:<br />

Z3 Z1; Z4 Z2; R3 R1; R4 R2; I1 I2,<br />

(32)<br />

after analytically solving the four integrals for r, R, z and<br />

Z, according to [13], the final expression for the selfinductance<br />

<strong>of</strong> circular conductor <strong>of</strong> rectangular cross<br />

section in free space is:<br />

2 <br />

0N<br />

L (<br />

2 2<br />

2, 2, , ) ( 1, 1,<br />

, )<br />

2 1 <br />

Q R R H Q R R H <br />

R R H 0<br />

QR ( , R, H, ) QR<br />

( , R, H, ) d<br />

1 2 2 1


4<br />

<br />

2 <br />

h cos QrRh (, , , )<br />

<br />

<br />

30sin <br />

2 hcos<br />

bh arctan<br />

sin<br />

2 2 2<br />

2 2<br />

h cosrRsin3h r R cos<br />

<br />

2<br />

hsin bh 20<br />

<br />

4 2<br />

Rhsincos<br />

hrRcos <br />

arctan <br />

2 2<br />

Rsin bh <br />

<br />

4 2<br />

rhsincos<br />

hRrcos <br />

arctan <br />

2 2<br />

rsin bh <br />

<br />

2<br />

bh <br />

cos<br />

2 4 4 2 2<br />

3cos<br />

R r Rrcosr R <br />

15<br />

2<br />

2 2 <br />

2r R <br />

<br />

2<br />

bh <br />

<br />

bhcos ln <br />

<br />

<br />

2<br />

bh h<br />

<br />

2<br />

bh h<br />

<br />

2<br />

2 2 2 4 4<br />

r R 2cos R r<br />

<br />

8<br />

<br />

5 2 2<br />

R sin cos ln rRcos <br />

5<br />

b<br />

5 2 2<br />

r sin cos ln Rrcos b<br />

5<br />

3 2 2 2 2<br />

R cos 5h 3R sin <br />

ln rRcos <br />

15<br />

(33)<br />

2<br />

bh <br />

3 2 2 2 2<br />

r cos 5h 3r sin <br />

ln Rrcos <br />

15<br />

<br />

2<br />

bh ,<br />

<br />

<br />

2 2<br />

where N is the number <strong>of</strong> turns, b r R 2rRcos, and H Z2 Z1.<br />

The integral over in equation (33)<br />

cannot be written in closed form so it has to be solved<br />

numerically, solving the singularities in 0 and<br />

by using the l'Hôpital's rule.<br />

The above presented method for determining the<br />

inductance matrix <strong>of</strong> the power transformer windings has<br />

been tested for the various conductor positions and<br />

different magnetic permeabilities <strong>of</strong> the transformer core.<br />

Comparison showed that the difference between the<br />

pr<strong>of</strong>essional FEM tools (Ans<strong>of</strong>t Maxwell ®<br />

s<strong>of</strong>tware<br />

package) is way beyond 1%, which proves the accuracy<br />

and usefulness <strong>of</strong> the presented method.<br />

V. CONCLUSION<br />

In this paper we present the method for fast and precise<br />

computation <strong>of</strong> capacitances and inductances <strong>of</strong> high<br />

power transformer windings with coils <strong>of</strong> rectangular<br />

cross section based on the boundary element method.<br />

Geometry <strong>of</strong> the windings is axially symmetric, and the<br />

model may be reduced to two-dimensional axially<br />

symmetric problem. Capacitances are computed from<br />

static electric field solution. Surface charge distribution is<br />

determined by BEM solution <strong>of</strong> a pair <strong>of</strong> coupled integral<br />

equations for static electric fields. Inductances are<br />

computed from static magnetic field solution. Surface<br />

magnetization current distribution is determined by BEM<br />

- 143 - 15th IGTE Symposium 2012<br />

solution <strong>of</strong> a pair <strong>of</strong> coupled integral equations for static<br />

magnetic fields.<br />

Boundaries and interfaces are divided into line and arc<br />

finite segments, and the unknown distribution <strong>of</strong> surface<br />

charge or current density is approximated by piecewise<br />

constant functions. System <strong>of</strong> equations for unknown<br />

coefficients <strong>of</strong> distribution is obtained by “pointmatching”.<br />

The testing shows that even very coarse discretization<br />

results in satisfactory accuracy <strong>of</strong> the computation and<br />

therefore proves the applicability <strong>of</strong> the presented<br />

method.<br />

REFERENCES<br />

[1] E. Bjerkan and H. K. Høidalen, "High frequency FEM-based<br />

power transformer modeling: investigation <strong>of</strong> internal stresses due<br />

to network-initiated overvoltages", International Conference on<br />

Power Systems Transients (IPST'05), Montreal, Canada, June<br />

2005.<br />

[2] Y. Shibuya, T. Matsumoto and T. Teranishi, "Modelling and<br />

analysis <strong>of</strong> transformer winding at high frequencies", International<br />

Conference on Power Systems Transients (IPST'05), Montreal,<br />

Canada, June 2005.<br />

[3] K. Pedersen, M. E. Lunow, J. Holboell and M. Henriksen,<br />

"Detailed high frequency models <strong>of</strong> various winding types in<br />

power transformers", International Conference on Power Systems<br />

Transients (IPST'05), Montreal, Canada, June 2005.<br />

[4] G. Liang, H. Sun, X. Zhang, X. Cui, “Modeling <strong>of</strong> Transformer<br />

Windings Under Very Fast Transient Overvoltages” IEEE<br />

Transactions on Electromagnetic Compatibility, Vol. 48, No 4,<br />

November 2006.<br />

[5] M. Popov, L. van der Sluis, R. P. P. Smeets and J. L. Roldan,<br />

"Analysis <strong>of</strong> very fast transients in layer-type transformer<br />

windings", IEEE Transactions on Power Delivery, Vol. 22, No. 1,<br />

pp. 238-247, January 2007.<br />

[6] Ž. Štih, “High Voltage Insulating System Design by Application<br />

<strong>of</strong> Electrode and Insulator Contour Optimization”, IEEE<br />

Transactions on Electrical Insulation, Vol. EI-21, No.4, August<br />

1986.<br />

[7] P. Zhu, "Field distribution <strong>of</strong> a uniformly charged circular arc",<br />

Journal <strong>of</strong> Electrostatics, Vol. 63, pp. 1035-1047, March 2005.<br />

[8] Z. Haznadar, Ž. Štih, "Electromagnetic Fields, Waves and<br />

Numerical Methods", IOS Press, Amsterdam 2000.<br />

[9] J. T. Conway, "Trigonometric integrals for the magnetic field <strong>of</strong><br />

the coil <strong>of</strong> rectangular cross section", IEEE Transactions on<br />

Magnetics, Vol. 42, No. 5, pp. 1538-1548, May 2006.<br />

[10] S. I. Babic and C. Akyel, "New analytic-numerical solutions for<br />

the mutual inductance <strong>of</strong> two coaxial circular coils with<br />

rectangular cross section in air", IEEE Transactions on Magnetics,<br />

Vol. 42, No. 6, pp. 1661-1669, June 2006.<br />

[11] J. T. Conway, "Inductance calculations for circular coils <strong>of</strong><br />

rectangular cross section and parallel axes using Bessel and Struve<br />

functions", IEEE Transactions on Magnetics, Vol. 46, No. 1, pp.<br />

75-81, January 2010.<br />

[12] D. Yu, K. S. Han, "Self-Inductance <strong>of</strong> Air-Core Circular Coils<br />

with Rectangular Cross Section", IEEE Transactions on<br />

Magnetics, Vol. 23, No. 6, pp. 3916-3921, November 1987.<br />

[13] I. Doležel, "Self-inductance <strong>of</strong> an air cylindrical coil", Acta<br />

, Vol. 34, No. 4, pp. 443-473, 1989.


- 144 - 15th IGTE Symposium 2012<br />

Numerical and Experimental Investigations <strong>of</strong><br />

the Structural Characteristics <strong>of</strong> Stator Core<br />

Stacks<br />

Mathias Mair ∗ , Bernhard Weilharter †‡ , Siegfried Rainer § , Katrin Ellermann ∗ and Oszkár Bíró †§<br />

∗ Institute for Mechanics, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />

† Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical<br />

Machines, Austria<br />

‡ Institute for Electric Drives and Machines, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />

§ Institute for Fundamentals and Theory in Electrical Engineering, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />

E-mail: mair@tugraz.at<br />

Abstract—The response characteristics <strong>of</strong> two stator core stacks are investigated by experimental modal analysis.<br />

Furthermore, the modal parameters like the eigenfrequencies and eigenvectors are calculated from a numerical modal<br />

analysis. Afterwards, their frequency response functions are computed and compared with the measured frequency response<br />

functions. In order to achieve a set <strong>of</strong> material parameters, the computed response characteristics are adjusted to match<br />

the measured response characteristics.<br />

Index Terms—experimental modal analysis, homogeneous material model, numerical modal analysis, stator core stack<br />

I. INTRODUCTION<br />

The development <strong>of</strong> electrical machines requires an accurate<br />

dynamical analysis in order to reduce side effects<br />

<strong>of</strong> vibration, e.g. noise and material damage. Since the<br />

electrical machine consists <strong>of</strong> many complex and heterogeneous<br />

parts, like the stator core stack, the mechanical<br />

modeling <strong>of</strong> an electrical machine is a complicated task.<br />

Especially for the noise computation <strong>of</strong> electrical machines,<br />

the structural behavior <strong>of</strong> the stator core stack is<br />

<strong>of</strong> interest. It is mainly influenced by forces caused by<br />

the magnetic field in the air gap acting on the stator teeth<br />

[1]–[4]. An analytical method proposed by [5] allows<br />

for the computation <strong>of</strong> the stator vibration with a twodimensional<br />

ring model. However, the disadvantage <strong>of</strong><br />

this method is that it is not possible to consider the<br />

response characteristics along the axial direction. Other<br />

approaches, considering the stator core stack as a thin<br />

cylinder, have been investigated in [6]–[11].<br />

With numerical methods, e.g. the finite element method<br />

(FEM), the three dimensional behavior <strong>of</strong> the stator can<br />

be determined [12], [13]. The main problem is to set up<br />

an appropriate material model for the numerical analysis,<br />

which considers the heterogeneous composition <strong>of</strong> the<br />

stator core stack consisting <strong>of</strong> laminated sheets coated<br />

with resin [14].<br />

Experimental investigations <strong>of</strong> models consisting <strong>of</strong><br />

laminated iron sheets and a comparison with numerical<br />

simulations using three-dimensional homogeneous models<br />

has been presented in [15]. Another investigation <strong>of</strong> a<br />

stator core stack has been done by [16] in order to acquire<br />

isotropic material parameters. With this approach, it is<br />

possible to calculate modes with pure radial displacement<br />

adequately. A similar investigation has been presented by<br />

[17] with the difference that the used FEM-model has<br />

been computed with a material model <strong>of</strong> transversally<br />

isotropic elasticity. However, the measurement points<br />

have not been uniformly distributed on the outer ring<br />

surface. As a consequence, the measurement result <strong>of</strong> the<br />

response characteristics <strong>of</strong> the stator core stack is limited.<br />

In this paper, the three dimensional structural vibration<br />

behavior <strong>of</strong> stator core stacks is investigated by using<br />

the finite element method in conjunction with a modal<br />

analysis. The influence <strong>of</strong> the lamination along the axial<br />

direction will be considered by using a homogeneous material<br />

model with transversally isotropic elasticity. For the<br />

identification <strong>of</strong> the corresponding material parameters,<br />

two finite element models <strong>of</strong> stator core stacks have been<br />

set up, one with teeth and one without teeth. A modal<br />

analysis is carried out to determine the eigenfrequencies<br />

and eigenforms (modes) <strong>of</strong> the finite element models.<br />

Thereafter the frequency response characteristics <strong>of</strong> the<br />

two structures are computed with a reduced order model<br />

by applying a modal reduction.<br />

Acceleration measurements for which the structures<br />

have been excited with an electrodynamic shaker in a<br />

frequency range <strong>of</strong> 0 − 6000 Hz have been performed<br />

at 60 points on the stator core stacks. An experimental<br />

modal analysis then provides the measured response<br />

characteristics and eigenfrequencies and eigenforms [18].<br />

Finally, the results <strong>of</strong> the numerical investigation are<br />

compared with the results from the experimental modal<br />

analysis. The material parameters are adjusted step by<br />

step until the response characteristics <strong>of</strong> the numerical<br />

model approximate the measured one sufficiently. This<br />

way a set <strong>of</strong> material parameters for homogeneous material<br />

models for the stator core stacks is obtained, which<br />

describes the structural behavior adequately.


II. EXPERIMENTAL MODAL ANALYSIS (EMA)<br />

An experimental modal analysis allows for the determination<br />

<strong>of</strong> the response characteristics <strong>of</strong> the stator core<br />

stacks excited by a shaker. For this, acceleration sensors<br />

are recording the vibration at distinct measurement points<br />

on the stator core stacks. Then, the characteristic response<br />

behavior can be derived, transforming the resulting time<br />

signals into the frequency domain.<br />

A. Test stand for experimental modal analysis<br />

In order to measure the vibration on the stator core<br />

stacks, a test stand as shown in Fig. 1 has been built.<br />

Ropes composed <strong>of</strong> an elastic material are connected<br />

to a portal frame and suspend the stator core stack,<br />

additionally the table is mounted on air bearings. This<br />

reduces the influence <strong>of</strong> the adjacent structure to a minimum.<br />

In order to excite the structure, an electromagnetic<br />

shaker is mounted on the table. The connection between<br />

structure and shaker is realized by a push rod and a<br />

force sensor affixed to the test structure with a twocomponent<br />

adhesive. This way, the measurement setup<br />

can be built up and disassembled easily without machine<br />

tools. For the measurement, the shaker is controlled by<br />

a measurement system which also records the signals <strong>of</strong><br />

the acceleration sensors.<br />

Fig. 1: Test stand<br />

B. Measurements procedure<br />

The measurement points are located on the outer<br />

surface <strong>of</strong> the structure at sixty defined positions, see<br />

Fig. 2. The excitation is applied to point no. 1 for all<br />

measurements. At this point, the structure is excited by<br />

the shaker with a so-called periodic chirp signal in a<br />

frequency range from 2 Hz to 6400 Hz. The applied signal<br />

is repeated ten times with the entire frequency range<br />

passed through in each sequence within a short time<br />

period <strong>of</strong> 2.5 s.<br />

In the course <strong>of</strong> the measurement procedure, the frequency<br />

response function (FRF) in reference to the excitation<br />

point is determined for each measurement point.<br />

Finally, the arithmetically averaged acceleration and force<br />

signals <strong>of</strong> these ten measurement runs are used to derive<br />

the FRFs corresponding each <strong>of</strong> the measurement points.<br />

- 145 - 15th IGTE Symposium 2012<br />

55<br />

43<br />

31<br />

19<br />

7<br />

58<br />

46<br />

34<br />

22<br />

10<br />

49<br />

37<br />

25<br />

13<br />

Fig. 2: Defined measurement points<br />

C. Identification <strong>of</strong> modal parameters<br />

After all FRFs are measured, the next step is to<br />

identify the modal parameters. This is done by the so<br />

called PolyMAX frequency-domain method [19], which<br />

is a curve fitting technique. In a first step a least-squares<br />

method fits the polynomials<br />

p<br />

<br />

p<br />

−1 V0(Ω) =<br />

(1)<br />

z<br />

i=0<br />

i βi z<br />

i=0<br />

i αi<br />

to the measured FRFs. Thereby, Ω denotes the excitation<br />

frequency and V0(Ω) is called the frequency response<br />

matrix. βi are the coefficient numerator matrices, αi are<br />

the coefficient denominator matrices, zi are the complex<br />

basis vectors in the discrete frequency domain and p is<br />

the order <strong>of</strong> the polynomials.<br />

With the known polynomial functions, the eigenvalues<br />

λi and the modal participation vectors li can be calculated.<br />

To determine the eigenvectors ri as well as the<br />

lower and upper residual matrices LR and UR, a further<br />

least square method, based on the pole residual model<br />

q<br />

<br />

ril<br />

V(Ω) =<br />

i=0<br />

T i<br />

λi − jΩ + rilH <br />

i<br />

+<br />

λi − jΩ<br />

LR<br />

+UR , (2)<br />

Ω2 is applied. The dashed symbols mark the conjugate<br />

complex variables. The determined frequency response<br />

matrix V(Ω)<br />

⎡<br />

⎤<br />

f11 f12 ··· f1m<br />

⎢<br />

.<br />

⎢<br />

.<br />

⎥<br />

f21 f22<br />

V(Ω) = ⎢<br />

. ⎥<br />

⎢ .<br />

⎣<br />

. ⎥<br />

(3)<br />

.<br />

.. ⎦<br />

fn1 ··· fnm<br />

is filled with the frequency response functions fkl between<br />

the excitation point l and the measuring point k.<br />

The size <strong>of</strong> V(Ω) is therewith defined by m excitation<br />

points times n measuring points.<br />

The identified modal parameters and frequency response<br />

functions fkl allow a convenient description <strong>of</strong><br />

the measured response characteristics for the later comparison<br />

with numerical results.<br />

D. Measurement results<br />

In order to determine material parameters for a specific<br />

numerical model, the mode-shapes determined with the<br />

1


EMA, corresponding to the estimated eigenvalues, must<br />

be identified. To distinguish the different mode-shapes,<br />

a numbering system is established. The numbering comprises<br />

three numbers and refers to a cylindrical coordinate<br />

system. The first digit describes the number <strong>of</strong> maxima<br />

<strong>of</strong> the mode in radial direction along the azimuthal<br />

coordinate axis, see Fig. 3. The second digit represents<br />

the number <strong>of</strong> zero crossings <strong>of</strong> the mode in radial<br />

direction along the z-axis. The third digit is a counter to<br />

differentiate modes with the same deformation properties<br />

regarding the first two digits.<br />

z<br />

2<br />

y<br />

Fig. 3: Example for a mode (3,2,0)<br />

1) Results <strong>of</strong> the stator core stack without teeth:<br />

The sum <strong>of</strong> all measured as well as the sum <strong>of</strong> all<br />

approximated frequency response functions fkl, the latter<br />

calculated by the identified modal parameters, are depicted<br />

in Fig. 4, with the different mode patterns indicated<br />

by the introduced numbering system.<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

(2,1,0)<br />

(2,0,0)<br />

(2,2,3)<br />

(2,2,2)<br />

(2,2,1)<br />

(2,2,0)<br />

(3,2,2)<br />

(3,2,1)<br />

(3,2,0)<br />

(3,1,0)<br />

(3,0,0)<br />

(3,1,1)<br />

(3,2,3)<br />

500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000<br />

(4,0,0)<br />

(4,0,1)<br />

(3,3,0)<br />

(3,4,0)<br />

(3,3,1)<br />

(3,4,1)<br />

(3,3,2)<br />

(3,4,2) (4,2,0)<br />

(3,3,3) (3,4,3) (4,2,1)<br />

(3,3,4)<br />

(4,2,2)<br />

(3,3,5)<br />

(4,4,0)<br />

(4,4,1)<br />

3<br />

Sum <strong>of</strong> all approximated FRF‘s, using modal parameters<br />

Sum <strong>of</strong> all measured FRF‘s<br />

frequency [Hz]<br />

x<br />

(5,0,0)<br />

(5,4,0)<br />

(5,4,1)<br />

(5,4,2)<br />

(5,4,3)<br />

Fig. 4: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />

stator core stack without teeth<br />

As Fig. 4 shows, with the increase <strong>of</strong> the excitation<br />

frequency the mean <strong>of</strong> the magnitude decreases. This<br />

corresponds to mass dominated dynamical behavior. [20,<br />

p.294]<br />

- 146 - 15th IGTE Symposium 2012<br />

The modes (2,0,0), (2,1,0), (3,0,0), (3,1,0), (4,0,0),<br />

(4,0,1) and (5,0,0) have the most distinctive magnitude in<br />

the investigated frequency domain. Some <strong>of</strong> these modes,<br />

(2,0,0), (3,0,0), (4,0,0) and (5,0,0) could be calculated by<br />

analytical two-dimensional methods, see [7], [5] or [10].<br />

Other modes like (2,2,0) or (3,1,1) with non-uniform<br />

radial displacements along the z-axis can only be treated<br />

by three-dimensional approaches.<br />

Table I summarizes the measurement results for the<br />

stator core stack without teeth and lists modes with their<br />

appropriate eigenfrequencies and damping ratios.<br />

TABLE I: Modes, eigenfrequency and damping ratio <strong>of</strong><br />

stator core stack without teeth<br />

mode num. eigenfrequency damping ratio<br />

1/(2, 0, 0) 769,22 Hz 0,411387 %<br />

2/(2, 1, 0) 795,03 Hz 0,959535 %<br />

3/(2, 2, 0) 1282,95 Hz 1,197700 %<br />

4/(2, 2, 1) 1353,39 Hz 1,086200 %<br />

5/(3, 2, 0) 1728,14 Hz 1,442430 %<br />

6/(3, 1, 0) 2092,07 Hz 0,218612 %<br />

7/(3, 0, 0) 2109,96 Hz 0,093046 %<br />

8/(3, 1, 1) 2192,46 Hz 1,122050 %<br />

9/(3, 2, 0) 2278,98 Hz 0,691529 %<br />

10/(3, 3, 0) 2509,73 Hz 0,897709 %<br />

11/(3, 3, 1) 2569,24 Hz 1,057550 %<br />

12/(3, 3, 2) 2631,76 Hz 1,108720 %<br />

13/(4, 0, 0) 3860,04 Hz 0,043218 %<br />

14/(4, 0, 1) 3871,37 Hz 0,050271 %<br />

15/(3, 4, 0) 3947,18 Hz 0,496143 %<br />

16/(3, 4, 1) 4037,01 Hz 1,110130 %<br />

17/(4, 2, 0) 4414,03 Hz 0,606060 %<br />

18/(4, 2, 1) 4482,12 Hz 0,590154 %<br />

19/(4, 4, 0) 4756,58 Hz 0,348687 %<br />

20/(4, 4, 1) 4773,74 Hz 0,360285 %<br />

21/(5, 4, 0) 4940,08 Hz 0,930774 %<br />

22/(5, 0, 0) 5927,25 Hz 0,127039 %<br />

2) Results <strong>of</strong> the stator core stack with teeth: Similarly<br />

the results given in Fig. 4, the sum <strong>of</strong> all measured and<br />

the sum <strong>of</strong> all approximated frequency response function<br />

fkl for the stator core stack with teeth, are plotted in Fig.<br />

5.<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

1e-8<br />

(2,0,1)<br />

(2,0,0)<br />

(2,1,0)<br />

(2,1,1)<br />

(2,2,1)<br />

(2,2,0)<br />

(3,0,0)<br />

(3,1,0)<br />

(0,0,1)<br />

(0,0,2)<br />

(3,2,0)<br />

(4,0,0)<br />

(4,3,2)<br />

(4,3,1)<br />

(4,3,0)<br />

Sum <strong>of</strong> all approximated FRF‘s, using modal parameters<br />

Sum <strong>of</strong> all measured FRF‘s<br />

(4,0,1)<br />

500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000<br />

frequency [Hz]<br />

(4,1,0)<br />

(4,3,7)<br />

(4,3,6)<br />

(4,3,3)<br />

(4,3,4)<br />

(4,3,5)<br />

(5,1,0)<br />

(5,0,0) (6,1,0)<br />

(6,0,0)<br />

(0,1,0)<br />

(5,2,0)<br />

(5,4,0)<br />

(5,2,1)<br />

(5,4,1)<br />

(5,4,2)<br />

(5,4,3)<br />

Fig. 5: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />

stator core stack with teeth


Comparing the FRFs depicted in Fig. 4 and Fig. 5,<br />

the number <strong>of</strong> distinct modes in Fig. 5 is greater. Due<br />

to the higher mass, the corresponding eigenfrequencies<br />

<strong>of</strong> the stator core stack with teeth are lower than those<br />

<strong>of</strong> the stator core stack without teeth. Furthermore, the<br />

frequency spacing <strong>of</strong> distinct modes, e.g. (3,0,0) and<br />

(3,1,0) increases.<br />

It is impossible to identify modes which have higher<br />

eigenfrequencies than 5400 Hz, because the used measurement<br />

grid is too coarse to detect the appropriate<br />

eigenforms. In this frequency range, one would expect<br />

to see modes comprising seven maxima or more with a<br />

radial displacement along the azimuthal axis. This is not<br />

possible to identify with only twelve measurement points<br />

in the azimuthal direction.<br />

Table II lists measurement results for the stator core<br />

stack with teeth.<br />

TABLE II: Modes, eigenfrequency and damping ratio <strong>of</strong><br />

stator core stack with teeth<br />

mode num. eigenfrequency damping ratio<br />

1/(2, 0, 0) 661.27 Hz 0.054903 %<br />

2/(2, 0, 1) 667.84 Hz 0.055824 %<br />

3/(2, 1, 0) 720.08 Hz 0.337501 %<br />

4/(2, 1, 1) 727.85 Hz 0.332884 %<br />

5/(2, 2, 0) 1365.85 Hz 1.005810 %<br />

6/(2, 2, 1) 1416.91 Hz 0.939553 %<br />

7/(3, 0, 0) 1767.43 Hz 0.047843 %<br />

8/(3, 1, 0) 1851.83 Hz 0.205125 %<br />

9/(3, 2, 0) 2313.17 Hz 0.640636 %<br />

10/(0, 0, 1) 2372.92 Hz 0.654953 %<br />

11/(0, 0, 2) 2376.10 Hz 0.608697 %<br />

12/(4, 3, 0) 2755.43 Hz 0.382039 %<br />

13/(4, 0, 0) 3107.52 Hz 0.097675 %<br />

14/(4, 0, 1) 3116.28 Hz 0.152128 %<br />

15/(4, 1, 0) 3190.75 Hz 0.180018 %<br />

16/(4, 3, 3) 3288.28 Hz 0.743811 %<br />

17/(0, 1, 0) 3955.81 Hz 0.306280 %<br />

18/(5, 2, 0) 4074.92 Hz 0.188232 %<br />

19/(5, 0, 0) 4423.63 Hz 0.046342 %<br />

20/(5, 1, 0) 4484.19 Hz 0.173786 %<br />

21/(5, 4, 0) 4655.18 Hz 0.661100 %<br />

22/(5, 4, 1) 4745.87 Hz 0.237239 %<br />

23/(6, 0, 0) 5314.37 Hz 0.031471 %<br />

24/(6, 1, 0) 5356.55 Hz 0.039125 %<br />

III. NUMERICAL MODAL ANALYSIS<br />

As a means for the numerical simulation <strong>of</strong> the dynamical<br />

behavior <strong>of</strong> the stator core stacks, the finite element<br />

method is used. Therefore, an adequate simulation model<br />

has to be set up.<br />

For the numerical model based on the FEM - model,<br />

some simplifications are made:<br />

• the contacts between the laminations are neglected<br />

• the grain orientation <strong>of</strong> the cold rolled silicon-ironalloy<br />

is neglected<br />

• a linear and homogeneous material model is assumed<br />

• the FEM model is assumed to be linear<br />

- 147 - 15th IGTE Symposium 2012<br />

Using an adequate FEM-model and performing a numerical<br />

modal analysis, the modal parameters (eigenfrequencies,<br />

eigenforms, and damping coefficients) which<br />

can be directly related to the modal parameters from the<br />

measurements are obtained.<br />

A. Material model<br />

By considering the above limitations, a material model<br />

with transversally isotropic elasticity is implemented.<br />

z<br />

Fig. 6: Coordinate system <strong>of</strong> the stator core stack<br />

The used transversally isotropic elasticity <strong>of</strong> the material<br />

model corresponds to the coordinate system, depicted<br />

in Fig. 6. The flexibility matrix S for a material model<br />

with a transversally isotropic elasticity is given by<br />

⎡<br />

⎢<br />

S = ⎢<br />

⎣<br />

1<br />

Ex<br />

νxy<br />

− Ex<br />

1<br />

Ex<br />

νzy<br />

− Ex<br />

νxz<br />

− Ez<br />

νyz<br />

− Ez<br />

1<br />

Ez<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

0<br />

2(1+νxy)<br />

Ex<br />

0<br />

0<br />

0<br />

1<br />

Gxz<br />

0<br />

0<br />

0<br />

1<br />

Gxz<br />

− νyx<br />

Ex<br />

− νzx<br />

Ex<br />

x<br />

y<br />

⎤<br />

⎥ , (4)<br />

⎥<br />

⎦<br />

where E is Young’s modulus, G is the shear modulus and<br />

ν is Poisson’s ratio. The material model and therefore<br />

the flexibility matrix S <strong>of</strong> the transversally isotropic<br />

elasticity has rotationally symmetrical properties. Due to<br />

the symmetric properties <strong>of</strong> S, the following conditions<br />

are valid:<br />

νxy = νyx<br />

νxz<br />

Ez<br />

= νzx<br />

Ex<br />

B. Modal description<br />

Based on the finite element method, the equations <strong>of</strong><br />

motion <strong>of</strong> a structural damped system can be formulated<br />

as the linear system <strong>of</strong> differential equations<br />

(5)<br />

(6)<br />

M ¨ û + D ˙ û + Kû = f . (7)<br />

Thereby, M is the mass matrix, K is the stiffness matrix,<br />

f is the excitation force and D is the proportional


damping matrix defined by the Rayleigh damping model<br />

[12, p.950ff]<br />

D = α M + β K . (8)<br />

as a linear combination <strong>of</strong> the mass matrix M and the<br />

stiffness matrix K with the scalar coefficients α and β.<br />

Solving the eigenvalue problem <strong>of</strong> the undamped<br />

system leads to the eigenvalues λi and eigenvectors<br />

ri. The modal matrix R consisting <strong>of</strong> mass-normalized<br />

eigenvectors defines the transformation<br />

û = Rz (9)<br />

<strong>of</strong> the displacement in the global state space û to the<br />

displacement in the modal state space z.<br />

Using (9) and multiplying (7) with the transformed<br />

modal matrix R from the left, the proportional damped<br />

equations <strong>of</strong> motion are transformed into a noninteracting<br />

system <strong>of</strong> the dimension q<br />

˜M ¨z + ˜ D ˙z + ˜ Kz = ˜ f . (10)<br />

Since, the eigenvectors are mass-normalized, the transformation<br />

<strong>of</strong> the mass matrix M into the modal state<br />

space leads to the identity matrix I<br />

˜M = R T MR= I . (11)<br />

The transformed stiffness matrix becomes<br />

˜K = R T KR= Λ = diag ω 2 i , (12)<br />

where ωi is the undamped angular eigenfrequency <strong>of</strong> the<br />

i−th mode. The damping matrix yields a diagonal matrix<br />

expressed as<br />

˜D = R T DR= diag [2ζiωi] , (13)<br />

Thereby ζi denotes the modal damping ratio <strong>of</strong> the i−th<br />

mode [20, p.63ff].<br />

This approach yields simultaneously a modal reduction<br />

<strong>of</strong> the equation system. This reduction has the advantage<br />

that, without it, the computing effort increases rapidly.<br />

The disadvantage is that the disregarded modes create<br />

an error in the response characteristics. However, the<br />

error resulting from the material model with transversally<br />

isotropic elasticity is expected to be much higher than this<br />

error and thus the latter is neglected.<br />

Assuming a harmonic excitation, the excitation force<br />

can be expressed as<br />

f = ˇ f e jΩt<br />

(14)<br />

in the frequency domain. Here, ˇ f is the amplitude<br />

<strong>of</strong> the excitation force and Ω describes the excitation<br />

frequency. This leads to a harmonic ansatz for the modal<br />

displacement:<br />

z = ˇz e jΩt<br />

(15)<br />

where ˇz denotes the amplitude <strong>of</strong> the displacement in the<br />

modal state space. Hence, (10) becomes<br />

−Ω 2 I ˇz + jΩ ˜ Dˇz + ˜ Kˇz = ˇ f . (16)<br />

- 148 - 15th IGTE Symposium 2012<br />

Finally, the backward transformation in the global state<br />

space leads to<br />

ǔ = R<br />

<br />

−Ω 2 I + jΩ ˜ D + ˜ −1 K<br />

R T ˇ f = ˜V ˇ f . (17)<br />

Therewith, the numerically estimated frequency response<br />

matrix ˜V is<br />

<br />

˜V (Ω) = R −Ω 2 I + jΩ ˜ D + ˜ −1 K R T<br />

(18)<br />

Using (11), (13) and (12), the entries <strong>of</strong> ˜V (Ω) can be<br />

expressed by the frequency response functions ˜ fkl<br />

˜fkl(Ω) =<br />

q<br />

i=1<br />

ir ∗ k ir ∗ l<br />

−Ω 2 + ω 2 i + jΩ 2ζi ωi<br />

, (19)<br />

which can be related directly to the corresponding frequency<br />

response functions fkl(Ω) estimated by the measurement.<br />

Thereby ir∗ k denotes the k-th entry <strong>of</strong> the i-th<br />

mass-normalized eigenvector ri.<br />

IV. INFLUENCE OF MATERIAL PARAMETERS ON<br />

TRANSMISSION BEHAVIOR<br />

In order to estimate the influence <strong>of</strong> each material<br />

parameter, simulations as explained in section III-B are<br />

carried out for the stator model without teeth. The<br />

material parameters, except for the density, are varied<br />

in a distinct range and their influence on the structural<br />

behavior is investigated.<br />

The density is determined by measurements. With a<br />

mass <strong>of</strong> 149.8kg and a volume from 0.019905 m 3 , a<br />

density <strong>of</strong> 7525 kg/m 3 results. This value is used for all<br />

calculations <strong>of</strong> this study.<br />

The material parameters <strong>of</strong> interest for the influence<br />

on the dynamical behavior are the Young’s moduli Ex<br />

and Ez, the shear module Gxz and the Poisson ratios<br />

νxy and νxz. The initial set <strong>of</strong> material parameters is<br />

shown in Table III. Based on this set, each parameter is<br />

TABLE III: Initial dataset <strong>of</strong> material parameters for a<br />

variation <strong>of</strong> each parameter<br />

material parameters values<br />

Ex<br />

190 · 109 N/m 2<br />

Ez<br />

25 · 109 N/m 2<br />

Gxz<br />

10 · 109 N/m 2<br />

νxy<br />

0.3<br />

νxz<br />

0.3<br />

ϱ 7525 kg/m 3<br />

varied to a lower and a higher value. The influence <strong>of</strong><br />

each parameter on the frequency response functions is<br />

shown below. The results <strong>of</strong> these investigations are the<br />

basis on which the adjustment <strong>of</strong> the simulated response<br />

characteristics on the measured response characteristics<br />

is done.


A. Variation <strong>of</strong> the Poisson ration ν<br />

In order to analyse the influence <strong>of</strong> the Poisson ratios<br />

on the response characteristics, the values are varied from<br />

0.2 to 0.4 . The Poisson ratios νxy and νxz are set equal<br />

for the calculations. Fig. 7 shows the sum <strong>of</strong> all frequency<br />

response functions for the varied Poisson ratios.<br />

magnitude [ m N ]<br />

-4<br />

10 v = 0.2<br />

v = 0.3<br />

v = 0.4<br />

10 -5<br />

10 -6<br />

10 -7<br />

500 1000 1500 2000 2500 3000<br />

frequency [Hz]<br />

Fig. 7: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />

stator core stack without teeth for varied Poisson ratio<br />

This comparison evidences that the influence <strong>of</strong> the<br />

Poisson ratios on the response characteristics is insignificantly<br />

small. Therefore, for νxy and νxz a value <strong>of</strong> 0.3<br />

is chosen.<br />

B. Variation <strong>of</strong> the elastic modulus Ex<br />

In Fig. 8, the sum <strong>of</strong> the calculated frequency response<br />

functions for the variation Ex is depicted. It can be ob-<br />

magnitude [ m N ]<br />

10 -4<br />

10 -5<br />

10 -6<br />

10 -7<br />

1<br />

2<br />

4<br />

5<br />

11 11a<br />

500 1000 1500<br />

frequency [Hz]<br />

2000 2500 3000<br />

7<br />

8<br />

9 2<br />

E x = 170· 10 N/m<br />

9 2<br />

E x = 190· 10 N/m<br />

9 2<br />

E x = 210· 10 N/m<br />

Fig. 8: Sum <strong>of</strong> all frequency response functions <strong>of</strong><br />

the stator core stack without teeth for varied Young’s<br />

modulus Ex<br />

served that some eigenfrequencies, for example for mode<br />

4, 5 or 11, are not influenced by the Young’s modulus Ex.<br />

Other modes, such as 7, 8, or 11a, are heavily affected by<br />

it. A small variation <strong>of</strong> the Young’s modulus Ex yields<br />

a large shift <strong>of</strong> distinct eigenfrequencies. If the value <strong>of</strong><br />

Ex decreases, the eigenfrequencies corresponding to the<br />

modes 7, 8, or 11a are declining and vice versa. Also<br />

- 149 - 15th IGTE Symposium 2012<br />

the eigenfrequencies <strong>of</strong> the corresponding modes 1 and<br />

2 depend on the Young’s modulus Ex but not as much<br />

as the previous ones.<br />

C. Variation <strong>of</strong> the elastic modulus Ez<br />

The variation <strong>of</strong> the material parameter Ez, depicted<br />

in Fig. 9, leads to a different dynamical behavior than<br />

the variation <strong>of</strong> Ex. The eigenfrequencies corresponding<br />

to the modes 1, 2 and 7 are not influenced by a variation<br />

magnitude [ m N ]<br />

10 -4<br />

10 -5<br />

10 -6<br />

10 -7<br />

2<br />

1<br />

3<br />

4<br />

500 1000 1500<br />

frequency [Hz]<br />

2000 2500 3000<br />

7<br />

9 2<br />

E z = 20· 10 N/m<br />

9 2<br />

E z = 25· 10 N/m<br />

9 2<br />

E = 30· 10 N/m<br />

Fig. 9: Sum <strong>of</strong> all frequency response functions <strong>of</strong><br />

the stator core stack without teeth for varied Young’s<br />

modulus Ez<br />

<strong>of</strong> the Young’s modulus Ez. When increasing the value<br />

<strong>of</strong> Ez, all other eigenfrequencies are shifted upwards<br />

in the considered frequency range and when decreasing<br />

Ez, these eigenfrequencies are shifted downwards. It is<br />

interesting to note, that the eigenfrequencies which are<br />

independent <strong>of</strong> the Young’s modulus Ez (mode 1, 2, 7),<br />

depend on the Young’s modulus Ex.<br />

D. Variation <strong>of</strong> the shear modulus Gxz<br />

Finally, theresults for the variation <strong>of</strong> the shear modulus<br />

Gxz is shown in Fig. 10. It can be seen that the<br />

magnitude [ m N ]<br />

10 -4<br />

10 -5<br />

10 -6<br />

10 -7<br />

1<br />

2<br />

500 1000 1500<br />

frequency [Hz]<br />

2000 2500 3000<br />

6<br />

9<br />

z<br />

10<br />

9 2<br />

G xz = 8· 10 N/m<br />

9 2<br />

G xz = 10· 10 N/m<br />

9 2<br />

G = 12· 10 N/m<br />

Fig. 10: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />

stator core stack without teeth by varied shear modulus<br />

Gxz<br />

5<br />

xz<br />

11<br />

12


eigenfrequencies corresponding to the modes 1 and 2 are<br />

independent <strong>of</strong> the variation <strong>of</strong> Gxz. Other modes, like<br />

5, 6, 9, 11 or 12, are heavily affected by the variation<br />

<strong>of</strong> the shear modulus. These eigenfrequencies are shifted<br />

downwards by decreasing and upwards by increasing the<br />

value <strong>of</strong> Gxz.<br />

Summing up, the influence <strong>of</strong> the material properties<br />

on the eigenfrequencies is strongly influenced by the<br />

corresponding eigenmode occurring at these frequencies.<br />

Depending on the characteristics (radial, azimuthal or axial<br />

bending) <strong>of</strong> the mode, Ex, Ez and Gxz have different<br />

influences. This background is important to know for the<br />

latter adjustment <strong>of</strong> the response characteristics.<br />

V. ADJUSTMENT OF MATERIAL PARAMETERS<br />

The dynamical behavior <strong>of</strong> the numerical model<br />

strongly depends on the chosen material parameters.<br />

An iterative process optimizes the eigenfrequencies and<br />

eigenvectors based on the known influence <strong>of</strong> the material<br />

parameters as discussed in section IV. Material<br />

parameters can be found by an adequate adjustment <strong>of</strong><br />

the measured and simulated response characteristics.<br />

A. Stator core stack without teeth<br />

First, a dataset <strong>of</strong> material parameters is chosen which<br />

describes the behavior <strong>of</strong> isotropic elasticity. Thereafter,<br />

the material parameters <strong>of</strong> the transversally isotropic<br />

elasticity are identified for the stator core stack without<br />

teeth.<br />

1) Comparison <strong>of</strong> measured and calculated frequency<br />

response functions by using the isotropic dataset: The<br />

results from the numerical simulation using isotropic<br />

material parameters (Table IV) are compared with the<br />

results from the experimental investigation, see Fig. 11.<br />

It can be seen that there is no analogy between the simu-<br />

TABLE IV: Initial dataset <strong>of</strong> material parameters <strong>of</strong><br />

isotropic elasticity<br />

material parameters values<br />

Ex<br />

210 · 109 N/m 2<br />

Ez<br />

210 · 109 N/m 2<br />

Gxz<br />

Ex<br />

2(1+ν)<br />

ν 0.3<br />

ϱ 7525 kg/m 3<br />

lated and measured response characteristics. Only the first<br />

eigenfrequency <strong>of</strong> the measured and simulated FRFs are<br />

similar. Furthermore, in the investigated frequency range<br />

less eigenfrequencies arise for the simulation model. This<br />

comparison shows that a material model with isotropic<br />

elasticity is clearly unsuitable.<br />

2) Adjustment <strong>of</strong> the measured and simulated frequency<br />

response function by using transversal isotropic<br />

elasticity: In a next step, a material model with transversally<br />

isotropic elasticity is used and the needed material<br />

parameters are adjusted. Table V lists these material<br />

parameters used as an initial dataset for the simulation.<br />

- 150 - 15th IGTE Symposium 2012<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

10 -8<br />

Sum <strong>of</strong> simulated FRF‘s in radial direction<br />

Sum <strong>of</strong> measured FRF‘s in radial direction<br />

500 1000 1500 2000 2500 3000<br />

frequency [Hz]<br />

Fig. 11: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and<br />

calculated FRFs with isotropic material model <strong>of</strong> the<br />

stator core stack without teeth<br />

Thereby, the Young’s modulus Ez and the shear modulus<br />

Gxz are chosen considerably lower with 40·10 9 N/m 2 and<br />

15 · 10 9 N/m 2 . This shifts the eigenfrequencies downward<br />

in the considered frequency range as explained in section<br />

IV.<br />

Looking at Fig. 12, it can be seen that in the regarded<br />

frequency range the measured and simulated eigenfrequencies<br />

<strong>of</strong> the modes (2,0,0) and (3,0,0) are close<br />

together.<br />

TABLE V: Dataset <strong>of</strong> material parameters <strong>of</strong> transversal<br />

isotropic elasticity<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

material parameters values<br />

Ex<br />

210 · 109 N/m 2<br />

Ez<br />

40 · 109 N/m 2<br />

Gxz<br />

15 · 109 N/m 2<br />

νxy<br />

0.3<br />

νxz<br />

0.3<br />

ϱ 7525 kg/m 3<br />

mode (2,0,0)<br />

Sum <strong>of</strong> simulated FRF‘s in radial direction<br />

Sum <strong>of</strong> measured FRF‘s in radial direction<br />

mode (3,0,0)<br />

10<br />

500 1000 1500 2000 2500 3000<br />

-8<br />

frequency [Hz]<br />

Fig. 12: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and calculated<br />

FRFs with transversally isotropic material model<br />

<strong>of</strong> the stator core stack without teeth<br />

Now the modes are adjusted by decreasing the elastic


modulus Ex. The value is optimized manually step-bystep<br />

until an adequate match is attained. For the Young’s<br />

modulus Ex, a value could be found which aligns the<br />

measured and simulated modes (2,0,0) and (3,0,0). The<br />

next step in the optimization is to adjust the mode<br />

(3,1,0). Therefore the shear modulus Gxz is reduced.<br />

Finally, with the Young’s modulus Ez, the other modes<br />

between (2,0,0) and (3,0,0) can be influenced. A stepwise<br />

reduction <strong>of</strong> this value yields an adequate correlation <strong>of</strong><br />

the other modes.<br />

The resulting material parameters <strong>of</strong> this optimization<br />

<strong>of</strong> the stator core stack without teeth are listed in Table<br />

VI.<br />

TABLE VI: Resulting dataset <strong>of</strong> material parameters<br />

for the stator core stack without teeth <strong>of</strong> transversally<br />

isotropic elasticity<br />

material parameters values<br />

Ex<br />

191, 8 · 109 N/m 2<br />

Ez<br />

24, 7 · 109 N/m 2<br />

Gxz<br />

11 · 109 N/m 2<br />

νxy<br />

0, 3<br />

νxz<br />

0, 3<br />

ϱ 7525 kg/m 3<br />

Fig. 13 depicts the response characteristics <strong>of</strong> the<br />

simulation results, using optimized material parameters<br />

and the measurement results for the stator core stack<br />

without teeth. A good approximation for the simulated<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

mode (2,0,0)<br />

Mode mode (2,0,0) (2,1,0)<br />

Mode mode (2,2,0)<br />

mode (2,2,1)<br />

mode (2,2,3)<br />

mode (3,1,0)<br />

mode (3,2,1)<br />

Sum <strong>of</strong> simulated FRF‘s in radial direction<br />

Sum <strong>of</strong> measured FRF‘s in radial direction<br />

mode (3,0,0)<br />

mode (3,2,3)<br />

mode (3,3,0)<br />

mode (3,3,4)<br />

10<br />

500 1000 1500 2000 2500 3000<br />

-8<br />

frequency [Hz]<br />

Fig. 13: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and<br />

calculated FRFs <strong>of</strong> the stator core stack without teeth,<br />

by using a material model with transversally isotropic<br />

elasticity and optimized parameters<br />

response characteristics can be observed. Hence, the<br />

identified material parameters for a linear and homogeneous<br />

material model can represent a similar dynamical<br />

behavior as the real structure <strong>of</strong> the stator core without<br />

teeth in a frequency range from 0Hzto 3000 Hz.<br />

The coincident eigenfrequencies and their corresponding<br />

modes are listed in Table VII.<br />

- 151 - 15th IGTE Symposium 2012<br />

TABLE VII: Coincident measured and simulated eigenfrequencies<br />

and modes <strong>of</strong> the stator core stack without<br />

teeth, resulting from adjustment<br />

modes measured eigenfreq. simulated eigenfreq.<br />

(2, 0, 0) 769.22 Hz 749.11 Hz<br />

(2, 1, 0) 795.03 Hz 757.16 Hz<br />

(2, 2, 0) 1280.95 Hz 1278.38 Hz<br />

(2, 2, 1) 1353.39 Hz 1362.88 Hz<br />

(2, 2, 3) 1606.08 Hz 1607.62 Hz<br />

(3, 2, 1) 1881.54 Hz 1869.87 Hz<br />

(3, 1, 0) 2092.07 Hz 2095.87 Hz<br />

(3, 0, 0) 2109.96 Hz 2097.67 Hz<br />

(3, 2, 3) 2278.98 Hz 2313.00 Hz<br />

(3, 3, 0) 2509.72 Hz 2512.20 Hz<br />

(3, 3, 4) 2826.32 Hz 2830.84 Hz<br />

B. Stator core stack with teeth<br />

As an initial dataset for the stator core stack with teeth,<br />

the resulting material parameters <strong>of</strong> the stator core stack<br />

without teeth have been chosen.<br />

For the investigation <strong>of</strong> the stator core stack with<br />

teeth the density has to be determined. With a weight <strong>of</strong><br />

196.4kgand a volume <strong>of</strong> 2.61335·10−2 m3 the density <strong>of</strong><br />

the stator core stack with teeth results in 7515.3 kg/m 3 .<br />

This is 0.14 % less than the density <strong>of</strong> the stator core<br />

stack without teeth. Hence, all simulations for the stator<br />

core stack with teeth use the newly found density.<br />

1) Comparison <strong>of</strong> measured and calculated frequency<br />

response functions by using the initial dataset: The identified<br />

material parameters are validated by a comparison<br />

<strong>of</strong> the measured and the calculated response characteristics<br />

<strong>of</strong> the stator core stack with teeth, see Fig. 14. The<br />

comparison shows that the match <strong>of</strong> the measured data<br />

with the simulation results using the material parameters<br />

in Table VI for the stator core stack without teeth is not<br />

satisfactory. Only the modes (2,0,0) and (3,0,0) have a<br />

smaller deviation than the other modes. Therefore, the<br />

material parameters <strong>of</strong> the stator core stack with teeth<br />

are determined again.<br />

magnitude [ m N ]<br />

10 -5<br />

mode (2,0,0)<br />

10 -6<br />

10 -7<br />

10 -8<br />

mode (2,1,0)<br />

mode (3,0,0) mode (3,1,0)<br />

mode (2,2,x)<br />

Sum <strong>of</strong> simulated FRF‘s in radial direction<br />

Sum <strong>of</strong> measured FRF‘s in radial direction<br />

mode (3,2,x)<br />

mode (3,3,x)<br />

500 1000 1500<br />

frequency [Hz]<br />

2000 2500 3000<br />

Fig. 14: Validation <strong>of</strong> the identified material parameters<br />

by comparing results <strong>of</strong> a the simulation and measurement<br />

<strong>of</strong> the stator core stack with teeth


2) Adjustment <strong>of</strong> measured and simulated frequency<br />

response function by using transversal isotropic elasticity:<br />

The procedure <strong>of</strong> the adjustment <strong>of</strong> the material<br />

parameters is the same as for the stator core stack without<br />

teeth. For the initial dataset, the material parameters<br />

identified in section V-A are used. The optimization<br />

yields a set <strong>of</strong> material parameters listed in Table VIII.<br />

TABLE VIII: Dataset <strong>of</strong> material parameters <strong>of</strong> transversally<br />

isotropic elasticity with optimized Ez<br />

material parameters values<br />

Ex<br />

199.8 · 109 N/m 2<br />

Ez<br />

20.1 · 109 N/m 2<br />

Gxz<br />

9.9 · 109 N/m 2<br />

νxy<br />

0.3<br />

νxz<br />

0.3<br />

ϱ 7515.3 kg/m 3<br />

Fig. 15 shows that eigenfrequencies exist in the simulation<br />

which do not occur in the measurement. Furthermore<br />

the linear and homogeneous model does not adjust<br />

the modes (2,1,0), (3,1,0) and (4,1,0) to the measured<br />

eigenfrequencies <strong>of</strong> the equivalent simulated modes. The<br />

measured distance in the frequency range <strong>of</strong> about 100 Hz<br />

between the modes (2,0,0) and (2,1,0) or (3,0,0) and<br />

(3,1,0) etc. could not be represented.<br />

magnitude [ m N ]<br />

10 -5<br />

10 -6<br />

10 -7<br />

10 -8<br />

mode (2,0,0)<br />

mode (2,1,0)<br />

mode (3,0,0)<br />

mode (2,3,0)<br />

mode (2,2,x)<br />

mode (2,2,0)<br />

mode (3,1,0)<br />

Sum <strong>of</strong> simulated FRF‘s in radial direction<br />

Sum <strong>of</strong> measured FRF‘s in radial direction<br />

mode (3,2,x)<br />

mode (3,3,0)<br />

mode (4,3,2)<br />

mode (4,3,1)<br />

mode (4,0,0)<br />

500 1000 1500 2000<br />

frequency [Hz]<br />

2500 3000<br />

Fig. 15: Comparison <strong>of</strong> measured and simulated frequency<br />

response functions <strong>of</strong> the stator core stack with<br />

teeth in a frequency domain from 500 Hz to 3300 Hz<br />

Table IX lists modes and their corresponding measured<br />

and simulated eigenfrequencies which could be approximately<br />

adjusted in a frequency range from 500 Hz to<br />

3300 Hz.<br />

VI. SUMMARY AND CONCLUSION<br />

For the investigation <strong>of</strong> electrical machines, the dynamical<br />

behavior <strong>of</strong> the stator core is <strong>of</strong> high interest. The<br />

mechanical characterization is difficult since the structure<br />

<strong>of</strong> the stator core is inhomogeneous. In this paper, an<br />

approach has been presented which yields a linear and<br />

homogeneous description <strong>of</strong> a stator core stack.<br />

For the investigation <strong>of</strong> the dynamical behavior, two<br />

stator core stacks, one without stator teeth and the other<br />

- 152 - 15th IGTE Symposium 2012<br />

TABLE IX: Coincident measured and simulated eigenfrequencies<br />

and modes <strong>of</strong> the stator core stack with teeth,<br />

resulting from adjustment<br />

modes measured eigenfreq. simulated eigenfreq.<br />

(2, 0, 0) 661.27 Hz 662.79 Hz<br />

(2, 1, 0) 720.08 Hz 611.87 Hz<br />

(2, 2, 0) 1365.85 Hz 1358.87 Hz<br />

(3, 0, 0) 1767.43 Hz 1777.50 Hz<br />

(3, 1, 0) 1854.83 Hz 1782.64 Hz<br />

(4, 3, 1) 2819.37 Hz 2955.25 Hz<br />

(4, 3, 2) 2962.95 Hz 2955.25 Hz<br />

(4, 0, 0) 3107.52 Hz 3134.49 Hz<br />

with stator teeth, have been chosen. The experimental<br />

modal analysis have been carried out on both stator core<br />

stacks. The measurement results have been used for the<br />

adjustment <strong>of</strong> the simulation data.<br />

The numerical modal analysis has been applied in<br />

conjunction with the finite element method. For that,<br />

adequate models had to be chosen. The inhomogeneous<br />

structure has been represented by a linear and homogeneous<br />

FEM model and the lamination <strong>of</strong> the stator cores<br />

has been considered by a transversally isotropic material<br />

model.<br />

A study <strong>of</strong> the influence <strong>of</strong> the transversally isotropic<br />

material parameters has been carried out. Thereby, each<br />

material parameter has been varied and the resulting<br />

FRFs have been compared. It could be identified,<br />

which material parameter influences which mode. This<br />

knowledge is the basis <strong>of</strong> the adjustment <strong>of</strong> the simulated<br />

response characteristics <strong>of</strong> the stator cores.<br />

Before adjusting the response characteristics, a comparison<br />

<strong>of</strong> the simulated results using a material model<br />

<strong>of</strong> isotropic elasticity with the measurement results has<br />

been carried out for the stator core stack without teeth.<br />

It could be shown that a material model with isotropic<br />

elasticity is not appropriate.<br />

The stepwise adjustment <strong>of</strong> the simulated FRFs to<br />

the measured have shown a good match <strong>of</strong> the response<br />

characteristic <strong>of</strong> the stator core stack without teeth up to<br />

a frequency <strong>of</strong> 3kHz. However, some measured modes<br />

could not be identified with the used linear and homogeneous<br />

numerical model. The approximated material<br />

parameters have then been validated by a comparison <strong>of</strong><br />

the measured and simulated dynamical behavior <strong>of</strong> the<br />

stator core stack with teeth. This validation shows that<br />

the response characteristic <strong>of</strong> the measured and simulated<br />

results did not coincide except for some simulated<br />

eigenfrequencies.<br />

In order to improve the numerical model <strong>of</strong> the stator<br />

core stack with teeth, a further optimization <strong>of</strong> the material<br />

parameters has been carried out. The match <strong>of</strong> the<br />

measured and simulated response characteristics is not<br />

as good as for the stator core stack without teeth. In the<br />

investigated frequency range <strong>of</strong> the measured data some<br />

eigenfrequencies and eigenmodes could not be identified<br />

in the simulation.<br />

However, a working method has been introduced,


which describes the three dimensional dynamical behaviour<br />

<strong>of</strong> a stator core stack by using a linear and<br />

homogeneous numerical model.<br />

REFERENCES<br />

[1] B. Weilharter, O. Biro, H. Lang, G. Ofner, and S. Rainer, “Validation<br />

<strong>of</strong> a comprehensive analytic noise computation method for<br />

induction machines,” Industrial Electronics, IEEE Transactions<br />

on, vol. 59, no. 5, pp. 2248 –2257, may 2012.<br />

[2] J. Li, X. Song, D. Choi, and Y. Cho, “Research on reduction<br />

<strong>of</strong> vibration and acoustic noise in switched reluctance motors,”<br />

in Advanced Electromechanical Motion Systems Electric Drives<br />

Joint Symposium, 2009. ELECTROMOTION 2009. 8th International<br />

Symposium on, july 2009, pp. 1 –6.<br />

[3] S. Rainer, O. Biro, B. Weilharter, and A. Stermecki, “Weak coupling<br />

between electromagnetic and structural models for electrical<br />

machines,” Magnetics, IEEE Transactions on, vol. 46, no. 8, pp.<br />

2807 –2810, aug. 2010.<br />

[4] B. Weilharter, O. Bi? andro? and, S. Rainer, and A. Stermecki,<br />

“Computation <strong>of</strong> rotating force waves in skewed induction machines<br />

using multi-slice models,” Magnetics, IEEE Transactions<br />

on, vol. 47, no. 5, pp. 1046 –1049, may 2011.<br />

[5] H. Jordan, Geräuscharme Elektromotoren. Essen: Verlag W.<br />

Girardet, 1951.<br />

[6] R. N. Arnold and G. B. Warburton, “Flexural vibrations <strong>of</strong> the<br />

walls <strong>of</strong> thin cylindrical shells having freely supported ends,” <strong>Proceedings</strong><br />

<strong>of</strong> the Royal Society <strong>of</strong> London. Series A, Mathematical<br />

and Physical Sciences, vol. 197, no. 1049, pp. 238–256, Jun. 1949.<br />

[7] H. Frohne, “Über die primären bestimmungsgrößen er<br />

lautstärke bei asynchronmotoren,” Ph.D. dissertation, Technische<br />

Hochschule Hannover, 1959.<br />

[8] E. C. Naumann and J. L. Sewall, “An experimental and analytical<br />

vibration study <strong>of</strong> thin cylindrical shells with and without longitudinal<br />

stiffeners,” NASA Langley Research Center, Tech. Rep.<br />

NASA-TN-D-4705, Sep 1968.<br />

[9] Y. Stavsky and R. Loewy, “On vibrations <strong>of</strong> heterogenous orthotropic<br />

cylindrical shells,” Journal <strong>of</strong> Sound and Vibration,<br />

vol.18,no.3,pp.i–i,1971.<br />

[10] P. L. Tímár, A. Fazecas, J. Kiss, A. Miklós, and S. J. Yang,<br />

Noise and Vibration <strong>of</strong> Electrical Machines. New York: Elsevier<br />

Science Publishing Company, Inc., 1989.<br />

[11] C. Wang and J. C. S. Lai, “Prediction <strong>of</strong> natural frequencies <strong>of</strong><br />

finite length circular cylindrical shells,” Applied Acoustics, vol. 59,<br />

no. 4, pp. 385 – 400, 2000.<br />

[12] K.-J. Bathe, Finite-Elemente-Methoden, 2nd ed. Berlin Heidelberg:<br />

Springer, 2002, aus dem Englichen übersetzt von Peter<br />

Zimmermann.<br />

[13] R. Gasch and K. Knothe, Strukturdynamik: Band 1: Diskrete<br />

Systeme. Springer, 1987.<br />

[14] W. Yixuan and W. Ying, “Research on dynamic characteristics<br />

<strong>of</strong> stator core <strong>of</strong> large turbo-generator,” in Power and Energy<br />

Engineering Conference (APPEEC), 2010 Asia-Pacific, march<br />

2010, pp. 1 –4.<br />

[15] H. Wang and K. Williams, “Effects <strong>of</strong> laminations on the vibrational<br />

behaviour <strong>of</strong> electrical machine stators,” Journal <strong>of</strong> Sound<br />

and Vibration, vol. 202, no. 5, pp. 703–715, May 1997.<br />

[16] M. van der Giet and K. Hameyer, “Identification <strong>of</strong> homogenized<br />

equivalent materials for the modal analysis <strong>of</strong> composite structures<br />

in electrical machines,” in the 9th International Conference on<br />

Vibrations in Rotating Machinery, VIRM 2008, Exceter, UK, 2008,<br />

pp. 437–448.<br />

[17] J. Roivainen, “Unit-wave response-based modeling <strong>of</strong> electromechanical<br />

noise and vibration <strong>of</strong> electromechanical machines,”<br />

Ph.D. dissertation, Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Helsinki,<br />

2009.<br />

[18] S. Verma and A. Balan, “Experimental investigations on the<br />

stators <strong>of</strong> electrical machines in relation to vibration and noise<br />

problems,” Electric Power Applications, IEE <strong>Proceedings</strong> -, vol.<br />

145, no. 5, pp. 455 –461, sep. 1998.<br />

[19] B. Peeters, H. Van der Auweraer, P. Guillaume, and J. Leuridan,<br />

“The polymax frequency-domain method: a new standard for<br />

modal parameter estimation?” Shock and Vibration, vol. 11, no. 3,<br />

pp. 395–409, Jan. 2004.<br />

- 153 - 15th IGTE Symposium 2012<br />

[20] D. J. Ewins, Modal Testing: Theory, Practice and Applications,<br />

2nd ed. Letchworth, Hertfordshire, UK: Research Studies Press,<br />

2000.


- 154 - 15th IGTE Symposium 2012<br />

Proper Location <strong>of</strong> the Regulating Coil in Transformers<br />

from Short-Circuit Forces Point <strong>of</strong> View<br />

*, O. Sonmez, * B. Duzgun, * G. Komurgoz<br />

* Istanbul Technical <strong>University</strong> Electrical and Electronics Faculty, 80626 Istanbul, Turkey<br />

Abstract—A transformer has complicated network <strong>of</strong> internal forces acting on and stressing the conductors, support and<br />

insulation structures. These forces are fundamental to the interaction <strong>of</strong> current-carrying conductors within magnetic fields<br />

involving an alternating-current source. Location <strong>of</strong> the regulating coil in transformer determines electrodynamic forces<br />

effect on the operational behavior <strong>of</strong> the transformer. This paper presents design principles <strong>of</strong> the regulating coil in<br />

transformers and shows the electrodynamics forces and their deformation results by using finite element method.<br />

Index Terms—Electrodynamic Forces, Deformation Analysis, FEA, Regulating Coil.<br />

I. INTRODUCTION<br />

The transformer is a very critical and costly important<br />

component in power generation and transmission systems<br />

as regarding reliable and performance. The capacity <strong>of</strong><br />

transformers is increasing with the rapid development. As<br />

the voltage level is higher, the time needed to design a<br />

transformer is <strong>of</strong> great importance. One <strong>of</strong> the important<br />

problems in the design <strong>of</strong> transformers is radial and axial<br />

forces, being proportional to the square <strong>of</strong> the short<br />

circuit current. By the interaction <strong>of</strong> leakage field and<br />

the short circuit current, which makes the windings be<br />

published or pulled, the huge short circuit force is<br />

generated in the windings (large power). The leakage<br />

flux not only causes the additional losses and forces, but<br />

also creates heating to the internal components. Short<br />

circuit current is 8 to 10 times the rated current in larger<br />

transformers and 20 to 25 times in smaller units. Forces<br />

arising during short-circuit may be as high as ten<br />

thousand to million N. By the effect <strong>of</strong> so large forces<br />

and thermal expansion <strong>of</strong> wires, the insulation <strong>of</strong><br />

transformer windings can be distorted, even collapsed,<br />

short circuit error occurs or damage to the clamping<br />

structures. Furthermore, the location <strong>of</strong> the tapings has<br />

the predominant effect on the axial forces since it<br />

controls the residual ampere turn. Failure <strong>of</strong> transformers<br />

due to short circuits is major concern for power utilities<br />

and manufactures. These hazards can be avoided by<br />

proper design <strong>of</strong> windings structure against thermal and<br />

mechanical strains to prevent permanent deformations<br />

and movement <strong>of</strong> windings if forces can be calculated<br />

correctly.<br />

In the past, many technical papers have been published<br />

which give equations for calculation the electromagnetic<br />

forces acting on the windings in transformers [1-9].<br />

Electromagnetic force computations methods have been<br />

proposed in the literature mainly based on static and<br />

transient formulations [10]. Classical methods can be<br />

used to compute the short circuit forces in windings [11].<br />

In these methods, it is used simplified configurations<br />

with some assumptions. Furthermore these methods are<br />

simple, fast and easy, but not accurate and not suitable<br />

for predicting the performance <strong>of</strong> special types <strong>of</strong><br />

Sönmez Transformer Company, 41410 Kocaeli, Turkey<br />

e-mail: komurgoz@itu.edu.tr<br />

transformers, especially the axial length <strong>of</strong> windings is<br />

not equal [12]. It is, however, obvious that by using<br />

modern computerized methods, sophisticated methods, it<br />

is possible to calculate forces acting on the elements <strong>of</strong><br />

winding, the effect <strong>of</strong> any arrangements <strong>of</strong> parts and<br />

asymmetries. If magnetic field is calculated accurately, it<br />

is possible to define electromagnetic forces in the<br />

detailed transformer model by using numerical methods,<br />

Finite Element Methods (FEM), Finite Difference<br />

Methods (FDM) and Boundary elements (BEM) etc. In<br />

recent years, a significant development <strong>of</strong> FEM s<strong>of</strong>tware<br />

has enabled the force calculation to be accomplished<br />

easily in where the winding and tapping arrangement is<br />

complex.<br />

This paper concentrates on the use <strong>of</strong> FEM to models.<br />

This method provides a comprehensive view <strong>of</strong> the<br />

overall transformer mechanic and electromagnetic<br />

behavior under normal and disturbance conditions. The<br />

effect <strong>of</strong> tap winding configurations is also analyzed. The<br />

results obtained from FEM <strong>of</strong> transformers using<br />

MAXWELL® and ANSYS® are validated by the<br />

mathematical models.<br />

II. FORCES ACTING ON THE TRANSFORMER<br />

When the electromagnetic force becomes greater than the<br />

strength <strong>of</strong> the windings, the windings will fail. The types<br />

<strong>of</strong> failure,Electromagnetic forces, acting on transformer<br />

can be classified as “radial forces” which develop in the x<br />

direction and “axial forces” develop in the y direction.<br />

For the calculation <strong>of</strong> these forces, both analytical and<br />

numerical methods are presented such as residual<br />

ampere-turn method, Robin’s solution, Smythe’s<br />

solution, calculation using Fourier series, two<br />

dimensional method <strong>of</strong> images, FEM, image method with<br />

discrete conductors etc. [13].<br />

Axial forces creates slipping or breakdown <strong>of</strong> windings as<br />

a whole standing-up <strong>of</strong> part <strong>of</strong> windings, tilting and<br />

deformation <strong>of</strong> coils. Radial forces creats buckling<br />

phenomena <strong>of</strong> inner windings, excessive elongation <strong>of</strong><br />

outer windings.<br />

A. Axial Forces<br />

One <strong>of</strong> the elementary and simplest methods, residual


ampere-turn method, gives closer approximations and<br />

reliable results for the calculation <strong>of</strong> axial forces.<br />

Concentric windings are separated into two groups and<br />

each group has balanced ampere-turns. The radial<br />

ampere-turns produce radial flux which causes axial<br />

force in the windings as it seen in Figure 1. This<br />

assumption allows calculation <strong>of</strong> the axial forces.<br />

Figure 1: Axial and radial forces in concentric axially nonsymmetrical<br />

windings [13].<br />

The algebraic sum <strong>of</strong> the ampere-turns <strong>of</strong> low voltage<br />

and high voltage windings at any point and at end <strong>of</strong> the<br />

windings gives the radial ampere-turns at that point in the<br />

winding. A curve is plotted for every points called<br />

residual or unbalanced ampere-turn diagram which the<br />

method derives its name [12]. It is clear that windings<br />

without axial displacement and windings have the same<br />

length have no residual ampere-turns or forces between<br />

windings. However, there are some internal compressive<br />

forces and forces on the end coils, although there is no<br />

axial thrust between windings.<br />

Figure 2 gives the methodology for the determining<br />

distribution <strong>of</strong> radial ampere-turns. ‘a’ is the length<br />

tapped out at the end <strong>of</strong> the outer windings. Summation<br />

<strong>of</strong> I and II shown in Figure 2(b) are both balanced<br />

ampere-turn groups. If these groups are superimposed,<br />

they produce the given ampere-turn arrangement. The<br />

triangle as shown in Figure 2(c) presents the diagram <strong>of</strong><br />

the radial-ampere turns. This diagram plotted against<br />

distance along the winding. a(NImax) is the maximum<br />

value, where (NImax) represents the ampere-turns <strong>of</strong> either<br />

the low voltage or high voltage winding.<br />

Figure 2: Determination <strong>of</strong> residual ampere-turns [12].<br />

- 155 - 15th IGTE Symposium 2012<br />

Tapings location on the winding has a great effect on the<br />

axial forces since it controls the residual ampere-turn<br />

diagram.<br />

B. Radial Forces<br />

The radial forces develop due to interaction <strong>of</strong> coil<br />

currents with the axial component <strong>of</strong> its own magnetic<br />

flux. In a transformer with concentric windings, radial<br />

forces considered insignificant because, the radial<br />

strength <strong>of</strong> the winding is high. Most problems occur<br />

because <strong>of</strong> axial forces and axial movement results more<br />

damage to the winding and insulation than radial<br />

movements.<br />

The inner coil is subjected a pressure tends to collapse<br />

to the core. At the same time, the outer coil is under a<br />

pressure to extend the diameter <strong>of</strong> the coil which<br />

produces a stress as shown in Figure 1. Preferable choice<br />

in a transformer is circular coils, because they are the<br />

strongest shape to withstand the radial pressure<br />

mechanically [14].<br />

III. CALCULATION OF ELECTROMAGNETIC FORCES<br />

A. Short-Circuit Current<br />

Short-circuit currents on the windings have a<br />

significant effect on calculation <strong>of</strong> electromagnetic<br />

forces. Generally, the short-circuit current is calculated<br />

for different situations by considering [15];<br />

Tapping arrangement<br />

Fault position<br />

Short-circuit power combination (network and<br />

transformer)<br />

Short-circuit type (e.g. three phase symmetrical)<br />

To see the effects <strong>of</strong> the short-circuit current on power<br />

transformers, the simplest fault scenario, three phase<br />

short-circuit scenario is investigated. Symmetrical shortcircuit<br />

current can be calculated according IEC 60076-5<br />

as [16];<br />

I <br />

U<br />

Z Z <br />

<br />

3 t s<br />

9 And the amplitude<br />

is;<br />

Imax <strong>of</strong> the first peak <strong>of</strong> the current<br />

I Ik 2 10<br />

max<br />

<br />

The factor k is the initial <strong>of</strong>fset <strong>of</strong> the current and<br />

2 stands for the peak to r.m.s. value <strong>of</strong> sinusoidal wave.<br />

This k 2 factor depends on the X/R ratio and the<br />

values <strong>of</strong> k are shown in standards IEC 60076-5 [16].<br />

This current is based on the following expression for<br />

the peak factor;<br />

R/ X<br />

2 <br />

k 2 1<br />

<br />

e<br />

<br />

sin<br />

2 11


Y1 [kA]<br />

25.00<br />

12.50<br />

0.00<br />

-12.50<br />

Curve Inf o<br />

InputCurrent(Winding_LV_A)<br />

Setup1 : Transient<br />

InputCurrent(Winding_LV_B)<br />

Setup1 : Transient<br />

InputCurrent(Winding_LV_C)<br />

Setup1 : Transient<br />

Name X Y<br />

Phase C_sc 131.5000 21.6972<br />

Phase A_sc 118.5000 21.6972<br />

Phase B_sc 105.0000 21.7270<br />

Input Current LV Model2D_coils ANSOFT<br />

Phase B_sc<br />

Phase A_sc<br />

Phase C_sc<br />

-25.00<br />

75.00 87.50 100.00<br />

Time [ms]<br />

112.50 125.00 135.00<br />

Figure 3: Input currents <strong>of</strong> low voltage windings.<br />

The given short-circuit has two components as steady<br />

state and exponentially unidirectional component. In<br />

Figure 3, applied steady-state and short-circuit currents<br />

on the windings <strong>of</strong> the power transformer in Maxwell<br />

s<strong>of</strong>tware is shown. The exponentially unidirectional<br />

component is ignored to make calculations simpler.<br />

B. Electromagnetic Forces<br />

Transient analysis allows calculating electromagnetic<br />

forces for every time step by calculating the leakage flux<br />

and full field in winding region. Fully coupled dynamic<br />

physics solution is;<br />

A<br />

AJs V Hc vA t<br />

The differential equation and the boundary conditions<br />

<strong>of</strong> transient axial symmetric electromagnetic field can be<br />

expressed in the cylindrical coordinate as;<br />

- 156 - 15th IGTE Symposium 2012<br />

12 rA rA rA <br />

<br />

<br />

: v' Z Z <br />

v' r r <br />

Js <br />

<br />

'<br />

t<br />

13<br />

S1: rA rA0<br />

14 rA <br />

S2: v' Ht<br />

n<br />

15 For 2D analysis, the radial and axial components <strong>of</strong> the<br />

magnetic flux density can be expressed as;<br />

A<br />

Br<br />

<br />

z<br />

B<br />

0<br />

16 1 rA<br />

Bz<br />

<br />

r r<br />

17 When the magnetic flux density is decomposed into its<br />

radial and axial components;<br />

<br />

F J ˆ B rˆB zˆ d F rˆF zˆ<br />

18<br />

<br />

<br />

<br />

<br />

r z r z<br />

In brief, the force on the power transformer is<br />

expressed by the Lorentz force as<br />

<br />

dF idlB And the radial force <strong>of</strong> unit length<br />

F B I dl<br />

x y<br />

max<br />

The axial force <strong>of</strong> unit length<br />

F B I dl<br />

y x<br />

max<br />

19 20 21 IV. RESULTS &DISCUSSIONS<br />

A. Model<br />

Electrical machines require an accurate mathematical<br />

model for system simulation and performance evaluation.<br />

Detailed knowledge <strong>of</strong> the flux distribution <strong>of</strong> a<br />

transformer plays a very important role in a safe<br />

estimation <strong>of</strong> the forces <strong>of</strong> the transformer. Complex<br />

computer programs are required to obtain a reasonable<br />

representation <strong>of</strong> the field in different parts <strong>of</strong> the<br />

windings. Using the above models for determinate forces,<br />

a numerical application (FEM) has been implemented for<br />

a 25 MVA power transformer. 3-D model <strong>of</strong> the general<br />

structure is shown in Figure 4. To reduce computing time<br />

and avoid excessive use <strong>of</strong> ram, the insulating materials<br />

and supporting structure are neglected, besides analyses<br />

were done in 2-D structure.<br />

Figure 4: 3-D model <strong>of</strong> analyzed power transformer.<br />

The characteristics <strong>of</strong> the studied transformer are<br />

presented in Table I and geometry details <strong>of</strong> the analyzed<br />

transformer are shown in Figure 5.<br />

TABLE I<br />

TRANSFORMER DATA<br />

Rated Power 25 [MVA]<br />

Rated Frequency 50 [Hz]<br />

Rated Voltages 120 / 11 [kV]<br />

Rated Currents 120 / 1310 [A]<br />

Turns Ratio 1000 / 159<br />

Connection Yd11<br />

Tap setting ± 15 %<br />

Transformer short circuit voltage (%) 9<br />

Figure 5: Geometry details <strong>of</strong> analyzed transformer.


Figure 6: 2-D model <strong>of</strong> analyzed transformer tapped at upper side.<br />

B. Electromagnetic Results<br />

Transformers require an accurate mathematical model<br />

for system simulation and performance evaluation. In this<br />

study, magnetic analysis <strong>of</strong> the designed machines has<br />

been investigated using Maxwell 2D program and total<br />

deformations have been investigated using ANSYS®<br />

program (Figure 6). The simulations were completed<br />

using the following steps;<br />

1) Geometric model creation,<br />

2) The appointment <strong>of</strong> the materials that make up<br />

the structure <strong>of</strong> the machine,<br />

3) Boundary conditions and mesh process,<br />

4) The appointment <strong>of</strong> currents in windings,<br />

5) Analyze,<br />

6) Examination <strong>of</strong> the results.<br />

In Figure 7 and 8 leakage flux distributions are shown<br />

for +15% tapping position and -15% tapping position. As<br />

the leakage flux increases, electromagnetic forces are<br />

occurring rapidly.<br />

Figure 7: Leakage flux distribution at +15% tapping position <strong>of</strong> HV<br />

windings.<br />

Figure 8: Leakage flux distribution at -15% tapping position <strong>of</strong> HV<br />

windings.<br />

- 157 - 15th IGTE Symposium 2012<br />

The graphs <strong>of</strong> distribution <strong>of</strong> radial magnetic flux<br />

density along the transformer window are shown in<br />

Figure 9 and 10 for +15% tapping, -15% tapping at upper<br />

part <strong>of</strong> HV windings, respectively.<br />

Mag_B [tesla]<br />

1.50<br />

1.25<br />

1.00<br />

0.75<br />

0.50<br />

0.25<br />

Axial Flux Density Distribution Model2D_coils ANSOFT<br />

0.00<br />

0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75<br />

Distance [meter]<br />

Curve Inf o<br />

Mag_B<br />

Setup1 : Transient<br />

Time='115000000ns'<br />

Figure 9: Axial flux distribution at +15% tapping position <strong>of</strong> HV<br />

windings.<br />

Figure 9 shows axial flux distribution with respect to<br />

height <strong>of</strong> the winding for at +15% tapping position <strong>of</strong> HV<br />

windings. To determine the axial forces, it is necessary to<br />

find the radial flux produced by the radial ampere-turns.<br />

As seen from figure, axial flux density is approximately<br />

constant along the winding due to symmetrical windings<br />

(with fully balanced ampere-turns)<br />

Mag_B [tesla]<br />

3.50<br />

3.00<br />

2.50<br />

2.00<br />

1.50<br />

1.00<br />

0.50<br />

Axial Flux Density Distribution Model2D_coils ANSOFT<br />

0.00<br />

0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75<br />

Distance [meter]<br />

Figure 10: Axial flux distribution at -15% tapping position <strong>of</strong> HV<br />

windings.<br />

Curve Inf o<br />

Mag_B<br />

Setup1 : Transient<br />

Time='115000000ns'<br />

If there is an asymmetry in the winding heights due to the<br />

tap position or for some other reasons such as failure,<br />

flux distribution changes as shown in Figure 10. Flux<br />

density distribution makes maximum in one place along<br />

the height <strong>of</strong> the winding.<br />

The electromagnetic forces in the winding <strong>of</strong> the<br />

power transformer are calculated with the leakage flux<br />

and transient currents. The radial and axial forces <strong>of</strong> each<br />

conductor coil in the HV windings are given in Figure<br />

11-14. Figure 11 and 12 shows radial and axial forces at<br />

+15% tapping position <strong>of</strong> HV windings.<br />

Radial Forces<br />

x 105<br />

3.5<br />

3<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

Radial Forces on the HV Coils for +15% tapping at 118.5 ms<br />

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />

Coil Numbers<br />

Figure 11: Radial Forces at +15% tapping position <strong>of</strong> HV windings.


Axial Forces<br />

x 104<br />

8<br />

6<br />

4<br />

2<br />

0<br />

-2<br />

-4<br />

-6<br />

-8<br />

Axial Forces on the HV Coils for +15% tapping at 118.5 ms<br />

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />

Coil Numbers<br />

Figure 12: Axial Forces at +15% tapping position <strong>of</strong> HV windings.<br />

Axial and radial forces in windings when the windings<br />

are axially non-symmetrical are calculated as given in<br />

Figure 13 and 14.<br />

Due to the symmetry <strong>of</strong> winding and regular<br />

distribution <strong>of</strong> flux, forces values are smaller than<br />

asymetrical winding arrangement. If there is an<br />

asymmetry in the winding heights due to the tap position<br />

(or for some other reasons), the ampere-turn unbalance<br />

increases and gives rise to forces, and result <strong>of</strong> this,<br />

tending to break the winding.<br />

Radial Forces<br />

x 105<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

Radial Forces on the HV Coils for -15% tapping at 118.5 ms<br />

1 2 3 4 5 6 7 8 9 10 11 12<br />

Coil Numbers<br />

Figure 13: Radial Forces at %-+15 tapping position <strong>of</strong> HV windings.<br />

Axial Forces<br />

x 105<br />

1<br />

0<br />

-1<br />

-2<br />

-3<br />

-4<br />

-5<br />

-6<br />

-7<br />

-8<br />

-9<br />

Axial Forces on the HV Coils for -15% tapping at 118.5 ms<br />

1 2 3 4 5 6 7 8 9 10 11 12<br />

Coil Numbers<br />

Figure 14: Axial Forces at -15% tapping position <strong>of</strong> HV windings.<br />

The total body force density and total deformation are<br />

determined by using ANSYS program and shown in<br />

Figure 15-18. Figure 15 and 17 shows the effect <strong>of</strong> forces<br />

on winding at +15% tapping position <strong>of</strong> HV windings.<br />

- 158 - 15th IGTE Symposium 2012<br />

Figure 15: Total body force density at -15% tapping position <strong>of</strong> HV<br />

windings (115 ms)<br />

Figure 16: Total body force density at -15% tapping position <strong>of</strong> HV<br />

windings (115 ms)<br />

Deformations in windings when the windings are<br />

axially non-symmetrical are obtained as given in Figure<br />

16 and18. Total deformation depends on the tap position<br />

and at +15% tapping position <strong>of</strong> HV windings, they are<br />

bigger than which at -15% tapping position <strong>of</strong> HV<br />

windings. The location <strong>of</strong> forces shifts to the upper side<br />

<strong>of</strong> the winding.<br />

Figure 17: Total deformations at +15% tapping position <strong>of</strong> HV windings<br />

(115 ms).<br />

V. CONCLUSION<br />

In this paper, leakage magnetic field and electrodynamic<br />

force <strong>of</strong> the 25 MVA power transformers were analyzed<br />

under short circuit conditon <strong>of</strong> the low voltage windings<br />

<strong>of</strong> the transformer by using ANSYS® and MAXWELL®


Figure 18 Total deformations at -15% tapping position <strong>of</strong> HV windings<br />

(115 ms).<br />

based on the FEM.Two different conditions when the<br />

power transformer is under mximum tap are analyzed.<br />

The location <strong>of</strong> the regulating coil is changed.<br />

Afterwards, deformation result is showed by using<br />

calculated force values. Undesirable stresses values can<br />

be prevented on the transformers by making appropriate<br />

coil arrangements. The insertation <strong>of</strong> tap sections in the<br />

windings, which produces asymetries between LV and<br />

HV windings, tends to cause an inrease <strong>of</strong> radial and<br />

axial forces annd then damages in transformers. The<br />

method <strong>of</strong> calculation <strong>of</strong>feres a reference to the design <strong>of</strong><br />

transformer.<br />

LIST OF PRINCIPLES SYMBOLS<br />

a fractional difference in winding heights<br />

A magnetic vector potential<br />

Br, B , Bz components <strong>of</strong> the flux density <strong>of</strong> (in Tesla)<br />

dl<br />

<br />

F<br />

unit length <strong>of</strong> wire<br />

force<br />

h axial height <strong>of</strong> the winding<br />

Hc coercive magnetic field strength <strong>of</strong> the PM<br />

Ht tangential component <strong>of</strong> magnetic intensity<br />

Imax maximum current<br />

Js current source density<br />

J - directional short-circuit current density<br />

ˆr , ˆ and ˆz unit vectors in cylindrical coordinate<br />

S1 parallel boundary condition<br />

S2 vertical boundary condition<br />

U rated voltage<br />

v velocity<br />

V electric scalar potential<br />

Zs short-circuit impedance <strong>of</strong> the system<br />

Zt short-circuit impedance <strong>of</strong> the transformer<br />

phase angle<br />

conductivity<br />

studied domain<br />

v ' reluctivity<br />

' conductance<br />

REFERENCES<br />

[1] K. Karsai, D. Kerenyi, and L. Kiss, "Large Power Transformers,"<br />

Elsevier Science, December 1987.<br />

[2] L. Jiao, "The calculation <strong>of</strong> ampere force on electric power<br />

transformer under the short circuit situation," Electrical Machines<br />

- 159 - 15th IGTE Symposium 2012<br />

And Systems, 2008. ICEMS 2008. International Conference On,<br />

17-20 October 2008.<br />

[3] V. Behjat, A. Vahedi, A. Setayeshmehr, and et all. "Identification<br />

<strong>of</strong> the most sensitive frequency response measurement technique<br />

for diagnosis <strong>of</strong> interturn faults in power transformers,"<br />

Measurement Science And <strong>Technology</strong>, volume 21, No. 7, 2010.<br />

[4] L. Jiao, B. Bai, and H. Li, "The calculation <strong>of</strong> ampere force on<br />

electric power transformer under the short circuit situation,"<br />

International conference on electrical machines and systems, pp.<br />

4423-4426, October 2008.<br />

[5] A. C. de Azevedo, I. Rezende, and A. C. Delaiba, "Investigation <strong>of</strong><br />

transformer electromagnetic forces caused by external faults using<br />

FEM," Transmission & distribution conference and exposition:<br />

Latin America, August 2006.<br />

[6] K. Kurita, T. Kuriyama, K. Hiraishi, and et all. "Mechanical<br />

Strength <strong>of</strong> Transformer Windings under Short-Circuit<br />

Conditions," IEEE Transactions on Power Apparatus and Systems,<br />

volume 88, No. 3, March 1969.<br />

[7] K. Hiraishi, Y. Hori and S. Shida, "Mechanical Strength Of<br />

Transformer Windings under Short-Circuit Conditions," IEEE<br />

Transactions on Power Apparatus and Systems, pages 2381-2390,<br />

October 1971.<br />

[8] R. E. Ayres, G. O. Usry, M R Patel, and et all. "Dynamic<br />

Measurement during Short Circuit Testing <strong>of</strong> Transformers, Part<br />

2- Test Results And Analysis," IEEE Transactions on Power<br />

Apparatus and Systems, volume PAS-94, pages 198-206,<br />

March/April 1975.<br />

[9] M. Heathcote and D.P. Franklin, "The J&P Transformer Book"<br />

Reed Educational and Pr<strong>of</strong>essional Publishing Ltd, England, 12th<br />

edition, 1998.<br />

[10] G.B. , Kumbhar, and S.V. Kulkarni, "Analysis <strong>of</strong> short-circuit<br />

performance <strong>of</strong> split-winding transformer using coupled fieldcircuit<br />

approach," Power Delivery, IEEE Transactions On, Issue:<br />

2, pages 936-943, April 2007.<br />

[11] M. Waters, "The Measurement and Calculation <strong>of</strong> Axial<br />

Electromagnetic Forces in Concentric Transformer Windings,"<br />

<strong>Proceedings</strong> <strong>of</strong> the IEE - Part II: Power Engineering, volume 101,<br />

pages 35-46, February 1954.<br />

[12] M. F. Beavers and C. M. Adams, "The Calculations and<br />

Measurement <strong>of</strong> Axial Electromagnetic Forces on Concentric<br />

Coils in Transformers," Power Apparatus and Systems, Part III.<br />

Transactions <strong>of</strong> the American Institute <strong>of</strong> Electrical Engineers,<br />

volume 78, pages 467-477, August 1959.<br />

[13] M. S. A. Minhas, "Dynamic Behaviour <strong>of</strong> Transformer Winding<br />

under Short-Circuits," Ph.D. Thesis, <strong>University</strong> <strong>of</strong> the<br />

Witwatersrand, Johannesburg, November 2007.<br />

[14] M. G. Say, "The Performance and Design <strong>of</strong> Alternating Current<br />

Machines," Sir Issak Pitman & Sons Ltd, London, 3rd edition,<br />

1958.<br />

[15] N. Mahomed, "Electromagnetic Forces in Transformers under<br />

Short-Circuit Conditions," Energize Online, pp. 36-40, March<br />

2011.<br />

[16] IEC Standard 60076-5: Power Transformers-Part 5: "Ability to<br />

withstand short circuit”, 2006.


- 160 - 15th IGTE Symposium 2012<br />

Robust Design <strong>of</strong> IPM motors using<br />

Co-Evolutionary Algorithms<br />

*Min Li, † André S. Ruela, † Frederico G. Guimarães, † Jaime A. Ramírez and *David A. Lowther<br />

*McGill <strong>University</strong>, 3480 <strong>University</strong>, H3A 2K6 Montreal, Canada<br />

† Federal <strong>University</strong> <strong>of</strong> Minas Gerais, Belo Horizonte, MG 31270-010, Brazil<br />

E-mail: *david.lowther@mcgill.ca, † jramirez@ufmg.br<br />

Abstract—A robust design formulation is developed considering the minimization <strong>of</strong> the torque ripples <strong>of</strong> an interior<br />

permanent magnet (IPM) machine in the presence <strong>of</strong> uncertainties in the values <strong>of</strong> the design variables. This optimization<br />

problem is first solved using the worst vertex prediction and a deterministic search. In addition, a competitive co-evolutionary<br />

algorithm is applied to the minimax optimization problem to find a robust solution, in which one population evolves values for<br />

the design variables and the other one evolves values in the uncertainty set. Through a worst case analysis, the result from the<br />

co-evolutionary algorithm is proven to have a more robust performance than that <strong>of</strong> the non-robust optimization if<br />

manufacturing tolerance is taken into account. The computation time <strong>of</strong> the co-evolutionary algorithm may be largely<br />

reduced through the use <strong>of</strong> parallel computing environments.<br />

Index Terms—Evolutionary algorithms, IPM motor, Minimax optimization, Robust design.<br />

I. INTRODUCTION<br />

In recent years, interior permanent magnet (IPM)<br />

motors have become popular for many applications that<br />

require variable speed and torque. As an alternative to the<br />

traditional induction motor, IPM motors have the<br />

advantages <strong>of</strong> higher efficiencies and lower noise. The<br />

design <strong>of</strong> an IPM motor is a complicated task that<br />

involves the consideration <strong>of</strong> many different aspects,<br />

such as the size and the weight <strong>of</strong> the machine, the<br />

desired output torque, the cost <strong>of</strong> the permanent magnet,<br />

etc. In this paper, we focus on the robust design <strong>of</strong> IPM<br />

machines for which the objective is the reduction <strong>of</strong><br />

vibrations and noise <strong>of</strong> the device caused by errors in<br />

manufacturing, in order to improve the quality and to<br />

extend the life <strong>of</strong> product.<br />

The idea <strong>of</strong> robust design was introduced to electrical<br />

machine design over two decades ago. Robustness is<br />

<strong>of</strong>ten defined in terms <strong>of</strong> the performance <strong>of</strong> the device<br />

being less sensitive to manufacturing errors and<br />

variations <strong>of</strong> the operation conditions. Dr. Taguchi, with<br />

his statistical based methods, is considered as one <strong>of</strong> the<br />

pioneers <strong>of</strong> engineering robust design as he developed<br />

the foundations <strong>of</strong> robust design to meet the challenges <strong>of</strong><br />

producing high-quality products. A Taguchi-based<br />

optimization method has been applied to the design <strong>of</strong><br />

brushless DC motors in [1], where the signal-to-noise<br />

ratio was used to estimate the robustness <strong>of</strong> the product.<br />

A robust shape optimization was applied by Yoon to the<br />

design <strong>of</strong> electromagnetic devices in [2], where the mean<br />

and the standard deviation <strong>of</strong> the performance were<br />

treated as multi-objectives for the design problem. This<br />

paper also employed a sensitivity based approach to<br />

compute the approximation <strong>of</strong> the standard deviation and<br />

took feasibility robustness into account. Another useful<br />

formulation <strong>of</strong> robust design is to apply the worst case<br />

analysis and to optimize the worst performance <strong>of</strong> the<br />

objective function in the presence <strong>of</strong> uncertainties [3] [4].<br />

The robustness measure is integrated into the<br />

optimization process by using a robust target function<br />

defined on the uncertainty set <strong>of</strong> the design variables; and<br />

the vertices <strong>of</strong> the uncertainty set were used to predict the<br />

worst value <strong>of</strong> the objective function. Several different<br />

robust design formulations were reviewed and discussed<br />

in [5] and the authors proposed that the standard<br />

deviation can be approximated using the difference<br />

between the worst performance and the nominal<br />

performance and the computation cost could be largely<br />

reduced. In the recent development <strong>of</strong> sensitivity based<br />

robust design optimization, the authors defined a gradient<br />

index (GI) using the sensitivity <strong>of</strong> the performance<br />

function with respect to some critical uncertainty<br />

variables [6]. This simple and efficient algorithm was<br />

illustrated with an example <strong>of</strong> MEMS devices where<br />

robustness is crucial for high yield rate but information<br />

on uncertainties is hard to obtain. This gradient index<br />

based robust design method was also tested with the<br />

TEAM workshop problem 22 in [7]. The worst case<br />

analysis and the robust target function have also been<br />

applied to topological design problems [8], where a<br />

robust topological gradient (TG) was used to evaluate the<br />

robustness for a certain topological design.<br />

In addition to deterministic optimization approaches,<br />

the worst case analysis based robust design problems (i.e.<br />

minimax optimization) can also be solved using genetic<br />

algorithms and evolutionary algorithms [9] [10]. In<br />

particular, co-evolutionary algorithms have been used to<br />

solve constrained optimization problems formulated as<br />

the minimax optimization problem [11, 12]. In this case,<br />

one population evolves solutions for the problem while<br />

the second one evolves the terms for the Lagrange<br />

penalty function. Co-evolutionary methods have been<br />

applied to robust design as well [13, 14]. In [13] a coevolutionary<br />

algorithm is used to design a robust<br />

nonlinear control under uncertainties. In [14], the authors<br />

have also reviewed a few different formulations <strong>of</strong><br />

competitive co-evolutionary genetic algorithms.<br />

In this paper, a robust design problem is defined for<br />

minimizing the torque ripples <strong>of</strong> an IPM motor while<br />

considering the uncertainties in the design variables. Two<br />

different approaches based on the worst case scenario are<br />

being considered. The first one employs a deterministic<br />

search and uses the computed sensitivity information to


predict the worst performance <strong>of</strong> the design. The second<br />

algorithm is based on a competitive co-evolutionary<br />

strategy. It introduces a competition between two<br />

populations, one evolves values for the design variables<br />

and the other one evolves values for the uncertainty<br />

variables. Details <strong>of</strong> the two approaches are presented in<br />

the second section <strong>of</strong> the paper and the results from the<br />

robust design are tested against a non-robust design in<br />

section III. In the last section, the performance and the<br />

limitations <strong>of</strong> the two robust design approaches are<br />

discussed.<br />

II. ROBUST TOPOLOGY OPTIMIZATION<br />

A. Robust Design formulation<br />

A practical way to treat the robust design problem is to<br />

use the worst case scenario. A robust objective function<br />

can be defined as:<br />

min max f ( ) , (1)<br />

x U<br />

( x)<br />

where f(x) is the nonrobust objective function and U(x) is<br />

an uncertainty set containing all the possible variations <strong>of</strong><br />

the design variable x,<br />

n<br />

U ( x)<br />

{ R : ( 1<br />

i<br />

) xi<br />

( 1<br />

i<br />

) xi}<br />

. (2)<br />

Then the worst performance <strong>of</strong> the objective function f<br />

can be approximated using the value <strong>of</strong> f evaluated at one<br />

<strong>of</strong> the vertices <strong>of</strong> U, i.e. the worst vertex.<br />

max<br />

U<br />

( x)<br />

f ( ) f ( x<br />

pred<br />

Nevertheless, in a constrained optimization problem, if<br />

a nominal optimal solution x* is located close to the<br />

boundary <strong>of</strong> the feasible region, which may happen in<br />

some cases, some perturbed solutions, due to the<br />

variations <strong>of</strong> the design variables, will no longer be<br />

feasible. In such a situation, the robust solution must be<br />

placed away from the boundary <strong>of</strong> the feasible region to<br />

make sure that the entire uncertainty set <strong>of</strong> x* stays in the<br />

feasible region. To ensure feasibility robustness, a robust<br />

constraint function is defined as:<br />

max<br />

U<br />

( x )<br />

)<br />

- 161 - 15th IGTE Symposium 2012<br />

(3)<br />

g ( ) 0 , (4)<br />

where gi(x) are the original constraints for the problem.<br />

Finally, a robust design formulation using a robust<br />

objective function and a robust constraint function is<br />

given as:<br />

min<br />

x<br />

max<br />

U<br />

( x )<br />

L<br />

i<br />

f ( )<br />

max g i ( ) 0 , (5)<br />

s.<br />

t.<br />

U<br />

( x )<br />

X x X<br />

U<br />

B. Topological gradient<br />

Applications <strong>of</strong> topological shape optimization to the<br />

design <strong>of</strong> electromagnetic devices are relatively new [15]<br />

[16]. Unlike the classical shape optimization for which<br />

only the size and boundary <strong>of</strong> the design object is<br />

allowed to vary, in a topological shape design process,<br />

the topology <strong>of</strong> the domain can change as well, for<br />

instance, by drilling an air hole in the domain or filling<br />

this hole with a different material than the rest <strong>of</strong> the<br />

domain.<br />

The topological gradient (TG) is defined as the<br />

derivative <strong>of</strong> an objective function with respect to an<br />

infinitely small hole Q as:<br />

obj<br />

( \ Q(<br />

x,<br />

r))<br />

obj<br />

TG ( x)<br />

lim<br />

. (6)<br />

r<br />

0 ( )<br />

where is an arbitrary objective function, x is the center<br />

<strong>of</strong> the hole Q, r is the radius <strong>of</strong> Q, \Q(x,r) is the new<br />

topology after the small hole is present and () is the<br />

volume change <strong>of</strong> the domain , which is the volume <strong>of</strong><br />

Q but with a negative sign. Thus a positive value <strong>of</strong><br />

TG(x) means a negative change <strong>of</strong> the values <strong>of</strong> the<br />

objective function after the small hole Q is created.<br />

Hence the topological gradient can provide information<br />

on whether a topology change (creating a small hole in<br />

the system) will result in a decrease <strong>of</strong> the objective<br />

function.<br />

Now we can define a robust objective function based<br />

on the worst performance <strong>of</strong> a non-robust function J due<br />

to the perturbation <strong>of</strong> the design variable x as:<br />

f<br />

w<br />

J ( ) , (7)<br />

max<br />

U<br />

( x )<br />

and U(x) is the uncertainty set similar to (2), while in a<br />

topological design using TG, the design variables are the<br />

three dimensional coordinates, x, <strong>of</strong> the center <strong>of</strong> the<br />

potential topology change. Hence n = 3 and the vector <br />

= {1 2 3} represents the largest variation to the<br />

nominal value <strong>of</strong> x <strong>of</strong> the three dimensional coordinates.<br />

This robust objective function fw can be easily estimated<br />

using the worst vertex <strong>of</strong> the rectangular uncertainty set<br />

U.<br />

If a topology change is taking place in the design<br />

domain , the scalar objective function J can be<br />

approximated near the point using a first order local<br />

expansion, as:<br />

2<br />

J ( ) (<br />

\ Q(<br />

, r))<br />

(<br />

)<br />

TG(<br />

) <br />

( r)<br />

o(<br />

r ) .(8)<br />

Since () and (r) are both constants with respect to<br />

(note that (r) is the volume <strong>of</strong> Q with a negative sign),<br />

J() has the largest value where TG() is the smallest.<br />

Therefore, the worst performance <strong>of</strong> J due to the<br />

perturbation <strong>of</strong> the design variables x is determined by<br />

the point in the uncertainty set, where TG() has the<br />

smallest value. Hence we can obtain a robust topological<br />

gradient as:


TG<br />

R<br />

( x)<br />

min TG ( ) . (9)<br />

U<br />

( x )<br />

Figure 1 is used to illustrate the robustness <strong>of</strong> a<br />

topology. There exist two areas for a potential<br />

topological change in the design domain . However, the<br />

first area, which has the highest TG value, is close to a<br />

large area which has the lowest TG value, i.e. the TG<br />

value drops drastically in the neighborhood <strong>of</strong> the first<br />

area. Therefore the second area, which has the second<br />

largest TG values, is superior to the first one for a<br />

topological change in terms <strong>of</strong> the topological robustness.<br />

This is, in fact, equivalent to the robust topological<br />

design using second-order sensitivity analysis.<br />

Figure 1. Robustness <strong>of</strong> topology<br />

C. Worst vertex prediction using sensitivity<br />

In robust topology optimization, first, we use the<br />

robust TG to determine the topology change in the<br />

problem domain. After we “drilled” a hole in the system,<br />

the boundary <strong>of</strong> the hole is parameterized and is<br />

optimized using a shape optimizer. Thus a new<br />

uncertainty set is defined for the new design variables,<br />

which are the coordinates <strong>of</strong> the controlling points on the<br />

boundary <strong>of</strong> the hole. However, the robust objective<br />

function remains the same through the entire design<br />

process.<br />

In order to find the worst vertex, we can use the<br />

information <strong>of</strong> the gradient computed at the point x. The<br />

following figure gives an example <strong>of</strong> the worst vertex<br />

prediction in R 2 , where the opposite direction <strong>of</strong> the<br />

gradient <strong>of</strong> the objective function points to the worst<br />

vertex <strong>of</strong> the uncertainty set.<br />

- 162 - 15th IGTE Symposium 2012<br />

Figure 2. Worst vertex prediction using gradient<br />

D. Algorithm<br />

Finally, an algorithm for robust topology optimization<br />

based on topological shape optimization is described as<br />

follows:<br />

1. Set the iteration number k =0.<br />

2. Calculate the robust topological gradient TGR at the<br />

center <strong>of</strong> each element.<br />

3. Define the new domain k where the topology<br />

changes take place by removing the material in the<br />

elements where TGR is greater than zero.<br />

4. Apply standard shape optimization method with a<br />

robust objective function to determine the shape <strong>of</strong> the<br />

boundary.<br />

5. Check convergence and exit if the optimality<br />

condition is satisfied.<br />

6. Set k=k+1 and go to 2.<br />

Note that uncertainties related to both topology and<br />

shape are being handled throughout the entire design<br />

process.<br />

III. COMPETITIVE CO-EVOLUTIONARY ALGORITHMS<br />

Co-evolutionary algorithms are suitable for solving<br />

minimax optimization problems. The robust design<br />

formulation based on the worst case analysis defined in<br />

(1) can be generalized as<br />

min max f ( x,<br />

u)<br />

, (10)<br />

xX uU<br />

where f(·,·) is an objective or fitness function, x is a<br />

vector <strong>of</strong> the design variables and u is a vector <strong>of</strong> the<br />

uncertainty variables. Equation (1) is a special case <strong>of</strong><br />

(10) where = x + u. This formulation presents a<br />

competitive relationship between the two players, where<br />

the leader selects a value in X and the follower chooses a<br />

value in U in correspondence.<br />

In a worst case analysis based robust design, the<br />

system seeks for the best design under its worst case<br />

scenario. Thus it is possible to decompose this design<br />

process into two tasks, to find the design with the best<br />

performance and to find the worst performance <strong>of</strong> a


design subject to small variations <strong>of</strong> the design<br />

parameters. Similarly, in a competitive co-evolutionary<br />

algorithm, one population competes with the other<br />

leading to an “arms race”. The first population (denoted<br />

as population A in the rest <strong>of</strong> the paper) represents<br />

candidate solutions in the design space, which are<br />

evolving to minimize the objective; while the second<br />

population (population B) represents disturbances in U,<br />

an uncertainty set applied to the design variables in order<br />

to maximize the objective function. In other words, the<br />

former population provides a solution and the second<br />

population tries to attack the solution in the worst case<br />

scenario. Through the evolutions <strong>of</strong> the two populations,<br />

a more robust solution, i.e. with the best possible worst<br />

case performance, can be found after several generations.<br />

Therefore, this competitive model will guide the<br />

evolution towards robust solutions.<br />

A. Fitness computation<br />

In co-evolutionary algorithms, the two populations, A<br />

and B, evolve independently, but the fitness evaluations<br />

<strong>of</strong> the populations are related to each other. The fitness <strong>of</strong><br />

an individual in one population is evaluated against<br />

values <strong>of</strong> the individuals from the other population. For<br />

instance, the fitness <strong>of</strong> an individual in population A is<br />

defined as,<br />

F( x)<br />

f ( x,<br />

u*)<br />

, (11)<br />

where u* is the current best solution from population B.<br />

The goal <strong>of</strong> evolution <strong>of</strong> population A is to minimize the<br />

fitness function F(x).<br />

The fitness <strong>of</strong> an individual in population B is assigned<br />

against the value <strong>of</strong> the current best individual x* in<br />

population A. Therefore the fitness function for<br />

population B is defined as,<br />

G( u)<br />

f ( x*,<br />

u)<br />

. (12)<br />

The goal <strong>of</strong> evolution <strong>of</strong> population B is to maximize the<br />

fitness function G(u).<br />

B. Alternating co-evolutionary GA<br />

One typical co-evolutionary approach that can be<br />

applied to the continuous minimax problem is called the<br />

alternating co-evolutionary GA (ACGA). Figure 3 shows<br />

a diagram <strong>of</strong> the ACGA. The two populations (A and B)<br />

are initialized randomly. The finesses <strong>of</strong> the individuals<br />

in one population are evaluated against the other<br />

population using the functions defined in (11) and (12).<br />

For instance, after the initialization <strong>of</strong> population A (i.e. a<br />

set <strong>of</strong> random values is assigned to the design variables x<br />

between the lower bound and the upper bound), the<br />

algorithm fixes the values <strong>of</strong> the uncertainty variables u<br />

and evolves population A for several generations to<br />

minimize the fitness function F. Then the algorithm<br />

switches to the evolution <strong>of</strong> population B, while the<br />

values <strong>of</strong> the design variables archived for the best fitness<br />

are kept and the values <strong>of</strong> the uncertainty variables are<br />

- 163 - 15th IGTE Symposium 2012<br />

updated towards the maximization <strong>of</strong> the fitness function<br />

G. This alternating process repeats until the stopping<br />

criterion is met, e.g. the maximum number <strong>of</strong><br />

generations.<br />

Figure 3 Alternating Co-Evolutionary GA<br />

In the algorithm implemented in this paper, both<br />

populations have a total <strong>of</strong> = 100 individuals each. At<br />

the beginning <strong>of</strong> the execution, these individuals are<br />

randomly generated, respecting the bounds. The<br />

candidate solutions are represented by a one-dimensional<br />

array <strong>of</strong> real values. An individual has three genes<br />

representing the values <strong>of</strong> the design variables or the<br />

uncertainty variables. Individuals are selected for<br />

reproduction by means <strong>of</strong> a binary tournament where two<br />

individuals are randomly selected and their fitness values<br />

are compared, and that individual with the best fitness is<br />

selected for reproduction.<br />

The crossover operator used in the algorithm is a<br />

combination <strong>of</strong> an extrapolation method with a one-point<br />

crossover method [17]. Each pair <strong>of</strong> the selected<br />

individuals undergoes crossover with a recombination<br />

rate r = 1.0, and produces two <strong>of</strong>fspring. The operator<br />

performs a blend crossover <strong>of</strong> the gene at the crossing<br />

point, with a random factor within the interval [0, 1].<br />

After crossover, a mutation operator is applied to the<br />

<strong>of</strong>fspring, with a mutation rate m = 0.2. The mutation<br />

operator is very simple. If a gene is under mutation, the<br />

algorithm randomly generates a new real value within the<br />

bounds. The genetic algorithm implemented is<br />

generational, i.e. all <strong>of</strong>fspring replace their parents in the<br />

next generation.<br />

All the <strong>of</strong>fspring are then evaluated and the best<br />

individual is stored and is used as a population<br />

representative and passed as argument for the opponent’s<br />

evaluation, as described in equations (11) and (12).


The algorithm runs for a maximum <strong>of</strong> 100 generations<br />

and returns the best stored pair (x, u).<br />

C. Parallel co-evolutionary GA<br />

The parallel co-evolutionary GA (PCGA), shown in<br />

figure 4, is very similar to the ACGA, except that the two<br />

competitive populations evolve simultaneously. As a<br />

parallel model, this can be implemented easily for a<br />

parallel computing environment and the computational<br />

time will be reduced to half in theory.<br />

Figure 4 Parallel Co-Evolutionary GA<br />

Several other methods <strong>of</strong> co-evolutionary algorithms<br />

using different schemes <strong>of</strong> the fitness assignment can be<br />

seen in [18]–[20]. In this paper, the alternating coevolutionary<br />

GA is used to solve the robust design<br />

problem.<br />

IV. RESULTS<br />

Figure 5 A simulation model <strong>of</strong> an IPM motor.<br />

- 164 - 15th IGTE Symposium 2012<br />

A. Numerical model<br />

Figure 5 shows a quarter <strong>of</strong> a 3-phase 4-pole IPM<br />

machine. The quarter <strong>of</strong> the rotor core has one slot in the<br />

center and the rest <strong>of</strong> the core is made <strong>of</strong> steel. A<br />

permanent magnet bar made <strong>of</strong> NdFeB magnet is inserted<br />

in the center <strong>of</strong> the slot. The goal <strong>of</strong> the design is to find<br />

the optimal shape <strong>of</strong> the permanent magnet bar and the<br />

flux barriers which minimize the torque ripples <strong>of</strong> this<br />

motor, while maintaining an adequate average torque.<br />

The objective function can be defined, without<br />

considering the manufacturing uncertainties, as,<br />

Ti<br />

Tavg<br />

minmax<br />

F ( )<br />

x u<br />

i Tavg<br />

. (13)<br />

s.<br />

t.<br />

minT<br />

0.<br />

4Nm<br />

u<br />

avg<br />

The design variables chosen for the optimization are the<br />

length <strong>of</strong> the permanent magnet, L, the width <strong>of</strong> the<br />

permanent magnet, h and the distance from the<br />

permanent magnet to the surface <strong>of</strong> the rotor, d.<br />

This numerical model is solved using a 2-D nonlinear<br />

finite element solver (MagNet [21]). At each iteration <strong>of</strong><br />

the optimization, torques are evaluated at different<br />

positions <strong>of</strong> the rotor. The rotor mesh is regenerated after<br />

a new geometry <strong>of</strong> the rotor is archived during the<br />

optimization process.<br />

B. Results from RTO<br />

The robust topological optimization method is applied<br />

to a rotor core filled only with iron [22]. The topological<br />

gradient is evaluated in the design region in order to find<br />

potential topological changes which reduce the value <strong>of</strong><br />

the cost function. The permanent magnet and air<br />

materials are created in the region according to the TG<br />

values, as shown in figure 6.<br />

Figure 6 Topology <strong>of</strong> the rotor generated by RTO<br />

This shows a rough topology with one permanent<br />

magnet block and two air flux barriers <strong>of</strong> the design,<br />

which serves as the starting point <strong>of</strong> the shape<br />

optimization process. The system then optimizes the<br />

shape <strong>of</strong> the boundaries between different materials in<br />

order to achieve more accurate values <strong>of</strong> the geometries.<br />

The value <strong>of</strong> the design variables <strong>of</strong> the robust optimal is:<br />

H = 1.616 mm, L = 19.879 mm and d = 12.243 mm.


C. Results from ACGA<br />

It is not practical to combine the topology optimization<br />

with the alternating co-evolutionary GA due to the huge<br />

computational cost. Thus the ACGA is applied to the<br />

model shown in figure 5 to find the robust optimal values<br />

<strong>of</strong> the design variables. The manufacturing tolerances <strong>of</strong><br />

the design variables are considered as the uncertainties <strong>of</strong><br />

the problem. The optimal results, from the robust<br />

formulation, are given in table I.<br />

TABLE I<br />

VALUES OF DESIGN VARIABLES OF THE NOMINAL AND ROBUST OPTIMA<br />

Design<br />

variables<br />

Unit Nominal<br />

optimal<br />

Robust<br />

Optimal<br />

(by RTO)<br />

Robust<br />

Optimal<br />

(by CGGA)<br />

H mm 1.588 1.616 1.453<br />

L mm 18.33 19.879 19.345<br />

D mm 13.426 12.243 12.695<br />

Nominal<br />

Performance<br />

Nm 0.2358 0.2583 0.2547<br />

Worst<br />

Performance<br />

Nm 0.2709 0.2791 0.3039<br />

Feasibility<br />

robustness<br />

No Yes Yes<br />

Table 1 shows the values <strong>of</strong> the non-robust optimal and<br />

the robust optimal. The worst cases <strong>of</strong> the performances<br />

are evaluated. The uncertainty is set to be 5% <strong>of</strong> the<br />

design variables.<br />

V. CONCLUSION<br />

This paper discusses robust design issues and<br />

formulations for IPM design problems. Two different<br />

methods have been applied, and they can both find robust<br />

solutions for the problem.<br />

The robust topology optimization method employs a<br />

deterministic search based on the topological gradient<br />

and the shape sensitivity. The algorithm requires two<br />

FEM solutions per evaluation <strong>of</strong> the robust objective<br />

function (one FEM solution for the nominal cost function<br />

value and sensitivity calculation, and one for the worst<br />

performance). In the deterministic search, the maximum<br />

number <strong>of</strong> objective function evaluation is set to 200 and<br />

the total time <strong>of</strong> execution is around a few hours. Thus<br />

the method is very efficient and fast to converge.<br />

However, the robust objective function defined in [1] is<br />

not necessarily partially differentiable and this may pose<br />

some difficulties to the optimization. Also, convexity <strong>of</strong><br />

the objective function is not guaranteed, thus the worst<br />

performance point may be found inside the uncertainty<br />

set instead <strong>of</strong> on the corner. The worst performance<br />

prediction is only an approximation.<br />

On the other hand, the co-evolutionary GA does not<br />

rely on the sensitivity information. The algorithm<br />

maintains a population <strong>of</strong> the uncertainty variables and<br />

seeks for the exact worst performance point in the<br />

uncertainty set. The algorithm maintains two populations<br />

with 100 individuals for each population. The maximum<br />

number <strong>of</strong> generations <strong>of</strong> evolution is set to 100. The coevolutionary<br />

GA requires a total number <strong>of</strong> 20000 FEM<br />

- 165 - 15th IGTE Symposium 2012<br />

solutions and the total execution time for the algorithm is<br />

around 5 days. Although a huge computation time is<br />

required for the co-evolutionary GA, this algorithm is<br />

parallelizable, thus the time may be reduced by choosing<br />

an appropriate scheme <strong>of</strong> parallel computing. Also,<br />

depending on the nature <strong>of</strong> the optimization problems,<br />

some modifications can be applied to the GA to reduce<br />

the number <strong>of</strong> the function evaluations.<br />

[1]<br />

REFERENCES<br />

H. T.Wang, Z. J. Liu, S. X. Chen, and J. P.Yang, “Application <strong>of</strong><br />

Taguchi method to robust design <strong>of</strong> BLDC motor performance,”<br />

IEEE Trans.Magn., vol. 35, pp. 3700–3702, Sept. 1999.<br />

[2] Y. Sang-Baeck, et al., "Robust shape optimization <strong>of</strong><br />

[3]<br />

electromechanical devices," Magnetics, IEEE Transactions on,<br />

vol. 35, pp. 1710-1713, 1999.<br />

C. M. Piergiorgio Alotto, Werner Renhart, Andreas Weber, Gerald<br />

Steiner, "Robust target functions in electromagnetic design,"<br />

COMPEL: The International Journal for Computation and<br />

Mathematics in Electrical and Electronic Engineering, vol. 22, pp.<br />

549 - 560, 2003.<br />

[4] G. Steiner, et al., "Managing uncertainties in electromagnetic<br />

design problems with robust optimization," Magnetics, IEEE<br />

Transactions on, vol. 40, pp. 1094-1099, 2004<br />

[5] F. G. Guimaraes, et al., "Multiobjective approaches for robust<br />

electromagnetic design," Magnetics, IEEE Transactions on, vol.<br />

42, pp. 1207-1210, 2006.<br />

[6] J. S. Han and B. M. Kwak, "Robust optimization using a gradient<br />

index: MEMS applications," Structural and Multidisciplinary<br />

Optimization, vol. 27, pp. 469-478, 2004.<br />

[7] K. Nam-Kyung, et al., "Robust Optimization Utilizing the Second-<br />

Order Design Sensitivity Information," Magnetics, IEEE<br />

[8]<br />

Transactions on, vol. 46, pp. 3117-3120, 2010.<br />

Min Li, David A. Lowther, "A robust objective function for<br />

topology optimization", COMPEL: The International Journal for<br />

Computation and Mathematics in Electrical and Electronic<br />

Engineering, Vol. 30 Iss: 6, pp.1829 – 1841, 2011<br />

[9] G. Spagnuolo, "Worst case tolerance design <strong>of</strong> magnetic devices<br />

by evolutionary algorithms," Magnetics, IEEE Transactions on,<br />

vol. 39, pp. 2170-2178, 2003.<br />

[10] M. Ci<strong>of</strong>fi, et al., "Stochastic handling <strong>of</strong> tolerances in robust<br />

magnets design," Magnetics, IEEE Transactions on, vol. 40, pp.<br />

1252-1255, 2004.<br />

[11] H. J. C. Barbosa, A coevolutionary genetic algorithm for<br />

constrained optimization. <strong>Proceedings</strong> <strong>of</strong> the 1999 Congress on<br />

Evolutionary Computation, CEC 99. vol. 3, 1999.<br />

[12] J. Kim, Co-evolutionary computation for constrained min-max<br />

problems and its applications for pursuit-evasion games.<br />

<strong>Proceedings</strong> <strong>of</strong> the IEEE Congress on Evolutionary Computation,<br />

CEC 2001, vol. 2, pp. 1205-1212, 2001.<br />

[13] J. M. Claverie, Robust nonlinear control design using competitive<br />

coevolution, <strong>Proceedings</strong> <strong>of</strong> the IEEE Congress on Evolutionary<br />

Computation, CEC 2000, vol. 1, pp. 403-409, 2000.<br />

[14] A. M. Cramer, et al., "Evolutionary Algorithms for Minimax<br />

Problems in Robust Design," Evolutionary Computation, IEEE<br />

Transactions on, vol. 13, pp. 444-453, 2009.<br />

[15] K. Dong-Hun, et al., "Smooth Boundary Topology Optimization<br />

for Electrostatic Problems Through the Combination <strong>of</strong> Shape and<br />

Topological Design Sensitivities," Magnetics, IEEE Transactions<br />

on, vol. 44, pp. 1002-1005, 2008.<br />

[16] D. H. Kim, et al., "The Implications <strong>of</strong> the Use <strong>of</strong> Composite<br />

Materials in Electromagnetic Device Topology and Shape<br />

Optimization," Magnetics, IEEE Transactions on, vol. 45, pp.<br />

1154-1157, 2009<br />

[17] Haupt, Randy L. Practical genetic algorithms / Randy L. Haupt,<br />

Sue Ellen Haupt.—2nd ed. p. cm. Red. ed. <strong>of</strong>: Practical genetic<br />

algorithms. c1998. “A Wiley-Interscience publication.” ISBN 0-<br />

471-45565-2.<br />

[18] Y. Shi and R. A. Krohling, “Co-evolutionary particle swarm<br />

optimization to solve min-max problems,” in Proc. 2002 Cong.<br />

Evol. Comput., vol. 2, pp. 1682–1687<br />

[19] M. T. Jensen, “A new look at solving minimax problems with<br />

coevolution,” in Applied Optimization, Vol. 86, Metaheuristics:


Computer Decision-Making,M.G. C. Resende and J. Pinho de<br />

Sousa, Eds. Boston, MA: Kluwer, 2004, pp. 369–384.<br />

[20] J. Hur, H. Lee, and M.-J. Tahk, “Parameter robust control design<br />

using bimatrix co-evolution algorithms,” Eng. Optim., vol. 35, no.<br />

4, pp. 417–426, Aug. 2003.<br />

[21] MagNet user’s manual 2012, http://www.infolytica.ca<br />

[22] M. Li, and D. A. Lowther, “Robust Topology Optimization <strong>of</strong> an<br />

IPM Motor using Topological Analysis,” proceeding <strong>of</strong><br />

CompuMag2011, 2011<br />

- 166 - 15th IGTE Symposium 2012


IGTE Symposium, TU <strong>Graz</strong> 2012<br />

- 167 - 15th IGTE Symposium 2012<br />

Free-form Optimization for Magnetic Design<br />

Z. Andjelić 1 , S. Sadović 2<br />

1 ABB Corporate Research, Baden, Switzerland;<br />

2 Sadovic Consulting, Paris, France<br />

E-mail: zoran.andjelic@ch.abb.com<br />

Abstract— The paper presents an approach for free-form optimization <strong>of</strong> the magnetic problems. The approach is based on<br />

the novel simple sensitivity analysis and does not require the calculation <strong>of</strong> the adjoint problem. The solution engine in the<br />

background is IEM. The developed approach is illustrated on some typical benchmark problems.<br />

Index Terms— Free-form optimization, IEM, Sensitivity analysis<br />

Also, in free-form optimization the meshing <strong>of</strong> the<br />

I. INTRODUCTION<br />

analysed objects using mesh generator is performed<br />

When speaking about free-form optimization <strong>of</strong> industrial<br />

only in the first iteration. For all further iterations the<br />

problems we distinguish two different approaches: direct<br />

mesh is updated directly in the Analysis module.<br />

and indirect approach. In direct approach we try to As we use the non-gradient approach, the calculation<br />

minimize the maximal field quantities laying directly on time is much faster than with the gradient approach,<br />

the interface between different media by changing the requiring the costly calculation <strong>of</strong> the gradients.<br />

form <strong>of</strong> those interfaces in the normal direction. Typical As mentioned above we distinguish between the direct<br />

applications are optimization <strong>of</strong> the structural problems and indirect approaches for free-form optimization. One<br />

[1], or dielectric design <strong>of</strong> electrical apparatus [2], [3], <strong>of</strong> the additional key differences between direct and<br />

[4]. In indirect approach we are searching for the indirect approach is that for the optimization problems<br />

prescribed distribution <strong>of</strong> the objective function in the following the direct approach it is not necessary to<br />

space <strong>of</strong> interest by changing the shape <strong>of</strong> the structures calculate any sensitivity function [2], [3]. In the current<br />

outside <strong>of</strong> such space <strong>of</strong> interest. In this paper we discuss contribution we shall focus us on the indirect approach<br />

in more details the second approach, illustrated by some illustrated by some applications in magnetic design. It is<br />

typical benchmark problems.<br />

important to note that the proposed approach is<br />

independent <strong>of</strong> the application class and can be used for<br />

optimization <strong>of</strong> not only magnetic but also dielectric,<br />

acoustic or similar class <strong>of</strong> problems. It also has a generic<br />

character and can be used having FEM or other numerical<br />

method as the numerical engine in the background. In the<br />

present contribution we use IEM (Integral Equation<br />

Method) for the solution <strong>of</strong> the magnetostatic field<br />

problems.<br />

II. FREE-FORM OPTIMIZATION<br />

For automatic shape optimization we follow a nonparametric,<br />

non-gradient approach, which in<br />

combination with IEM (Integral Equation Method)<br />

enables fast and robust optimization <strong>of</strong> the real-world 3D<br />

problems [5]. The main benefits <strong>of</strong> such an approach<br />

comparing to the standard parametric, gradient-based<br />

approaches are:<br />

The applied procedure usually leads to the global<br />

optimum contrary to the parametric optimization<br />

approach where the optimum can be searched only<br />

within the “parametric space” defined by the design<br />

parameters (radii, distances, etc.).<br />

Due to the fact that we don’t need as input any design<br />

parameter it is not necessary to “communicate” with<br />

the CAD system during the optimization iterative<br />

procedure. As shown in Figure 1 the iterative<br />

framework in free-form optimization requires<br />

communication only between Analysis and<br />

Optimization module, whereby by parametric<br />

optimization in each iteration a new set <strong>of</strong> parameters<br />

has to be generated in CAD tool, meshed in mesh<br />

generator and then passed to Analysis module for<br />

further processing.<br />

Figure 1: Free-form vs. parametric optimization framework<br />

III. IEM FORMULATION<br />

The analysis <strong>of</strong> the non-linear problems in<br />

magnetostatic by IEM is performed using the improved<br />

procedure described initially in [6], and more detailed<br />

elaborated recently in [7]. The magnetic field in any space<br />

point can be found as:<br />

J M<br />

H H H<br />

J<br />

where H is a field component produced by the excitation<br />

M<br />

current in free space and H is a field produced by the<br />

magnetic charges. The first field component can be easily<br />

calculated by Bio-Savarot law. For the calculation <strong>of</strong> the<br />

second one we use the formula:<br />

M 1 1<br />

H J 1dSJ N<br />

2dVN<br />

(2)<br />

4 J K 1 dS <br />

4<br />

N 2 2dV<br />

N (2)<br />

4 4<br />

<br />

K<br />

S<br />

J<br />

VN<br />

where J and N are the fictitious surface and volume<br />

magnetic charges, and 1 K and K 2 are the kernels <strong>of</strong> the<br />

3<br />

type r / r . The surface charges are obtained by solving<br />

second Fredholm integral equation:<br />

(1)


IGTE Symposium, TU <strong>Graz</strong> 2012<br />

1 1<br />

2 (3)<br />

<br />

(3)<br />

2 <br />

1<br />

<br />

I<br />

J<br />

J GdS 1 I 2 I I N NGdV 2 2d<br />

N<br />

I 2 2 2<br />

s V N<br />

JGdS 1 2H<br />

n<br />

Here 1 G and 2<br />

12 <br />

<br />

1 2<br />

G are the kernels <strong>of</strong> the type<br />

3<br />

rn/ r and<br />

where the 1 and 2 are the relative<br />

permeabilities <strong>of</strong> the surrounding media and magnetic<br />

materials. When solving the linear problems the last term<br />

on the right-hand side <strong>of</strong> equation (3) is equal to zero.<br />

Here is important to stress some <strong>of</strong> the main features<br />

<strong>of</strong> IEM when solving the non-linear magnetostatic<br />

problem. In spite <strong>of</strong> the fact that it is necessary to mesh<br />

the volume <strong>of</strong> the non-linear magnetic parts, the number<br />

<strong>of</strong> unknowns for the non-linear problem is same as the<br />

number <strong>of</strong> unknowns for the linear one. This is due to the<br />

fact that the non-linear contribution - second term on the<br />

right-hand side <strong>of</strong> (3) - appears just as the correction term<br />

and is calculated throughout the iteration procedure from<br />

the previous iteration. Also, as the material parameter<br />

appears only in the diagonal term, the matrix<br />

calculation is performed only during the first iteration. In<br />

other iterations only the diagonal term is changed together<br />

with the RHS term taking into account the contributions<br />

due to volume charges.<br />

IV. INDIRECT APPROACH<br />

Contrary to the direct approach where it is not necessary<br />

to calculate any sensitivity functions, in indirect approach<br />

this function has to be established. To establish such<br />

function we use the analogy to the sensitivity analysis<br />

typically used in the signal-processing (SP) problems. In<br />

SP the objective <strong>of</strong> controller design is to keep the error<br />

between the controlled output and the external input as<br />

small as possible. In signal processing the sensitivity<br />

function S(s) is typically calculated as:<br />

Es ()<br />

Ss () ; Es () Rs () Ys<br />

() (4)<br />

Rs () ds ()<br />

where E(s) is feedback error, R(s) and d(s) are the<br />

external input and disturbance.<br />

To calculate the sensitivity for our optimization tasks we<br />

use the analogy to the above SP scheme. Here we take as<br />

example the quantities from the magnetic problem,<br />

Figure 2.<br />

Figure 2: Sensitivity calculation scheme for optimization<br />

tasks<br />

In magnetic problems the magnetic field in the space<br />

point <strong>of</strong> interest can be calculated using BEM [5] as:<br />

- 168 - 15th IGTE Symposium 2012<br />

1<br />

H(<br />

j) ( i) K( i, j) d<br />

(5)<br />

4 <br />

<br />

Sensitivity <strong>of</strong> changing the field H in the space <strong>of</strong> interest<br />

with the changes <strong>of</strong> the geometry <strong>of</strong> the magnetized body<br />

can then be obtained as:<br />

H H<br />

S <br />

H H<br />

G C<br />

G C<br />

max<br />

In the above case the external input H G is a given i.e.<br />

prescribed (desired) field distribution in the space <strong>of</strong><br />

interest, H C is a calculated field in the same space. The<br />

C<br />

disturbance H is calculated as:<br />

max<br />

C C<br />

max max<br />

(6)<br />

H ( i) max[ H ( i, j), j1, N ]; (7)<br />

The displacement vector D <strong>of</strong> the moving <strong>of</strong> the mesh<br />

nodes can then be calculated as:<br />

D Sn More information on the calculation <strong>of</strong> the sensitivity<br />

function for free-form optimization tasks can be found in<br />

[8].<br />

For illustration the above procedure has been used to<br />

optimize the Die mold problem, Example 1 and Field<br />

homogenization problem, Example 2.<br />

V. EXAMPLE 1: DIE MOLD OPTIMIZATION<br />

This is a TEAM benchmark problem No. 25 used up to<br />

now for the benchmarking <strong>of</strong> the codes dealing with 2D<br />

parametric optimization [9], Figure 3.<br />

Figure 3: TEAM benchmark problem No. 25<br />

Here we use the same model as a 3D problem adding the<br />

extrusion in y-direction <strong>of</strong> 200 mm, Figure 4. The<br />

objective is to obtain the homogeneous radial field<br />

distribution within the cavity shown in Figure 3. One <strong>of</strong><br />

the die molds is keept fix (inner cylinder) and the other<br />

one is in our approach subjected to the free-optimization<br />

process in order to get the radial field distribution in the<br />

j<br />

8


IGTE Symposium, TU <strong>Graz</strong> 2012<br />

cavity. 3D model is shown in Figure 4 and the detailed<br />

2D view in Figure 5.<br />

Figure 4: 3D model <strong>of</strong> the Team problem No. 25<br />

Figure 5: Details <strong>of</strong> the Team problem No. 25<br />

Applying module for free-form optimization governed by<br />

the above given approach for sensitivity calculation we<br />

have after 24 iterations obtained the optimal form <strong>of</strong> the<br />

magnetic poles, Figure 6 (in red).<br />

Figure 6: Outer magnetic mold before and after optimization<br />

Such new form <strong>of</strong> magnetic poles has provided a desired<br />

radial field distribution in the cavity. Figure 7 shows the<br />

form <strong>of</strong> the magnetic poles befor otpimization, after 10 th<br />

iteration and as the optimal form after 24 iterations. The<br />

field vectors illustrate the changes <strong>of</strong> the field during the<br />

optimization process. Only at the end <strong>of</strong> the die molds<br />

some deviations are observed caused by the end-region<br />

field disturbances.<br />

- 169 - 15th IGTE Symposium 2012<br />

Figure 7: Magnetic field homogenization during the<br />

optimization process.<br />

VI. EXAMPLE 2: AIR GAP FIELD HOMOGENIZATION<br />

In this example the objective function is to achieve the<br />

homogeny field distribution over the prescribed space <strong>of</strong><br />

interest lying in the air gap between the magnetic poles,<br />

Figure 8.<br />

Figure 8: Model <strong>of</strong> the magnetic structure<br />

The core is made <strong>of</strong> the material with 1500<br />

, and is<br />

excited by the current-carrying coil with I=12240A.<br />

Before doing any optimization the magnetic field<br />

distribution over the space <strong>of</strong> interest is shown in Figure 9<br />

and Figure 11, a.). The field over the space <strong>of</strong> interest<br />

varies from 35737 A/m to 57555 A/m. As the<br />

optimization objective we define here the desired value <strong>of</strong><br />

the homogeneous field over the space <strong>of</strong> interest<br />

(50x50mm) as H d =50000 A/m. After applying the<br />

optimization modus governed by the above sensitivity<br />

calculation, we have obtained after 37 iterations the<br />

optimal form <strong>of</strong> the magnetic pole shoes that deliver the<br />

desired field distribution within the error less than 10%,<br />

Figure 10.


IGTE Symposium, TU <strong>Graz</strong> 2012<br />

Figure 9: Magnetic field distribution over the space <strong>of</strong><br />

interest before any optimization<br />

Figure 10: Magnetic field distribution over the space <strong>of</strong><br />

interest after 37 iterations. The magnetic poles have changed<br />

the form in order to provide prescribed homogeneous field <strong>of</strong><br />

50000A/m.<br />

Figure 11 shows in more details the field distribution over<br />

the space <strong>of</strong> interest before (a) and after optimization (b).<br />

Figure 11: Detailed view on the field distribution over the<br />

space <strong>of</strong> interest before (a) and after (b) optimization<br />

The field variation for optimal design (with threshold<br />

error <strong>of</strong> 10%) is between Hmin = 46489A/m and<br />

Hmax=55027 A/m.<br />

VII. CONCLUSION<br />

The paper elaborates the procedure for free-form<br />

optimization <strong>of</strong> magnetic problems. The procedure is<br />

- 170 - 15th IGTE Symposium 2012<br />

based on the novel approach for the simple sensitivity<br />

calculation. The proposed approach does not require<br />

calculation <strong>of</strong> the adjoint problem and has a generic<br />

character with respect to both the classes <strong>of</strong> the<br />

application (magnetic, dielectric, acoustics...) and the<br />

numerical methods used within the simulation engine<br />

(BEM, FEM).<br />

REFERENCES<br />

[1] R. Meske: “Non-parametric gradient-less shape optimization in<br />

solid mechanics”, Shaker Verlag,2007, ISBN 978-3-8322-6373-7<br />

[2] Z. Andjelic, S. Sadovic: “Reduction <strong>of</strong> breakdown appearance by<br />

automatic geometry optimization”, IEEE Conf. on El. Insulation<br />

and Dielectric Phenomena, Vancouver BC, Canada, 2007<br />

[3] Z. Andjelic, D. Pusch, T. Schoenemann, S. Sadovic: “Multi-load<br />

optimization in electrical engineering design, Part 1: Simulation,<br />

EngOpt 2008- Int. Conf. on Engineering Optimization, Rio de<br />

Janeiro, Brazil, 01-05. June 2008<br />

[4] Z. Andjelic, S. Sadovic, Jean-Claude Mauroux: “Preventing<br />

breakdown by direct optimization approach”, IEEE Int. Power<br />

Modulator and High Voltage Conf, San Diego, CA-June 3-7,<br />

2012<br />

[5] Z. Andjelic at al: “BEM-based simulations in engineering<br />

design”, In Boundary Element Analysis, Mathematical Aspects<br />

and Applications, Springer Verlag 2007, ISBN: 3-540-47465-X<br />

[6] B. Krstajic, Z. Andjelic, S. Milojkovic, S. Babic, S. Salon:<br />

“Nonlinear 3D magnetostatic field calculation by the integral<br />

equation method with surface and volume magnetic charges”,<br />

IEEE Tran. on Mag., vol.28, No.2, March 199<br />

[7] Z. Andjelic, G. Of, O. Steinbach, P. Urthaler: “Fast BEM for<br />

industrial applications in magnetostatic”, in Lecture Nodes in<br />

Applied and Computational Mechanics, Springer-Verlag, Vol. 63,<br />

2012<br />

[8] Z. Andjelic: “Simple sensitivity approach for optimization tasks in<br />

electrical engineering”, OIPE Workshop, Gent, Belgium, 2012<br />

[9] N. Takahashi, M. Natsumeda, M. Otoshi and K. Muramatsu:<br />

“Examination <strong>of</strong> optimal design method using die press model<br />

(problem 25)”, COMPEL 17 5/6, 1982


- 171 - 15th IGTE Symposium 2012<br />

Optimization for ECT treatment planning<br />

1 P. Di Barba, 3 L.G. Campana, 2 F. Dughiero, 3 C.R. Rossi, 2 E. Sieni<br />

1 Department <strong>of</strong> Industrial and Information Engineering, Pavia <strong>University</strong>, via Ferrata 1, 27100 Pavia (Italy)<br />

2 Department <strong>of</strong> Industrial Engineering, Padova <strong>University</strong>, via Gradenigo 6/A, 35131 Padova (Italy)<br />

3 Melanoma and Sarcoma Unit, Istituto Oncologico Veneto (IOV),Via Gattamelata 64, 35128 Padova (Italy)<br />

E-mail: paolo.dibarba@unipv.it,{fabrizio.dughiero, carlor.rossi, elisabetta.sieni}@unipd.it, luca.campana@ioveneto.it<br />

Abstract—Treatment planning <strong>of</strong> Electrochemotherapy (ECT) is designed by means <strong>of</strong> a genetic multi-objective optimization<br />

method: the needle position maximizing the electric field in the treated volume is searched for. NSGA algorithm is coupled<br />

with penalty function technique in order to identify the constrained Pareto front to select the best compromise solutions and<br />

discard the unfeasible ones.<br />

Index Terms—Electrochemotherapy, conduction field, Finite Element, Pareto front, NSGA.<br />

I. INTRODUCTION<br />

ECT uses pulses <strong>of</strong> electric field in order to improve the<br />

delivery <strong>of</strong> chemotherapeutic drugs into cancer cells [1]-<br />

[2]. A suitable electric field intensity is able to induce cell<br />

membrane permeabilization that improves the<br />

chemotherapy drug delivery. However, a high electric<br />

field intensity in healthy tissues, and in some critical<br />

regions like e.g. large vessels, is to be prevented. A<br />

conduction electric field is applied to tumor tissues by<br />

means <strong>of</strong> needle electrodes suitably positioned in the<br />

target volume. In order to improve the therapy success,<br />

the positioning <strong>of</strong> electrodes is considered an<br />

optimization problem. The research group in Ljubljana<br />

<strong>University</strong> has proposed some solutions to optimal<br />

electrode positioning in deep-seated tumor like in [3-6].<br />

In this paper a multiobjective optimization method, based<br />

on a modified NSGA-II algorithm, that includes<br />

constraints and penalty functions in order to prevent<br />

unfeasible solutions, is proposed for the optimal<br />

positioning <strong>of</strong> needles in the tumor mass [7-10]. The<br />

optimization problem is solved using a 2D model <strong>of</strong><br />

steady conduction field.<br />

II. CLINICAL ECT<br />

ECT is a medical therapy based on cell electroporation<br />

for patients with cutaneous and subcutaneous tumor<br />

nodules on the basis <strong>of</strong> the synergistic association <strong>of</strong><br />

locally applied brief electrical currents (reversible<br />

electroporation) and low permeant anticancer agents [11-<br />

13]. Electroporation is a local electric treatment that uses<br />

a physical behavior <strong>of</strong> cells when a pulsed electric field is<br />

applied in order to open some pores on the cell<br />

membrane. Those opening can be used as channels as a<br />

delivery system to enhance the penetration <strong>of</strong> drugs,<br />

genes, or molecular probes into cancer cells. This is an<br />

applied electrical fields with suitable intensity that<br />

increase cell membrane permeability [14-17]. Figure 1<br />

shows the most important phase <strong>of</strong> chemotherapy drug<br />

administration using ECT technique: In the phase I <strong>of</strong> the<br />

treatment the clinician injects the drug (e.g. bleomicine),<br />

then during phase II he applies the electric pulse, and<br />

finally the drug penetrate the membrane cells.<br />

Since its development at the Institute Gustave Roussy,<br />

this technique has been quickly tested in the clinical<br />

setting and recently is entered in the clinical practice [11-<br />

12], [18-21]. At Melanoma and Sarcoma unit <strong>of</strong> the<br />

“Istituto Oncologico Veneto” (IOV) in Padova, Italy,<br />

clinical application <strong>of</strong> ECT using standard electrodes [23]<br />

has shown yet satisfying results [22]. Standard electrodes<br />

are a set <strong>of</strong> 7 needles with a length between 10 and 30<br />

mm on a rigid support, [23]. The ECT equipment<br />

manufacturer produces also a long needles machine that<br />

can be used to treat with ECT some deep-seated tumors<br />

like sarcoma [23-25]. In this case the clinician implants<br />

single 20 cm length electrodes on the tumor mass<br />

accordingly to medical image <strong>of</strong> the tumor and clinical<br />

practice. So, in the case <strong>of</strong> flexible long-needle<br />

equipment, it is <strong>of</strong> interest to improve the therapy success<br />

studying the electric field produced by some<br />

configurations <strong>of</strong> electrodes implanted on the tumor mass<br />

using optimization algorithms.<br />

ECT electrode<br />

E<br />

Skin surface<br />

Figure 1: Description <strong>of</strong> the ECT application.<br />

III. DIRECT PROBLEM: ELECTRIC FIELD ANALYSIS<br />

In general, the case study models three regions: the<br />

tumor, T, with an average radius <strong>of</strong> 3 cm, the<br />

surrounding healthy tissue, H, and a region close to the<br />

treated region that might be a critical one, C. Each<br />

region is attributed the relevant electric conductivity [3].<br />

The needle electrodes are represented as a set <strong>of</strong> nine<br />

points. In particular, the fixed main electrode is located in<br />

the center <strong>of</strong> the lesion whereas the other eight electrodes,<br />

the ones that can be moved, are around the central one.<br />

The ECT process forces a sequence <strong>of</strong> voltages in the<br />

range 1 to 3 kV for each electrode pair. The imposed<br />

voltages represent the boundary conditions <strong>of</strong> the field<br />

problem. Then, the electric field is computed by means <strong>of</strong><br />

the finite-element method (FEM) solving a steady<br />

conduction problem for each electrode pair: specifically,<br />

16 field analyses on the same grid are needed to compute<br />

the electric field for each needle configurations [26]. The


solved equation is:<br />

V<br />

0<br />

(1)<br />

imposing Neumann condition on electric scalar potential<br />

on the domain boundary:<br />

V<br />

n<br />

0<br />

And finally the electric potential has been fixed to a<br />

constant value, U, in two <strong>of</strong> the ne electrodes in the<br />

following way<br />

V U<br />

i<br />

0 V U<br />

(<br />

i,<br />

j)<br />

i j i 1,...<br />

n<br />

j<br />

e<br />

U<br />

i<br />

Then, given an electrode configuration, solving the direct<br />

problem implies to repeat the field analysis, i.e. solving<br />

(1), for all possible (i,j) pairs <strong>of</strong> electrodes.<br />

<br />

H<br />

C<br />

T<br />

Electrode<br />

U i<br />

Main electrode<br />

Figure 1: Geometry <strong>of</strong> the 2D conduction field.<br />

Given all the ne field analyses considering the mesh<br />

nodes <strong>of</strong> each problem region the highest value <strong>of</strong> the<br />

electric field is searched for each node <strong>of</strong> the problem<br />

domain and recorded in sets named Emax(i), one for each<br />

<strong>of</strong> examined region.<br />

IV. INVERSE PROBLEM: OPTIMAL ELECTRODE<br />

POSITIONING<br />

The therapy efficacy depends on the electric field<br />

intensity applied to the cells. In some practical cases the<br />

proximity to a prescribed therapeutic value <strong>of</strong> the<br />

temperature is searched for [27-28], whereas in our case<br />

the overcoming <strong>of</strong> a given electric field threshold is to be<br />

controlled. The ideal configuration <strong>of</strong> needle electrodes is<br />

the one that maximizes the sub-volume <strong>of</strong> the tumor<br />

region covered with an electric field intensity over the<br />

electropermeabilization threshold [3], ERE, and<br />

simultaneously minimizes the volume <strong>of</strong> healthy tissues or<br />

critical organs that have an electric field higher than ERE<br />

[29-30]. Accordingly, the following objective functions,<br />

to be minimized, have been defined:<br />

<br />

<br />

N E ( E E<br />

f1(<br />

E)<br />

100<br />

<br />

1<br />

N E,<br />

tot<br />

RE<br />

) <br />

<br />

<br />

<br />

that represents the complementary sub-volume <strong>of</strong> the<br />

tumor region for which the electric field is under ERE,<br />

evaluated as the number <strong>of</strong> nodes, NE, in which the<br />

U j<br />

(2)<br />

(3)<br />

(4)<br />

- 172 - 15th IGTE Symposium 2012<br />

electric field intensity is higher than ERE. NE,tot is the total<br />

number <strong>of</strong> nodes in which the electric field is evaluated in<br />

the tumor region. The design criterion considered<br />

evaluates the nodes <strong>of</strong> the healthy tissue region or the<br />

region <strong>of</strong> a critical organ (e.g. large vessel), in which the<br />

electric field exceeds a prescribed threshold ETH:<br />

g(<br />

E,<br />

E<br />

TH<br />

)<br />

N ( E E<br />

N<br />

E<br />

TH<br />

100<br />

(5)<br />

E,<br />

tot<br />

)<br />

Starting from (5) three objective functions have been<br />

generated. Namely:<br />

(a) g(<br />

E,<br />

E )<br />

f (6)<br />

2 IRE<br />

in which the threshold <strong>of</strong> electric field is fixed to the<br />

irreversible electroporation value, EIRE = 10 5 V/m, and is<br />

computed in the tumor region T;<br />

(b) g E,<br />

E )<br />

f (7)<br />

3 ( ETH 1<br />

in which the threshold <strong>of</strong> electric field is fixed to the<br />

reversible electroporation, ETH1 = 410 4 V/m, computed<br />

on the healthy tissue H. In this case it is desirable that<br />

the electric field does not exceed the threshold ETH1 in<br />

order to preserve healthy tissue; and finally:<br />

(c) g E,<br />

E )<br />

f (8)<br />

4 ( ETH 2<br />

In this case the electric field cannot exceed the ETH2 = 10 3<br />

V/m in the critical region C to prevent the damage <strong>of</strong><br />

critical organs. Generally this threshold is chosen lower<br />

than electroporation threshold in order to ensure an<br />

electric field lower the ERE.<br />

All the objective functions (4) and (6)-(8) are computed<br />

using the Emax(i) set <strong>of</strong> values in the corresponding<br />

region <strong>of</strong> the computation domain.<br />

A1<br />

A2 A2<br />

Figure 2: f1 and f3 optimization goal.<br />

Accordingly, a sequence <strong>of</strong> bi-objective optimization<br />

problems have been considered and solved: find the<br />

Pareto front minimizing the couple <strong>of</strong> functions (f1, fk)<br />

with k=2,3 and 4 subject to the solution <strong>of</strong> the direct<br />

problem (1) and a set <strong>of</strong> geometrical constraints on the<br />

electrode position. For instance the minimum distance<br />

between two electrodes must be greater than 10 mm.<br />

Constraints have been incorporated in the objectives<br />

functions by means <strong>of</strong> a penalty term as in [7]. For<br />

instance Figure 2 shows the f1 and f3 optimization goal: f1<br />

tends to maximize the area A1, whereas f3 tends to<br />

A1


minimize the area A2. Moreover, Figure 3 shows the<br />

penalty constraint effect: if the non-penalty algorithm is<br />

used, two electrodes can be at a distance lower than the<br />

prescribed minimum (10 mm) like the one marked with a<br />

circle in Figure 3 (a). In contrast, if the penalty algorithm<br />

is used, too near electrodes configuration are discarded<br />

and a possible configuration is like the one in Figure 3<br />

(b).<br />

(a) (b)<br />

Figure 3: Penalty algorithm effect.<br />

V. RESULTS<br />

Results <strong>of</strong> some optimized configurations are here<br />

presented.<br />

A. Case 1: penalty vs non-penalty<br />

The optimization problem considers the electric field in<br />

the tumor region T that must exceed the electroporation<br />

threshold ERE (f1) and the electric field in the healthy<br />

tissue region, T, that must be lower than the<br />

electroporation threshold ETH1=ERE (f3).<br />

In this case, results obtained using penalty algorithm are<br />

compared with results obtained using non-penalty<br />

algorithm. Figure 4 reports the two Pareto fronts obtained<br />

starting from the same initial population and using the<br />

two algorithm: the Pareto front is reshaped.<br />

Figure 4: Pareto Front for the case 1 using penalty and<br />

non-penalty algorithm.<br />

200103 E [V/m]<br />

15010 3<br />

10010 3<br />

5010 3<br />

0,00<br />

Not feasible<br />

Figure 5: Optimized electrodes configurations: Electric<br />

field in the examined region using (a) penaltyand (b) nonpenalty<br />

algorithm.<br />

- 173 - 15th IGTE Symposium 2012<br />

In Figure 5 the highest value <strong>of</strong> the electric field obtained<br />

at each domain point applying the whole sequence <strong>of</strong><br />

electrodes discharges during an ECT treatment is reported<br />

for the tumor region, Emax(T), and the healthy tissue,<br />

Emax(H). The corresponding electrodes position is also<br />

indicated by dots.<br />

A. Case 2: preventing irreversible electroporation<br />

The optimization problem considers the electric field in<br />

the tumor region T that must exceed the electroporation<br />

threshold ERE (f1) and must be lower than the irreversible<br />

threshold, EIRE (f2).<br />

Figure 6 reports the Pareto front obtained starting from an<br />

initial population and using the penalty algorithm.<br />

Figure 6: Pareto Front for the case 2 using penalty<br />

algorithm.<br />

In Figure 7 the highest value <strong>of</strong> the electric field obtained<br />

at each domain point applying the whole sequence <strong>of</strong><br />

electrodes discharges during an ECT treatment is reported<br />

for the tumor region, Emax(T). The corresponding<br />

electrodes position is also indicated by dots.<br />

200103 E [V/m]<br />

15010 3<br />

10010 3<br />

5010 3<br />

0,00<br />

Figure 7: Optimized electrodes configurations: Electric<br />

field in the examined region using penalty algorithm.<br />

In this case electrodes are positioned in the healthy tissue<br />

because the irreversible electroporation is avoided in<br />

order to prevent cells necrosis.<br />

A. Case 3: preserving critical organ (blood vessel)<br />

The optimization problem considers the electric field in<br />

the tumor region T that must exceed the electroporation<br />

threshold ERE (f1) and the electric field in the critical<br />

region, C, that must be lower than the threshold ETH1<br />

(f4).<br />

Figure 8 reports the Pareto front obtained starting from an<br />

initial population and using the penalty algorithm. In<br />

Figure 9 the highest value <strong>of</strong> the electric field obtained at<br />

each domain point applying the whole sequence <strong>of</strong><br />

electrodes discharges during an ECT treatment is reported


for the tumor region, Emax(T) and the critical region,<br />

Emax(C). The corresponding electrodes position is also<br />

marked by black and green dots.<br />

Figure 8: Pareto Front for the case 3 using penalty<br />

algorithm.<br />

200103 E [V/m]<br />

15010 3<br />

10010 3<br />

5010 3<br />

Figure 9: Optimized electrodes configurations: Electric<br />

field in the examined region using penalty algorithm.<br />

In this case the electrode are far from the critical region,<br />

whereas in Figure 5 are close and even inside the critical<br />

region. Then different objective functions allow to<br />

identify different electrodes configurations depending on<br />

the problem constraints.<br />

VI. CONCLUSIONS<br />

The NSGA-II algorithm modified including constraints<br />

and penalty function has been applied to design ECT<br />

electrodes positioning. Various objective function pairs<br />

have been implemented in order to compare results<br />

obtained considering different problem targets.<br />

VII. ACKNOWLEDGES<br />

This project has been developed in the frame <strong>of</strong> a Post-<br />

Doc granted by the Padova <strong>University</strong>, Italy.<br />

REFERENCES<br />

[1] G. Sersa, D. Miklavcic, M. Cemazar, Z. Rudolf, G. Pucihar, M.<br />

Snoj, “Electrochemotherapy in treatment <strong>of</strong> tumours”, European<br />

J. Surgical Oncology, 34 (2), 232–240, 2008.<br />

[2] S. Corovic, A. Zupanic, D. Miklavcic, “Numerical modeling and<br />

optimization <strong>of</strong> electric field distribution in subcutaneous tumor<br />

treated with electrochemotherapy using needle electrodes”, IEEE<br />

Trans. on Plasma Science, 36(4), 1665–1672, 2008.<br />

[3] B. Kos, A. Zupanic, T. Kotnik, M. Snoj, G. Sersa, D. Miklavcic,<br />

“Robustness <strong>of</strong> treatment planning for electrochemotherapy <strong>of</strong><br />

deep-seated tumors”, J. Membrane Biol., 236(1), 147–153, 2010.<br />

[4] D. Sel, A. M. Lebar, D. Miklavcic, “Feasibility <strong>of</strong> employing<br />

model-based optimization <strong>of</strong> pulse amplitude and electrode<br />

distance for effective tumor electropermeabilization”, Biomedical<br />

Engineering, IEEE Trans, 54(5), 773–781, 2007.<br />

[5] D. Miklavcic, M. Snoj, A. Zupanic, B. Kos, M. Cemazar, M.<br />

Kropivnik, M. Bracko, T. Pecnik, E. Gadzijev, G. Sersa,<br />

“Towards treatment planning and treatment <strong>of</strong> deep-seated solid<br />

tumors by electrochemotherapy”, BioMedical Eng. On Line, 9(1),<br />

0,00<br />

- 174 - 15th IGTE Symposium 2012<br />

10, 2010.<br />

[6] A. Županič, S. Čorović, D. Miklavčič, “Optimization <strong>of</strong> electrode<br />

position and electric pulse amplitude in electrochemotherapy”,<br />

Radiology and Oncology, vol. 42(2), 93–101, 2008.<br />

[7] P. Di Barba, L.G. Campana, F. Dughiero, C.R. Rossi, E. Sieni,<br />

“Optimal needle positioning for electrochemotherapy: a<br />

constrained multiobjective strategy”, to appear in Proc. CEFC<br />

2012.<br />

[8] K. Deb, A. Pratap, S. Agarwal, T. Meyarivan, “A fast and elitist<br />

multiobjective genetic algorithm: NSGA-II”, IEEE Trans.<br />

Evolutionary Computation, vol. 6(2), 182–197, 2002.<br />

[9] P. Di Barba, Multiobjective Shape Design in Electricity and<br />

Magnetism. Springer, 2010.<br />

[10] P. Di Barba, F. Dughiero, E. Sieni, “Field synthesis for the<br />

optimal treatment planning in Magnetic Fluid Hyperthermia”,<br />

Archives <strong>of</strong> Electrical Engineering, vol. 61(1), 57–67, 2012.<br />

[11] L.M. Mir, “Terapeutic perspectives <strong>of</strong> in vivo cell<br />

electropermeabilization”, Bioelecrtochemistry, 53, 1–10, 2000.<br />

[12] Belehradek M, Domenge C, Luboinski B, et al.<br />

“Electrochemotherapy, a new antitumor treatment. First clinical<br />

phase I–II trial”, Cancer, 72, 3694–700, 1993.<br />

[13] L.M. Mir, S. Orlowski. “Mechanisms <strong>of</strong> electrochemotherapy”.<br />

AdvDrug Del Rev. 35,107–18, 1999.<br />

[14] C. Chen, S.W. Smye, M.P. Robinson, et al. “Membrane<br />

electroporation theories: a review”, Med Biol Eng Comput. 44, 5–<br />

14, 2006.<br />

[15] S. Somiari, J. Glasspool-Malone, J.J. Drabick, et al. “Theory and<br />

in vivo application <strong>of</strong> electroporative gene delivery”, Mol Ther., 3,<br />

178–87, 2000.<br />

[16] R. Heller, M.J. Jaroszeski, A. Atkin, et al. “In vivo gene<br />

electroinjection and expression in rat liver”, FEBS Lett.,389, 225–<br />

8, 1996.<br />

[17] M. Cemazar, G. Sersa, “Electrotransfer <strong>of</strong> therapeutic molecules<br />

into tissues”, Curr Opin Mol Ther., 9, 554–62, 2007.<br />

[18] B.J. Mossop, R.C. Barr, W. Henshaw, et al. “Electric fields in<br />

tumors exposed to external voltage sources: implication for<br />

electric field mediated drug and gene delivery”, Ann Biochem<br />

Eng., 34, 1564–72, 2006.<br />

[19] A. Gothelf, L.M. Mir, J. Gehl, “Electrochemotherapy: results <strong>of</strong><br />

cancer treatment using enhanced delivery <strong>of</strong> bleomycin by<br />

electroporation”, Cancer Treat Rev., 29, 371–87, 2003.<br />

[20] G. Sersa, “The state-<strong>of</strong>-the-art <strong>of</strong> electrochemotherapy before the<br />

ESOPE study; advantages and clinical use”, EJC Suppl., 4, 52–9,<br />

2006.<br />

[21] M. Snoj, M. Cemazar, T. Srnovrsnik, S. P. Kosir, G. Sersa, “Limb<br />

sparing treatment <strong>of</strong> bleeding melanoma recurrence by<br />

electrochemotherapy”, Tumori, 95(3), 398–402, 2009.<br />

[22] L. Campana, S. Mocellin, M. Basso, O. Puccetti, G. De Salvo, V.<br />

Chiarion-Sileni, A. Vecchiato, L. Corti, C. Rossi, D. Nitti,<br />

“Bleomycin-Based Electrochemotherapy: Clinical Outcome from<br />

a Single Institution’s Experience with 52 Patients”, Annals <strong>of</strong><br />

Surgical Oncology, 16(1), 191–199, 2009.<br />

[23] Cliniporator, Igea: http://www.igea.it (last visited October 2012).<br />

[24] B. Kos, A. Zupanic, T. Kotnik, M. Snoj, G. Sersa, D. Miklavcic,<br />

“Robustness <strong>of</strong> treatment planning for electrochemotherapy <strong>of</strong><br />

deep-seated tumors”, J. Membrane Biol., 236(1), 147–153, 2010.<br />

[25] I. Edhemovic, E. M. Gadzijev, E. Brecelj, D. Miklavcic, B. Kos,<br />

A. Zupanic, B. Mali, T. Jarm, D. Pavliha, M. Marcan, G.<br />

Gasljevic, V. Gorjup, M. Music, T. P. Vavpotic, M. Cemazar, M.<br />

Snoj, G. Sersa, “Electrochemotherapy: a new technological<br />

approach in treatment <strong>of</strong> metastases in the liver”, Technol. Cancer<br />

Res. Treat., vol. 10(5), 475–485, 2011.<br />

[26] Cedrat: http://www.cedrat.com/ (last visited October 2012).<br />

[27] I. M. V. Caminiti, F. Ferraioli, A. Formisano, R. Martone,<br />

“Adaptive Ablation Treatment Based on Impedance Imaging”,<br />

IEEE Tran, Magn., 46(8), 3329–3332, 2010.<br />

[28] I. M. V. Caminiti, F. Ferraioli, A. Formisano, R. Martone, “Three<br />

dimensional optimal current patterns for radi<strong>of</strong>requency ablation<br />

treatments”, COMPEL, 31(3), 985–995, 2012.<br />

[29] P. Neittaanmaki, M. Rudnicki, A. Savini Inverse problems and<br />

optimal design in electricity and magnetism, Oxford Science<br />

Pub., 1996.<br />

[30] P. Di Barba, F. Dughiero, E. Sieni, “Synthesizing Distributions <strong>of</strong><br />

Magnetic Nanoparticles for Clinical Hyperthermia”, IEEE Trans.<br />

Magn., 48(2), 263–266, 2012.


- 175 - 15th IGTE Symposium 2012<br />

Investigation <strong>of</strong> the Electroporation Effect<br />

in a Single Cell<br />

Jaime A. Ramirez ∗ , William P.D. Figueiredo ∗ , Joao Francisco C. Vale ∗ , Isabela D. Metzker ∗ , Rafael G. Santos ∗ ,<br />

Matheus S. de Mattos ∗ Elizabeth R.S. Camargos ∗ , and David A. Lowther †<br />

∗ Federal <strong>University</strong> <strong>of</strong> Minas Gerais, Belo Horizonte, Brazil<br />

† McGill <strong>University</strong>, Montreal, Canada<br />

E-mail: jramirez@ufmg.br<br />

Abstract—This paper investigates the electroporation phenomenon in a single cell exposed to ultra short (μs) and high voltage (kV/m)<br />

electric pulses. The problem is addressed by two complementary approaches. First, numerical simulations based on an asymptotic<br />

approximation derived from the Smoluchowski theory are used to calculate the pore generation, growth and size evolution at the<br />

membrane <strong>of</strong> a spherical cell model, immersed in a suspension medium and consisting <strong>of</strong> cytoplasm and membrane. The numerical<br />

calculations are solved using the finite difference method. Second, an in vitro experiment with LLC-MK2 cells is carried out in which<br />

electroporation was monitored with molecules <strong>of</strong> propidium iodide. This part also comprehended the design and manufacturing <strong>of</strong> a<br />

portable electric pulse generator capable <strong>of</strong> providing rectangular pulses with amplitude <strong>of</strong> 1,000V and duration in the range <strong>of</strong> 1-μs<br />

to 100-μs. The pulse generator is composed <strong>of</strong> three modules: a high voltage dc source, a control module, and an energy storage and<br />

high voltage switching. The numerical simulations considered a 5-μm radius cell submitted to a 500kV/m rectangular electric pulse<br />

for 1-μs. The results indicate the formation <strong>of</strong> ∼3,500 pores at the cell membrane, most <strong>of</strong> them, ∼950, located at poles <strong>of</strong> the cell<br />

aligned to the applied electric pulse, with radii sizes varying from 0.5-nm to 13-nm. The in vitro experiment considered expositon<br />

<strong>of</strong> LLC-MK2 cells to pulses <strong>of</strong> 200V, 500V, and 700V, and 1-μs. Images from fluorescence microscopy exhibit the LLC-MK2 cells<br />

with intense red, a strong evidence <strong>of</strong> the electroporation.<br />

Index Terms—Electroporation, electric fields, finite difference method.<br />

I. INTRODUCTION<br />

Electroporation is the process <strong>of</strong> applying pulsed electric<br />

fields to biological cells to induce the formation <strong>of</strong> transient<br />

“pores” in the cell membrane. Depending on the magnitude<br />

and duration <strong>of</strong> the electric pulse, the membrane may recover<br />

to its original state (the pores reseal) -areversible process;<br />

otherwise, the cell dies - an irreversible process. This phenomenon<br />

was first reported by [1] and is well discussed in the<br />

contributions [2]- [4].<br />

Earlier studies have focused on relatively low external fields,<br />

i.e. less than a kilovolt per centimeter, applied over time periods<br />

ranging from several tens <strong>of</strong> microseconds to milliseconds.<br />

Recently, the use <strong>of</strong> high electric fields (∼ 100kV/cm), or<br />

higher, with pulse durations in the nanosecond range has been<br />

employed and opened a new area <strong>of</strong> research in bioelectrics<br />

[5]. A controlled electroporation process can, therefore, be<br />

used to deliver substances to the cell cytoplasm in a wide range<br />

<strong>of</strong> applications, including gene therapy, drug delivery, nonthermal<br />

inactivation <strong>of</strong> micro-organisms and cancer treatment,<br />

see for instance [4].<br />

From the practical point <strong>of</strong> view, controlling the electroporation<br />

process involves two complementary challenges. First,<br />

a comprehensive simulation analysis is required. The time<br />

dependent electric field, induced at the cell membrane by<br />

the external pulse, need to be obtained. It is this field that<br />

provides the dynamic driving force for the physical process.<br />

In addition, the dynamical evolution <strong>of</strong> the pores at the<br />

cell membrane under the influence <strong>of</strong> this field need to be<br />

adequately treated. Second, a detailed experimental laboratory<br />

test is necessary to confirm the simulation. This involves the<br />

building <strong>of</strong> an electric pulse generator in which the pulse<br />

width and electric field strength are controlled. Moreover,<br />

appropriate microscopy techniques are required to confirm the<br />

pore formation at the cell membrane.<br />

In terms <strong>of</strong> numerical simulations, most works that consider<br />

the dynamics aspects <strong>of</strong> the electroporation phenomenon are<br />

based on the Smoluchowski theory. Krassowska et al. [6]- [9]<br />

employ an asymptotic approximation <strong>of</strong> the Smoluchowski<br />

theory in a single cell model to determine the formation <strong>of</strong><br />

pores in a spherical cell submitted to electric pulses <strong>of</strong> a<br />

few kV/m in the millisecond range. Schoenbach et al. [10]-<br />

[13], [5], use the full equations <strong>of</strong> Smoluchowski theory to<br />

establish the nucleation <strong>of</strong> pores in spherical cell models<br />

exposed to high intensity (thousands <strong>of</strong> kV/m) and ultra short<br />

(nano seconds) electric pulses. The approaches used by both<br />

groups yield acceptable results that are used in a wide range<br />

<strong>of</strong> applications, which in the first case are confirmed indirectly<br />

and in the second case are verified experimentally; however<br />

there is the need to address the electroporation phenomenon<br />

for electric pulses in the micro second range.<br />

This work investigates the electroporation phenomenon in<br />

a single cell when submitted to electric pulses <strong>of</strong> magnitude<br />

in the order <strong>of</strong> 1kV/mm and duration <strong>of</strong> 1-μs. The<br />

phenomenon is addressed by two complementary approaches,<br />

numerical simulations and an in vitro experiment. The Material<br />

and Methods is divided into three subsection. First, the<br />

mathematical modeling <strong>of</strong> electroporation describes how the<br />

numerical simulations can be used to assess the dynamics<br />

<strong>of</strong> the pore formation process, i.e. pore generation, growth<br />

and size-evolution at the cell membrane. This is based on


an asymptotic approximation based on the Smoluchowski<br />

theory and is solved using the finite difference method. The<br />

formulation is capable <strong>of</strong> providing the voltage induced across<br />

the cell membrane and important features for the practical<br />

application <strong>of</strong> electroporation, i.e. the number <strong>of</strong> pores and the<br />

distribution <strong>of</strong> pore radii as a functions <strong>of</strong> time and position<br />

on the cell membrane. A detailed description on how the<br />

numerical calculations are made is also given. Second, the<br />

electric pulse generator subsection discusses the theory used<br />

to design and build the generator. The first module is a high<br />

voltage d.c. source, the second is a control module and the<br />

third is responsible for the energy storage and high voltage<br />

switching. The generator is capable <strong>of</strong> providing retangular<br />

pulses with amplitude <strong>of</strong> 1,000V and duration in the range<br />

<strong>of</strong> 1μs to 100μs, with resting intervals <strong>of</strong> 10μs between<br />

the pulses. Third, the cell culture subsection describes the<br />

procedures used to prepare the LLC-MK2 cells for the in<br />

vitro experiment with molecules <strong>of</strong> propidium iodide. Finally,<br />

the Results presents the numerical analysis in a spherical cell<br />

<strong>of</strong> 5-μm radius, exposed to an electric pulse <strong>of</strong> 500-kV/m,<br />

duration <strong>of</strong> 1-μs; and the in vitro exposition <strong>of</strong> LLC-MK2 cells<br />

to electric pulses <strong>of</strong> <strong>of</strong> 200-kV/m, 500-kV/m, and 700-kV/m,<br />

duration <strong>of</strong> 1-μs, and fluorescence microscopy analysis.<br />

II. MATERIAL AND METHODS<br />

A. Mathematical Modeling <strong>of</strong> Electroporation<br />

For the mathematical description <strong>of</strong> the electroporation at<br />

the cell membrane, let us consider the model consisting <strong>of</strong><br />

a cell in a suspension medium within the parallel plates <strong>of</strong><br />

a cuvette, as indicated in Fig.1. For convenience, the cell<br />

is considered spherical and composed only by a membrane<br />

and cytoplasm, which are characterized by a conductivity and<br />

permittivity (σm, εm) and (σc, εc), respectively. The outer<br />

region, or suspension medium, is also characterized by its<br />

conductiviy and permittivity (σo, εo).<br />

<br />

<br />

<br />

<br />

σ , ε <br />

<br />

Fig. 1. Cell model - not to scale<br />

σ , ε <br />

<br />

<br />

<br />

<br />

σ, ε <br />

- 176 - 15th IGTE Symposium 2012<br />

The dynamics <strong>of</strong> the electroporation process, i.e. behavior<br />

<strong>of</strong> pore generation, growth and size-evolution at the cell membrane,<br />

can be calculated using the continuum Smoluchowski<br />

theory [6], [11], with the following governing equation for the<br />

pore density distribution function n (r, t),<br />

<br />

∂n ∂<br />

+ D −<br />

∂t ∂r<br />

n∂E<br />

<br />

1 ∂n<br />

− = S(r); (1)<br />

∂r kT ∂r<br />

where S(r) is the source, or pore formation term; D is a pore<br />

diffusion constant; r is the pore radius; T is the temperature;<br />

kB is the Boltzmann constant; and E(r) is the energy. This<br />

expression can be simplified by an asymptotic approximation<br />

based on [6]- [9]. This is discussed next.<br />

1) The formation <strong>of</strong> pores: It is assumed that the pores<br />

are hydrophilic and, thus, able to conduct current, and created<br />

with an initial radius r∗ at a rate determined by,<br />

<br />

dn<br />

= αe(Vm/Vep)2 1 −<br />

dt n<br />

<br />

(2)<br />

neqVm<br />

where n(t, θ) is the pore density and neq is the equilibrium<br />

pore density for a given transmembrane voltage Vm,<br />

neq(Vm) =n0e q(Vm/Vep)2<br />

2) The evolution <strong>of</strong> pore radii: The rate <strong>of</strong> change <strong>of</strong> the<br />

pore radii is given by<br />

drj<br />

dt = U(rj,Vm,σeff ),j =1, 2, ..., K (4)<br />

where U is the advection velocity given by<br />

U(r, Vm,σeff )= D<br />

<br />

kT<br />

V 2 mFmax<br />

1+rh/(r + rt) +4β<br />

r∗<br />

r<br />

<br />

4 1<br />

r<br />

+ D<br />

kT [−2πγ +2πσeff r]<br />

The last term represents the effective tension <strong>of</strong> the membrane,<br />

σeff , which is a function <strong>of</strong> Ap, the combined area <strong>of</strong> all pores<br />

existing on the cell,<br />

σeff (Ap) =2σ ′ − 2σ′ − σ0<br />

(6)<br />

1 − Ap/A<br />

where σ0 is the tension <strong>of</strong> the membrane without pores, σ ′ is<br />

the energy per area <strong>of</strong> the hydrocarbon-water interface, Ap =<br />

k j=1 πr2 j<br />

(3)<br />

(5)<br />

, and A is the surface area <strong>of</strong> the cell. In this work<br />

it is assumed that changes <strong>of</strong> cell shape, area, and volume,<br />

can be ignored for microsecond pulses.<br />

3) The voltage in the cell membrane: The voltage in the<br />

cell membrane Vm can be calculated as the difference between<br />

Vi and Ve, i.e. the difference between the potential at the<br />

interfaces <strong>of</strong> the cell membrane, as indicated in the next Fig.2.<br />

Vm(t, θ) =Vi(t, R2,θ) − Ve(t, R1,θ) (7)<br />

The potential Vi at the inner interface (cytoplasm) and Ve<br />

at the outer interface (outer region) <strong>of</strong> the cell membrane can<br />

be obtained by two systems defined by Laplace’s equations,<br />

∇ 2 Vi =0 and ∇ 2 Ve =0 (8)


σ , ε <br />

<br />

σ , ε <br />

<br />

<br />

<br />

σ , ε <br />

Fig. 2. Cell membrane interface - not to scale<br />

The applied electric pulse is imposed as a boundary condition<br />

on the outer region Ve,<br />

Ve(t, r, θ) =−Ercos(θ) as r →∞ (9)<br />

where r is the distance from the center <strong>of</strong> the cell and θ<br />

is the polar angle measured with respect to the direction <strong>of</strong><br />

the applied pulse E. In terms <strong>of</strong> numerical calculation, it is<br />

sufficient to set r ≥ 3R1, i.e. the outer region at least three<br />

times greater than the cell radius. This will be discussed in<br />

detail in the numerical calculation subsection.<br />

The other boundary conditions can be defined for t


TABLE I<br />

PARAMETERS FOR THE CELL NUMERICAL MODEL [9]<br />

Parameter Value<br />

σc(Sm −1 ) 3.0 × 10 −1<br />

σo(Sm −1 ) 3.0 × 10 −1<br />

σm(Sm −1 ) 3.0 × 10 −7<br />

εo(AsV −1 ) 6.4 × 10 −10<br />

εc(AsV −1 ) 6.4 × 10 −10<br />

εm(AsV −1 ) 4.4 × 10 −11<br />

R1(m) 5.0000 × 10 −6<br />

R2(m) 4.9995 × 10 −6<br />

CM (Fm −1 ) 8.8 × 10 −3<br />

n0(m −2 ) 1.5 × 10 9<br />

α(m −2 s −1 ) 1 × 10 9<br />

Vep(V ) 0.258<br />

r∗(m) 0.51 × 10 −9<br />

rm(m) 0.8 × 10 −9<br />

rh(m) 0.97 × 10 −9<br />

rt(m) 0.31 × 10 −9<br />

T (K) 310<br />

q 2.4606<br />

D(m 2 s −1 ) 5 × 10 −14<br />

γ(Jm −1 ) 1.8 × 10 −11<br />

β(J) 1.4 × 10 −19<br />

Fmax(NV −2 ) 0.7 × 10 −9<br />

σ ′ (Jm −2 ) 2 × 10 −2<br />

σ0(Jm −2 ) 1 × 10 −6<br />

Vrest(V ) −0.08<br />

It is capable <strong>of</strong> providing rectangular pulses with amplitude<br />

<strong>of</strong> 1,000V and duration in the range <strong>of</strong> 1μs to100μs with<br />

resting intervals <strong>of</strong> 10μs between the pulses. The modules are<br />

discussed in detail next.<br />

Fig. 4. Pulse generator modules.<br />

1) The high voltage d.c. source: The high voltage source<br />

module is based on [17] and uses a six capacitor doubler<br />

stage, as indicated in Fig.5. It is capable <strong>of</strong> converting a.c.<br />

voltage into d.c. voltage up to 2kV. It uses MUR1560 diodes<br />

and 330μF capacitors. A 1:1 transformer is used to provide<br />

insulation between the electrical grid and the module; in<br />

addition, a variable autotransformer is used to control the a.c.<br />

input and the d.c. output.<br />

2) The control module: The control circuit uses a<br />

PIC18F4550 microcontroller that operates at 12 Mips. The<br />

control driver is based in a push-pull amplifier and composed<br />

<strong>of</strong> a pair <strong>of</strong> Mosfets models ZVN2106a and ZVP2106a, both<br />

- 178 - 15th IGTE Symposium 2012<br />

Fig. 5. High voltage dc source.<br />

capable <strong>of</strong> working with voltages up to 60V and currents<br />

up to 500mA, as indicated in Fig.6. The control driver is<br />

responsible for adjusting the control signal produced by the<br />

microcontroller to the ratings required to fast charge the<br />

capacitances <strong>of</strong> the switching circuit, which can be achieved<br />

by increasing the output voltage and the maximum current.<br />

Fig. 6. Driver circuit.<br />

3) High voltage storage and switching: The high voltage<br />

switching is based on a IGBT (IRGPS60B120KD) that operates<br />

with voltages up to 1,2kV and current pulses up to 240A.<br />

It has a 45ns rise time, 58ns fall time and 400ns turn <strong>of</strong>f<br />

delay that provides pulses with amplitude and width in the<br />

range required in this work. The IGBT is connected in series<br />

to a 73μF capacitor and the load, as indicated in Fig.7. The<br />

capacitor charges through a 1kΩ resistor until it reaches the<br />

same voltage <strong>of</strong> the high voltage source. When the IGBT is<br />

activated, it inverts the voltage on the capacitor and a negative<br />

voltage, that equals the one that charged the capacitor, appears<br />

on the load.<br />

Fig. 7. Switching circuit.


4) The pulse generator: The pulse generator built is shown<br />

in Fig.8. For safety reasons, the high voltage source and the<br />

high voltage storage and switching circuit are inside an acrylic<br />

box; the control module is outside the box.<br />

Fig. 8. Pulse generator.<br />

C. In vitro experiment<br />

1) Cell culture procedures: LLC-MK2 (monkey kidney<br />

epithelial cell line) were maintained in DMEM (Dulbecco’s<br />

modified Eagle’s medium - Invitrogen) with 10% fetal bovine<br />

serum (FBS, Invitrogen), 1% penicillin-streptomycin (Invitrogen)<br />

and 1% glutamine (Sigma-Aldrich) at 37C under 5%<br />

CO2 atmosphere. Cell culture was kept at an optimal density<br />

through weekly passages. Briefly, LLC-MK2 cells were seeded<br />

until 70-90% confluent cell monolayer. To perform subculture,<br />

the cell culture medium was removed and the cells were<br />

rinsed twice with PBS -/-. Trypsin was used to remove<br />

adherent cells (0,5ml/25cm2 surface area). The cells were then<br />

resuspended in 4,5ml <strong>of</strong> fresh serum-containing medium for<br />

trypsin inactivation. Cell viability was assessed directly by<br />

Trypan Blue staining. Cultures were split 1:10 and placed in<br />

a new flask with DMEM 10%.<br />

2) Exposure to electroporation: LLC-MK2 cells were collected<br />

from culture media and suspended in HBS (Hepes<br />

Buffered Saline) at a concentration <strong>of</strong> 2, 6 × 106 cells/ml<br />

in rectangular electroporation cuvettes with 1-mm electrode<br />

separation. All the manipulations were done in sterile condition<br />

in a vertical laminar flow cabinet Veco. Electroporation<br />

was monitored with molecules <strong>of</strong> propidium iodide, 1,5-nm<br />

× 2-nm, (25 μg/ml, Sigma-Aldrich), a fluorochrome that is<br />

excluded from cells with intact membrane.<br />

3) Fluorescence microscopy: Aliquots <strong>of</strong> control and<br />

pulsed cells were placed into glass slides and observed in<br />

Axioplan 2 Zeiss fluorescence microscope (UV emission 630<br />

nm).<br />

III. RESULTS AND DISCUSSION<br />

A. Numerical simulations<br />

The simulations considered a 5-μm radius cell submitted to<br />

a 500kV/m rectangular electric pulse for 1-μs. The results for<br />

the total number <strong>of</strong> pores, number <strong>of</strong> pores at the polarized<br />

poles and the maximum radii <strong>of</strong> the pores are indicated at<br />

- 179 - 15th IGTE Symposium 2012<br />

Figs.9-11. It can be seen from Figs.9-11 that the pore nucleation<br />

starts at approximately 0.8μs, when Vm is approximately<br />

1.25V, an reaches ∼3,500 pores at the cell membrane, most <strong>of</strong><br />

them, ∼950, located at poles <strong>of</strong> the cell aligned to the applied<br />

electric pulse (θ =0 ◦ and θ = 180 ◦ ), with radii sizes varying<br />

from 0.5-nm to 13-nm. After the initial stage, the number <strong>of</strong><br />

pores increases until around 1-μs, when the pulse ends. In the<br />

final stage, the radii <strong>of</strong> the pores decrease very fast but the<br />

number <strong>of</strong> pores stays stable for a longer period. This is in<br />

agreement with the literature, large pores tend to decay faster<br />

but the complete resealing <strong>of</strong> all pores take a longer period.<br />

Fig. 9. Total number <strong>of</strong> pores.<br />

Fig. 10. Number <strong>of</strong> pores in the 1st sector (θ =0 ◦ ) and in the 180th sector<br />

(θ = 180 ◦ ).<br />

B. Experiments<br />

A series <strong>of</strong> in vitro experiments were carried out with the<br />

LLC-MK2 cells following the methodology described in the<br />

previous section. After exposure to 1-μs pulse, the LLC-MK2<br />

cells exhibited intense red fluorescence mostly in the cell<br />

nuclei where propidium iodide binds to double-stranded DNA,<br />

as illustrated in Fig.13. This is a strong evidence that pores<br />

with radii <strong>of</strong> at least 2-nm (the size <strong>of</strong> the propidium iodide<br />

molecule) were created at the cell membrane. The numerical


Fig. 11. Maximum radii evolution.<br />

calculation predicts the creation <strong>of</strong> most pores at poles <strong>of</strong><br />

the cell (θ=180 and θ=0) with radii sizes varying from 0.5nm<br />

to 13-nm; the fluoresce microscopy can only partially<br />

confirm this. Further tests with other microscopy techniques<br />

are required to confirm the region in the cell membrane where<br />

the pores are created, their sizes and how long they last.<br />

Fig. 12. Pulse <strong>of</strong> 500V × 1μs measured at the cuvette.<br />

Fig. 13. Propidium iodide influx into LLC-MK2 cell exposed to pulses <strong>of</strong><br />

1-μs, 200V (a), 500V (b) and 700V (c).<br />

IV. CONCLUSIONS<br />

This paper considered the study <strong>of</strong> the electroporation<br />

phenomenon in a single cell, using numerical simulations and<br />

- 180 - 15th IGTE Symposium 2012<br />

an in vitro experiment. The numerical simulations considered<br />

a5-μm radius cell submitted to a 500kV/m rectangular electric<br />

pulse for 1-μs, which was addressed using an asymptotic<br />

approximation based on the Smoluchowski theory and the<br />

finite difference method. The results indicate the formation<br />

<strong>of</strong> ∼3,500 pores at the cell membrane, most <strong>of</strong> them, ∼950,<br />

located at poles <strong>of</strong> the cell aligned to the applied electric pulse,<br />

with radii sizes varying from 0.5-nm to 13-nm. The in vitro<br />

experiment considered expositon <strong>of</strong> LLC-MK2 cells to electric<br />

pulses <strong>of</strong> 200kV/m, 500kV/m, and 700kV/m, and 1-μs. Images<br />

from fluorescence microscopy confirm the electroporation at<br />

the LLC-MK2 cells. The methodology employed is adequate<br />

to investigate the electroporation phenomenon in simple cells<br />

exposed to electric pulses <strong>of</strong> kV/m and in the μs range.<br />

V. ACKNOWLEDGMENT<br />

This work was supported by CNPq, Brazil, under<br />

Grants 306910/2006-3, 482185/2010-4, 507810/2010-4,<br />

504978/2010-1; by FAPEMIG, Brazil, under Grants Pronex:<br />

TEC 01075/09 and TEC-PPM-489/10; by CAPES, Brazil and<br />

DFAIT, Canada.<br />

REFERENCES<br />

[1] R. Stampfli, “Reversible electrical breakdown <strong>of</strong> the excitable membrane<br />

<strong>of</strong> a Ranvier node,” An. Acad. Brasil. Ciens., vol.30, pp.57-63, 1958.<br />

[2] T.Y. Tsong, “Electroporation <strong>of</strong> cell membrane,” Biophy. J., vol.60,<br />

pp.297-306, 1991.<br />

[3] J.C. Weaver and Y.A. Chizmadzhev, “Theory <strong>of</strong> electroporation: a review,”<br />

Bioelectroch. Bioenergetics, vol.41, pp.135-160, 1996.<br />

[4] T. Kotnik, P. Kramar, G. Pucihar, D. Miklavcic, M. Tarek, “Cell<br />

membrane electroporation. Part 1: The phenomenon,” IEEE Elec. Ins.<br />

Magazine, vol.28, pp.14-23, 2012.<br />

[5] R.P. Joshi and K.H. Schoenbach, “Bioelectric effects <strong>of</strong> intense ultrashort<br />

pulses,” Critic. Rev. in Biom. Eng., vol.38, pp.255-304, 2010.<br />

[6] J.C. Neu and W. Krassowska, “Asymptotic model <strong>of</strong> electroporation,”<br />

Phys. Rev. E, 59:3471-3482, 1999.<br />

[7] K.A. DeBruin and W. Krassowska, “Modeling electroporation in a single<br />

cell. I: Effects <strong>of</strong> field strength and rest potential,” Biophy. J., vol.77,<br />

pp.1213-1224, 1999.<br />

[8] K.C. Smith, J.C. Neu, W. Krassowska, “Model <strong>of</strong> creation and evolution<br />

<strong>of</strong> stable electropores for DNA delivery,” Biophy. J., vol.86, pp.2813-<br />

2826, 2004.<br />

[9] W. Krassowska and P.D. Filev, “Modeling electroporation in a single cell,”<br />

Biophy. J., vol.92, pp.404-417, 2007.<br />

[10] R.P. Joshi and K.H. Schoenbach, “Electroporation dynamics in biological<br />

cells subjected to ultrafast electrical pulses: a numerical simulation<br />

study,” Phys. Review E, vol.62, pp.1025-1033, 2000.<br />

[11] R.P. Joshi, Q. Hu, K.H. Schoenbach, “Dynamical modeling <strong>of</strong> cellular<br />

response to short duration, high intensity electric fields,” IEEE Trans. on<br />

Dielec. Elec. Ins., vol.10, pp.778-787, 2003.<br />

[12] K.H. Schoenbach, R.P. Joshi, J.F. Kolb, N. Chen, M. Stacey, P.F.<br />

Blackmore, E.S. Buescher, S.J. Beebe, “Ultrashort electrical pulses open a<br />

new gateway into biological cells,” <strong>Proceedings</strong> <strong>of</strong> IEEE, vol.92, pp.1122-<br />

1137, 2004.<br />

[13] Q. Hu, R.P. Joshi, K.H. Schoenbach, “Simulations <strong>of</strong> nanopore formation<br />

and phosphatidylserine externalization in lipid membrane subjected to a<br />

high intensity, ultrashort electric pulse,” Phys. Review E, vol.72, 031902,<br />

2005.<br />

[14] J. Mankowski and M. Kristiansen, “A review <strong>of</strong> short pulse generator<br />

technology,” IEEE Trans. on Plasma Science, vol.28, pp.102-108, 2000.<br />

[15] A. Chaney and R. Sundararajan, “Simple mosfet-based high-voltage<br />

nanosecond pulse circuit,” IEEE Trans. on Plasma Science, vol.32,<br />

pp.1919-1924, 2004.<br />

[16] J. R. Grenier and M. Kazerani, “Mosfet-based pulse power supply for<br />

bacterial transformation,” IEEE Trans. on Industry Application, vol.44,<br />

pp.25-31, 2008.<br />

[17] E. Kuffel, W.S. Zaengl, J. Kuffel. High Voltage Engineering Fundamentals.<br />

Second Edition, Butterworth Heinemann, Oxford, UK, 2000.


- 181 - 15th IGTE Symposium 2012<br />

Anisotropic Model for the Numerical<br />

Computation <strong>of</strong> Magnetostriction in<br />

Grain-Oriented Electrical Steel Sheets<br />

M. Kaltenbacher∗ ,A.Volk † , and M. Ertl ‡<br />

∗Institute <strong>of</strong> Mechanics and Mechatronics, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />

† Department <strong>of</strong> Sensor <strong>Technology</strong>, <strong>University</strong> <strong>of</strong> Erlangen-Nuremberg, Germany<br />

‡ Siemens Energy Sector, Nuremberg, Germany<br />

E-mail: manfred.kaltenbacher@tuwien.ac.at<br />

Abstract—We present a recently developed physical model for magnetostriction in transformer cores and its efficient<br />

numerical computation by applying the Finite Element (FE) method. Thereby, we fully take the anisotropic behavior <strong>of</strong> the<br />

material into account, both in the computation <strong>of</strong> the nonlinear electromagnetic field as well as the induced magnetostrictive<br />

strains. Numerical computations demonstrate the importance <strong>of</strong> modeling the anisotropy <strong>of</strong> grain oriented electrical steel<br />

sheets as used in electric transformers. Both the magnetic field along the joint regions, and furthermore the mechanical<br />

vibrations especially in thickness direction differ strongly as compared to computations with an isotropic material model.<br />

Index Terms—magnetostriction, finite element method, anisotropic material behavior, nonlinearity<br />

I. INTRODUCTION<br />

Magnetostrictive materials are widely used for actuator<br />

and sensor applications. However, <strong>of</strong>ten the magnetostrictive<br />

behavior <strong>of</strong> these alloys is an undesirable effect, as<br />

e.g. in electric machines and transformers, where it is one<br />

<strong>of</strong> the main sources for noise generation. Unfortunately,<br />

these materials exhibit nonlinear behavior for the magnetic<br />

properties as well as the mechanical characteristics<br />

leading to the well-known magnetic hysteresis loop and<br />

the magnetostrictive hysteresis loop (so-called butterfly<br />

curve), respectively (see, e.g., [1], [2], [3]). A quite<br />

important aspect – especially for grain-oriented electrical<br />

steel as used in transformers – is the anisotropic material<br />

behavior both concerning the magnetic properties as well<br />

as the induced mechanical strains [4].<br />

The modeling <strong>of</strong> magnetostrictive effects is a topic <strong>of</strong><br />

intensive research. Among the huge amount <strong>of</strong> publications<br />

one can find three main approaches. The first one,<br />

which is widely used, is based on introducing a magnetostrictive<br />

strain tensor, where the entries depend on the<br />

magnetic induction (see, e.g., [5], [1]). Thereby, these<br />

additional strains result in mechanical forces modeled as<br />

a right hand side term in the partial differential equation<br />

(PDE) for mechanics. In a second approach, a free<br />

energy as a tensor function depending on the mechanical<br />

strain and magnetic induction is used (see, e.g., [6], [7]).<br />

Thereby, a fully coupled constitutive relation between<br />

mechanical and magnetic quantities is achieved. The last<br />

approach is based on a thermodynamic consistent model,<br />

where the mechanical strain and magnetic induction is<br />

decomposed in a reversible and an irreversible part [2],<br />

[3]. Furthermore, the full constitutive model is based on a<br />

free energy function. Whereas in [2] the irreversible part<br />

is modeled by a switching criterion using inner variables,<br />

[3] uses hysteresis operators. Common to all models<br />

is the current restriction to isotropic and / or uniaxial<br />

behavior.<br />

Our goal is the precise investigation <strong>of</strong> the magnetic<br />

field and resulting mechanical vibrations caused by magnetostriction<br />

along the joint regions <strong>of</strong> electric transformers.<br />

Therefore, we cannot apply any homogenization<br />

technique and fully resolve each individual steel sheet.<br />

This is clearly not possible for a whole transformer<br />

core, and so we restrict our investigation to some few<br />

steel sheets. To reduce the complexity, we choose an<br />

ansatz, in which we neglect the reaction <strong>of</strong> the mechanical<br />

stresses and strains on the magnetic properties and<br />

therefore decouple the computation <strong>of</strong> the magnetic and<br />

mechanical field. By help <strong>of</strong> an Epstein frame and a SST<br />

(Single Sheet Tester), we measure the magnetic as well<br />

as the mechanical hysteresis curves <strong>of</strong> the grain-oriented<br />

electrical steel sheets with different orientations (w.r.t the<br />

rolling direction). From these curves we then extract for<br />

each orientation the corresponding commutation curve,<br />

so that the hysteretic behavior is simplified to a nonlinear<br />

one. This approach is then applied to a stack <strong>of</strong> six<br />

electrical steel sheets with a 90 o joint region, excited<br />

by two current loaded coils. We compare this anisotropic<br />

model to an isotopic one, where the nonlinear magnetic<br />

and mechanical material parameter are just used from the<br />

rolling direction.<br />

The rest <strong>of</strong> the paper is organized as follows. In Sec.<br />

II we describe our physical model and its in-cooperation<br />

into the magnetic and mechanical PDE as well as their Finite<br />

Element (FE) formulation. The measurement setups,<br />

which provide us the nonlinear curves, are discussed in<br />

Sec. III. In Sec. IV the numerical results are presented,<br />

demonstrating the importance <strong>of</strong> taking anisotropy for<br />

grain-oriented electrical steel sheets as used in transformers<br />

into account. Finally, Sec. V summarizes our<br />

achievements.


II. PHYSICAL MODELING AND FE DISCRETIZATION<br />

Magnetostrictive materials are characterized by the<br />

magnetic hysteresis between the magnetic induction B<br />

and magnetic field intensity H as well as the mechanical<br />

hysteresis between the mechanical strain S and magnetic<br />

induction B (see Fig. 1).<br />

Fig. 1. Magnetic and mechanical hysteresis (butterfly curve).<br />

According to a thermodynamically consistent model,<br />

we decompose the physical quantities magnetic induction<br />

and mechanical strain into a reversible and an irreversible<br />

part1 (indicated by the superscripts r and i, respectively)<br />

S = S r + S i , B = B r + B i . (1)<br />

To allow for the history <strong>of</strong> the driving magnetic field<br />

intensity, the irreversible magnetic induction Bi is set to<br />

be equal to the magnetization M, which is modeled, e.g.,<br />

by a Preisach hysteresis operator [3]<br />

B i = M = H[H] eM . (2)<br />

The irreversible strain can be, e.g., expressed by the<br />

following polynomial ansatz [3]<br />

i<br />

S = 3<br />

2 (β1 ·H[H]+β2 · (H[H]) 2 + ···<br />

+βn · (H[H]) n <br />

) eM ⊗ e t M − 1<br />

3 [I]<br />

<br />

, (3)<br />

while the parameters β1, ···,βn need to be fitted to<br />

measurement data and [I] denotes the identity tensor.<br />

Now, magnetostriction is a property <strong>of</strong> ferromagnetic<br />

materials and can be described as a coupling between the<br />

mechanical and the magnetic field. This relation is described<br />

by the well-known magnetostrictive constitutive<br />

equations modeling the linear coupling <strong>of</strong> the magnetic<br />

and the mechanical deformation [2]<br />

σ = [c H ]S r − [e] t H (4)<br />

B r = [e]S r +[μ S ]H . (5)<br />

In (4), (5) σ denotes the Cauchy stress tensor in Voigt notation,<br />

[cH ] the tensor <strong>of</strong> mechanical moduli (at constant<br />

magnetic field intensity), [e] the piezomagnetic coupling<br />

tensor and [μS ] the tensor <strong>of</strong> magnetic permeability (at<br />

constant mechanical strain).<br />

By using these constitutive relations, we have presented<br />

in [3] a formulation based on the magnetic<br />

1 With [S] we denote the tensor <strong>of</strong> mechanical strain and with S<br />

the algebraic vector containing the three normal and three shear strains<br />

according to Voigt notation.<br />

- 182 - 15th IGTE Symposium 2012<br />

scalar potential, and in [8] have even extended it for<br />

the magnetic vector potential to also take eddy current<br />

effects into account. However, both models are currently<br />

restricted to scalar hysteresis operators, and do not take<br />

into account the anisotropic behavior, which is a crucial<br />

point for grain-oriented electrical steel used in transformer<br />

cores [4]. Furthermore, both models are quite<br />

expensive concerning computational time. Therefore, we<br />

have developed a dedicated physical model for grainoriented<br />

electrical steel. In doing so, we first assume that<br />

the entries <strong>of</strong> the piezomagnetic coupling tensor [e] are<br />

small, and we are allowed to neglect this coupling in (4),<br />

(5). Next the anisotropic and nonlinear magnetic behavior<br />

<strong>of</strong> the steel sheets is modeled by its vector relation<br />

between the magnetic induction B and field intensity H<br />

B = B (H) =Bϕ(H)eB ; eB = B<br />

. (6)<br />

B<br />

Here, we compute the unit vector eB and evaluate the<br />

magnetic commutation curve Bϕ for which the orientation<br />

fits best with eB. Therefore, the defining partial<br />

differential equation (PDE) for the magnetic field reads<br />

as<br />

γ ∂A<br />

∂t −∇×ν(Bϕ)∇×A = Ji (7)<br />

with A the magnetic vector potential, Ji the impressed<br />

current density, ν the magnetic reluctivity depending on<br />

Bϕ (see (6)) and γ the electric conductivity.<br />

The PDE for mechanics is given by<br />

ρ ∂2u ∂t2 −Btσ =0 (8)<br />

with u the mechanical displacement, ρ the density, and<br />

B = ∇s the differential operator. As in (3), we assume the<br />

conservation <strong>of</strong> volume for the irreversible strain. However,<br />

we now model instead <strong>of</strong> the hysteretic behavior<br />

a nonlinear, anisotropic behavior, and denote it by [Sm ]<br />

(magnetostrictive induced strain tensor), which computes<br />

as follows<br />

[S m ]= 3<br />

2<br />

<br />

eB × e t B − 1<br />

3 I<br />

<br />

S m ϕ (B) . (9)<br />

Here, we compute the direction <strong>of</strong> B and evaluate the<br />

magnetostrictive commutation curve S m ϕ (B) for which<br />

the orientation fits best with eB. Now, we can express<br />

the reversible mechanical strain S r by the difference <strong>of</strong><br />

the total strain S = Bu and the irreversible (magnetostrictive)<br />

strain S i = S m via<br />

S r = Bu − S m . (10)<br />

This relation in combination with (4) by neglecting [e]<br />

results for (8) into<br />

ρ ∂2 u<br />

∂t 2 −Bt [c H ]Bu = −B t [c H ]S m . (11)<br />

The Finite Element (FE) formulation <strong>of</strong> (7) and (11)<br />

is straight forward. For (7) we use edge finite elements<br />

and solve the arising algebraic system <strong>of</strong> equations by<br />

an efficient Newton scheme utilizing a two level solver


[9]. For (11) we apply nodal finite elements (for details,<br />

see e.g., [10]).<br />

Summarizing, the developed magnetostrictive model<br />

has the following features:<br />

• Decoupling <strong>of</strong> magnetic and mechanical PDEs; so<br />

both PDEs can be solved separately with optimal<br />

conditions.<br />

• Anisotropy and eddy currents are taken into account<br />

• No hysteresis considered; instead it uses commutation<br />

curves computed from measured hysteresis<br />

curves.<br />

• Change <strong>of</strong> magnetic properties due to the mechanical<br />

field within a working point is neglected<br />

(working point can be determined by pre-stressing<br />

<strong>of</strong> measured samples).<br />

III. MEASUREMENT SETUPS<br />

First <strong>of</strong> all, to obtain reliable measurement data for the<br />

magnetic behavior, we have constructed an Epstein frame<br />

according to IEC 60404-2 (see Fig. 2). The 25 cm Epstein<br />

Steel sheets (overlap at the corners)<br />

Coil to compensate flux in air<br />

Excitation and<br />

measurement<br />

coils<br />

Fig. 2. 25 cm Epstein frame for measuring the magnetic properties<br />

<strong>of</strong> grain-oriented electrical steel sheets.<br />

apparatus consists <strong>of</strong> 4 coils with primary windings,<br />

secondary windings, a compensation coil and the material<br />

sample as core. The sheets are stratified in stripes. The<br />

measurement setup represents in this way a transducer,<br />

whose characteristics are specified. The primary outer<br />

windings are used to magnetize the material and the<br />

secondary inner windings are needed for magnetic flux<br />

density determination over the induced voltage. We have<br />

performed measurements for steel sheets, which have<br />

been cut out at different angles according to the rolling<br />

direction. Thereby, for each stack <strong>of</strong> steel sheets, we<br />

have measured the outer and all inner hysteresis loops, as<br />

demonstrated in Fig. 3. Out <strong>of</strong> all the hysteresis loops,<br />

we compute for each angle a commutation curve (see<br />

Fig. 4), which we then use in our numerical computation<br />

for the magnetic field.<br />

To measure the mechanical hysteresis <strong>of</strong> the electrical<br />

steel sheets a second measurement setup was constructed<br />

on the basis <strong>of</strong> a Single Sheet Tester (SST) as displayed<br />

in Fig. 5. This extended setup also captures<br />

- 183 - 15th IGTE Symposium 2012<br />

B (T)<br />

H (kA/m)<br />

Angle 0 o<br />

Angle 15 o<br />

Angle 30 o<br />

Angle 45 o<br />

Angle 60 o<br />

Angle 75 o<br />

Angle 90 o<br />

Fig. 3. Magnetic hysteresis curves for grain-oriented electrical steel<br />

sheets being cut out at different angles according to the rolling direction<br />

(0 o corresponds to the rolling direction).<br />

B ( (T) )<br />

H (kA/m)<br />

Angle 0 o<br />

Angle 15 o<br />

Angle 30 o<br />

Angle 45 o<br />

Angle 60 o<br />

Angle 75 o<br />

Angle 90 o<br />

Fig. 4. Nonlinear BH curves for different angles (0 o corresponds to<br />

the rolling direction).<br />

the magnetic induction as well as the magnetic field<br />

intensity. However we did not use the SST to determine<br />

magnetic hysteresis since it has to be calibrated to a<br />

certified setup to obtain reliable measurement results. To<br />

capture the mechanical hysteresis the SST was extended<br />

by a lifting mechanism to unload the sample sheet to<br />

ensure its stress-less vibration. The mechanical vibration<br />

due to magnetic excitation <strong>of</strong> the SST is measured by<br />

Ferrite core<br />

Excitation n and<br />

Steel sheet measurement ment coil<br />

(material under test)<br />

Laser-<br />

vibrometer<br />

Fig. 5. Single sheet tester as used to obtain the mechanical hysteresis<br />

(principle and manufactured setup).


a laser vibrometer, which compared to strain gauges<br />

provides high accuracy without electromagnetic crosssensitivity<br />

and is contact-free. The measurement <strong>of</strong> the<br />

mechanical strain as a function <strong>of</strong> the magnetic field<br />

results in the magnetostrictive hysteresis loop (so-called<br />

butterfly curve). Additionally the extended SST permits<br />

pre-stressing <strong>of</strong> the steel sheets in order to capture the<br />

reaction on the magnetic properties which corresponds to<br />

a working point that is used in the simulation. To consider<br />

anisotropy, again a series <strong>of</strong> measurement is performed<br />

with different electrical steel sheets which have been cut<br />

out with varying cutting angles with respect to the grain<br />

orientation <strong>of</strong> the steel. As for the magnetic hysteresis, we<br />

also convert the mechanical hysteresis in a single commutation<br />

curve, which leads to angle-dependent nonlinear<br />

magnetostriction curves, as displayed in Fig. 6.<br />

S (μm/m)<br />

Angle 0 o<br />

Angle 15 o<br />

Angle 30 o<br />

Angle 45 o<br />

Angle 60 o<br />

Angle 75 o<br />

Angle 90 o<br />

B (T)<br />

Fig. 6. Nonlinear SB curves for different angles (0 o corresponds to<br />

the rolling direction).<br />

IV. NUMERICAL RESULTS<br />

For the numerical investigation, we choose a setup <strong>of</strong><br />

six stacked electrical steel sheets with a 90 degree joint<br />

and an excitation coil along each yoke as displayed in<br />

Fig. 7. We model just a quarter symmetry by applying<br />

Steel sheets<br />

Excitation coils<br />

Zoomed and scaled<br />

in thickness direction<br />

Fig. 7. Computational model: quarter symmetry is considered.<br />

appropriate boundary conditions at the symmetry planes.<br />

Our main goal is to study the difference between an<br />

isotropic and anisotropic magnetostrictive computation.<br />

- 184 - 15th IGTE Symposium 2012<br />

Thereby, we choose for the isotropic computation the<br />

measured material curves along the rolling direction<br />

(angle <strong>of</strong> zero degree), whereas for the anisotropic computation<br />

we use all measured material curves (see Fig. 4<br />

and 6).<br />

In a first step, we compute the magnetic field and<br />

compare the flux lines at the joints. Figure 8 displays the<br />

flux lines for the isotropic and Fig. 9 for the anisotropic<br />

case at the time step <strong>of</strong> maximal magnetic induction<br />

(about 1.7 T). We display the flux lines just for the two<br />

Fig. 8. Magnetic flux lines for the two upper steel sheets in case<br />

<strong>of</strong> isotropic computation. For better visualization we have scaled the<br />

thickness direction by a factor <strong>of</strong> ten.<br />

last layers and zoom into the joint region. Comparing<br />

the results, one can clearly see the difference. For the<br />

Fig. 9. Magnetic flux lines for the two upper steel sheets in case<br />

<strong>of</strong> anisotropic computation. For better visualization we have scaled the<br />

thickness direction by a factor <strong>of</strong> ten.<br />

isotropic case, the amplitude <strong>of</strong> the magnetic induction<br />

immediately drops to a low one at the beginning <strong>of</strong> the<br />

joint due to the increased effective cross section when<br />

turning the flux direction in the rectangular joint region.<br />

Since the magnetic material properties are homogeneous<br />

and independent <strong>of</strong> direction, the change <strong>of</strong> the magnetic<br />

flux direction itself is continuously across the joint region.<br />

The transition <strong>of</strong> the magnetic flux between the<br />

two vertical stacked steel sheets is mainly limited when<br />

entering and leaving the joint region. Accordingly the<br />

magnetic flux density reaches its full value just at the<br />

end <strong>of</strong> the joint when entering the opposite yoke.<br />

In the anisotropic case, the guiding effect <strong>of</strong> the<br />

preferred magnetic direction in the grain orientation <strong>of</strong><br />

the electrical sheet keeps the amplitude and direction <strong>of</strong><br />

the magnetic flux for some distance in the joint region.


In the area <strong>of</strong> the central diagonal <strong>of</strong> the joint region (at<br />

45 o ), the reduced magnetic permeability perpendicular<br />

to the grain orientation forces the magnetic flux to a<br />

vertical transition into neighbouring steel sheets. This<br />

x-Displacement (nm)<br />

y-Displacement (nm)<br />

10<br />

0<br />

-25<br />

20<br />

0<br />

-50<br />

0 20 40 60<br />

Time (ms)<br />

0 20 40 60<br />

Time (ms)<br />

z-Displacement (nm)<br />

2.5<br />

0<br />

-2.5<br />

Evaluation point<br />

0 20 40 60<br />

Time (ms)<br />

Isotropic case<br />

Anisotropic case<br />

Fig. 10. Mechanical displacement at an observation point along the<br />

yoke.<br />

behavior <strong>of</strong> the magnetic field has a strong impact on the<br />

mechanical vibrations. In a second step we use the computed<br />

magnetic induction and calculate the mechanical<br />

deformation according to the additional magnetostrictive<br />

strain. In Figs. 10 and 11 we display all three components<br />

<strong>of</strong> the mechanical displacement over time at<br />

two different observation points. In general, we observe<br />

that the displacement in plane direction (x− and y−<br />

displacement) show almost no difference. However, the<br />

displacement in thickness direction (z− displacement) is<br />

quite different both concerning amplitude and frequency<br />

content. Especially at the joint region the amplitude <strong>of</strong> the<br />

mechanical vibration is a factor <strong>of</strong> about 1000 larger in<br />

the anisotropic case as in the isotropic case. Furthermore,<br />

we can state that the computation for the isotropic<br />

material model exhibits mainly the 100 Hz component<br />

(current excitation is at 50 Hz). In the anisotropic case<br />

higher harmonics are predominant. The related frequency<br />

spectrum in vibration and noise is typical what can be<br />

measured at real transformers.<br />

V. CONCLUSION AND OUTLOOK<br />

We have presented a magnetostrictive constitutive<br />

model which fully takes the anisotropy <strong>of</strong> grain-oriented<br />

electrical steel sheets as used in electrical transformers<br />

into account. The model itself is simplified in this<br />

sense that the magnetic as well as mechanical hysteretic<br />

behavior is reduced to a nonlinear one by computing<br />

commutation curves out <strong>of</strong> the corresponding hysteresis<br />

measurements. Furthermore, we neglect the impact <strong>of</strong><br />

the mechanical field on the magnetic properties within a<br />

working point, which can be determined by pre-stressing<br />

the measured sample sheets. However, the model needs<br />

measurements provided by an Epstein frame and a SST<br />

- 185 - 15th IGTE Symposium 2012<br />

x-Displacement (nm)<br />

y- Displacement (nm)<br />

0<br />

-45<br />

0<br />

0 20 40 60<br />

Time (ms)<br />

-45<br />

0 20 40 60<br />

Time (ms)<br />

z - Displacement (nm)<br />

8<br />

0<br />

-6<br />

Evaluation point<br />

Scaled by 1000<br />

0 20 40 60<br />

Time (ms)<br />

Isotropic case<br />

Anisotropic case<br />

Fig. 11. Mechanical displacement at an observation point at the joint<br />

region.<br />

(Single Sheet Tester). The computations show strong differences<br />

both in the magnetic field as well as mechanical<br />

vibrations when comparing this anisotropic model to an<br />

isotropic one, which just uses measured curves in rolling<br />

direction <strong>of</strong> the steel sheets.<br />

Currently we are working on an experimental validation<br />

setup, where we can study different joint techniques,<br />

especially step-lap joints.<br />

REFERENCES<br />

[1] L. Vandevelde, J. A. Melkebeek. Modeling <strong>of</strong> Magnetoelastic<br />

Material. IEEE Trans. on Magnetics, 38(2), 2002.<br />

[2] K.Linnemann, S. Klinkel, W. Wagner. A constitutive model for<br />

magnetostrictive and piezoelectric materials. International Journal<br />

<strong>of</strong> Solids and Structures 46, 2009.<br />

[3] M. Kaltenbacher, M. Meiler, M. Ertl. Physical modeling and<br />

numerical computation <strong>of</strong> magnetostriction. Compel, 28(4), 2009.<br />

[4] B. Weiser, H. Pfützner, J. Anger. Relevance <strong>of</strong> Magnetostriction<br />

and Forces for the Generation <strong>of</strong> Audible Noise <strong>of</strong> Transformer<br />

Cores IEEE Trans. on Magnetics, 36(5), 2000.<br />

[5] K. Delaere, W. Heylen, K. Hameyer, R. Belmans. Local Magnetostriction<br />

Forces for Finite Element Analysis. IEEE Trans. on<br />

Magnetics, 36(5), 2000.<br />

[6] A. Dorfmann and R. W. Ogden. Magneto-elastic modeling <strong>of</strong><br />

elastomers. Eur. J. Mechanics and Solids, 22, 2003.<br />

[7] K. Fonteyn, A. Belahcen, R. Kouhia, P. Rasilo, A. Arkkio. FEM<br />

for Directly Coupled Magneto-Mechanical Phenomena in Electrical<br />

Machines. IEEE Trans. on Magnetics, 46(8), 2010.<br />

[8] A. Volk, M. Kaltenbacher, A. Hauck, M. Ertl, R. Lerch. Finite Element<br />

Scheme based on Magnetic Vector Potential and Mechanical<br />

Displacement for Modeling Magnetostriction. <strong>Proceedings</strong> <strong>of</strong> the<br />

8th International Conference on Computation in Electromagnetics<br />

CEM, 2011.<br />

[9] A. Hauck, M. Ertl, J. Schöberl, M. Kaltenbacher. Accurate Simulation<br />

<strong>of</strong> Transformer Step-Lap Joints using Anisotropic Higher<br />

Order FEM. 15 th IGTE Symposium, <strong>Graz</strong>, Austria, 2012.<br />

[10] M. Kaltenbacher. Numerical Simulation <strong>of</strong> Mechatronic Sensors<br />

and Actuators. Springer, 2nd edition, 2007.


- 186 - 15th IGTE Symposium 2012<br />

Analytic Approximation Solution for the<br />

Schwarz-Christ<strong>of</strong>fel Parameter Problem<br />

Norbert Eidenberger∗ and Bernhard G. Zagar∗ ∗Institute for Measurement <strong>Technology</strong>, Altenberger Strasse 69, A-4040 Linz, Austria<br />

E-mail: norbert.eidenberger@jku.at<br />

Abstract—We present a novel analytic approximation method for the Schwarz-Christ<strong>of</strong>fel parameter problem based on<br />

linearization. The modeling requirements for successful linearization are discussed. The linearization introduces a mapping<br />

error which can be virtually eliminated by applying an optimization method. Thus, the proposed method yields conformal<br />

mapping functions which can provide solutions for inverse problems and are suited for sensitivity analyses.<br />

Index Terms—Schwarz-Christ<strong>of</strong>fel parameter problem, Schwarz-Christ<strong>of</strong>fel transform, conformal mapping, potential<br />

problem.<br />

I. INTRODUCTION<br />

Conformal mapping methods provide useful tools for<br />

the analysis <strong>of</strong> many physical phenomena. In particular,<br />

these methods can be utilized to solve two dimensional<br />

potential problems which appear e. g. in electromagnetics,<br />

fluid dynamics, or heat transfer [1], [2]. The<br />

general idea consists in transforming a potential problem<br />

bounded by a complicated geometry to a simpler one,<br />

for which the solution can be computed more easily.<br />

The transformation is performed by a conformal mapping<br />

function. Subsequently, this function also transforms the<br />

solution <strong>of</strong> the simpler problem to the complicated one<br />

which provides the solution <strong>of</strong> the original problem.<br />

Some recent examples for the application <strong>of</strong> conformal<br />

mapping methods are [3], [4], and [5].<br />

For the purpose <strong>of</strong> conformal mapping the coordinates<br />

<strong>of</strong> two dimensional problems are interpreted as the real<br />

and imaginary parts <strong>of</strong> complex numbers. Thus, the<br />

mapping functions represent complex valued functions<br />

in complex variables [6]. Different methods are available<br />

for the construction <strong>of</strong> suitable mapping functions [2].<br />

One <strong>of</strong>ten utilized method, the Schwarz-Christ<strong>of</strong>fel transform<br />

(SCT), constructs mapping functions for polygonal<br />

geometries. Many relevant technical problems involve<br />

polygon shaped boundaries therefore the SCT plays an<br />

important role in many applications.<br />

In order to employ the SCT, its problem dependant<br />

parameters need to be computed. It is not possible to<br />

compute the parameters for polygons with more than<br />

three corners analytically, although a unique solution for<br />

the SCT parameters exists. This constitutes the so-called<br />

SCT parameter problem [7]. Nowadays numerical methods<br />

are routinely employed to solve the SCT parameter<br />

problem [8]. However, numerical methods yield solutions<br />

which are disconnected from the original problem geometry.<br />

This prevents further analysis with respect to the<br />

geometric parameters <strong>of</strong> the original problem.<br />

In this paper we propose a solution method for the SCT<br />

parameter problem based on a series expansion <strong>of</strong> the<br />

SCT base function (1). The method yields an approximate<br />

analytic solution for the SCT parameters containing the<br />

geometry parameters. Due to the approximation error<br />

the resulting mapping function produces mapping errors.<br />

We show that the mapping errors can be eliminated by<br />

prewarping the geometric parameters appropriately. This<br />

is achieved through an optimization method which minimizes<br />

the mapping error. The advantage <strong>of</strong> the proposed<br />

method over the standard numerical solution consists in<br />

the presence <strong>of</strong> the geometry parameters in the mapping<br />

function which permits further analysis <strong>of</strong> the problem,<br />

e. g. sensitivity analyses or solving inverse problems.<br />

This paper consists <strong>of</strong> three main parts. The first part<br />

gives a short introduction to the SCT together with<br />

the corresponding parameter problem. The second part<br />

describes the approximation method for the solution <strong>of</strong><br />

the SCT parameter problem. The third part presents the<br />

minimization method which eliminates the mapping error.<br />

Finally, the conclusion sums up the properties <strong>of</strong> the<br />

proposed method and highlights its potential advantages<br />

and applications.<br />

II. THE SCHWARZ-CHRISTOFFEL TRANSFORM<br />

The SCT represents a widely utilized method for<br />

constructing conformal mapping functions. The SCT base<br />

equation,<br />

<br />

n<br />

z = f(w) =A (w − wi) αi π −1<br />

dw + B, (1)<br />

i=1<br />

maps the upper half <strong>of</strong> the image (w-) plane to the<br />

inside <strong>of</strong> a polygon in the object (z-) plane [9] which<br />

is illustrated in Fig. 1. It contains several unknown<br />

parameters. Parameter A represents a scaling and rotation<br />

factor, parameter B represents a translation, and the<br />

parameters wi represent the corner coordinates in the<br />

w-plane with the known parameters αi representing the<br />

corresponding interior angles.<br />

The unknown parameters A, B and wi are computed<br />

by comparing the polygon corners coordinates in both<br />

planes via the relation z = f(w). The resulting number <strong>of</strong><br />

equations equals the number <strong>of</strong> unknowns, which means<br />

that a unique solution exists. However, for polygons with<br />

more than three corners the integral in (1) yields special


- 187 - 15th IGTE Symposium 2012<br />

Fig. 1. Example setup for the Schwarz-Christ<strong>of</strong>fel transform showing the geometric parameters <strong>of</strong> a rectangle in the z-plane and its image in the<br />

w-plane.<br />

functions for which no inverse functions exist. This<br />

prohibits the analytic computation <strong>of</strong> the SCT parameters<br />

even though it is known that a unique solution exists. This<br />

constitutes the SC parameter problem [7].<br />

Nowadays, the parameter problem is usually solved<br />

numerically. A thorough discussion <strong>of</strong> numerical solution<br />

methods for the SC parameter problem is presented in<br />

[8]. The authors <strong>of</strong> [8] also provide a Matlab toolbox<br />

[10] which permits an easy application <strong>of</strong> the SCT.<br />

For quadrilaterals such as rectangles the SCT produces<br />

mapping functions which consist <strong>of</strong> elliptic functions.<br />

In these cases the elliptic modulus can be utilized to<br />

efficiently compute the parameters numerically [7]. An<br />

application example for this approach is presented in<br />

[11].<br />

III. ANALYTIC APPROXIMATION OF THE PARAMETER<br />

PROBLEM<br />

The disadvantage <strong>of</strong> purely numerical solutions <strong>of</strong> the<br />

SC parameter problem consists in the missing relation to<br />

the original geometry. This prevents subsequent analyses<br />

<strong>of</strong> problems with respect to their geometry. Preserving<br />

this relation requires an analytic approach which is<br />

developed below.<br />

The proposed method consists <strong>of</strong> several steps. The<br />

first step consists in constructing the SCT for the problem<br />

at hand. The evaluation <strong>of</strong> the integral in (1) then yields<br />

a mapping function for which the SC parameters need to<br />

be computed. In order to compute them analytically, the<br />

mapping function is linearized. Then the SC parameter<br />

problem is solved for the linearized mapping function.<br />

The results are inserted back into the original nonlinear<br />

mapping function which then contains geometry parameters<br />

<strong>of</strong> the original problem.<br />

Unfortunately, the procedure is not quite that straight<br />

forward. Several problems which may occur during linearization<br />

need to be addressed, before the method can<br />

be applied successfully.<br />

A. Method Development<br />

An analysis <strong>of</strong> (1) shows that the SCT lends itself to<br />

Taylor series expansion. Equation (1) can be rewritten as<br />

<br />

z = A g(w)dw + B (2)<br />

where<br />

g(w) =<br />

n<br />

i=1<br />

(w − wi) α i<br />

π −1<br />

which represents the transformation core <strong>of</strong> the SCT<br />

defining the general polygon shape. Equation (2) indicates,<br />

that beginning with the first order term, the Taylor<br />

series consists only <strong>of</strong> the transformation core g(w) and<br />

its derivatives. Because g(w) represents a product, this<br />

means that the series expansion consists mainly <strong>of</strong> simple<br />

functions.<br />

Expanding (2) as a Taylor series yields<br />

(3)<br />

z = f(w0)+Ag(w0)(w − w0)+O(w 2 ) (4)<br />

where O(w2 ) represents the higher order terms <strong>of</strong> the<br />

series. In order to be able to solve the parameter problem,<br />

the SC parameters must not appear within non-invertible<br />

functions. The non-invertible special functions contained<br />

in f(w) disappear in the first and higher order terms<br />

<strong>of</strong> the series. However, there remains a special function<br />

within the zeroth order term<br />

<br />

<br />

f(w0) =A g(w)dw<br />

+ B. (5)<br />

w=w0<br />

The special function vanishes if the result <strong>of</strong> the integral<br />

at w0 equals zero. Parameter B represents a translation<br />

<strong>of</strong> the mapped polygon in the z-plane, thus it can be set<br />

to B =0without loss <strong>of</strong> generality. This corresponds<br />

to mapping the origins <strong>of</strong> the w- and z-plane onto each<br />

other, so the Taylor series expansion is centered at w0 =<br />

0 and f(w0) =0. If a translation <strong>of</strong> the mapping result is<br />

truly required, this can be achieved by a simple additional<br />

mapping function.<br />

Truncating (4) after the first order term and incorporating<br />

the above considerations yields<br />

z = Ag(w0)(w − w0) (6)<br />

which represents a mapping function linearized with<br />

respect to the image coordinates w. Note, that (6) usually<br />

will not be linear with respect the geometric parameters.<br />

The condition developed in the previous step requires<br />

that the origins <strong>of</strong> the planes are mapped onto each<br />

other. This can only be guaranteed if the coordinates <strong>of</strong><br />

a point are known in both planes. This is the case for the<br />

polygon corners which are present in the transformation<br />

core g(w).


- 188 - 15th IGTE Symposium 2012<br />

Fig. 2. Example for a symmetric polygon which features a point besides the corners for which its coordinates are known in both planes.<br />

The transformation core consists <strong>of</strong> a product <strong>of</strong> terms<br />

which contain the coordinates <strong>of</strong> the polygon corners<br />

in the w-plane. However, for a corner at w0 = 0 the<br />

transformation core evaluates either to zero or infinity,<br />

depending on the angle αi in the exponent corresponding<br />

to the corner.<br />

g(0) = 0 if α>π (7)<br />

g(0) = ∞ if α


Fig. 3. Illustration <strong>of</strong> the mapping error introduced into the mapping<br />

function by the linearization.<br />

In this case the mapping error can be defined as the<br />

distances between the ideal and the actual mapped corner<br />

positions in the z-plane<br />

i=1<br />

Δz = f<br />

<br />

w, <br />

d − z, (13)<br />

where the components <strong>of</strong> w represent the coordinates<br />

wi <strong>of</strong> the polygon corners in the w-plane, and z and<br />

Δz consist <strong>of</strong> the corresponding ideal polygon corner<br />

positions and mapping errors in the z-plane.<br />

Equation (13) is not suited as an objective function<br />

for minimizing the mapping error because the sign <strong>of</strong><br />

the mapping error may change. In order to avoid this,<br />

the sum <strong>of</strong> the square <strong>of</strong> the real and imaginary part <strong>of</strong><br />

the mapping errors is utilized as the objective function<br />

<br />

n<br />

<br />

min Re(Δzi) 2 +Im(Δzi) 2<br />

, (14)<br />

where n represents the number <strong>of</strong> polygon corners.<br />

Equation (14) can be reformulated as<br />

min Δz T Δz ∗ , (15)<br />

where Δz ∗ represents the vector containing the complex<br />

conjugate mapping error. The objective function in (15)<br />

forms a convex function with a global minimum at 0<br />

which is known to exist. In addition, (15) consists <strong>of</strong> a<br />

sum <strong>of</strong> squares. For this type <strong>of</strong> functions exist specialized<br />

optimization algorithms [12], [13], which ensures<br />

that a solution can be computed easily.<br />

V. CONCLUSION<br />

In this paper we have presented a method for computing<br />

an analytic approximation solution <strong>of</strong> the SC<br />

parameter problem. The proposed method takes advantage<br />

<strong>of</strong> the structure <strong>of</strong> the SCT base equation and<br />

shows that a linearization <strong>of</strong> the problem is possible<br />

under certain conditions. These conditions are defined<br />

and their consequences regarding the modeling process<br />

are discussed.<br />

The resulting conformal mapping function contains<br />

approximation errors. We presented a formulation <strong>of</strong> the<br />

corresponding mapping error which permits its minimization<br />

to arbitrarily small dimensions by varying the<br />

geometric parameters <strong>of</strong> the original polygon. Thus, the<br />

proposed method yields a conformal mapping function<br />

which produces the desired map.<br />

- 189 - 15th IGTE Symposium 2012<br />

The advantage <strong>of</strong> the proposed analytic over the conventional<br />

numeric approximation consists in the presence<br />

<strong>of</strong> the geometry parameters in the mapping function.<br />

Together with the minimization procedure this permits<br />

• the solution <strong>of</strong> inverse problems e.g. in capacitive<br />

sensing applications,<br />

• the sensitivity analysis <strong>of</strong> sensor setups with respect<br />

to their geometry [14].<br />

ACKNOWLEDGMENT<br />

The authors gratefully acknowledge the partial financial<br />

support for the work presented in this paper by the<br />

Austrian Research Promotion Agency and the Austrian<br />

COMET program supporting the Austrian Center <strong>of</strong><br />

Competence in Mechatronics (ACCM).<br />

REFERENCES<br />

[1] P. M. Morse and H. Feshbach, Methods <strong>of</strong> theoretical physics.<br />

McGraw-Hill, 1953, vol. 1.<br />

[2] R. Schinzinger and A. Laura, Conformal Mapping: Methods and<br />

Applications. Elsevier, 1991.<br />

[3] A. Verh<strong>of</strong>f, “Generalized poisson integral formula applied to<br />

potential flow solutions for free and confined jets with secondary<br />

flow,” Computers & Fluids, vol. 54, pp. 18–38, 2012.<br />

[4] A. J. Davidson and N. J. Mottram, “Conformal mapping techniques<br />

for the modelling <strong>of</strong> liquid crystal devices,” European<br />

Journal <strong>of</strong> Applied Mathematics, vol. 23, no. 01, pp. 99–119,<br />

2012.<br />

[5] M. Schwarz, T. Holtij, A. Kloes, and B. Iñíguez, “Analytical<br />

compact modeling framework for the 2D electrostatics in lightly<br />

doped double-gate mosfets,” Solid-State Electronics, vol. 69, pp.<br />

72–84, 2012.<br />

[6] P. Henrici, Applied and Computational Complex Analysis. John<br />

Wiley & Sons, 1974, vol. 1.<br />

[7] ——, Applied and Computational Complex Analysis. John Wiley<br />

& Sons, 1986, vol. 3.<br />

[8] T. A. Driscoll and L. N. Trefethen, Schwarz–Christ<strong>of</strong>fel Mapping,<br />

P. Ciarlet, A. Iserles, R. Kohn, and M. Wright, Eds. Cambridge<br />

<strong>University</strong> Press, 2002.<br />

[9] V. I. Smirnov, Lehrbuch der höheren Mathematik, 14th ed. Harri<br />

Deutsch, 1995.<br />

[10] T. Driscoll. (2012, Sep.) Schwarz-Christ<strong>of</strong>fel toolbox<br />

for MATLAB. <strong>University</strong> <strong>of</strong> Delaware, Department<br />

<strong>of</strong> Mathematical Sciences. [Online]. Available:<br />

http://www.math.udel.edu/ driscoll/s<strong>of</strong>tware/SC/<br />

[11] R. Igreja and C. Dias, “Extension to the analytical model <strong>of</strong> the<br />

interdigital electrodes capacitance for a multi-layered structure,”<br />

Sensors and Actuators A: Physical, vol. 172, no. 2, pp. 392–399,<br />

2011.<br />

[12] R. Fletcher, Practical methods <strong>of</strong> optimization, 2nd ed. Wiley,<br />

2000.<br />

[13] P. E. Gill, W. Murray, and M. H. Wright, Practical optimization.<br />

Academic Press, 1981.<br />

[14] N. Eidenberger and B. G. Zagar, “Sensitivity <strong>of</strong> capacitance<br />

sensors for quality control in blade production,” in ICST 2011,<br />

Dec. 2011.


- 190 - 15th IGTE Symposium 2012<br />

Additional Eddy Current Losses in Induction<br />

Machines Due to Interlaminar Short Circuits<br />

P. Handgruber∗ , A. Stermecki∗ ,O.Bíró∗ , and G. Ofner †<br />

∗Institute for Fundamentals and Theory in Electrical Engineering (IGTE),<br />

Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />

Inffeldgasse 18/I, A-8010 <strong>Graz</strong>, Austria<br />

† ELIN Motoren GmbH, Elin-Motoren-Straße 1, A-8160 Preding/Weiz, Austria<br />

E-mail: paul.handgruber@tugraz.at<br />

Abstract—A novel three-dimensional eddy current model to account for the additional losses caused by interlaminar short<br />

circuits is presented and applied to the loss estimation <strong>of</strong> an induction machine. The method is based on a single sheet model<br />

with appropriate boundary conditions on the interlaminar contact surface avoiding cumbersome full three-dimensional<br />

models with multiple short circuited laminations. The results <strong>of</strong> the single sheet model without short circuits are compared<br />

to measured no-load iron loss data. The interlaminar model is validated by means <strong>of</strong> full models comprising several<br />

interconnected sheets and used for the quantification <strong>of</strong> extra losses caused by conductive joints and shearing burrs. It has<br />

been found that particularly burrs occurring on the tooth edges lead to a significant loss increase.<br />

Index Terms—AC-machines, eddy currents, electric machines, electromagnetic modeling, finite element methods, magnetic<br />

losses<br />

I. INTRODUCTION<br />

Active iron parts in electrical machines are commonly<br />

built <strong>of</strong> thin steel laminations. In an ideally assembled<br />

core, the individual laminates are isolated from each<br />

other in order to minimize the effects <strong>of</strong> eddy currents.<br />

During the cutting process, mechanical deformations lead<br />

to microscopic shearing burrs on the cutting edges. This<br />

edge burrs can break down the insulation resulting in<br />

conductive connections between the stacked sheets. If<br />

the burr-induced short circuits cover several laminations,<br />

high currents begin to circulate leading to a significant<br />

loss increase and hence to local overheatings [1].<br />

The core stacks are frequently held together by conductive<br />

joints, such as bolts, welds or clamping bars.<br />

In most cases, these fixations as well as the shaft and<br />

parts <strong>of</strong> the frame are mounted uninsulated on the core,<br />

short-circuiting a large number <strong>of</strong> laminations. Further<br />

interlaminar contacts are induced by small insulation<br />

faults on the lamination surfaces inside the core middle.<br />

However, the probability <strong>of</strong> such inner short circuits is<br />

very low and stochastic [2]. Therefore, their effects are<br />

not considered in this work.<br />

Up to now, the complex problem <strong>of</strong> interlaminar short<br />

circuits has been chiefly treated by statistical and/or empirical<br />

methods accompanied by vast measurement series<br />

[3], [4]. Attempts to quantify the resulting additional<br />

losses are mostly based on analytical approaches. For<br />

instance, in [5], [6] and [7] artificial burrs have been applied<br />

in a controlled manner to a distribution transformer.<br />

Comparisons <strong>of</strong> the performed measurements to an analytical<br />

eddy current model showed only poor correlation<br />

due to the simplifications made. In [8], [9] and [10] the<br />

losses have been evaluated using a resistance network<br />

analogy. The circuit parameters have been derived from<br />

small-scale models based on simple geometries.<br />

In order to enable studies on more complicated structures<br />

like an electrical machine, a novel method based<br />

on a three-dimensional (3-D) finite element analysis<br />

is proposed in this work. The method is capable to<br />

compute the true paths <strong>of</strong> the eddy currents and values<br />

<strong>of</strong> the ensuing losses. A full 3-D model with multiple<br />

joined laminations is avoided by introducing appropriate<br />

boundary conditions on the interlaminar contact surface<br />

<strong>of</strong> a single sheet model.<br />

This paper is organized as follows: In section II a<br />

novel 3-D eddy current model considering the effects <strong>of</strong><br />

interlaminar short circuits is introduced. In section III<br />

the method presented is applied to the loss estimation<br />

<strong>of</strong> a megawatt rated slip-ring induction machine. First,<br />

the single sheet model without short circuits is compared<br />

to no-load iron loss measurements. The validation <strong>of</strong><br />

the interlaminar contact model is performed using full<br />

models with several short circuited laminations. Finally,<br />

the effects <strong>of</strong> conductive joints and shearing burrs are<br />

subjected to an in-depth analysis.<br />

II. 3-D EDDY CURRENT MODEL<br />

The proposed method can be subdivided into two steps:<br />

first, a transient two-dimensional (2-D) field analysis is<br />

carried out for the whole machine. In a second step, a<br />

transient nonlinear 3-D eddy current problem is solved<br />

separately for an individual stator and rotor sheet.<br />

The 3-D model is excited by time-dependent boundary<br />

conditions obtained from the 2-D field analysis. This<br />

allows separate treatment <strong>of</strong> the stator and rotor sheets<br />

and avoids cumbersome and computationally expensive


transient 3-D finite element simulations including rotor<br />

motion [11]. Nevertheless, all loss relevant effects, like<br />

high-order field harmonics and the rotor movement are<br />

included in the analysis.<br />

Only a single sheet is considered in the 3-D model,<br />

the effects <strong>of</strong> interlaminar short circuits are taken into<br />

account by boundary conditions on the contact surface.<br />

This approach is valid ins<strong>of</strong>ar as, for multiple interconnected<br />

laminates, the electromagnetic quantities in the<br />

sheets within the core become periodic.<br />

A nodal based A,V -A formulation is employed for the<br />

analysis <strong>of</strong> the 2-D problem. The 2-D iron regions are assumed<br />

to be non-conductive. The massive rotor windings<br />

are modeled as eddy current domains but not the stator<br />

conductors. The starting transients <strong>of</strong> the machine are<br />

bypassed by using the initial steady state solution from<br />

a time harmonic analysis. The 3-D eddy current problem<br />

is solved solely in the conductive laminations using the<br />

A,V formulation based on isoparametric hexahedral edge<br />

elements with quadratic shape functions [12]. Introducing<br />

the magnetic vector potential A, in the whole problem<br />

domain Ω and the electric scalar potential V , in the eddy<br />

current region Ωc as<br />

B = ∇×A,<br />

E = − ∂ ∂<br />

A −<br />

∂t ∂t ∇V<br />

⎫<br />

⎬<br />

⎭ in Ωc, (1)<br />

B = ∇×A in Ω − Ωc<br />

(2)<br />

where B is the magnetic flux density, E is the electric<br />

field intensity and t is time, the Maxwell equations under<br />

quasistatic approximation can be written as<br />

∇×ν∇×A + σ ∂ ∂<br />

A + σ ∇V = 0,<br />

∂t ∂t<br />

∇· σ ∂ ∂<br />

A + σ<br />

∂t ∂t ∇V<br />

⎫<br />

⎪⎬<br />

in Ωc, (3)<br />

=0 ⎪⎭<br />

∇×ν∇×A = J0 in Ω − Ωc. (4)<br />

Herein ν denotes the reluctivity and σ the conductivity <strong>of</strong><br />

the sheet material, J0 represents the given current density<br />

<strong>of</strong> the stranded coils in the 2-D model.<br />

Fig. 1 shows the specifications <strong>of</strong> the boundary conditions<br />

for the 3-D model <strong>of</strong> a stator sheet. The boundary<br />

conditions are prescribed anew for every time step. On<br />

the boundaries along the lamination thickness, the model<br />

is excited by the normal component <strong>of</strong> B derived form<br />

the 2-D field analysis. The prescription <strong>of</strong> the normal<br />

component <strong>of</strong> B is equivalent to specifying the tangential<br />

component <strong>of</strong> A. This component is obtained from the<br />

2-D field vector potential A2-D and is assumed to be<br />

constant along the thickness. The boundaries along the<br />

sheet thickness are hereinafter called axial boundaries.<br />

Since flux in the axial direction is neglected, the tangential<br />

component <strong>of</strong> A is set to zero on the boundaries<br />

parallel to the laminations.<br />

Setting the electric scalar potential free on the boundaries<br />

<strong>of</strong> Ωc denoted by Γi ensures the satisfaction <strong>of</strong> the<br />

- 191 - 15th IGTE Symposium 2012<br />

Fig. 1. Specification <strong>of</strong> the boundary conditions for a 3-D stator sheet<br />

model. The sheet thickness is increased for better visibility. Symmetry<br />

boundary conditions on the bottom surface allow to consider only half<br />

<strong>of</strong> the thickness.<br />

natural boundary condition <strong>of</strong> vanishing normal component<br />

<strong>of</strong> the current density J here. At the interlaminar<br />

contact surface Γc, the current flow is assumed to be<br />

normal to the face, resulting in a constant and unknown<br />

electric scalar potential. Furthermore, it has to be guaranteed<br />

that the exchanged current through the contact spots<br />

between the sheets is zero by introducing the surface<br />

integral relation<br />

<br />

Γc<br />

σ<br />

<br />

∂ ∂<br />

A +<br />

∂t ∂t ∇V<br />

<br />

· n dΓ = 0 (5)<br />

as an additional equation into the finite element equation<br />

system [13]. The symbol n stands for the outer normal<br />

vector.<br />

For interlaminar contacts occurring on the sheet boundaries<br />

like shearing burrs, the effects <strong>of</strong> the high circulating<br />

eddy currents on the given magnetic boundary field<br />

cannot be neglected. In such cases, the magnetic field<br />

density is not prescribed directly on the axial sheet<br />

boundaries, but on an additional outer finite element layer<br />

surrounding the lamination. This layer is modeled as nonconductive<br />

extending the 3-D problem to an A,V -A one.<br />

Thus, the required independence <strong>of</strong> the prescribed field<br />

from the eddy currents is ensured again.<br />

III. APPLICATION AND RESULTS<br />

The method presented has been applied to the loss<br />

estimation <strong>of</strong> a megawatt rated, 50 Hz, 690 V, deltaconnected<br />

three-phase, four pole slip-ring induction machine<br />

fed by sinusoidal voltage. In the case <strong>of</strong> a healthy<br />

machine without short circuits present in the core, the<br />

computed total iron losses are compared to no-load<br />

measurements. After validating the proposed interlaminar<br />

contact model against full models with many short<br />

circuited sheets, the additional losses due to conductive<br />

joints and shearing burrs will be quantified.


- 192 - 15th IGTE Symposium 2012<br />

(a) (b)<br />

Fig. 2. Eddy current loss density distribution in a stator (a) and rotor (b) sheet at rated no-load current and a specific time instant.<br />

A. No-load iron losses<br />

According to the statistical loss theory, the total iron<br />

losses are composed <strong>of</strong> eddy current, hysteresis and<br />

excess losses [14]. The computed 3-D eddy current<br />

loss distribution for a healthy stator and rotor sheet<br />

is presented in Fig. 2. The losses are quite uniformly<br />

distributed in the stator sheet and mainly attributable to<br />

the 50 Hz fundamental frequency component. The rotor<br />

losses are concentrated in the vicinity <strong>of</strong> the air-gap<br />

and primarily evoked by the first stator slot harmonic at<br />

1800 Hz. In order to cover all relevant harmonic effects,<br />

the time step size Δt was fixed to Δt = T/500 for the stator<br />

and Δt = T/1000 for the rotor sheet simulation; T is the<br />

time period <strong>of</strong> the fundamental frequency. The total eddy<br />

current losses are obtained by integrating the product <strong>of</strong><br />

E and J over the sheet volume, averaged over time and<br />

multiplied by the number <strong>of</strong> laminations. The proportion<br />

<strong>of</strong> the losses in the rotor core is remarkable and nearly<br />

as high as in the stator (see Fig. 3(a)).<br />

The measured and computed no-load iron losses as<br />

a function <strong>of</strong> the supply current are compared in Fig.<br />

3(b). During the measurements, the rotor windings were<br />

(a) (b)<br />

Fig. 3. Simulated no-load eddy current losses (a) and separated total<br />

iron losses (b) as a function <strong>of</strong> the supply current as well as measured<br />

losses.<br />

kept open in order to avoid additional rotor currents<br />

and hence further losses. The test machine was driven<br />

by a second machine at synchronous speed. Thereby,<br />

the friction losses are covered by the driving machine.<br />

The measured losses on the stator terminals <strong>of</strong> the test<br />

machine correspond to the iron losses and stator winding<br />

losses. For the computation, the hysteresis and excess<br />

losses are obtained by a method developed in [15] based<br />

on the evaluation <strong>of</strong> the shape <strong>of</strong> dynamic magnetization<br />

curves. The hysteresis losses are calculated by a static<br />

vector Preisach model [16], [17], the excess losses using<br />

the statistical loss theory [14]. The good agreement<br />

between the measurement and simulation results confirms<br />

that the methods used are able to cope with the complex<br />

electromagnetic phenomena arising in an induction machine,<br />

i. e. rotating flux and high-order field harmonics.<br />

The stator eddy current losses for no-load can even be<br />

evaluated correctly using time harmonic analyses, since<br />

they are almost exclusively caused by the fundamental<br />

field component. The methodology <strong>of</strong> the proposed twostep<br />

approach can be adopted in a straightforward way<br />

to time harmonic problems. For rated no-load current<br />

(I0=288 A), the losses obtained from a transient analysis<br />

are 901.1 W, the time harmonic analysis yields 903.7 W.<br />

When load is getting applied, the field harmonics increase<br />

strongly and transient analyses are required [18]. In<br />

order to shorten the computation time, the following<br />

investigations on the effects <strong>of</strong> interlaminar short circuits<br />

have been performed for the stator sheet at rated noload<br />

current using time harmonic calculations. However,<br />

all relevant factors influencing the behavior underlying<br />

an interlaminar short circuit are still incorporated in the<br />

computation. The use <strong>of</strong> time harmonic analyses constitutes<br />

no restriction <strong>of</strong> the introduced approach which is<br />

easily expandable for transient simulations.<br />

B. Validation <strong>of</strong> the interlaminar model<br />

The proposed single sheet model combined with<br />

boundary conditions considering the interlaminar interaction<br />

has been validated using two different examples:<br />

a simple conductive ring and a stator sheet sector <strong>of</strong> the<br />

induction machine investigated.<br />

1) Conductive ring: As shown in Fig. 4(a), the 2-D<br />

model <strong>of</strong> the ring is excited by a conductor arranged<br />

symmetrically in the bore separated from the core by<br />

an air-gap. The 3-D model (see Fig. 4(b)) consists <strong>of</strong><br />

ten and a half stacked lamination quarters short circuited<br />

by two conductive joints through the entire core stack.<br />

The joint material is the same as those <strong>of</strong> the laminates.<br />

The isolation between the sheets is modeled as a nonconductive<br />

A-region with a relative permeability <strong>of</strong> one<br />

and a thickness <strong>of</strong> one hundredth <strong>of</strong> those <strong>of</strong> the laminations.<br />

On the curved boundaries, the 3-D problem is


(a) (b)<br />

Fig. 4. Validation geometry <strong>of</strong> the conductive ring: 2-D (a) and 3-D<br />

(b) model.<br />

excited by boundary conditions derived form the 2-D<br />

field solution. Periodic boundary conditions have been<br />

applied on the left and right boundaries. On the top<br />

surface, the normal component <strong>of</strong> B and that <strong>of</strong> J are set<br />

to zero; on the bottom surface, the problem is mirrored<br />

using symmetric boundary conditions. The 2-D problem<br />

is treated with the time harmonic A formulation, the<br />

3-D one with the A,V -A formulation. Linear material<br />

properties are assumed for the sake <strong>of</strong> simplicity.<br />

Figs. 5(a) and 5(b) show the current and flux density<br />

distributions computed by the full model. The electromagnetic<br />

quantities in the center sheets repeat periodically<br />

suggesting the applicability <strong>of</strong> the reduced<br />

approach. Consequently, the currents circulating through<br />

the contact spots affect the original 2-D field distribution<br />

(see Figs. 5(c) and 5(d)). The undermost half lamination<br />

serves as a validation reference for the reduced model<br />

- 193 - 15th IGTE Symposium 2012<br />

with a single sheet. Fig. 6 compares the obtained field<br />

solutions. The good agreement is also verified in terms<br />

<strong>of</strong> losses wich are 10.93 mW for the full model and<br />

11.37 mW for the reduced one.<br />

2) Stator sheet sector: Since 3-D simulations with<br />

multiple short circuited stator laminations are hardly<br />

feasible, only a small sheet sector is considered in the<br />

full model. Fig. 7(a) shows the used 2-D model <strong>of</strong><br />

the previously examined induction machine, Fig. 7(b)<br />

the 3-D model comprising hundred and a half laminations<br />

interconnected by shearing burrs over two teeth<br />

throughout the stack. A continuous burr width <strong>of</strong> 100 μm<br />

(a) (b)<br />

Fig. 7. Validation geometry <strong>of</strong> the stator sheet sector: 2-D (a) and<br />

3-D (b) model.<br />

has been selected requiring a rather fine mesh near the<br />

burred region. Results for different burr widths will be<br />

discussed in section III-C2. The burr material properties<br />

are the same as that <strong>of</strong> the sheets. The isolation thickness<br />

is specified as one fiftieth <strong>of</strong> the lamination thickness<br />

(a) (b)<br />

(c) (d)<br />

Fig. 5. Current density (a) and flux density (b) at a specific time instant for the full model as well as the corresponding vector plots (c,d) for the<br />

framed sectors.


- 194 - 15th IGTE Symposium 2012<br />

(a) (b)<br />

(c) (d)<br />

Fig. 6. Current density and flux density distribution computed for the bottom lamination <strong>of</strong> the full model (a,b) and the reduced one (c,d).<br />

(d=0.5 mm). The 2-D field solution is prescribed on an<br />

outer non-conductive layer enclosing the teeth and on the<br />

stator back. The normal component <strong>of</strong> both B and J is<br />

set to zero on the top surface and on the boundaries intersecting<br />

the yoke. Again, the problem region is mirrored<br />

at the bottom surface and solved in the frequency domain<br />

using the A,V -A formulation. Linear media are used for<br />

validation purposes, nonlinearity is considered in the next<br />

sections by means <strong>of</strong> well established methods.<br />

The eddy current distribution for rated no-load current<br />

calculated by the full model is shown in Fig. 8. In<br />

the burr region, high currents are closing over the teeth<br />

(see also Fig. 12). The induced currents increase with<br />

Fig. 8. Current density distribution at rated no-load current and a<br />

specific time instant for the full model.<br />

the number <strong>of</strong> short circuited laminations and approach<br />

asymptotically a final value. Accordingly, the reduced<br />

model presents a worst case scenario for a suitably<br />

large number <strong>of</strong> interconnected sheets. The required sheet<br />

number depends on various factors, such as size and<br />

position <strong>of</strong> the short circuits, sheet dimensions or material<br />

properties. The undermost lamination <strong>of</strong> the full model<br />

and the reduced method again give similar current density<br />

distributions as shown in Fig. 9. The resulting losses<br />

for the full model are 392.0 mW, the reduced approach<br />

predicts 363.9 mW.<br />

C. Effects <strong>of</strong> interlaminar short circuits<br />

Two examples <strong>of</strong> interlaminar short circuits will be<br />

addressed in more detail. The first one involves conductive<br />

joints represented by clamping bars. Second, the<br />

impact <strong>of</strong> shearing burrs will be discussed by means <strong>of</strong><br />

parametric studies. All <strong>of</strong> the following simulations have<br />

been carried out for rated no-load current in the frequency<br />

domain using the proposed interlaminar model. Nonlinearity<br />

is taken into account by an effective reluctivity<br />

approach [19].<br />

1) Conductive joints: The iron stacks <strong>of</strong> mid-power<br />

range machines are <strong>of</strong>ten axially fixed by clamping bars<br />

installed after pressing the core. These clamping plates<br />

are attached uninsulated on the core back leading to<br />

a conductive connection among the sheets. Different<br />

numbers <strong>of</strong> bars installed along the stator back circumference<br />

have been investigated using the method presented.<br />

Fig. 10 demonstrates the computed current and flux<br />

density distribution when four clamping bars per pole<br />

pitch are considered in the simulation. On the one hand,<br />

the induced currents in axial direction cause additional<br />

losses. On the other hand, they oppose the original field<br />

distribution increasing the magnetic flux density and<br />

hence the losses near the clamping areas. A detailed view<br />

<strong>of</strong> the electromagnetic quantities near a contact area is<br />

shown in Fig. 11. The eddy current losses in the stator<br />

core increase by 3.93 % for one installed connection bar<br />

per pole. For four bars they rise by 13.65 %.


- 195 - 15th IGTE Symposium 2012<br />

(a) (b)<br />

Fig. 9. Current density distribution for the bottom lamination <strong>of</strong> the full (a) and reduced (b) model.<br />

(a) (b)<br />

Fig. 10. Maximal flux density (a) and loss density (b) distribution for four clamping bars per pole installed.<br />

(a) (b)<br />

(c) (d)<br />

Fig. 11. Current density (a) and flux density (b) in a contact area at a specific time instant as well as the corresponding vector plots in the bar<br />

section (c,d).<br />

2) Shearing burrs: The quantification <strong>of</strong> the extra<br />

losses caused by edge burrs is carried out performing<br />

simulations for different numbers <strong>of</strong> teeth afflicted by<br />

burrs and various burr widths. Contrary to the former<br />

validation example, a complete stator sheet quarter has<br />

been considered in the computations. Fig. 12 shows the<br />

eddy current paths in the burr layer for different numbers<br />

<strong>of</strong> teeth burred. The high magnetizing flux in the tooth<br />

area induces interlaminar currents enclosing the teeth and<br />

resulting in a steep rise <strong>of</strong> the losses. The trends in Fig.<br />

13 indicate a cubic dependence <strong>of</strong> the additional losses on<br />

the number <strong>of</strong> burred teeth. The application <strong>of</strong> burrs on<br />

the stator back led to no significant loss increase owing<br />

to the small flux density near the outermost rim.


- 196 - 15th IGTE Symposium 2012<br />

(a) (b)<br />

(c) (d)<br />

Fig. 12. Current density vector plots for one (a), two (b) three (c) and five (d) burred teeth at a specific time instant and a burr width <strong>of</strong> 100 μm.<br />

The burr region only is considered in the plot.<br />

Eddy Current Losses per Sheet Pcl in W<br />

10<br />

9<br />

8<br />

7<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

bburr =10μm<br />

bburr =20μm<br />

bburr =40μm<br />

bburr =60μm<br />

bburr =80μm<br />

bburr = 100 μm<br />

0<br />

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5<br />

Number <strong>of</strong> Teeth Burred<br />

Fig. 13. Losses as a function <strong>of</strong> the number <strong>of</strong> burred teeth for different<br />

burr widths.<br />

The width <strong>of</strong> the burr layer has been varied in a<br />

practically relevant range from 10 to 100 μm. Even<br />

broader interlaminar contacts can be present in laser cut<br />

sheets due to the heat induced insulation burn-<strong>of</strong>f. As<br />

shown in Fig. 14, the losses rise linearly with the burr<br />

width. The loss behavior for different no-load supply<br />

currents can be seen in Fig. 15. The losses increase<br />

quadratically for lower values, whereas at higher currents,<br />

saturation effects occur limiting the maximal attainable<br />

flux densities and thus losses.<br />

IV. DISCUSSION AND CONCLUSIONS<br />

The method presented enables to compute the true 3-<br />

D eddy current distribution underlying an interlaminar<br />

Eddy Current Losses per Sheet Pcl in W<br />

10<br />

9<br />

8<br />

7<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

1 tooth burred<br />

2 teeth burred<br />

3 teeth burred<br />

4 teeth burred<br />

5 teeth burred<br />

0<br />

10 20 30 40 50 60 70 80 90 100<br />

Burr Width bburr in μm<br />

Fig. 14. Losses as a function <strong>of</strong> the burr width for different numbers<br />

<strong>of</strong> burred teeth.<br />

short circuit allowing a quantitative assessment <strong>of</strong> the<br />

arising additional losses. In order to avoid full models<br />

comprising multiple short circuited laminations, only a<br />

single sheet is considered. The interlaminar interaction is<br />

taken into account by boundary conditions on the contact<br />

surface using a generalized A,V -A formulation.<br />

First, the method has been applied to the no-load<br />

iron loss estimation <strong>of</strong> a slip-ring induction machine.<br />

Therefore, simulations for a healthy machine with no<br />

short circuits present in the core have been carried out.<br />

The transient 3-D eddy current problem has been excited<br />

by boundary conditions derived from a classical 2-D field<br />

analysis and solved separately for a stator and rotor sheet.<br />

The hysteresis and excess losses required for comparisons


Eddy Current Losses per Sheet Pcl in W<br />

2<br />

1.5<br />

1<br />

0.5<br />

bburr =10μm<br />

bburr =20μm<br />

bburr =40μm<br />

bburr =60μm<br />

bburr =80μm<br />

bburr = 100 μm<br />

0<br />

50 100 150 200 250 300 350 400<br />

No-Load Current I0 in A<br />

Fig. 15. Losses as a function <strong>of</strong> the supply current for two burred<br />

teeth and different burr widths.<br />

to no-load iron loss measurements have been evaluated<br />

by a static Preisach vector model and the statistical<br />

loss theory, respectively. Good agreement was obtained<br />

between measured and simulated results.<br />

The effects <strong>of</strong> interlaminar short circuits have been<br />

studied for the stator sheet at rated no-load using timeharmonic<br />

computations, since it was found that the occurring<br />

eddy current losses are almost completely caused<br />

by the fundamental field component. Comparisons <strong>of</strong> the<br />

interlaminar contact model against full models with many<br />

laminations joined confirmed the validity <strong>of</strong> the reduced<br />

method as long as the electromagnetic quantities in the<br />

short-circuited sheets stay periodic. For a low number <strong>of</strong><br />

interconnected laminations, the full model yields lower<br />

losses than the reduced one. Consequently, the reduced<br />

model constitutes a worst-case approximation.<br />

Applications <strong>of</strong> the proposed approach revealed a<br />

significant loss increase for conductive paths introduced<br />

by shearing burrs on the tooth edges. It should be noted<br />

that the burrs studied will not be present to such an extent<br />

in a healthy machine, but the trends and findings will<br />

still apply, indicating the strong necessity to minimize<br />

bur-induced short circuits as far as practicable.<br />

The magnetic and electric properties have been assumed<br />

to be unaffected by the manufacturing process. In<br />

[20], [21] and [22] it was shown that especially the magnetic<br />

permeability near the edges can vary considerably<br />

due to the mechanical stress applied during punching.<br />

The incorporation <strong>of</strong> these effects as well as combinations<br />

<strong>of</strong> the developed method with measurement-based<br />

statistics have to be carried out in future work.<br />

V. ACKNOWLEDGMENT<br />

This work has been supported by the Christian<br />

Doppler Research Association (CDG) and by the ELIN<br />

Motoren GmbH.<br />

REFERENCES<br />

[1] P. Beckley, Electrical Steels for Rotating Machines. The Institution<br />

<strong>of</strong> Engineering and <strong>Technology</strong>, 2002.<br />

- 197 - 15th IGTE Symposium 2012<br />

[2] M. C. Marion-Pera, A. Kedous-Lebouc, T. Waeckerle, and B. Comut,<br />

“Characterization <strong>of</strong> SiFe Sheet Insulation,” IEEE Transactions<br />

on Magnetics, vol. 31, no. 4, pp. 2408–2415, 1995.<br />

[3] A. C. Beiler and P. L. Schmidt, “Interlaminar Eddy Current Losses<br />

in Laminated Cores,” Transactions <strong>of</strong> the American Institute <strong>of</strong><br />

Electrical Engineers, vol. 66, pp. 872–78, 1947.<br />

[4] C. A. Schulz, S. Duchesne, D. Roger, and J.-N. Vincent, “Capacitive<br />

short circuit detection in transformer core laminations,”<br />

Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 320, pp.<br />

e911–e914, 2008.<br />

[5] A. J. Moses and M. Aimoniotis, “Effects <strong>of</strong> Artificial Edge Burrs<br />

on the Properties <strong>of</strong> a Model Transformer Core,” Physica Scripta,<br />

vol. 39, pp. 391–393, 1989.<br />

[6] R. Mazurek, P. Marketos, A. Moses, and J.-N. Vincent, “Effect<br />

<strong>of</strong> Artificial Burrs on the Total Power Loss <strong>of</strong> a Three-Phase<br />

Transformer Core,” IEEE Transactions on Magnetics, vol. 46,<br />

no. 2, pp. 638–641, 2010.<br />

[7] R. Mazurek, H. Hamzehbahmani, A. Moses, P. I. Anderson, F. J.<br />

Anayi, and B. Thierry, “Effect <strong>of</strong> Artificial Burrs on Local Power<br />

Loss in a Three-Phase Transformer Core,” IEEE Transactions on<br />

Magnetics, vol. 48, no. 4, pp. 1653–1656, 2012.<br />

[8] D. A. Jones and W. S. Leung, “A theoretical and analogue<br />

approach to stray eddy-current loss in laminated magnetic cores,”<br />

<strong>Proceedings</strong> <strong>of</strong> the IEE - Part C: Monographs, vol. 108, no. 14,<br />

pp. 509–519, 1961.<br />

[9] C. A. Schulz, D. Roger, S. Duchesne, and J.-N. Vincent, “Experimental<br />

Characterization <strong>of</strong> Interlamination Shorts in Transformer<br />

Cores,” IEEE Transactions on Magnetics, vol. 46, no. 2, pp. 614–<br />

617, 2010.<br />

[10] J.-P. Bielawski, S. Duchesne, D. Roger, C. Demian, and T. Belgrand,<br />

“Contribution to the Study <strong>of</strong> Losses Generated by Interlaminar<br />

Short-Circuits,” IEEE Transactions on Magnetics, vol. 48,<br />

no. 4, pp. 1397–1400, 2012.<br />

[11] K. Yamazaki and N. Fukushima, “Iron-Loss Modeling for Rotating<br />

Machines: Comparison Between Bertotti’s Three-Term Expression<br />

and 3-D Eddy-Current Analysis,” IEEE Transactions on<br />

Magnetics, vol. 46, no. 8, pp. 3121–3124, 2010.<br />

[12] O. Bíró, “Edge element formulations <strong>of</strong> eddy current problems,”<br />

Computer Methods in Applied Mechanics and Engineering, vol.<br />

169, no. 3-4, pp. 391–405, 1999.<br />

[13] I. Bakhsh, O. Bíró, and K. Preis, “Skin effect problems with<br />

prescribed current condition,” in <strong>Proceedings</strong> <strong>of</strong> the 14 th Int. IGTE<br />

Symp. on Numerical Field Calculation in Electrical Engineering,<br />

2010.<br />

[14] G. Bertotti, Hysteresis in Magnetism. Academic Press, 1998.<br />

[15] E. Dlala, “A Simplified Iron Loss Model for Laminated Magnetic<br />

Cores,” IEEE Transactions on Magnetics, vol. 44, no. 11, pp.<br />

3169–3172, 2008.<br />

[16] E. Dlala, J. Saitz, and A. Arkkio, “Inverted and Forward Preisach<br />

Models for Numerical Analysis <strong>of</strong> Electromagnetic Field Problems,”<br />

IEEE Transactions on Magnetics, vol. 42, no. 8, pp. 1963–<br />

1973, 2006.<br />

[17] E. Dlala, “Efficient Algorithms for the Inclusion <strong>of</strong> the Preisach<br />

Hysteresis Model in Nonlinear Finite-Element Methods,” IEEE<br />

Transactions on Magnetics, vol. 47, no. 2, pp. 395–408, 2011.<br />

[18] E. Dlala, O. Bottauscio, M. Chiampi, M. Zucca, A. Belahcen, and<br />

A. Arrkio, “Numerical Investigation <strong>of</strong> the Effects <strong>of</strong> Loading and<br />

Slot Harmonics on the Core Losses <strong>of</strong> Induction Machines,” IEEE<br />

Transactions on Magnetics, vol. 48, no. 2, pp. 1063–1066, 2012.<br />

[19] G. Paoli and O. Bíró, “Time harmonic eddy currents in nonlinear<br />

media,” COMPEL: The International Journal for Computation<br />

and Mathematics in Electrical and Electronic Engineering,<br />

vol. 17, no. 5/6, pp. 567–575, 1998.<br />

[20] F. Ossart, E. Hug, C. Hubert, Olivier Buvat, and R. Billardon,<br />

“Effect <strong>of</strong> punching on electrical steels: Experimental and numerical<br />

coupled analysis,” IEEE Transactions on Magnetics, vol. 36,<br />

no. 5, pp. 3137–3140, 2000.<br />

[21] A. Schoppa, J. Schneider, and J.-O. Roth, “Influence <strong>of</strong> the cutting<br />

process on the magnetic properties <strong>of</strong> non-oriented electrical<br />

steels,” Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 215-<br />

216, pp. 100–102, 2000.<br />

[22] K. Fujisaki, R. Hirayama, T. Kawachi, S. Satou, C. Kaidou,<br />

M. Yabumoto, and T. Kubota, “Motor Core Iron Loss Analysis<br />

Evaluating Shrink Fitting and Stamping by Finite-Element<br />

Method,” IEEE Transactions on Magnetics, vol. 43, no. 5, pp.<br />

1950–1954, 2007.


- 198 - 15th IGTE Symposium 2012<br />

Evaluating the influence <strong>of</strong> manufacturing<br />

tolerances in permanent magnet synchronous<br />

machines<br />

I. Coenen, T. Herold, C. Piantsop Mbo’o, and K. Hameyer<br />

Institute <strong>of</strong> Electrical Machines, RWTH Aachen <strong>University</strong>, Schinkelstrasse 4, D-52056 Aachen, Germany<br />

E-mail: isabel.coenen@iem.rwth-aachen.de<br />

Abstract—Manufacturing tolerances can result in an unwanted behavior <strong>of</strong> electrical machines. Undesired parasitic effects<br />

such as torque ripples may be increased. A quality control <strong>of</strong> machines subsequent to manufacturing is therefore required<br />

in order to test whether the machines comply with its specifications. This is useful to ensure a high reliability <strong>of</strong> the<br />

manufactured machines. This paper describes the consideration <strong>of</strong> rotor tolerances due to non-ideal manufacturing processes.<br />

The idea is to estimate the influence <strong>of</strong> the manufacturing tolerances for realization <strong>of</strong> a reliable quality control. To study<br />

various fault scenarios numerical field simulations are employed which are parameterized by measurements.<br />

Index Terms—electrical machines, Finite Element Analysis (FEA), manufacturing tolerances, quality control.<br />

I. INTRODUCTION<br />

The reliability <strong>of</strong> electrical drives [1] is an important<br />

aspect to ensure a high availability. In industrial applications,<br />

permanent magnet excited synchronous machines<br />

(PMSM) are widely employed as they <strong>of</strong>fer advantages<br />

in efficiency and power density. However, especially the<br />

rotor <strong>of</strong> PMSMs is susceptible to tolerances caused during<br />

mass production. Variations from the ideal machine<br />

influence its operational behavior [2]. Therefore, it is<br />

important to verify the machine’s quality prior to its<br />

installation.<br />

Reliable and widely used diagnostic methods are vibration<br />

and current monitoring [3]. In this study, electrical<br />

quantities are focused because this <strong>of</strong>fers the advantage<br />

that no additional sensors need to be installed [4].<br />

A. Proposed monitoring setup<br />

The most <strong>of</strong>ten proposed end-<strong>of</strong>-line test is back-EMF<br />

monitoring [5], [6]. Fig. 1 shows a possible setup for its<br />

realization. Here, the motor under test is driven under<br />

open-circuit conditions. For attenuation <strong>of</strong> the drive’s<br />

influence, a flywheel is employed between drive and<br />

motor under test.<br />

<br />

<br />

<br />

<br />

<br />

<br />

Fig. 1. Back-EMF monitoring setup.<br />

<br />

<br />

This approach presents a non invasive monitoring<br />

method being benefical for diagnosis. However, such a<br />

setup is very expensive. It is cost-expensive because a<br />

drive is required and a certain device is needed to damp<br />

possible influences <strong>of</strong> the drive. Above all, it is timeexpensive<br />

due to the fact that the motor under test is<br />

mechanically coupled to the drive. This is not an efficient<br />

solution when a large number <strong>of</strong> machines needs to be<br />

tested during mass production.<br />

In this study, an additional approach is investigated<br />

where the current is being monitored. The corresponding<br />

setup is shown in Fig. 2. Here, a start-up <strong>of</strong> the motor<br />

up to a certain speed is performed in such a way that the<br />

current is measured at various speeds. The benefit <strong>of</strong> this<br />

method is its time- and cost-saving setup. When compared<br />

to the back-EMF setup, less hardware is needed.<br />

No mechanical coupling to a drive is required, simply<br />

the motor is connected to the inverter.<br />

However, for evaluating the current, it needs to be considered<br />

that the current is a controlled quantity. Impacts<br />

caused by the control system or the inverter supply might<br />

lead to misinterpretation <strong>of</strong> the results.<br />

<br />

<br />

<br />

<br />

Fig. 2. Current monitoring setup.<br />

<br />

<br />

In the following, the back-EMF and current characteristic<br />

<strong>of</strong> a PMSM is determined. In order to study various<br />

fault scenarios, numerical field calculation is employed<br />

considering tolerance affected rotor components. The aim<br />

is to evaluate the influence <strong>of</strong> such tolerances to be able to<br />

determine distinguishing characteristics. This information


can be helpful to develop an appropriate end-<strong>of</strong>-line test<br />

and to reveal which <strong>of</strong> the proposed setups is most<br />

qualified.<br />

II. MOTOR UNDER STUDY<br />

The machine studied within this work is a three-phase<br />

permanent magnet synchronous machine with tooth-coil<br />

winding system. It presents six stator slots and four pole<br />

pairs p. The eight magnets <strong>of</strong> the rotor are arranged in a<br />

spoke configuration.<br />

III. INFLUENCE OF ROTOR TOLERANCES<br />

During the manufacturing process material dependant<br />

failures, geometrical or shape deviations may occur. Such<br />

tolerances influence the machine’s behavior. For instance,<br />

increased torque ripples are caused [7].<br />

The considered tolerances within this paper concern<br />

the magnet’s material and its dimensions. The magnetization<br />

faults are illustrated Fig. 3. Possible deviations<br />

affect the magnitude <strong>of</strong> the remanence flux density BR<br />

and the angle β <strong>of</strong> the magnetization direction. Further<br />

Fig. 3. Magnetization faults.<br />

BR<br />

examples <strong>of</strong> rotor tolerances, not considered within this<br />

study, would be a displacement <strong>of</strong> the magnet and rotor<br />

eccentricity.<br />

A. Theoretical analysis<br />

Within this study, the influence <strong>of</strong> rotor tolerances onto<br />

electrical signals is focused. According to [5], for nonideal<br />

rotor components new harmonic orders nrf are<br />

expected to appear in the back-EMF spectrum which are<br />

a function <strong>of</strong> the pole pair number p:<br />

nrf =1± k<br />

with k ∈ N. (1)<br />

p<br />

In the following, this relation shall be approved and<br />

specialized for the certain machine investigated.<br />

The back-EMF Vi is the induced voltage at no load<br />

condition (open circuit). For a coil with w numbers <strong>of</strong><br />

turns Vi is defined as follows:<br />

Vi = −w dφ d<br />

= −w<br />

dt dt (<br />

<br />

Bd A). (2)<br />

Applied to a machine’s winding, it means that the<br />

back-EMF in one coil <strong>of</strong> the winding is determined by the<br />

air gap flux density B. Therefore the back-EMF presents<br />

the same harmonic orders which appear in the spectrum<br />

<strong>of</strong> the flux density. The latter will be considered for an<br />

order analysis.<br />

β<br />

- 199 - 15th IGTE Symposium 2012<br />

The magnetic flux density at the air gap <strong>of</strong> the machine<br />

is a rotating wave which is a function <strong>of</strong> relative position<br />

at the air gap α and time t [8]. It is given as the product<br />

<strong>of</strong> the magnetomotive force Θ (MMF) and the air gap<br />

permeance Λ:<br />

B(α, t) =Θ(α, t) · Λ(α, t). (3)<br />

The functions <strong>of</strong> permeance, magnetomotive force and<br />

flux density can generally be represented by a series <strong>of</strong><br />

space and time harmonics [9]:<br />

Λ(α, t) = <br />

Λyl,zl · cos(yl · α − zl · t), (4)<br />

yl,zl<br />

Θ(α, t) = <br />

yt,zt<br />

B(α, t) = <br />

yb,zb<br />

Θyt,zt · cos(yt · α − zt · t), (5)<br />

Byb,zb · cos(yb · α − zb · t). (6)<br />

At this, ω1 is the supplying angular frequency. For<br />

reasons <strong>of</strong> illustration the phase angle is neglected.<br />

For derivation <strong>of</strong> the new harmonic orders caused by<br />

non-ideal rotor components only the rotor fundamental<br />

component <strong>of</strong> the MMF is considered, meaning yt = p<br />

and zt = ω1. Furthermore, a constant air gap width is<br />

considered, meaning yl =0and zl =0. For the faultless<br />

case, this implies:<br />

Bp(α, t) =Bp,ω1 · cos(p · α − ω1 · t). (7)<br />

The above mentioned rotor tolerances lead to an<br />

asymmetrical distribution <strong>of</strong> the air gap field. With the<br />

described approach, this means a modulation <strong>of</strong> the<br />

MMF caused by the rotor magnets. New space and time<br />

harmonics k with k ∈ N appear resulting in the following<br />

expression for the flux density considering non-ideal rotor<br />

components:<br />

Brf(α, t) =<br />

<br />

Bk · cos((p ± k) · α − (ω1 ± kωm) · t).<br />

k<br />

Here, ωm is the rotational speed with ωm = ω1<br />

p . Hence,<br />

Brf can be expressed as follows:<br />

Brf(α, t) =<br />

<br />

Bk · cos((p ± k) · α − (1 ± k<br />

) · ω1t).<br />

p<br />

k<br />

This expression indicates the new harmonic orders appearing<br />

in the spectrum <strong>of</strong> the flux density and equally<br />

in the back-EMF spectrum due to deviations at the machine’s<br />

rotor as predicted in expression (1). However, it<br />

is only valid considering one single coil [10]. Derivating<br />

the harmonics for one phase, the coil configuration needs<br />

to be considered. One phase <strong>of</strong> the investigated machine<br />

contains two coils which are displaced by 180 ◦ .<br />

According to (9) the back-EMF <strong>of</strong> the first coil in<br />

one phase assuming faulty rotor components can be<br />

(8)<br />

(9)


determined as follows:<br />

Virf1(α, t) =<br />

<br />

Vik · cos((p ± k) · 0 ◦ − (1 ± k<br />

) · ω1t).<br />

p<br />

k<br />

(10)<br />

Similarly, the back-EMF <strong>of</strong> the second coil in the same<br />

phase is:<br />

Virf2(α, t) =<br />

<br />

Vik · cos((p ± k) · 180 ◦ − (1 ± k (11)<br />

) · ω1t).<br />

p<br />

k<br />

The resulting back-EMF Virfph for one phase can be<br />

determined by the superposition <strong>of</strong> the two coils:<br />

Virfph = Virf1 + Virf2 =<br />

<br />

Vikcos((1 ±<br />

k<br />

k<br />

p )ω1t) · [1 + cos((p ± k) · 180 ◦ )].<br />

(12)<br />

For odd numbers (p ± k), (12) is equal to zero. With<br />

p =4it is obvious that only even numbers <strong>of</strong> k appear<br />

in the back-EMF spectrum.<br />

Finally, (1) can be specialized for the analyzed machine,<br />

indicating the new harmonic orders appearing in<br />

the spectrum <strong>of</strong> back-EMF in case <strong>of</strong> non-ideal rotor<br />

components:<br />

n ′ rf =1± 2k′<br />

p with k′ ∈ N. (13)<br />

For the faultless case where the air gap field is symmetrical,<br />

the appearing orders are determined by the winding<br />

arrangement [5]. Considering a three phase winding,<br />

these harmonic orders are:<br />

n =6m ± 1 with m ∈ N. (14)<br />

The mentioned new harmonic orders caused by faults<br />

appear in addition to (14).<br />

For the current, the harmonic orders can be derived<br />

analogously. Ampere’s law reveals the general relation<br />

between electrical current I and magnetic flux density<br />

B:<br />

<br />

μ0 · I = Bds. (15)<br />

In practice, the current may additionally be affected by<br />

the control system and by the inverter supply. These<br />

impacts need to be considered in order to avoid wrong<br />

interpretation <strong>of</strong> the measured signals.<br />

IV. METHODOLOGY<br />

To determine the influence <strong>of</strong> the rotor tolerances<br />

onto the back-EMF and current characteristic, numerical<br />

field simulations are used. Reliable analysis requires a<br />

sufficiently large number <strong>of</strong> experiments which means<br />

less effort performing with simulation instead <strong>of</strong> measurements.<br />

In addition, the interpretation <strong>of</strong> measurement<br />

results is difficult within this context, as for a certain<br />

prototype the real existing deviations are unknown. The<br />

intentional construction <strong>of</strong> tolerances is very difficult to<br />

realize.<br />

S<br />

- 200 - 15th IGTE Symposium 2012<br />

Back−EMF [p.u.]<br />

A. Finite Element Analysis<br />

To calculate the back-EMF, a two-dimensional timestepping<br />

Finite Element Analysis (FEA) is applied. Noload<br />

operation at a speed <strong>of</strong> 3000 rpm is assumed and the<br />

voltage is calculated by use <strong>of</strong> the time derivative <strong>of</strong> the<br />

magnetic flux, as in equation (2).<br />

For analysis, a discrete Fourier transform (DFT) is performed<br />

which yields the spectrum <strong>of</strong> back-EMF as shown<br />

in Fig. 4. For the ideal faultless case with symmetrical<br />

air gap field, harmonic orders appear according to (14).<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Back−EMF [p.u.]<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Fig. 4. Back-EMF spectrum assuming faultless case.<br />

1) Parameterization by statistical measurements: For<br />

parameterization <strong>of</strong> the FEA model, a statistical verification<br />

is performed. The back-EMF is measured for ten<br />

prototypes <strong>of</strong> the machine. The resulting first order is<br />

evaluated in form <strong>of</strong> a histogram shown in Fig. 5. Based<br />

on this results, the magnets material properties (BR)<br />

within the model are adjusted in order that simulated<br />

value <strong>of</strong> first order and mean value <strong>of</strong> measured first<br />

order agree.<br />

Absolute Frequency<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

0.99 1 1.01<br />

First order back−EMF [p.u.]<br />

Fig. 5. Measured Back-EMF histogram.<br />

B. Extended d-q model<br />

A d-q model is a common way to describe the PMSM’s<br />

dynamical behavior considering the control system. Here,<br />

the application <strong>of</strong> such a model is studied to calculate<br />

the current. Since the common d-q model only takes the<br />

fundamental wave into account, it is not able to consider<br />

non-ideal behavior such as local defects studied within<br />

this work. Hence, an extended d-q model [11] is applied


to calculate the current. Here, FEA is used to extract<br />

additional elements to extend the common d-q equations.<br />

In the following, a start-up <strong>of</strong> the machine from zero to<br />

3000 rpm is simulated and the stator current is analyzed<br />

by use <strong>of</strong> a short-time Fourier transform (STFT). This<br />

yields the spectrum including the frequency distribution<br />

over time <strong>of</strong> the non-stationary current signal. Fig. 6<br />

shows the result for the faultless case. The value <strong>of</strong><br />

current is represented by a color range, where light colors<br />

mean a high and dark colors a low value. It can be seen<br />

that the harmonic orders are the same as for the back-<br />

EMF.<br />

Fig. 6. Current spectrum assuming faultless case.<br />

Here, sine-wave excitation is assumed. With the presented<br />

model it is also possible to simulate inverter<br />

operation. However, modeling the inverter leads to a<br />

computationally expensive model. Fig. 7 shows the result<br />

for the faultless case assuming inverter supply. Due to<br />

the high intensity <strong>of</strong> computation it is illustrated with<br />

lower resolution. Besides the main harmonic orders some<br />

new orders appear. However, these do not interfere with<br />

the orders which are expected to appear correspending to<br />

(13) due to tolerances. Therefore, inverter supply is not<br />

considered within this study because <strong>of</strong> the computational<br />

costs.<br />

Fig. 7. Current spectrum assuming faultless case and inverter supply.<br />

V. SIMULATION RESULTS<br />

To develop a reliable end-<strong>of</strong>-line check, the most<br />

common and important fault modes should be captured.<br />

In the following, different approaches are applied to<br />

- 201 - 15th IGTE Symposium 2012<br />

Back−EMF [p.u.]<br />

simulate various fault scenarios. The choice <strong>of</strong> the corresponding<br />

approach depends on the particular fault, the<br />

prior knowledge <strong>of</strong> the fault and the available data.<br />

A. Worst-case analysis<br />

For PMSMs cogging torque is an undesired effect as it<br />

leads to rotational oscillations <strong>of</strong> the drive train. Cogging<br />

torque is strongly influence by deviations caused by<br />

the manufacturing process [2],[7]. However, measuring<br />

cogging torque is very time- and cost-expensive [12] and<br />

therefore no appropiate method for an end-<strong>of</strong>-line control.<br />

In the following it shall be studied how back-EMF and<br />

current are influenced at a faulty machine presenting a<br />

high value <strong>of</strong> cogging torque due to magnetization faults.<br />

Considering a deviation in the magnitude <strong>of</strong> the magnets’<br />

remanence flux density BR, the amount <strong>of</strong> variation<br />

<strong>of</strong> cogging torque is depending on which and how<br />

many permanent magnets are affected. In [13] Design<strong>of</strong>-Experiments<br />

is applied to find out the worst-case<br />

configuration <strong>of</strong> magnetization faults concerning cogging<br />

torque. Applied to the studied machine, the configuration<br />

shown in Fig. 8 presents the highest value <strong>of</strong> peak-to-peak<br />

cogging torque. Thereby, the filled magnets represent the<br />

ones that are defective. Considering a deviation at BR<br />

<strong>of</strong> -10%, the value <strong>of</strong> cogging torque is about seventimes<br />

higher compared to the reference value <strong>of</strong> the ideal<br />

machine.<br />

Fig. 8. Worst-case configuration <strong>of</strong> magnetization faults.<br />

Fig. 9 and Fig. 10 show the results for back-EMF and<br />

current in case <strong>of</strong> the described worst-case magnetization<br />

fault.<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

Faulty case<br />

Faultless case<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Back−EMF [p.u.]<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

Faulty case<br />

Faultless case<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Fig. 9. Back-EMF spectrum assuming worst-case magnetization fault.<br />

The spectra show new harmonic orders, especially<br />

n ′ rf =0.5 and n′ rf =2.5 are apparent according to (1).<br />

Compared to the faultless case the first harmonic order<br />

is reduced.


Back−EMF [p.u.]<br />

Fig. 10. Current spectrum assuming worst-case magnetization fault.<br />

B. Sample cases<br />

For the studied machine the height <strong>of</strong> the magnet can<br />

vary between 97% and 100% <strong>of</strong> its desired value and<br />

the width can vary by ± 1.5%. The dimensions <strong>of</strong> some<br />

sample magnets have been measured which are used<br />

as input data for this approach. None <strong>of</strong> the measured<br />

dimensions exceed the allowed tolerance range. Five<br />

cases are created where every magnet is subjected to<br />

the given tolerances. Fig. 11 and Fig. 12 exemplarily<br />

show the back-EMF and current spectrum <strong>of</strong> one faulty<br />

case. The first harmonic order is reduced compared to<br />

the faultless case and the new ordinal numbers appear<br />

corresponding to (13).<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

Faulty case<br />

Faultless case<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Back−EMF [p.u.]<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

Faulty case<br />

Faultless case<br />

0<br />

0 2 4 6 8<br />

Harmonic order<br />

Fig. 11. Back-EMF spectrum assuming faulty magnet dimensions.<br />

Fig. 12. Current spectrum assuming faulty magnet dimensions.<br />

When compared to the results from the worst-case<br />

magnetization fault studied in V-A, one can see that the<br />

- 202 - 15th IGTE Symposium 2012<br />

influence <strong>of</strong> deviations at the magnets’ dimensions within<br />

the allowed tolerance range is not significant. The spectra<br />

show the same specific charateristics but less amounts.<br />

For all five studied cases the simulated spectra do not<br />

differ considerably from the faultless case.<br />

C. Stochastic analysis<br />

In [14] the influence <strong>of</strong> varying qualities <strong>of</strong> the permanent<br />

magnet has been investigated applying a stochastic<br />

analysis. This is applied here to compare the influences<br />

<strong>of</strong> deviations in the magnetization magnitude and magnetization<br />

direction. Overall, 60 failure configurations are<br />

studied. For 20 cases the remanence flux density BR<br />

is assumed to be Gaussian distributed with a standard<br />

deviation <strong>of</strong> 3σ equal to 10% <strong>of</strong> the nominal value.<br />

For 20 other cases the magnetization direction is also<br />

assumed to be normally distributed with a standard<br />

deviation <strong>of</strong> 5 ◦ . The other 20 cases present both kind<br />

<strong>of</strong> deviations. Applying FEA, cogging torque and back-<br />

EMF are calculated and evaluated employing histograms.<br />

The distribution <strong>of</strong> the first harmonic order <strong>of</strong> the back-<br />

EMF is shown in Fig. 13. It shows the range in which<br />

the back-EMF is influenced because <strong>of</strong> the different<br />

magnetization deviations.<br />

Absolute Frequency<br />

Absolute Frequency<br />

Magnetization magnitude<br />

7<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

0.98 1 1.02 1.04<br />

First order back−EMF [p.u.]<br />

7<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

Magnitude and direction<br />

0<br />

0.98 1 1.02 1.04<br />

First order back−EMF [p.u.]<br />

Absolute Frequency<br />

20<br />

15<br />

10<br />

5<br />

Magnetization direction<br />

0<br />

0.98 1 1.02 1.04<br />

First order back−EMF [p.u.]<br />

Fig. 13. Back-EMF histogram considering magnetization faults.<br />

It can be seen that the influence <strong>of</strong> the deviations<br />

concerning the magnetization direction is very small. The<br />

magnitude fault is prevailing. This is to be expected for<br />

the studied machine as it presents interior magnets.<br />

The same analysis is performed for the peak-to-peak<br />

cogging torque, which is presented in Fig. 14. Again<br />

it can be concluded that the influence <strong>of</strong> magnetization<br />

direction is small when compared to the deviation in<br />

magnitude as the corresponding distribution shows.


Absolute Frequency<br />

Absolute Frequency<br />

8<br />

6<br />

4<br />

2<br />

Magnetization magnitude<br />

0<br />

0 2 4 6<br />

Peak−to−peak cogging torque [p.u.]<br />

8<br />

6<br />

4<br />

2<br />

Magnitude and direction<br />

Absolute Frequency<br />

0<br />

0 2 4 6<br />

Peak−to−peak cogging torque [p.u.]<br />

20<br />

15<br />

10<br />

5<br />

Magnetization direction<br />

0<br />

0 2 4 6<br />

Peak−to−peak cogging torque [p.u.]<br />

Fig. 14. Cogging torque histogram considering magnetization faults.<br />

Generally, magnetization faults influence cogging<br />

torque and back-EMF characteristic simultaneously [15].<br />

For both quantities new harmonic orders arise depending<br />

on the caused asymmetry in the air gap field. Based<br />

on the presented studies, it becomes apparant that the<br />

influence <strong>of</strong> magnetization tolerances on cogging torque<br />

is more significant as on back-EMF. This means, that<br />

on the one hand the machine’s behavior is strongly<br />

influenced in general. But on the other hand, as a cogging<br />

torque test is excluded for an end-<strong>of</strong>-line concept, it<br />

implies high functional requirement <strong>of</strong> the measurement<br />

devices to detect faults by use <strong>of</strong> back-EMF analysis.<br />

This shows the importance <strong>of</strong> an influence analysis such<br />

as presented in this study. It is required to study the<br />

impact <strong>of</strong> tolerances to be able to separate it towards<br />

measurement inaccuracy.<br />

VI. CONCLUSION<br />

In this work, the influence <strong>of</strong> non-ideal manufactured<br />

rotor components <strong>of</strong> a PMSM on its back-EMF and<br />

current characteristic is studied. It has been shown that<br />

electrical quantities are applicable to realize tolerance<br />

diagnosis. Especially the stator current approves to be<br />

a promising approach due to its time- and cost-saving<br />

setup. The new harmonic orders caused by rotor faults<br />

are derived within a theoretical analysis and confirmed<br />

by the simulation results.<br />

An end-<strong>of</strong>-line check could be realized in such a<br />

way that all machines presenting a certain level in these<br />

specific characteristics are rejected. With the presented<br />

methods, the range <strong>of</strong> these distinguishing characteristics<br />

can be evaluated to detect the corresponding levels for<br />

rejection. At this, measurement accuracy should be taken<br />

into account.<br />

- 203 - 15th IGTE Symposium 2012<br />

Comparing the different rotor tolerances, all present<br />

the same characteristics but with different amounts depending<br />

on the fault’s intensity and arrangement. As a<br />

feedback for manufacturing, a differentiation <strong>of</strong> various<br />

tolerances would be gainful but can not be achieved with<br />

the presented analysis. However, the focus <strong>of</strong> the end<strong>of</strong>-line<br />

check is to verify the machines’ quality which<br />

can be realized by the suggested approach. The results<br />

<strong>of</strong> this study evince to be valueable for application <strong>of</strong> an<br />

accurate quality control for PMSMs finally improving its<br />

reliability.<br />

REFERENCES<br />

[1] S. Nandi, H.A. Toliyat, and X. Li, ”Condition Monitoring and Fault<br />

Diagnosis <strong>of</strong> Electrical Motors - A Review,” IEEE Transactions on<br />

Energy Conversion, vol. 20, no. 4, pp. 710-729, December 2005.<br />

[2] L. Gasparin, A. Cernigoj, S. Markic, and R. Fiser, ”Additional<br />

Cogging Torque Components in Permanent-Magnet Motors Due<br />

to Manufacturing Imperfections,” IEEE Transactions on Magnetics,<br />

vol. 45, no. 3, pp. 1210-1213, March 2009.<br />

[3] P.J. Tavner, ”Review <strong>of</strong> condition monitoring <strong>of</strong> rotating electrical<br />

machines,” IET Electric Power Applications, vol. 2, no. 4, pp.<br />

215247, 2008.<br />

[4] W. le Roux, R. G. Harley, and T. G. Habetler, ”Detecting Rotor<br />

Faults in Low Power Permanent Magnet Synchronous Machines,”<br />

IEEE Transactions on Power Electronics, vol. 22, no. 1, pp. 322-<br />

328, January 2007.<br />

[5] D. Casadei, F. Filippetti, C. Rossi, A. Stefani, and D.J. Ewins,<br />

”Magnets faults characterization for Permanent Magnet Synchronous<br />

Motors,” IEEE International Symposium on Diagnostics<br />

for Electric Machines, Power Electronics and Drives, pp. 1-6, 2009.<br />

[6] A. Flach, F. Drager, M. Ayeb, and L. Brabetz, ”A New Approach to<br />

Diagnostics for Permanent-Magnet Motors in Automotive Powertrain<br />

Systems,” IEEE International Symposium on Diagnostics for<br />

Electrical Machines, Power Electronics and Drives, pp. 234-239,<br />

September 2011.<br />

[7] G. Heins, T. Brown, and M. Thiele, ”Statistical Analysis <strong>of</strong> the<br />

Effect <strong>of</strong> Magnet Placement on Cogging Torque in Fractional Pitch<br />

Permanent Magnet Motors ,” IEEE Transactions on Magnetics, vol.<br />

47, no. 8, pp. 2142-2148, August 2011.<br />

[8] J.R. Cameron, W.T. Thomson, and A.B. Dow, ”Vibration and current<br />

monitoring for detecting airgap eccentricity in large induction<br />

motors,” IEE <strong>Proceedings</strong> B Electric Power Applications, vol. 133,<br />

no. 3, pp. 155 - 163, May 1986.<br />

[9] B.M. Ebrahimi, J. Faiz, and M.J. Roshtkhari, ”Static-, Dynamicand<br />

Mixed-Eccentricity Fault Diagnoses in Permanent-Magnet<br />

Synchronous Motors,” IEEE Transactions on Industrial Electronics,<br />

vol. 56, no. 11, pp. 4727-4739, November 2009.<br />

[10] J. Urresty, J. Riba Ruiz, and L. Romeral, ”A Back-emf Based<br />

Method to Detect Magnet Failures in PMSMs ,” IEEE Transactions<br />

on Magnetics, July 2012.<br />

[11] T. Herold, D. Franck, E. Lange, and K. Hameyer, ”Extension<br />

<strong>of</strong> a D-Q Model <strong>of</strong> a Permanent Magnet Excited Synchronous<br />

Machine by Including Saturation, Cross-Coupling and Slotting<br />

Effects,” International Electric Machines and Drives Conference<br />

(IEMDC), pp. 1379-1383, 2011.<br />

[12] C. Schlensok, D. van Riesen, B. Schmülling, M. Schöning, and<br />

K. Hameyer, ”Cogging Torque Analysis on Permanent Magnet<br />

Machines by Simulation and Measurement,” tm - Technisches<br />

Messen, vol. 74, no. 7-8, pp. 393-401, August 2007.<br />

[13] I. Coenen, M. van der Giet, and K. Hameyer, ”Manufacturing Tolerances:<br />

Estimation and Prediction <strong>of</strong> Cogging Torque Influenced<br />

by Magnetization Faults,” IEEE Transactions on Magnetics, vol.<br />

48, no. 5, pp. 1932-1936, May 2012.<br />

[14] I. Coenen, M. Herranz Gracia, and K. Hameyer, ”Influence and<br />

evaluation <strong>of</strong> non-ideal manufacturing process on the cogging<br />

torque <strong>of</strong> a permanent magnet excited synchronous machine,”<br />

COMPEL, vol. 30, no. 3, pp. 876-884, 2011.<br />

[15] K. Kim, S. Lim, D. Koo, and J. Lee, ”The Shape Design <strong>of</strong><br />

Permanent Magnet for Permanent Magnet Synchronous Motor<br />

Considering Partial Demagnetization,” IEEE Transactions on Magnetics,<br />

vol. 42, no. 10, October 2006.


- 204 - 15th IGTE Symposium 2012<br />

<br />

<br />

Hai Van Jorks, Erion Gjonaj and Thomas Weiland<br />

TU Darmstadt, Institute <strong>of</strong> Computational Electromagnetics, Schloßgartenstraße 8, 64289 Darmstadt, Germany<br />

Abstract— High frequency eddy currents are investigated and the Common Mode Input Impedance <strong>of</strong> a PWM controlled<br />

induction motor is calculated from finite element simulations. In order to determine machine parameters accurately, two<br />

modelling approaches are compared. The first is a two-dimensional simulation approach where iron core lamination effects<br />

are included by means <strong>of</strong> an equivalent material approximation. The second approach consists in fully three-dimensional<br />

analysis taking into account explicitly the eddy currents induced in the laminations. It is shown that homogenised equivalent<br />

material models may lead to large errors in the calculation <strong>of</strong> machine inductances, especially at high frequencies. However,<br />

the Common Mode Input Impedance, which is the final parameter <strong>of</strong> interest, seems to be less affected by the lamination<br />

modelling.<br />

Index Terms—eddy currents, finite element, lamination, PWM<br />

I. INTRODUCTION<br />

In modern drive systems fast switching inverters are<br />

the source <strong>of</strong> high frequency common mode voltages at<br />

the motor terminals. Due to stray capacitances between<br />

windings and grounded iron parts <strong>of</strong> the machine, a<br />

common mode current is excited and for its part may<br />

cause circulating bearing currents which may damage the<br />

bearing [1]. The phenomena can be described by<br />

equivalent circuit representation which employs the<br />

frequency dependent Common Mode Input Impedance<br />

being the ratio <strong>of</strong> common mode voltage and current<br />

<br />

Common Mode Input Impedance can be computed<br />

using <br />

<br />

<br />

Figure 1: Lumped parameter model <strong>of</strong> a two conductor system.<br />

Parameters can be gathered in the impedance and capacitance matrix<br />

Firstly, the stray capacitances are extracted from<br />

electrostatic and winding impedances from<br />

magnetoquasistatic simulations. Ohmic conductivity <strong>of</strong><br />

winding insulation is negligible.<br />

Secondly, the winding scheme is taken into account to<br />

match the corresponding voltages and currents at the<br />

front and rear end <strong>of</strong> the machine. At this point, endwinding<br />

inductances can be included in the model, but<br />

require distinct modelling approaches and are, therefore,<br />

neglected in the present analysis.<br />

Considering the laminated middle part <strong>of</strong> the motor, it<br />

is common to use two-dimensional (2D) models <strong>of</strong> the<br />

motor cross-section to compute the self and coupling<br />

impedances. In earlier literature it is proposed to apply<br />

the finite element (FE) method within a single stator slot<br />

while the magnetic field was assumed to be zero outside<br />

the slot perimeter [2]. However, as shown in a recent<br />

investigation [3], strong inductive coupling at high<br />

frequencies (several to ) may occur even<br />

between distant slots. The effect is caused by the core<br />

lamination, which, despite the small skin depth (7),<br />

promotes the spreading <strong>of</strong> high frequency magnetic<br />

fluxes over the iron sheets’ surfaces. In [4] the lamination<br />

was approximated by an additional impedance.<br />

Moreover, it is possible to include the lamination already<br />

in the FE model. Therefore, we consider the entire motor<br />

cross section in the magnetoquasistatic simulations.<br />

In the case <strong>of</strong> 2D analysis, modelling the laminated<br />

core in a plane requires the application <strong>of</strong><br />

homogenization techniques. We investigated the accuracy<br />

<strong>of</strong> the widely used formulation (2) for a broad frequency<br />

range . The reference solution was<br />

obtained from fully three-dimensional (3D) simulations.<br />

Since resolving the small skin depth in motor laminations<br />

in a 3D mesh is computationally very costly, general<br />

purpose simulation s<strong>of</strong>tware cannot be utilized. On that<br />

account, we developed a specialised 3D FE simulation<br />

tool, which takes advantage <strong>of</strong> the periodicity <strong>of</strong> the<br />

lamination stack, but otherwise does not use any<br />

approximation on motor geometry or on the material<br />

properties <strong>of</strong> the laminated core.<br />

II. 2D LAMINATION MODELLING<br />

A well-known homogenization model for laminated<br />

cores [5] utilizes a frequency dependent equivalent<br />

permeability for the iron core given by,<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

where 0r is the permeability <strong>of</strong> iron, 2b the thickness <strong>of</strong><br />

the plate and the skin depth at a given frequency. The<br />

magnetic field problem for the homogenized core is<br />

reduced to a planar problem. While this approach allows<br />

for efficient 2D FE modelling, accuracy at higher<br />

frequencies may not be sufficient. Figure 2 shows the Bfield<br />

plot <strong>of</strong> the motor model obtained from simulations


with “FEMM” [6], a 2D open source s<strong>of</strong>tware which<br />

employs the approach (2). In order to obtain self and<br />

mutual impedances <strong>of</strong> the multi-conductor system, only<br />

one conductor was excited by a current. The spreading <strong>of</strong><br />

the flux over the lamination as well as across the periodic<br />

boundary <strong>of</strong> the computational model can be observed.<br />

The impedance matrix was extracted and will be<br />

compared to the 3D reference in Section IV.B.<br />

Figure 2: 2D model <strong>of</strong> cross-sectional motor geometry (60° section)<br />

with magnitude <strong>of</strong> magnetic flux density at 1 MHz<br />

III. FULLY 3D FE ANALYSIS<br />

A. 3D FE formulation<br />

Maxwell’s equations in frequency domain expressed<br />

by a magnetic vector potential yield<br />

<br />

<br />

<br />

where is the permeability, the conductivity, is a<br />

voltage gradient used for excitation. Two types <strong>of</strong><br />

boundary conditions on are applied. Firstly,<br />

<br />

where is the unit normal vector and is the<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

triangular prims are chosen, because they allow for an<br />

efficient discretisation <strong>of</strong> the very thin iron sheet (Fig. 3).<br />

Applying Galerkin’s method to (3) the matrix equation <br />

<br />

is generated, where the complex matrix combines the<br />

discrete operators corresponding to the left hand side <strong>of</strong><br />

(3), vector holds the degrees <strong>of</strong> freedom (DOFs) <strong>of</strong><br />

the vector potentials and vector the exciting currents.<br />

In the case <strong>of</strong> voltage excitation <strong>of</strong> individual conductors<br />

the relation<br />

<br />

- 205 - 15th IGTE Symposium 2012<br />

can be employed [7], where <br />

, is the<br />

<br />

angular frequency and is the coupling matrix for the<br />

vector <strong>of</strong> the wire voltages . The<br />

important case where a single conductor n is exited can<br />

be obtained by setting in (6) and for all<br />

other conductors (see also Section IV.A).<br />

B. Reduction <strong>of</strong> the problem size<br />

A high frequency 3D FE model <strong>of</strong> the complete motor<br />

geometry is still far beyond today’s computing capacities.<br />

But even if we consider just a slice <strong>of</strong> the motor<br />

with the thickness <strong>of</strong> half a lamination sheet ,<br />

an appropriate 3D discretisation will lead to several<br />

million mesh cells. In order to model eddy currents at<br />

frequencies up to the discretisation has to<br />

resolve the small skin depth in the high-permeability iron<br />

<br />

<br />

<br />

In the analysis, the following parameters were used:<br />

,<br />

,<br />

where is is the electrical conductivity and the<br />

permeability <strong>of</strong> iron, respectively.<br />

Parallelization <strong>of</strong> our 3D FEM code is a key feature,<br />

nevertheless, further reduction <strong>of</strong> the problem size is<br />

particularly important. In the case <strong>of</strong> common mode<br />

excitation <strong>of</strong> a 3-phase 4-pole induction machine, the<br />

field pattern in the motor cross section is periodic with<br />

respect to a 60° rotation around the longitudinal axis <strong>of</strong><br />

the motor. This reduces the computational domain to a<br />

60° section while periodic boundary conditions are<br />

applied to the cut planes (Fig. 3).<br />

Figure 3: Simulation mesh and magnitude <strong>of</strong> the magnetic flux density<br />

at 1 MHz for the 3D motor model. For better visibility, the model is<br />

scaled by a factor 100 in the transversal direction.<br />

IV. IMPEDANCE MATRIX CALCULATION<br />

A. Extraction procedure<br />

The standard procedure to extract the impedances <strong>of</strong><br />

the cross-sectional conductors from FE analysis, is to<br />

excite the -th conductor with a current <strong>of</strong> and set all<br />

the other conductors to . After running the simulation,<br />

the induced voltages in all conductors have to be<br />

computed from the magnetic vector potential solution.


This procedure has to be repeated for all <br />

conductors. A section <strong>of</strong> the analyzed <br />

induction motor holds 120 stator and 8 rotor conductors.<br />

If a general purpose FEM s<strong>of</strong>tware is used, this leads to a<br />

large computational overhead, which makes the method<br />

inconvenient. However, the extraction procedure can be<br />

condensed into a single simulation cycle. In this way,<br />

common steps like loading <strong>of</strong> the mesh, setup <strong>of</strong> the curlcurl<br />

matrix and its LU decomposition have to be<br />

performed only once (see Fig. 4). Referring to the 3D<br />

simulation <strong>of</strong> the motor cross section, a speedup factor <strong>of</strong><br />

could be obtained. As was shown in [7],<br />

impedance matrix can be computed from<br />

<br />

Still, in order to avoid the explicit inversion <strong>of</strong> the sparse<br />

matrix , the equation system (5) has to be solved <br />

times, with being the number <strong>of</strong> conductors. A detailed<br />

overview <strong>of</strong> the implemented algorithm is depicted in<br />

Fig. 4.<br />

Figure 4: Flowchart <strong>of</strong> computational algorithm<br />

B. Simulation results<br />

The same motor geometry is used in the 2D (Fig. 2)<br />

and the 3D (Fig. 3) case. The simulation time <strong>of</strong> a 3D<br />

model with DOFs on a cluster with 60 nodes<br />

was for a sweep <strong>of</strong> 7 frequency points. Figure 5<br />

shows the magnitude <strong>of</strong> self-impedance <strong>of</strong> one stator<br />

conductor extracted from 2D and 3D simulations. The 2D<br />

solution which employs the lamination model (equivalent<br />

- 206 - 15th IGTE Symposium 2012<br />

permeability) shows major discrepancies from the 3D<br />

reference case. For additional validation <strong>of</strong> the simulation<br />

models, the same geometry was tested without laminated<br />

materials, i.e. the iron core forms a massive block. Thus,<br />

2D analysis does not require the lamination formulae and<br />

therefore should give the same results as 3D analysis. The<br />

corresponding impedance is shown in Fig. 6. 2D and 3D<br />

impedances can be found in very good agreement.<br />

Z / Ω<br />

10 0<br />

10 -7<br />

9%<br />

10 1<br />

Figure 5: Laminated iron with . Comparison <strong>of</strong> conductor<br />

impedance extracted from field simulations and relative error between<br />

2D and 3D results.<br />

Z / Ω<br />

10 -1<br />

10 -2<br />

10 -3<br />

10 -4<br />

10 -5<br />

10 -6<br />

10 -3<br />

10 -4<br />

10 -5<br />

10 -6<br />

10 0<br />

10 -7<br />

0.00%<br />

53%<br />

0.02%<br />

10 1<br />

Figure 6: Massive iron with . Comparison <strong>of</strong> conductor<br />

impedance extracted from field simulations and relative error between<br />

2D and 3D results.<br />

V. COMMON MODE INPUT IMPEDANCE<br />

A. Assembling <strong>of</strong> transmission line model<br />

The state <strong>of</strong> a multi-conductor line in the frequency<br />

domain is described by telegrapher’s equations. Taking<br />

into account the lumped parameter approximation, three<br />

matrix equations are obtained<br />

<br />

Z 1,1 laminated μ r,Fe = 1000<br />

94%<br />

10 2<br />

10 2<br />

97%<br />

10 3<br />

f / Hz<br />

10 3<br />

f / Hz<br />

<br />

<br />

<br />

<br />

where , , and are the voltage and<br />

current vectors at the front (z=0) and rear end (z=l) <strong>of</strong> the<br />

95%<br />

10 4<br />

Z 1,1 massive iron μ Fe = 1000<br />

0.13%<br />

1.82%<br />

6.43%<br />

10 4<br />

88%<br />

10 5<br />

2D<br />

3D<br />

9.58%<br />

10 5<br />

2D<br />

3D<br />

80%<br />

6.04%<br />

10 6<br />

10 6


motor, respectively, and and <br />

<br />

<br />

are the lumped element matrices in the pi-equivalent<br />

circuit (Fig. 1). The vector holds the currents through<br />

the impedances and can be eliminated by inserting one<br />

equation into the others. In the next step, the winding<br />

scheme is taken into account which further reduces the<br />

degrees <strong>of</strong> freedom in the system (9a-c). The proceeding<br />

can be found in [3]. It is important to set the current at the<br />

star point <strong>of</strong> the machine to zero, according to the<br />

common mode measuring setup, where the star point is<br />

not grounded. When solving the final equation system, a<br />

given voltage at the motor terminals will yield a certain<br />

input current. The ratio <strong>of</strong> the two quantities is the<br />

Common Mode Input Impedance . B. 240 kW induction motor<br />

The 2D and 3D plots (Fig. 7) show very good<br />

agreement, despite the deviations in the impedance<br />

matrices (Fig. 5). This is understandable since the stray<br />

capacitances are dominant in the common mode circuit <strong>of</strong><br />

the machine. However, impedances are important to<br />

predict resonance points. The frequency <strong>of</strong> the first<br />

resonance in the 2D curve is shifted by 6 kHz while its<br />

magnitude differs by 4%.<br />

Z / Ω<br />

10 3<br />

10 2<br />

10 1<br />

10 3<br />

first resonance<br />

| Z com | 2D vs. 3D<br />

10 4<br />

f / Hz<br />

Figure 7: Comparison <strong>of</strong> the Common Mode Impedance between 2D<br />

and 3D results.<br />

VI. CONCLUSION<br />

We developed a specialized 3D FE simulation code<br />

which is able to efficiently extract the impedance matrix<br />

<strong>of</strong> a multi-conductor setup, e.g. a motor cross section.<br />

Since the laminated iron is modelled with its actual<br />

geometry and material properties, 3D results are more<br />

accurate than those from 2D simulations with<br />

homogenized material approach. Finally, capacitance<br />

matrix, impedance matrix and the winding scheme are<br />

combined to obtain the frequency-dependent Common<br />

Mode Input Impedance <strong>of</strong> the machine. It can be<br />

observed that the solution is dominated by the<br />

10 5<br />

2D<br />

3D<br />

10 6<br />

- 207 - 15th IGTE Symposium 2012<br />

capacitances and, therefore, less sensitive to inaccuracies<br />

in the impedance matrix. In future work the nonlinear<br />

properties <strong>of</strong> the iron will be considered in the<br />

simulations and the end regions <strong>of</strong> the motor will be<br />

included in the Transmission Line Model.<br />

Acknowledgement This work is founded by the<br />

Deutsche Forschungsgemeinschaft (DFG) under the<br />

collaborative research group grant FOR 575.<br />

[1]<br />

REFERENCES<br />

S. Chen, T.A. Lipo, D. Fitzgerald, “Source <strong>of</strong> induction motor<br />

bearing currents caused by PWM inverters”, IEEE Trans. on<br />

Energy Conv., Vol. 11, Iss. 1, 1996, pp. 25–32.<br />

[2] I. Boldea, S. A. Nasar, “The induction machine handbook”, CRC<br />

Press, 2002.<br />

[3] H. Jorks, E. Gjonaj, T. Weiland, O. Magdun, “Three-dimensional<br />

simulations <strong>of</strong> an induction motor including eddy current effects in<br />

core laminations”, IET Science, Measurement & <strong>Technology</strong>, Vol.<br />

6, Iss. 5, Sep. 2012, pp. 344 – 349.<br />

[4] P. Maeki-Ontto, J. Luomi, “Induction motor model for the analysis<br />

<strong>of</strong> capacitive and induced shaft voltages”, Proc. <strong>of</strong> IEMDC '05,<br />

May 2005, pp. 1653-1660.<br />

[5] R.L. Stoll, “The analysis <strong>of</strong> eddy currents”, Oxford <strong>University</strong><br />

Press, 1974.<br />

[6] FEMM by David Meeker, version 4.2, available at<br />

[7]<br />

www.femm.info.<br />

A. Bossavit, "Two dual formulations <strong>of</strong> the 3-D eddy-current<br />

problem", COMPEL, Vol. 4, Iss. 2, 1985, pp. 103 – 116.<br />

[8] H. De Gersem, O. Henze, T. Weiland, A. Binder, “Simulation <strong>of</strong><br />

wave propagation effects in machine windings”, COMPEL, Vol.<br />

29, Iss. 1, 2010, pp. 23 – 38.


- 208 - 15th IGTE Symposium 2012<br />

Computation <strong>of</strong> end-winding inductances <strong>of</strong> rotating<br />

electrical machinery through three-dimensional<br />

magnetostatic integral FEM formulation<br />

F. Calvano 1 , G. Dal Mut 2 , F. Ferraioli 2 , A. Formisano 3 , F. Marignetti 4 ,<br />

R. Martone 3 , G. Rubinacci 1 , A. Tamburrino 4 and S. Ventre 4<br />

1 Dip. di Ingegneria Elettrica, Università di Napoli Federico II, Via Claudio 25, I-80124, Naples, Italy<br />

2 Ansaldo Energia, Via N. Lorenzi 8, I-16152, Genova, Italy<br />

3 Dip. di Ing. Industriale e dell’Informazione, Seconda Università di Napoli, Via Roma 29, I-81031, Aversa (CE), Italy<br />

4 Dip. di Ing. Elettrica e dell’Informazione, Univ. di Cassino e del Lazio Merid., Via Di Biasio 43, I-03043, Cassino,<br />

Italy<br />

Abstract—An effective numerical technique to calculate end-winding inductances <strong>of</strong> rotating electrical machinery is presented.<br />

The algorithm is based on a 3D integral formulation; it allows to take into account non linearities, relative speed between<br />

stator and rotor and is well suited for the treatment <strong>of</strong> regions with large air volumes. Numerical implementation concerning<br />

the analysis <strong>of</strong> a large synchronous generator highlights the advantages <strong>of</strong> the proposed method. The aim <strong>of</strong> the paper is to<br />

assess, by means <strong>of</strong> an accurate 3D model, the correction factor to be applied to the inductances computed through 2D models<br />

to take into account the effects due to end windings.<br />

Index Terms— End windings, Integral FEM approach, Inductances, Flux density numerical computation, Synchronous<br />

generators.<br />

I. INTRODUCTION<br />

Although the computation <strong>of</strong> the inductances <strong>of</strong><br />

rotating electrical machinery is a key issue both in the<br />

design stage and in performance assessment, its accurate<br />

calculation by Finite Elements approaches is still an open<br />

problem.<br />

The inductances can be split into two contributions:<br />

main and leakage inductances.<br />

The end-winding effect affects both the contributions:<br />

main and, especially, leakage inductances. End-winding<br />

inductances are at the base <strong>of</strong> both the steady-state<br />

operation and the dynamical behavior <strong>of</strong> electrical<br />

machinery [1-3]. The main inductances are related to the<br />

flux linkages while the leakage inductances can be<br />

divided into slot inductances, tooth tip inductances,<br />

skewing inductances and zigzag leakage inductances. In<br />

large machines, end windings contribute significantly to<br />

the values <strong>of</strong> both main and more significantly leakage<br />

inductances [4].<br />

The contributions to the inductances due to the active<br />

length <strong>of</strong> the conductor can easily be computed either<br />

numerically, by standard 2D FEM analyses, or<br />

analytically by means <strong>of</strong> infinite length models. On the<br />

contrary, the end winding contribution can only be<br />

computed from the actual 3D magnetic field distribution.<br />

Most analytical approaches to the magnetic field<br />

computation [5, 6], including the most recent ones [7], are<br />

based on the solution <strong>of</strong> the Biot-Savart law through<br />

equivalent current sheet representations, using<br />

axisymmetric hypothesis [8] or the theory <strong>of</strong> images [9].<br />

Both three dimensional and two-dimensional<br />

techniques based on FEM can be used as an alternative,<br />

but they also rely on rough approximations to reduce the<br />

number <strong>of</strong> nodes [10,11]. End regions fields and fluxes<br />

can be very complex to be computed, especially for large<br />

power machines, where end regions may occupy up to<br />

one third <strong>of</strong> the total machine length.<br />

The aim <strong>of</strong> the paper is to propose a method based on<br />

the use <strong>of</strong> an accurate 3D FEM simulation to improve the<br />

accuracy <strong>of</strong> the standard 2D model achieved via a<br />

commercial s<strong>of</strong>tware.<br />

The 3D FEM technique here proposed takes advantage<br />

from an integral formulation implemented in an noncommercial<br />

code. Such an approach has been previously<br />

applied to compute end winding forces [12,13].<br />

In this paper the analysis <strong>of</strong> a large synchronous<br />

generator is considered as an example. Field simulations<br />

in different working conditions are used to assess the<br />

influence <strong>of</strong> end effects on flux linkages.<br />

The main advantages <strong>of</strong> the proposed approach are<br />

manifold: (1) a reduced number <strong>of</strong> elements is required to<br />

model the end regions; because the integral formulations<br />

do not require the discretization <strong>of</strong> the air region but,<br />

rather, <strong>of</strong> the material regions only (conducting and/or<br />

magnetic materials); (2) neighboring elements do not<br />

need to share nodes allowing for more freedom in<br />

meshing complex geometries; (3) the formulation<br />

provides an inherent capability to include air-spaced<br />

moving parts, as no interface mesh is needed [13].<br />

This approach is therefore particularly effective to<br />

model generator end regions because it can also take into


account rotor motion and magnetic nonlinearities.<br />

The paper is organized as follows: Section II presents<br />

the basis <strong>of</strong> the integral formulation and its numerical<br />

implementation. Flux expressions coming from integral<br />

formulation are also discussed. Then, the 3D correction<br />

with respect to 2D fluxes and inductances is introduced in<br />

Section III.<br />

Such formulation is used in Section IV to look for the<br />

3D flux density in the end regions <strong>of</strong> a large synchronous<br />

generator. The 3D correction to the 2D quantities are then<br />

performed according to the proposed formulation.<br />

Section IV contributes in particular to extend the<br />

knowledge <strong>of</strong> rotating electrical machinery by providing a<br />

powerful numerical tool to compute lumped parameters in<br />

the analytical model <strong>of</strong> synchronous generators with a<br />

precision superior to that achieved by classical 2D finite<br />

element models. As matter <strong>of</strong> fact, with the proposed<br />

model, the contribution to terminal quantities such as the<br />

reactances coming from the end winding region can be<br />

accurately taken into account. Terminal quantities are<br />

finally compared with experimental measurements.<br />

II. INTEGRAL FORMULATION AND ITS NUMERICAL<br />

IMPLEMENTATION<br />

It is well known that for a synchronous machine<br />

operating at steady state for the computation <strong>of</strong> the<br />

inductances it is sufficient to refer to a nonlinear<br />

magnetostatic model [16, 17]. The currents in the stator<br />

and in the rotor coils are supposed to be assigned for any<br />

position <strong>of</strong> the rotor in order to focus the attention on the<br />

magnetostatic problem formulation. However for assigned<br />

voltage, active and reactive power field currents and<br />

inductances can be computed by solving a sequence <strong>of</strong><br />

non linear mangetostatic problems [17]. In any case it is<br />

possible to neglect the effects <strong>of</strong> the eddy currents in the<br />

massive conductive parts <strong>of</strong> the device.<br />

The numerical model is based on an integral<br />

formulation <strong>of</strong> the nonlinear magnetostatic problem in<br />

terms <strong>of</strong> the unknown magnetization M. The solution is<br />

obtained by means <strong>of</strong> a Picard-Banach iteration whose<br />

convergence can be theoretically proved when the<br />

magnetic constitutive equation is uniformly monotonic<br />

and Lipschitzian [14, 15].<br />

In particular, by using the Biot-Savart law, the<br />

magnetic induction can be expressed in terms <strong>of</strong> its<br />

sources as:<br />

( ) = <br />

( ) =<br />

ˆ S ( ) +<br />

0 ( )<br />

μ0<br />

(<br />

( r−r') ) 3<br />

Br Mr B r<br />

μ Mr − ∇⋅ 'Mr' 4π Vf<br />

r−r' dV '+<br />

μ<br />

( r−r') + ( ') ⋅ ˆ ( ') dS ', for∈V<br />

3<br />

f<br />

4 Mr nr<br />

r<br />

r−r' 0<br />

π ∂Vf<br />

- 209 - 15th IGTE Symposium 2012<br />

(3.1)<br />

where BS is the magnetic induction produced in the free<br />

space by the stator and rotor currents, Vf is the region<br />

filled by the magnetic materials, ∂Vf is its boundary and<br />

ˆn is the outward unit normal on ∂Vf.<br />

The nonlinear constitutive relationship in Vf can be<br />

expressed by introducing the local operator as<br />

M(r)=[B(r)] in Vf<br />

(3.2)<br />

Therefore M is the solution <strong>of</strong> the following nonlinear<br />

problem:<br />

M(r)=[M] in Vf<br />

(3.3)<br />

As shown in [14], the operator is a contraction if is<br />

uniformly monotonic and Lipschitzian. Therefore, the<br />

solution exists, is unique and can be found by the fixed<br />

point iteration.<br />

From the numerical point <strong>of</strong> view, the magnetization<br />

can be expressed in terms <strong>of</strong> piece-wise constant vector<br />

shape functions in each elementary volume arising the<br />

after discretization <strong>of</strong> Vf, such as<br />

M<br />

() r = M jP<br />

j () r in V f<br />

j<br />

(3.4)<br />

where the Pj's are discontinuous shape functions obtained<br />

multiplying the scalar pulse functions pk's (pk = 1 in the kth<br />

element and it is zero elsewhere) by the (three) unit<br />

vectors along the coordinate axes.<br />

The numerical model is obtained by applying the<br />

Galerkin method to (3.3), rewritten as -1 [M]=[M] in<br />

Vf:<br />

-1<br />

Pi [ M] Pi [<br />

M]<br />

⋅ dV = ⋅ dV, ∀i<br />

Vf Vf<br />

The fixed point iteration is therefore rewritten as:<br />

<br />

V f<br />

k+<br />

1<br />

k<br />

[ M ] dV Pi<br />

⋅ [ M ]<br />

-1<br />

Pi ⋅ <br />

<br />

V f<br />

V f<br />

<br />

P ⋅ P dV<br />

i<br />

i<br />

=<br />

<br />

V f<br />

P ⋅ P dV<br />

i<br />

i<br />

dV<br />

, ∀i<br />

(3.5)<br />

(3.6)<br />

where the subscript k indicates the approximation <strong>of</strong> M<br />

and B at the k-th iteration. Note that being the term<br />

the volume <strong>of</strong> the i-th element, the r.h.s <strong>of</strong><br />

P ⋅ P dV<br />

<br />

V f<br />

i j<br />

(3.6) is the average <strong>of</strong> the magnetic induction B k =[M k ]<br />

in the i-th elementary volume at the iteration k. Being the<br />

magnetization piece-wise constant, eq. (3.6) can be solved<br />

for M k+1 in each element, by applying the constitutive<br />

relation to the average magnetic induction in the same<br />

element.<br />

Therefore, after discretization, (3.6) gives rise to the<br />

following fixed point iteration [14, 15]:<br />

( )<br />

k −1<br />

k<br />

B = D EM + W<br />

(3.7)


where:<br />

( )<br />

k+ 1<br />

k<br />

M = G B<br />

(3.8)<br />

( ) ⋅ ( ) ( ) ⋅ ( )<br />

μ ˆ ˆ ' '<br />

0 nr Pi r nr Pj r <br />

Eij = μ0Dij−<br />

<br />

dSdS'<br />

4π <br />

r−r' ∂Vi ∂Vj<br />

<br />

Dij<br />

= Pi⋅PjdV (3.9)<br />

Vf<br />

<br />

Wi<br />

= Pi⋅BSdV Vf<br />

Vi is the volume <strong>of</strong> the i-th element, M k the column<br />

vector <strong>of</strong> the coefficients in (3.4) at the k-th iteration, B k<br />

the column vector made by the average magnetic<br />

induction in the elements and G is the global relationship<br />

corresponding to after the discretization process.<br />

The flux Φn linked with the n-th circuit <strong>of</strong> volume τ and<br />

produced by the currents flowing in a set <strong>of</strong> coils is<br />

defined as:<br />

<br />

Φ n = Ar () ⋅Jn()<br />

r dτ<br />

τ n<br />

(3.10)<br />

where A is the magnetic vector potential associated to all<br />

the sources and Jn is the current density associated to the<br />

unit current impressed in the n-th circuit.<br />

This definition is consistent with the definition <strong>of</strong> the<br />

magnetic energy in the linear case and with the definition<br />

<strong>of</strong> the flux linked with a circuit <strong>of</strong> infinitely small crosssection.<br />

As a matter <strong>of</strong> fact, in this case it results:<br />

Jn() r<br />

Φ n = Ar () ⋅ dτ<br />

=<br />

I<br />

τ n<br />

n<br />

In<br />

1<br />

= () ˆ <br />

Ar ⋅ tSd<br />

n =<br />

S<br />

n<br />

n I<br />

γ<br />

n<br />

= Ar () ⋅tˆd<br />

<br />

γ n<br />

(3.11)<br />

being γn the closed curve defining the axis <strong>of</strong> the circuit,<br />

ˆt the unit tangent vector and In the unit current flowing<br />

in the n-th filamentary circuit.<br />

The magnetic vector potential appearing in (3.11) can<br />

be calculated from both free and (magnetic) polarization<br />

currents after (3.3) is solved with suitable boundary<br />

conditions by applying the Biot-Savart law for the<br />

magnetic vector potential:<br />

( ) = ˆ ( )<br />

Ar r<br />

μ<br />

<br />

( ', t)<br />

× ( − ')<br />

Mr r r<br />

0 AS+ dV', for<br />

r∈V<br />

3<br />

f<br />

4π V r−r' f<br />

(3.12)<br />

In (3.12) A has been written as the sum <strong>of</strong> the<br />

contribution <strong>of</strong> the free and magnetizing currents. All the<br />

procedure is quite time consuming when high number <strong>of</strong><br />

unknowns are treated; however a very effective<br />

computational tool has been recently proposed [18] based<br />

on a suitable use <strong>of</strong> high performance computing<br />

- 210 - 15th IGTE Symposium 2012<br />

architecture.<br />

III. THE 3D CORRECTION TO THE 2D SOLUTION<br />

In the case here treated the 2D solution provides an<br />

accurate solution in a large part <strong>of</strong> the domain <strong>of</strong> interest.<br />

As a consequence, the 3D analysis can be limited just to<br />

the region where it is really required.<br />

The classical theory [19], <strong>of</strong> the electrical machinery<br />

suggests to split fluxes and inductances in a 2D and 3D<br />

contributions, each corresponding to one <strong>of</strong> the two<br />

geometrical parts <strong>of</strong> the system geometry.<br />

Unfortunately such a separation is rather questionable<br />

and ambiguous. Then here a quite different approach is<br />

suggested: the actual 3D magnetic flux linked with a close<br />

line, is split in (a) the 2D part Φn (2D) (evaluated by 2D<br />

flux per the unit length, in the axial symmetrical region,<br />

multiplied for the length) and (b) the complement ΔΦn (3D)<br />

defined as the 3D correction requested for the case at<br />

hand. Similar consideration can be applied to other<br />

parameter including the main or flux leakage coefficients.<br />

IV. NUMERICAL EXAMPLE<br />

Among the rotating electrical machinery, large turbogenerators<br />

are endowed with quite long end windings<br />

which contribute poorly to produce linkage flux but,<br />

unfortunately, to produce significant leakage flux [20].<br />

Such a contribution is generally evaluated by simplifying<br />

considerably the complex geometry and, in addition, by<br />

neglecting the non linearity’s [7].<br />

The integral approach presented in the Section II is a<br />

powerful tool able provide a deeper analysis <strong>of</strong> the<br />

machinery and to overcome both limitations. This kind <strong>of</strong><br />

information is particularly relevant in the design process<br />

<strong>of</strong> the turbine-generator where the flux leakage has a<br />

preeminent significance.<br />

In the following a numerical example is presented in<br />

order to assess the consistency <strong>of</strong> the 3D corrections to be<br />

considered for different operating conditions.<br />

The turbine generator simulated has a rated apparent<br />

power in the range 300-350 MVA depending on the room<br />

temperature affecting the cooling system. It has two poles<br />

and the nominal frequency is 50 Hz.<br />

Some details <strong>of</strong> the finite element mesh <strong>of</strong> the iron<br />

regions denoted (Vf ) as well as <strong>of</strong> the field and armature<br />

coils, are reported in Fig.1.<br />

The 3D FEM model is characterized by a number <strong>of</strong><br />

45593 nodes (24201 in the iron region Vf including stator<br />

and rotor iron as well as the enclosure) and 17774<br />

elements (12478 in Vf ). It is worth noticing that the a non<br />

conformal mesh <strong>of</strong> the iron regions Vf has been adopted in<br />

order to exploit some additional geometrical degrees <strong>of</strong><br />

freedom in the sub-regions where a particular refinement<br />

<strong>of</strong> the mesh is necessary.


Armature coil<br />

Field coil<br />

Stator<br />

iron<br />

Rotor<br />

iron<br />

Enclosure<br />

Figure 1: Section <strong>of</strong> the Finite element Mesh used in the<br />

computations.<br />

Boundary conditions. In principle the complete<br />

geometry <strong>of</strong> the machine should be treated. However just<br />

a part has been considered while the effect <strong>of</strong> the<br />

remaining part has been forced by suitable boundary<br />

conditions in a cutting plane in the region where the 3D<br />

solution actually meets the 2D approximation.<br />

Material characterization. The magnetic iron properties<br />

has been represented in the (3.2) form by substituting the<br />

BH curve in B=μ0(H+M).<br />

In the following the no load operation mode is<br />

considered: the rotor is assumed to rotate at the nominal<br />

angular speed, a DC current is applied to the field coil<br />

and an open circuit is imposed to the armature terminals.<br />

Notice that, for a synchronous machine such an<br />

operative condition can be examined by means <strong>of</strong> just a<br />

single magneto-static problem. As a matter <strong>of</strong> fact, such<br />

solution provides as many samples <strong>of</strong> the time evolution<br />

<strong>of</strong> the terminal voltage as the number <strong>of</strong> the stator slot if<br />

the stator winding is a double layer one [16].<br />

Of course the non-linear iron effects affects the main<br />

flux and, consequently, the output voltages. Therefore the<br />

magnetic analysis has been repeated for several rotor<br />

currents, including, 100, 500 and 1000A, respectively.<br />

In particular the field current 500 A corresponds to the<br />

rated voltage at the generator terminals. In the following,<br />

the 500 A case is discussed in details while the other two<br />

currents are considered just to evaluate the saturation<br />

effect on ΔΦn (3D) .<br />

The three-dimensional distribution <strong>of</strong> the magnetic<br />

vector potential on the field and armature coils are<br />

sketched in figs. 2, 3 and the flux density in figs. 4, 5,<br />

assuming the boundary condition imposed on the<br />

symmetry plane and an excitation current <strong>of</strong> 500 A.<br />

Flux waveforms as well as their amplitude spectrum is<br />

then calculated according to (3.11); the results are<br />

reported in fig. 6. In order to look for higher harmonics, a<br />

spectrum analysis <strong>of</strong> the principal flux has been<br />

performed (fig. 7).<br />

- 211 - 15th IGTE Symposium 2012<br />

Figure. 2: amplitude <strong>of</strong> the magnetic vector potential<br />

[Tm] in the armature coil.<br />

Figure. 3: amplitude <strong>of</strong> the magnetic vector potential<br />

[Tm] in the field coil.<br />

Figure. 4: amplitude <strong>of</strong> the magnetic induction [T] in the<br />

armature coil.<br />

Figure. 5: amplitude <strong>of</strong> the magnetic induction [T] in the<br />

field coil.


As mentioned before, the 2D field coincides with the<br />

3D field in the neighborhood <strong>of</strong> the symmetry plane. The<br />

analysis <strong>of</strong> the vector potential and flux density<br />

components <strong>of</strong> the 3D model can be compared to the 2D<br />

solution, to evaluate the validity <strong>of</strong> the 2D approximation.<br />

The 2D solution matches the 3D one in a large part <strong>of</strong> the<br />

active length, with a good approximation (in the order <strong>of</strong><br />

90%).<br />

The comparison <strong>of</strong> magnetic flux, inductance<br />

coefficient per unit length <strong>of</strong> both solutions provides the<br />

desired correction factors. The flux waveforms as well as<br />

the amplitude spectrum from the 2D solution are reported<br />

in figs. 6, 7 and the results are compared with those from<br />

3D solution.<br />

Figure. 6: Flux linkage [Wb] waveforms <strong>of</strong> a single a<br />

stator coil<br />

From the comparison <strong>of</strong> the main fluxes it follows the<br />

2D solution underestimates the flux as well as the no load<br />

voltage <strong>of</strong> ΔΦn (3D) =2%.<br />

Unfortunately, due to its complexity, both the accuracy<br />

and the resolution <strong>of</strong> the 3D solution could be<br />

unsatisfactory for a number <strong>of</strong> applications. Of course, the<br />

2D analysis is able to provide more accurate and robust<br />

solution in the plane. Therefore, in order to assess the<br />

quality <strong>of</strong> the 2D solution given by the 3D analysis, a 2D<br />

analysis <strong>of</strong> linkage fluxes has been carried out by using<br />

the commercial s<strong>of</strong>tware package Ansys (Release 13) and<br />

the voltage computed from the principal flux has been<br />

compared with both the 3D evaluations and the<br />

experimental measurements.<br />

The finite element mesh used in this case is shown in<br />

fig. 8 and consists <strong>of</strong> 125549 nodes and 5754 second<br />

order triangular elements.<br />

The no load voltage ha been computed and compared<br />

to the 3D calculations. The actual end-winding effect,<br />

neglected by the 2D solution, is highlighted in fig. 9<br />

where the air gap radial magnetic induction as a function<br />

<strong>of</strong> the angle θ and <strong>of</strong> the axial position Z is plotted. In<br />

addition, to further assess the analysis the 2D no load<br />

voltage has been also compared with a set <strong>of</strong> experimental<br />

measurements.<br />

The discrepancy is rather vanishing (below 1%). Of<br />

- 212 - 15th IGTE Symposium 2012<br />

course such a result comes by an equilibrium <strong>of</strong> two<br />

conflicting effects: the first is lack <strong>of</strong> the end winding<br />

contribution and the second the error introduced by<br />

neglecting the 3D effects in the 2D evaluations.<br />

Figure. 7: Flux linkage [Wb] amplitude spectrum.<br />

Fig. 8: 2D finite element mesh used in the 2D<br />

calculations.<br />

Figure. 9: 3D rendering <strong>of</strong> the radial magnetic flux at the<br />

air gap.


Similar results can be achieved with different currents<br />

(discrepancy in the order 2-3% <strong>of</strong> the actual flux linkage<br />

with 100 A or 1000 A) showing that, in no load operation<br />

the effect <strong>of</strong> iron non linearity is quite limited.<br />

The same procedure is applied to evaluate the leakage<br />

flux and its contribution coming from 3D effects. The<br />

results show that the discrepancy is much more relevant.<br />

In the order <strong>of</strong> 10, 15, 20 % for an exciting current <strong>of</strong><br />

100, 500 and 1000 A, respectively.<br />

V. CONCLUSION<br />

The computation <strong>of</strong> end winding inductances <strong>of</strong> large<br />

turbo generators requires proper mathematical tools to be<br />

performed. This paper proposes an integral FEM<br />

formulation to compute the 3D vector potential and flux<br />

density distribution in the end regions. The approach does<br />

not requires the meshing <strong>of</strong> the free space than allowing a<br />

significant reduction <strong>of</strong> computer burden.<br />

A large synchronous generator with power in the range<br />

300-350 MVA has been analyzed as a case study. Both<br />

axial and radial components <strong>of</strong> the flux density generated<br />

by stator coils have been computed.<br />

The flux linkages for all coils and the no load voltages<br />

have been computed.<br />

In order to assess the influence <strong>of</strong> the end regions,<br />

different stack lengths have been simulated for the same<br />

end windings length. The results achieved include the<br />

definition <strong>of</strong> 3D correction to be applied to 2D<br />

simulation and, in addition, the variation <strong>of</strong> the correction<br />

as a function <strong>of</strong> the exciting currents has been evaluated.<br />

The comparison between terminal quantities coming from<br />

widely overspread 2D models and experimental<br />

measurements have been reviewed by considering 3D<br />

effects evaluated by using the proposed model.<br />

REFERENCES<br />

[1] B. Hosninger, “Theory <strong>of</strong> end-winding leakage reactance”, Power<br />

Apparatus and Systems, Part III, vol. 78, pp. 417-426, Aug. 1959.<br />

[2] W. M. Arshad, H. Lendenmann, Y. Liu, J.-O. Lamell and H.<br />

Persson, “Finding end winding inductances <strong>of</strong> MVA machines”<br />

Proc. Fortieth IAS Meeting, vol. 4, pp. 2309-2314, 2005.<br />

[3] M.F. Hsieh, Y.C. Hsu, D.G. Dorrell, and K.H. Hu, "Investigation<br />

on end winding inductance in motor stator windings", IEEE<br />

Trans. on Magn., vol. 43, pp. 2513–2515, June 2007.<br />

[4] J. Pyrhönen, T. Jokinen and V. Hrabovcová, Design <strong>of</strong> Rotating<br />

Electrical Machines, John Wiley and Sons, Ltd, 2008.<br />

[5] J. A. Tegopoulos, "End component <strong>of</strong> armature leakage reactance<br />

<strong>of</strong> turbine generators", IEEE Trans. on PAS, vol. 83, pp. 632-637,<br />

June 1964.<br />

[6] J. A. Tegopoulos, “Current sheets equivalent to the end-winding<br />

currents <strong>of</strong> turbine generator stator and rotor,” AIEE Trans. Pt. III,<br />

vol. PAS-81, pp. 695–700, February 1963.<br />

[7] V.S Lazarns, A.G Kladas, A.G Mamalis, and J.A. Tegopoulos,<br />

"Analysis <strong>of</strong> end zone magnetic field in generators and shield<br />

optimization for force reduction on end windings", IEEE Trans.<br />

on Mag., vol. 45, pp.1470–1473, March 2009.<br />

- 213 - 15th IGTE Symposium 2012<br />

[8] D. J. Scott, S. J. Salon, and G. L. Kusik, “Electromagnetic forces<br />

on the armature end windings <strong>of</strong> large turbine generators I—<br />

Steady state conditions”, IEEE Trans. PAS., vol. PAS-100, pp.<br />

4597–4603, Nov. 1981.<br />

[9] Q. Li and F. Wang, “Application <strong>of</strong> image method to calculate 3-<br />

D magnetic field and parameters <strong>of</strong> SC alternator”, IEEE Trans.<br />

on Mag., vol. 25, pp. 1850–1853, Feb. 1989.<br />

[10] D. Ban, D. Zarko, and I. Mandic, “Turbo-generator end winding<br />

leakage inductance calculation using a 3D analytical approach<br />

based on the solution <strong>of</strong> Neumann integrals”, IEEE Trans. on En.<br />

Conv., vol. 20, pp. 98–105, March 2005.<br />

[11] A.T. Brahimi, A. Foggia, and G. Meunier, "End winding<br />

reactance computation results using a 3D finite element program"<br />

IEEE Trans. on Mag., vol. 29, pp. 1411-1414, March 1993.<br />

[12] R. Albanese, F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano,<br />

F. Marignetti, R. Martone, G. Rubinacci, A. Tamburrino and S.<br />

Ventre, "Coupled three dimensional numerical calculation <strong>of</strong><br />

forces and stresses on the end windings <strong>of</strong> large turbo generators<br />

via Integral Formulation", IEEE Trans. on Mag., vol. 48, pp. 875<br />

- 878, Feb. 2012.<br />

[13] F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano, F. Marignetti,<br />

R. Martone, G. Rubinacci, A. Tamburrino and S. Ventre, “A<br />

novel technique based on integral formulation to treat the motion<br />

in the analysis <strong>of</strong> electric machinery”, International Journal <strong>of</strong><br />

Applied Mathematics and Mechanics, in press.<br />

[14] R. Albanese, F. I. Hantila, and G. Rubinacci, “A nonlinear eddy<br />

current integral formulation in terms <strong>of</strong> a two-components current<br />

density vector potential”, IEEE Trans. Mag. 32, pp. 784-787,<br />

March 1996.<br />

[15] R. Albanese, and G. Rubinacci, “Finite elements methods for the<br />

solution <strong>of</strong> 3D eddy current problems”, Advances in Imaging and<br />

Electron Physics, vol. 102, pp. 1-86, 1998.<br />

[16] N. Bianchi, Electrical machine analysis using finite elements,<br />

Taylor and Francys, pp. 141-162, 2005.<br />

[17] M.V.K. Chari, S.H. Minnich, S.C. Tandon, Z.J. Csendes, J.<br />

Berkery, “Load characteristics <strong>of</strong> synchronous generator by the<br />

finite element method”, IEEE Trans. on PAS, vol. 100, pp.1-13,<br />

January 1981.<br />

[18] R. Albanese, F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano,<br />

F. Marignetti, R. Martone, G. Rubinacci, A. Tamburrino and S.<br />

Ventre, “Electromechanical analysis <strong>of</strong> end windings in turbo<br />

generators”. COMPEL, vol. 30, pp. 1885-1898, 2011.<br />

[19] J. Pyrhonen, T. Jokinen, V. Hrabokova, Design <strong>of</strong> Rotating<br />

Electrical Machines, John Wiley and sons, Ltd, 2008, pp.246-249<br />

[20] M.V. Deshpande, Design and Testing <strong>of</strong> Electrical Machines,<br />

Phi learning Pvt. Ltd., 2010.


- 214 - 15th IGTE Symposium 2012<br />

Magnetomechanical Coupled FE Simulations <strong>of</strong><br />

Rotating Electrical Machines<br />

*A. Belahcen, *K. Fonteyn, † R. Kouhia, *P. Rasilo, and *A. Arkkio<br />

*Aalto <strong>University</strong>, Dept. <strong>of</strong> Electrical Engineering, POBox 13000, FIN-00076 Aalto, Finland<br />

† Tampere <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Dept. <strong>of</strong> Mechanics and Design, P.O BOX 589, 33101 Tampere, Finland<br />

E-mail: anouar.belahcen@aalto.fi<br />

Abstract— Regardless <strong>of</strong> the relatively large amount <strong>of</strong> published models <strong>of</strong> magnetostriction, only few <strong>of</strong> them have been<br />

applied to describe this phenomenon in electrical steel and even less have been incorporated in the FE simulation <strong>of</strong> electrical<br />

machines. In this paper we review the models <strong>of</strong> magnetostriction and magnetomechanical coupling in electrical steel and their<br />

incorporation into the FE analysis <strong>of</strong> rotating electrical machines. We also discuss the advantages and disadvantages <strong>of</strong> the<br />

different models and present an energy-based coupled magnetomechanical set <strong>of</strong> constitutive equations that describe both the<br />

magnetostriction and the magnetic nonlinearity and its dependency on stresses in the electrical steel. We further present the<br />

implementation <strong>of</strong> these equations into in-house 2D FE s<strong>of</strong>tware for the simulations <strong>of</strong> electrical machines. The simulations<br />

carried out show that the energy based model describes well the vibrations <strong>of</strong> electrical machines due to magnetostriction and<br />

reluctance forces. A discussion on how the model should be improved to account for hysteresis is also presented.<br />

Index Terms—coupled models, electrical machines, finite elements, magnetostriction.<br />

current density and the geometry <strong>of</strong> the windings are<br />

known. However, some issues related to the skin-effect<br />

and the eddy-currents make this computation rather<br />

complex in some special cases as explained by Islam et al.<br />

[1]. In iron, and due to its magnetic domain structure and<br />

its finite electric conductivity, the flow <strong>of</strong> a time-varying<br />

flux produces hysteresis and eddy-current losses (in some<br />

approaches also excess losses). The computation <strong>of</strong> these<br />

so-called iron losses is still one <strong>of</strong> the most active<br />

research fields in the simulation <strong>of</strong> electrical machines.<br />

Finally, the motion <strong>of</strong> the rotating parts <strong>of</strong> the machine<br />

and the friction in the bearings <strong>of</strong> the machine as well as<br />

the one between the moving parts and the air produces<br />

mechanical losses that can be computed through complex<br />

CFD models in case <strong>of</strong> high speed machines or<br />

approximated by semi-empirical equations.<br />

The knowledge <strong>of</strong> the above loss components is<br />

valuable information for the designers <strong>of</strong> electrical<br />

machines as they allow them to optimize the structure <strong>of</strong><br />

the machine with regards to the cooling and the<br />

mechanical strength as well as the use <strong>of</strong> magnetic and<br />

other materials.<br />

Besides the structural and cooling optimization <strong>of</strong> the<br />

machine, the designers are bind by environmental aspects<br />

such as the level <strong>of</strong> acoustic noise and the estimation <strong>of</strong><br />

the lifecycle <strong>of</strong> the machine. The acoustic noise is<br />

produced by the vibrations <strong>of</strong> the structure <strong>of</strong> the machine<br />

under the effect <strong>of</strong> different forces and other excitations<br />

and by the airflow in different channels.<br />

I. INTRODUCTION<br />

Owing to the high demand on energy efficient and<br />

environmental friendly apparatuses, the designers <strong>of</strong><br />

electrical machines, among others, are more and more<br />

interested in accurate computational methods for the<br />

analysis <strong>of</strong> their design. The energy conversion in<br />

electrical machines occurs within three different but<br />

tightly coupled subsystems, i.e., the electrical system, the<br />

magnetic system and the mechanical system as shown in<br />

Fig. 1. The electric system consists <strong>of</strong> the windings <strong>of</strong> the<br />

machine that are connected to the supply in the case <strong>of</strong> a<br />

motor or to the load in case <strong>of</strong> generator operation. The<br />

electric supply/load is nowadays typically a voltage<br />

source frequency converter, the current <strong>of</strong> which is<br />

controlled according to the operation point <strong>of</strong> the<br />

machine. Such current depends on the load <strong>of</strong> the<br />

machine and consists <strong>of</strong> a torque producing component as<br />

well as a component necessary for the magnetization <strong>of</strong><br />

the iron core and another compensating for the core<br />

losses. The magnetic system consists <strong>of</strong> the iron core <strong>of</strong><br />

the machine as well as the airgap and other construction<br />

parts in which a magnetic flux is produced by the coils’<br />

currents according to the Ampere’s law <strong>of</strong> induction.<br />

These fluxes and their interaction with the magnetic<br />

materials produce forces and torque that are transferred to<br />

the mechanical system consisting <strong>of</strong> the rotor, the shaft<br />

and the bearings <strong>of</strong> the machine as well as a possible<br />

cooling fan mounted on the shaft <strong>of</strong> the machine.<br />

The electric, magnetic and mechanical systems,<br />

although represented as separate subsystems, are tightly<br />

coupled to each other and their operation quantities<br />

cannot be solved separately. Indeed the current drawn by<br />

the machine, the torque it produces and the magnetic flux<br />

density in the airgap and the iron core are usually solved<br />

simultaneously especially if the machine is voltage fed.<br />

The operation <strong>of</strong> the above subsystems is known to be<br />

dissipative as there are energy losses related to each<br />

subsystem. The Joule losses resulting from the currents<br />

flowing in the windings are usually easy to compute if the<br />

Electrical power<br />

Lorentz forces<br />

Electrical<br />

System (windings)<br />

Electrical<br />

Losses<br />

Magnetic forces and<br />

Magnetostriction<br />

Coupling field<br />

(iron and air)<br />

Air flow and Friction<br />

Vibrations Wearing and Noise<br />

Magnetic<br />

Losses<br />

Mechanical system<br />

(bearings and fan)<br />

Mechanical<br />

Losses<br />

Mechanical power<br />

Fig. 1. Illustration <strong>of</strong> the energy conversion process with the<br />

related electric, magnetic and mechanical subsystems and<br />

related losses, forces and vibrations and their origins.


The interaction between the flux and the currents<br />

produces Lorentz forces acting on the windings, while the<br />

flow <strong>of</strong> the magnetic flux in the iron core and the airgap<br />

gives rise to magnetic forces and strains in the core.<br />

The vibrations produced by the interaction between the<br />

magnetic forces, the deformations and the structure result<br />

in acoustic noise and mechanical wearing <strong>of</strong> different<br />

parts <strong>of</strong> the machine such as the winding insulation and<br />

the bearings. The knowledge <strong>of</strong> these parasitic effects at<br />

the design stage will help in reducing the vibrations and<br />

noise <strong>of</strong> the machine as well as estimating the lifecycle <strong>of</strong><br />

the machine and optimizing the structure for longer life<br />

too. An illustration <strong>of</strong> the different losses and vibration<br />

phenomena occurring at different subsystems <strong>of</strong> the<br />

energy conversion is given in Fig. 1.<br />

The strains and stresses in the iron core <strong>of</strong> an electrical<br />

machine are produced by different sources and have<br />

strong degrading effect on the quality <strong>of</strong> the iron, thus<br />

reducing the efficiency <strong>of</strong> the energy conversion process<br />

and increasing the amount <strong>of</strong> iron needed for a given<br />

power <strong>of</strong> the machine. The effect <strong>of</strong> the mechanical stress<br />

on the power losses in electrical steel have been known<br />

for quite long time. Already in the 70’s <strong>of</strong> the last century<br />

Moses [2], [3], among others, showed through magnetic<br />

measurements that the mechanical stress affects the<br />

losses, the magnetization, and the magnetostriction <strong>of</strong><br />

electrical steel. By magnetostriction it was meant the<br />

relative change in the length <strong>of</strong> a specimen <strong>of</strong> magnetic<br />

material when it is subjected to a magnetic field.<br />

For a better understanding let us first clarify what is<br />

magnetostriction. In 1842 W. P. Joule discovered what is<br />

today called Joule magnetostriction, which is a volume<br />

conserving deformation <strong>of</strong> magnetic material caused by<br />

its magnetization. Such a deformation results in an<br />

elongation <strong>of</strong> the material in the direction <strong>of</strong><br />

magnetization and a shrink in the orthogonal directions<br />

for positive magnetostrictive materials and vice versa for<br />

negative magnetostrictive materials. Later on, it was<br />

observed that at high values <strong>of</strong> the magnetization the<br />

deformation is no more volume conserving and this<br />

phenomena was called volume magnetostriction. Both<br />

types <strong>of</strong> magnetostriction are called forced<br />

magnetostriction in a sense that they are caused by the<br />

magnetization that forces the magnetic domain walls to<br />

move and the domains to rotate thus producing the<br />

mechanical deformation. On the other hand, when the<br />

magnetic material is cooled down from a high<br />

temperature, it undergoes a strong isotropic change in its<br />

dimensions as it goes though the Curie temperature. Such<br />

a change in volume was explained by the formation <strong>of</strong><br />

magnetic domains and the orientation <strong>of</strong> elementary<br />

magnetic moments within the domains. Several other<br />

experimental works have been conducted on magnetic<br />

materials and separate magnetomechanical phenomena<br />

obtained separate names according to their respective<br />

discoverers. The skew magnetostriction ,e.g., resulting<br />

from a helical magnetization and producing a bending <strong>of</strong><br />

some electrically conducting magnetic material has been<br />

called Wiedemann-effect and the inverse effect <strong>of</strong><br />

- 215 - 15th IGTE Symposium 2012<br />

magnetostriction which results in a change <strong>of</strong> the<br />

magnetization properties <strong>of</strong> magnetic materials under the<br />

action <strong>of</strong> applied mechanical stress was called Vilari<br />

effect. Also the apparent change in the Young modulus <strong>of</strong><br />

magnetic material, which is due to the intrinsic<br />

rearrangement <strong>of</strong> the magnetic domains and thus the<br />

intrinsic elongation following this rearrangement, has<br />

been called Delta-E effect. A comprehensive description<br />

<strong>of</strong> these phenomena can be found, e.g., from [4]. The<br />

intrinsic forms <strong>of</strong> magnetostriction are nowadays referred<br />

to as spontaneous magnetostriction. They are <strong>of</strong> great<br />

interest for metallurgist but are not <strong>of</strong> importance for the<br />

simulation <strong>of</strong> electrical machines under operation as they<br />

do not occur anymore at this stage. Fig. 2 shows an<br />

illustration <strong>of</strong> the difference between spontaneous and<br />

forced magnetostriction.<br />

Cooling through<br />

Curie temperature<br />

H<br />

Positive Joule<br />

magnetostriction<br />

H=0<br />

Applying external<br />

magnetic field<br />

Spontaneous<br />

Negative Joule<br />

magnetostriction<br />

H<br />

Forced magnetostriction<br />

magnetostriction<br />

Fig. 2. Illustration <strong>of</strong> spontaneous and forced magnetostriction.<br />

The mechanical stress or strain acting on the magnetic<br />

material can originate from different phenomena some <strong>of</strong><br />

them produce static stresses and other dynamic stresses.<br />

The shrink fitting <strong>of</strong> the stator into the frame <strong>of</strong> the<br />

machine produces static compressive stresses <strong>of</strong> the order<br />

<strong>of</strong> 10 MPa as shown by Fujisaki et al. [5]. These stresses<br />

can be evaluated by means <strong>of</strong> numerical simulations or<br />

through analytical approximations, but due to the<br />

manufacturing tolerances they may have excessive local<br />

values. On the other hand the so called reluctance forces<br />

occurring between the stator and the rotor <strong>of</strong> the machine<br />

and which are time and space dependent, produce<br />

dynamic tensile and compressive stresses at different<br />

locations <strong>of</strong> the machine. These stresses are at the origin<br />

<strong>of</strong> the so called magnetic noise, i.e., the acoustic noise<br />

due to magnetically excited vibrations. The level <strong>of</strong> these<br />

stresses depends on the magnetic flux density in the air<br />

gape <strong>of</strong> the machine and can be <strong>of</strong> the order <strong>of</strong> 200 MPa.<br />

Further, the rotating and alternating magnetization <strong>of</strong> the<br />

iron core produces dynamic magnetostrictive strains the<br />

level <strong>of</strong> which depends on the state <strong>of</strong> stress in the core.<br />

At last but not least, the punching <strong>of</strong> the magnetic<br />

material in the manufacturing process produces residual<br />

stresses and plastic strains at some regions <strong>of</strong> the<br />

magnetic material. The level <strong>of</strong> these stresses and strains<br />

depend the manufacturing process and the quality <strong>of</strong><br />

punching tools.<br />

All these stresses and strains will affect both the<br />

magnetization characteristics and the energy losses <strong>of</strong> the<br />

magnetic material as well as the vibrations <strong>of</strong> the core and<br />

thus the acoustic noise and the wearing <strong>of</strong> the materials<br />

and parts <strong>of</strong> the machine.<br />

The modeling <strong>of</strong> magnetostriction started In the 50’s <strong>of</strong><br />

the last century as related to the so called giant<br />

magnetostrictive materials and their applications in<br />

ultrasound generators and receptors. The effect <strong>of</strong>


magnetostriction on the noise <strong>of</strong> power transformer was<br />

also investigated through experimental studies starting in<br />

the 70’s by Moses [6] and later by Weiser and Pfutzner<br />

[7] but the modeling <strong>of</strong> such phenomena in electrical<br />

machines started only at the beginning <strong>of</strong> the last two<br />

decades as it was noted that the prediction <strong>of</strong> the losses<br />

and vibrations <strong>of</strong> the machine requires adequate models<br />

able to account for the magnetomechanical coupling as<br />

well as the other electromagnetic couplings.<br />

In this paper we will concentrate on the modeling <strong>of</strong><br />

magnetostriction and the related magnetomechaical<br />

effects and their incorporation into the simulation <strong>of</strong><br />

electrical machines as well as the effect <strong>of</strong> these<br />

phenomena on the computation <strong>of</strong> iron losses. The<br />

coupled electromagnetic simulation methodology for<br />

electrical machines, which made it possible to develop the<br />

magnetomechanical models is now quite established as<br />

reported by Arkkio [8] and Salon et al. [9] among others<br />

and will not be handled here.<br />

In section II we will explore the force-based models <strong>of</strong><br />

magnetostriction and in section III the stress-strain based<br />

models. In section IV we will introduce the energy based<br />

model and in section V we will discuss the incorporation<br />

<strong>of</strong> the different models into 2D finite element simulation<br />

<strong>of</strong> electrical machines. In Section VI we will discuss the<br />

impact <strong>of</strong> these models on the computation <strong>of</strong> iron losses<br />

and sketch the necessity for future developments in view<br />

<strong>of</strong> accurate simulations and computation <strong>of</strong> losses and<br />

vibrations.<br />

II. FORCE-BASED MODELS OF MAGNETOSTRICTION<br />

The main idea behind the force-based models <strong>of</strong><br />

magnetostriction is that the magnetostrictive deformation<br />

<strong>of</strong> a sample <strong>of</strong> magnetic material under homogeneous<br />

magnetization can be produced by a distributed set <strong>of</strong><br />

forces acting on the boundaries <strong>of</strong> the sample as<br />

explained by Delaere et al. [10] and sketched in Fig. 3 for<br />

an arbitrary element.<br />

Such a representation <strong>of</strong> the magnetostriction emanates<br />

from earlier work <strong>of</strong> Besbes et al. [11], where the<br />

magnetostrictive forces were directly derived from the<br />

principle <strong>of</strong> virtual work and by accounting for the<br />

variation <strong>of</strong> the permeability with the mechanical stress. It<br />

should be mentioned that in [11], the magnetostriction has<br />

not been well described as sever assumptions such as<br />

linear magnetization and its linear dependency on the<br />

stress have been made. The local application <strong>of</strong> the<br />

principle <strong>of</strong> virtual work for the computation <strong>of</strong> magnetic<br />

forces itself was developed by Bossavite [12] after its<br />

introduction at a global level by Coulomb [13].<br />

The development <strong>of</strong> such force-based models and their<br />

coupling with the electromagnetic simulation <strong>of</strong> electrical<br />

machines was also reported by Mohamed et al. [14],<br />

Vandevelde et al. [15] and Belahcen [16]. Vandevelde<br />

and Belahcen used a stress approach to compute the<br />

magnetostrictive forces, whereas Delayer used a strain<br />

approach. In all these works, the other magnetic forces<br />

were computed either according to the principle <strong>of</strong> virtual<br />

- 216 - 15th IGTE Symposium 2012<br />

work and introduced into the FE simulation as<br />

generalized nodal forces or through the Maxwell stress<br />

tensor. The two methodologies have been earlier<br />

demonstrated by Kameari [17] to be equivalent. Lately,<br />

many authors applied the concept <strong>of</strong> magnetostrictive<br />

forces in the FE simulation <strong>of</strong> electrical machines.<br />

The equation for the computation <strong>of</strong> the generalized<br />

nodal magnetic forces is given bellow<br />

B<br />

T 1 <br />

F A A d dSˆ<br />

J e<br />

Sˆ<br />

e<br />

H<br />

B J<br />

(1)<br />

<br />

e U U<br />

0<br />

<br />

where B and H are the magnetic flux distribution and<br />

field strength respectively. J is the Jacobian matrix for the<br />

transformation from the reference finite element to actual<br />

one and ˆ Se stands for the reference element. U is the<br />

vector <strong>of</strong> nodal displacement. The integration with respect<br />

to the magnetic flux density in (1) as well as the<br />

differentiation with respect to the displacements is carried<br />

out analytically. For this purpose and for FE computation,<br />

a cubic spline representation <strong>of</strong> the HB-curve <strong>of</strong> the iron<br />

sheets is used. The different approaches for the<br />

computation <strong>of</strong> nodal magnetostrictive forces can be<br />

found in [10], [14], [15], and [16]. Fig. 4 shows the<br />

magnetic and magnetostrictive force computed for the<br />

stator core <strong>of</strong> a synchronous machine [18]. Similar forces<br />

for the induction machines have been reported in [19].<br />

Fig. 3. Sketch <strong>of</strong> the computation <strong>of</strong> magnetostrictive force from<br />

magnetostrictive deformation after Delaere [10].<br />

Fig. 4. Generalized magnetic forces (left) and equivalent<br />

magnetostrictive forces computed in the stator core <strong>of</strong> a<br />

synchronous machine. The forces have been normalized.<br />

The structural deformation can be computed either in a<br />

coupled or uncoupled methodology. In the coupled<br />

approach the mechanical and magnetic problems are<br />

solved simultaneously and the forces are updated at<br />

iteration level. In the uncoupled approach the magnetic<br />

problem is first solved and the forces computed as postprocessing<br />

quantities from the magnetic problem then<br />

introduced in the mechanical problem as loads. The<br />

results <strong>of</strong> the mechanical problem are the nodal<br />

displacements from which the deformation as well as the<br />

strains and stresses can be computed.<br />

Although the concept <strong>of</strong> magnetostrictive forces<br />

describes quit well the deformation <strong>of</strong> the magnetic


material it has a major drawback that consist <strong>of</strong> resulting<br />

in erroneous stress state in the material. Indeed, the<br />

magnetostrictive forces result into tensile stress if the<br />

boundaries <strong>of</strong> the element are free to move (refer to Fig.<br />

3). However, the actual state <strong>of</strong> stress in this case should<br />

be a zero stress in the element. If the boundaries <strong>of</strong> the<br />

element are fixed, the magnetostrictive forces will result<br />

into a zero stress, while the actual stress is a compressive<br />

one, the magnitude <strong>of</strong> which depends on the material<br />

properties and the level <strong>of</strong> magnetization. This erroneous<br />

behavior due to equivalent magnetostriction forces is<br />

illustrated in Fig. 5, where the magnetostrictive<br />

elongation is solved correctly but the stress is wrong.<br />

From the above discussion it is clear that the concept <strong>of</strong><br />

magnetostrictive force is not able to describe the stress<br />

dependency <strong>of</strong> the magnetic properties <strong>of</strong> the material and<br />

thus <strong>of</strong> the magnetostriction itself when it has to be<br />

computed under different boundary conditions dictated by<br />

the geometry and the topology <strong>of</strong> the electrical machine.<br />

III. STRAIN-BASED MODELS OF MAGNETOSTRICTION<br />

In the strain-based models <strong>of</strong> magnetostriction there is<br />

no need for the calculation <strong>of</strong> equivalent forces. The<br />

magnetostrictive strains are incorporated in the structural<br />

analysis in a similar way to the thermal dilatation <strong>of</strong><br />

metals. Here also, the magnetostriction can be modeled in<br />

a decoupled or coupled approach. In the decoupled<br />

approach [20] the magnetostrictive strains are computed<br />

from per element magnetic flux densities and the<br />

measured single valued flux densty-elongation<br />

relationship. These strains are then incorporated in a<br />

structural finite element model <strong>of</strong> the electrical machine<br />

to produce the deformation. If the effect <strong>of</strong> stress is to be<br />

accounted for in the computation <strong>of</strong> the magnetostriction,<br />

a coupled model should be used. Such a model emanates<br />

from the coupled magnetomechanical constitutive<br />

equations <strong>of</strong> the material [21] as explained in the next<br />

Section.<br />

Equivalent forces:<br />

Equivalent forces:<br />

= 0 ; = 0 = ms ; = -ms<br />

Iron<br />

Actual behavior:<br />

Actual behavior:<br />

= 0 ; = ms = ms ; = 0<br />

Fig. 5. Illustration <strong>of</strong> the magnetostriction and the state <strong>of</strong><br />

stress in the sample under the effect <strong>of</strong> magnetostrictive forces.<br />

Left clamped sample and right free sample. In both case the<br />

elongation is correct but the stress is wrong.<br />

IV. THE ENERGY-BASED CONSTITUTIVE EQUATIONS<br />

The energy-based, coupled constitutive equations <strong>of</strong><br />

the electrical steel are derived from an appropriate<br />

representation <strong>of</strong> the Helmholtz free energy [21], which<br />

itself is based on previous empirical observations made<br />

from the measurement <strong>of</strong> magnetostriction under different<br />

stresses and flux densities [22]. A summary <strong>of</strong> the results<br />

<strong>of</strong> these measurements all together with the model<br />

prediction are shown in Fig. 6. The Helmholtz free energy<br />

in a sample is written as:<br />

Iron<br />

- 217 - 15th IGTE Symposium 2012<br />

1<br />

I<br />

<br />

2<br />

2<br />

1<br />

4 1<br />

g ( I ) I <br />

i 1 4 1 1<br />

<br />

2 <br />

2 4 5 5 6 6<br />

2<br />

<br />

i0 i 1 <br />

<br />

B <br />

2 2<br />

ref <br />

2GI<br />

I I I<br />

<br />

where the invariants I .. I are:<br />

1 6<br />

1 2 1 3<br />

I tr( )<br />

, I tr( ) , I tr( )<br />

1 2<br />

3<br />

2<br />

3<br />

(3)<br />

I <br />

4 B B, 2<br />

I BB, I BB 5<br />

6<br />

(4)<br />

is the first Lamé parameter,G the shear modulus <strong>of</strong> the<br />

material, the mass density and<br />

3 3 1<br />

g exp( I ) 0 0 1 0 5<br />

4 4 3<br />

(5)<br />

3( i 1) 4( i 1) <br />

g exp <br />

I<br />

<br />

; i i<br />

1<br />

4 <br />

<br />

3 <br />

i 1..4(6)<br />

; i 0..6 are model parameters and B is a reference<br />

i ref<br />

flux density. The magnetomecjanically coupled<br />

constitutive equations are derived from (2) as<br />

6<br />

6<br />

<br />

Ii<br />

<br />

Ii<br />

and M <br />

i1, i3I i<br />

i1, i3IB i<br />

(7)<br />

where and are the stress and strain tensors and M the<br />

magnetization. An extensive derivation <strong>of</strong> the model and<br />

its equations is given in [23]. The model results in an<br />

explicitly coupled formulation for the stress tensor and<br />

the magnetic field strength vector in terms <strong>of</strong> the magnetic<br />

flux density and the mechanical strain tensor as<br />

( B,<br />

) I + (<br />

I ) 2G<br />

1 1 4<br />

<br />

1 1 <br />

<br />

( ) <br />

<br />

2 ( ) <br />

0 <br />

B B BB <br />

B B BB 4<br />

2 <br />

<br />

B B( BB) <br />

5<br />

2<br />

2<br />

<br />

<br />

<br />

B B B B<br />

<br />

6 2<br />

2 ( ) +( )<br />

<br />

B B B B B B<br />

<br />

(8)<br />

1<br />

5 <br />

2<br />

HB ( , ) B2 B B B <br />

0 4 5<br />

2 2<br />

(9)<br />

Magnetostriction (m/m)<br />

(2)<br />

In (8) and (9) 1 and are used to shorten the notation:<br />

4<br />

x 10-6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

-1<br />

1 g<br />

<br />

<br />

I and<br />

4<br />

i i1<br />

1 4<br />

2<br />

i0<br />

I1<br />

3.9 MPa<br />

0.0 MPa<br />

-1.7 MPa<br />

-6.1 MPa<br />

-2<br />

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2<br />

Flux density (T)<br />

<br />

1<br />

g<br />

(10)<br />

4<br />

i i<br />

<br />

I<br />

4 4<br />

2i0i1 Fig. 6. Measured magnetostrictive strains at different flux<br />

densities and applied mechanical stresses (left) and comparison<br />

with model prediction (right). Positive stresses are tensile and<br />

negative ones are compressive.<br />

V. INCORPORATION INTO FE SIMULATIONS<br />

The starting point for the implementation <strong>of</strong> the<br />

magnetomechanically coupled equation in FE analysis <strong>of</strong><br />

electrical machines is an in-house 2D s<strong>of</strong>tware package.<br />

The field equations have been previously coupled with


the electrical circuit equations <strong>of</strong> the machine winding,<br />

which makes it possible to feed the model from a voltage<br />

source and also resolve the induced voltages and currents<br />

in other parts <strong>of</strong> the machine, e.g., cage winding or solid<br />

rotor, through time stepping analysis [8].<br />

The coupled constitutive equations (8) and (9) were<br />

first linearized to the first order and then the weak form <strong>of</strong><br />

Galerkin method was applied to (8) and the principle <strong>of</strong><br />

virtual work applied to (9). This resulted in:<br />

<br />

H ( w) <br />

d H<br />

( w) <br />

<br />

B d<br />

<br />

B <br />

<br />

<br />

(11)<br />

( w) H d wH ds 0<br />

0 0<br />

<br />

<br />

T ˆ <br />

<br />

<br />

<br />

d <br />

<br />

T <br />

<br />

<br />

ˆ<br />

<br />

<br />

d<br />

<br />

<br />

ˆ d uˆ ( f f ) d<br />

B<br />

<br />

T<br />

0<br />

B<br />

<br />

<br />

<br />

T<br />

<br />

T<br />

uˆf ds 0<br />

surf<br />

<br />

mech inert<br />

- 218 - 15th IGTE Symposium 2012<br />

(12)<br />

where w is a test or weight function and the quantities<br />

with hat are virtual ones.u is the mechanical displacement<br />

vector and f , f , f are respectively mechanical,<br />

mech inert surf<br />

inertia body forces and surface forces. In the<br />

implementation the shape functions <strong>of</strong> the finite element<br />

approximation are used as weight function.<br />

Equations (11) and (12) are then spatially descitized<br />

using standard finite element procedure and inserted in<br />

the in-house code, thus replacing the nonlinear model <strong>of</strong><br />

iron cores. The insertion <strong>of</strong> these equations in the code<br />

does not affected the electromagnetic coupling as this<br />

latter one takes place in the windings and conducting<br />

regions only whereas the magnetomechanical coupling<br />

takes place in non conduction iron. Special attention<br />

however has to be given to the region formed by the<br />

airgap or more properly the interface between any noniron<br />

region and the iron core. This is because the Maxwell<br />

stress tensor makes sense only if it is computed from both<br />

regions at any interface. Thus when assembling the<br />

system matrix, the contribution to the nodal values <strong>of</strong> the<br />

Maxwell stress are computed from both iron and non-iron<br />

element with common interface with the iron core.<br />

In the case <strong>of</strong> force based model <strong>of</strong> magnetostriction,<br />

the approach is quite similar to the one presented above,<br />

except that the equivalent magnetostrictive forces as well<br />

as the other magnetic forces computed with (1) have been<br />

inserted in the model as external mechanical forces. Such<br />

implementation was already reported in [19].<br />

The implemented formulation and s<strong>of</strong>tware were first<br />

tested on a simple model consisting <strong>of</strong> an iron disc<br />

excited through Direchlet boundary condition on its outer<br />

edge. The magnetic vector potential on this edge was set<br />

to time dependent values as to create a rotating field<br />

uniformly distributed on the surface <strong>of</strong> the test sample.<br />

The boundaries <strong>of</strong> the sample were free to move and only<br />

the center <strong>of</strong> the disc was fixed in both x- and y-direction.<br />

Fig. 7. Shows the original and deformed mesh used in the<br />

model when the flux density was 1.5 T either along the xaxis<br />

or at an angle <strong>of</strong> 45 deg. to it. The results show that<br />

thanks to the tensor representation and formulation the<br />

effect <strong>of</strong> the shear stress and strain are correctly<br />

computed. The model was also applied to a induction<br />

machine-like device without the airgap in view <strong>of</strong><br />

minimizing all the other magnetic forces. The results from<br />

this verification are reported in [23] and show good<br />

agreement between the measured and computed<br />

displacements. The model was applied to the computation<br />

<strong>of</strong> the deformation and vibrations <strong>of</strong> two induction<br />

machines, the parameters <strong>of</strong> which are given in Table I.<br />

The extensive results from the simulations <strong>of</strong> the two<br />

machines as well as a comprehensive analysis <strong>of</strong> these<br />

results are reported in [24]. Here we present a comparison<br />

between the computed displacements <strong>of</strong> nodes on the<br />

tooth <strong>of</strong> the machines when the magnetostriction only is<br />

accounted for and when the so called reluctance forces<br />

(Maxwell stress) are also accounted for. Although, the<br />

vibrations depend on the machine construction and could<br />

not be generalized, this result gives the reader an estimate<br />

<strong>of</strong> the effect <strong>of</strong> magnetostriction on the vibrations <strong>of</strong><br />

rotating electrical machines. Fig. 8 shows this comparison<br />

for both machines. Due to the differences in the number<br />

<strong>of</strong> pole pairs the vibration behaviors are different too.<br />

y-coordinate (m)<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

-0.05<br />

-0.1<br />

-0.1 -0.05 0 0.05 0.1 0.15<br />

x-coordinate (m)<br />

y-coordinate (m)<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

-0.05<br />

-0.1<br />

-0.1 -0.05 0 0.05 0.1 0.15<br />

x-coordinate (m)<br />

Fig. 7. Test model, original and deformed mesh computed with<br />

the developed method with a flux density <strong>of</strong> 1.5 T along the xaxis<br />

(left) and at 45 deg (right).<br />

Table I. Parameters <strong>of</strong> the simulated induction machines<br />

Parameter Machine I Machine II<br />

Rated voltage 380 V 380 V<br />

Slip 2 % 3.2 %<br />

Rated current 60 A 27 A<br />

Rated power 30 kW 15 kW<br />

Number <strong>of</strong> pole pairs 1 2<br />

Outer diameter <strong>of</strong> the stator core 323 mm 235 mm<br />

Inner diameter <strong>of</strong> the stator core<br />

190.2 mm 145 mm<br />

Number <strong>of</strong> stator slots<br />

36<br />

36<br />

Outer diameter <strong>of</strong> the rotor core 188.37 mm 144.1 mm<br />

Number <strong>of</strong> rotor slots 28 34<br />

Fig. 8.Comparison between computed displacements <strong>of</strong> a<br />

node on the stator tooth in machine 1 (left) and machine 2<br />

(right). Subscripts r and stand for the radial and tangential<br />

directions. 1 is the case with only magnetostriction and 2 when<br />

both magnetostriction and reluctance forces are considered.


VI. FUTURE TRENDS AND DEVELOPMENTS<br />

The developed coupled magnetomechanical model<br />

although bidirectional in magnetic and mechanics is<br />

single valued and thus does not take dissipation into<br />

account. On the other hand, existing hysteresis models<br />

either for magnetic [25] or mechanics [26] are based on<br />

the Preisach approach, which is mathematically rigorous<br />

but does not give clear insight into the energetic balance<br />

between the two subsystems. The only energy-based<br />

hysteresis model that describes both magnetism and<br />

mechanics is the one developed by Jiles [27] but its<br />

application in electric steel and further to the simulation<br />

<strong>of</strong> electrical machines has not been reported yet. This<br />

might be due to the sharp saturation <strong>of</strong> the magnetization<br />

curves <strong>of</strong> electrical steel, which cause convergence<br />

problems but also to the fact that the original model traits<br />

the compressive and tensile stresses in a symmetric way,<br />

which is not adequate for the electric steel where the<br />

compressive stresses have much pronounced effect on the<br />

magnetic and magnetostrictive properties <strong>of</strong> the material.<br />

The dynamic behavior <strong>of</strong> magnetostriction also needs to<br />

be addressed [28] as well as the effect <strong>of</strong> anisotropy [20].<br />

These shortcuts in the modeling and simulation <strong>of</strong><br />

electrical machines still need to be addressed in view <strong>of</strong><br />

better estimation <strong>of</strong> the vibrations <strong>of</strong> electrical machines<br />

and iron losses, which are known to depend on the state<br />

<strong>of</strong> stress in the material. The presented model already<br />

estimates the effect <strong>of</strong> magnetostriction on the state <strong>of</strong><br />

stress but other causes <strong>of</strong> stress have to be added too.<br />

The models to be developed and used need both<br />

characterization <strong>of</strong> the material under different flux<br />

densities and mechanical stress and verification<br />

procedures to assess the validity <strong>of</strong> the models. The work<br />

presented in [29] is a good start for the characterization<br />

work. We have also developed characterization<br />

methodologies and analyzed their accuracy [30]; this<br />

work is still continuing. The verification work needs still<br />

some development.<br />

REFERENCES<br />

[1] M. J. Islam, A. Arkkio, “Effects <strong>of</strong> pulse-width-modulated supply<br />

voltage on eddy currents in the form-wound stator winding <strong>of</strong> a cage<br />

induction motor,” IET Electric Power Applications, vol. 3, no. 1, pp.<br />

50-58, January 2009.<br />

[2] A. Moses, P. Phillips, “Some effects <strong>of</strong> stress in Goss-oriented siliconiron,”<br />

IEEE Trans. Magn. , vol. 14, no. 5, pp. 353-355, Sep 1978.<br />

[3] A. Moses, “Effects <strong>of</strong> applied stress on the magnetic properties <strong>of</strong><br />

high permeability silicon-iron,” IEEE Trans. Magn., vol. 15, no. 6, pp.<br />

1575-1579, Nov 1979.<br />

[4] Du Trémolet de Lacheisserie, E., 1993. Magnetostriction–Theory<br />

and Applications <strong>of</strong> Magnetoelasticity. CRC Press Inc. 432 pages.<br />

[5] K. Fujisaki, R. Hirayama, T. Kawachi, S. Satou, C. Kaidou, M.<br />

Yabumoto, T. Kubota, “Motor core iron loss analysis evaluating<br />

shrink fitting and stamping by finite-element method,” IEEE Trans.<br />

Magn., vol. 43, no. 5, pp.1950-1954, May 2007<br />

[6] A. Moses, “Measurement <strong>of</strong> magnetostriction and vibration with<br />

regard to transformer noise,” IEEE Trans. Magn., vol. 10, no. 2, pp.<br />

154-156, Jun 1974<br />

[7] B. Weiser, H. Pfutzner, J. Anger, “Relevance <strong>of</strong> magnetostriction and<br />

forces for the generation <strong>of</strong> audible noise <strong>of</strong> transformer cores,” IEEE<br />

Trans. Magn., vol. 36, no. 5, pp.3759-3777, Sep 2000<br />

[8] A. Arkkio, Analysis <strong>of</strong> induction motors based on the numerical<br />

solution <strong>of</strong> the magnetic field and circuit equations, Doctoral<br />

dissertation, 1987, Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Espoo, Finland<br />

- 219 - 15th IGTE Symposium 2012<br />

[9] S. J. Salon, R. Palma, C. C. Hwang, “Dynamic modeling <strong>of</strong> an<br />

induction motor connected to an adjustable speed drive,” IEEE Trans.<br />

Magn., vol. 25, no. 4, pp. 3061-3063, Jul 1989.<br />

[10] K. Delaere, W. Heylen, R. Belmans, K. Hameyer, “Comparison <strong>of</strong><br />

induction machine stator vibration spectra induced by reluctance<br />

forces and magnetostriction,” IEEE Trans. Magn., vol. 38, no. 2, pp.<br />

969-972, Mar 2002<br />

[11] M. Besbes, Z. Ren, A. Razek, “Finite element analysis <strong>of</strong> magnetomechanical<br />

coupled phenomena in magnetostrictive materials,” IEEE<br />

Trans. Magn., vol. 32, no. 3, pp. 1058-1061, May 1996<br />

[12] A. Bossavit, “Edge-element computation <strong>of</strong> the force field in<br />

deformable bodies,” IEEE Trans. Magn., vol. 28, no. 2, pp. 1263-<br />

1266, Mar 1992.<br />

[13] J. L. Coulomb, “A methodology for the determination <strong>of</strong> global<br />

electromechanical quantities from a finite element analysis and its<br />

application to the evaluation <strong>of</strong> magnetic forces, torques and<br />

stiffness,” IEEE Trans. Magn., vol. 19. 6, pp. 2514-19, 1983.<br />

[14] O.A. Mohammed, T. Calvert, R. McConnell, “Coupled<br />

magnetoelastic finite element formulation including anisotropic<br />

reluctivity tensor and magnetostriction effects for machinery<br />

applications,” IEEE Trans. Magn., vol. 37, no. 5, pp. 3388-3392, Sep<br />

2001.<br />

[15] L. Vandevelde, J.A.A. Melkebeek, “Magnetic forces and<br />

magnetostriction in electrical machines and transformer cores,” IEEE<br />

Trans. Magn., vol. 39, no. 3, pp. 1618- 1621, May 2003.<br />

[16] A. Belahcen, “Vibrations <strong>of</strong> rotating electrical machines due to<br />

magnetomechanical coupling and magnetostriction,” IEEE Trans.<br />

Magn., vol. 42, no. 4, pp. 971-974, Apr. 2006.<br />

[17] A. Kameari, “Local calculation <strong>of</strong> forces in 3D FEM with edge<br />

elements,” International Journal <strong>of</strong> applied Electromagnetics in<br />

Materials, vol. 3, pp. 231-240, 1993.<br />

[18] A. Belahcen, “Magnetoelastic coupling in rotating electrical<br />

machines,” IEEE Trans. Magn., vol. 41, no. 5, pp. 1624-1627, May<br />

2005.<br />

[19] A. Belahcen, Magnetoelasticity, magnetic forces and<br />

magnetostriction in electrical machines. Doctoral dissertation, 2004,<br />

Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Espoo, Finland.<br />

[20] S. Somkun, A. J. Moses, P. I. Anderson, P. Klimczyk,<br />

“Magnetostriction anisotropy and rotational magnetostriction <strong>of</strong> a<br />

nonoriented electrical steel,” IEEE Trans. Magn., vol. 46, no. 2, pp.<br />

302-305, Feb. 2010.<br />

[21] A. Belahcen, K. Fonteyn, A. Hannukainen and R. Kouhia, “On<br />

numerical modeling <strong>of</strong> coupled magnetoelastic problem”, Nordic<br />

Seminar on Computational Mechanics NSCM-21, pp. 203-206, Oct.<br />

16-17.2008, Trondheim, Norway.<br />

[22] A. Belahcen and M. El Amri, “Measurement <strong>of</strong> stress-dependent<br />

magnetisation and magnetostriction <strong>of</strong> electrical steel sheets,”<br />

International Conference on Electrical Machines ICEM, Sep. 5-<br />

8.2004, Cracow, Poland.<br />

[23] K. A. Fonteyn, Energy-based magneto-mechanical model for<br />

electrical steel sheets, Doctoral dissertation, 2010, Aalto <strong>University</strong>,<br />

Finland<br />

[24] K. A. Fonteyn, A. Belahcen, P. Rasilo, R. Kouhia, A. Arkkio,<br />

“Contribution <strong>of</strong> Maxwell stress in air on the deformations <strong>of</strong><br />

induction machines,” Journal <strong>of</strong> Electrical Engineering & <strong>Technology</strong>,<br />

vol. 7, no. 3, pp. 336-341, 2012.<br />

[25] E. Dlala, A. Belahcen, K. Fonteyn, M. Belkasim, “Improving loss<br />

properties <strong>of</strong> the mayergoyz vector hysteresis model,” IEEE Trans.<br />

Magn., vol. 46, no. 3, pp. 918-924, March 2010.<br />

[26] A. Bergqvist, On magnetic hysteresis modelling, Doctoral<br />

dissertation, 1994, Royal Institute <strong>of</strong> <strong>Technology</strong>, Stockholm,<br />

Sweden<br />

[27] D. C. Jiles, D. L. Atherton, “Theory <strong>of</strong> ferromagnetic hysteresis<br />

(invited),” Journal <strong>of</strong> Applied Physics, vol. 55, no. 6, pp. 2115-2120,<br />

Mar 1984.<br />

[28] P. Rasilo, A. Belahcen, “Iron losses, magnetoelasticity and<br />

magnetostriction in ferromagnetic steel laminations,” IEEE<br />

Conference on Electromagnetic Field Computation CEFC, 11-<br />

14.11.2012, Oita, Japan.<br />

[29] Y. Kai, Y. Tsuchida, T. Todaka, M. Enokizono, “Influence <strong>of</strong> stress<br />

on vector magnetic property under rotating magnetic flux conditions,”<br />

IEEE Trans. Magn., vol. 48, no. 4, pp. 1421-1424, April 2012.<br />

[30] A. Belahcen, P. Rasilo, K. Fonteyn, R. Kouhia and A. Arkkio,<br />

“Modeling the stress effect on the measurement <strong>of</strong> magnetostriction in<br />

electrical sheets under rotational magnetization,” IEEE Conference on<br />

Electromagnetic Field Computation CEFC, 11-14.11.2012, Oita,<br />

Japan.


- 220 - 15th IGTE Symposium 2012<br />

Magnetic Saturation Effect on Modeling Squirrel-cage<br />

Induction Motors with Stator Inter-turn Fault<br />

*Jawad Faiz, † Mansour Ojaghi and † Mahdi Sabouri<br />

*Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering,<br />

College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran, E-mail: jfaiz@ut.ac.ir<br />

† Department <strong>of</strong> Electrical Engineering, <strong>University</strong> <strong>of</strong> Zanjan, Zanjan, Iran<br />

Abstract— Coupled circuits model (CCM) <strong>of</strong> squirrel-cage induction motors is the most detailed and complete analytical<br />

model for analyzing the performance <strong>of</strong> the faulty induction motors. This paper extends the CCM to a saturable model<br />

including variable degrees <strong>of</strong> the saturation effects using an appropriate air gap function and novel techniques for locating<br />

the angular position <strong>of</strong> the air gap flux density and estimating the saturation factor. Comparing simulated and experimental<br />

magnetization characteristics shows the accuracy <strong>of</strong> the new saturable model. Using saturable and non-saturable models,<br />

various simulations are carried out on faulty induction motors, and then, by comparing the results, the impacts <strong>of</strong> the<br />

saturation on the performance <strong>of</strong> the faulty motor are presented.<br />

Index Terms—Induction motors, Inter-turn fault, Magnetic saturation, Coupled circuits model.<br />

I. INTRODUCTION<br />

Implementing a proper condition monitoring system is<br />

essential to prevent squirrel-cage induction motors<br />

(SCIMs) from catastrophic failure. The stator inter-turn<br />

fault is a relatively frequent fault and if not diagnosed on<br />

time, causes major breakdown <strong>of</strong> the SCIM. SCIM<br />

performance under the inter-turn fault can be analyzed<br />

using magnetically coupled circuits model (CCM) [1],<br />

[2]. Such analysis helps to realize the faulty SCIMs<br />

performance, to extract proper indexes for the various<br />

faults and to develop effective fault diagnosis and<br />

condition monitoring techniques for SCIMs. To do so,<br />

CCM must be as exact as possible; however, ignoring the<br />

magnetic saturation may keep it away from the required<br />

exactness [3].<br />

For economical utilization <strong>of</strong> the magnetic material,<br />

electrical machines operating regions have to be extended<br />

above the knee <strong>of</strong> the magnetization characteristic, which<br />

forces the machine into the saturation region. Many<br />

attempts have been so far made to include saturation<br />

effects in SCIM models including CCM [4]–[9]. An<br />

extension to CCM <strong>of</strong> healthy SCIM, which includes<br />

variable degrees <strong>of</strong> saturation, has been reported in [9].<br />

The proposed saturable CCM (SCCM) needs to track the<br />

air gap rotating flux density, and this has been done by a<br />

rather simple technique in [9]. However, distortion <strong>of</strong> the<br />

air gap flux density distribution due to the fault causes<br />

the application <strong>of</strong> that technique erroneous. Thus, the<br />

existing SCCM is not viable to analyze the faulty SCIMs.<br />

In this paper, the rotor meshes are used as the air gap<br />

flux samplers. The flux-linkages <strong>of</strong> the rotor meshes,<br />

calculated at each step, are used to estimate the air gap<br />

flux density distribution. Then, Fourier series analysis is<br />

used to determine the space harmonics <strong>of</strong> the air gap flux<br />

density, including fundamental harmonic amplitude (B1)<br />

and its phase angle (1). B1 is used to determine<br />

saturation factor (Ksat) properly and 1 is utilized to track<br />

the air gap flux density. Therefore, a modified version <strong>of</strong><br />

the SCCM is obtained, whose accuracy is proved by<br />

comparing the magnetization characteristics determined<br />

through the simulation and experiments. Then, using<br />

both the modified SCCM and the normal CCM, the<br />

performance <strong>of</strong> SCIM with some inter-turn faults are<br />

simulated. By comparing the results, impacts <strong>of</strong> magnetic<br />

saturation on the faulty SCIMs performance and their<br />

fault indexes will be clear. Comparisons are also made<br />

with experimental results, which confirm the accuracy <strong>of</strong><br />

the proposed model.<br />

II. SCCM OF SCIM WITH INTER-TURN FAULT<br />

Any loop on the rotor <strong>of</strong> SCIM consisting <strong>of</strong> any two<br />

adjacent rotor bars is considered as a circuit mesh. The<br />

shorted turns in the stator are also considered as an<br />

independent circuit (phase 'd'). Applying KVL to the<br />

rotor meshes, stator phases and stator shorted turns and<br />

adding the related torque and mechanical equations,<br />

CCM dynamic equations are attained. These equations<br />

with complete details are presented in [2]. Self/mutual<br />

inductances <strong>of</strong> the various circuits and their derivatives<br />

versus the rotor position are the most important<br />

parameters <strong>of</strong> CCM equations. These inductances are<br />

calculated using the following equation [10]:<br />

2<br />

1<br />

Lxy or<br />

l<br />

g n N d<br />

0 x y<br />

(1)<br />

where x and y can be any phase <strong>of</strong> the stator (a, b, c or d)<br />

or any mesh <strong>of</strong> the rotor (1 to R), μ0 is the air magnetic<br />

permeability, r is the air gap mean radius, l is the stack<br />

length, g -1 is the inverse air gap function, nx is the x<br />

phase (mesh) turn function and is the angle in the stator<br />

stationary reference. In the case <strong>of</strong> the uniform air gap<br />

machine, Ny is the winding function <strong>of</strong> the y circuit, but<br />

in the case <strong>of</strong> the non-uniform air gap machine, it is the<br />

modified winding function <strong>of</strong> the y circuit.<br />

Saturation <strong>of</strong> the magnetic material causes its<br />

reluctance to be increased against the machine's flux.<br />

Similar increase <strong>of</strong> the reluctance can be achieved by a<br />

proportional increase in the air gap length along the main<br />

flux path [4]. Anywhere within the core material,<br />

reluctance increase caused by the saturation depends on<br />

the exact value <strong>of</strong> the flux density, but independent <strong>of</strong> the<br />

flux direction. Thus, it is expected that the fictitious air<br />

gap length (gf) fluctuates a complete cycle every half


cycle <strong>of</strong> the air gap flux density distribution around the<br />

air gap. Assuming a sinusoidal form for this fluctuation,<br />

the following satisfies the mentioned requirements [9]:<br />

g f g[<br />

1<br />

cos( 2P(<br />

f ))] (2)<br />

where f is the angular position <strong>of</strong> a zero crossing point<br />

<strong>of</strong> the air gap flux density in the stator reference, P is the<br />

pole pairs, g' is the mean value <strong>of</strong> gf and is the peak<br />

value <strong>of</strong> its fluctuation. g' and are determined as<br />

follows:<br />

3K<br />

sat<br />

g <br />

ge<br />

(3)<br />

K 2<br />

2( Ksat<br />

1)<br />

<br />

(4)<br />

3Ksat<br />

where ge is the effective air gap length <strong>of</strong> the unsaturated<br />

machine, which is related to the mechanical air gap<br />

length (g0) by the Carter's coefficient (ge=kcg0). Ksat is the<br />

saturation factor which is defined as the ratio <strong>of</strong> the<br />

fundamental components <strong>of</strong> the air gap voltage for the<br />

saturated and unsaturated conditions [4].<br />

Replacing the inverse <strong>of</strong> (2) into (1), using modied<br />

winding function theory, assuming the turn functions to<br />

have only step variation in the center <strong>of</strong> the slots and<br />

determining the indefinite integrals, exact analytic<br />

equations are obtained for the various inductances. Then,<br />

differentiating the equations versus the rotor position<br />

(r), exact analytic equations also are obtained for the<br />

derivatives <strong>of</strong> the inductances. The equations are<br />

functions <strong>of</strong> Ksat and f, thus by using them; variable<br />

degree <strong>of</strong> saturation effect enters to the model <strong>of</strong> the<br />

SCIM. The equations are included in the appendix.<br />

For the proposed SCIM (see Section V), Figure 1<br />

shows the turn functions <strong>of</strong> the phase 'a' winding before<br />

and after inter-turn fault. The winding has 4 concentric<br />

coils each with 90 turns in healthy condition. In the faulty<br />

case 14 turns from the outer coils are short-circuited.<br />

Figure 1 also shows the turn function <strong>of</strong> the shorted turns<br />

(phase 'd'). As an example, Figure 2 shows the variations<br />

<strong>of</strong> the self inductance <strong>of</strong> the phase 'd' (Ldd) by the<br />

variations <strong>of</strong> Ksat and f, which has been calculated using<br />

the proposed analytical equations. As seen, by increasing<br />

Ksat and Ldd more variations occur around its decreasing<br />

mean value. The increase <strong>of</strong> g' and by increasing Ksat<br />

are respectively the reasons for the decreasing mean<br />

value and increasing uctuations amplitude <strong>of</strong> the<br />

inductance. A complete rotation <strong>of</strong> f causes two<br />

complete cycles <strong>of</strong> variation for Ldd, which is due to the<br />

two poles <strong>of</strong> the SCIM.<br />

sat<br />

- 221 - 15th IGTE Symposium 2012<br />

Figure 1: Turn function <strong>of</strong> phase 'a' winding: a) before and b) after<br />

inter-turn fault with 14 shorted turns in the outer coil and c) turn<br />

function <strong>of</strong> phase 'd' after the fault occurrence<br />

Figure 2: Phase 'd' self inductance (Ldd) variation versus Ksat and f<br />

III. DETERMINING K SAT AND F IN FAULTY SCIM DURING<br />

SIMULATION STEPS<br />

Generally, the air gap flux density distribution is<br />

disturbed in the faulty SCIMs and this leads to an error in<br />

the use <strong>of</strong> the technique for determining f [9]. In<br />

addition, Ksat was determined using the air gap voltage,<br />

which depends on the rotation speed <strong>of</strong> the air gap flux as<br />

well as its amplitude, while the saturation degree depends<br />

only on the flux density amplitude. This bring difficulty<br />

when applying the model in variable-speed drives<br />

systems, as the air gap voltage before saturation is no<br />

more a constant, but varies by the reference speed<br />

variation.<br />

When simulating SCIM using the CCM or SCCM,<br />

flux-linkages <strong>of</strong> all the rotor meshes are evaluated within<br />

any simulation step. Since any rotor mesh consists <strong>of</strong><br />

only one turn, these flux-linkages are the total fluxes<br />

passing through the meshes. Considering the short air<br />

gap length and ignoring the small flux leakages, the air<br />

gap flux density next to any rotor mesh i can be estimated<br />

as:<br />

Bai i<br />

A<br />

(5)<br />

where i is the flux-linkage <strong>of</strong> mesh i and A is the area<br />

above the mesh in the air gap:<br />

A 2rl<br />

R<br />

(6)<br />

where R is the rotor bars number. Therefore, the air gap<br />

flux density distribution is estimated within any<br />

simulation step. Then, using Fourier series analysis, the<br />

space harmonics <strong>of</strong> the air gap flux density is determined<br />

as follows:<br />

1 2<br />

R 1<br />

i1<br />

Bsn<br />

B(<br />

) sin( nP<br />

) d<br />

Bai<br />

sin( )<br />

0<br />

nP<br />

d<br />

<br />

<br />

i<br />

(7)<br />

B<br />

cn<br />

1<br />

<br />

np<br />

<br />

<br />

R<br />

<br />

i1<br />

B [cos( nP<br />

) cos( nP<br />

) ]<br />

ai<br />

i1<br />

R<br />

1<br />

Bai[sin(<br />

nPi<br />

1)<br />

sin( nPi<br />

)]<br />

np<br />

i1<br />

where Bsn and Bcn are the sine and cosine components <strong>of</strong><br />

the nth space harmonic <strong>of</strong> the estimated air gap flux<br />

density respectively, is the angle in the rotor reference<br />

and i the angle <strong>of</strong> center <strong>of</strong> the rotor bar i. Then, the<br />

phase angle <strong>of</strong> the space harmonics (in electrical<br />

i1<br />

R<br />

1 2<br />

1<br />

i1<br />

B(<br />

)<br />

cos( nP)<br />

d<br />

Bai<br />

0<br />

<br />

i<br />

i 1<br />

i<br />

cos( nP)<br />

d<br />

(8)


adians) and their amplitudes can be calculated as follows<br />

respectively:<br />

tan ( )<br />

1 Bsn<br />

n <br />

(9)<br />

B<br />

cn<br />

2<br />

Bsn<br />

2<br />

Bn Bcn<br />

<br />

(10)<br />

Having the phase angle <strong>of</strong> the fundamental harmonic<br />

(1), f could be estimated by:<br />

1<br />

<br />

f r<br />

<br />

(11)<br />

P 2P<br />

Also using the amplitude <strong>of</strong> the fundamental harmonic,<br />

Ksat obtained by:<br />

1 0 B B Ksat (12)<br />

where B0 is related to the flux density <strong>of</strong> knee (Bkp) <strong>of</strong> the<br />

core material within the teeth.<br />

These modifications in f and Ksat estimations make<br />

the SCCM applicable to the simulation <strong>of</strong> mains-/driveconnected<br />

SCIMs under the inter-turn fault. However,<br />

knee point is not a distinct point on the magnetization<br />

characteristic and there is not precise analytic method to<br />

determine the Carter's coefficient (kc). Therefore, Genetic<br />

Algorithm (GA) is used for the optimal estimation <strong>of</strong> the<br />

required B0 and kc in the next section.<br />

IV. ESTIMATION OF B0 AND K C<br />

GA is a heuristic searching method for the optimal<br />

solution based on mechanics <strong>of</strong> natural selection and<br />

natural genetics. It evolves into new generations <strong>of</strong><br />

individuals by using knowledge from the previous<br />

generations and generally includes three fundamental<br />

genetic operations <strong>of</strong> reproduction, crossover and<br />

mutation. The searching process is independent <strong>of</strong> the<br />

form <strong>of</strong> the objective function, and will not be trapped in<br />

the rapid descending direction introduced by the local<br />

optimum solutions. The solution <strong>of</strong> a complex problem<br />

can be started with weak initial estimations and then be<br />

corrected in evolutionary process <strong>of</strong> fitness. Figure 3<br />

shows a flowchart <strong>of</strong> the applied GA. More details about<br />

GA can be found in [11].<br />

To use GA for estimating B0 and kc in the proposed<br />

SCIM, a proper objective (fitness) function must be<br />

defined. Such fitness function may be achieved by<br />

closely fitting the magnetization characteristic (i.e. the<br />

no-load voltage versus no-load current curve) <strong>of</strong> the<br />

SCIM obtained from SCCM to that obtained by<br />

experiments. To do so, the no-load stator RMS line<br />

currents are measured in the laboratory with n different<br />

stator line voltages up to<br />

- 222 - 15th IGTE Symposium 2012<br />

Reproduction<br />

No<br />

Figure 3: Flowchart <strong>of</strong> the applied Genetic Algorithm<br />

Figure 4: Convergence rate <strong>of</strong> the algorithm<br />

the nominal voltage (I e ai, I e bi, I e ci, for i=1,2,…,n). Then,<br />

for any distinct values <strong>of</strong> B0 and kc, corresponding line<br />

currents with the same stator voltages are obtained by<br />

simulation (I s ai, I s bi, I s ci, for i=1,2,…,n). Now, the fitness<br />

function is defined as follows:<br />

n<br />

<br />

i1<br />

e<br />

ai<br />

s 2<br />

ai<br />

e<br />

bi<br />

Start<br />

Select variables<br />

(Solution space)<br />

Construct initial<br />

population randomly<br />

Calculate the fitness for<br />

each population<br />

Apply crossover<br />

Apply mutation<br />

Ending<br />

condition<br />

reached?<br />

Select the best population<br />

End<br />

Yes<br />

s 2<br />

bi<br />

Fit.<br />

(( I I ) ( I I ) ( I I ) ) (13)<br />

e<br />

ci<br />

s 2<br />

ci<br />

The lower the fitness, the better will be the estimation <strong>of</strong><br />

B0 and kc. With a population size <strong>of</strong> 15, GA converges to<br />

the required solution after about 130 iterations. Figure 4<br />

shows the convergence rate <strong>of</strong> the algorithm. The<br />

optimum values for B0 and kc are 0.5007 and 1.2058<br />

respectively. Figure 5 compares the<br />

magnetization characteristics obtained from the<br />

experiment and SCCM with optimal B0 and kc. Good<br />

agreement between the simulated and experimental<br />

results is evident.<br />

V. EXPERIMENTAL TEST RIG<br />

A test rig consisting <strong>of</strong> a 750 W, 380 V, 50 Hz, 2-pole,<br />

Y-connected SCIM was set up in the laboratory. Three-


Figure 5: Magnetization Characteristic obtained from simulation ()<br />

and experiments (---).<br />

Figure 6: Photograph <strong>of</strong> the test rig.<br />

phase windings <strong>of</strong> the stator <strong>of</strong> the SCIM removed and<br />

replaced by similar windings with various taps taken out<br />

from different turns <strong>of</strong> the phase ‘a’ winding. Inter-turn<br />

fault with variable number <strong>of</strong> shorted turns is produced in<br />

the SCIM by connecting any two <strong>of</strong> the taps. The motor<br />

is mechanically coupled with a magnetic powder brake to<br />

produce adjustable mechanical load. A digital scopemeter<br />

is used for sampling the line currents <strong>of</strong> the SCIM<br />

[12]. Two independent current or voltage signals can be<br />

sampled and recorded with 5000 samples per second.<br />

Figure 6 shows a photograph <strong>of</strong> the test rig.<br />

VI. SIMULATION AND ANALYSIS<br />

The proposed SCIM is simulated using the<br />

developed SCCM under various loads, supply and fault<br />

conditions and corresponding tests are performed on the<br />

real SCIM in the laboratory. The results are presented,<br />

compared and analyzed in this section. Figure 7 shows<br />

the variation <strong>of</strong> f with time in an interval between - to <br />

obtained during a simulation by the method introduced in<br />

the Section III. Constant slop <strong>of</strong> the variation <strong>of</strong> f is due<br />

to the constant speed <strong>of</strong> the air gap rotating magnetic<br />

field (the synchronous speed).<br />

Figure 8 shows the normalized spectra <strong>of</strong> the stator<br />

line current in the faulty SCIM with 21 shorted turns<br />

under no load. As seen, all the even/odd harmonics are<br />

present in the experimental and SCCM result but not in<br />

the CCM result. The amplitude <strong>of</strong> the 3 rd harmonic <strong>of</strong> the<br />

stator line current (150 Hz) might be considered as an<br />

index for the inter-turn fault [13], [14]. As seen, this<br />

amplitude in the SCCM result is very closer to the<br />

experimental result than that in the CCM result.<br />

The stator negative-sequence current component at the<br />

fundamental frequency is one <strong>of</strong> the old indexes<br />

introduced to diagnose the inter-turn fault [15], [16]. In<br />

the healthy symmetrical SCIM with the balanced threephase<br />

supply, the negative-sequence current is zero.<br />

- 223 - 15th IGTE Symposium 2012<br />

Figure 7: Simulated time variations <strong>of</strong> f<br />

Figure 8: Normalized spectra <strong>of</strong> stator line current under no load with<br />

21 shorted turns obtained by: a) experiment, b) SCCM and c) CCM<br />

However, the inter-turn fault quickly increases this<br />

current component. To determine the negative sequence<br />

current at the required frequency, the related line current<br />

phasors <strong>of</strong> the stator are obtained first by using the<br />

sampled currents and the Fourier algorithm which is<br />

conventional in the field <strong>of</strong> the digital protection [17].<br />

Then, using the line current phasors, the negative<br />

sequence current phasor is determined [15]. Knowing the<br />

amplitude, phase angle and frequency <strong>of</strong> the negative<br />

sequence current, its waveform can also be sketched. For<br />

the proposed SCIM with 21 shorted turns under full load,<br />

the negative-sequence currents obtained through the<br />

simulation and experiments have been shown in Figure 9.<br />

As seen, the saturation effect, introduced by the SCCM,<br />

increases the amplitude <strong>of</strong> the current in order to<br />

approach the experimental results<br />

Simulations and experiments on the proposed SCIM<br />

with 14 and 21 shorted turns were repeated under<br />

various load levels. Table I compares the attained<br />

amplitudes <strong>of</strong> the stator negative sequence current at the<br />

fundamental frequency. As seen, the negative sequence<br />

current increases by increasing the fault degree, while the<br />

load level change has negligible impact on the current.<br />

Also, the SCCM results follow the experimental results<br />

more<br />

closely than the CCM results.<br />

However, any negative sequence component in the<br />

TABLE. I


Current<br />

(mA)<br />

- 224 - 15th IGTE Symposium 2012<br />

AMPLITUDE OF STATOR NEGATIVE SEQUENCE CURRENT UNDER VARIOUS LOAD LEVELS<br />

Faulty SCIM with 14 short-circuited turns Faulty SCIM with 21 short-circuited turns<br />

No<br />

load<br />

20%<br />

rated<br />

load<br />

40%<br />

rated<br />

load<br />

60%<br />

rated<br />

load<br />

80%<br />

rated<br />

load<br />

Full<br />

load<br />

No<br />

load<br />

20%<br />

rated<br />

load<br />

40%<br />

rated<br />

load<br />

60%<br />

rated<br />

load<br />

80%<br />

rated<br />

load<br />

Experimental 470 478 481 482 489 494 698 690 690 712 726 730<br />

SCCM 467 490 480 456 462 487 693 700 668 701 702 702<br />

CCM 413 457 438 411 422 430 619 623 609 647 648 650<br />

Figure 9: Stator negative sequence current obtained by a) experiment, b)<br />

SCCM and c) CCM in faulty SCIM with 21 shorted turns under fullload.<br />

mains voltage, which is permissible to some small extent<br />

in real mains, produces negative sequence current in the<br />

stator <strong>of</strong> the healthy SCIM. Simulation result indicates<br />

that with 2% negative sequence in the stator voltage <strong>of</strong><br />

the healthy SCIM, the negative sequence current changes<br />

by 0.54% from no-load to full-load, by 1.86% from nonsaturable<br />

(CCM) to saturable motor (SCCM) and by<br />

6.65% from healthy to single-turn shorted condition (the<br />

weakest fault), while changing the negative sequence<br />

voltage from 2% to 5% changes the negative sequence<br />

current by 148.8% in the healthy SCIM. Therefore, the<br />

negative sequence current as an inter-turn fault index is<br />

highly sensitive to the voltage imbalance level, which is<br />

not pleasing as shown in Figure 10.<br />

The negative sequence apparent impedance <strong>of</strong> the<br />

SCIM is also used as an index to diagnose the stator<br />

inter-turn faults [18], [19]. This impedance is the ratio <strong>of</strong><br />

the stator negative sequence voltage phasor to its<br />

negative sequence current phasor. Figure 11 shows the<br />

similar results with Figure 10 for this index which is<br />

obtained using simulation. As seen, this index has very<br />

smaller sensitivity to the voltage imbalance and the<br />

magnetic saturation is the most effective lateral factor<br />

affecting this index.<br />

VII. CONCLUSION<br />

The flux-linkages <strong>of</strong> the rotor meshes, calculated in<br />

every simulation step <strong>of</strong> the CCM for the SCIM was used<br />

to estimate the air gap flux density distribution. Then,<br />

space harmonic components <strong>of</strong> the air gap flux density<br />

were determined using Fourier series analysis. The phase<br />

angle <strong>of</strong> the space fundamental harmonic was utilized to<br />

locate the air gap flux density during simulation <strong>of</strong> the<br />

faulty SCIMs. Also, the amplitude <strong>of</strong> this fundamental<br />

harmonic is applicable to evaluate the saturation factor<br />

more reasonably. Therefore, a saturable CCM was<br />

Full<br />

load<br />

Figure 10: Sensitivity <strong>of</strong> stator negative sequence current to the weakest<br />

fault (single-turn), saturation, voltage imbalance and load level,<br />

evaluated by simulations.<br />

Figure 11: Sensitivity <strong>of</strong> the negative sequence apparent impedance <strong>of</strong><br />

the SCIM to weakest fault (single-turn), saturation, voltage imbalance<br />

and load level, evaluated by simulations.<br />

developed which is capable to analyze faulty SCIMs.<br />

Comparing the simulation results with the corresponding<br />

experimental results indicates that the saturable model is<br />

more precise than the non-saturable model. Further study<br />

showed that the magnetic saturation affects the inter-turn<br />

fault indices more than the load level and the stator<br />

voltage imbalance.<br />

APPENDIX<br />

The indefinite integral <strong>of</strong> the inverse air gap function<br />

(gf -1 ) is determined first as follows:<br />

f ( ,<br />

, K<br />

f<br />

sat<br />

<br />

1<br />

) g f ( ,<br />

f , K sat ) d<br />

1<br />

cos( 2P(<br />

)) <br />

1<br />

<br />

cos <br />

f <br />

<br />

<br />

<br />

2 <br />

2 P g 1 <br />

1 cos( 2P(<br />

f ))<br />

<br />

(A1)<br />

Then, the analytical equation for the inductances <strong>of</strong> the<br />

`stator windings is obtained as:<br />

L<br />

x y<br />

m<br />

o<br />

r l<br />

nx<br />

( ti )[ n y ( ti ) f s ( f , K sat )]<br />

(A2)<br />

i1<br />

[ f ( <br />

i1<br />

, , K<br />

f<br />

sat<br />

) f ( , , K<br />

where x and y accounts for the stator phases, m is the<br />

number <strong>of</strong> the stator slots, i is the angle <strong>of</strong> center <strong>of</strong> the<br />

stator slot i, ti is the angle <strong>of</strong> center <strong>of</strong> the stator tooth<br />

after the stator slot i, and fs is:<br />

2<br />

g<br />

1<br />

m<br />

fs ( f , Ksat<br />

) ny<br />

( ti)<br />

[ f ( i1,<br />

f , Ksat<br />

) f ( i,<br />

f , K<br />

2<br />

i1<br />

i<br />

f<br />

sat<br />

)]<br />

sat<br />

)]<br />

(A3)


Equation (A2) is independent <strong>of</strong> r and its derivative<br />

versus r is zero for all x and y. For the rotor meshes the<br />

inductance equation is:<br />

Luv o r l [ C fr<br />

( f , K sat )] [ f ( x1,<br />

f , K sat ) f ( x,<br />

f , K sat )] (A4)<br />

now u and v accounts for the rotor meshes 1 to R, f is f<br />

in the rotor reference (f = f - r), u is the angle <strong>of</strong> center<br />

<strong>of</strong> the rotor bar number u in the rotor reference, C=1 for<br />

u = v and C=0 otherwise and fr is:<br />

2<br />

g<br />

1<br />

<br />

fr ( f , K sat ) [ f ( y1,<br />

f , K sat ) f ( y , f , K<br />

2<br />

sat<br />

)]<br />

- 225 - 15th IGTE Symposium 2012<br />

(A5)<br />

Equations (A4)-(A5) depend on r because <strong>of</strong> f.<br />

Considering the relationship between these two variables<br />

yields Luv/r= -Luv/f and thus:<br />

Luv<br />

or<br />

l[<br />

C fr<br />

( f , K<br />

<br />

r<br />

where:<br />

f<br />

r ( f , K<br />

o<br />

rl<br />

<br />

f<br />

sat<br />

f<br />

( u1,<br />

f , Ksat)<br />

f<br />

( u,<br />

f , Ksat)<br />

)] [<br />

<br />

]<br />

<br />

<br />

sat<br />

<br />

)]<br />

[ f ( , , K ) f ( , , K<br />

u1<br />

f<br />

f<br />

sat<br />

u<br />

f<br />

f<br />

sat<br />

)]<br />

(A6)<br />

f r ( f , Ksat<br />

) g<br />

<br />

<br />

f<br />

2<br />

1<br />

<br />

2<br />

f<br />

( y 1,<br />

f , Ksat<br />

) f<br />

( y , f , Ksat<br />

)<br />

[<br />

<br />

]<br />

<br />

f<br />

<br />

f<br />

(A7)<br />

f ( i , f , Ksat<br />

)<br />

1<br />

<br />

; i x1,<br />

x , y1,<br />

y<br />

<br />

f g[<br />

1<br />

cos( 2P(<br />

i <br />

f ))]<br />

(A8)<br />

For the mutual inductances between the rotor meshes and<br />

stator phases the following equation is obtained:<br />

k2<br />

1<br />

i1<br />

<br />

i<br />

Lmn<br />

o<br />

r l<br />

[ nn<br />

( ) f s ( f , K sat )]<br />

ik<br />

2<br />

1<br />

(A9)<br />

[ f ( , , K ) f ( , , K )]<br />

i<br />

f<br />

sat<br />

i1<br />

where x and y account for the rotor meshes and stator<br />

phases respectively, k1-1 and k2+1 are the angles <strong>of</strong> the<br />

two bars <strong>of</strong> mesh x in the stator reference, k1 to k2 are the<br />

stator slots between k1-1 and k2+1 and e.g. k1 is the angle<br />

<strong>of</strong> the stator slot k1. Now k1-1 and k2+1 are responsible to<br />

the dependency <strong>of</strong> inductances on r. The derivatives <strong>of</strong><br />

the two mentioned parameters versus r are equal to 1,<br />

while the derivatives <strong>of</strong> the other parameters in (A9)<br />

versus r are zero. Using these facts and some<br />

differentiation rules leads to:<br />

Lx<br />

y<br />

<br />

r<br />

<br />

o r l<br />

k1<br />

<br />

k11<br />

[ n y ( ) f s ( f , K sat )]<br />

2<br />

g[<br />

1<br />

cos( 2P(<br />

k11<br />

<br />

f ))]<br />

<br />

o<br />

r l<br />

k 21<br />

<br />

k 2<br />

[ n y ( ) f s ( f , K sat )]<br />

2<br />

g[<br />

1<br />

cos( 2P(<br />

<br />

))]<br />

k 21<br />

REFERENCES<br />

f<br />

f<br />

sat<br />

(A10)<br />

[1] X. Luo, Y. Liao, H. A. Toliyat, A. El-Antably and T. A. Lipo,<br />

“Multiple coupled circuit modeling <strong>of</strong> induction machines,”<br />

IEEE Trans. Ind. Applications, vol. 31, pp. 311 - 318,<br />

March/April 1995.<br />

[2] A. Raie, and V. Rashtchi, "Using a genetic algorithm for<br />

detection and magnitude determination <strong>of</strong> turn faults in an<br />

induction motor", Springer-Verlag, vol. 84, pp. 275–279, August<br />

2002.<br />

[3] S. Nandi, “Detection <strong>of</strong> stator faults in induction machines using<br />

residual saturation harmonics,” IEEE Trans. Ind. Applications,<br />

vol. 42, no. 5, pp. 1201 - 1208, 2006.<br />

[4] J. C. Moreira and T.A. Lipo, “Modeling <strong>of</strong> saturated ac machines<br />

including air gap flux harmonic components,” IEEE Trans. Ind.<br />

Applications, vol. 28, pp. 343 - 349, March/April 1992.<br />

[5] D. Bispo, L. M. Neto, J. T. Resende and D. A. Andrade, “A new<br />

strategy for induction machine modeling taking into account the<br />

magnetic saturation ,” IEEE Trans. Ind. Applications, vol. 37, no.<br />

6, pp. 1710 - 1719, Nov./Dec. 2001.<br />

[6] T. Tuovinen, M. Hinkkanen, and J. Luomi, “Modeling <strong>of</strong><br />

saturation due to main and leakage fux interaction in induction<br />

machines,” IEEE Trans. Ind. Applications, vol. 46, no. 3, pp.<br />

937 - 945, 2010.<br />

[7] Tu Xiaoping, L.-A. Dessaint, R. Champagne, and K. Al-Haddad,<br />

“Transient modeling <strong>of</strong> squirrel-cage induction machine<br />

considering air-gap flux saturation harmonics,” IEEE Trans. Ind.<br />

Electronics, vol. 55, no. 7, pp. 2798 - 2809, 2008.<br />

[8] S. Nandi, “A detailed model <strong>of</strong> induction machines with<br />

saturation extendable for fault analysis,” IEEE Trans. Ind.<br />

Applications, vol. 40, pp. 1302 - 1309, September/October 2004.<br />

[9] M. Ojaghi and J. Faiz, “Extension to multiple coupled circuit<br />

modeling <strong>of</strong> induction machines to include variable degrees <strong>of</strong><br />

saturation effects,” IEEE Trans. Magn., vol. 44, no. 11, pp.<br />

4053-4056, Nov. 2008.<br />

[10] J. Faiz, and I. Tabatabaei, "Extension <strong>of</strong> winding function theory<br />

for nonuniform air gap in electric machinery," IEEE Trans. Magn.,<br />

vol. 38, pp. 3654-3657, November 2002.<br />

[11] Z. Michalewicz, Genetic Algorithms & Data Structures, Evalution<br />

Programs, Springer-Verlag, 1992.<br />

[12] Fluke 196c/199C Scope-Meter User's Manual, Fluke<br />

Corporation, Oct. 2001, Netherlands.<br />

[13] G. Joksimovic, J. Penman, “The detection <strong>of</strong> interturn short<br />

circuits in the stator windings <strong>of</strong> operating motors,” IEEE Trans<br />

Ind. Electronics, vol. 47, no.5, pp.1078–1084, Oct. 2000.<br />

[14] J.H. Jung, J.J. Lee, and B.H. Kwon, “Online diagnosis <strong>of</strong><br />

induction motors using MCSA,” IEEE Trans. Ind. Electronics,<br />

vol. 53, no. 6, pp. 1842–1852, Dec. 2006.<br />

[15] A.Bellini, F.Filippetti, C.Tassoni, G.A.Capolino, “Advances in<br />

Diagnostic Techniques for Induction Machines,” IEEE Trans.<br />

Industrial Electronics, vol.55, no12, pp. 4109-4126, Dec. 2008.<br />

[16] Wu Qing, and S. Nandi, “Fast single-turn sensitive stator interturn<br />

fault detection <strong>of</strong> induction machines based on positive- and<br />

negative-sequence third harmonic components <strong>of</strong> line currents,”<br />

IEEE Trans. Ind. Applications, vol. 46, pp. 974 - 983, 2010.<br />

[17] A.T.Johns and S.K. Salman, Digital Protection for Power<br />

Systems. IEE Power series 15, London, UK 1995.<br />

[18] J.L. Kohler, J. Sottile, and F.C. Trutt, “Condition monitoring <strong>of</strong><br />

stator windings in induction motors: I. Experimental<br />

investigation on effective negative-sequence impedance<br />

detector,” IEEE Trans. Ind. Application, vol. 38, pp. 1447–1453,<br />

2002.<br />

[19] L. Sang Bin, R.M. Tallam, and T.G. Habetler, “A robust, on-line<br />

turn-fault detection technique for induction machines based on<br />

monitoring the sequence component impedance matrix,” IEEE<br />

Trans. Power Electronics, vol. 18, pp. 865–872, 2003.


- 226 - 15th IGTE Symposium 2012<br />

Accurate Magnetostatic Simulation <strong>of</strong> Step-Lap<br />

Joints in Transformer Cores Using Anisotropic<br />

Higher Order FEM<br />

A. Hauck∗ , M. Ertl † ,J.Schöberl ‡ and M. Kaltenbacher §<br />

∗ SIMetris GmbH, Erlangen, Germany † SIEMENS Transformers, Nuremberg, Germany<br />

‡ Institute for Analysis and Scientific Computing, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />

§ Institute <strong>of</strong> Mechanics and Mechatronics, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />

E-mail: andreas.hauck@simetris.de<br />

Abstract—We present a simulation scheme for the accurate simulation <strong>of</strong> thin magnetic structures, specifically the nonlinear<br />

magnetic flux distribution in a core step-lap joint with interest in the local saturation near the air gaps. Due to the high<br />

aspect ratio <strong>of</strong> the model, we utilize hierarchical higher order finite elements, where the polynomial degree is spatially<br />

adapted to resolve the flux distribution within the steel sheets. The deterioration <strong>of</strong> convergence in the iterative conjugate<br />

gradient (CG) solver is handled by an anisotropic Schwarz-type block preconditioner, grouping the unknowns depending<br />

on the aspect ratio <strong>of</strong> the elements. The resulting Newton scheme can optionally be accelerated by a 2-step solution strategy,<br />

where a start value is computed on a coarse subspace <strong>of</strong> lowest order in analogy to a full multigrid scheme.<br />

Index Terms—Step-Lap Joints, Higher Order Finite Elements, Block Preconditioner, Nonlinear Solver<br />

I. INTRODUCTION<br />

Precise knowledge about the accurate flux distribution<br />

in transformer cores is both important for reducing the<br />

magnetic losses, as well as for localizing sources for<br />

forces (magnetostriction, interlaminar forces). In recent<br />

years significant reduction <strong>of</strong> both effects was achieved<br />

using the multi-step-lap technique (see Fig. 1 and 2),<br />

where the overlap region <strong>of</strong> transformer sheets is shifted<br />

in several steps [1]. In order to optimize the layout<br />

further, a detailed simulation <strong>of</strong> the fluxes, including the<br />

nonlinear B-H curve <strong>of</strong> the core has to be performed.<br />

Accurate simulation <strong>of</strong> such a problem poses some diffi-<br />

Fig. 1. Sketch <strong>of</strong> transformer core<br />

with step-lap corners.<br />

A<br />

A<br />

View A – A<br />

Fig. 2. Magnetic flux concentration<br />

(45 ◦ -view).<br />

culties, as the thickness <strong>of</strong> the steel sheets is typically in<br />

the range <strong>of</strong> 200 - 300 μm, while the length can be a few<br />

meters and the number <strong>of</strong> vertically stacked sheets add up<br />

to some thousand layers. In addition, the flux variation is<br />

very high in the vicinity <strong>of</strong> the air gaps in the corner, but<br />

rather smooth in some distance away from it, making it<br />

difficult to resolve accurately. Furthermore, the nonlinear<br />

permeability <strong>of</strong> the grain-oriented electrical steel sheets<br />

has to be taken into account and modeled correctly.<br />

Within this paper we solve the nonlinear magnetostatic<br />

problem using finite elements <strong>of</strong> higher order, together<br />

with an iterative preconditioned conjugate gradient (CG)<br />

method. By exploiting the special structure <strong>of</strong> the Finite<br />

Element (FE) basis [2], we can build an effective<br />

preconditioner for handling elements with high aspect<br />

ratios, while at the same time the spatial accuracy can be<br />

adapted to the discretization <strong>of</strong> the model. The nonlinear<br />

Newton scheme can be further accelerated by calculating<br />

a good initial start value on a coarse sub-space. Finally,<br />

the applicability <strong>of</strong> the method is demonstrated for the<br />

aforementioned step-lap core model.<br />

II. NONLINEAR MAGNETOSTATIC FORMULATION<br />

The nonlinear magnetostatic problem can be written in<br />

terms <strong>of</strong> the magnetic vector potential A as<br />

∇×ν(|∇ × A|)(∇×A) =J + ∇×νB0<br />

B = ∇×A , (1)<br />

where ν(|∇ × A|) is the nonlinear reluctivity (e.g. <strong>of</strong><br />

steel), J the impressed current density (e.g. <strong>of</strong> a coil)<br />

and B0 an additional prescribed flux density. As the<br />

impressed current density J and the curl <strong>of</strong> the prescribed<br />

flux density B0 are assumed to be divergence free, a<br />

unique solution can either be guaranteed by enforcing<br />

the Coulomb gauge<br />

∇·A =0 (2)<br />

explicitly (leading to a mixed formulation) or by adding<br />

a small regularization term αA to (1), where α ≈ 10 −6 ν<br />

[2]. We will modify this strategy in Section III.<br />

For physical reasons, the vector potential A is only<br />

continuous in the tangential part, which requires the use<br />

<strong>of</strong> H(curl)-conforming vectorial elements, which will be<br />

introduced in Sec. III.


The nonlinear problem is solved using a Newton<br />

formulation<br />

F ′ (Ak)[ΔA] =−F(Ak) (3)<br />

Ak+1 = Ak + ηΔA . (4)<br />

Improvement <strong>of</strong> convergence is achieved by computing<br />

a line search parameter η ∈ ]0, 1], which is determined<br />

by minimizing the residual <strong>of</strong> (3). The magnetic B-H<br />

commutation curve is extracted from measured hysteresis<br />

curves, reducing the problem to a simplified nonlinear<br />

one, which is given in terms <strong>of</strong> measured (Bi,Hi)-value<br />

pairs. By applying a C 1 -spline approximation, a smooth<br />

monotone approximation is calculated, from which the<br />

reluctivity ν and its derivative ν ′ (entering the Fréchet<br />

derivative F ′ ) can be derived (see Fig. 3). For details we<br />

refer to [3].<br />

Fig. 3. Approximation <strong>of</strong> measured B-H-data by C 1 -splines.<br />

III. HIGHER ORDER FINITE ELEMENTS<br />

The two key requirements for our choice <strong>of</strong> higher<br />

order H(curl)-conforming FE shape functions are the<br />

ability to choose the polynomial degree p independently<br />

in each local direction (ξ,η,ζ), as well as the availability<br />

<strong>of</strong> efficient iterative solution techniques (i.e. an efficient<br />

preconditioner). The first requirement leads to the use<br />

<strong>of</strong> hierarchical shape functions, where we we utilize the<br />

hierarchical shape functions <strong>of</strong> [2], which can be written<br />

as<br />

N(T )= <br />

N 0 E ⊕ <br />

N ∇ E ⊕ <br />

NF ⊕<br />

E<br />

E<br />

F<br />

N ∇ F ⊕ NI ⊕ N ∇ I<br />

The shape functions N are composed <strong>of</strong> unknowns<br />

defined on edges, faces and in the interior (subscripts<br />

E, F and I), see Fig. 4 for a hexahedral element.<br />

The lowest order Nédélec functions N0 E , which have a<br />

pζ pη pξ E 5<br />

E 12<br />

F 2<br />

E 4<br />

E 8<br />

E 1<br />

ζ<br />

η<br />

ξ<br />

F 1<br />

F 6<br />

E 9<br />

E 11<br />

E 3<br />

E 6<br />

E 10<br />

E 2<br />

F4 E7 Fig. 4. Degrees <strong>of</strong> freedom for the hexahedral element.<br />

- 227 - 15th IGTE Symposium 2012<br />

(5)<br />

constant tangential component along one edge (p = 0),<br />

are explicitly included. In addition, higher order gradient<br />

components on edges N∇ E , faces N∇F and in the interior<br />

N∇ I are represented separately. This key feature - also<br />

known as the local exact sequence property - is equivalent<br />

to fulfilling the so-called De-Rham complex<br />

R id<br />

−→ H 1 (Ω) grad<br />

−→ H(curl,Ω)<br />

curl<br />

−→ H(div,Ω) div<br />

0<br />

−→ L2(Ω) −→ {0} (6)<br />

already on the finite element level, i.e. the gradients,<br />

forming the null-space <strong>of</strong> the curl-operator, can be completely<br />

omitted for each type <strong>of</strong> unknowns (edge, face,<br />

interior) separately if only the flux density B is <strong>of</strong><br />

interest. In [2], this is denoted as the reduced basis and<br />

can be used to gauge the problem in the following way:<br />

• For the lowest order Nédélec functions N0 E , we add<br />

the regularization term α as described in Sec. I.<br />

• For the higher order terms, we simply skip the<br />

gradient functions N∇ E , N∇F and N∇I .<br />

Another unique advantage <strong>of</strong> this basis according to [9]<br />

is that shape functions <strong>of</strong> arbitrary order are available<br />

for all types <strong>of</strong> elements in 2-D and 3-D, utilizing any<br />

kind <strong>of</strong> hierarchical 1-D shape functions, e.g. Legendre<br />

or Gegenbauer.<br />

A. Anisotropic Adapted Polynomial Degree<br />

In general the magnetic flux density B is defined as<br />

⎛<br />

B = ⎝ Bx<br />

⎞<br />

⎛<br />

⎞<br />

∂Az ∂Ay<br />

∂y − ∂z<br />

By ⎠ ⎜ ∂Ax ⎟<br />

= ∇×A = ⎝<br />

⎠ . (7)<br />

Bz<br />

∂z<br />

∂Ay<br />

∂x<br />

∂Az<br />

− ∂x<br />

∂Ax<br />

− ∂y<br />

In case <strong>of</strong> thin structures, the in-plane components<br />

(Bx,By) are dominant (see Fig. 5). Additionally, the<br />

variation <strong>of</strong> the in-plane components <strong>of</strong> A in z-direction<br />

( ∂Ax ∂Ay<br />

, ) in (7) is already resolved accurately by the<br />

∂z<br />

∂z<br />

FE-discretization in thickness direction <strong>of</strong> the single sheet<br />

, i.e.<br />

layers. Thus the dominant terms left are ∂Az<br />

∂y<br />

A (z)<br />

z<br />

z<br />

y<br />

x<br />

z<br />

B (x)<br />

y<br />

A (x)<br />

z<br />

A (x)<br />

z<br />

x<br />

and ∂Az<br />

∂x<br />

Fig. 5. Flux / potential distribution in face on (x, z)-plane.<br />

the magnetic vector potential A should be approximated<br />

quite accurately in the in-plane direction.<br />

Therefore, it is advantageous to reflect his behavior in<br />

the anisotropic polynomial degree as<br />

pη,pξ >pζ , (8)<br />

assuming that the global z and the local ζ direction<br />

coincide. The increase <strong>of</strong> the polynomial degree only<br />

affects the face NF and inner NI degrees <strong>of</strong> freedoms,


as we skip higher order gradient functions N ∇ E and N∇ F<br />

due to gauging. This leads to practical order templates<br />

like paniso =(2, 2, 1) or paniso =(3, 3, 1). Although it<br />

seems that the lowest order anisotropic template should<br />

be paniso =(1, 1, 0), this does not lead to more accurate<br />

results, as only faces in (x, y)-direction get additional<br />

unknowns, which do not contribute to an improved<br />

resolution <strong>of</strong> Az.<br />

As the permeability in air μ0 is typically several<br />

orders <strong>of</strong> magnitude smaller compared to the one in the<br />

ferromagnetic core, the flux is mostly concentrated in<br />

the core. This allows us the choose a small isotropic<br />

polynomial degree <strong>of</strong> pair = 0 for the approximation.<br />

The last step is especially effective, if structured grids<br />

are utilized or if the air domain is significantly large.<br />

B. Single Step Iterative Solution Scheme<br />

The resulting system <strong>of</strong> equations after FE discretization<br />

can be written as<br />

with<br />

⎛<br />

⎝<br />

KN0N0 KN0F KN0I<br />

KF N0 KFF KFI<br />

KIN0 KIF KII<br />

K(A)A = f , (9)<br />

⎞ ⎛<br />

⎠ ⎝<br />

AN0<br />

AF<br />

AI<br />

⎞ ⎛<br />

⎠ = ⎝<br />

fN0<br />

fF<br />

fI<br />

- 228 - 15th IGTE Symposium 2012<br />

⎞<br />

⎠ .<br />

(10)<br />

The interior unknowns AI can be eliminated by static<br />

condensation as<br />

AI = K −1<br />

II (fI − KIN0 AN0 − KIFAF ) , (11)<br />

where K −1<br />

II can be inverted on the element level. Substituting<br />

this result back in (10) results in the reduced<br />

system<br />

<br />

ˆKN0N0 ˆKN0F<br />

ˆfN0<br />

AN0 = , (12)<br />

ˆKF<br />

ˆKFF AF ˆfF<br />

N0<br />

with the modified matrices and RHS vectors as<br />

−1<br />

= KN0N0 − KN0I(KII )KIN0<br />

ˆKN0F = KN0F − KN0I(K −1<br />

II )KIF<br />

ˆKN0N0<br />

ˆKF N0 = KF N0 − KIF(K −1<br />

II )KN0I<br />

(13)<br />

(14)<br />

(15)<br />

ˆKFF = KFF − KIF(K −1<br />

II )KFI (16)<br />

ˆfN0 = fN0 − KN0I(K −1<br />

II )fI (17)<br />

ˆfF = fF − KFI(K −1<br />

II )fI . (18)<br />

The two main effects <strong>of</strong> the static condensation are:<br />

• The number <strong>of</strong> unknowns is reduced significantly,<br />

as with increasing polynomial degree p only face<br />

and interior unknowns are added.<br />

• The condition number κ <strong>of</strong> the reduced system (12)<br />

is much smaller compared to the one <strong>of</strong> the full<br />

system (10), causing less iterations <strong>of</strong> the CG solver.<br />

In order to solve the reduced system (12), we apply a<br />

Preconditioned Conjugate Gradient (PCG) method. It was<br />

shown in [2], that a α-robust preconditioner C−1 can be<br />

simply defined by a block Jacobian preconditioner (i.e.<br />

an additive Schwarz method, ASM), defined by<br />

<br />

C =<br />

ˆKN0N0<br />

0<br />

0<br />

Kˆ B<br />

FF<br />

<br />

, (19)<br />

where the single blocks are formed as follows:<br />

• ˆ KN0N0 : The lowest order Nédélec functions can be<br />

either solved by a sparse direct solver [10] or by a<br />

suitable iterative method, respecting the Helmholtzdecomposition<br />

<strong>of</strong> the magnetic vector potential (see<br />

e.g. [11]).<br />

: For every face, all unknowns are grouped in<br />

• ˆ KB FF<br />

one block (superscript B). The application <strong>of</strong> the<br />

preconditioner basically is just the inversion <strong>of</strong> the<br />

single face blocks K −1<br />

FF .<br />

IV. ANISOTROPIC BLOCK PRECONDITIONER<br />

If the preconditioner (19) is applied for structures with<br />

a very high aspect ratio (AR), the convergence <strong>of</strong> the<br />

iterative solver deteriorates. This can be explained by an<br />

increase in the condition number κ = λmax/λmin, asthe<br />

entries in the stiffness matrix K scale with 1/h, where h<br />

is the mesh size, leading to strongly coupled entries for<br />

nearly parallel edges / faces and thus to nearly singular<br />

systems with high condition numbers [5].<br />

The idea proposed in [5] is based on a singularity<br />

decomposition technique, where new unknown variables<br />

are introduced and assigned to groups <strong>of</strong> parallel edges<br />

with small distance. However, this method introduces<br />

new matrix entries, as all edges in one group couple via<br />

the auxiliary variable. In addition, the method is only<br />

applicable to 1st order elements, as only edge degrees <strong>of</strong><br />

freedom are considered.<br />

An alternative approach is taken in [4], where a plane<br />

smoother for nodal and edge components <strong>of</strong> the A − φformulation,<br />

respecting the Helmholtz-decomposition, is<br />

applied within a geometric multigrid (MG) solver. Again,<br />

explicit knowledge <strong>of</strong> the anisotropic direction is needed<br />

a priori. In our approach, the idea <strong>of</strong> [4] is extended to<br />

η<br />

ζ<br />

ξ<br />

F3<br />

F2<br />

F1<br />

F8<br />

F7<br />

F6<br />

F5<br />

F4<br />

thin direction(s)<br />

Fig. 6. Thin structure with 2 distinct face groups {F1, F2, F3} and<br />

{F4, ..., F10}, where η is the long direction.<br />

the p-version <strong>of</strong> the FEM. As the lowest order edge contributions<br />

KN0N0 are already solved with a direct solver<br />

and the inner degrees <strong>of</strong> freedom KII get eliminated by<br />

static condensation, only the face contributions ˆ KFF are<br />

affected by the anisotropy. Thus we can modify the initial<br />

face blocks ˆ KB FF <strong>of</strong> the preconditioner matrix C (19) by<br />

grouping all unknowns <strong>of</strong> the faces perpendicular to the<br />

F10<br />

F9


thin direction in one diagonal block ˆ K Bai<br />

FF , if the aspect<br />

ratio <strong>of</strong> the element exceeds a user-defined threshold<br />

ARth. We can even generalize the idea, allowing for two<br />

anisotropic / thin directions within one element, e.g. in<br />

Fig. 6 the faces F4 to F10 couple strongly, as the size in<br />

both, ξ- and ζ-direction, is small compared to the extend<br />

in η-direction. This is especially useful in meshes with<br />

tensor-product structure.<br />

The modified preconditioner matrix is then defined as<br />

<br />

ˆKN0N0 0<br />

C =<br />

0 Kˆ Bai . (20)<br />

FF<br />

The procedure for computing ˆ K Bai<br />

FF without explicit<br />

knowledge <strong>of</strong> the thin direction(s) is sketched in Algorithm<br />

1. It collects strongly coupled (thin) faces in a graph<br />

and determines the blocks <strong>of</strong> ˆ K Bai<br />

FF by calculating the set<br />

<strong>of</strong> connected components <strong>of</strong> it. As the only information<br />

needed for the algorithm is the size <strong>of</strong> the elements in<br />

each direction, the procedure can be applied to general<br />

3-D elements (tetrahedron, wedge, pyramid).<br />

Algorithm 1: Definition <strong>of</strong> anisotropic face blocks.<br />

Input: elements e <strong>of</strong> mesh T<br />

Output: groups <strong>of</strong> thin faces Gi,i=1,...,nG<br />

Data: graph <strong>of</strong> connected anisotr. faces G=(V,E)<br />

foreach e ∈T do<br />

if ARmax(e) ≥ ARth then<br />

compute size <strong>of</strong> element w.r.t. local<br />

directions (hξ,hη,hζ)<br />

hmax = max(hξ,hη,hζ)<br />

foreach d ∈{ξ,η,ζ} do<br />

if hd/hmax ≥ ARth then<br />

get faces F1, F2 perpendicular to<br />

d-direction<br />

insert (F1,F2) in G<br />

nG = # <strong>of</strong> non-connected components <strong>of</strong> G<br />

for i =1to nG do<br />

Gi = connected faces in i-th component <strong>of</strong> G<br />

V. NONLINEAR TWO-STEP STRATEGY<br />

In order to accelerate the resulting Newton scheme,<br />

we utilize a 2-step strategy, motivated by full multigrid<br />

methods (see [6] and [8]). The idea is to compute a<br />

good start approximation for the Newton scheme on a<br />

coarse sub-space T H , which is formed in the p-version<br />

by the lowest order functions KN0N0. The fine space T h<br />

is spanned by the complete set <strong>of</strong> polynomials [7]. Thus<br />

the stiffness matrix K in (10) can be splitted into KH<br />

and Kh as follows<br />

KH = K00<br />

Kh = K =<br />

<br />

= KN0N0 (21)<br />

<br />

. (22)<br />

K00 K01<br />

K10 K11<br />

Due to the hierarchical basis functions, the interpolation<br />

operator Ih H is trivially defined as<br />

Ah =[I, 0] T AH = I h HAH . (23)<br />

- 229 - 15th IGTE Symposium 2012<br />

15 cm<br />

y<br />

z<br />

x<br />

30 cm<br />

0.96 mm<br />

1 layer<br />

air gaps<br />

(exploded view)<br />

Fig. 7. Model <strong>of</strong> step-lap core without air domain (scale factor 30 in<br />

thickness direction).<br />

This allows us to perform a 2-step solution strategy as<br />

follows:<br />

1) Solve a few Newton steps on the small coarse<br />

system KN0N0 (21), using a direct solver.<br />

2) Interpolate the coarse space solution AH to the fine<br />

space Ah according to (23).<br />

3) Proceed with the full system Kh (22) using the<br />

solution strategy <strong>of</strong> Section III-B.<br />

VI. APPLICATION: STEP-LAP CORE MODEL<br />

The applicability <strong>of</strong> the method is demonstrated for a<br />

typical 45◦-multi-step-lap joint region <strong>of</strong> a transformer<br />

core (see Fig. 7) with 4 layers <strong>of</strong> steel sheet, each<br />

0.24 mm in thickness, with a step-lap <strong>of</strong> 2 air gaps<br />

(width: 1 mm) in each sheet. The model is discretized<br />

by 2568 hexahedral elements and 3172 nodes. The used<br />

nonlinear B-H curve is the one depicted in Fig. 3. As<br />

excitation, we apply a prescribed flux density B0 =<br />

0.1 − 2.5 Tiny-direction.<br />

Fig. 8. Concentration <strong>of</strong> magnetic flux lines in corner (45 ◦ -view) for<br />

uniform p =0(top) and p =3(bottom) with B0 =1.0 T (scale<br />

factor 30 in thickness direction).<br />

A. Initial Results<br />

Initially, we choose an isotropic polynomial degree<br />

p =0,...,3 and compare the spatial resolution <strong>of</strong> the<br />

magnetic flux density near the air gaps. Here, the Newton<br />

algorithm takes between 3 and 9 iterations.<br />

The results <strong>of</strong> the simulation are visualized on a very<br />

fine postprocessing mesh. In Fig. 8 it is clearly visible,<br />

B 0


Fig. 9. Flux distribution for uniform p =0(top) and p =3(bottom)<br />

with B0 =1.0 T (scale factor 30 in thickness direction).<br />

that the continuation <strong>of</strong> the fluxlines across the air gaps<br />

is poorly approximated for p =0and that the curvature<br />

<strong>of</strong> the streamlines is unphysical between the air gaps. In<br />

contrast, the simulation using p =3resolves accurately<br />

the flux concentration above and below the air gaps<br />

(depicted in red). The same observation holds true for<br />

the absolute value <strong>of</strong> the magnetic flux density in Fig.<br />

9, where the flux concentration between the air gaps is<br />

smeared over a large area for p =0.<br />

From Fig. 10 we deduce, that the iterative method is<br />

by a factor 2 to 10 slower compared to the direct method<br />

for all excitation values B0. In contrast, the memory<br />

consumption is only about 50% compared to the direct<br />

one for higher polynomial degrees p, as seen in Table I<br />

(SC denotes the use <strong>of</strong> static condensation).<br />

Fig. 10. Simulation time for direct and iterative solution approach<br />

without anisotropic block preconditioner.<br />

TABLE I<br />

MEMORY REQUIREMENT AND DOFS FOR DIFFERENT POLYNOMIAL<br />

DEGREES (SC: STATIC CONDENSATION)<br />

Polynomial Degree piso<br />

0 1 2 3<br />

# Total DOFs 7824 43956 141840 332292<br />

# Inner DOFs - 12840 71904 208008<br />

Memory Usage (GB)<br />

Direct Solver 0.24 0.39 1.12 3.61<br />

Direct Solver (SC) 0.24 0.37 0.92 2.32<br />

Iterative 1-step (SC) 0.24 0.28 0.53 1.61<br />

- 230 - 15th IGTE Symposium 2012<br />

B. Use <strong>of</strong> Anisotropic Block Preconditioner<br />

The poor runtime performance <strong>of</strong> the iterative 1-step<br />

scheme in Fig. 10 can be explained by the extremely<br />

high aspect ratios up to 1:1000 in Fig. 11: All elements<br />

within the steel sheets have aspect ratios higher than<br />

1:400, leading to over 3000 CG iterations on average.<br />

Fig. 11. Aspect ratio <strong>of</strong> step-lap setup (not shown for elements in air).<br />

If we utilize the anisotropic preconditioner<br />

(20) for varying aspect ratio thresholds<br />

ARth = {1000, 500, 100, 50, 10} the iteration numbers<br />

and time for solving the linear equation system drops<br />

significantly (see Fig. 12). The results are compared for<br />

the iterative 1-step solver with B0 = 1.0 T. From an<br />

Fig. 12. Reduction <strong>of</strong> CG iterations (left) and solution time (right)<br />

for varying aspect ratio threshold ARth.<br />

initial CG iteration count <strong>of</strong> about 3000 (ARth = 1000)<br />

we achieve an average reduction to 100-150 iterations<br />

for ARth = 10, corresponding to a factor <strong>of</strong> 20-30,<br />

depending on the polynomial degree.<br />

The effect on the solution time is similar, where<br />

a reduction by a factor <strong>of</strong> 9 (p=3) to 25 (p=1) can<br />

be achieved, making it comparable in runtime to the<br />

direct solver. The increase in memory for storing larger<br />

diagonal blocks is very moderate, being in the range <strong>of</strong><br />

5-15% compared to the non-blocked version.<br />

The rate <strong>of</strong> reduction in iterations is not heavily<br />

depending <strong>of</strong> the polynomial degree, making the preconditioner<br />

a p-robust method for practical applications. For<br />

all the following results, we apply the preconditioner with<br />

a default threshold <strong>of</strong> ARth =10.


C. Application <strong>of</strong> Two-Step Approach<br />

By applying the 2-step strategy <strong>of</strong> Section V, an<br />

additional decrease in runtime can be observed (see Fig.<br />

13). We start here with 2 Newton iterations on the coarse<br />

Fig. 13. Runtime comparison <strong>of</strong> standard 1-step iterative solution<br />

approach and 2-step approach.<br />

space T H with p =0and use it as start value for the<br />

fine space T h . On average, this saves 1 to 2 Newton<br />

iterations on the fine space, resulting in a reduction <strong>of</strong><br />

runtime between 4% and 48%, which is in a similar range<br />

as reported in [8]. However, the effect diminishes with<br />

higher flux values for all polynomial degrees.<br />

D. Anisotropic Polynomial Degree<br />

Finally, we utilize the strategy as explained in Section<br />

III-A by reducing the polynomial degree anisotropically<br />

in thickness direction pζ


- 232 - 15th IGTE Symposium 2012<br />

Parameter Identification <strong>of</strong> a Finite Element<br />

Based Model <strong>of</strong> Wound Rotor Induction Machines<br />

*Martin Mohr, *Oszkár Bíró, *Andrej Stermecki and † Franz Diwoky<br />

* Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines at the<br />

Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />

† AVL List GmbH, Hans-List-Platz 1, A-8020 <strong>Graz</strong>, Austria<br />

E-mail: martin.mohr@TU<strong>Graz</strong>.at<br />

Abstract—This paper presents two efficient algorithms for the parameter identification in a finite element based circuit model<br />

<strong>of</strong> a wound rotor induction machine. This approach uses magneto-static finite element method simulations for building lookup<br />

tables and employs quint-cubic splines for the interpolation. A reduction <strong>of</strong> the finite element simulation cost is achieved by<br />

decreasing the number <strong>of</strong> nonlinear iterations using a special simulation order keeping the magneto-motive force in the<br />

machine for several sampling points constant. Furthermore, the quint-cubic spline parameter calculation method has been<br />

revised using a dimensional recursive evaluation approach allowing a fast parameter calculation with lower memory demand<br />

and <strong>of</strong>fering possibility <strong>of</strong> parallelization.<br />

Index Terms— Finite element methods, motor drives, numerical models.<br />

I. INTRODUCTION<br />

Finite element (FE) based models, in particular<br />

physical phase variable (PPV) models, use look-up tables<br />

(LUT) generated by magneto-static finite element method<br />

(FEM) simulations. During the following transient<br />

simulations, only the evaluation <strong>of</strong> these LUTs by an<br />

appropriate interpolation method is needed. Several<br />

applications <strong>of</strong> this approach to electrical machines have<br />

already been published [1]-[8].<br />

For permanent magnet machines, this approach is<br />

straightforward [1]-[5]. Typically three independent state<br />

variables are sufficient for prescribing the state <strong>of</strong> the<br />

machine. Therefore, no more than a few hundred or<br />

thousand FEM simulations are needed and a three<br />

variable interpolation method is sufficient.<br />

In contrast, wound rotor induction machines (WRIM)<br />

necessitate a higher number <strong>of</strong> state variables. This is a<br />

consequence <strong>of</strong> the second coil system on the rotor. The<br />

number <strong>of</strong> simulations Ntot increases exponentially with<br />

the number <strong>of</strong> state variables:<br />

Ntot Ni<br />

(1)<br />

states<br />

where Ni is the number <strong>of</strong> sampling points for the i th state<br />

variable. Furthermore, a higher dimensional interpolation<br />

method is necessary. Nevertheless, several<br />

implementations <strong>of</strong> FE-based WRIM models are found in<br />

the literature [6]-[8].<br />

This work refers to the PPV-model <strong>of</strong> a WRIM with<br />

stranded coils introduced in [8] and can be perceived as<br />

an addendum to it. Since the parameter identification has<br />

not been treated in [8], this paper presents the practical<br />

application <strong>of</strong> this model.<br />

The drawbacks <strong>of</strong> the PPV-WRIM model are the very<br />

high number <strong>of</strong> needed FEM-simulations and the<br />

demanding quint-cubic spline parameter calculation. Both<br />

are caused by the number <strong>of</strong> state variables <strong>of</strong> the model<br />

being five as shown in the next section.<br />

II. INTRODUCTION TO THE PPV-WRIM MODEL<br />

This model uses the electrical rotor position Rot and<br />

S S<br />

two transformed currents for each <strong>of</strong> the stator i, i and<br />

R R<br />

the rotor i , i as state variables. This reduction can be<br />

done under the assumption that the rotor and stator<br />

systems are in Y-connection with isolated star point.<br />

The current state variables used are the phase currents<br />

S S S<br />

R R R<br />

<strong>of</strong> the stator iA, iB, i C and rotor iA, iB, i C transformed<br />

into the rotor related reference system:<br />

S S S R R R<br />

iA iB iC 0, iA iB iC 0,<br />

S 1 0 S<br />

i <br />

<br />

<br />

cosRot sin <br />

Rot 2 i<br />

1 2 A<br />

<br />

<br />

S , S<br />

i<br />

<br />

sinRot cosRot 3 i<br />

(2)<br />

<br />

<br />

<br />

<br />

B <br />

3 3<br />

R 1 0 R<br />

i <br />

<br />

2 i<br />

1 2 A<br />

<br />

<br />

R R .<br />

i<br />

<br />

3 i<br />

<br />

<br />

<br />

B <br />

3 3<br />

The rotor position is derived from the mechanical<br />

model and is an input variable for the electrical machine<br />

model.<br />

The used LUTs <strong>of</strong> this model approach contain the<br />

S S S<br />

phase flux linkages <strong>of</strong> the stator A, B, C<br />

and <strong>of</strong> the<br />

R R R<br />

rotor A, B , C<br />

for the electrical system <strong>of</strong> equations as<br />

well as the machine torque T.<br />

All these quantities are parameterized by the five state<br />

variables:<br />

S S R R<br />

LUT f Rot , i, i, i , i<br />

.<br />

(3)<br />

In contrast to [6], the total machine torque T is also<br />

included in the LUT.<br />

The electrical system <strong>of</strong> equations uses the line to line<br />

quantities defined as<br />

S S S S S S S S S S S S<br />

vAB: vBvA, vBC : vCvB, iAB : iBiA, iBC : iCiB R R R R R R R R R R R R<br />

vAB: vBvA, vBC : vCvB, iAB : iBiA, iBC: iC iB<br />

(4)<br />

S S S<br />

with the phase voltages <strong>of</strong> the stator v , v , v and <strong>of</strong> the<br />

A B C


R R R<br />

rotor vA, vB, v C . This system can be rewritten as<br />

<br />

S S S S S <br />

AB AB AB AB <br />

Rot <br />

AB <br />

<br />

dt <br />

Rot iS iS iR i<br />

R<br />

S<br />

di<br />

<br />

S<br />

S S<br />

S S S S S<br />

<br />

v <br />

AB RCUi AB<br />

BC BC BC BC <br />

<br />

BC<br />

<br />

S <br />

dt<br />

<br />

S S<br />

v <br />

BC RCUi <br />

<br />

BC Rot iS iS iR i<br />

<br />

<br />

<br />

S <br />

R<br />

<br />

di <br />

<br />

R R R R<br />

v <br />

AB RCUi<br />

<br />

R R R R<br />

<br />

<br />

AB <br />

<br />

<br />

(5)<br />

AB AB AB AB AB<br />

R R R <br />

<br />

dt<br />

v BC RCUi R<br />

BC <br />

<br />

Rot<br />

iS iS iR i<br />

<br />

R<br />

di R R R R R <br />

<br />

BC BC BC BC BC dt R<br />

<br />

di<br />

Rot i S i S i R i <br />

<br />

R<br />

dt <br />

S<br />

with the stator phase resistance R CU and the rotor phase<br />

R<br />

resistance R CU .<br />

For the evaluation <strong>of</strong> the function values as well as the<br />

partial derivatives, a quint-cubic spline interpolation has<br />

been suggested in [8]. Furthermore, it has been shown<br />

that this model is equivalent to a FEM model in all but<br />

interpolation errors. However, the parameter<br />

identification process has not been discussed there.<br />

In this work, an improved simulation workflow for the<br />

needed FEM simulations is proposed and presented.<br />

Furthermore, a revised quint-cubic spline parameter<br />

calculation method is introduced in detail.<br />

III. IMPROVED FEM SIMULATION WORKFLOW<br />

The motivation for an improved simulation workflow is<br />

the circumstance that the number <strong>of</strong> magneto-static FEM<br />

simulations needed to achieve an accurate interpolation is<br />

higher than in usual applications. However, there are only<br />

slight changes in the input data <strong>of</strong> these simulations.<br />

Utilizing this circumstance, a reduction <strong>of</strong> the FEM<br />

simulation time can be achieved as shown below.<br />

A. Finite Element WRIM Model for Parameter<br />

Identification<br />

The FEM model <strong>of</strong> the WRIM under investigation is<br />

shown in Fig. 1. This three phase machine has three<br />

magnetic pole pairs. The rotor and stator coil systems are<br />

in Y-connection with isolated star points. Furthermore, all<br />

coils are modeled as stranded coils, skin effect is not<br />

considered. Due to the symmetry <strong>of</strong> the machine, only a<br />

third <strong>of</strong> the geometry is modeled, decreasing the number<br />

<strong>of</strong> finite elements and thus the simulation time. Periodic<br />

boundary conditions are used at the cutting planes. Rotor<br />

and stator are coupled with constraint equations, thus no<br />

conforming mesh is needed, and the rotation <strong>of</strong> the rotor<br />

is taken into account [9].<br />

The number <strong>of</strong> finite elements is 16128 and the<br />

number <strong>of</strong> DOFs is 48 025. A single magneto-static FEM<br />

simulation with ANSYS 12.1 needs approximately 21<br />

seconds on a computer with Intel Core2 Quad CPU<br />

(Q9400) and 8GB RAM. During this simulation, in<br />

average 20 nonlinear iterations are needed. Simulation<br />

time and number <strong>of</strong> iterations depend on the operating<br />

point.<br />

For a test example <strong>of</strong> nine sampling points for each<br />

state current and 15 different rotor positions, 98 415<br />

- 233 - 15th IGTE Symposium 2012<br />

FEM simulations have to be carried out. The overall<br />

calculation time is approximately 574 hours. In order to<br />

reduce the computation time, the approach described<br />

below and called method <strong>of</strong> constant magneto-motive<br />

force (MMF) is proposed.<br />

Figure 1: FEM model <strong>of</strong> wound rotor induction machine with rotor and<br />

stator coil system.<br />

B. Method <strong>of</strong> Constant Magneto-Motive Force<br />

A reduction <strong>of</strong> the FEM simulation time can be<br />

achieved in general by reducing<br />

o the number <strong>of</strong> finite elements,<br />

o the number <strong>of</strong> simulations or<br />

o the number <strong>of</strong> nonlinear iterations.<br />

A reduction <strong>of</strong> the number <strong>of</strong> elements decreases the<br />

model quality and a reduction <strong>of</strong> the total number <strong>of</strong><br />

simulations decreases the quality <strong>of</strong> the interpolation.<br />

However, a reduction <strong>of</strong> the number <strong>of</strong> nonlinear<br />

iterations reduces the simulation time without decreasing<br />

the model quality. Nevertheless, a direct manipulation <strong>of</strong><br />

this quantity is not possible because it depends on the<br />

nonlinear behavior <strong>of</strong> the material used in the model.<br />

However, the material properties <strong>of</strong> a prior simulation can<br />

be used as initial guess for the new simulation.<br />

This can be done for several magneto-static simulations<br />

although they are independent from each other under the<br />

assumption that the saturation state between these<br />

simulations is approximately the same. This can be<br />

accomplished if the magneto-motive force between<br />

these simulations does not change significantly.<br />

If the same reference system is used for the rotor and<br />

stator current transformation, then can be easily<br />

calculated component-by-component using<br />

S R<br />

Θα i i <br />

α α<br />

= =<br />

NS S + NR<br />

R<br />

, (6)<br />

Θβ iβ iβ <br />

with the number <strong>of</strong> windings on the stator side NS and on<br />

the rotor side NR. So, a constant MMF can be reached by


keeping the MMF components and constant as<br />

shown in Fig.2.<br />

R<br />

<br />

S<br />

<br />

<br />

S<br />

<br />

R<br />

<br />

<br />

<br />

Figure 2: Several parameter setups (different stator and rotor currents)<br />

with constant total magneto-motive force.<br />

This leads to a simple algorithm using a special<br />

simulation order for the different parameter setups.<br />

S<br />

i<br />

<br />

S<br />

i<br />

Rot<br />

R<br />

i<br />

R<br />

i<br />

Figure 3: a) Blockdiagram <strong>of</strong> algorithm with five independent loops for<br />

each state variable. b) Blockdiagram <strong>of</strong> constant MMF algorithm<br />

Figure 3a shows the straightforward approach without<br />

any optimization. All loops are independent, the loop<br />

order is arbitrarily and massive parallelization is possible.<br />

Figure 3b shows the improved constant MMF<br />

algorithm. Instead <strong>of</strong> five independent loops for each state<br />

variable, this algorithm consists <strong>of</strong> two coupled loops for<br />

each <strong>of</strong> the orthogonal - and -components and one<br />

independent loop for the rotor position. The outermost<br />

loops define the MMF-vector component-by-component,<br />

the innermost loops change the rotor and stator current<br />

ratio for the relevant components. By this coupling, the<br />

step sizes <strong>of</strong> the state variables are also coupled and<br />

cannot be chosen freely. Nevertheless, a high number <strong>of</strong><br />

simulations with constant MMF can be carried out.<br />

The variation <strong>of</strong> the rotor position is done between<br />

these loops. This is suggested because the saturation state<br />

between adjacent rotor positions does not change<br />

significantly. However, this loop can be easily made the<br />

outermost one, allowing a parallelization for each rotor<br />

position.<br />

<br />

<br />

Rot<br />

S R<br />

Ni Ni S R <br />

S R<br />

Ni Ni S R <br />

- 234 - 15th IGTE Symposium 2012<br />

A direct comparison between the two algorithms in<br />

Fig. 3 is shown in Fig. 4, highlighting the benefits <strong>of</strong> this<br />

simple algorithm.<br />

Figure 4: Comparison <strong>of</strong> the number <strong>of</strong> nonlinear iterations for 100<br />

magneto-static FEM simulations with and without constant MMF.<br />

The number <strong>of</strong> nonlinear iterations has been reduced<br />

approximately by a factor <strong>of</strong> four, the simulation time by<br />

a factor <strong>of</strong> three. This lower improvement for the<br />

simulation time can be explained by computational costs<br />

during the simulation setup and initialization. For the test<br />

example, the total simulation time has decreased to 190h<br />

from 574h, without any parallelization.<br />

IV. QUINT CUBIC SPLINE PARAMETER CALCULATION<br />

As shown in [8], a quint cubic spline interpolation<br />

method is well suited to be applied in the PPV-WRIM<br />

model, because it allows a continuous interpolation <strong>of</strong> the<br />

function value as well as the first partial derivatives <strong>of</strong> the<br />

function defined by the LUT. Nevertheless, the number <strong>of</strong><br />

parameters per segment is 1024 and this results in a high<br />

memory demand for the LUTs holding these spline<br />

parameters.<br />

For the test example, the memory demand per LUT is<br />

approximately 480MB. Thus, approximately 3.3GB are<br />

needed in total for the PPV-WRIM model. However, this<br />

is not a problem for state <strong>of</strong> the art computers and<br />

moreover it is not necessary to load all data into the<br />

RAM. But the calculation <strong>of</strong> the parameters is very<br />

demanding, since their number is as high as 62.9 millions<br />

(Table III).<br />

Nevertheless, under the assumption <strong>of</strong> a regular and<br />

orthogonal grid and with the continuity conditions for<br />

quint-cubic splines utilized, a fragmentation <strong>of</strong> this<br />

system <strong>of</strong> equations is possible, allowing a fast<br />

calculation <strong>of</strong> the spline parameters with less memory<br />

demand and a parallelization capability. In this section,<br />

this method is presented and explained in detail.<br />

A. Quint-cubic spline interpolation<br />

A quint-cubic spline interpolation is a piecewise third<br />

order polynomial interpolation in five dimensions as<br />

shown in (8). C 1 -continuity (continuity <strong>of</strong> the function<br />

value and its first order partial derivatives) at the segment<br />

boundaries can be forced by the right choice <strong>of</strong> continuity<br />

conditions. Within the segments all derivatives are<br />

continues due to the properties <strong>of</strong> polynomials.<br />

Each segment in this sense is a five dimensional (5D)


hyper-cuboid with 32 corners and 10 adjacent segments<br />

sharing a four dimensional (4D) hyper-cuboid. The<br />

number <strong>of</strong> such segments NSeg depends on the number <strong>of</strong><br />

sampling points <strong>of</strong> each coordinate variable x1, x2, x3, x4,<br />

x5 and can be calculated as<br />

NSeg Ni1. (7)<br />

ix , x , x , x , x <br />

1 2 3 4 5<br />

The quint-cubic spline is defined as<br />

x : x1, x2, x3, x4, x5,<br />

3 3 3 3 3<br />

i j k l m<br />

f xaijklmx 1x2x3x 4x5 ,<br />

(8)<br />

i0 j0 k0 l0 m0<br />

with the spline parameters aijklm <strong>of</strong> the segment fulfilling<br />

L U L U L U<br />

x1 x1 x1 , x2 x2 x2 , x3 x3 x3<br />

,<br />

(9)<br />

L U L U<br />

x4 x4 x4 and x5 x5 x5<br />

,<br />

L L L L L<br />

with the lower segment boundaries x1 , x2, x3, x4, x 5 , the<br />

U U U U U<br />

upper segment boundaries x1 , x2 , x3 , x4 , x 5 and the<br />

local coordinates x1, x2, x3, x4, x5<br />

defined by<br />

L L L<br />

x1 x1x1 , x2 x2x2, x3<br />

x3x3, (10)<br />

L L<br />

x4 x4x4 and x5<br />

x5x5. The first order partial derivative with respect to x1 can<br />

be interpolated with<br />

3 3 3 3 3<br />

f i1j k l m<br />

fx1 ia ijklmx1<br />

xxxx 2 3 4 5 (11)<br />

x1 i1 j0 k0 l0 m0<br />

and in a similar manner those with respect to the other<br />

variables.<br />

Linear extrapolation or a periodic behavior at the<br />

domain boundaries can be easily realized using<br />

appropriate additional constraints [11].<br />

B. Continuity conditions for quint-cubic splines<br />

The choice <strong>of</strong> the continuity conditions for the spline<br />

parameter determination is important for the continuity at<br />

the segment boundaries. C 1 -continuity even at the<br />

segment boundaries can be achieved for tri-cubic splines<br />

if the function value f, the first order partial derivatives<br />

fx1, fx2, fx3, and all higher order pure mixed derivatives<br />

fx1x2, fx1x3, fx2x3 and fx1x2x3 are continuous at the corners <strong>of</strong><br />

each cuboid-seqment. This prerequisite is proven in [10]<br />

for tri-cubic splines.<br />

However, this pro<strong>of</strong> can also be used for higher<br />

dimensional splines. This leads to the conclusion that for<br />

C 1 -continuity, the same prerequisites are necessary. In the<br />

case <strong>of</strong> quint-cubic splines, these are the function value,<br />

the five first order partial derivatives and all 26 higher<br />

order pure mixed derivatives. These are in sum 32<br />

equations per corner and in total 1024 constraints for<br />

1024 parameters per segment. It can easily be proven that<br />

these equations are linear independent.<br />

C. Continuity <strong>of</strong> f, fx1, fx2, fx3, fx4 and fx5 on the segment<br />

faces<br />

For the interpolation <strong>of</strong> the partial derivatives is<br />

necessary to show their continuity also on the segment<br />

faces. Therefore, let us assume a regular and orthogonal<br />

grid for the sampling variables and two adjacent segments<br />

- 235 - 15th IGTE Symposium 2012<br />

S 1 and S 2 that share a face with x1=const. Without loss <strong>of</strong><br />

U<br />

generality, the face defined by x 1 is used for S 1 and for<br />

S 2 L<br />

the face defined by x 1 is used. On both faces, the<br />

quint-cubic splines become the quad-cubic splines f S1 and<br />

f S2 below:<br />

3 3<br />

S1U L<br />

i<br />

j k l m<br />

ijklm 1 1 2 3 4 5<br />

jklm , , , 0 i0<br />

S1 quad-cubic spline parameter bjklm<br />

S2 <br />

3<br />

<br />

jklm , , , 0<br />

j k l m <br />

0jklm 2 3 4 5 with:<br />

S2<br />

jklm 0jklm<br />

3 3<br />

S1U L<br />

i1<br />

j k l m<br />

x1 ijklm 1 1 xxxx 2 3 4 5<br />

jklm , , , 0<br />

i1<br />

S 1<br />

quad-cubic spline<br />

parameter b<br />

jklm<br />

S2 x1 <br />

3<br />

<br />

jklm , , , 0<br />

j k l m<br />

<br />

1jklm 2 3 4 5<br />

S2<br />

with: <br />

jklm 1jklm<br />

f a x x x x x x<br />

f a x x x x b a<br />

f i a x x<br />

f a x x x x b a<br />

(12)<br />

For continuity <strong>of</strong> f, fx2, fx3, fx4 and fx5 at this face, it is<br />

sufficient that the quad-cubic splines f S1 and f S2 are equal.<br />

S1<br />

Therefore the all spline parameters b jklm <strong>of</strong> f S1 must be<br />

S 2<br />

equal to the corresponding parameter b jklm <strong>of</strong> f S2 .<br />

This equality can easily be shown, since both <strong>of</strong> these<br />

quad-cubic splines fulfill the same 16 constraints<br />

f, fx2, fx3, fx4, fx5, fx2x3,..., f x2x3x4x5 in all 16 involved<br />

nodes and these 256 equations for 256 unknowns are<br />

linearly independent. Therefore, the solution is unique<br />

and the two splines are the same.<br />

For continuity <strong>of</strong> fx1 at this face, the two additional<br />

S1<br />

S 2<br />

quad-cubic splines f x1<br />

and f x1<br />

representing the partial<br />

derivative with respect to x1 on this face, must be equal.<br />

This can be proven in a similar manner as for the other<br />

constraints fx1, fx1x2, fx1x3, fx1x4, fx1x5,..., f x1x2x3x4x5 per<br />

node.<br />

Furthermore, the same pro<strong>of</strong> can be used for the other<br />

faces. Thus, C 1 -continuity for the quint-cubic spline<br />

interpolation has been shown for a regular and orthogonal<br />

grid.<br />

D. Segmented parameter calculation<br />

The pro<strong>of</strong> <strong>of</strong> continuity shows that f, fx1, fx2, fx3, fx4, fx5,<br />

fx1x2, fx1x3,…, fx2x3x4x5 and fx1x2x3x4x5 must be continuous in<br />

each node. If all <strong>of</strong> those 32 conditions per node can be<br />

determined in advance, it is not necessary to assemble a<br />

single system <strong>of</strong> equations for all segments. Instead, each<br />

segment could be solved independently. This will lead to<br />

NSeg systems <strong>of</strong> equations with 1024 unknowns each and<br />

furthermore allows parallel evaluation.<br />

Under the assumption <strong>of</strong> a regular and orthogonal grid<br />

for the sampling data, only one parameter changes along<br />

each edge <strong>of</strong> the segments. Therefore, the quint-cubic<br />

spline becomes a normal cubic spline<br />

i<br />

f x aˆ x 1<br />

2, x3, x4, x i x<br />

5const<br />

i0<br />

3 3 3 3<br />

j k l m<br />

i <br />

ijklm 2 3 4 5<br />

j0 k0 l0 m0<br />

with aˆ a xxxx,<br />

3<br />

<br />

(13)


as shown for an edge with constant values for x2, x3, x4<br />

and x5. The resulting cubic splines along the straight lines<br />

with constant values for x2, x3, x4 and x5 could be<br />

evaluated independent <strong>of</strong> each other. This can also be<br />

done for all other straight lines where only one parameter<br />

changes. This leads to a number <strong>of</strong> NCSP simple cubic<br />

splines, where<br />

N N with M : x<br />

, x , x , x , x .<br />

(14)<br />

<br />

CSP j<br />

iM jM/ i<br />

1 2 3 4 5<br />

With these cubic splines, all first order partial<br />

derivatives in each node can be evaluated as for x1 below:<br />

3<br />

i 1<br />

f x i aˆx <br />

. (15)<br />

<br />

x1 1 i 1<br />

i1<br />

These derivatives can be used in a similar manner for<br />

determining all higher order mixed derivatives.<br />

This approach needs a high number <strong>of</strong> cubic spline<br />

determinations, but all these systems <strong>of</strong> equations have<br />

only three unknowns per segment and thus in total<br />

3Nx1 1<br />

unknowns for the cubic splines along the x1direction,<br />

for example. Furthermore, the system matrices<br />

for all splines in one direction are equal. This leads to a<br />

single system <strong>of</strong> equations with many right hand sides that<br />

can be solved very effectively.<br />

E. Dimensional recursive approach<br />

Obviously, not all 1024 parameters per segment are<br />

unknown. For example the constant parameter a00000 <strong>of</strong><br />

each segment is equal to the function value in the segment<br />

L L L L L<br />

reference node x1 , x2, x3, x4, x 5 where all local<br />

coordinates are zero<br />

L L L L L<br />

a00000 f x1, x2, x3, x4, x5<br />

(16)<br />

and all derivatives in the reference node can be directly<br />

identified as parameters <strong>of</strong> the quint-cubic spline<br />

L L L L L<br />

a10000 fx1x1, x2, x3, x4, x5,<br />

L L L L L<br />

a01000 fx2x1, x2, x3, x4, x5,<br />

(17)<br />

Nevertheless, most <strong>of</strong> the parameters per segment must<br />

still be calculated. However, the construction <strong>of</strong> the quintcubic<br />

spline by using a reference node can be easily<br />

combined with the continuity conditions. This leads to a<br />

dimensional recursive approach allowing a well<br />

structured determination <strong>of</strong> all parameters.<br />

There are five faces <strong>of</strong> the segment including the<br />

reference node. For each <strong>of</strong> these faces, one coordinate is<br />

constant zero, resulting in a quad-cubic spline, as shown<br />

for x1=0:<br />

3 3 3 3<br />

j k l m<br />

f 0, x2x3x4x5a 0 jklm x2x3x4x5 . (18)<br />

j0 k0 l0 m0<br />

All parameters <strong>of</strong> this quad-cubic spline are also<br />

parameters <strong>of</strong> the quint-cubic spline as already indicated<br />

by the parameter indices in (18). Furthermore, the first<br />

order partial derivative with respect to x1 for the face<br />

x1=0 leads to a quad-cubic spline<br />

3 3 3 3<br />

j k l m<br />

fx10, x 2x3x4x5a1jklmx 2x3x4x5 . (19)<br />

j0 k0 l0 m0<br />

- 236 - 15th IGTE Symposium 2012<br />

All <strong>of</strong> these parameters can be also identified as quintcubic<br />

spline parameters. Doing this also for the other four<br />

coordinates, leads to ten quad-cubic splines. Each <strong>of</strong> them<br />

is defined by the continuity conditions <strong>of</strong> the<br />

corresponding nodes. Finally, 992 quint-cubic parameters<br />

per segment can be determined by this method, only the<br />

32 parameters aijklm with i, j, k, l, m 2,3 have to be<br />

solved for separately. This is due to the segment diagonal<br />

U U U U U<br />

node x1 , x2 , x3 , x4 , x 5 ,<br />

since this node is not one <strong>of</strong><br />

the determined quad-cubic splines.<br />

For a quad-cubic spline, this approach can be used in<br />

the same manner, leading to eight tri-cubic splines and an<br />

additional system <strong>of</strong> equations <strong>of</strong> 16 unknowns for the<br />

diagonal node per segment. The further dimensional<br />

recursion is shown in Fig. 5.<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Figure 5: Dimensional recursive approach for quint-cubic spline<br />

parameter determination.<br />

It is pointed out that the function calls shown in Fig. 5<br />

are multi-function calls. This means that each call has to<br />

be multiplied with the number <strong>of</strong> sampling points for the<br />

corresponding dimension to get the real number <strong>of</strong> single<br />

function calls. As an example, the number <strong>of</strong> single quadcubic<br />

spline function calls NQ4 can be calculated as<br />

NQ4 2Nx1Nx2 Nx3 Nx4 Nx5,<br />

(20)<br />

i.e. two multi-function calls per dimension.<br />

Furthermore, there is also some redundancy within this<br />

recursion. It is not necessary to call 80 multi-function<br />

calls for tri-cubic splines as Fig. 5 hypothesizes. Some <strong>of</strong><br />

them are equal for different quad-cubic splines. This can<br />

easily be shown by the fact that each quad-cubic spline is<br />

defined by 256 parameters and each <strong>of</strong> them is also a<br />

parameter <strong>of</strong> the quint-cubic spline. However, the ten<br />

quad-cubic splines define only 992 quint-cubic<br />

parameters.<br />

TABLE I<br />

RECURSIVE MULTI-FUNCTION CALLS<br />

QuintQuadTriBi- Cubic<br />

cubiccubiccubiccubic 1 10 40 80 80<br />

Table 1 shows the actual number <strong>of</strong> multi-function


calls during this recursive approach. Table 2 shows an<br />

overview for different sampling rate setups and Table 3<br />

shows the corresponding calculation time and memory<br />

demand for an implementation in Fortran95. These<br />

calculations were carried out without parallelization on a<br />

computer with Intel Core2 Duo CPU (E8400) and 8GB<br />

RAM.<br />

TABLE II<br />

SETUP OVERVIEW FOR VARIOUS SAMPLING RATES<br />

Setup Nx1 Nx2 Nx3 Nx4 Nx5 NSample NSeg<br />

1 6 6 6 6 6 7776 3125<br />

2 8 8 8 8 8 32768 16807<br />

3 11 11 11 11 11 161051 100000<br />

4 12 12 12 12 12 248832 161051<br />

5 16 8 8 8 8 65536 36015<br />

6 16 9 9 9 9 104976 61440<br />

TABLE III<br />

SIMULATION TIME AND MEMORY DEMAND<br />

Setup Number <strong>of</strong> Calculation Memory<br />

Parameters time demand<br />

in Mio [s] [MByte]<br />

1 3.20 3,04 60,8<br />

2 17.21 15,61 131,3<br />

3 102.40 83,80 781,3<br />

4 1649.16 134,53 1258<br />

5 36.88 33,76 281,4<br />

6 62.91 54,39 480,0<br />

These setups illustrate the scaling <strong>of</strong> this approach. A<br />

comparison shows that the calculation time increases<br />

slightly slower than the number <strong>of</strong> segments does.<br />

The last setup corresponds to the test example <strong>of</strong><br />

section III-A. For this example, Nx1=16, although there<br />

are 15 different rotor positions. This can be explained by<br />

the periodicity <strong>of</strong> the rotor position, i.e. the first sampling<br />

point is identical with the last one.<br />

V. CONCLUSION<br />

The method <strong>of</strong> constant magneto-motive force<br />

presented in this paper reduces the number <strong>of</strong> nonlinear<br />

iterations and thus the over-all simulation time for the<br />

FEM simulations. This is achieved by just changing the<br />

order <strong>of</strong> parameter setups and thus makes this method<br />

simply usable in every commercial FEM simulation tool.<br />

The revised quint-cubic spline parameter calculation<br />

method introduced in this work allows a very fast and<br />

memory saving pre-calculation <strong>of</strong> the needed spline<br />

- 237 - 15th IGTE Symposium 2012<br />

parameters. This was achieved by a dimensional recursive<br />

approach that leads to a segmentation <strong>of</strong> the whole system<br />

<strong>of</strong> equations. The parameters can also be evaluated in<br />

parallel.<br />

These two improvements make the PPV-WRIM model<br />

approach introduced in [8] applicable to simulation tasks<br />

during the design stage <strong>of</strong> electrical drive chains.<br />

VI. ACKNOWLEDGEMENT<br />

This work has been supported by the Christian Doppler<br />

Research Association (CDG) and by the industrial partner<br />

AVL List GmbH.<br />

[1]<br />

REFERENCES<br />

Mohammed, O.A.; Liu, S.; Liu, Z.; , "Physical modeling <strong>of</strong><br />

electric machines for motor drive system simulation," Power<br />

Systems Conference and Exposition, 2004. IEEE PES , vol., no.,<br />

pp. 781-786 vol.2, 10-13 Oct. 2004<br />

[2] Liu, Z.; Mohammed, O.A.; Liu, S.; , "An improved physics-based<br />

phase variable model <strong>of</strong> PM synchronous machines obtained<br />

through field computation," Computation in Electromagnetics,<br />

2008. CEM 2008. 2008 IET 7th International Conference on ,<br />

vol., no., pp.166-167, 7-10 April 2008<br />

[3] Kallio, S.; Karttunen, J.; Andriollo, M.; Peltoniemi, P.;<br />

Silventoinen, P.; , "Finite element based phase-variable model in<br />

the analysis <strong>of</strong> double-star permanent magnet synchronous<br />

machines," Power Electronics, Electrical Drives, Automation and<br />

Motion (SPEEDAM), 2012 International Symposium on , vol.,<br />

no., pp.1462-1467, 20-22 June 2012 doi:<br />

[4]<br />

10.1109/SPEEDAM.2012.6264434<br />

Mohammed, O.A.; Liu, S.; Liu, Z.; Khan, A.A.; , "Improved<br />

physics-based permanent magnet synchronous machine model<br />

obtained from field computation," Electric Machines and Drives<br />

Conference, 2009. IEMDC '09. IEEE International , vol., no.,<br />

pp.1088-1093, 3-6 May 2009, doi:<br />

[5]<br />

10.1109/IEMDC.2009.5075339<br />

Mohr, M.; Bíró, O.; Stermecki, A.; Diwoky, F.; ,”An Improved<br />

Physical Phase Variable Model for Permanent Magnet Machines”,<br />

submitted and accepted at ICEM, Marseille, France, 2012<br />

[6] Sarikhani, A.; Mohammed, O.A.; , "Development <strong>of</strong> transient FEphysics-based<br />

model <strong>of</strong> induction for real time integrated drive<br />

simulations," Electric Machines & Drives Conference (IEMDC),<br />

2011 IEEE International , vol., no., pp.687-692, 15-18 May 2011<br />

[7] Sarikhani, A.; Mohammed, O. A.; , "Non-linear FE-based<br />

modeling <strong>of</strong> induction machine for integrataed drives,"<br />

[8]<br />

Computation in Electromagnetics (CEM 2011), IET 8th<br />

International Conference on , vol., no., pp.1-2, 11-14 April 2011<br />

Mohr, M.; Bíró, O.; Stermecki, A.; Diwoky, F.; ,” An Improved<br />

Physical Phase Variable Model for Wound Rotor Induction<br />

Machines”, submitted and accepted at CEFC, Oita, Japan, 2012<br />

[9] ANSYS® Academic Research, Release 12.1, Help System, “Low-<br />

Frequency Electromagnetic Analysis Guide”, ANSYS, Inc.<br />

[10] Lekien, F.; Marsden, J.; , “Tricubic interpolation in three<br />

dimensions,” Internat. J. Numer. Methods Engrg., 63 3 (2005),<br />

pp.455-471<br />

[11] Boor, C.D.; , “A practical guide to splines,” New York: Springer-<br />

Verlag, 1978, p39f., ISBN: 9780387903569


- 238 - 15th IGTE Symposium 2012<br />

Post Insulator Optimization Based on Dynamic<br />

Population Size<br />

Peter Kitak, Arnel Glotic, Igor Ticar<br />

<strong>University</strong> <strong>of</strong> Maribor, Faculty <strong>of</strong> Electrical Engineering and Computer Science, Smetanova 17, SI-2000 Maribor,<br />

Slovenia<br />

E-mail: peter.kitak@uni-mb.si<br />

Abstract—This paper suggests the use <strong>of</strong> dynamic population size throughout the optimization process which is applied on the<br />

numerical model <strong>of</strong> a medium voltage post insulator. The main objective <strong>of</strong> the dynamic population is reducing population<br />

size, to achieve faster convergence. Change <strong>of</strong> population size can be done in any iteration by proposed method. The multiobjective<br />

optimization process is based on the PSO algorithm, which is suitably modified in order to operate with the principle<br />

<strong>of</strong> the optimal Pareto front.<br />

Index Terms—Dynamic population size, Insulation elements, Multi-objective optimization, particle swarm optimization.<br />

reductions in the population size are presented in fifth<br />

section. The sixth section is a conclusion with<br />

fundamental summarizing thoughts.<br />

I. INTRODUCTION<br />

Principal function <strong>of</strong> medium voltage insulator is<br />

electrical insulation <strong>of</strong> the conductive parts from the<br />

earthed parts <strong>of</strong> the device, and the mechanical fixing <strong>of</strong><br />

equipment and conductors which have different electrical<br />

potential. This element <strong>of</strong>ten contains built-in capacitive<br />

voltage divider and thus is capable <strong>of</strong> performing voltage<br />

indication function.<br />

Particle swarm optimization method [1] is a very<br />

efficient algorithm and it is applied onto many<br />

engineering problems. In comparison to the original<br />

version, many modifications have been made to the<br />

algorithm, which has gained many improvements [2], [3].<br />

Also, the PSO algorithm is extended into a multiobjective<br />

particle swarm MOPSO [4], [5], on the basis <strong>of</strong> a nondominant<br />

solution sorting (Pareto concept) [6]. Although,<br />

the PSO algorithm does not contain genetic operators in<br />

its fundamentals, it has been proven that the introduction<br />

<strong>of</strong> the mutation is very useful [7]. The presented method<br />

enables the enhancement <strong>of</strong> the solution space. The more<br />

recent modifications <strong>of</strong> the algorithm introduce the<br />

dynamic population size throughout the optimization<br />

process [8]. A variable [9], [10] or fixed [11] change <strong>of</strong><br />

the population size, during the evolution, is possible. The<br />

method that includes variable changes <strong>of</strong> population size<br />

enables the change in any iteration. The method with<br />

fixed change <strong>of</strong> population is determined with a<br />

predefined step <strong>of</strong> iteration.<br />

Reduction <strong>of</strong> the population is desired, when the lasting<br />

time <strong>of</strong> the optimization needs to be shortened. However,<br />

the efficiency and robustness <strong>of</strong> the modified algorithm<br />

must not change.<br />

The remainder <strong>of</strong> this paper is organized as follows.<br />

Section two describes numerical model <strong>of</strong> medium<br />

voltage post insulator and also a multi-objective<br />

optimization model. Section three presents a classical<br />

Particle Swarm Optimization (PSO) algorithm and also an<br />

updated version <strong>of</strong> such algorithm with the Cauchy<br />

mutation operator. Section four describes a dynamic<br />

population PSO algorithm, where two procedures for<br />

population reduction are presented. Results <strong>of</strong> the<br />

classical optimization PSO algorithm and for all dynamic<br />

II. NUMERICAL MODEL AND OPTIMUM<br />

SIGNIFICATION<br />

Post insulator is used as a voltage indicator in medium<br />

voltage switchgear. Post insulators are installed in a<br />

switchgear device or in any other input element, where<br />

voltage is present. Fig. 1 shows parametrically written 3D<br />

numerical model <strong>of</strong> a post insulator. The metal fitting <strong>of</strong><br />

the insulator (upper connection) for fastening to the<br />

conductive part <strong>of</strong> the upper side, is elongated with a<br />

special electrode <strong>of</strong> the divider, which has the same<br />

potential as the conductive part. The metal fitting for<br />

insulator fastening to the earthed part (lower connection)<br />

is situated at the bottom <strong>of</strong> the insulator. An electrically<br />

separated cylindrical metal mesh is mounted around this<br />

metal connection, which is the other divider’s electrode.<br />

Modeling <strong>of</strong> the mentioned switchgear element<br />

demands design regarding the exact value <strong>of</strong> capacitance,<br />

which is calculated from electric energy. Modeling also<br />

requires the lowest possible magnitude <strong>of</strong> electric field<br />

inside insulation material, because increased values <strong>of</strong><br />

electric field lessen the life-time <strong>of</strong> these switchgear<br />

elements. Optimization provides electric field strength<br />

reduction on electrodes’ edges and on other critical points<br />

on its crossing between insulation and air. Switchgear<br />

stable performance requires that switchgear elements<br />

must be constructed to endure the highest voltages, such<br />

as lightning strike (125 kV).<br />

Requirements described above, which have an opposite<br />

tendency, are presented with two objective functions (fC,<br />

fE). Every objective function presents an individual<br />

electric characteristic with different quantities, therefore<br />

objective functions must be written relatively. Both<br />

objective functions are written with bell shaped fuzzy sets<br />

with the maximal value <strong>of</strong> 1. Function fE is presented in<br />

the Fig. 2a, whereas function fC is in Fig. 2b. The<br />

numerical calculation is based on FEM analysis, with<br />

solver <strong>of</strong> the EleFAnT program package [12].


upper electrode<br />

epoxy insulation<br />

cylindrical metal mesh<br />

lower electrode<br />

Figure 1: Post insulator illustrative model with<br />

optimization parameters.<br />

The entire model is parametrically written and<br />

described with eight parameters (Fig. 1). It is necessary to<br />

perform FEM calculation for each evaluation <strong>of</strong> the<br />

objective functions.<br />

f E<br />

a)<br />

f c<br />

1,2<br />

1<br />

0,8<br />

0,6<br />

0,4<br />

0,2<br />

0<br />

1 2 3 4 5<br />

1,2<br />

1<br />

0,8<br />

0,6<br />

0,4<br />

0,2<br />

p7<br />

p5<br />

p4<br />

p2<br />

E (MV/m)<br />

p8<br />

p6<br />

p3<br />

p1<br />

0<br />

54 59 64 69 74 79 84 89 94<br />

b)<br />

C (μF)<br />

Figure 2: Determination <strong>of</strong> objective function: a) fE, b) fC.<br />

PSO algorithm [1] has been used as a multi-objective<br />

optimization algorithm. Among other methods, the<br />

weighted sum method is used to solve the multiobjective<br />

optimization problem. Equation (1) describes how the<br />

functions fE and fC are merged into a unified objective<br />

function f<br />

f wEfE wCfC (1)<br />

where wE and wC are the weights <strong>of</strong> the individual<br />

quantities.<br />

The weighted sum method requires special attention<br />

when selecting the objective functions weights, which<br />

enable transformation to optimization with a composed<br />

single objective function. This method also requires a<br />

detailed knowledge <strong>of</strong> the applicative problem. Knowing<br />

the presented problem, according to (1), the authors have<br />

selected the following weights: wE = 0.6 and wC = 0.4.<br />

- 239 - 15th IGTE Symposium 2012<br />

III. PSO ALGORITHM AND CAUCHY MUTATION<br />

The PSO algorithm is placed in a population-based<br />

stochastic search technique, that imitates social behavior<br />

<strong>of</strong> the birds while they fly and does not contain genetic<br />

operators. Instead <strong>of</strong> the genetic operators, the population<br />

members are exposed to the cooperation between each<br />

other, at the same time, they compete with each other<br />

throughout generations.<br />

Each and every particle adjusts its flying ability to the<br />

leading particle - the best individual. Each particle <strong>of</strong> the<br />

population, which represents a possible solution to the<br />

problem, is treated as a point in the D-dimensional space.<br />

The i th particle is presented as xi ( xi1, xi2,..., xiD)<br />

. The<br />

best former position (position, which gives the best result<br />

in the previous iteration) <strong>of</strong> the each and every particle is<br />

stored and presented as pi ( pi1, pi2,..., piD)<br />

. The velocity<br />

<strong>of</strong> the i th particle is presented as vi ( vi1, vi2,...., viD)<br />

.<br />

The velocity changes vi and the new position xi <strong>of</strong> the<br />

i th particle changes in accordance with the (2) and (3):<br />

vi( t1) wvi() t c1rand() ( pi() t xi()) t <br />

(2)<br />

c2Rand() ( pg( t) xi(<br />

t))<br />

x( t1) x( t) v( t<br />

1)<br />

(3)<br />

i i i<br />

where t indicates the iteration, c1 and c2 are positive<br />

constants, rand() and Rand() are random functions <strong>of</strong> the<br />

dimension [0,1]. Index g represents the position <strong>of</strong> the<br />

best particle among other particles from the optimization<br />

process. Equation (2) is used for calculation <strong>of</strong> the new<br />

particle velocity on the basis <strong>of</strong> the previous particle<br />

velocity and the distance between its instantaneous<br />

distance and distance <strong>of</strong> the leading particle. Equation (3)<br />

represents a flight <strong>of</strong> the particle towards a new position.<br />

When the new population is entirely formed, the<br />

algorithm is being carried out until the interruption<br />

criterion is reached. Two approaches are used in this<br />

paper: a classical approach with the static population<br />

(number <strong>of</strong> the population members is always the same)<br />

and a dynamic approach, where the population changes<br />

the number <strong>of</strong> members throughout the optimization<br />

process. The quality <strong>of</strong> each particle is evaluated on the<br />

basis <strong>of</strong> the defined objective function.<br />

With the intention to prevent too fast convergence and<br />

consequentially to trap into a local minimum, the classical<br />

PSO algorithm has been upgraded with the Cauchy<br />

mutation [13]. Cauchy mutation operator that is used in<br />

the PSO is determined with a weighted vector.<br />

1 NP<br />

W v , (4)<br />

i ji<br />

NP j1<br />

where vji is velocity <strong>of</strong> vector j th particle in the<br />

population. The best particle is mutated from (2)<br />

according to the following equation:


p () i p () i W N( X , X ) , (5)<br />

'<br />

g g i<br />

min max<br />

where N is Cauchy distribution function over the<br />

interval (Xmin, Xmax).<br />

IV. DYNAMIC POPULATION SIZE IN THE PSO<br />

ALGORITHM<br />

In this paper the optimization procedure considers two<br />

approaches for the dynamic population size, employed in<br />

the multiobjective optimization problem. The first<br />

approach is based on the gradual reduction <strong>of</strong> the<br />

population size by half (subsection 4A). In the second<br />

approach, a dynamic reduction <strong>of</strong> population size is<br />

proposed, which is described in subsection 4B.<br />

A. Gradual reduction <strong>of</strong> the population size by half<br />

The original idea is presented by Brest [11] and<br />

proposes gradual reduction <strong>of</strong> the population size by half<br />

in each block <strong>of</strong> a predefined iteration number. This<br />

means that the reduction is not applied throughout all<br />

iterations. Fig. 3 shows the example where the population<br />

reduction has been carried out four times and the<br />

coefficient that defines the reduction is pmax = 4. In each<br />

reduction step, the population is reduced by a half in<br />

comparison to its former size.<br />

p=1<br />

p=2<br />

p=3 NP/4<br />

p=4 NP/8<br />

NP/2<br />

NP<br />

Figure 3: Schematic presentation <strong>of</strong> the population<br />

reduction.<br />

The stopping criterion for the optimization process is a<br />

predefined number <strong>of</strong> function evaluations maxnfeval. In<br />

relation to the population size reduction, there are two<br />

possibilities to determine the size <strong>of</strong> iteration blocks iterp.<br />

First possibility is an equal number <strong>of</strong> function<br />

evaluations throughout each single iteration block.<br />

Therefore the number <strong>of</strong> iterations is defined as:<br />

maxnfeval<br />

iterp<br />

<br />

pmax NP<br />

p<br />

The second possibility <strong>of</strong>fers a constant number <strong>of</strong><br />

iterations iterp, therefore the number <strong>of</strong> function<br />

evaluations for each reduction block is:<br />

(6)<br />

nfeval NPp iterp<br />

(7)<br />

For easier explanation, the Table I shows values for<br />

number <strong>of</strong> objective function evaluations (NP times iterp).<br />

These values are valid for individual population size<br />

blocks with a constant number <strong>of</strong> iterations iterp = 10 and<br />

the number <strong>of</strong> population reduction pmax = 4.<br />

- 240 - 15th IGTE Symposium 2012<br />

TABLE I<br />

RUN DATA, MAXNFEVAL=1200, PMAX=4<br />

p 1 2 3 4<br />

NP 56 28 14 7<br />

iterp 10 10 10 10<br />

NP x iterp 560 280 140 70<br />

Selection procedure is based on the idea from the<br />

selection mechanism in DE optimization algorithm [11].<br />

Individual from the first half <strong>of</strong> the population xi(t) and<br />

the suitable individual from the second half xNP/2+i(t) are<br />

compared based on the corresponding objective function<br />

values. Afterwards, the individual with a better objective<br />

function value takes the position i and therefore becomes<br />

the member <strong>of</strong> the new and reduced population. After the<br />

last step <strong>of</strong> selection is performed, the new and reduced<br />

population is obtained<br />

xNP/2<br />

i( t) if f( xNP/2i( t)) f( xi( t))<br />

xi( t1)<br />

<br />

xi(<br />

t)<br />

other<br />

B. Dynamic reduction <strong>of</strong> population size in individual<br />

iteration<br />

Alteration <strong>of</strong> population size through an optimization<br />

process is realized based on objective function<br />

evaluations, respectively according to (9):<br />

avr i best<br />

NP() t NPmax<br />

f( xmax<br />

)<br />

(8)<br />

f ( x ) f( x )<br />

, (9)<br />

where the favr(xi) is average objective function value <strong>of</strong><br />

the observed population. f(xbest) and f(xmax) are the<br />

objective function values <strong>of</strong> the best and worst particle<br />

ever found up to the observed iteration. NPmax is initial<br />

(max) population size.<br />

As the optimization algorithm approaches to optimal<br />

solution, the value <strong>of</strong> the objective function alters.<br />

Generally, the population’s average value <strong>of</strong> the objective<br />

function value is getting smaller along with iteration<br />

number. However, this does not hold true for all<br />

iterations, because the particles move also through the<br />

non-promising areas <strong>of</strong> the search space. Because <strong>of</strong> this,<br />

the population size is, according to (9), generally<br />

reducing its size; however there are also iterations, where<br />

the population size has been extended. Each population<br />

extension in individual iterations appear usually when the<br />

average objective function value is increased. The<br />

missing particles are obtained from the set <strong>of</strong> particles<br />

that have been discarded on account <strong>of</strong> population<br />

reduction in previous iterations. This improves the<br />

algorithms reliability.<br />

V. RESULTS<br />

Optimization processes with the PSO algorithm are<br />

performed under the following settings: c1=0.5, c2=1.2,<br />

w=0.8. Maximal number <strong>of</strong> iterations for all calculations<br />

is maxiter = 40.


The optimization results are shown in Table II, where<br />

first two examples are showing results obtained by<br />

standard PSO algorithm and different size <strong>of</strong> population,<br />

third one showing results <strong>of</strong> standard PSO algorithm<br />

upgraded with Cauchy mutation and last two are showing<br />

results obtained by using dynamical population size in<br />

standard PSO algorithm. This paper presented two<br />

different concepts <strong>of</strong> dynamical population size, gradual<br />

reduction and dynamic reduction proposed in section IV.<br />

TABLE II<br />

OPTIMIZATION RESULTS OBTAINED WITH PSO ALGORITHM<br />

description min f NP maxnfeval<br />

standard PSO 0.345 30 1200<br />

standard PSO 0.328 56 2240<br />

standard PSO + mutation 0. 326 56 2240<br />

dyn. populated PSO<br />

0. 326 56/28/14/7 1050<br />

(gradual reduction)<br />

dyn. populated PSO<br />

(proposed reduction)<br />

0. 326<br />

max 56<br />

min 10<br />

829<br />

Optimization process convergences for all mentioned<br />

examples in Table II are shown in Fig. 4.<br />

Objective function value<br />

0,6<br />

0,55<br />

0,5<br />

0,45<br />

0,4<br />

0,35<br />

standard PSO (NP=56)<br />

standard PSO (NP=30)<br />

standard PSO + mutation<br />

dyn. populated PSO (gradual reduction)<br />

dyn. populated PSO (proposed reduction)<br />

0,3<br />

0 10 20<br />

Iteration<br />

30 40<br />

Figure 4: Objective function values <strong>of</strong> PSO algorithm<br />

during the optimization process<br />

Algorithm with using small population size has not<br />

reached global solution, because <strong>of</strong> stuck in local<br />

optimum. Global solution can be reached by increasing<br />

population size which leads increased number <strong>of</strong> function<br />

evaluations and longer computation time. By using<br />

proposed dynamical population size algorithm achieved<br />

global minimum with decreased number <strong>of</strong> function<br />

evaluation and computation time.<br />

Size <strong>of</strong> population<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

Standard procedure with a static population size<br />

Dynamically populated PSO<br />

(proposed reduction)<br />

Dynamically populated PSO<br />

(gradual reduction)<br />

0<br />

0 10 20<br />

Iteration<br />

30 40<br />

Figure 5: Changing <strong>of</strong> population size during the<br />

optimization process.<br />

Changing population size along the optimization process<br />

is shown on Fig. 5 – for the gradual reduction, proposed<br />

reduction and fixed population size (static size).<br />

- 241 - 15th IGTE Symposium 2012<br />

VI. CONCLUSION<br />

Results show comparison <strong>of</strong> the optimization process for<br />

different population size reduction methods. Reduction <strong>of</strong><br />

the population is desirable, when computation time<br />

should be decreased and although efficiency and<br />

robustness <strong>of</strong> algorithm should not be changed.<br />

The important impact <strong>of</strong> proposed PSO algorithm with<br />

using dynamical population size can be seen in decreased<br />

number <strong>of</strong> function evaluation.<br />

In each iteration is tendency to decrease the population<br />

size. However, the population number can be also<br />

increased by adding the new members, which refreshes<br />

the population. Therefore, the algorithm’s ability to<br />

search the minima is increased. It is important, already at<br />

the beginning, to select the appropriate, respectively<br />

enough large population. Therefore, the global search <strong>of</strong><br />

environment is enabled. Smaller population size is<br />

sufficient just for local search solutions.<br />

[1]<br />

REFERENCES<br />

J. Kennedy and R. C. Eberhart, “Particle swarm optimization,”<br />

Proc. IEEE International Conference on Neural Networks, Vol.<br />

IV, Piscataway, NJ, pp. 1942-1948, 1995.<br />

[2] S.L. Ho, Y. Shiyou, Ni Guangzheng and H.C. Wong, “A particle<br />

swarm optimization method with enhanced global search ability<br />

for design optimizations <strong>of</strong> electromagnetic devices,” IEEE Trans.<br />

on Magn., vol. 42, no.4, pp. 1107-1110, 2006.<br />

[3] G. Toscano Pulido and C.A. Coello Coello, “Using clustering<br />

techniques to improve the performance <strong>of</strong> a particle swarm<br />

optimizer,” <strong>Proceedings</strong> <strong>of</strong> Genetic and Evolutionary<br />

[4]<br />

Computation Conference, Seattle, WA, pp. 225-237, 2004.<br />

L. dos Santos Coelho, H.V.H. Ayala and P. Alotto, “A<br />

Multiobjective Gaussian Particle Swarm Approach Applied to<br />

Electromagnetic Optimization,” IEEE Trans. on Magn., vol. 46,<br />

no.8, pp. 3289-3292, 2010.<br />

[5] L. dos Santos Coelho, L.Z. Barbosa and L. Lebensztajn,<br />

“Multiobjective Particle Swarm Approach for the Design <strong>of</strong> a<br />

Brushless DC Wheel Motor,” IEEE Trans. on Magn., vol. 46,<br />

no.8, pp. 2994-2997, 2010.<br />

[6] U. Baumgartner, C. Magele and W. Renhart, “Pareto optimality<br />

and particle swarm optimization,” IEEE Trans. on Magn., vol. 40,<br />

no.2, pp. 1172-1175, 2004.<br />

[7] L. Jize, S. Ping and L. Kejie, “A Modified Particle Swarm<br />

Optimization with Adaptive Selection Operator and Mutation<br />

Operator,” International Conference on Computer Science and<br />

S<strong>of</strong>tware Engineering CSSE 2008, Vol. 1, Wuhan, China, pp.<br />

1199-1202, 2008.<br />

[8] W. F. Leong and G. G. Yen "PSO-based multiobjective<br />

optimization with dynamic population size and adaptive local<br />

archives", IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38,<br />

no. 5, pp.1270 - 1293 , 2008.<br />

[9] M. Greeff and AP. Engelbrecht, “Dynamic multi-objective<br />

optimisation using PSO,” Studies in Computational Intelligence,<br />

(Series Ed: Kacprzyk, Janusz), pp. 105-123, 2010.<br />

[10] G. G. Yen and W. F. Leong "Dynamic multiple swarms in<br />

multiobjective particle swarm optimization", IEEE Trans. Syst.,<br />

Man, Cybern. A, Syst. Humans, vol. 39, p.890 - 911, 2009.<br />

[11] J. Brest and M. Sepesy Maucec, “Population Size Reduction for<br />

the Differential Evolution Algorithm,” Applied Intelligence, 29(3)<br />

pp. 228–247, 2008.<br />

[12] Program tools ELEFANT. <strong>Graz</strong>, Austria: Inst. Fundam. Theory<br />

Elect. Eng., Univ. Technol. <strong>Graz</strong>, 2000.<br />

[13] H. Wang, Y. Liu, S. Zeng, H. Li, and C. Li, "Opposition-based<br />

Particle Swarm Algorithm with Cauchy Mutation", Proc. <strong>of</strong> the<br />

2007 IEEE Congress on Evolutionary Compulation, 2007, pp.<br />

4750-4756.


- 242 - 15th IGTE Symposium 2012<br />

Simulation <strong>of</strong> the Absorbing Clamp Method for<br />

Optimizing the Shielding <strong>of</strong> Power Cables<br />

Szabolcs Gyimóthy∗ ,József Pávó∗ ,Péter Kis † , Tomoaki Toratani ‡ , Ryuichi Katsumi ‡ and Gábor Varga∗ ∗Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics, Egry J. u. 18, H-1111 Budapest, Hungary<br />

† Furukawa Electric Institute <strong>of</strong> <strong>Technology</strong>, Vasgolyó u. 2-4, H-1158 Budapest, Hungary<br />

‡ Furukawa Electric R&D Center for Automotive Systems & Devices, Hiratsuka, Japan<br />

E-mail: gyimothy@evt.bme.hu<br />

Abstract—An efficient numerical simulation tool based on FEM is proposed, by which the EMC shielding effect<br />

characteristics <strong>of</strong> power cables can be predicted in the 30–1000 MHz frequency range, as if it would be measured by the<br />

Absorbing Clamp Method. The proposed simulation method is based on decomposition: a 2D axisymmetric RF FE model<br />

is used for describing the whole measurement setup, while a 3D quasistatic FE model is used for the symmetry cell <strong>of</strong> the<br />

shielding layer in order to capture the effect <strong>of</strong> its fine geometric details. The two models are coupled via the concept <strong>of</strong><br />

the equivalent shielding layer obtained by homogenization. Comparison with real measurements show that the shielding<br />

characteristics can be reliably predicted this way, with some deviation in the low end <strong>of</strong> the frequency range though. This<br />

simulation tool can be applied in the design and optimization <strong>of</strong> braided cable shields to be used in automotive industry.<br />

Index Terms—EMC testing, cable shielding, homogenization, automotive industry<br />

I. INTRODUCTION<br />

The effective reduction <strong>of</strong> emitted radio frequency (RF)<br />

disturbances in electric vehicles –generated mainly by<br />

power semiconductors having high slew rates– becomes<br />

more and more important nowadays [1]. Partly for<br />

this reason, the network configuration <strong>of</strong> cars is being<br />

changed from unshielded single core multi wire harnesses<br />

into coaxial conductor layouts. Consequently, optimization<br />

<strong>of</strong> the shielding <strong>of</strong> such cables is a current topic.<br />

The absorbing clamp method (ACM) is a well known<br />

technique for measuring electromagnetic interference<br />

(EMI) generated by electric cables in the range 30–<br />

1000 MHz [2]. Nowadays it is commonly used in the<br />

automotive industry for testing electromagnetic compatibility<br />

(EMC) <strong>of</strong> braided shields and connectors <strong>of</strong><br />

wire harnesses [3]. The measurement set-up is shown in<br />

Fig. 1. The central element <strong>of</strong> the device is the “clamp”<br />

consisting <strong>of</strong> a set <strong>of</strong> split lossy ferrite rings (see Fig. 2)<br />

and a sensing loop [4]. The shielding effect (SE) to be<br />

measured is –in essence– the ratio <strong>of</strong> the output signal<br />

<strong>of</strong> the unshielded cable to that <strong>of</strong> the shielded.<br />

This method became de facto an industry standard in<br />

many areas. For instance –in addition to RF compliance<br />

testing <strong>of</strong> cables– it is already used for the quantitative<br />

Fig. 1. ACM device for measuring the efficiency <strong>of</strong> cable shielding.<br />

evaluation and comparison <strong>of</strong> the performance <strong>of</strong> shields<br />

through their measured SE characteristics. Therefore,<br />

although there are other practical parameters by which<br />

one may characterize and optimize a shielding, the design<br />

cycle would be much more efficient if the SE curve <strong>of</strong><br />

a shield prototype (as taken by ACM) could be directly<br />

predicted by numerical simulation.<br />

ACM is simple and relatively cheap, but it was not<br />

developed –in the late 60’s– with numerical modeling<br />

in mind. Although there are some theoretical studies on<br />

its operation [5], they are far from being applicable for<br />

quantitative prediction. Actually, the simulation <strong>of</strong> this<br />

measurement is quite challenging from the numerical<br />

point <strong>of</strong> view: a) the set-up consists <strong>of</strong> several components,<br />

among others ferrite; b) a wide frequency range is<br />

studied; c) the arrangement is “large” but has some very<br />

fine details.<br />

Several types <strong>of</strong> numerical models have been worked<br />

out to simulate this measurement, based on e.g. coupled<br />

transmission line system (TLS), finite element method<br />

(FEM) and method <strong>of</strong> moments (MoM), with little success<br />

though [6][7]. Surprisingly, none <strong>of</strong> them was able<br />

to catch even the main tendency <strong>of</strong> the characteristics.<br />

Anyone might conclude from the physical picture that<br />

the shielding effect gets better at higher frequencies,<br />

and actually the above mentioned computing models<br />

just confirmed this behavior. However, the real ACM<br />

Fig. 2. Absorbing clamp <strong>of</strong> type R&S R○ MDS-21 [4].


measurement <strong>of</strong> a typical braided cable shield usually<br />

results in a decaying SE characteristics as frequency<br />

increases (c.f. Fig. 13). Whether this behavior is intrinsic<br />

to the shielding, or rather just a “side-effect” <strong>of</strong> the<br />

measurement method, was a question.<br />

II. SUMMARY OF THE MODELING APPROACH<br />

It was realized that detailed 3D modeling <strong>of</strong> the measurements<br />

is not doable because <strong>of</strong> the enormous computational<br />

resources needed for correct analysis <strong>of</strong> the<br />

complicated arrangement.<br />

Our idea is to use decomposition and homogenization.<br />

Different models are used for the braided shield details<br />

and the overall set-up, respectively. The ACM measurement<br />

at higher frequencies tend to show little dependence<br />

on the larger environment (e.g. support, ground, walls,<br />

etc.). This suggests the use <strong>of</strong> a simplified axisymmetric<br />

2D finite element (FE) model <strong>of</strong> the arrangement, which<br />

can be analyzed efficiently.<br />

On the other hand, realistic cable shields do not<br />

exhibit such symmetry. To overcome this difficulty, we<br />

introduced the concept <strong>of</strong> equivalent homogeneous (bulk)<br />

shielding layer that may have frequency dependent complex<br />

conductivity, such that its shielding characteristics<br />

approximates that <strong>of</strong> the original braided shield. Also a<br />

method was developed by which the equivalent conductivity<br />

parameter can be identified. Although this latter<br />

requires a 3D FE model <strong>of</strong> the shield, the domain <strong>of</strong><br />

computation extends to only a single symmetry cell <strong>of</strong> the<br />

geometry. As a result, this 3D analysis is manageable in a<br />

relatively moderate computing environment, too. Putting<br />

the equivalent shield with its properly selected parameters<br />

into the 2D model <strong>of</strong> the experimental set-up provides the<br />

simulated output <strong>of</strong> the measurement.<br />

III. THE 2D AXISYMMETRIC RF FE MODEL<br />

By omitting the support and the surroundings <strong>of</strong> the<br />

measurement device and taking the two reflector plates as<br />

disc-shaped, we get an axially symmetric arrangement, <strong>of</strong><br />

which a 2D longitudinal section is enough to be considered.<br />

We used the RF Module <strong>of</strong> Comsol Multiphysics R○<br />

in the application mode “Electromagnetic Waves / TM<br />

Waves / Harmonic propagation” and with model space<br />

dimension “Axial symmetry (2D)” [8].<br />

The geometry <strong>of</strong> the 2D model can be seen in Fig. 3.<br />

Cable conductor and reflector plates are both modeled<br />

as perfect electric conductor (PEC) boundary conditions.<br />

The terminal impedance is modeled by means <strong>of</strong> a<br />

50 Ω coaxial cable section with non-reflecting boundary<br />

condition at its end. The arrangement is considered<br />

open in the radial direction, which is modeled by a<br />

cylindrical perfectly matching layer (PML). Finally, the<br />

excitation is realized as a 50 Ω coaxial cable, fed by an<br />

incident coaxial-mode TEM wave, which is prescribed as<br />

“port” boundary condition and characterized by the input<br />

voltage Uin.<br />

- 243 - 15th IGTE Symposium 2012<br />

Fig. 3. The 2D axisymmetric FE model built in Comsol RF.<br />

The output signal is the induced voltage Uout <strong>of</strong> an<br />

assumed loop encircling the first ferrite ring (the one<br />

which is closest to the feed). The transfer function is<br />

defined as the ratio <strong>of</strong> output voltage to input, and<br />

normally its gain k versus the frequency f is used,<br />

k(f) = 20 log10 |Uout/Uin| [dB]. (1)<br />

The shielding effect (SE) <strong>of</strong> a given shield is defined here<br />

as the ratio <strong>of</strong> the output voltage measured for the bare<br />

cable core (i.e. for the cable with its shielding removed)<br />

to that one measured for the shielded cable. Provided the<br />

input voltage is kept fixed, the SE characteristics (given<br />

in dB, as function <strong>of</strong> the frequency) can be expressed as<br />

follows,<br />

SE(f) =kun(f) − ksh(f), (2)<br />

where “un” and “sh” stand for the unshielded and<br />

shielded configurations, respectively.<br />

Figure 4 is a snapshot about the circumferential component<br />

<strong>of</strong> the magnetic field, Hϕ, taken in the unshielded<br />

case. It can be observed how two waves –one along the<br />

line with a longer wavelength and one along the series<br />

<strong>of</strong> ferrite rings with shorter– are coupled with each other.<br />

The damping effect <strong>of</strong> the ferrite is also observable on<br />

the plot. Notably, this FE model performs well even in<br />

the reconstruction <strong>of</strong> the stray field <strong>of</strong> bulk aluminum<br />

shielding layers, for which SE can be as high as 130 dB.<br />

A. Parameter Dependency <strong>of</strong> the 2D Model<br />

We have carefully investigated the sensitivity <strong>of</strong> the<br />

computed results on several model parameters –including<br />

some <strong>of</strong> the numerical implementation– in order to filter<br />

out all possible sources <strong>of</strong> inconsistency between the<br />

model and the real measurement. First comes a list <strong>of</strong><br />

those parameters which have little or no effect: value<br />

<strong>of</strong> the terminal impedance; the implemenation <strong>of</strong> open<br />

boundary (e.g. PML, absorbing boundary condition, etc.);<br />

fluctuation <strong>of</strong> the input voltage.


Fig. 4. Snapsot <strong>of</strong> the circumferential magnetic field.<br />

The latter needs some explanation. Since the device is<br />

not matched with the signal generator source, the voltage<br />

along the feeding cable becomes location dependent due<br />

to reflections, moreover this dependence is function <strong>of</strong> the<br />

frequency. In the FE model we consider a chunk <strong>of</strong> the<br />

feeding cable <strong>of</strong> fixed length, which implicitly defines the<br />

quantity Uin for the model. However, this may not be the<br />

same as its real (measured) counterpart. Fortunately, the<br />

computed SE curves do not show significant dependence<br />

on the “definition” <strong>of</strong> Uin, as simulations testified.<br />

Two factors that really affect the results are the permeability<br />

characteristics <strong>of</strong> the ferrite rings, as well as the<br />

assumption <strong>of</strong> axial symmetry. These are detailed in the<br />

next two subsections.<br />

B. Permeability <strong>of</strong> the Ferrite Rings<br />

The permeability <strong>of</strong> the ferrite material plays a dominant<br />

role in forming the SE characteristics predicted by<br />

the model, hence its accurate description is critical. Since<br />

permeability data for the given absorbing clamp were<br />

not available, we made measurements. The obtained frequency<br />

dependence <strong>of</strong> the complex relative permeability<br />

(real and imaginary parts) can be seen on the graph <strong>of</strong><br />

Fig. 5. Note that the data above 100 MHz are extrapolated<br />

values.<br />

C. Limitations <strong>of</strong> Assuming Axial Symmetry<br />

Considering the field plot in Fig. 4 one can conceive<br />

that the ferrite clamp acts a waveguide and –at higher<br />

- 244 - 15th IGTE Symposium 2012<br />

Relative permeability<br />

800<br />

700<br />

600<br />

500<br />

400<br />

300<br />

200<br />

100<br />

0<br />

10 7<br />

10 8<br />

Frequency (Hz)<br />

real part<br />

negative imaginary part<br />

Fig. 5. Complex relative permeability <strong>of</strong> the ferrite material.<br />

frequencies– can give rise to higher order modes. Although<br />

with the use <strong>of</strong> coaxial feeding the axisymmetric<br />

mode(s) are deemed predominant, non-axisymmetric<br />

modes can appear as well wherever symmetry is violated<br />

in the cross section <strong>of</strong> the device.<br />

For studying this behavior we carried out the mode<br />

analysis for the cross section <strong>of</strong> the device using FEM.<br />

In order to break axial symmetry, a large grounded<br />

conducting plate –being parallel with the axis <strong>of</strong> the cable<br />

under test– was added to the arrangement. In addition<br />

to this, the full 3D FE model <strong>of</strong> the arrangement was<br />

analyzed. Note that an unshielded cable was examined<br />

in both cases for the sake <strong>of</strong> simplicity.<br />

Figure 6 compares the transfer functions obtained from<br />

the full 3D and the 2D axisymmetric model, respectively.<br />

Although several possible propagating modes exist towards<br />

1GHz, there is only a little deviation between the<br />

curves, mostly limited to the lower frequencies. From<br />

this we concluded that the effect <strong>of</strong> higher order nonaxisymmetric<br />

modes is negligible, and that the axisymmetric<br />

model is acceptable within this frequency range.<br />

Fig. 6. Comparison <strong>of</strong> the transfer functions computed by the 2D and<br />

3D FE models, respectively, for the case <strong>of</strong> an unshielded cable.<br />

10 9


IV. HOMOGENIZATION AND EQUIVALENT SHIELDING<br />

The goal is to replace the shield having complex geometry<br />

with a homogeneous cylindrical shielding layer,<br />

which is called hereinafter the “equivalent shield”. The<br />

task is to determine the parameters <strong>of</strong> the latter so that<br />

its shielding effect be the same as if it was measured<br />

on the real cable shielding. Of course, this equivalence<br />

is allowed to satisfy approximately, and only on (and<br />

outside <strong>of</strong>) an observation surface, i.e. at and beyond a<br />

certain distance from the shield (Fig. 7).<br />

3D Observation surface 1D<br />

Real texture<br />

Wire<br />

Shield<br />

Isolation<br />

Fig. 7. Illustration <strong>of</strong> the concept <strong>of</strong> homogenization.<br />

Equivalent<br />

homogeneous material<br />

Figure 8 demonstrates how the roughness <strong>of</strong> the stray<br />

field distribution is smoothing as we increase the distance<br />

taken from the shield. (the radial component <strong>of</strong> the<br />

electric field, Er, is plotted along a line parallel with<br />

the cable, at 1GHz). This smoothing behavior justifies<br />

the use <strong>of</strong> the homogeneous shield as replacement in the<br />

2D model. Note that the shield tested here is not braided<br />

but leaky (c.f. Fig. 10), hence for braided shields one can<br />

expect stronger homogenization effect.<br />

There are several parameters that can be varied in order<br />

to find an equivalence, like for instance the inner and<br />

outer radii <strong>of</strong> the layer, or the specific electric conductivity<br />

and magnetic permeability <strong>of</strong> the material. In order to<br />

simplify and regularize the inverse problem we decided<br />

to keep the geometry fixed, and choose non-magnetic<br />

material. Hence only the complex valued conductivity,<br />

σ, <strong>of</strong> the layer remained to be determined. The proposed<br />

values for the inner and outer radius <strong>of</strong> the equivalent<br />

shield are r1 =10mmand r2 =11mm, respectively.<br />

The observation surface is at the radius robs =23.5mm<br />

which coincides with the inner radius <strong>of</strong> the ferrite rings.<br />

A. Evaluation <strong>of</strong> the Scalar Conductivity Parameter<br />

For those shielding configurations which show certain<br />

symmetry in the ϕ direction, like the one on the left<br />

hand side in Fig. 9, the azimuthal components <strong>of</strong> the<br />

shielding currents are symmetric too. As a consequence,<br />

the axial component <strong>of</strong> the magnetic fields caused by<br />

those currents are compensated, and thereby vanish.<br />

This allows us to investigate only the “axial electric –<br />

azimuthal magnetic” field mode, and hence to describe<br />

the conductivity σ by a complex scalar value.<br />

If the current in the wire is given and the parameter σ is<br />

known, the electric and magnetic fields can be determined<br />

from Maxwell’s equations. Since the problem with homogeneous<br />

shielding is essentially one dimensional (see<br />

- 245 - 15th IGTE Symposium 2012<br />

Fig. 8. Smoothness <strong>of</strong> the field at various distances from the shield.<br />

Fig. 7, right), the solution can be given analytically. We<br />

can describe the electric and magnetic fields, each, by<br />

one single component,<br />

E(r, ϕ, z, t) =Re Ez(r)e −jωt ez (3)<br />

H(r, ϕ, z, t) =Re {Hϕ(r)e −jωt eϕ (4)<br />

where time harmonic fields <strong>of</strong> angular frequency ω are<br />

assumed, and j denotes the imaginary unit. Using the<br />

quasi static approximation in the conducting region,<br />

one can easily derive the following Bessel’s differential<br />

equation from Maxwell’s equations:<br />

d2Ez 1 dEz<br />

+<br />

dr2 r dr − jωμ0σEz =0, r1


• Hϕ is specified at r0 (this is equivalent to prescribing<br />

the total current <strong>of</strong> the wire),<br />

• both Ez and Hϕ are continuous at r1 and r2,<br />

• the asymptotic behavior <strong>of</strong> the fields for r →∞is<br />

known.<br />

We omit further details <strong>of</strong> the solution as they can<br />

be found in several textbooks on electromagnetism [9].<br />

The closed formula expressing the magnetic field was<br />

implemented as a Matlab R○ function, where the input is<br />

the complex conductivity σ, and the output is the complex<br />

amplitude (phasor) <strong>of</strong> Hϕ at r = robs.<br />

We use the following procedure for the evaluation <strong>of</strong><br />

the equivalent conductivity <strong>of</strong> the homogeneous shielding<br />

layer (let us denote it with σeq in the following):<br />

1) We create the 3D FE model <strong>of</strong> the real shielding<br />

together with an exciting wire centered in its axis.<br />

We compute the magnetic field at some selected<br />

frequencies between 30 − 1000 MHz.<br />

2) We take samples <strong>of</strong> the azimuthal magnetic field on<br />

the observation surface, and calculate its average.<br />

This is what we call the “observed magnetic field”<br />

and denote with Hϕ,obs.<br />

3) In an optimization loop we attempt to find σeq<br />

for which Hϕ (a function <strong>of</strong> σ) and Hϕ,obs match<br />

the best (this is carried out for each frequency<br />

separately):<br />

σeq =argmin|Hϕ(σ)<br />

− Hϕ,obs| (8)<br />

σ<br />

Some remarks on the procedure. Notably, in step 1) it<br />

is enough to take one symmetry cell <strong>of</strong> the arrangement<br />

as Fig. 10 demonstrates. We used the AC/DC module<br />

<strong>of</strong> Comsol Multiphysics R○ for computing the fields with<br />

quasi static approximation [10]. In step 2) we also verify<br />

whether the z component magnetic field, Hz itself, as<br />

well as the fluctuation <strong>of</strong> Hϕ on the observation surface<br />

are really negligible. Finally, in step 3) we used the<br />

Matlab R○ function fminsearch for the purpose.<br />

B. Evaluation <strong>of</strong> the Conductivity Tensor<br />

For shielding configurations like the spiral-structure in<br />

Fig. 9 on the right, the above described method is not<br />

suitable because the magnetic field is no longer pure azimuthal.<br />

In this case we can still use the anisotropic conductivity<br />

tensor in the equivalent homogeneous shielding<br />

layer to imitate a similar phenomenon. The conductivity<br />

tensor is assumed to have the following form:<br />

⎡<br />

σ = ⎣ σrr<br />

⎤<br />

0 0<br />

⎦ (9)<br />

0 σϕϕ σϕz<br />

0 σzϕ σzz<br />

That is, the r − ϕ and r − z cross-effects are supposed to<br />

have negligible contribution to the observable magnetic<br />

field. Moreover, σϕz = σzϕ is expected.<br />

To demonstrate the treatment <strong>of</strong> anisotropy we derive<br />

the equations for the field components in the conductive<br />

region. First, the governing Maxwell’s equations are<br />

- 246 - 15th IGTE Symposium 2012<br />

Fig. 10. Symmetry cell <strong>of</strong> the shielded cable used for determining<br />

the σ parameter <strong>of</strong> the equivalent homogeneous shielding layer. This<br />

structure was inspired by leaky coaxial (LCX) cables.<br />

written in the quasi static approximation, in the time<br />

harmonic regime:<br />

∇×H = σE, ∇×E = −jωμ0H (10)<br />

Taking into account the obvious symmetries <strong>of</strong> the configuration<br />

with homogeneous shielding layer, i.e. ∂/∂z =<br />

0 and ∂/∂ϕ =0, the resolution <strong>of</strong> (10) written for the<br />

three cylindrical components is the following:<br />

r : 0 = σrrEr, 0=−jωμ0Hr (11)<br />

⎧<br />

⎪⎨ −<br />

ϕ :<br />

⎪⎩<br />

∂Hz<br />

∂r = σϕϕEϕ + σϕzEz<br />

− ∂Ez<br />

(12)<br />

= −jωμ0Hϕ<br />

⎧<br />

∂r<br />

⎪⎨<br />

1 ∂<br />

z :<br />

r ∂r<br />

⎪⎩<br />

(rHϕ) =σzϕEϕ + σzzEz<br />

1 ∂<br />

r ∂r (rEϕ)<br />

(13)<br />

=−jωμ0Hz<br />

It can be seen that the radial components vanish, and<br />

also that σrr does not play a role. However, there are no<br />

longer separable Hϕ − Ez and Hz − Eϕ modes, as in<br />

the isotropic case. The solution <strong>of</strong> the equations (11-13)<br />

is not easy, but as the domain is one dimensional, fast<br />

solution method may be established. The algorithm given<br />

in section IV-A should slightly be modified here:<br />

1) The same as in the isotropic case, but we have<br />

to carry out the FE analysis for two different<br />

excitations: one in which Hϕ is prescribed at the<br />

wire surface (r = r0), and one in which Hz is<br />

prescribed there (whatever would be the physical<br />

meaning <strong>of</strong> the latter). These two solutions are<br />

marked with ( ′ ) and ( ′′ ) in the following.<br />

2) We take samples <strong>of</strong> the axial and azimuthal<br />

magnetic field on the observation surface from<br />

both FEM solutions, and calculate their average.<br />

This way we obtain the observed magnetic fields,<br />

, H′′<br />

H ′ ϕ,obs<br />

ϕ,obs , H′ z,obs<br />

and H′′<br />

z,obs respectively.


3) We attempt to optimize the components <strong>of</strong> σ as<br />

above. The generalization <strong>of</strong> the scalar case is<br />

straightforward:<br />

σeq =argmin<br />

σ<br />

<br />

H ′<br />

ϕ − H ′ <br />

<br />

ϕ,obs + H ′′<br />

ϕ − H ′′<br />

+ H ′ z − H ′ <br />

<br />

z,obs + H ′′<br />

z − H ′′<br />

V. TEST RESULTS<br />

ϕ,obs<br />

z,obs<br />

<br />

+<br />

<br />

(14)<br />

For testing the method we chose a simple shield structure<br />

(Fig. 11) inspired by leaky coaxial (LCX) cables. The<br />

shield is is made <strong>of</strong> aluminum; the inner radius <strong>of</strong> the<br />

tube is 10 mm; the wall thickness is 1mm. The shield<br />

has circular holes <strong>of</strong> 5mmradius in a regular distribution;<br />

there are 4 holes along the circumference.<br />

Fig. 11. Leaky aluminum shield used for testing the method.<br />

Since the geometry is symmetric with respect to the<br />

azimuthal (ϕ) direction, we are allowed to use the equivalent<br />

scalar conductivity (c.f. section IV-A). By solving<br />

(8) we obtained the σeq curves presented in Fig. 12.<br />

Note that these curves are not the only suitable ones,<br />

because the solution <strong>of</strong> (8) is not unique. However, we<br />

experienced that quite different σeq curves resulted in<br />

the same SE characteristics at the end, so they can be<br />

considered equally good in this respect.<br />

Figure 13 shows the SE curve computed by building<br />

the σeq characteristics <strong>of</strong> Fig. 12 into the 2D axisymmetric<br />

FE model. For comparison, the figure also shows the<br />

curve obtained by real measurement. Obviously, the SE<br />

Fig. 12. The computed equivalent complex conductivity, σeq.<br />

- 247 - 15th IGTE Symposium 2012<br />

Shielding Effect (dB)<br />

60<br />

55<br />

50<br />

45<br />

40<br />

35<br />

30<br />

25<br />

20<br />

15<br />

10 7<br />

10 8<br />

Frequency (Hz)<br />

measured<br />

simulated<br />

Fig. 13. Comparison <strong>of</strong> measured and simulated shielding effects.<br />

characteristics has been reliably predicted by the model,<br />

with some deviation in the low end <strong>of</strong> the frequency<br />

range.<br />

VI. CONCLUSIONS<br />

A numerical simulation method has been elaborated that<br />

can be used to predict the shielding effect <strong>of</strong> shields with<br />

various patterns. Using this tool the designer can predict<br />

the usability <strong>of</strong> a given shield construction. This simulation<br />

method has been thoroughly verified by theoretical<br />

considerations, numerical experiments, and also by some<br />

experimental data. In the authors’ opinion, the error <strong>of</strong><br />

prediction at low frequency might be due to either the<br />

inexact knowledge <strong>of</strong> the ferrite permeability characteristics<br />

or the insufficiency <strong>of</strong> the 2D axisymmetric model.<br />

REFERENCES<br />

[1] M. Reuter, S. Tenbohlen, W. Köhler, and A. Ludwig, “Impedance<br />

analysis <strong>of</strong> automotive high voltage networks for EMC measurements,”<br />

in 10th Int. Symposium on Electromagnetic Compatibility<br />

(EMC Europe), York (UK), 26-30 Sept 2011.<br />

[2] A. Tsaliovich, Electromagnetic Shielding Handbook for Wired and<br />

Wireless EMC Applications, ser. Kluwer international series in<br />

engineering and computer science. Kluwer Academic, 1999. [Online].<br />

Available: http://books.google.hu/books?id=4vl0S6fZo-IC<br />

[3] S. Miyazaki, S. Kihira, and T. Nozaki, “New shielding construction<br />

<strong>of</strong> high-voltage wiring harnesses for Toyota Prius – winning<br />

<strong>of</strong> Toyota Superior Award for cost reduction,” Sumitomo Electric<br />

Industries Technical Review, no. 61, pp. 21–23, Jan 2006.<br />

[4] Rohde & Schwarz R○ MDS-21 Absorbing Clamp – Data<br />

sheet. [Online]. Available: http://www2.rohde-schwarz.com/file/<br />

MDS-21 EZ-24 dat en.pdf<br />

[5] D. Williams and S. Jones, “Time domain characterization and<br />

modelling <strong>of</strong> the absorbing clamp. a device for measuring radiated<br />

radio frequency power,” in Eighth International Conference on<br />

Electromagnetic Compatibility, 21-24 Sept 1992, pp. 149–159.<br />

[6] L. Fejérvári, “Simulation <strong>of</strong> wire harness radiation,” Furukawa<br />

Electric Institute <strong>of</strong> <strong>Technology</strong>, Budapest, Tech. Rep., March<br />

2009.<br />

[7] P. Kis, “Simulation <strong>of</strong> wire harness radiation,” Furukawa Electric<br />

Institute <strong>of</strong> <strong>Technology</strong>, Budapest, Tech. Rep., March 2010.<br />

[8] Comsol Multiphysics RF Module User’s Guide, COMSOL AB,<br />

November 2008.<br />

[9] K. Simonyi, Foundations <strong>of</strong> Electrical Engineering: Fields, Networks,<br />

Waves. London: Pergamon, 1963.<br />

[10] Comsol Multiphysics AC/DC Module User’s Guide, COMSOL<br />

AB, November 2008.<br />

10 9


- 248 - 15th IGTE Symposium 2012<br />

A Neural Network Approach to Sizing an Electrical<br />

Machine<br />

Steven Bielby, David A. Lowther<br />

Electrical and Computer Engineering Department, McGill <strong>University</strong>, 3480 <strong>University</strong> Street, Montreal, Quebec,<br />

Canada. H3A 2A7<br />

E-mail: david.lowther@mcgill.ca<br />

Abstract—The first stage in the design <strong>of</strong> an electrical machine, or an electromagnetic device, is usually referred to as<br />

“sizing”. It produces an approximate description (or design) <strong>of</strong> the desired device in terms <strong>of</strong> its major physical dimensions.<br />

This is traditionally performed using simple magnetic circuits or electric equivalent circuits. This paper proposes an<br />

approach based on a data base <strong>of</strong> device performance data and a neural network to estimate the values <strong>of</strong> the parameters<br />

necessary to provide a complete initial description <strong>of</strong> the device.<br />

Index Terms—Design, electrical machines, sizing, neural networks.<br />

Not only is the speed <strong>of</strong> the process an issue in that the<br />

I. INTRODUCTION<br />

design can take too much time, many <strong>of</strong> the parameters<br />

The design <strong>of</strong> any artifact is a process <strong>of</strong> searching an that are needed for a complete field solution <strong>of</strong> the device<br />

appropriate space at increasing levels <strong>of</strong> complexity. This have little or no effect on the main performance<br />

space is usually referred to as the design space. It is, parameters. For example, the magnitude <strong>of</strong> the torque in<br />

fundamentally, a high dimensional space relating the an electrical machine is dependent on the variation <strong>of</strong> the<br />

performance <strong>of</strong> a device to the values <strong>of</strong> a set <strong>of</strong> stored energy in the airgap <strong>of</strong> the device as the rotor<br />

parameters that describe the structure and physical changes position. The energy, in turn, depends on the<br />

properties <strong>of</strong> the device. The performance, <strong>of</strong> course, fields in the airgap and, to a first approximation, these<br />

might not be a single variable but several. For example, may be related to the current densities and the size <strong>of</strong> the<br />

the cost, size and weight as well as output or functional airgap. This information can be expressed through a basic<br />

capabilities might all be performance variables. The magnetic circuit.<br />

physical parameters could be the actual dimensions Consequently, the process usually employed for<br />

describing the physical structure; the material properties; relatively conventional structures relies on the knowledge<br />

or the external environment including the excitation <strong>of</strong> the designer and simple equivalent circuits for the<br />

sources, the mechanical loads, etc. Thus the design space initial steps. The design starts at an extremely coarse high<br />

can be extremely complex and the design process for a level and works downwards. The equivalent circuit<br />

device needs to be able to balance output objectives and model <strong>of</strong> an electrical machine provides a way <strong>of</strong> relating<br />

input specifications. The difficulty in design is that the electrical, mechanical and, possibly, thermal performance<br />

challenge posed by the specification is, in general, an <strong>of</strong> a device to the values <strong>of</strong> a relatively small number <strong>of</strong><br />

inverse problem, i.e. the designer is asked to produce a components and thus is easy to work with as well as<br />

physical structure which will meet objectives which are providing fast initial iterations. However, it is not always<br />

the input to the design. For example, for an electrical easy to relate the equivalent circuit values to the physical<br />

machine, the design input might well be the torque structures themselves and the knowledge base <strong>of</strong> the<br />

required from the device over a particular speed range. In designer helps to bridge this gap at the early stages <strong>of</strong> the<br />

addition, there might be electrical constraints imposed by design process. Fig. 1 illustrates the hierarchical process<br />

the form <strong>of</strong> the power supply.<br />

involved where the initial search for a design solution<br />

While it might be possible to achieve the design <strong>of</strong> an takes place at a high level with a limited set <strong>of</strong> parameters<br />

electrical machine from first principles, i.e. starting from but a large space to explore. As the design space is<br />

the equations <strong>of</strong> physics and knowledge <strong>of</strong> the properties narrowed down, the number <strong>of</strong> parameters is expanded to<br />

<strong>of</strong> various materials, this involves working at the provide a more detailed examination <strong>of</strong> the local space.<br />

maximum detail <strong>of</strong> the device. To use this approach, the At each level <strong>of</strong> the process, the analysis performed<br />

problem requires the solution <strong>of</strong> the field equations and becomes more complex and, consequently, more<br />

these require that all the physical parameters are expensive. If design is considered to be an optimization<br />

identified and given values. This generates a huge design process, then the phases <strong>of</strong> exploration and exploitation<br />

space to work in. The approach has been demonstrated re-occur at each level. At some point, the “virtual”<br />

on simple models but is computationally extremely (computer based) design is terminated because an<br />

expensive [1]. Searching such a space without an initial increase in the level <strong>of</strong> detail will have no effect on the<br />

idea <strong>of</strong> where a solution might be is extremely slow on<br />

existing computational platforms. In addition, such an<br />

required accuracy <strong>of</strong> the performance parameters .<br />

approach has difficulty including economic,<br />

manufacturing, and other constraints on the design.


Figure 1. Hierarchical Design Process<br />

The starting point at the highest level can, clearly,<br />

affect the convergence and time taken to complete the<br />

process. In an industrial organization, determining the<br />

starting point is performed through one or both <strong>of</strong> two<br />

processes. The first involves a database <strong>of</strong> previous<br />

designs [2], while the second uses simple magnetic<br />

circuit models (mentioned earlier) which vary in their<br />

effectiveness. However, if little or no design experience<br />

exists and a magnetic circuit is ineffective, for example in<br />

the case <strong>of</strong> systems with non-linearity and eddy currents,<br />

these two approaches may fail. The approach suggested<br />

in this paper is to use a neural network to map the desired<br />

machine performance onto a set <strong>of</strong> parameter values for<br />

the device.<br />

II. THE CONCEPT OF SIZING<br />

Following from the above discussion, the first stage in<br />

the design process <strong>of</strong> an electrical machine is to estimate<br />

the approximate size <strong>of</strong> the device needed to meet a set <strong>of</strong><br />

specifications. Traditionally, this is based on a few,<br />

relatively basic, rules. For example, the torque that can be<br />

delivered by an electrical machine is given in Equation 1:<br />

2<br />

T 0.<br />

5D<br />

L.(<br />

B.<br />

J )<br />

(1)<br />

Where D is the outer diameter <strong>of</strong> the rotor, J is the<br />

effective stator current sheet in amps per meter <strong>of</strong><br />

circumference, L is the length <strong>of</strong> the rotor, and B is the<br />

flux density crossing the air gap.<br />

How these quantities are created is the job <strong>of</strong> the rotor<br />

and stator structures. In a simple machine, the flux<br />

density in the airgap can be estimated from a basic<br />

magnetic circuit where the only component that provides<br />

a magnetic “resistance” to the magnetic flux is the airgap.<br />

The benefit <strong>of</strong> this approach is that it can provide<br />

approximate values for many <strong>of</strong> the key parameters in the<br />

machine design and it, in effect, locates the most likely<br />

area in the design search space for a solution. The levels<br />

<strong>of</strong> accuracy needed here are relatively low – the goal is to<br />

find a “ball park” estimate for the parameters so a<br />

solution within 10% or 20% <strong>of</strong> the real answer is good<br />

enough to begin the design process. The issue is speed –<br />

these results can be obtained very quickly and the<br />

exploration <strong>of</strong> a possible design space can be completed<br />

at a much reduced cost.<br />

However, given a need for only an approximate<br />

- 249 - 15th IGTE Symposium 2012<br />

solution but to deliver it extremely quickly, there are<br />

several alternate paradigms which could achieve this on a<br />

relatively unsophisticated computing device. In modern<br />

terms, there are two approaches which can be used. The<br />

first is to generate a surrogate model [3]. This is, in<br />

effect, the approach taken by the equivalent circuit<br />

approach, i.e. the real model is replaced by a simplified<br />

structure which performs in much the same way. The<br />

effectiveness and accuracy <strong>of</strong> the surrogate can be<br />

controlled relatively easily. The relationship between the<br />

output performance and the input parameters is <strong>of</strong>ten<br />

referred to as the “Response Surface” [4] and the<br />

accuracy or fidelity <strong>of</strong> the response surface depends on<br />

the surrogate model chosen. An alternate approach is to<br />

generate a series <strong>of</strong> points on the response surface and<br />

then to develop a curve fitting or interpolation system to<br />

estimate other points on the surface. In this case, the<br />

response surface can be modeled in a way that estimating<br />

a new point and determining the values <strong>of</strong> the input<br />

parameters for this point can be achieved very quickly.<br />

This is a variant <strong>of</strong> the approach suggested in [5].<br />

However, the development <strong>of</strong> the surrogate can be<br />

computationally expensive since full field solutions may<br />

be necessary. The gain, <strong>of</strong> course, over the simple<br />

magnetic circuit approach is that the synthesized initial<br />

design is likely to be much more detailed and to take into<br />

account more <strong>of</strong> the real behavior <strong>of</strong> the device than the<br />

simple assumption <strong>of</strong> perfect magnetic materials and only<br />

an air gap.<br />

The approach being proposed in this paper is a<br />

combination <strong>of</strong> the two “conventional” systems. The<br />

surrogate model is based on a neural network and an<br />

existing database <strong>of</strong> solutions is used to train the<br />

network, thus providing an approximation to the response<br />

surface. Such a system can deliver an initial estimate <strong>of</strong> a<br />

design with minimal computational effort and has the<br />

added advantage <strong>of</strong> improving its capabilities after each<br />

design as the new design can be added to the training<br />

database.<br />

III. NEURAL NETWORKS<br />

A neural network is an interconnected system <strong>of</strong> basic<br />

processing elements [6]. Each element performs a simple<br />

computation based on its inputs and produces an output.<br />

Each neuron “sees” a different weighted set <strong>of</strong> inputs and<br />

the outputs are combined to generate the output <strong>of</strong> the<br />

network. Fig. 2 illustrates the process.


Figure 2. Basic Neural Network Architecture<br />

The architecture shown in Fig.2. is <strong>of</strong>ten referred to as a<br />

“feed-forward network” in that the input data is fed in<br />

one direction through the network and the network<br />

operates synchronously.<br />

The operation <strong>of</strong> the network is controlled through the<br />

values <strong>of</strong> the weights on the neuron inputs and the<br />

combination <strong>of</strong> the neuron outputs. If a vector <strong>of</strong> input<br />

values, representing a point in a multi-dimensional space,<br />

is presented at the inputs, the network will respond with<br />

an appropriate output. In a sense, the network operates<br />

like an active read-only memory. The output from the<br />

memory system being determined not by addressing a<br />

particular location in the memory but based on the<br />

content <strong>of</strong> a memory location. Thus, neural networks are<br />

<strong>of</strong>ten referred to as “auto-associative” or “contentaddressable”<br />

memories. The feed-forward network is<br />

only one <strong>of</strong> several possible neural interconnections and<br />

is interesting in this application because it is able to<br />

interpolate between examples it has been shown.<br />

The process <strong>of</strong> defining the operation <strong>of</strong> a particular<br />

network is known as “training”. In this phase, the<br />

network is provided with a set <strong>of</strong> input vectors and the<br />

output corresponding to each vector. The weights seen by<br />

each neuron and their final combination into the output<br />

are adjusted to minimize the error between the required<br />

output and the one that is actually generated. This can<br />

<strong>of</strong>ten be a difficult process depending on the form <strong>of</strong> the<br />

neurons themselves.<br />

The neurons can be based around several different<br />

functions. Traditionally, neurons have used a paradigm<br />

based on a summing amplifier. Each neuron provides the<br />

weighted sum <strong>of</strong> its inputs which is then processed by a<br />

thresholding function. The outputs <strong>of</strong> all the neurons are<br />

then summed at the output. Each neuron operates over<br />

the entire input space. In this case, a neuron is said to<br />

“fire”, i.e. produce an output, when the weighted sum <strong>of</strong><br />

the inputs exceeds some threshold value. Thus the<br />

evaluation <strong>of</strong> the weights requires the satisfaction <strong>of</strong> a set<br />

<strong>of</strong> inequalities and the solution is non-unique. In<br />

addition, in order to model sophisticated functions,<br />

several layers <strong>of</strong> neurons may be needed and this can lead<br />

to difficulties in the training operation.<br />

- 250 - 15th IGTE Symposium 2012<br />

For the work described in this paper, neurons based<br />

on radial basis functions are used. In this case, the<br />

output function <strong>of</strong> the network is described by:<br />

Where ci represents the center <strong>of</strong> the area <strong>of</strong> interest <strong>of</strong> a<br />

single neuron and x is the position <strong>of</strong> the current input<br />

point in the parameter space being considered. Wi<br />

represents the trained weight <strong>of</strong> neuron i.<br />

The function, , is given by:<br />

2<br />

y<br />

<br />

2<br />

<br />

(3)<br />

(<br />

y) e<br />

Where controls the domain <strong>of</strong> influence <strong>of</strong> the neuron.<br />

The training process can thus determine the values <strong>of</strong><br />

W and for each neuron. Each neuron thus has a local<br />

effect. The determination <strong>of</strong> the weights for a network<br />

based on these functions can be expressed as an<br />

optimization problem and the approach results in a<br />

network that is easier to train.<br />

Once trained, the network can reproduce the examples<br />

that it was shown. However, there is also an emergent<br />

property in that it is able to “generalize”, i.e. it can<br />

generate outputs for input vectors it has not “seen”<br />

before. The process <strong>of</strong> building a neural network can be<br />

considered similar to fitting a surface in a multidimensional<br />

space to a set <strong>of</strong> data points. In fact, there is<br />

some commonality here with methods used in meshless<br />

systems to evaluate field solutions [7]. The network can<br />

function extremely quickly since the individual neuron<br />

operations are computationally simple and it acts as a<br />

look-up table for the unknown surface.<br />

How well the network can match the input data and<br />

corresponding outputs and how good the generalization<br />

capabilities are depends on the network design. The<br />

number <strong>of</strong> neurons can be considered to be similar to the<br />

number <strong>of</strong> basis functions used to represent the surface.<br />

If too few are used, the network will have a problem<br />

training to the presented data with sufficient accuracy; if<br />

too many are used, the network will have difficulty<br />

generalizing and may generated large errors between the<br />

known data points (a sort <strong>of</strong> high frequency oscillation<br />

between the points). For this reason, the training set is<br />

generally split into two pieces: the first is used to train<br />

the network; the second, which has not been seen by the<br />

network during training, is used to test the generalization<br />

capabilities. This can then lead to a higher level process<br />

where the network architecture, i.e. the number <strong>of</strong><br />

neurons used, is modified during the training process to<br />

try to improve the generalization performance.<br />

IV. THE PROPOSED SIZING PROCESS<br />

From the above, the process <strong>of</strong> sizing an electromagnetic<br />

device, in particular, an electrical machine, could be<br />

implemented using a neural network. This is based on the<br />

fact that the process <strong>of</strong> sizing is usually fairly limited, e.g.<br />

(2)


for a specific torque requirement and architecture <strong>of</strong><br />

machine, determine the key diameter values and the air<br />

gap size. If several designs <strong>of</strong> a specific class <strong>of</strong> machine<br />

already exist, then a neural network can be trained on this<br />

data and the generalization capability will allow it to<br />

estimate the “size” <strong>of</strong> the new device. As stated above,<br />

the goal <strong>of</strong> sizing is not to produce a perfect solution to<br />

the design problem, rather it is to get within a reasonable<br />

range in the design space <strong>of</strong> a possible solution. Thus the<br />

system does not have to be highly accurate; an error <strong>of</strong> 10<br />

or 20 percent in the performance <strong>of</strong> the proposed design<br />

is probably acceptable since a conventional optimization<br />

system can take the design from that point to completion.<br />

The process <strong>of</strong> developing a neural network based<br />

sizing system is shown in Fig. 3.<br />

Figure 3. Sizing Network Development Process.<br />

Since the goal <strong>of</strong> the sizing process is to develop an<br />

approximate synthesized prototype, the database shown<br />

in Fig.3 is used primarily to identify the major features<br />

and parameter values. Hence, in fact, a database <strong>of</strong><br />

existing designs is (a) probably too limited and will not<br />

cover the design space particularly well and (b) is<br />

unlikely to be structured to provide the information<br />

needed for sizing. Instead, a more controlled database can<br />

be constructed by using existing analysis programs.<br />

Using this approach, the database can be developed to<br />

provide effective coverage <strong>of</strong> the design space. In<br />

addition, certain parameters <strong>of</strong> the device, e.g. the<br />

- 251 - 15th IGTE Symposium 2012<br />

number <strong>of</strong> poles, the maximum frequency, etc., can be<br />

fixed and thus the network can be trained on a subset <strong>of</strong><br />

the machine design space. This lowers the dimensionality<br />

<strong>of</strong> the space and hence, simplifies the network and<br />

reduces the training time. It also simulates most existing<br />

sizing processes where certain key parameters are set in<br />

the specifications. In the event that these are not set, a<br />

higher level network can be developed to first make the<br />

choice <strong>of</strong> these key parameters before moving into the<br />

sizing process.<br />

V. A SIMPLE SIZING TEST<br />

Given the issues facing machines designers due to the<br />

costs <strong>of</strong> permanent magnets, a possible design scenario<br />

for demonstrating the effectiveness <strong>of</strong> the neural network<br />

approach is the replacement <strong>of</strong> a permanent magnet rotor<br />

with that for an induction machine while keeping the<br />

stator design constant. Thus the goal is to design a rotor<br />

structure that can produce a specific torque-speed<br />

performance. Note that, since the native torque-speed<br />

curve for an induction machine is very different to that<br />

for a permanent magnet design, the substitution is only<br />

possible with the additional use <strong>of</strong> power electronics and<br />

an external control system.<br />

Conventional sizing approaches, which work well for<br />

permanent magnet machines, are not very effective in<br />

dealing with induction machine sizing and somewhat<br />

more sophisticated models based around equivalent<br />

circuits are needed. Thus the induction machine is an<br />

ideal candidate for the process being described in this<br />

paper.<br />

The proposed system was tested on two different rotor<br />

architectures. The first was a drag-cup servo rotor where<br />

the rotor architecture is a conducting (copper) cylinder<br />

around a permeable (iron) core. The design parameters<br />

here are simple: just the thickness and radius <strong>of</strong> the<br />

conducting cylinder. The second design involved a<br />

squirrel-cage rotor which increased both the<br />

electromagnetic complexity <strong>of</strong> the problem and the<br />

number <strong>of</strong> design parameters.<br />

A. The Drag-Cup Rotor<br />

Fig.4 shows the basic design <strong>of</strong> the drag-cup rotor<br />

being considered.<br />

Figure 4. A Drag-Cup Rotor for an Induction Machine.


The typical torque-speed curve for this device is<br />

shown in Fig.5.<br />

Figure 5. Typical Torque-Speed Curve for a Drag-Cup<br />

Rotor.<br />

TABLE II Drag-Cup Rotor from Neural Net (Torque in<br />

Nm)<br />

Test#<br />

TABLE I Drag-Cup Rotor Simulations<br />

Test# Inner Outer Starting Maximum<br />

<br />

Radius<br />

(mm)<br />

Desired<br />

Start<br />

Torque<br />

Radius<br />

(mm)<br />

Desired<br />

Max<br />

Torque<br />

Start<br />

Torque<br />

Torque<br />

(Nm)<br />

Max<br />

Torque<br />

Torque<br />

(Nm)<br />

1 25 27 6.12 17.93<br />

2 25 28 6.74 25.56<br />

3 25 29 6.82 32.84<br />

4 25 30 6.65 39.25<br />

<br />

Averag<br />

eError<br />

1 6.12 17.93 6.52 17.87 3.40%<br />

2 6.74 25.56 7.02 24.85 3.44%<br />

3 6.82 32.84 6.76 31.4 2.57%<br />

4 6.65 39.25 6.58 36.31 4.23%<br />

Table I shows a typical set <strong>of</strong> parameters for the drag-cup<br />

rotor. A large range <strong>of</strong> values over each parameter was<br />

used to generate the training and testing sets for the<br />

neural network and the torque results computed using a<br />

finite element code (MagNet [7]). The network was<br />

constructed and trained using the MatLab Neural<br />

Network toolbox. Once trained, the network predictions<br />

were tested on a set <strong>of</strong> 50 samples. Each sample was also<br />

evaluated using the finite element analysis and the results<br />

were compared. The average error over the whole set was<br />

4%. Table II shows some typical results.<br />

Following on from these results, the complexity was<br />

increased by considering a squirrel-cage rotor, i.e. a<br />

structure consisting <strong>of</strong> a set <strong>of</strong> conducting bars in slots on<br />

the rotor.<br />

- 252 - 15th IGTE Symposium 2012<br />

B. The Squirrel-Cage Rotor.<br />

A range <strong>of</strong> squirrel-cage rotor designs were<br />

constructed to work with a 4 pole, 3 phase stator, shown<br />

in Fig. 6. The variables in the rotor were the number <strong>of</strong><br />

bars, the size <strong>of</strong> the bars and the diameter <strong>of</strong> the rotor.<br />

Fig. 7 shows the architecture <strong>of</strong> the squirrel-cage. A<br />

number <strong>of</strong> combinations <strong>of</strong> these parameters were<br />

produced and the torque-speed curves generated, again<br />

using the MagNet s<strong>of</strong>tware. Results were generated for a<br />

range <strong>of</strong> values <strong>of</strong> each parameter resulting in 144<br />

models in the database Table III shows the parameters<br />

and the ranges used. The number <strong>of</strong> conduction bars was<br />

set to an integer corresponding to the most commonly<br />

used values for a 4 pole system. Each rotor geometry was<br />

simulated for a range <strong>of</strong> frequencies from 0 to 60 Hz and<br />

the starting and peak torques recorded, as well as the<br />

torque-speed curve.<br />

Table III Ranges <strong>of</strong> Parameters for Squirrel-<br />

Cage Rotor<br />

Parameter Minimum Maximum<br />

Radius<strong>of</strong><br />

Conduction<br />

Bars(mm) 0.5 2<br />

Radius<strong>of</strong><br />

Rotor(mm) 28 36<br />

Thenumber<strong>of</strong>conductionbarswassettoone<br />

<strong>of</strong>15,20,30,35<br />

Figure 6. The 4 Pole, 3 Phase Stator Design used with<br />

the Sizing System.<br />

The network was developed following the process<br />

described in Fig. 3 and, once trained, was used to size a<br />

rotor for a particular specification. The neural network<br />

sizing estimates were then compared with an analysis <strong>of</strong><br />

the designed rotor in MagNet and an average error <strong>of</strong><br />

around 9% was generated over all the samples for a<br />

network with 20 neurons. Thus it is reasonable to state


that the proposed system provided a “sizing” estimate for<br />

the rotor design which was within the tolerance expected<br />

at this point in the design process.<br />

As a last test, the network architecture was varied, i.e.<br />

to determine the effect <strong>of</strong> the number <strong>of</strong> neurons on the<br />

error in prediction. The resulting errors are shown in Fig.<br />

8 as a function <strong>of</strong> the number <strong>of</strong> neurons. The data in Fig<br />

8 show the lack <strong>of</strong> approximation capability <strong>of</strong> the<br />

network for low numbers <strong>of</strong> neurons and the inability to<br />

generalize for high numbers. The ideal number for this<br />

problem appeared to be around 20 neurons in the<br />

network. It is not clear what caused the slight increase in<br />

error for a 15 neuron network and this bears further<br />

investigation.<br />

Figure 7. Basic Conductor Layout for a Squirrel Cage<br />

Rotor.<br />

Figure 8. Error between the Finite Element and Neural<br />

Network Solutions against the Number <strong>of</strong> Neurons in the<br />

Network for Starting and Maximum Torques<br />

VI. CONCLUSIONS<br />

The paper has described an approach to developing an<br />

initial prototype <strong>of</strong> an electromagnetic device based on a<br />

limited number <strong>of</strong> specifications. This is conventionally<br />

known as “sizing”. The use <strong>of</strong> a neural network together<br />

with a pre-computed database <strong>of</strong> examples, developed<br />

from a finite element analysis <strong>of</strong> a range <strong>of</strong> devices<br />

covering the design space, has been shown to be effective<br />

in developing an initial solution. The process <strong>of</strong> training<br />

the network is similar to developing the response surface<br />

- 253 - 15th IGTE Symposium 2012<br />

for the particular machine examples. The neural network<br />

acts as a form <strong>of</strong> surrogate but it is capable <strong>of</strong> providing a<br />

solution to the inverse problem unlike the more<br />

conventional usage <strong>of</strong> these techniques where the goal is<br />

to develop an effective forward model. The accuracy <strong>of</strong><br />

the neural network is within the range <strong>of</strong> existing sizing<br />

approaches and can probably be improved with a better<br />

training database.<br />

REFERENCES<br />

[1] Dyck, D.N., Lowther, D.A., “Automated Design <strong>of</strong> Magnetic<br />

Devices by Optimizing Material Distribution,” IEEE Transactions<br />

on Magnetics, Vol.32, 3, 1996, pp. 1188-1193.<br />

[2] Ouyang, J., Lowther, D.A., “A Hybrid Design Model for<br />

Electromagnetic Devices,” IEEE Transactions on Magnetics, Vol.,<br />

45, 3, 2009, pp. 1442-1445.<br />

[3] Hawe, G,I,. Sykulski, J.K., “The Consideration <strong>of</strong> Surrogate<br />

Model Accuracy in Single-Objective Electromagnetic Design<br />

Optimization,” <strong>Proceedings</strong> <strong>of</strong> the 6 th International Conference on<br />

Computational Electromagnetics, 2006, pp.1-2.<br />

[4] Wang, L., Lowther, D.A., ”Reducing the Design Space <strong>of</strong><br />

Standard Electromagnetic Devices using Bayesian Response<br />

Surfaces,” IEEE Transactions on Magnetics, Vol. 46, 2010, pp.<br />

2884-2887.<br />

[5] Hawe, G., Sykulski, J., “Considerations <strong>of</strong> Accuracy and<br />

Uncertainty with Kriging Surrogate Models in Single-Objective<br />

Electromagnetic Optimisation,” IET <strong>Proceedings</strong> on Science,<br />

Education and <strong>Technology</strong>, Vol. 1, 2007, pp.37-47.<br />

[6] Aleksander, I., Morton, H., “An Introduction to Neural<br />

Computing,” London, UK, International Thomson Computer<br />

Press, 1991.<br />

[7] Benbouza, N, Louai, F.Z., Nait-Said, N. “Application <strong>of</strong> Mexhless<br />

Petrov Galerkin (MLPG) Method in Electromagnetics using<br />

Radial Basis Functions,” <strong>Proceedings</strong> <strong>of</strong> the 4 th IET Conference on<br />

Power Electronics, Machines and Drives, 2008, pp. 650-655.<br />

[8] MagNet Users Manual, Infolytica Corporation, 2012.


- 254 - 15th IGTE Symposium 2012<br />

Exploring and Exploiting Parallelism in the Finite<br />

Element Method on Multi-core Processors: an<br />

Overview<br />

Hussein Moghnieh and David A. Lowther<br />

Department <strong>of</strong> Electrical and Computer Engineering, McGill <strong>University</strong> Montreal, Quebec, H3A 2A7, Canada<br />

E-mail: hussein.moghnieh@mail.mcgill.ca<br />

Abstract—Exploring parallelism requires identifying parts <strong>of</strong> a method or a kernel that can run concurrently. Exploiting<br />

parallelism involves utilizing techniques aimed at devising an efficient parallel implementation on a given processor.<br />

Different stages <strong>of</strong> the Finite Element Method have been found to require different approaches to explore and exploit their<br />

parallelism. While data locality is essential to gain performance, many approaches to parallelism have been found to not<br />

exhibit data locality by nature.<br />

Index Terms—Finite Element Method, incomplete Cholesky preconditioner, matrix assembly, mesh generation, multi-core<br />

processor, sparse matrix-vector multiplication.<br />

structure (i.e. maximum number <strong>of</strong> non-zeros per row and<br />

I. INTRODUCTION<br />

the average number <strong>of</strong> non-zeros per row) has been<br />

examined. The resulting matrices are shown in TABLE I.<br />

The matrix naming convention used is an indicator <strong>of</strong> the<br />

problem, element mesh size and the type <strong>of</strong> finite element<br />

formulation applied. For instance, BDC-1-0.07, indicates<br />

that the matrix is generated from the BDC problem, and a<br />

first order (i.e. 1) nodal formulation has been applied on a<br />

mesh where the maximum size <strong>of</strong> any triangular element<br />

is 0.07mm, while BDC-0-1 denotes a matrix that was<br />

assembled by applying an edge element formulation on a<br />

mesh where the maximum size <strong>of</strong> any triangular element<br />

is 1mm.<br />

Further, an initial 3D mesh <strong>of</strong> a transformer (ET)<br />

model has been refined multiple times and a first-order<br />

nodal finite element formulation has been applied on each<br />

<strong>of</strong> the refined meshes. The resulting matrices are denoted<br />

by ET and are shown in TABLE I.<br />

The introduction <strong>of</strong> the multi-core processor by IBM<br />

(i.e. the POWER4) in 2001, and later by Intel and AMD,<br />

has rekindled the interest in using parallel computing to<br />

accelerate computations in an electromagnetic (EM) field<br />

simulation s<strong>of</strong>tware running on a desktop computer.<br />

Since then, a considerable amount <strong>of</strong> research effort has<br />

been invested in investigating the methods and kernels<br />

executed in field simulation s<strong>of</strong>tware; these include mesh<br />

generation, matrix assembly, sparse matrix-vector<br />

multiplication (SMVM) and iterative solver<br />

preconditioning techniques such as incomplete LU<br />

factorization (ILU). Despite having achieved a degree <strong>of</strong><br />

performance gain, several shortcomings have reduced the<br />

effectiveness <strong>of</strong> those techniques in achieving the<br />

ultimate performance goal <strong>of</strong> a field analysis s<strong>of</strong>tware,<br />

which is the reduction <strong>of</strong> the overall time to design a<br />

device. These impediments include the problem size and<br />

structure as well as the architecture <strong>of</strong> the multi-core<br />

processor.<br />

This paper intends to illustrate the degree <strong>of</strong><br />

parallelism which might be expected in each <strong>of</strong> the design<br />

and analysis stages <strong>of</strong> a process based around the finite<br />

element method (FEM), in addition to discussing several<br />

issues and bottlenecks that arise while exploiting<br />

parallelism on a multi-core processor. In particular, it is<br />

intended to examine the gains due to parallelism on<br />

realistic electromagnetic design examples, i.e. a 2D<br />

brushless DC motor model and a 3D transformer model.<br />

II. METHODOLOGY<br />

An initial 2D mesh <strong>of</strong> a brushless DC (BDC) motor<br />

model has been refined multiple times, by setting an<br />

upper limit on the area <strong>of</strong> the elements in each refinement<br />

step, in order to create a range <strong>of</strong> typical meshes and<br />

mesh sizes. Subsequently, first order and second order<br />

nodal formulations, in addition to an edge formulation<br />

have been applied on each mesh and a matrix has been<br />

assembled in each case. The effect <strong>of</strong> applying different<br />

formulations and mesh sizes on the matrix size (i.e.<br />

degrees <strong>of</strong> freedom and number <strong>of</strong> non-zeros) and matrix<br />

TABLE I<br />

MATRICES PROPERTIES<br />

Matrix DOF NNZ Ave.<br />

(Max)<br />

nnz/row<br />

CSR size<br />

(MB)<br />

BDC-1-0.5 38,084 259,188 7 (12) 3.2<br />

BDC-1-0.07 1,194,044 8,334,798 7 (22) 100<br />

BDC-1-0.04 3,152,216 22,000,128 7 (33) 264<br />

BDC-2-3 48,031 407,733 9 (37) 5<br />

BDC-2-0.07 4,787,651 40,664,669 9 (43) 484<br />

BDC-2-0.04 12,660,592 107,560,044 9 (60) 1,280<br />

BDC-0-1 55,772 278,168 5 (5) 3.4<br />

BDC-0-0.07 3,492,389 17,931,763 5 (5) 219<br />

BDC-0-0.04 9,492,389 47,437,511 5 (5) 579<br />

ET-1-0.08 38,234 549,047 15 (36) 6.4<br />

ET-1-0.04 409,531 5,999,230 15 (31) 70<br />

ET-1-0.01 1,975,427 28,927,159 15 (39) 339<br />

Subsequently, the parallel performance and bottlenecks<br />

encountered in an efficient implementation <strong>of</strong> important<br />

FEM kernels, particularly matrix assembly, sparse<br />

matrix-vector multiplication, and preconditioning<br />

techniques based on incomplete LU factorizations, are<br />

investigated.


III. PARALLEL MATRIX ASSEMBLY<br />

The process <strong>of</strong> matrix assembly is not considered to be<br />

time consuming. It is an process, since it consists <strong>of</strong><br />

iterating once over all mesh elements. For each element,<br />

two operations are performed. The first is to approximate<br />

the solution <strong>of</strong> the field within each element which would<br />

result in a dense matrix structure for each mesh<br />

element e where u depends upon the formulation and the<br />

number <strong>of</strong> unknowns in an element. The second operation<br />

is to map each entry <strong>of</strong> the dense matrix to a global<br />

matrix A. The latter step constitutes a significant portion<br />

<strong>of</strong> the total assembly cost mainly because the global<br />

matrix A is sparse. Inserting and updating entries in a<br />

sparse matrix, even when its structure is a priori known,<br />

is not trivial, such is the case when using compressed<br />

sparse row (CSR).<br />

In the case <strong>of</strong> matrix assembly in FEM, the maximum<br />

number <strong>of</strong> non-zeros in any row can be roughly estimated<br />

since it depends on the FEM formulation used. In such a<br />

case, a more suitable choice <strong>of</strong> a sparse storage than the<br />

CSR is to use the ELLPACK sparse storage scheme [1].<br />

The ELLPACK sparse format stores a sparse matrix into two dense data structures<br />

(ELL_values and ELL_column_ind) as shown in Figure 1.<br />

ELL_values stores the values <strong>of</strong> non-zeros in each row in<br />

a condensed form and pads the remaining spaces with<br />

zeros. ELL_column_ind stores the column index <strong>of</strong> each<br />

corresponding non-zero in the ELL_values and “-1” for<br />

the padded non-zeros. The size W corresponds to the<br />

maximum number <strong>of</strong> non-zeros per row. When the<br />

number <strong>of</strong> non-zeros per row is less than W, zeros are<br />

padded to fill the remaining locations.<br />

Mutex<br />

objects<br />

Values per<br />

row counter<br />

00 01<br />

2 00 01 0 0 0 1 1 1<br />

11 14<br />

2 11 14 0 0 1 4 1 1<br />

20 22 25<br />

3 20 22 25<br />

0<br />

0 2 5 1<br />

32 33 35<br />

3 32 33 35<br />

0<br />

2 3 5 1<br />

44 45<br />

2 44 45 0 0 4 5 1 1<br />

50 51 52 55<br />

4 50 51 52 55 0 1 2 5<br />

nxn sparse matrix nxw values nxw<br />

column indices<br />

Figure 1: Synchronized ELLPACK sparse storage.<br />

ELLPACK sparse format<br />

ELL_values ELL_colum_ind<br />

The performance <strong>of</strong> parallel matrix assembly using<br />

atomic operations on multi-core processors has been<br />

investigated. Mutual exclusion (mutex) objects from the<br />

POSIX threads (Pthread) library were used to<br />

synchronize access <strong>of</strong> multiple threads to a shared<br />

resource, which in our case, is the matrix A. For this<br />

purpose, an array <strong>of</strong> mutex objects was created where<br />

each object corresponds to a row in the global matrix as<br />

shown in Figure 1. Typically, in order for a thread to add<br />

or modify entries on a row <strong>of</strong> the global matrix, it must<br />

acquire a lock on the mutex object corresponding to that<br />

row. After the thread finishes its modifications, it releases<br />

the lock to make it available for other threads. For<br />

example, a thread that is assembling an element <strong>of</strong> 3<br />

unknowns (1, 2, 3) must aggregate the total <strong>of</strong> <br />

entries in the global matrix. Each 3 <strong>of</strong> these entries is<br />

added onto the same row <strong>of</strong> the global matrix; hence, a<br />

- 255 - 15th IGTE Symposium 2012<br />

total <strong>of</strong> 3 locks are required on 3 different mutex objects.<br />

This is illustrated in Algorithm 1 (line 6).<br />

Algorithm 1: Parallel matrix assembly using atomic operations.<br />

Figure 2 shows the runtimes in seconds <strong>of</strong> the parallel<br />

assembly <strong>of</strong> 3 matrices using a first-order nodal finite<br />

element formulation on a quad-core Intel i7 processor.<br />

The sequential runtimes are small (a few seconds) despite<br />

the fact that these matrices are considered to be those <strong>of</strong><br />

realistic average size problems. The runtimes were<br />

reduced by more than 50% relative to 1-thread execution<br />

when the number <strong>of</strong> threads was 4. Notice the difference<br />

in runtimes between sequential execution (no<br />

synchronization) shown in horizontal lines and runtimes<br />

<strong>of</strong> 1 thread. This difference highlights the cost <strong>of</strong> calling<br />

the Pthread Application Programming Interface (API)<br />

times. The overhead <strong>of</strong> calling a Pthread API<br />

although it appears to be large in here, is not the main<br />

concern in multi-threaded applications. Instead it is the<br />

wait time that could incur when a thread is waiting for a<br />

mutex object to be released by another thread. In matrix<br />

assembly, this occurs when threads are simultaneously<br />

processing mesh elements that share vertices and edges.<br />

In the case <strong>of</strong> FEM, the possibility <strong>of</strong> threads waiting to<br />

acquire a lock is small since the number <strong>of</strong> shared<br />

vertices or edges is low; it is related to the average<br />

number <strong>of</strong> non-zeros per row.<br />

Figure 2: Parallel matrix assembly timings in seconds on an<br />

Intel quad-core i7-860 processor.


A. Parallel Matrix Assembly Synchronization and<br />

Cache Data Locality<br />

The time it has taken to complete the matrix assembly<br />

process in the previous experiments was very small (only<br />

a few seconds), hence, it was not possible to accurately<br />

measure the total time spent on synchronization (i.e.<br />

calling the Pthread API and waiting to acquire a mutex<br />

lock). Instead, Intel’s VTune Amplifier [2] was used to<br />

count the number <strong>of</strong> execution cycles spent on<br />

synchronization. In the case <strong>of</strong> matrix assembly using 1<br />

thread, this number constituted around 9% <strong>of</strong> the total<br />

cycles spent on matrix assembly (see Figure 3). This<br />

number reflects only the time to call the Pthread API,<br />

since there was no time or cycles wasted waiting to<br />

acquire a lock (no other threads were competing to<br />

acquire a lock). When using 4 threads, more cycles were<br />

halted during synchronization, and in this case the<br />

percentage <strong>of</strong> time wasted increased to 24% (see Figure<br />

4).<br />

91%<br />

Matrix assembly execution cycles<br />

Pthreads Lock / Unlock execution cycles<br />

Figure 3: Execution cycles <strong>of</strong> matrix assembly using 1 thread.<br />

76%<br />

Figure 4: Execution cycles <strong>of</strong> matrix assembly using 4 threads.<br />

IV. SPARSE MATRIX-VECTOR MULTIPLICATION<br />

It is well established that matrix-vector multiplication<br />

( ) exhibits a low floating-point operations<br />

(FLOP) count to memory access ratio, regardless <strong>of</strong><br />

whether A is dense or sparse [3, 4]. This low ratio <strong>of</strong><br />

FLOP/BYTE makes SMVM a memory bandwidth<br />

limited problem requiring the use <strong>of</strong> optimization<br />

techniques which efficiently use the memory hierarchy<br />

system (main memory, caches and registers).<br />

The experiments conducted and presented in this<br />

section aim at analyzing both the effectiveness and the<br />

limitation <strong>of</strong> the commonly used SMVM optimization<br />

techniques when applied on the matrix set described in<br />

TABLE I.<br />

Instead <strong>of</strong> using the ELLPACK storage described<br />

above which could incur a large number <strong>of</strong> padded zeros<br />

in matrices arising from a high order finite element<br />

formulation, a variation <strong>of</strong> this storage, known as the<br />

Hybrid (HYB) storage, is used instead. In this storage<br />

scheme, some <strong>of</strong> the non-zeros are stored in a coordinate<br />

list format (COO) so as to minimize the number <strong>of</strong><br />

padded zeros in the ELLPACK storage as illustrated in<br />

9%<br />

Matrix assembly execution cycles<br />

Pthreads Lock / Unlock execution cycles<br />

24%<br />

- 256 - 15th IGTE Symposium 2012<br />

Figure 5.<br />

00 01<br />

11 14<br />

20 22 25<br />

40<br />

41<br />

32 33 35<br />

44 45<br />

50 51 52 55<br />

ELLPACK<br />

sparse format<br />

ELL_values ELL_colum_ind<br />

00 01<br />

11 14<br />

0<br />

0<br />

0<br />

1<br />

1<br />

4<br />

1<br />

1<br />

Coordinate (COO) list<br />

sparse format<br />

20 22 25 0 2 5<br />

COO_values 45 55<br />

32 33 35 2 3 5 COO_row_ind 4 5<br />

40 41 44 0 1 4 COO_col_ind 5 5<br />

50 51 52 0 1 2<br />

Fillin<br />

Nonzeros stored in<br />

COO<br />

Figure 5: Hybrid (HYB) storage scheme. Some non-zeros are<br />

stored in a coordinate list format (COO) in order to reduce the<br />

total number <strong>of</strong> padded zeros.<br />

In order to evaluate the magnitude <strong>of</strong> the impact <strong>of</strong><br />

accessing X on SMVM performance, the multiplication<br />

by X[column] was replaced by X[i] (Algorithm 2, line 7).<br />

Although this multiplication yielded an incorrect result,<br />

the aim was to show an upper bound on performance gain<br />

in cache blocking (i.e. no cache misses on X).<br />

Algorithm 2: Modified SMVM to eliminate the effect <strong>of</strong> cache<br />

misses on .<br />

Figure 6: BDC-1: SMVM performance when using cache<br />

blocking on .<br />

Eliminating the cache misses <strong>of</strong> has increased the<br />

performance <strong>of</strong> SMVM significantly (as anticipated)<br />

when the matrix was unstructured (i.e. BDC-1) as shown<br />

in Figure 6 (Natural ordering). To further validate the<br />

results, the set <strong>of</strong> matrices in TABLE I (BDC-1) were<br />

ordered to reduced their bandwidth using the Reverse


Cuthill-McKee (RCM) technique [5]. When the matrices<br />

were ordered using RCM, the performance <strong>of</strong> SMVM<br />

using cache blocking was close to the performance <strong>of</strong><br />

SMVM without cache blocking (Figure 6), since cache<br />

misses were reduced due to the ordered access pattern on<br />

.<br />

A. Loop Setup Overhead<br />

One <strong>of</strong> the factors that has been argued to be contributing<br />

to reducing the performance <strong>of</strong> SMVM is the low number<br />

<strong>of</strong> non-zeros per row[6]. For each row <strong>of</strong> the matrix A,<br />

the inner loop <strong>of</strong> the SMVM code, whether using the<br />

CSR storage (as shown in line 5 <strong>of</strong> Algorithm 2) or using<br />

the HYB storage, iterates over the row's non-zeros and<br />

multiplies them by the corresponding entries in . When<br />

only a few non-zeros are present, the inner loop setup<br />

overhead time would dominate the calculation time and<br />

would not be able to be amortized over the short<br />

calculation time <strong>of</strong> a few non-zeros. Since the set <strong>of</strong> FEM<br />

matrices used in this work falls within this category (i.e.<br />

low per row) a test examining the degradation <strong>of</strong><br />

the SMVM performance due to the inner loop setup<br />

overhead has been carried out by replacing the inner loop<br />

<strong>of</strong> SMVM with a set <strong>of</strong> instructions which explicitly<br />

multiply each element <strong>of</strong> by its corresponding element<br />

in ; this technique is <strong>of</strong>ten referred to as “loop<br />

unrolling”. “Loop unrolling” has been made possible by<br />

the use <strong>of</strong> the ELLPACK (or Hybrid) sparse format since<br />

the number <strong>of</strong> non-zeros per row is fixed, hence the<br />

number <strong>of</strong> times an inner loop executes its inner<br />

instruction is fixed. In such a case, the inner loop can be<br />

eliminated and the instruction within the inner loop can<br />

be replaced by explicitly writing the set <strong>of</strong> instructions<br />

that would have been executed by the inner loop.<br />

Algorithm 3 illustrates a sparse matrix-vector<br />

multiplication using the ELLPACK storage. Assuming<br />

that the width <strong>of</strong> the ELLPACK storage is 7, the inner<br />

loop which multiplies the non-zeros <strong>of</strong> a row by the<br />

corresponding locations in is replaced by seven<br />

instructions. The effect <strong>of</strong> this technique on the<br />

performance <strong>of</strong> SMVM when applied on BDC-1 matrix<br />

test set is shown in Figure 7. It can be seen that while loop<br />

unrolling did increase SMVM performance, it was not as<br />

significant as the performance gain obtained from<br />

eliminating cache misses on .<br />

Algorithm 3: SMVM loop unrolling using NVIDIA's Hybrid<br />

sparse storage.<br />

- 257 - 15th IGTE Symposium 2012<br />

Figure 7: BDC-1: Loop unrolling and cache blocking (singleprecision<br />

floating-point operations).<br />

B. SMVM memory bandwidth<br />

Figure 8 shows the memory bandwidth when executing<br />

SMVM using different optimization techniques. The<br />

sustainable memory bandwidth obtained from executing<br />

the STREAM benchmark [7] on an Intel i7-860 processor<br />

is also shown on the same figure. The widely used<br />

STREAM benchmark serves as an indicator <strong>of</strong> the<br />

realistic performance <strong>of</strong> the memory subsystem <strong>of</strong> a<br />

particular processer. In this benchmark, a set <strong>of</strong> kernels is<br />

applied on a dense data structure chosen to be larger than<br />

the available cache <strong>of</strong> a particular processor.<br />

Figure 8: BDC-1: SMVM sustainable memory bandwidth<br />

(MB/s) on Intel i7-860 processor.<br />

In general, a naïve implementation <strong>of</strong> SMVM (i.e. no<br />

optimization) would work well below 50% <strong>of</strong> the<br />

STREAM benchmark sustained memory bandwidth,<br />

while an optimized SMVM (with cache blocking)<br />

attained 70% <strong>of</strong> the STREAM benchmarks. The<br />

implication <strong>of</strong> these results highlights the effect <strong>of</strong> using<br />

sparse storage, which introduces additional memory<br />

fetches due to indirect addressing which also prevents<br />

efficient memory pre-fetching by the processor. A similar<br />

observation has been found when running the same<br />

experiments on an older generation <strong>of</strong> quad-core


processor; AMD’s dual-socket, dual-core Opteron 2214<br />

processor.<br />

C. Parallel SMVM<br />

This section compares the sequential and parallel<br />

performance <strong>of</strong> SMVM kernels when applied to matrices<br />

obtained from the set described in TABLE I and a<br />

miscellaneous matrix test set obtained from “the<br />

<strong>University</strong> <strong>of</strong> Florida Sparse Matrix Collection” [8]<br />

shown in TABLE II. The latter set has been widely used in<br />

the past few years by researchers to evaluate the<br />

performance <strong>of</strong> SMVM algorithms. The results <strong>of</strong> our<br />

evaluation are shown in Figure 9.<br />

TABLE II<br />

MISCELLANEOUS MATRIX TEST SET<br />

Matrix DOF NNZ Ave. (Max)<br />

nnz/row<br />

CSR<br />

size<br />

(MB)<br />

Protein 36,417 4,344,765 120 (204) 50<br />

Sphere 83,334 6,010,480 73 (81) 69<br />

Cant. 62,451 4,007,383 65 (78) 46<br />

Tunnel 217,918 11,524,432 53 (180) 133<br />

CFD 46,835 2,374,001 50 (145) 27<br />

Ship. 140,874 7,813,404 26 (68) 42<br />

Econ. 206,500 1,273,389 7 (74) 16<br />

Epidem. 525,825 2,100,225 4 (4) 26<br />

Circuit 170,998 958,936 6 (353) 12<br />

The following observations were concluded from the<br />

results shown in Figure 9:<br />

The sequential performance <strong>of</strong> SMVM kernels when<br />

the size <strong>of</strong> a matrix fits in the available processor<br />

cache is significantly higher than when the matrix<br />

does not fit in the cache (e.g. BDC-1-0.5 and ET-0-<br />

0.5).<br />

Figure 9: Parallel SMVM using HYBRID storage (doubleprecision<br />

floating-point operations)<br />

- 258 - 15th IGTE Symposium 2012<br />

Matrices that have a high percentage number <strong>of</strong> nonzeros<br />

per row attained higher GFLOPS than matrices<br />

with short row lengths. This is not due to the overhead<br />

caused by the inner-loop <strong>of</strong> SMVM (as demonstrated<br />

in section III.A), but to the ratio <strong>of</strong> the DOF and<br />

NNZ. In general, matrices arising in FEM have high<br />

ratios <strong>of</strong> DOF over NNZ, which explains the low<br />

performance relative to other matrices. This explains<br />

also why matrices obtained from 3D first-order finite<br />

element analysis attained higher GFLOPS than<br />

matrices obtained from 2D analysis.<br />

The performance <strong>of</strong> parallel SMVM is affected by the<br />

distribution <strong>of</strong> non-zeros in a row. Matrices arising<br />

from FEM have a balanced distribution <strong>of</strong> the number<br />

<strong>of</strong> non-zeros per row, leading to better thread<br />

utilization and subsequently to higher GFLOPS.<br />

V. PRECONDITIONING TECHNIQUES:INCOMPLETE<br />

CHOLESKY AND INCOMPLETE CHOLESKY WITH FILL-INS.<br />

There are two techniques to solve a system <strong>of</strong> linear<br />

equations where is the coefficient matrix and <br />

is the right hand side vector. The first is to use direct<br />

solver methods and the second is to use iterative methods.<br />

The direct solver methods rely on decomposing the<br />

coefficient matrix, , into upper and lower triangular<br />

matrices and, where . This is a robust<br />

method. However, it is not useful for large systems, since<br />

the triangular matrices L and U lose their sparsity, as zero<br />

entries in the coefficient matrix turn into non-zero<br />

entries in and . Those new entries are referred to as<br />

fill-ins.<br />

A less robust technique is based on iterative<br />

approaches, such as the conjugate gradient method (CG).<br />

This method requires a large number <strong>of</strong> iterations over<br />

the system <strong>of</strong> linear equations to reach the solution. The<br />

number <strong>of</strong> iterations depends upon the condition number<br />

<strong>of</strong> the matrix in . This condition number can be<br />

reduced (i.e. leading to less CG iterations) if a<br />

preconditioner that is based on the incomplete<br />

factorization <strong>of</strong> is applied to the CG method[9].<br />

Incomplete factorization derives its name from the<br />

direct method discussed above. It uses the same<br />

elimination algorithm to decompose the matrix into an<br />

and , which are an approximation <strong>of</strong> and,<br />

obtained by dropping some fill-in entries. One <strong>of</strong> the<br />

dropping strategies during ILU factorization is to drop all<br />

fill-ins so that the sparsity <strong>of</strong> and matches that <strong>of</strong> the<br />

original matrix A. This dropping rule gives rise to an<br />

ILU(0) or IC(0) (incomplete Cholesky in the case <strong>of</strong><br />

symmetric matrices) preconditioner [10], where the zero<br />

denotes that no fill-ins are allowed. Incomplete Cholesky<br />

with no fill-ins has been the preconditioner <strong>of</strong> choice on a<br />

desktop computer mainly due to its ability to reduce the<br />

number <strong>of</strong> iterations <strong>of</strong> a PCG while being inexpensive to<br />

produce and to compute on a desktop computer. The<br />

structures <strong>of</strong> the factors and are a priori known,<br />

making it easy to pre-allocate the storage requirement,<br />

without the need for symbolic factorization. An efficient<br />

implementation would be to duplicate the lower part <strong>of</strong> A<br />

and then perform an in-place factorization by going in an


ordered manner over the entries <strong>of</strong> each row. A very<br />

efficient implementation is found in the SparseLib++<br />

library [11]. TABLE III shows the execution times <strong>of</strong><br />

creating an IC(0) preconditioner.<br />

Matrix Degrees <strong>of</strong><br />

freedom<br />

TABLE III<br />

INCOMPLETE CHOLESKY PERFORMANCE<br />

Upper<br />

triangle<br />

NNZ<br />

CSR size<br />

(MB)<br />

IC(0) time<br />

(sec.)<br />

BDC-1-0.5 38,084 147,636 1.9 0.0275<br />

BDC-1-0.1 632,883 2,521,428 31.3 0.4987<br />

BDC-1-0.04 3,152,216 12,576,171 155 2.594<br />

ET-0.08 38,324 293,643 3.5 0.1359<br />

ET-0.04 409,531 3,204,372 38.2 1.554<br />

ET-0.01R 2,666,039 21,100,983 252 10.3244<br />

In order to improve the convergence rate <strong>of</strong> PCG<br />

beyond that provided by using the IC(0) preconditioner,<br />

much research has focused on extending the idea <strong>of</strong> the<br />

incomplete Cholesky preconditioner by allowing fill-ins<br />

to occur. There are two heuristics used to control the<br />

amount <strong>of</strong> fill-in. The first is based on a drop tolerance<br />

criterion, known as the Incomplete LU Threshold (ILUT)<br />

through which entries are dropped if their values are<br />

below a preset threshold. The second is based on the level<br />

<strong>of</strong> fill-in known as ILU, where symbolic factorization,<br />

using graph theory, is carried out to identify the locations<br />

<strong>of</strong> the fill-ins and their level in the graph. The fill-in<br />

entries that exceed a given level are dropped. Matrix<br />

elements are assigned a level 0, hence IC(0) discards all<br />

fill-ins and the resulting factorized matrix has the same<br />

sparsity pattern as the original matrix. One way to<br />

calculate the level <strong>of</strong> a fill-in is to use the sum rule as<br />

shown in (1). This rule gives rise to a symbolic<br />

factorization algorithm described by Hysom [12] that is<br />

amenable to parallelization. The sparsity <strong>of</strong> each row in<br />

the final preconditioner can be evaluated independently<br />

from the other rows. Figure 10 demonstrates the<br />

scalability <strong>of</strong> this algorithm. Despite that, the runtime <strong>of</strong><br />

the symbolic factorization is considered to be a<br />

bottleneck in our case mainly due to the large number <strong>of</strong><br />

fill-ins that incurred in the final ILU preconditioners<br />

(where =1, 2 or 3) as shown in TABLE IV.<br />

level(i, j) min {level(i, k) level(k, j) 1} (1)<br />

1hmin{i, j}<br />

- 259 - 15th IGTE Symposium 2012<br />

Figure 10: Execution times <strong>of</strong> parallel symbolic factorization <strong>of</strong><br />

BDC-1-0.1 where . The results demonstrate that the multithreaded<br />

implementation <strong>of</strong> Hysom’s algorithm is highly<br />

scalable.<br />

Matrix IC(0)<br />

TABLE IV<br />

FILL-INS<br />

ILU(1) ILU(2) ILU(3)<br />

BDC-1-0.5 148,636 225,244 321,101 429,566<br />

(51%) (116%) (189%)<br />

BDC-1-0.1 2,521,428 3,848,266 5,544,448 7,380,993<br />

(52%) (120%) (193%)<br />

ET-0.08 293,643 700,914 1,368,930 2,555,737<br />

(139%) (366%) (770%)<br />

ET-0.04 3,204,372 7,927,979 15,963,746 30,758,154<br />

(147%) (398%) (860%)<br />

VI. PRECONDITIONER BACKWARD-FORWARD<br />

SUBSTITUTION<br />

The next step is to investigate the degree <strong>of</strong> parallelism<br />

(i.e. the number <strong>of</strong> operations that can be executed<br />

simultaneously) that can be attained when solving a<br />

preconditioner (by backward and forward substitution)<br />

within a PCG iteration obtained from the matrix BDC-1-<br />

0.5 (i.e. a 2D problem) and the matrix ET-0.08 (i.e. a 3D<br />

problem). A histogram will be used to depict the<br />

maximum degree <strong>of</strong> parallelism and the number <strong>of</strong> steps<br />

required to solve each <strong>of</strong> the preconditioners. The x-axis<br />

shows the number <strong>of</strong> steps required to solve a matrix, and<br />

the y-axis <strong>of</strong> the histogram shows the number <strong>of</strong> rows<br />

that can be solved simultaneously at a given step. Figure<br />

11 and Figure 12 show the rows dependency histograms <strong>of</strong><br />

ILU(1) and ILU(3) <strong>of</strong> the matrix BDC-1-0.5 respectively.<br />

The maximum degree <strong>of</strong> parallelism <strong>of</strong> ILU(1) was 1,151<br />

and the number <strong>of</strong> steps required to solve the<br />

preconditioner was 196. On the other hand, the ILU(3)<br />

preconditioner <strong>of</strong> the same problem had a maximum<br />

degree <strong>of</strong> parallelism equal to 453 and 399 steps were<br />

required to solve it. The more fill-ins that existed in a<br />

preconditioner, the less parallelism could be exploited.<br />

Solving a preconditioner obtained from a 3D problem<br />

is less amenable to parallelism than that obtained from a<br />

2D problem. For instance, an ILU(1) preconditioner<br />

obtained from ET-0.08 (3D electric transformer problem)<br />

can be solved in 12,559 steps where the maximum<br />

number <strong>of</strong> rows that could be solved simultaneously is<br />

only 16 (see TABLE V) and an ILU(3) preconditioner <strong>of</strong><br />

the same problem can be solved in 24,125 steps where the


maximum attainable degree <strong>of</strong> parallelism is only 16 (see<br />

TABLE VI). A 2D problem that has the same number <strong>of</strong><br />

degrees <strong>of</strong> freedom as ET-0.08 (i.e. BDC-1-0.5) was<br />

more amenable to parallelism.<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

0 20 40 60 80 100 120 140 160 180 200<br />

Figure 11: Degree <strong>of</strong> parallelism (y-axis) attained when solving<br />

ILU(1) <strong>of</strong> BDC-1-0.5. The x-axis represents the number <strong>of</strong><br />

sequential steps to finish the solve stage.<br />

500<br />

450<br />

400<br />

350<br />

300<br />

250<br />

200<br />

150<br />

100<br />

50<br />

0<br />

0 50 100 150 200 250 300 350 400<br />

Figure 12: Degree <strong>of</strong> parallelism (y-axis) when solving ILU(3)<br />

<strong>of</strong> BDC-1-0.5. The x-axis represents the number <strong>of</strong> sequential<br />

steps to finish the solve stage.<br />

TABLE V<br />

ILU(1) OF ET-0.08<br />

Preconditioner NNZ Max. Solving<br />

level and ordering<br />

degree <strong>of</strong><br />

parallelism<br />

steps<br />

ILU(1)-Natrual 700,914 69 12,559<br />

ILU(1)-AMD 673,511 207 569<br />

ILU(1)-RCM 585,646 36 3,491<br />

TABLE VI<br />

ILU(3) OF ET-0.08<br />

Preconditioner NNZ Max. Solving<br />

level and ordering<br />

degree <strong>of</strong><br />

parallelism<br />

steps<br />

ILU(3)-Natrual 2,555,737 16 24,125<br />

ILU(3)-AMD 1,937,345 119 1,690<br />

ILU(3)-RCM 1,984,093 12 10,548<br />

Since the preconditioner obtained from the 3D<br />

transformer problem exhibited a low degree <strong>of</strong><br />

parallelism, approximate minimum degree (AMD) [13]<br />

and Reverse Cuthill–McKee (RCM) orderings were first<br />

applied on the ET-0.08 matrix before generating ILU(1)<br />

and ILU(3) preconditioners. Although, it has been<br />

established in the literature that orderings to reduce fillins<br />

or increase parallelism (RCM and AMD) degrade the<br />

quality <strong>of</strong> the preconditioner and lead to more PCG<br />

- 260 - 15th IGTE Symposium 2012<br />

iterations than when using Natural ordering [14], [15], the<br />

aim <strong>of</strong> this experiment was to only focus on the<br />

implication <strong>of</strong> ordering in terms <strong>of</strong> reducing the overall<br />

solver time.<br />

The number <strong>of</strong> non-zeros in the upper triangle<br />

preconditioner, the maximum degree <strong>of</strong> parallelism and<br />

the solving steps <strong>of</strong> both preconditioners ILU(1) and<br />

ILU(3) using different orderings are summarized in<br />

TABLE V and TABLE VI respectively. AMD ordering<br />

resulted in a relatively higher parallelizable<br />

preconditioner solver than Natural and RCM orderings.<br />

On the other hand, RCM exhibited a similar degree <strong>of</strong><br />

parallelism to that <strong>of</strong> the Natural ordering but required<br />

less solve steps. The reason being that RCM reduces the<br />

bandwidth <strong>of</strong> the matrix and balances the distribution <strong>of</strong><br />

non-zeros between rows. This implies that there is a<br />

balance in the degree <strong>of</strong> parallelism among steps, which<br />

will translate into balanced threads utilization.<br />

VII. CONCLUSION<br />

A. Matrix Structure<br />

Given the dependency <strong>of</strong> the sparse storage upon the<br />

problem structure, it is important to devise matrix test<br />

sets that are relevant to the problem domain (i.e. low<br />

frequency electromagnetic analysis using the Finite<br />

Element Method). One <strong>of</strong> the advantages <strong>of</strong> matrices<br />

generated in FEM is the absence <strong>of</strong> a large discrepancy in<br />

the number <strong>of</strong> non-zeros between rows. This enables the<br />

use <strong>of</strong> a sparse storage technique such as ELLPACK that<br />

pre-allocate memory to store the coefficient matrix.<br />

B. 2D vs 3D Analysis<br />

Matrices generated from 2D finite element analysis are<br />

less dense that those generated from 3D problems.<br />

SMVM attained more GFLOPS in the case <strong>of</strong> a denser<br />

matrix (i.e. first-order finite element formulation <strong>of</strong> a 3D<br />

problem). However, ILU preconditioner generated in the<br />

case <strong>of</strong> a 3D problem was less amenable to parallelism<br />

than a preconditioner <strong>of</strong> a 2D problem.<br />

C. Matrix Ordering<br />

Overall, although parallelism can be explored and<br />

exploited in most <strong>of</strong> the examined FEM kernels, the main<br />

bottleneck remains the solver part <strong>of</strong> the FEM process.<br />

There are many sub-kernels that are executed within a<br />

large number <strong>of</strong> loops. This places a stringent<br />

requirement on sparse data structures as there is no gain if<br />

these structures change in between sub-kernels. Further,<br />

in each sub-kernel within the solver, there is a large<br />

number <strong>of</strong> simple operations to be executed. The number<br />

<strong>of</strong> operations is related to the degrees <strong>of</strong> freedom <strong>of</strong> the<br />

matrix and the operations’ complexity is related to the<br />

number <strong>of</strong> non-zeros per row. These simple operations<br />

are memory bandwidth limited requiring that each<br />

operation be optimized in terms <strong>of</strong> memory access.<br />

Hence, single thread optimization remains the most<br />

essential part <strong>of</strong> the solver’s optimization.<br />

Reverse Cuthill-McKee (RCM) ordering has been<br />

found to be beneficial assuming that it will not degrade<br />

the performance <strong>of</strong> PCG. It balances threads utilization


when solving a preconditioner and also in enhances cache<br />

performance in SMVM.<br />

REFERENCES<br />

[1] R. G. Grimes, D. M. Young, and D. R. Kincaid, "ITPACK 2.0:<br />

User's Guide," CNA-150, Center for Numerical<br />

Analysis,<strong>University</strong> <strong>of</strong> Texas,Austin,Texas<br />

,August 1979.<br />

[2] Intel, "Intel VTune Amplifier XE 2011," 2011 ed: Intel<br />

Corporation, 2011.<br />

[3] S. Toledo, "Improving the memory-system performance <strong>of</strong> sparsematrix<br />

vector multiplication," IBM Journal <strong>of</strong> Research and<br />

Development, vol. 41, pp. 711-725, 1997.<br />

[4] R. Vuduc, "Automatic performance tuning <strong>of</strong> sparse matrix<br />

kernels," PhD thesis, <strong>University</strong> <strong>of</strong> California,Berkley, 2003.<br />

[5] E. Cuthill and J. McKee, "Reducing the bandwidth <strong>of</strong> sparse<br />

symmetric matrices," presented at the <strong>Proceedings</strong> <strong>of</strong> the 1969 24th<br />

national conference, 1969.<br />

[6] S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J.<br />

Demmel, "Optimization <strong>of</strong> sparse matrix-vector multiplication on<br />

emerging multicore platforms," Parallel Computing, vol. 35, pp.<br />

178-194, 2009.<br />

[7] J. D. McCalpin, "STREAM: Sustainable memory bandwidth in<br />

high performance computers," STREAM: Sustainable Memory<br />

Bandwidth in High Performance Computers, 1991.<br />

[8] T. A. Davis and Y. Hu, "The <strong>University</strong> <strong>of</strong> Florida sparse matrix<br />

collection," ACM Transactions on Mathematical S<strong>of</strong>tware (TOMS),<br />

vol. 38, p. 1, 2011.<br />

[9] H. A. Van der Vorst, "Preconditioning by incomplete<br />

decompositions," Ph.D. Thesis, <strong>University</strong> <strong>of</strong> Utrecht, 1982.<br />

[10] J. Meijerink and V. DER VORST, "An iterative solution method<br />

for linear systems <strong>of</strong> which the coefficient matrix is a symmetric<br />

M-matrix," Mathematics <strong>of</strong> Computation, vol. 31, pp. 148-162,<br />

1977.<br />

[11] R. Pozo and K. Remington, "Sparselib++ v. 1. 5 sparse matrix class<br />

library. reference guide," NASA, 1996.<br />

[12] D. Hysom and A. Pothen, "Level-based incomplete LU<br />

factorization: Graph model and algorithms," Preprint UCRL-JC-<br />

150789, US Department <strong>of</strong> Energy, p. 17, 2002.<br />

[13] P. R. Amestoy, T. A. Davis, and I. S. Duff, "An approximate<br />

minimum degree ordering algorithm," SIAM Journal on Matrix<br />

Analysis and Applications, vol. 17, pp. 886-905, 1996.<br />

[14] I. S. Duff and G. A. Meurant, "The effect <strong>of</strong> ordering on<br />

preconditioned conjugate gradients," BIT Numerical Mathematics,<br />

vol. 29, pp. 635-657, 1989.<br />

[15] S. Doi and T. Washio, "Ordering strategies and related techniques<br />

to overcome the trade-<strong>of</strong>f between parallelism and convergence in<br />

incomplete factorizations," Parallel Computing, vol. 25, pp. 1995-<br />

2014, 1999.<br />

- 261 - 15th IGTE Symposium 2012


- 262 - 15th IGTE Symposium 2012<br />

Diagnosis <strong>of</strong> real cracks from the three spatial<br />

components <strong>of</strong> the eddy current testing signals<br />

M. Rebican∗ , L. Janousek † ,M.Smetana † , T. Strapacova † ,A.Duca∗and G. Preda∗ ∗ <strong>University</strong> Politehnica <strong>of</strong> Bucharest, Spl. Independentei 313, Bucharest 060042, Romania<br />

† Faculty <strong>of</strong> Electrical Engineering, <strong>University</strong> <strong>of</strong> Zilina, Univerzitna 1, 010 26 Zilina, Slovakia<br />

E-mail: mihai.rebican@upb.ro<br />

Abstract—This paper presents a novel approach for diagnosis <strong>of</strong> real cracks from two-dimensional eddy current testing<br />

signals by means <strong>of</strong> a stochastic method, such as tabu search. A new testing probe driving uniformly distributed eddy<br />

currents is employed for the inspection. Three spatial components <strong>of</strong> the perturbation field due to partially conductive<br />

cracks are sensed as the response signals in order to enhance information level <strong>of</strong> the inspection. The signals are simulated<br />

by a fast forward FEM-BEM solver using a database. Two crack models are proposed for the inversion: a crack with cuboid<br />

shape and a crack with more complex shape. In the both cases, the cracks have uniform conductivity. The length, depth,<br />

width and conductivity <strong>of</strong> the crack are unknown in the inversion process. Numerical results <strong>of</strong> the 3D reconstruction <strong>of</strong><br />

partially conductive cracks from simulated 2D signals with added noise are presented and discussed.<br />

Index Terms—partially conductive cracks, diagnosis, eddy current testing, tabu search.<br />

I. INTRODUCTION<br />

Eddy current testing (ECT) is one <strong>of</strong> the most common<br />

electromagnetic methods employed in non-destructive<br />

evaluation <strong>of</strong> conductive materials. The principle <strong>of</strong><br />

ECT underlies in the interaction <strong>of</strong> induced eddy currents<br />

with a structure <strong>of</strong> an examined conductive body<br />

based on the electromagnetic induction phenomena. The<br />

method is widely applied in various fields accounting for<br />

measurements <strong>of</strong> material thickness, proximity measurements,<br />

corrosion evaluation, sorting <strong>of</strong> materials based on<br />

the electromagnetic properties. However, the most wide<br />

spread area <strong>of</strong> its application in present is the detection<br />

and possible diagnosis <strong>of</strong> discontinuities.<br />

Real cracks, such as stress corrosion cracks (SCC),<br />

usually appear in steam generator (SG) tubes <strong>of</strong> pressurized<br />

water reactor (PWR) <strong>of</strong> nuclear power plants.<br />

Recently, quite satisfactory results are reported by several<br />

groups for automated evaluation <strong>of</strong> artificial slits [1] and<br />

even for several parallel notches [2] using eddy current<br />

testing (ECT). However, evaluation <strong>of</strong> real cracks from<br />

ECT signals remains still very difficult.<br />

In the case <strong>of</strong> artificial EDM notches, the width is<br />

usually considered fixed in the inversion process <strong>of</strong> ECT<br />

signals. However, for cracks with non-zero conductivity<br />

the width affects the signal and it has to be considered<br />

unknown during reconstruction [3]. It means that the<br />

additional variable should be taken into account for<br />

evaluation <strong>of</strong> a detected SCC what considerably increases<br />

ill-posedness <strong>of</strong> the inverse problem [4]. Thus, many unsatisfactory<br />

results are reported when the automated procedures<br />

originally developed for non-conductive cracks<br />

are employed in the evaluation <strong>of</strong> SCCs. It is stated<br />

that one <strong>of</strong> the possible reasons is lack <strong>of</strong> sufficient<br />

information [1].<br />

Several studies <strong>of</strong> the authors focused on enhancing<br />

information level <strong>of</strong> eddy current testing signals and on<br />

decreasing uncertainty in evaluation [5], [6]. Promising<br />

results create new challenges concerning development <strong>of</strong><br />

automatic procedures for diagnosis <strong>of</strong> real cracks.<br />

In a previous work [2], the authors developed an<br />

algorithm for reconstruction <strong>of</strong> multiple artificial slits<br />

from ECT signals by means <strong>of</strong> a stochastic optimization<br />

methods, such as tabu search. The reconstruction <strong>of</strong> multiple<br />

cracks was a 3D one. Therefore, the scheme is also<br />

appropriate for reconstruction <strong>of</strong> a partially conductive<br />

crack, when the width is not considered fixed.<br />

The paper proposes a novel approach for the threedimensional<br />

reconstruction <strong>of</strong> partially conductive cracks<br />

from simulated two-dimensional ECT signals, consisting<br />

<strong>of</strong> all the three spatial components <strong>of</strong> the perturbation<br />

field. Two crack models are proposed for the inversion.<br />

The first one has a cuboid shape and the other reflects<br />

a more complex geometry. Both the crack models consider<br />

uniform distribution <strong>of</strong> the partial conductivity. The<br />

length, depth, width and conductivity <strong>of</strong> the cracks are<br />

unknown in the inversion process <strong>of</strong> the signals. The<br />

validity <strong>of</strong> the approach is proved using perturbed ECT<br />

signals by added noise in the inversion process.<br />

II. EDDY CURRENT TESTING PROBLEM DEFINITION<br />

A plate specimen having the electromagnetic parameters<br />

<strong>of</strong> a stainless steel SUS316L is inspected in this<br />

study. The specimen has a thickness <strong>of</strong> t =10mm, a<br />

conductivity <strong>of</strong> σ =1.35 MS/m and a relative permeability<br />

<strong>of</strong> μr =1.<br />

Figure 1 shows the configuration <strong>of</strong> the plate (region<br />

Ω0) with a single surface breaking crack (shadow zone)<br />

located inside the region Ω1. The crack region Ω1 (22 ×<br />

2 × 10 mm3 ) is uniformly divided into a grid composed<br />

from nx × ny × nz (11×5×10) cells defining a possible<br />

crack geometry. The dimensions <strong>of</strong> each cell are 2.0 mm<br />

in length, 0.4 mm in width, and 1.0 mm in depth.<br />

A new eddy-current probe proposed by the authors<br />

is employed for the near-side inspection <strong>of</strong> the plate<br />

[7]. It consists <strong>of</strong> two circular exciting coils positioned<br />

apart from each other and oriented normally regarding


2<br />

1<br />

1<br />

nz<br />

Crack<br />

n y<br />

Ω 1<br />

Probe<br />

1 2 nx<br />

y<br />

x<br />

Scanning<br />

Fig. 1. Configuration <strong>of</strong> the plate specimen with a crack.<br />

the plate surface. The circular coils are connected in<br />

series but magnetically opposite to induce uniformly<br />

distributed eddy currents in the plate. The exciting coils<br />

are supplied from a harmonic source with a frequency<br />

<strong>of</strong> 5kHz and the current density 1A/mm 2 . A detection<br />

system <strong>of</strong> the probe is composed <strong>of</strong> three small circular<br />

coils oriented along three axes perpendicularly to each<br />

other [5]. The detection system is located in the center<br />

between the exciting coils to gain high sensitivity as<br />

the direct coupling between the exciting coils and the<br />

detectors is minimal at this position.<br />

Figure 2 shows the configuration <strong>of</strong> the new probe.<br />

Dimensions <strong>of</strong> the detecting coils are as follows: an inner<br />

diameter <strong>of</strong> 1.2mm, an outer diameter <strong>of</strong> 3.2mm and a<br />

winding height <strong>of</strong> 0.8mm.<br />

Two-dimensional scanning, so called C-scan, is performed<br />

over the cracked surface with a lift-<strong>of</strong>f <strong>of</strong> 1mm.<br />

The real and imaginary parts <strong>of</strong> the induced voltages in<br />

all three detecting coils corresponding to three spatial<br />

components <strong>of</strong> the perturbation electromagnetic field are<br />

sensed and recorded during the inspection.<br />

III. PARTIALLY CONDUCTIVE CRACK MODELS<br />

Partially conductive cracks with a uniform conductivity<br />

smaller than the conductivity <strong>of</strong> the base material are<br />

considered in this paper. Two crack models are proposed<br />

for the partially conductive cracks.<br />

In the first model shown in Figure 3, the crack has a<br />

cuboid shape. The crack parameter vector c consists <strong>of</strong><br />

z<br />

23<br />

x<br />

plate<br />

35<br />

14<br />

Fig. 2. ECT probe configuration.<br />

exciting<br />

coils<br />

10<br />

detectors<br />

Ω 0<br />

- 263 - 15th IGTE Symposium 2012<br />

1<br />

1<br />

2<br />

n<br />

z<br />

2<br />

ny<br />

1 2 n<br />

x<br />

Fig. 3. Crack region division - cuboid shape <strong>of</strong> the crack (model 1).<br />

6integers,c =[ix1,ix2,iy1,iy2,iz,s], whereix1 and<br />

ix2 are the indices <strong>of</strong> the first and last cells <strong>of</strong> the crack<br />

along the length <strong>of</strong> crack, iy1 and iy2 are the indices <strong>of</strong><br />

the first and last cells <strong>of</strong> the crack along the width <strong>of</strong><br />

crack, iz is the number <strong>of</strong> cells <strong>of</strong> the crack along the<br />

depth <strong>of</strong> crack, and σc = s%σ (σc - the conductivity <strong>of</strong><br />

crack, σ - the conductivity <strong>of</strong> base material).<br />

In Figure 3, for a uniform grid with 13×5×10 cells, the<br />

parameter vector is c =[6, 13, 1, 3, 4, 20]. Thus, 8×3×4<br />

cells form the crack, and the crack conductivity is σc =<br />

20%σ.<br />

The second crack model shown in Figure 4 adopts<br />

a more complex shape. The crack depth is considered<br />

as variable along the crack length. The crack parameter<br />

vector c consists <strong>of</strong> nx +3 integers, c =<br />

[iz1,iz2,...,iznx,iy1,iy2,s], whereizk, k = 1,nx is<br />

the number <strong>of</strong> cells <strong>of</strong> the crack along the depth <strong>of</strong> crack,<br />

iy1 and iy2 are the indices <strong>of</strong> the first and last cells <strong>of</strong><br />

the crack along the width <strong>of</strong> crack, and σc = s%σ (σc<br />

- the conductivity <strong>of</strong> crack, σ - the conductivity <strong>of</strong> base<br />

material).<br />

In Figure 4, for a uniform grid with 13 × 5 × 10<br />

cells, the parameter vector contains 16 integers, as c =<br />

[0, 0, 0, 0, 0, 8, 4, 1, 2, 5, 3, 6, 4, 1, 3, 30]. Thus, (8+4+1+<br />

2+5+3+6+4)× 3 cells form the crack, and the crack<br />

conductivity is σc = 30%σ.<br />

In the both models, the cracks have the same orienta-<br />

1<br />

1<br />

2<br />

n<br />

z<br />

2<br />

ny<br />

1 2 n<br />

x<br />

Fig. 4. Crack region division - complex shape <strong>of</strong> the crack (model 2).


tion. The width <strong>of</strong> crack can have the values: 0.4, 0.8,<br />

1.2, 1.6, 2 mm.<br />

IV. DIAGNOSIS OF PARTIALLY CONDUCTIVE CRACKS<br />

The fast-forward FEM-BEM analysis solver using<br />

database [8], [9] is adopted here for the ECT signals simulation.<br />

Actually, a version <strong>of</strong> the algorithm <strong>of</strong> database<br />

upgraded by the authors in previous works [2], [10],<br />

for the computation <strong>of</strong> the ECT signals due to multiple<br />

cracks is used in this paper. The database is designed for<br />

a three-dimensional defect region, and not as usually for a<br />

two-dimensional one where the crack width is considered<br />

fixed. Thus, the ECT response signals can be simulated<br />

also for partially conductive cracks with variable width<br />

using the same database generated in advance.<br />

The authors have already developed an algorithm for<br />

the reconstruction <strong>of</strong> multiple cracks from ECT signals<br />

by means <strong>of</strong> a stochastic optimization method, such as<br />

tabu search [2]. The reconstruction <strong>of</strong> multiple cracks<br />

validated by experimental data was a 3D one. Therefore,<br />

the scheme is also appropriate for the reconstruction<br />

<strong>of</strong> a partially conductive crack, when the width is not<br />

considered constant. It is well known that the width<br />

significantly affects the signal for cracks <strong>of</strong> non-zero<br />

conductivity [3].<br />

Tabu search is employed for the three-dimensional<br />

diagnosis <strong>of</strong> a partially conductive crack [2]. The error<br />

function ε to be minimized is defined as:<br />

ε(c) = <br />

j=X,Y,Z<br />

N<br />

i=1<br />

|ΔVij(c) − ΔV m<br />

ij |2<br />

, (1)<br />

N<br />

i=1<br />

|ΔV m<br />

ij |2<br />

where c is the crack parameter vector <strong>of</strong> the crack,<br />

ΔVij(c) and ΔV m<br />

ij are the simulated (reconstructed) and<br />

true (measured) induced pick-up voltages <strong>of</strong> the coils<br />

(ECT signal) for each spatial component (X, Y and Z<br />

according to the coordinate system shown in Figures 1<br />

and 2) at the i-th scanning point respectively, and N is<br />

the number <strong>of</strong> scanning points.<br />

Figures 5-7 show the simulated ECT signal for each<br />

spatial component (X, Y, Z) caused by a partially conductive<br />

crack with a cuboid shape, which has the parameters:<br />

lc =6mm, wc =0.8mm, dc =4mm, σc =5%<strong>of</strong> σ.<br />

In this paper, the simulated signals are affected by<br />

added noise before the inversion process in order to prove<br />

the validity and robustness <strong>of</strong> the proposed approach. The<br />

perturbed signal is computed as:<br />

(ΔV m<br />

i )ns =ΔV m<br />

i (1 ± ns%), (2)<br />

where ΔV m<br />

i and (ΔV m<br />

i )ns are the initial and perturbed<br />

true signals at the i-th scanning point respectively, ns is<br />

a random value <strong>of</strong> an imposed maximum level, NOISE.<br />

Figure 8 shows the perturbed ECT signal for Z component<br />

when noise <strong>of</strong> maximum level 40% is added to<br />

the simulated signal shown in Figure 7.<br />

- 264 - 15th IGTE Symposium 2012<br />

Absolute voltage [mV]<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

1<br />

1.8<br />

1.6<br />

1.4<br />

1.2<br />

15<br />

10<br />

5<br />

-20 -15 0<br />

-10 -5 0<br />

-5 y [mm]<br />

5<br />

x [mm] 10 -10<br />

15 20-15<br />

Fig. 5. X component <strong>of</strong> the simulated ECT signal.<br />

Absolute voltage [mV]<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

1<br />

1.8<br />

1.6<br />

1.4<br />

1.2<br />

15<br />

10<br />

5<br />

-20 -15 0<br />

-10 -5 0<br />

-5 y [mm]<br />

5<br />

x [mm] 10 -10<br />

15 20-15<br />

Fig. 6. Y component <strong>of</strong> the simulated ECT signal.<br />

V. NUMERICAL RESULTS AND DISCUSSION<br />

The numerical simulations <strong>of</strong> the cracks reconstruction<br />

are performed using an ordinary PC: Intel Core 2 Quad<br />

2.4GHz, 3GB RAM.<br />

In Table I the numerical results <strong>of</strong> the reconstruction<br />

are presented, when a partially conductive crack is<br />

modeled as a cuboid shape (crack model 1, Figure 3).<br />

The column denoted ”Real” gives the true dimensions<br />

(lc × wc × dc) and partial conductivity (σc in % <strong>of</strong> σ)<br />

<strong>of</strong> the crack. The results <strong>of</strong> the diagnosis are provided<br />

in the column labelled as ”Reconstructed” for various<br />

maximum levels <strong>of</strong> the noise added to simulated ECT<br />

signals (2). The error function ε(c) (1) <strong>of</strong> the diagnosis<br />

are reported, too.<br />

The time required for reconstruction <strong>of</strong> one crack is<br />

approximately 90-120 minutes. When there is no noise


Absolute voltage [mV]<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

1<br />

1.8<br />

1.6<br />

1.4<br />

1.2<br />

15<br />

10<br />

5<br />

-20 -15 0<br />

-10 -5 0<br />

-5 y [mm]<br />

5<br />

x [mm] 10 -10<br />

15 20-15<br />

Fig. 7. Z component <strong>of</strong> the simulated ECT signal.<br />

Absolute voltage [mV]<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

1<br />

1.8<br />

1.6<br />

1.4<br />

1.2<br />

15<br />

10<br />

5<br />

-20 -15 0<br />

-10 -5 0<br />

-5 y [mm]<br />

5<br />

x [mm] 10 -10<br />

15 20-15<br />

Fig. 8. Z component <strong>of</strong> the perturbed ECT signal by added noise <strong>of</strong><br />

maximum level 40%.<br />

added to signal (shown in Figures 5-7), the reconstructed<br />

crack is the same with the real crack (the error function<br />

ε =0). In the case <strong>of</strong> diagnosis from signals perturbed<br />

by noise (maximum level NOISE =10, 20, 30, 40%),<br />

TABLE I<br />

RESULTS OF THE PARTIALLY CONDUCTIVE CRACK<br />

RECONSTRUCTION FOR THE CRACK MODEL 1<br />

Crack Real Reconstructed<br />

NOISE (%) 0 10 20 30 40 40<br />

lc [mm] 6.0 6.0 6.0 6.0 6.0 6.0 6.0<br />

wc [mm] 0.8 0.8 0.8 0.8 0.8 0.8 1.2<br />

dc [mm] 4.0 4.0 4.0 4.0 4.0 4.0 4.0<br />

σc[%] 5 5 5 5 4 4 9<br />

ε · 10 −3 - 0 11 21 32 42 45<br />

- 265 - 15th IGTE Symposium 2012<br />

the crack is exactly reconstructed even if the maximum<br />

level <strong>of</strong> noise is high (30, 40%). Starting from another<br />

initial population <strong>of</strong> tabu search, and for a perturbed<br />

signal by added noise <strong>of</strong> highest level, NOISE =40%<br />

(the Z component is shown in Figure 8), the result is<br />

slightly different (the last column in Table I): the length<br />

and depth are equal to the true values; the width and<br />

partial conductivity are estimated not exactly but with<br />

good precision. However, the last two parameters are not<br />

very important from structural integrity point <strong>of</strong> view.<br />

The results clearly show that the crack parameters are<br />

estimated quite precisely from the noisy ECT signals<br />

using the proposed approach.<br />

Figures 9 and 10 show the results <strong>of</strong> three-dimensional<br />

diagnosis <strong>of</strong> the partially conductive crack described in<br />

Table I (column ”Real”) from 2D ECT signals without<br />

and with added noise <strong>of</strong> maximum level 20%, respectively,<br />

when the complex crack model (Figure 4) is<br />

employed for the inversion. The inversion procedure<br />

takes around 5-7 hours.<br />

In the case <strong>of</strong> the reconstruction from signal without<br />

noise, the crack is precisely localized and also its length<br />

width<br />

depth<br />

real reconstructed<br />

σ c=5%σ<br />

σ c=3%σ,<br />

ε=0.004<br />

Fig. 9. Reconstruction <strong>of</strong> a conductive crack from signal without noise.<br />

width<br />

depth<br />

real reconstructed<br />

σ c=5%σ<br />

σ c=4%σ,<br />

ε=0.027<br />

Fig. 10. Reconstruction <strong>of</strong> a conductive crack from perturbed signal<br />

by added noise <strong>of</strong> maximum level 20%.


width<br />

depth<br />

real reconstructed<br />

σ c=8%σ σ c=6%σ,<br />

ε=0.024<br />

Fig. 11. Reconstruction <strong>of</strong> an elliptical conductive crack.<br />

and width are exactly estimated. The depth pr<strong>of</strong>ile does<br />

not perfectly copy the true one. However, the maximum<br />

depth is accurately assessed. But, for the reconstruction<br />

from signal with added noise <strong>of</strong> maximum level <strong>of</strong> 20%,<br />

the width is smaller with a minimum value <strong>of</strong> 0.4 mm<br />

than real width.<br />

A crack with elliptical pr<strong>of</strong>ile is also reconstructed<br />

from signal without noise. The crack opening has a value<br />

<strong>of</strong> wc =0.4 mm, its surface length is lc =14mm, the<br />

maximum depth is dc = 4mm and the crack partial<br />

conductivity is adjusted to σc =8%<strong>of</strong> the base material<br />

conductivity σ. The reconstruction result is shown<br />

in Figure 11. The crack width and its surface length<br />

are accurately assessed. The estimated crack position is<br />

minimally shifted (0.4mm) in the crack width direction<br />

comparing the true position. The maximum depth is<br />

slightly overestimated <strong>of</strong> 1mm. When the signal caused<br />

by the elliptical conductive crack is perturbed by added<br />

noise <strong>of</strong> maximum level <strong>of</strong> 20%, and then is used in<br />

reconstruction, the maximum depth is overestimated <strong>of</strong><br />

2mm, but the crack width and its surface length are<br />

precisely estimated, too.<br />

The presented results proved effectiveness <strong>of</strong> the proposed<br />

novel approach <strong>of</strong> three-dimensional diagnosis <strong>of</strong><br />

partially conductive cracks, even if cracks with complex<br />

shape and signals with added noise are considered. ECT<br />

response signals gained during C-scan together with<br />

acquiring all three spatial components <strong>of</strong> the perturbation<br />

electromagnetic field significantly improve the preciseness<br />

<strong>of</strong> inversion process using tabu search stochastic<br />

method.<br />

VI. CONCLUSION<br />

A novel approach for three-dimensional diagnosis <strong>of</strong><br />

partially conductive cracks has been proposed in the<br />

paper. A special eddy current probe driving uniformly<br />

distributed eddy currents was used for the inspection <strong>of</strong><br />

a plate specimen. A detection system <strong>of</strong> the probe was<br />

designed in such a way that all three spatial components<br />

<strong>of</strong> the perturbation electromagnetic field were acquired.<br />

- 266 - 15th IGTE Symposium 2012<br />

The tabu search stochastic method was employed for the<br />

reconstruction <strong>of</strong> partially conductive cracks pr<strong>of</strong>ile from<br />

eddy current response signals gained during the C-scan<br />

<strong>of</strong> the probe. The signals were perturbed by added noise.<br />

Two crack models were proposed: a crack with cuboid<br />

shape and the other one with more complex shape.<br />

The length, depth, width and conductivity <strong>of</strong> the crack<br />

were considered unknown in the inversion process. The<br />

conductivity <strong>of</strong> the crack was uniform.<br />

The presented results proved that the proposed approach<br />

allows quite precisely reconstructing threedimensional<br />

pr<strong>of</strong>ile <strong>of</strong> a crack together with its partial<br />

conductivity from signals with added noise.<br />

Further work <strong>of</strong> the authors will concern more realistic<br />

shapes <strong>of</strong> cracks and validation with measured data from<br />

natural cracks (SCC).<br />

ACKNOWLEDGEMENTS<br />

This work has been co-funded by the Sectoral Operational<br />

Programme Human Resources Development<br />

2007-2013 <strong>of</strong> the Romanian Ministry <strong>of</strong> Labour, Family<br />

and Social Protection through the Financial Agreement<br />

POSDRU/89/1.5/S/62557.<br />

This work was supported by the Slovak Research and<br />

Development Agency under the contracts No. APVV-<br />

0349-10 and APVV-0194-07, and by grants <strong>of</strong> the Slovak<br />

Grant Agency VEGA, projects No. 1/0765/11, 1/0927/11.<br />

REFERENCES<br />

[1] N. Yusa, “Development <strong>of</strong> computational inversion techniques to<br />

size cracks from eddy current signals,” Nondestructive testing and<br />

evaluation, vol. 24, pp. 39–52, 2009.<br />

[2] M. Rebican, Z. Chen, N. Yusa, L. Janousek, and K. Miya, “Shape<br />

reconstruction <strong>of</strong> multiple cracks from ECT signals by means <strong>of</strong><br />

a stochastic method,” IEEE Transactions on Magnetics, vol. 42,<br />

pp. 1079–1082, 2006.<br />

[3] Z. Chen, M. Rebican, N. Yusa, and K. Miya, “Fast simulation <strong>of</strong><br />

ECT signal due to a conductive crack <strong>of</strong> arbitrary width,” IEEE<br />

Transactions on Magnetics, vol. 42, pp. 683–686, 2006.<br />

[4] N. Yusa, H. Huang, and K. Miya, “Numerical evaluation <strong>of</strong> the illposedness<br />

<strong>of</strong> eddy current problems to size real cracks,” NDT&E<br />

International, vol. 40, pp. 185–191, 2007.<br />

[5] L. Janousek, M. Smetana, and K. Capova, “Enhancing information<br />

level in eddy-current non-destructive inspection,” International<br />

Journal <strong>of</strong> Applied Electromagnetics and Mechanics, vol. 33, pp.<br />

1149–1155, 2010.<br />

[6] L. Janousek, M. Smetana, and M. Alman, “Decreasing uncertainty<br />

in size estimation <strong>of</strong> stress corrosion cracking from eddy-current<br />

signals,” Studies in Applied Electromagnetics and Mechanics,<br />

vol. 35, pp. 53–60, 2011.<br />

[7] L. Janousek, M. Smetana, and M. Alman, “Decline in ambiguity<br />

<strong>of</strong> partially conductive cracks depth evaluation from eddy current<br />

testing signals,” International Journal <strong>of</strong> Applied Electromagnetics<br />

and Mechanics, vol. 34, 2012 (in press).<br />

[8] Z. Chen, K. Miya, and M. Kurokawa, “Rapid prediction <strong>of</strong> eddy<br />

current testing signals using A−φ method and database,” NDT&E<br />

International, vol. 32, pp. 29–36, 1999.<br />

[9] Z. Chen, K. Aoto, and K. Miya, “Reconstruction <strong>of</strong> cracks with<br />

physical closure from signals <strong>of</strong> eddy current testing,” IEEE<br />

Transactions on Magnetics, vol. 36, pp. 1018–1022, 2000.<br />

[10] M. Rebican, N. Yusa, Z. Chen, K. Miya, T. Uchimoto, and<br />

T. Takagi, “Reconstruction <strong>of</strong> multiple cracks in an ECT roundrobin<br />

test,” International Journal <strong>of</strong> Applied Electromagnetics and<br />

Mechanics, vol. 19, no. 1-4, pp. 399–404, 2004.


- 267 - 15th IGTE Symposium 2012<br />

An Adaptive Galaxy-Based Search Approach for<br />

Electromagnetic Optimization Problems<br />

* Θ Leandro dos Santos Coelho, Θ Teodoro Cardoso Bora and † Piergiorgio Alotto<br />

* Industrial and Systems Eng. Graduate Program, Pontifical Catholic <strong>University</strong> <strong>of</strong> Parana, Curitiba, PR, Brazil<br />

Θ Department <strong>of</strong> Electrical Engineering (PPGEE), Federal <strong>University</strong> <strong>of</strong> Parana (UFPR), Curitiba, PR, Brazil<br />

† Dip. Ingegneria Industriale, Università di Padova, Italy, E-mail: piergiorgio.alotto@dii.unipd.it<br />

Abstract—Optimization metaheuristics have become very popular methods for electromagnetic device design. The Galaxybased<br />

search algorithm (GBSA) is a recently proposed algorithm, inspired by the movement <strong>of</strong> the arms <strong>of</strong> spiral galaxies in<br />

outer space. In this work, a standard and an adaptive version <strong>of</strong> GBSA (AGBSA) based on historic knowledge are applied to<br />

an analytical testcase and to Loney’s solenoid benchmark problem, showing the suitability <strong>of</strong> this technique for<br />

electromagnetic optimization. Furthermore, both algorithmic variants are compared with other well-known stochastic<br />

optimizers.<br />

Index Terms— Electromagnetic optimization, Galaxy-based search algorithm, Loney’s solenoid.<br />

I. INTRODUCTION<br />

Optimization algorithms which include stochastic<br />

components are nowadays commonly classified as<br />

metaheuristics and many <strong>of</strong> them, e.g. Particle Swarm<br />

Optimization (PSO), Genetic Algorithms (GA) and<br />

Evolution Strategies (ES), Differential Evolution (DE),<br />

just to name a few well-known ones, are known to be<br />

powerful techniques for the solution <strong>of</strong> optimization<br />

problems related to the design <strong>of</strong> electromagnetic<br />

devices. Such methods have been studied extensively in<br />

the last decades with growing interest in recent years (see<br />

e.g. [1]-[4]).<br />

A recently introduced metaheuristic which has not yet<br />

received much attention in the electromagnetic<br />

optimization community and which is starting to show<br />

interesting performances in other application areas is the<br />

the Galaxy-based search algorithm (GBSA) [6],[7].<br />

GBSA is a nature-inspired optimization method which<br />

mimics the movement <strong>of</strong> the arms <strong>of</strong> spiral galaxies in<br />

outer space.<br />

The objective <strong>of</strong> this paper is to review the basic<br />

algorithmic features <strong>of</strong> the relatively uncommon GBSA<br />

optimizer and to present a modified and improved<br />

adaptive GBSA (AGBSA) variant. Both algorithms are<br />

then tested on Loney’s solenoid benchmark problem [5],<br />

which features a rough objective function surface typical<br />

<strong>of</strong> many electromagnetic problems in which the direct<br />

problem is solved by numerical methods.<br />

The rest <strong>of</strong> this paper is organized as follows. Section<br />

II provides a detailed description <strong>of</strong> the GBSA algorithm,<br />

while section III is devoted to the application <strong>of</strong> GBSA to<br />

a multiminima analytical test problem. In Section IV, we<br />

describe Loney’s solenoid benchmark problem and<br />

presents the optimization results for the GBSA and<br />

AGBSA algorithmic variants and comparisons with other<br />

metaheuristics, finally the paper concludes with a brief<br />

discussion in Section V.<br />

II. FUNDAMENTALS OF THE GBSA ALGORITHM<br />

GBSA searches the input space using a spiral chaotic<br />

movement approximating the behavior <strong>of</strong> one arm <strong>of</strong> a<br />

spiral galaxy. This movement is driven by a chaotic<br />

process using a logistic map [7]. The main steps <strong>of</strong><br />

GBSA are given in Fig. 1, where S represents the current<br />

solution. The algorithm consists <strong>of</strong> two main<br />

S ← GenerateInitialSolution<br />

S ← LocalSearch (S)<br />

While (termination condition is not met) do<br />

Flag ← False<br />

SpiralChaoticMove (S, Flag)<br />

If Flag then<br />

S ← LocalSearch (S)<br />

Endif Endif<br />

End while while<br />

Fig. 1. Pseudo code <strong>of</strong> classical GBSA.<br />

componentes which are repeated in sequence:<br />

SpiralChaoticMove, shown in Fig. 2, and LocalSearch,<br />

shown in Fig. 3. The SpiralChaoticMove has the role <strong>of</strong><br />

searching around the current solution denoted by S. When<br />

the SpiralChaoticMove procedure finds an improved<br />

solution, it updates S with the improved solution, and the<br />

variable Flag is set to true. When Flag is true, the<br />

LocalSearch component <strong>of</strong> GBSA is activated in order to<br />

locally search around the current optimal solution.<br />

The SpiralChaoticMove component is iterated for<br />

MaxRep times. However, whenever it finds a solution<br />

better than the current optimal solution,<br />

SpiralChaoticMove is terminated and the control <strong>of</strong> the<br />

algorithm is transferred to the main procedure <strong>of</strong> GBSA.<br />

The SpiralChaoticMove component searches the space<br />

around the current best solution using a spiral movement<br />

enhanced by a chaotic variable generated by the logistic<br />

map:<br />

(1)<br />

Where, λ = 4 and x 0 = 0.19 (a sample output for the<br />

case <strong>of</strong> two degrees <strong>of</strong> freedom is given in Fig. 4). It<br />

should be mentioned that the first 5000 iterations <strong>of</strong> the<br />

logistic map are discarded in order not to include in the<br />

generating sequence the transient motion leading to the<br />

chaotic attractor.<br />

The LocalSearch component <strong>of</strong> GBSA may either<br />

find a locally optimal solution or it will exceed the<br />

maximum number <strong>of</strong> iterations kMax without<br />

improvement.<br />

The SpiralChaoticMove component <strong>of</strong> GBSA is the


input:<br />

S, the current best solution (Si is the ith component<br />

i=1:N)<br />

output:<br />

SNext is the first found solution better than S.<br />

Flag if set to true indicates that a better solution has<br />

been found.<br />

parameters:<br />

Each θi is initialised by (–1 + 2 NextChaos()).<br />

Δθ =0.01.<br />

r =0.001.<br />

Δr is set by NextChaos() at each procedure call.<br />

MaxRep is the maximum number <strong>of</strong> local iterations in<br />

SpiralChaoticMove. (e.g. 100)<br />

θ = –π<br />

While rep < MaxRep<br />

For i = 1 to N<br />

SNexti ← Si + NextChaos() r cos(θi)<br />

End<br />

If (f(SNext) ≥ f(S)) then<br />

Flag ← true<br />

Return<br />

Endif<br />

For i = 1 to N<br />

SNexti ← Si - NextChaos() r cos(θi)<br />

End<br />

If (f(SNext) ≥ f(S)) then<br />

Flag ← true<br />

Return<br />

Endif<br />

r ← r + Δr<br />

For i = 1 to N<br />

θi ←θi +Δθ<br />

End<br />

For i = 1 to N<br />

If(θi > π) then<br />

θ ← –π<br />

Endif<br />

End<br />

rep←rep +1<br />

Endwhile<br />

mechanism which is used for exploring the search space<br />

Fig. 2. Pseudo-code <strong>of</strong> the SpiralChaoticMove component <strong>of</strong><br />

GBSA<br />

in order to find the promising area which may include the<br />

optimal solution. In contrast, the LocalSearch is the<br />

GBSA component which is used to explore the promising<br />

area to find within this area the minimum <strong>of</strong> the objective<br />

function. In summary, exploration is conducted by<br />

SpiralChaoticMove while exploitation is carried out by<br />

LocalSearch.<br />

Both exploration and exploitation mechanisms are<br />

necessary for the success <strong>of</strong> any metaheuristic: without<br />

the exploitation mechanism, the metaheuristic may not be<br />

able to obtain accurate solutions whereas without the<br />

exploration mechanism, the metaheuristic may get easily<br />

trapped into a local optimum.<br />

The advantage <strong>of</strong> using the LocalSeach component<br />

with respect to some other exploitation mechanisms, such<br />

as mutation typical <strong>of</strong> Genetic Algorithms, is that the<br />

- 268 - 15th IGTE Symposium 2012<br />

input:<br />

S, the current best solution (Si is the ith component<br />

i=1:N)<br />

output:<br />

SNext is a solution better than S<br />

parameters:<br />

ΔS is the step size<br />

α is a dynamic parameter .<br />

KMax is the maximum number <strong>of</strong> local iterations in<br />

LocalSearch. (e.g. 100).<br />

For i = 1 to N<br />

a←1<br />

k←0<br />

while k < kMax<br />

SLi ←Si –α·ΔS·NextChaos()<br />

SUi ←Si +α·ΔS·NextChaos()<br />

If f(SL) < f(S) and f(SU) < f(S) then<br />

Goto Endrepeat<br />

Endif<br />

If f(SU) > f(S) then<br />

Si ← SUi<br />

SLi ← Sui<br />

α ← α + 0.01 × NextChaos()<br />

k←k+1<br />

ElseIf f(SL) > f(S) then<br />

Si ← SLi<br />

SUi ← SLi α ← α + 0.01 × NextChaos()<br />

k←k+1<br />

Else<br />

α ← α + 0.05 × NextChaos()<br />

k←k+1<br />

Endif<br />

Endwhile<br />

SLi ← Si<br />

SRi ← Si Endrepeat<br />

SNext ← S<br />

proposed local search never allows the algorithm lose the<br />

current best solution, thus increasing the greadyness <strong>of</strong><br />

the algorithm.<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

Fig. 3. Pseudo-code <strong>of</strong> the LoccalSearch component <strong>of</strong> GBSA<br />

0.2<br />

0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

Fig. 4. Sample output <strong>of</strong> the logistic map for the two dimensional<br />

case


Moreover, also the SpiralChaoticMove never loses the<br />

best solution found so far. In addition, it uses chaotic<br />

movements in order to reduce the chance <strong>of</strong> getting<br />

trapped into local optima (although this may still happen<br />

as will be seen in the paragraph devoted to the analytical<br />

test case). Due to the chaotic process, GBSA does not<br />

return to the same solution and thus diversity <strong>of</strong> the found<br />

solutions is kept high. Keeping diversity high is<br />

obviously especially important in dealing with<br />

multimodal problems.<br />

The tuning <strong>of</strong> the step size ΔS in the LocalSearch<br />

procedure is a very delicate task in the classical GBSA<br />

and the convergence properties <strong>of</strong> the method strongly<br />

depend on its specific value, which is also problemdependent.<br />

The proposed AGBSA efficiently tunes the<br />

step size using history knowledge <strong>of</strong> mean distances for<br />

the best solution in the previous iteration. The use <strong>of</strong><br />

history knowledge is typical <strong>of</strong> cultural algorithms [9].<br />

III. ANALYTICAL BENCHMARK<br />

The analytical benchmark refers to minimization<br />

<strong>of</strong> the so-called six-hump camel back function<br />

/3+ <br />

. The function has features which are typical <strong>of</strong><br />

many real problems, i.e. a bowl-shaped large-scale<br />

behavior, shown in Fig. 5a, which incorporates a<br />

relatively flat plateau with a rather rough small-scale<br />

behavior with several local minima, shown in Fig. 5b.<br />

The function has two global minima, i.e. at [-<br />

0.089842, 0.712656] and [0.089842, -0.712656] with<br />

value f=-1.031628453 and an additional four local<br />

minima.<br />

f(x1,x2)<br />

f(x1,x2)<br />

−50<br />

2<br />

6<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

−1<br />

2<br />

200<br />

150<br />

100<br />

50<br />

0<br />

1<br />

1<br />

0<br />

x2<br />

0<br />

x2<br />

−1<br />

−1<br />

−2<br />

−2<br />

−2<br />

−4<br />

Fig. 5: a) large-scale and b) small-scale behavior <strong>of</strong> the six-hump camel<br />

back function<br />

−1<br />

−2<br />

x1<br />

x1<br />

0<br />

0<br />

2<br />

1<br />

4<br />

2<br />

160<br />

140<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

0<br />

5<br />

4<br />

3<br />

2<br />

1<br />

0<br />

- 269 - 15th IGTE Symposium 2012<br />

Fig 6. shows the position <strong>of</strong> the optimal<br />

solutions obtained over 30 runs <strong>of</strong> the algorithm with<br />

maximum number <strong>of</strong> objective function evaluation set to<br />

100 (Fig. 6a) and 500 (Fig. 6b).<br />

The picture clearly shows that even with a rather<br />

small number <strong>of</strong> function evaluations, the areas <strong>of</strong> the<br />

global minima are usually correctly identified by the<br />

algorithm, while a larger number <strong>of</strong> function evaluations<br />

allows a good precision. The ability <strong>of</strong> the algorithm to<br />

escape local minima is also clearly shown, since none <strong>of</strong><br />

the runs terminated in one <strong>of</strong> the four local minima at a<br />

higher number <strong>of</strong> function evaluations, while some<br />

trapping in local minima is to be seen with a lower<br />

number <strong>of</strong> function evaluations.<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

−0.2<br />

−0.4<br />

−0.6<br />

−0.8<br />

−1<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

−0.2<br />

−0.4<br />

−0.6<br />

−0.8<br />

−1<br />

−1.5 −1 −0.5 0 0.5 1 1.5<br />

−1.5 −1 −0.5 0 0.5 1 1.5<br />

Fig. 6: Optimal solution over 30 runs : a) max. 500 function evaluations b)<br />

max. 3000 function evaluations<br />

IV. LONEY’S SOLENOID DESIGN<br />

The objective in Loney’s solenoid benchmark problem<br />

[5] is to produce a uniform magnetic flux density B within<br />

a given interval on the axis <strong>of</strong> a main solenoid (Fig. 7). The<br />

problem is described by two degrees <strong>of</strong> freedom (the<br />

separation s and the length l <strong>of</strong> the correcting coils) both<br />

constrained by box bounds


Fig. 7. Axial cross-section <strong>of</strong> Loney’s solenoid (upper half-plane).<br />

Three different basins <strong>of</strong> attraction can be recognized in<br />

the domain <strong>of</strong> the objective function F with values <strong>of</strong> F ><br />

4·10 -8 (high level region: HL), 3·10 -8 < F < 4·10 -8 (low<br />

level region: LL), and F < 3·10 -8 (very low level region -<br />

global minimum region: VL). The very low level region is<br />

a small ellipsoidally shaped area within the thin low level<br />

valley. In both VL and LL areas, small changes in one <strong>of</strong><br />

the parameters result in changes in objective function<br />

values <strong>of</strong> several orders <strong>of</strong> magnitude, as shown in Fig. 8.<br />

In the numerical tests, a stopping criterion <strong>of</strong> 3000<br />

objective function evaluations in each run was used. Tables<br />

I and II show the results over 30 runs. Table II also shows a<br />

comparison with other metaheuristics [10], [11].<br />

It can be noticed that the proposed improvement allows<br />

GBSA to become almost as good as some other wellknown<br />

stochastic optimizers, especially as far as the best<br />

solution is concerned, while improvements in the standard<br />

deviation and mean value are still required.<br />

Fig. 8. Objective function landscape and detail <strong>of</strong> the VL area<br />

Optimization<br />

Method<br />

TABLE I<br />

SIMULATION RESULTS OF F IN 30 RUNS<br />

F(s, l)·10 -8<br />

Maximum Mean Minimum StandardD<br />

(Worst)<br />

(Best) eviation<br />

GBSA 1510.646 95.960 2.121 282.727<br />

AGBSA 1510.631 88.313 2.068 283.834<br />

SOMA [10] 3.8761 3.2671 2.0595 0.5078<br />

Tribes [11] 3.9526 3.4870 2.0574 0.5079<br />

- 270 - 15th IGTE Symposium 2012<br />

TABLE II<br />

BEST SOLUTIONS FOR LONEY’S SOLENOID IN 30 RUNS<br />

Optimization separation length F(s, l)·10<br />

Method s (cm) l (cm)<br />

-8<br />

GBSA 11.8212 1.6519 2.121<br />

AGBSA 1.6072 1.5145 2.068<br />

V. CONCLUSION<br />

This paper proposes an improved AGBSA based on<br />

historic knowledge. Results on Loney’s solenoid design<br />

problem, a benchmark featuring many <strong>of</strong> the<br />

characteristics <strong>of</strong> typical electromagnetic design problems<br />

show promising results. Since the main remaining<br />

weakness <strong>of</strong> the algorithm, compared with other<br />

metaheuristics, is a rather large standard deviation <strong>of</strong><br />

optimal solutions, future research will be targeted at<br />

introducing additional improvements in order to decrease<br />

the spread <strong>of</strong> solutions.<br />

ACKNOWLEDGMENTS<br />

This work was supported by the National Council <strong>of</strong><br />

Scientific and Technologic Development <strong>of</strong> Brazil —<br />

CNPq — under Grant 476235/2011-1/PQ.<br />

REFERENCES<br />

[1] N. Al-Aaawar, T. M. Hijazi, and A. A. Arkadan, “Particle swarm<br />

optimization <strong>of</strong> coupled electromechanical systems,” IEEE<br />

Transactions on Magnetics, vol. 47, no. 5, 2011, pp. 1314-1317.<br />

[2] G. Crevecoeur, P. Sergeant, L. Dupré, and R. Van de Walle, “A<br />

two-level genetic algorithm for electromagnetic optimization,”<br />

IEEE Transactions on Magnetics, vol. 46, no. 7, 2010, pp. 2585-<br />

2595.<br />

[3] K. Watanabe, F. Campelo, Y. Iijima, K. Kawano, T. Matsuo, T.<br />

Mifune, and H. Igarashi, “Optimization <strong>of</strong> inductors using<br />

evolutionary algorithms and its experimental validation,” IEEE<br />

Transactions on Magnetics, vol. 46, no. 8, 2010, pp. 3393-3396.<br />

[4] P. Alotto. A hybrid multiobjective differential evolution method<br />

for electromagnetic device optimization. COMPEL, Vol. 30, No.<br />

6, 2011, pp.1815 – 1828.<br />

[5] P. Di Barba and A. Savini, “Global optimization <strong>of</strong> Loney’s<br />

solenoid by means <strong>of</strong> a deterministic approach,” Int. J. <strong>of</strong> Applied<br />

Electromagnetics and Mechanics, vol. 6, no. 4, pp. 247-254, 1995.<br />

[6] H. Shah-Hosseini, “Principal components analysis by the galaxybased<br />

search algorithm: a novel metaheuristic for continuous<br />

optimization,” Int. J. <strong>of</strong> Comp. Sci. and Eng., vol. 6, no. 1-2, pp.<br />

132-140, 2011.<br />

[7] H. Shah-Hosseini, “Otsu’s criterion-based multilevel thresholding<br />

by a nature-inspired metaheuristic called galaxy-based search<br />

algorithm,” Proc. <strong>of</strong> 3rd World Congr. on Nature and Biologically<br />

Inspired Computing, Salamanca, Spain, pp. 383-388, 2011.<br />

[8] G. Ciuprina, D. Ioan and I. Munteanu, “Use <strong>of</strong> intelligent-particle<br />

swarm optimization in electromagnetics,” IEEE Transactions on<br />

Magnetics, vol. 38, no. 2, pp. 1037-1040, 2002.<br />

[9] R. L. Becerra and C. A. C. Coello, “Cultural differential evolution<br />

for constrained optimization,” Comp. Methods in Appl. Mechanics<br />

and Engineering, vol. 195, no. 33-36, pp. 4303-4322, 2006.<br />

[10] L. S. Coelho and P. Alotto, “Electromagnetic optimization using a<br />

cultural self-organizing migrating algorithm approach based on<br />

normative knowledge,” IEEE Trans. on Magnetics, vol. 45, no. 3,<br />

pp. 1446-1449, 2009.<br />

[11] L. S. Coelho and P. Alotto, “Tribes optimization algorithm applied<br />

to the Loney’s solenoid,” IEEE Trans. on Magnetics, vol. 45, no.<br />

5, pp. 1526- 1529, 2009.


Abstract—Finite element modeling <strong>of</strong> a magnetic circuit used in<br />

automotive technologies is presented. A 3D magnetic analysis<br />

was performed in order to calculate the field distribution on the<br />

surface <strong>of</strong> giant magnetoresistance (GMR) sensors. Model<br />

results were compared with experiments, which showed good<br />

agreement. The validated model was further used to optimize<br />

the magnetic circuit design and to improve the working<br />

performance sensors.<br />

Index Terms— Sensors, Finite element method, Magnetic<br />

circuits, Magnetic fields, Giant Magnetoresistance..<br />

I. INTRODUCTION<br />

Magnetic sensors play an important role in automotive<br />

applications. They are reliable, cost effective with high<br />

performance and provide contactless measurements. They are<br />

majorly employed for applications such as measuring pedal<br />

position, engine transmission control, rotational speed <strong>of</strong> the<br />

wheels, and for anti-lock braking system (ABS) [1].<br />

A new type <strong>of</strong> magnetic sensor based on the Giant<br />

Magneto-resistance phenomenon (GMR) was developed by<br />

Infineon [2]. They <strong>of</strong>fer key benefits such as high sensitivity,<br />

linear operation over the sensing range, good temperature<br />

stability over a wide range and low field detection<br />

capabilities. Therefore, they are capable <strong>of</strong> being more precise<br />

on measuring the position or operating at large distances<br />

from the gear wheel in applications. Another benefit <strong>of</strong> using<br />

GMR elements is the low resistance noise. Presently, GMR<br />

sensors can be used in small fields such as 10 nT at 1 Hz and<br />

up to 10 8 nT. They can operate under temperatures between -<br />

55°C up to 150°C. Unfortunately due to GMR’s high<br />

sensitivity and their low field detection capability GMR<br />

elements can easily drive on saturation, if the detected<br />

magnetic field reached a crucial strength value. Therefore it<br />

is very important to ensure that GMR sensors always stay in<br />

their linear range.<br />

This can be achieved using an experimental procedure to<br />

measure the field distribution <strong>of</strong> the magnetic circuit.<br />

Unfortunately the experimental method is time consuming<br />

and cost expensive. In order to overcome these problems, this<br />

paper presents a model development <strong>of</strong> GMR sensors based<br />

on finite element method which can predict the field strength<br />

on the surface <strong>of</strong> GMR elements with high accuracy.<br />

- 271 - 15th IGTE Symposium 2012<br />

Implementation <strong>of</strong> a 3D magnetic circuit model<br />

for automotive applications<br />

Ioannis Anastasiadis 1, 3 , Andreas Buchinger 1 , Tobias Werth 2 ,Lukas Bellwald 1 and Kurt Preis 3<br />

1 KAI Kompetenzzentrum Automobil- und Industrieelektronik GmbH, Europastrasse 8, Villach, 9524 Austria<br />

2 Infineon Technologies Austria AG, Siemensstrasse 2, 9500 Villach, Austria<br />

3 Institute for Fundamentals and Theory in Electrical Engineering, Kopernikusgasse 24/3, A-8010 <strong>Graz</strong>, Austria<br />

II. GMR SENSOR CONCEPT<br />

Magnetoresistance is the change in resistance <strong>of</strong> a<br />

ferromagnetic material caused by an external magnetic field.<br />

The measure <strong>of</strong> magnetoresistance is usually given by the<br />

ratio ΔR/R, where R is the resistance for zero magnetic field<br />

and ΔR is the change in resistance when magnetic field<br />

changes by an amount ΔH. Usually ΔR/R value is small and<br />

hence the change in DC voltage remains low. In applications<br />

by using a Wheatstone bridge configuration to place the<br />

magnetoresistance elements, it is possible to minimize the DC<br />

<strong>of</strong>fset.<br />

In some cases <strong>of</strong> ferromagnetic multilayer’s stack (Fe/Cr)n,<br />

it was reported that at low temperatures their resistance can<br />

change up to 50 % [3]. Due to this major change in<br />

resistance, this phenomenal behavior was named as Giant<br />

Magneto-Resistance (GMR). GMR elements consist <strong>of</strong> a<br />

sequence <strong>of</strong> ferromagnetic and antiferromagnetic layers,<br />

which drastically change their resistance under an external<br />

magnetic field. The simplest GMR technology structure is the<br />

spin valve consisting <strong>of</strong> three layers, <strong>of</strong> which two<br />

ferromagnetic layers are separated by an antiferromagnetic<br />

layer [4]. One layer has a fixed magnetization direction called<br />

pinned layer-hard layer and the other layer is free to rotate<br />

with external fields and magnetization direction, termed as<br />

free layer-s<strong>of</strong>t layer. For industrial use, this pinned layer can<br />

be created in two ways. The first, using the current flow<br />

which provides heating to the layer and the second method is<br />

to use laser pulses for heating the selected layer. During<br />

cooling, the pinned magnetization is formed. In sensor<br />

technology, the pinned layer has its magnetization direction<br />

perpendicular to the free axis <strong>of</strong> the free layer. This setup<br />

gives a linear response <strong>of</strong> the change in GMR resistance when<br />

an external magnetic field is applied.<br />

GMR is a quantomechanic phenomenon created due to the<br />

orientation <strong>of</strong> conducting electrons while they pass through<br />

the GMR stack. If the spin orientation <strong>of</strong> the electrons is<br />

parallel to the magnetic orientation <strong>of</strong> the layer, they move<br />

freely and the resistance remains low. If the spin orientation<br />

is antiparallel to the orientation <strong>of</strong> the layer, resistance<br />

increases due to collisions with the atoms <strong>of</strong> the layers. For<br />

application purposes, the trigger/external field should have a<br />

magnitude bigger than the saturation field <strong>of</strong> the free layer


and smaller than the stand<strong>of</strong>f field <strong>of</strong> the pinned layer. If this<br />

is not the case, then the magnetization direction <strong>of</strong> the layers<br />

will be affected, which will change the overall characteristics<br />

<strong>of</strong> the magnetic sensor. The equation that describes the<br />

change in the resistance R <strong>of</strong> the GMR element is related to<br />

the angle θ between the magnetization directions <strong>of</strong> the free<br />

and pinned layer. In the simplest form <strong>of</strong> the GMR elements<br />

the change in resistance is proportional to the cosine <strong>of</strong> angle<br />

θ between the magnetization layers [5]:<br />

R − R||<br />

ΔR<br />

ΔR<br />

− ( 1−<br />

m1<br />

⋅ m2<br />

) = [ 1−<br />

cosθ<br />

] (1)<br />

R||<br />

2 R||<br />

2R||<br />

Where R is the resistance <strong>of</strong> the stack, R|| is the resistance<br />

<strong>of</strong> the stack in the parallel state, ΔR is the difference between<br />

the resistance <strong>of</strong> the stack in parallel and antiparallel state,<br />

m1 and m2 are the unit magnetization vectors.<br />

The magnetic-sensor consists <strong>of</strong> 4 GMR elements situated<br />

at the two edges <strong>of</strong> the sensor chip. Typical size <strong>of</strong> the GMR<br />

stacks is approximately 1 μm in length, with 1 mm depth and<br />

a thickness <strong>of</strong> few nanometers. They are connected to each<br />

other with a Whitestone bridge configuration to measure the<br />

speed signal. In addition, an extra GMR element is placed at<br />

the center <strong>of</strong> the IC to calculate the directional movement.<br />

The GMR element configuration is shown in figure 1.<br />

Fig. 1: GMR element configuration<br />

By using the above bridge configuration, it is possible to<br />

compensate the DC-<strong>of</strong>fset signal coming from the magnetic<br />

sources. The output signal is given by the following equation:<br />

R4<br />

R2<br />

Vsign = Vleft<br />

−Vright<br />

= VDD<br />

−VDD<br />

R + R R + R<br />

3 4<br />

2 1<br />

≈ Bxleft − Bxright<br />

(2)<br />

Calculating the magnetic field will provide an indication <strong>of</strong><br />

the sensor output signal. The above equation is valid, since<br />

the change in resistance <strong>of</strong> the GMR elements has a linear<br />

response with the change in magnetic field. This assumption<br />

is correct for fields around zero, but for larger applied fields<br />

the overall characteristics <strong>of</strong> GMR elements will change as<br />

they will saturate.<br />

≈<br />

- 272 - 15th IGTE Symposium 2012<br />

III. MAGNETIC CIRCUITS<br />

Typical magnetic circuits used in automotive technologies<br />

consist <strong>of</strong> a gear wheel, sensor and a magnet. This magnet,<br />

termed as back-bias magnet is the source for the circuit. The<br />

sensor and the back-bias magnet are fixed, while the wheel is<br />

subjected to rotation. The ferromagnetic gear wheel acts as an<br />

accumulator <strong>of</strong> the magnetic field –passive target- and the<br />

fluxes bend according to the position <strong>of</strong> the gear wheel, either<br />

if the static part <strong>of</strong> the circuit faces a tooth or not. This<br />

difference <strong>of</strong> the field distribution is sensed by the GMR<br />

elements and is transformed to an electrical signal as the<br />

output <strong>of</strong> the magnetic sensor. A schematic <strong>of</strong> this typical<br />

circuit is shown in Figure 2.<br />

magnetic sensor<br />

Fig.2. Basic magnetic circuit application<br />

back-bias magnet<br />

Previously, investigations and optimization process for<br />

back-bias magnets and gear wheels geometries was carried<br />

out for GMR magnetic sensors applications. [6, 7]. This paper<br />

presents the investigation and model development <strong>of</strong> the<br />

circuit as shown in Figure 3. The gear wheel consists <strong>of</strong> 44<br />

teeth with a circular pitch <strong>of</strong> 8°.18. The gear wheel is 10 mm<br />

long in y-axis dimension. The back-bias magnet structure<br />

consists <strong>of</strong> two magnets formed together with magnetization<br />

directions on the xz plane tilted at an angle <strong>of</strong> 20° in the z<br />

axis as shown in figure 3. The dimensions <strong>of</strong> the magnet are<br />

10 x 10 x 4 mm. The magnet is a ferrite with a remanence <strong>of</strong><br />

287 mT.<br />

y<br />

z<br />

x<br />

3 mm<br />

20° 20°<br />

3mm<br />

airgap<br />

4 mm<br />

3mm<br />

Fig.3: Magnetic circuit under investigation<br />

The magnetic sensor is placed between the gear wheel and<br />

the magnet. The distance from the end <strong>of</strong> the sensor to the top<br />

<strong>of</strong> a tooth is the airgap distance <strong>of</strong> the circuit. When the gear<br />

wheel is rotated, change in magnetic field distribution on the


GMR element surface takes place. By the rotation <strong>of</strong> the<br />

wheel with respect to the MS location and for a distance <strong>of</strong><br />

one pitch, the field distribution along x-axis has a sinusoidal<br />

form. It is <strong>of</strong> interest to calculate this field distribution and to<br />

compare with experimental results.<br />

IV. MODEL CREATION<br />

The field distribution on the xy plane along the surface <strong>of</strong><br />

the GMR elements was investigated. Additionally, it is<br />

necessary to check also the field strength in the normal<br />

direction (z-direction) in order to determine the sensor circuit<br />

response due to change in the airgap distance. For the above,<br />

it is necessary to investigate the magnetic circuit’s field<br />

distribution in three dimensions (3D). Because <strong>of</strong> the<br />

complexity <strong>of</strong> the problem, it is not possible to derive an<br />

analytical 3D solution. Therefore, finite element method was<br />

used to for this purpose [8]. Within this method the geometry<br />

<strong>of</strong> the problem is discretized in smaller regions, where the<br />

field distribution is calculated by means <strong>of</strong> approximated<br />

polynomial shape functions. The approach <strong>of</strong> the problem is<br />

bottom-up. Model was first created in two dimensions and<br />

then extracted in the third direction. 3D scalar magnetic<br />

element was used for this model. Figure 4 shows the field<br />

distribution for an airgap <strong>of</strong> 2 mm.<br />

Fig.4: field distribution for an airgap <strong>of</strong> 2 mm<br />

For the model creation, back-bias magnet and the gear<br />

wheel were created surrounded by air. Since the GMR<br />

elements do not interfere in the field distribution but only<br />

used to measure the magnetic field they are ignored in the<br />

model. Precautions have to be made for the magnet<br />

surrounding free space. The dimensions should be taken<br />

around 5 times the respective dimensions <strong>of</strong> the magnet for<br />

convergence reasons. Larger the model size will increase the<br />

computational time and smaller may lead to distorted<br />

calculations <strong>of</strong> the field due to calculation errors. All<br />

simulations reported in the paper were carried out using a<br />

commercial FEM tool [9].<br />

A. MS attached to back-bias magnet<br />

Initially, investigations for the case where magnetic sensor<br />

is attached to the back-bias magnet were performed. The air<br />

gap was 2 mm. For a rotation <strong>of</strong> 1 pitch, the field distribution<br />

- 273 - 15th IGTE Symposium 2012<br />

was calculated on the surface <strong>of</strong> the GMR elements. The Bx<br />

field distribution on the surface <strong>of</strong> the left GMR and center<br />

GMR element measured at a point in the center <strong>of</strong> their<br />

surface and for a rotation <strong>of</strong> 1 pitch is shown in figure 5.<br />

Bx(mT)<br />

Bx(mT)<br />

0<br />

-2<br />

-4<br />

-6<br />

-8<br />

-10<br />

-12<br />

-14<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

Bx field on the left GMR<br />

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.18<br />

angle(°)<br />

Fig. 5a:Bx field distribution on the left GMR<br />

Bx field on the center GMR<br />

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.18<br />

angle(°)<br />

Fig. 5b:Bx field distribution on the center GMR<br />

As it can be seen from Figure 5a, the Bx field on the left<br />

GMR element is approximately -10 mT which is big enough<br />

to drive GMR elements on saturation. Therefore MS should<br />

not be attached to the back-bias magnet but kept a distance<br />

between MS and magnet to prevent driving GMR elements on<br />

saturation.<br />

B. MS placed in a distance from the magnet<br />

In this case, a distance <strong>of</strong> 2 mm is kept between the<br />

magnet and the magnetic sensor as shown in figure 6.<br />

20°<br />

20°<br />

Fig.6: The new circuit geometry under investigation<br />

By setting the airgap distance also to 2 mm and considering<br />

that the package <strong>of</strong> the sensor has in normal direction 1 mm<br />

length, the total distance between the magnet back face and<br />

gear wheel teeth was 5 mm.<br />

y<br />

z<br />

x


Hence magnetization <strong>of</strong> the free layer <strong>of</strong> GMR stack follows<br />

the magnetization direction <strong>of</strong> the external plane field<br />

distribution on the surface <strong>of</strong> the element. We calculated the<br />

plane field Bx and By distribution along the surface <strong>of</strong> GMR<br />

stripes. GMR stripes have a length <strong>of</strong> approximately 1 mm in<br />

y-direction. For investigations <strong>of</strong> the plane field at the length<br />

<strong>of</strong> GMR stripe along y-direction, the Bx and By distributions<br />

were calculated at points which were equidistant spaced. The<br />

bottom point at the surface <strong>of</strong> GMR stripe is denoted to be the<br />

0 point and the next point is spaced by 0.05 mm until the last<br />

point <strong>of</strong> calculations on the top point <strong>of</strong> the stripe. The Bx and<br />

By filed distributions for those points and for two GMR<br />

elements, one which is located on the left half-bridge and<br />

another on the center <strong>of</strong> the sensor (GMR5) can be seen on<br />

figures 7 and 8.<br />

Bx(mT)<br />

By(mT)<br />

Bx(mT)<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

-2.5<br />

-3<br />

-3.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

0<br />

1<br />

1.5<br />

2<br />

2.5<br />

3<br />

3.5<br />

angle(°)<br />

4<br />

4.5<br />

5<br />

5.5<br />

6<br />

6.5<br />

7<br />

7.5<br />

8<br />

8.1818<br />

Bx @ 0mm<br />

Bx @ 0.05mm<br />

Bx@ 0.1mm<br />

Bx@ 0.15mm<br />

Bx@ 0.2mm<br />

Bx@ 0.25mm<br />

Bx@ 0.3mm<br />

Bx@0.35mm<br />

Bx@0.4mm<br />

Bx@0.45mm<br />

Bx@0.5mm<br />

Bx@ 0.55mm<br />

Bx@ 0.6mm<br />

Bx@ 0.65mm<br />

Bx@ 0.7mm<br />

Bx@ 0.75mm<br />

Bx@ 0.8mm<br />

Bx@ 0.85mm<br />

Bx@ 0.9mm<br />

Bx@ 0.95mm<br />

Bx@ 1mm<br />

Fig. 7a: Bx field distribution on a GMR on the left half-bridge<br />

0 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.182<br />

angle(°)<br />

By@ 0mm<br />

By@ 0.05mm<br />

By@ 0.1mm<br />

By@ 0.15mm<br />

By@ 0.2mm<br />

By@ 0.25mm<br />

By@ 0.3mm<br />

By@ 0.35mm<br />

By@ 0.4mm<br />

By@ 0.45mm<br />

By@ 0.5mm<br />

By@ 0.55mm<br />

By@ 0.6mm<br />

By@ 0.65mm<br />

By@ 0.7mm<br />

By@ 0.75mm<br />

By@ 0.8mm<br />

By@ 0.85mm<br />

By@ 0.9mm<br />

By@ 0.95mm<br />

By@ 1mm<br />

Fig. 7b: By field distribution on a GMR on the left half-bridge<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

0<br />

1<br />

1.5<br />

2<br />

2.5<br />

3<br />

3.5<br />

4<br />

4.5<br />

5<br />

5.5<br />

6<br />

6.5<br />

7<br />

7.5<br />

8<br />

8.18<br />

angle(°)<br />

Bx@ 0mm<br />

Bx@ 0.05mm<br />

Bx@ 0.1mm<br />

Bx@ 0.15mm<br />

Bx@ 0.2mm<br />

Bx@ 0.25mm<br />

Bx@ 0.3mm<br />

Bx@ 0.35mm<br />

Bx@ 0.4mm<br />

Bx@ 0.45mm<br />

Bx@ 0.5mm<br />

Bx@ 0.55mm<br />

Bx@ 0.6mm<br />

Bx@ 0.65mm<br />

Bx@ 0.7mm<br />

Bx@ 0.75mm<br />

Bx@ 0.8mm<br />

Bx@ 0.85mm<br />

Bx@ 0.9mm<br />

Bx@ 0.95mm<br />

Bx@ 1mm<br />

Fig. 8a: Bx field distribution on a center GMR element<br />

- 274 - 15th IGTE Symposium 2012<br />

By(mT)<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

-1.5<br />

-2<br />

0<br />

1<br />

1.5<br />

2<br />

2.5<br />

3<br />

3.5<br />

4<br />

4.5<br />

5<br />

5.5<br />

6<br />

6.5<br />

7<br />

7.5<br />

angle(°)<br />

8<br />

8.1818<br />

By@ 0mm<br />

By@ 0.05mm<br />

By@ 0.1mm<br />

By@ 0.15mm<br />

By@ 0.2mm<br />

By@ 0.25mm<br />

By@ 0.3mm<br />

By@ 0.35mm<br />

By@ 0.4mm<br />

By@ 0.45mm<br />

By@ 0.5mm<br />

By@ 0.55mm<br />

By@ 0.6mm<br />

By@ 0.65mm<br />

By@ 0.7mm<br />

By@ 0.75mm<br />

By@ 0.8mm<br />

By@ 0.85mm<br />

By@ 0.9mm<br />

By@ 0.95mm<br />

By@ 1mm<br />

Fig. 8b: By field distribution on a center GMR element<br />

The Bx fields along the GMR stripes have always the same<br />

response for a rotation <strong>of</strong> 1 pitch. On the other hand the By<br />

field is shifted each time we move from the bottom side <strong>of</strong> the<br />

stripe towards the upper side. The By fields are<br />

homogeneously distributed along the stripes and have the<br />

same values on the surfaces <strong>of</strong> all GMR stripes. Due to<br />

symmetry reasons, a GMR element situated on the right halfbridge<br />

should also have along their surface similar By and Bx<br />

distribution.<br />

C. Experimental results<br />

Experiments were performed for the configuration shown in<br />

figure 6. To compare with simulations, the gear wheel speed<br />

was set to 1.5 rpm. Such a small speed was chosen, because<br />

the finite element model was built for every corresponding<br />

magnet positions over the rotated angle. As the gear wheel<br />

rotates, the magnetic sensor provides speed and the<br />

directional signal. These measured signals are compared with<br />

simulation results. Drawback <strong>of</strong> the experimental procedure is<br />

that there is no possibility to directly derive the Bx and By<br />

distribution along the GMR stripe, but only the plane field<br />

distribution on the surface <strong>of</strong> GMR elements while gear wheel<br />

is rotated towards magnetic sensor. This experimental field<br />

distribution along the GMR surface is compared with the field<br />

distributions derived from the model.<br />

Hence Bx and By distributions are changed along the GMR<br />

stripes we have to calculated the average field that the<br />

elements are sensed and that field we have to compare it with<br />

the experimental results. Substituting the signal from the left<br />

and right Whetstone bridge, we measure the speed signal<br />

while the center GMR element shows the directional signal.<br />

The results are shown in figure 9. The signal is calculated on<br />

field distribution along the surface <strong>of</strong> the stripes.<br />

Fig. 9a: comparison <strong>of</strong> directional signal


Fig. 9b: comparison <strong>of</strong> speed signal<br />

Again, results are shown for a rotation <strong>of</strong> 1 pitch.<br />

Comparisons for the speed signal between experimental<br />

and simulation results reveal a small deviation <strong>of</strong><br />

approximately 3%. On the other hand, the comparison for the<br />

directional signal which comes from the measurements <strong>of</strong> the<br />

middle GMR element shows a bigger deviation with a mean<br />

value <strong>of</strong> 9%. Differences between simulation and<br />

experimental results are due to inaccuracies in the simulation<br />

model, such as how dense the model is. Another important<br />

issue is that in reality the geometry <strong>of</strong> gear wheel has<br />

deviations from the theoretical geometry due to construction<br />

reasons, for example each pitch distance has not exactly the<br />

same dimensions <strong>of</strong> 6mm or there may be a small deviation<br />

on the height <strong>of</strong> all the teeth <strong>of</strong> the gear wheel. Those<br />

problems can bring an error on the calculated signal.<br />

V. CONCLUSION<br />

A 3D model describing the rotation <strong>of</strong> a GMR sensor<br />

around a gear wheel was developed and verified with<br />

experiments. The model is based on finite element analysis<br />

and is used to calculate the variations <strong>of</strong> the field distribution,<br />

when the gear wheel is rotated around the stator <strong>of</strong> the<br />

magnetic circuit, MS and back-bias magnet. Calculation <strong>of</strong><br />

the field distribution was performed along the GMR element<br />

surface. In parallel, experiments were performed for the same<br />

configuration to support model development.<br />

Comparison shows a small deviation between the compared<br />

values given an indication <strong>of</strong> the valid <strong>of</strong> the model. Such a<br />

model can give us a fast and accurate estimation on the<br />

magnetic circuit’s functionality showing also the maximum<br />

airgap performance <strong>of</strong> this circuit.<br />

ACKNOWLEDGEMENTS<br />

This work was jointly funded by the Federal Ministry <strong>of</strong><br />

Economics and Labour <strong>of</strong> the Republic <strong>of</strong> Austria (contract<br />

98.362/0112-C1/10/2005) and the Carinthian Economic<br />

Promotion Fund (KWF) (contract 98.362/0112-C1/10/2005).<br />

REFERENCES<br />

[1] C.P.O. Treutler, “Magnetic sensors for automotive applications,” Elsevier,<br />

Sensors and Actuators A, vol. 91,2001, pp. 2-6<br />

[2] Dirk Hammersdchmidt, Ernst Katzmaier, et. all, “Giant magneto resistorssensor<br />

technology & automotive applications”, SAE 2005 World Congress<br />

& Exhibition, SAE, Detroit, 01-01-2005, pp. 1-16.<br />

- 275 - 15th IGTE Symposium 2012<br />

[3] M.N.Baibich, M.Broto, A. Fert, F. Nguyen Van Dau, F. Petr<strong>of</strong>f, P.<br />

Etienne, G. Creuzei, A. Frederick and J. Chazelas, “Giant<br />

Magnetoresistance <strong>of</strong> (001) Fe/(001) Cr Magnetic Superlattices,” Phys.<br />

Rev. Lett., vol. 61, num. 21, Nov. 1988, pp. 2472-2475.<br />

[4] Robert L. White, “Giant Magnetoresistance: A Primer” IEEE Trans. on<br />

Magn., vol. 28, Sept. 1992, pp. 2482-2487.<br />

[5] S.E. Russek, R.D: McMichael, M.J: Donahue and S. Kaka, “High Speed<br />

Switching and Rotational Dynamics in Small Magnetic Thin Film<br />

Devices,” Springer, Spin Dynamics in Confined Magnetic Structures 2,<br />

vol.87, 2003, pp. 93-156<br />

[6] I. Anastasiadis, T. Werth, K. Preis, “Evaluation and optimization <strong>of</strong> backbias<br />

magnets for automotive applications using Finite Element Methods”,<br />

IEEE Transactions on Magnetics, vol. 45 (March 2009) no.3, pp. 1332-<br />

1335.<br />

[7] I. Anastasiadis, T. Werth, K. Preis, “Investigation and optimization <strong>of</strong><br />

magnetic sensor gear wheels for automotive applications”, 14 th IGTE<br />

Symposium on Numerical Field Calculation in Electrical Engineering,<br />

19-22 Sept. 2010, submitted.<br />

[8] A. Bonderson,T. Rylander, P. Ingelström, Computational<br />

Electromagnetics (Book style), Springer Inc. 2005.<br />

[9] Tutorial, Electromagnetic Field Analysis Guide, ANSYS Release 11.0,<br />

ANSYS Inc Book International Inc., Canonsburg, PA, 2006


- 276 - 15th IGTE Symposium 2012


- 277 - 15th IGTE Symposium 2012


- 278 - 15th IGTE Symposium 2012


- 279 - 15th IGTE Symposium 2012


- 280 - 15th IGTE Symposium 2012


- 281 - 15th IGTE Symposium 2012


- 282 - 15th IGTE Symposium 2012<br />

Mixed Order Edge-based Finite Element Method<br />

by Means <strong>of</strong> Nonconforming Mesh Connection<br />

Yoshifumi Okamoto and Shuji Sato<br />

*Department <strong>of</strong> Electrical and Electronic Systems Engineering, Utsunomiya <strong>University</strong><br />

7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />

E-mail: okamotoy@cc.utsunomiya-u.ac.jp<br />

Abstract—The first order element is widely used as discretized order <strong>of</strong> edge-based finite element method. The second order<br />

discretization has a tendency <strong>of</strong> escape from the practical magnetic field analysis for the reason <strong>of</strong> many nonzero entries in<br />

global matrix and convergence deterioration <strong>of</strong> a linear solver. Therefore, the performance <strong>of</strong> higher order elements should<br />

be successfully utilized by the restriction <strong>of</strong> analyzed region. This paper presents the mixed order finite element method with<br />

reasonable computational costs by using nonconforming mesh connection. The higher order discretization is applied to the<br />

main region with higher accuracy, and is connected with the outer space discretized by first order element using<br />

nonconforming connection. This paper shows the detailed characteristics <strong>of</strong> nonconforming mixed order edge-based finite<br />

element analysis.<br />

Index Terms— Higher order element, higher order interpolation, mixed order edge-based finite element analysis,<br />

nonconforming mesh connection.<br />

linear combination [5] – [8]. We applied the linear<br />

combination to nonconforming connection to retain the<br />

symmetry <strong>of</strong> the global matrix. Furthermore, we newly<br />

propose the finite element analysis using the<br />

nonconforming connection between 2nd order elements,<br />

and comparisons have been made with conventional<br />

conforming analysis and mixed-order analysis.<br />

I. INTRODUCTION<br />

The edge-based finite element method is widely<br />

recognized as a powerful and practical numerical method<br />

in the design environment for the evaluation <strong>of</strong> temporal<br />

changes <strong>of</strong> electromagnetic field and various<br />

characteristics <strong>of</strong> electrical machine. Furthermore, the<br />

mesh generator has been advanced in order to product the<br />

finite element mesh for the complicated target. From<br />

these technical background, 1st order discretization is<br />

widely adopted as an edge-based finite element method.<br />

On the other hand, the 2nd order discretization has the<br />

tendency <strong>of</strong> escape from the practical magnetic field<br />

analysis owing to the many nonzero entries in global<br />

matrix and convergence deterioration <strong>of</strong> a linear solver<br />

for the algebraic equation. However, 2nd order element<br />

has better convergence characteristics to the true value <strong>of</strong><br />

physical quantity such as the magnetic energy than the<br />

1st order element when the element size is shortened.<br />

Then, it is assumed that 2nd order element should be<br />

adopted at the region where the accuracy is required and<br />

1st order element is adopted at other region. This<br />

combinatorial technique might be capable <strong>of</strong> improving<br />

the convergence characteristics <strong>of</strong> linear solvers and<br />

shortening the elapsed time with lower computational<br />

cost than conventional 2nd order discretization. The<br />

reference [1] describes the mixed order analysis based on<br />

the hierarchical elements. Furthermore, the mixed order<br />

analysis in the discontinuous Galerkin method is applied<br />

to the eddy current problem [2]. These references show<br />

the effectiveness <strong>of</strong> mixed order analysis. However, the<br />

combinatorial technique between higher order elements is<br />

not reported and the degree <strong>of</strong> nonconforming mesh<br />

connection between 2nd and 1st order is not verified.<br />

This paper shows the detailed effectiveness <strong>of</strong> mixed<br />

order edge-based finite element analysis which is realized<br />

using nonconforming mesh connection. The<br />

nonconforming mesh connection is mainly classified into<br />

two methods, in which one is the method based on<br />

Lagrange multiplier [3], [4], and the other is based on the<br />

II. MIXED ORDER FINITE ELEMENT METHOD BY MEANS<br />

OF NONCONFORMING MESH CONNECTION<br />

A. Weak form for edge-based finite element method<br />

The weak form Gi for Maxwell equation in<br />

magnetostatic field is given as follows:<br />

G<br />

i<br />

<br />

<br />

V<br />

( <br />

N ) (<br />

<br />

A)<br />

dV N J 0dV<br />

i<br />

<br />

<br />

Vm<br />

<br />

Vc<br />

( N ) <br />

B dV 0,<br />

where Ni is the edge-based shape function, is the<br />

reluctivity, A is magnetic vector potential, J0 is the<br />

current density vector, Br is the remanence, respectively.<br />

The domain for volume integral V, Vc, and Vm denote the<br />

whole region, the region for magnetizing winding, and<br />

the permanent magnet, respectively. When the magnetic<br />

nonlinearity is taken into account, Newton-Raphson (NR)<br />

method supported by the line-search based on functional<br />

(0, 1.0) [9] is adopted as the nonlinear analysis method.<br />

The ICCG method with shifted parameter [10] is used as<br />

a linear solver for the algebraic equation derived from<br />

edge-based finite element method.<br />

B. Nonconforming mesh connection<br />

This subsection describes the nonconforming mesh<br />

connection using the linear combination, which is<br />

formulated as follows:<br />

<br />

i<br />

i<br />

r<br />

(1)<br />

A N A dl<br />

,<br />

(2)<br />

b<br />

ab<br />

a<br />

k<br />

k<br />

k<br />

where Aab is the vector potential <strong>of</strong> nonconforming edge<br />

ab as shown in Figure 1 (a). In following formulation,<br />

suppose that the global coordinate at the node k is (xk, yk),


the global coordinate <strong>of</strong> node a and b is (xa, ya), (xb, yb)<br />

and the local coordinate <strong>of</strong> node a and b is (a, a), (b,<br />

b). The element shape on master side is assumed as a<br />

square or rectangle. Hence, the global coordinate (x, y)<br />

on master side element is as follows:<br />

x1<br />

x2<br />

x1<br />

x2<br />

x ,<br />

(3)<br />

2 2<br />

y1<br />

y4<br />

y1<br />

y4<br />

y ,<br />

(4)<br />

2 2<br />

where and are local coordinate on the master side<br />

element. , is given as follows:<br />

b<br />

<br />

a<br />

<br />

i ,<br />

(5)<br />

x x<br />

b<br />

a<br />

b<br />

<br />

a<br />

<br />

j ,<br />

(6)<br />

yb ya<br />

where i and j are the unit vectors in x- and y-direction.<br />

The linear combination using 1st and 2nd order mesh on<br />

the master side is mentioned below.<br />

Coefficients for the 1st order nonconforming mesh<br />

connection<br />

Adopting the 1st order element as the discretization <strong>of</strong><br />

master side, (2) becomes the expression:<br />

4<br />

A ab I ke Ake<br />

,<br />

(7)<br />

k 1<br />

where Ike is a coefficient which is evaluated by a line<br />

integral <strong>of</strong> edge-based shape function Nke along the edge<br />

ab as follows:<br />

b<br />

1<br />

dl<br />

I ke N d ( ) d<br />

,<br />

a<br />

ke l N<br />

1<br />

ke tab<br />

d<br />

(8)<br />

where tab is the unit vector in the direction ab, dl/d is the<br />

Jacobian,and is an additional parameter to perform the<br />

analytical line integrals, respectively. Using , the local<br />

coordinates and are transformed into<br />

b<br />

<br />

a a b<br />

,<br />

2 2<br />

(9)<br />

b<br />

<br />

a a<br />

<br />

b<br />

.<br />

2 2<br />

(10)<br />

Subsequently, tab and dl/d for the analytical evaluation<br />

<strong>of</strong> (8) are as follows:<br />

1<br />

tab { ( xb<br />

xa<br />

) i ( yb<br />

ya<br />

) j}<br />

,<br />

l<br />

(11)<br />

2e<br />

N 4e<br />

ab<br />

a<br />

l ab<br />

N 1e<br />

b<br />

1e<br />

N 3e<br />

3e N2e 4e<br />

2e<br />

N 7e<br />

8e<br />

N 8e<br />

a<br />

N 2e<br />

N 9e<br />

l ab<br />

5e<br />

N 10e<br />

N 1e<br />

b<br />

1e<br />

N 5e<br />

7e<br />

N 6e<br />

3e N4e 6e N3e 4e<br />

(a) (b)<br />

Figure 1. Interpolation to the slave edge ab from master<br />

side element. (a) 1st order master side element and (b)<br />

Serendipity 2nd order master side element.<br />

- 283 - 15th IGTE Symposium 2012<br />

2<br />

dl<br />

<br />

d<br />

x<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

x<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

lab<br />

,<br />

2<br />

(12)<br />

where lab is the length <strong>of</strong> edge ab. Then, the 1st order<br />

edge-based shape functions on master side surface are as<br />

follows:<br />

1<br />

( 1<br />

ke<br />

) <br />

N 4<br />

ke <br />

1<br />

( 1<br />

ke<br />

) <br />

4<br />

( 1 k 2)<br />

,<br />

( 3 k 4)<br />

(13)<br />

where ke and ke are the local coordinates on the edge k<br />

according to TABLE I. Substituting (11), (12), and (13),<br />

are substituted for (8), all components <strong>of</strong> Ike are<br />

analytically evaluated as follows:<br />

I<br />

ke<br />

1<br />

( b<br />

<br />

a ){ 2 <br />

ke ( a<br />

<br />

b )}<br />

8<br />

<br />

1<br />

( b<br />

<br />

a ){ 2 ke ( a b<br />

)}<br />

8<br />

2<br />

( 1<br />

( 3<br />

TABLE I<br />

LOCAL COORDINATES FOR 1ST ORDER<br />

EDGE-BASED SHAPE FUNCTION ON MASTER SIDE<br />

k 1 2 3 4<br />

ke 0 0 1.0 -1.0<br />

ke 1.0 -1.0 0 0<br />

k 2)<br />

. (14)<br />

k 4)<br />

Coefficients for the 2nd order nonconforming mesh<br />

connection<br />

Even in the case <strong>of</strong> the linear combination adopting the<br />

2nd order elements as the master side mesh, the<br />

derivation <strong>of</strong> the coefficients for linear combination is in<br />

the same way as 1st order nonconforming connection.<br />

Firstly, 2nd order edge-based shape functions <strong>of</strong><br />

Serendipity type [10] on the master side surface shown in<br />

Figure 1 (b) as follows:<br />

1<br />

( 1<br />

ke<br />

) ( 4<br />

ke<br />

<br />

ke<br />

) <br />

( 1 k 4)<br />

4<br />

1<br />

( 1<br />

ke<br />

) ( 4<br />

ke<br />

ke<br />

) <br />

( 5 k 8)<br />

4<br />

N ke <br />

, (15)<br />

1 2<br />

( 1<br />

) <br />

( k 9)<br />

2<br />

1 2<br />

( 1<br />

) <br />

( k 10)<br />

2<br />

where ke and ke are the local coordinates on the edge k<br />

according to TABLE II. Then, adopting the 2nd order<br />

element as master side discretization, (2) becomes next<br />

expression:<br />

10<br />

ab ke<br />

k 1<br />

A I A .<br />

(16)<br />

ke<br />

When the shape <strong>of</strong> master side element is a square or<br />

rectangle, the order <strong>of</strong> coordinates x and y can be defined<br />

TABLE II<br />

LOCAL COORDINATES FOR 2ND ORDER<br />

EDGE-BASED SHAPE FUNCTION ON MASTER SIDE<br />

k 1 2 3 4 5 6 7 8 9 10<br />

ke 0.5 -0.5 0.5 -0.5 1.0 1.0 -1.0 -1.0 0 0<br />

ke 1.0 1.0 -1.0 -1.0 0.5 -0.5 0.5 -0.5 0 0


as 1st order. Therefore, , , tab, dl/d are all same<br />

as (5), (6), (11), and (12). Substituting these equations<br />

and (15) for (8), all components <strong>of</strong> Ike are analytically<br />

obtained as follows:<br />

<br />

b <br />

a<br />

[ ke<br />

( b<br />

<br />

a ){ 4<br />

ke ( b<br />

<br />

a )<br />

48<br />

<br />

<br />

ke ( b<br />

<br />

a )} 3{<br />

2 <br />

ke ( a<br />

<br />

b )}<br />

{ 4<br />

ke ( a<br />

b<br />

) <br />

ke ( a<br />

<br />

b )}]<br />

<br />

<br />

( 1 k 4)<br />

b<br />

<br />

a<br />

[ ke ( b<br />

<br />

a ){ 4<br />

ke ( b<br />

<br />

a )<br />

48<br />

<br />

ke ( b<br />

<br />

a )} 3{<br />

2 ke ( a b<br />

)}<br />

I ke <br />

. (17)<br />

<br />

{ 4<br />

ke ( a<br />

b<br />

) ke ( a b<br />

)}]<br />

<br />

( 5 k 8)<br />

<br />

<br />

b <br />

a<br />

2<br />

{ 12 3(<br />

a<br />

<br />

b ) ( b<br />

<br />

a )}<br />

24<br />

<br />

<br />

( k 9)<br />

b<br />

<br />

a<br />

2<br />

{ 12 3(<br />

a b<br />

) ( b<br />

<br />

a )}<br />

24<br />

<br />

( k 10)<br />

In the case <strong>of</strong> nonconforming connection between 1st<br />

order elements, 1st order connection (14) is adopted.<br />

Similarly, in the case between 2nd order elements, 2nd<br />

order connection (17) is adopted. When the<br />

nonconforming connection between same order elements<br />

is performed, the coarser side is adopted as the master<br />

side <strong>of</strong> the linear combination and the finer side is<br />

adopted as slave side.<br />

On the other hand, 1st order connection (14) is adopted<br />

as the connection order for mixed order analysis, in<br />

which 1st and 2nd order elements are connected. The<br />

reasons are follows: If the 2nd order connection (17) is<br />

applied to the interface for mixed order connection, the<br />

coefficients I9e and I10e to the edges on 1st order edge 1e-<br />

4e become zero. Therefore, the diagonal component<br />

related to the connection interface may be zero, and the<br />

difficulty for solving the algebraic equation causes.<br />

III. ANALYSIS MODEL<br />

Figure 2 shows a square coil model in order to verify<br />

the performance <strong>of</strong> various nonconforming connections.<br />

The current density is determined by the electric scalar<br />

potential to be I = 1000 AT in the current input surface.<br />

The meshes for the region including a square coil and the<br />

outer space are nonconforrmally connected. The<br />

nonconforming connection is performed at the three<br />

surfaces composed <strong>of</strong> 1st surface (x = 120, y: [0, 120], z:<br />

[0, 120]), 2nd surface (x: [0, 120], y = 120, z: [0, 120]),<br />

and 3rd surface (x: [0, 120], y: [0, 120], z = 120). The<br />

range <strong>of</strong> whole region is set to x: [0, 300], y: [0, 300], and<br />

z: [0, 300], and the all element shapes are the cube to<br />

remove the error caused by the element distortion.<br />

Figure 3 shows an open type MRI model [12], in which<br />

the main object is to compute the uniform magnetic flux<br />

distribution in the imaging region with high accuracy.<br />

The nonconforming connection is performed at the three<br />

- 284 - 15th IGTE Symposium 2012<br />

surfaces composed <strong>of</strong> 1st surface (x = 400, y: [0, 400], z:<br />

[0, 400]), 2nd surface (x: [0, 400], y = 400, z: [0, 400]),<br />

and 3rd surface (x: [0, 400], y: [0, 400], z = 400). The<br />

magnetic nonlinearity <strong>of</strong> SS400 is considered in the yoke<br />

and pole piece. The remanence <strong>of</strong> two facing magnets is<br />

set to 1.2 T, and the nonlinear magnetostatic analysis is<br />

performed by NR method.<br />

IV. NUMERICAL RESULTS<br />

A. Verification using square coil<br />

Figure 4 shows the two examples <strong>of</strong> finite element<br />

meshes. The element coefficient matrix is computed by<br />

Gaussian quadrature 3×3×3 points. The linear equation is<br />

stopped when the condition ||rk||2/||b||2 < cg is satisfied,<br />

where ||rk||2 and ||b||2 are 2-norm <strong>of</strong> the residual at the k-th<br />

iteration and right side vector in the algebraic equations<br />

and cg is set to 10 -6 . is the typical element size <strong>of</strong><br />

standard model, and h is the element size <strong>of</strong> target model.<br />

Therefore, (h/) 2 <strong>of</strong> standard model becomes 1.0 as<br />

shown in Figure 4 (a). On the other hand, h in the<br />

nonconforming case (hexa-1st + hexa-1st, hexa-2nd +<br />

hexa-2nd, and hexa-2nd + hexa-1st) is defined as the<br />

element size <strong>of</strong> the inner mesh including magnetizing<br />

winding.<br />

Figure 5 shows the convergence characteristics <strong>of</strong><br />

magnetic energy. All characteristics have an asymptotic<br />

behavior as h shortens. The W values in nonconforming<br />

case (hexa-1st + hexa-1st and hexa-2nd + hexa-2nd) at<br />

(h/) 2 = 1.0 is equivalent to those in conforming case.<br />

These nonconforming characteristics are slightly<br />

detached from conforming characteristics in the range<br />

(h/) 2 < 1.0.<br />

coil(I = 1000 AT)<br />

currentoutput<br />

y<br />

z<br />

Figure 2: Square coil model.<br />

y<br />

400 130 350 400<br />

imagingregion<br />

polepiece(SS400)<br />

z<br />

unit:[mm]<br />

direction<strong>of</strong>current<br />

currentinput<br />

40<br />

x<br />

Figure 3: Open type MRI model.<br />

unit:[mm]<br />

magnet<br />

(Br = 1.2 T)<br />

yoke(SS400)<br />

x


Comparing the nonconforming characteristic (hexa-2nd +<br />

hexa-2nd) with (hexa-2nd + hexa-1st), the (hexa-2nd +<br />

hexa-2nd) characteristic is superior to the result <strong>of</strong> (hexa-<br />

2nd + hexa-1st) from the viewpoint <strong>of</strong> asymptote to the<br />

behavior <strong>of</strong> conforming hexa-2nd.<br />

TABLE III shows the effect <strong>of</strong> the size ratio on the<br />

computational accuracy. shows the ratio <strong>of</strong> the element<br />

size in outer space mesh for the element size in inner<br />

region, for example, becomes 1.5 in Figure 4 (b). In<br />

nonconforming case, the mesh for outer space is<br />

subdivided on the condition that the mesh for inner<br />

region is fixed. The number <strong>of</strong> elements for 2nd order is<br />

set to one eighth <strong>of</strong> 1st order element. When gets<br />

larger, the relative error <strong>of</strong> W tends to be worse owing to<br />

being coarse size <strong>of</strong> outer space element. The results <strong>of</strong><br />

nonconforming connection have the tendency, in which<br />

the accuracy <strong>of</strong> the nonconforming connection using 2nd<br />

order element is superior to 1st order on the whole. The<br />

outerregion<br />

y<br />

currentoutput<br />

innerregion<br />

nonconf.<br />

boundary<br />

y<br />

outerregion<br />

currentoutput<br />

innerregion<br />

z<br />

(a)<br />

z<br />

x<br />

currentinput<br />

x<br />

currentinput<br />

(b)<br />

Figure 4: Finite element meshes <strong>of</strong> a square coil model.<br />

(a) conforming (h/) 2 = 1.0 and (b) nonconforming (h/) 2<br />

= 0.444.<br />

- 285 - 15th IGTE Symposium 2012<br />

elapsed time using (nonconf. 2nd + 1st, = 2.0) is the<br />

shortest among the nonconforming results, in which the<br />

relative error <strong>of</strong> W is less than 0.1 %. Whereas the<br />

accuracy <strong>of</strong> W using (nonconf. 2nd + 2nd, = 2.0) is the<br />

best among the above mentioned nonconforming types,<br />

the elapsed time approximately quintuples against the<br />

case <strong>of</strong> (nonconf. 2nd + 1st, = 2.0). Hence, it is shown<br />

that the enough accuracy is provided by the mesh type<br />

(nonconf. 2nd + 1st, = 2.0) from the viewpoint <strong>of</strong><br />

practical analysis.<br />

Figure 6 shows the z-component <strong>of</strong> magnetic flux<br />

density on z-axis. The all characteristics coincide with the<br />

standard characteristic, and the relative error between<br />

(nonconf. hexa-2nd + hexa-1st) and (conf. hexa-2nd) is<br />

less than 0.05 %. The accuracy <strong>of</strong> magnetic flux in local<br />

area as well as that <strong>of</strong> magnetic energy is retained even in<br />

W [J]<br />

nonconf.hexa2nd+hexa2nd<br />

nonconf.hexa2nd+hexa1st<br />

0.0512<br />

0.0508<br />

0.0504<br />

stand.<br />

conf.hexa2nd<br />

conf.hexa1st<br />

0.0500<br />

0.0 0 0.2 0.4 0.6 0.8 1.0<br />

(h /) 2<br />

nonconf.<br />

hexa1st+hexa1st<br />

Figure 5: Convergence characteristics <strong>of</strong> magnetic<br />

energy.<br />

nonconf.boundary<br />

0.012 12.0<br />

inner outer<br />

B z [mT]<br />

0.010 10.0<br />

0.008 8.0<br />

0.006 6.0<br />

0.004 4.0<br />

0.002 2.0<br />

0.00 0 0.04 40 0.08 80 0.12 120<br />

z [mm]<br />

TABLE III<br />

ANALYZED RESULTS OF SQUARE COIL MODEL<br />

nonconf.hexa2nd+hexa2nd<br />

nonconf.hexa1st+hexa1st<br />

conf.hexa2nd conf.hexa1st<br />

7.72<br />

B z [mT]<br />

7.70<br />

stand.<br />

nonconf.<br />

hexa2nd+hexa1st<br />

7.68<br />

59.8 60.0 60.2<br />

z [mm]<br />

Figure 6: Distributions <strong>of</strong> Bz on z-axis <strong>of</strong> square coil<br />

model.<br />

mesh type<br />

inner<br />

NoE<br />

outer total<br />

size ratio DoF nonzero<br />

time for<br />

global matrix [s]<br />

ICCG<br />

ite.<br />

time for<br />

ICCG [s]<br />

W [mJ]<br />

relative error (%) <strong>of</strong> W<br />

vs. conf. 2nd (stand.)<br />

conf. 2nd (stand.) 37,044 1,120,581 1,157,625 1.0 13,759,410 576,830,675 224.2 *<br />

466 1042.3 *<br />

<br />

51.0553 0<br />

conf. 1st<br />

1,672,704 1,728,000 1.0 5,126,520 86,040,332 67.5 147 67.1 50.9912 0.125<br />

nonconf. 1st + 1st<br />

55,296<br />

209,088<br />

26,136<br />

264,384<br />

81,432<br />

2.0<br />

4.0<br />

775,368<br />

236,424<br />

12,873,180<br />

3,901,596<br />

10.7<br />

3.5<br />

96<br />

59<br />

6.8<br />

1.4<br />

50.9840<br />

50.9548<br />

0.140<br />

0.197<br />

3,267 58,788 8.0 170,289 2,819,055 2.6 49 0.9 50.8334 0.435<br />

conf. 2nd<br />

209,088 216,000 1.0 2,548,920 105,755,060 51.4 263 142.4 51.0505 0.009<br />

nonconf. 2nd + 2nd<br />

26,136<br />

3,267<br />

33,048<br />

10,179<br />

2.0<br />

4.0<br />

383,256<br />

115,971<br />

15,585,392<br />

4,574,483<br />

8.2<br />

2.7<br />

141<br />

98<br />

12.5<br />

3.6<br />

51.0505<br />

51.0179<br />

0.009<br />

0.073<br />

6,912 1,672,704 1,679,615 0.5 5,037,828 86,450,712 67.7 214 96.7 51.0477 0.015<br />

nonconf. 2nd + 1st<br />

209,088<br />

26,136<br />

216,000<br />

33,048<br />

1.0<br />

2.0<br />

693,576<br />

154,632<br />

13,378,746<br />

4,398,882<br />

10.2<br />

2.9<br />

114<br />

84<br />

8.4<br />

2.3<br />

51.0408<br />

51.0117<br />

0.028<br />

0.085<br />

3,267 10,179 4.0 88,497 3,312,651 2.0 84 2.4 50.8903 0.323<br />

CPU: Intel Core i7-2620M 2.7 GHz & 16 GB<br />

CPU * : Intel Core i7-3930K 4.2 GHz with over-clocked & 32 GB


outer region<br />

imaging region<br />

(inner region)<br />

y<br />

outer region<br />

imaging region<br />

(inner region)<br />

y<br />

outer region<br />

hexa-1st<br />

imaging region<br />

(inner region)<br />

hexa-1st<br />

y<br />

outer region<br />

hexa-1st<br />

imaging region<br />

(inner region)<br />

hexa-2nd<br />

y<br />

z<br />

r<br />

(a)<br />

z<br />

r<br />

(b)<br />

z<br />

r<br />

(c)<br />

z<br />

r<br />

(d)<br />

Figure 7: Finite element meshes <strong>of</strong> open type MRI<br />

model. (a) conforming (hexa-1st), (b) conforming (hexa-<br />

2nd isoparametric), (c) nonconforming (hexa-1st + hexa-<br />

1st), and (d) nonconforming (hexa-2nd + hexa-1st).<br />

B z [T]<br />

-0.20<br />

nonconf.boundary<br />

inner outer<br />

-0.22<br />

-0.24<br />

-0.26<br />

-0.28<br />

-0.30<br />

0.0 0.2 0.4 0.6 0.8<br />

r [m]<br />

x<br />

x<br />

x<br />

x<br />

- 286 - 15th IGTE Symposium 2012<br />

the nonconforming connection.<br />

B. Application <strong>of</strong> mixed order finite element analysis to<br />

open type MRI model<br />

This subsection shows the effectiveness <strong>of</strong> mixed order<br />

finite element analysis with nonconforming connection in<br />

the open type MRI model shown in Figure 3. The<br />

convergence criterion cg <strong>of</strong> ICCG method is set to 10 -3 ,<br />

and when the maximum correction <strong>of</strong> magnetic flux<br />

density is to be 10 -3 T, NR iteration is stopped.<br />

Figure 7 shows the finite element meshes for open type<br />

MRI model. Figure 7 (a), (b), (c), and (d) show the mesh<br />

for conforming hexa-1st, conforming hexa-2nd<br />

isoparametric, nonconforming (hexa-1st + hexa-1st), and<br />

nonconforming (hexa-2nd + hexa-2nd), respectively. The<br />

nonconforming mesh connection is performed at the<br />

interface between imaging region and other region in<br />

order to reduce the number <strong>of</strong> elements with keeping the<br />

accuracy <strong>of</strong> magnetic flux density in imaging region.<br />

Number <strong>of</strong> elements in conforming hexa-2nd (b) is one<br />

eighth size in conforming hexa-1st (a). The mesh for<br />

imaging region <strong>of</strong> (c) is exactly the same as that <strong>of</strong> (a),<br />

and the mesh for outer region <strong>of</strong> (c) and (d) is completely<br />

the same as that <strong>of</strong> (b).<br />

Figure 8 shows the distributions <strong>of</strong> z-direction<br />

magnetic flux density Bz on 45° direction r-axis which is<br />

located on the surface (x, y) = (0, 0) shown in Figure 7.<br />

There are some noise spikes in the characteristics <strong>of</strong><br />

(conf. hexa-1st), (nonconf. hexa-1st + hexa-1st), and<br />

(nonconf. hexa-2nd + hexa-1st). The generation <strong>of</strong> noise<br />

is likely to be caused by the element distortion <strong>of</strong> 1st<br />

order hexahedral elements. The distributions <strong>of</strong> 1st order<br />

discretization have the concave and convex owing to the<br />

interpolation <strong>of</strong> inner flux using edge-shape function.<br />

The mixed order characteristic <strong>of</strong> (nonconf. hexa-2nd +<br />

hexa-1st) seems to be combined two properties, in which<br />

the property <strong>of</strong> 2nd order is confirmed in the inner region<br />

(imaged region) and 1st order property appeared in the<br />

outer region.<br />

TABLE IV shows the analysis results for open type<br />

MRI model. Even in MRI model, the DoF <strong>of</strong> (conf. 2nd)<br />

is a half <strong>of</strong> (conf. 1st); nevertheless, the elapsed time <strong>of</strong><br />

(conf. 2nd) is longer than that <strong>of</strong> (conf. 1st). There is a<br />

possibility that the condition number <strong>of</strong> the global matrix<br />

B z [T]<br />

-0.26<br />

-0.27<br />

nonconf.hexa1st+hexa1st<br />

nonconf.hexa2nd+hexa1st<br />

conf.hexa2nd<br />

stand.<br />

conf.hexa1st<br />

-0.28<br />

0.50 0.62<br />

r [m]<br />

Figure 8: Distributions <strong>of</strong> Bz on r-axis in open type MRI model.


derived from 2nd order get worse than that <strong>of</strong> 1st order.<br />

On the other hand, the elapsed time <strong>of</strong> (nonconf. 2nd +<br />

1st) is the almost same as that <strong>of</strong> (nonconf. 1st + 1st).<br />

Furthermore, the iteration number for NR method in<br />

mixed order analysis is quite same as other mesh type.<br />

V. CONCLUSION<br />

We proposed a mixed order finite element method<br />

using nonconforming mesh connection technique. The<br />

obtained results are summarized as follows:<br />

1. We propose the nonconforming mesh connection<br />

between 2nd order elements supported by the linear<br />

combination, in which the coefficient for 2nd order<br />

connection can be derived from the line integral.<br />

2. The performances <strong>of</strong> the nonconforming connection<br />

for 2nd order elements and mixed order elements are<br />

verified using a square coil model. The accuracy <strong>of</strong> the<br />

nonconforming connection including the 2nd order<br />

discretization is superior to that <strong>of</strong> the meshes<br />

discretized by only 1st order element.<br />

3. The accuracy <strong>of</strong> mixed order analysis has the good<br />

agreement with that <strong>of</strong> conforming 2nd order<br />

discretized mesh and the mesh with 2nd order<br />

nonconforming connection.<br />

4. Mixed order analysis has superiority than the<br />

conventional conformal mesh from a point <strong>of</strong> view <strong>of</strong><br />

the elapsed time.<br />

Finally, we will investigate the effectiveness <strong>of</strong> mixed<br />

order edge-based finite element analysis including the<br />

distorted finite elements as a future works.<br />

ACKNOWLEDGMENT<br />

The authors would like to thank Mr. Y. Tominaga for<br />

helpful support. This work was supported by Japan<br />

Society for Promotion <strong>of</strong> Science (JSPS) Grant-in-Aid<br />

for Young Scientists (B) (Grant Number: 23760252).<br />

REFERENCES<br />

[1] M. Hano, T. Miyamura, and M. Hotta, “Fast and high-accuracy<br />

finite-element electromagnetic analysis by mixed-order vector<br />

elements,” The Papers <strong>of</strong> Joint Technical Meeting on Static<br />

Apparatus, SA-02-14, RM-02-14, pp. 13-18, Jan. 2002. (in<br />

Japanese)<br />

[2] P. Houston, I. Perugia, and D. Schötzau, “Nonconforming mixed<br />

finite-element approximations to time-harmonic eddy current<br />

problems,” IEEE Trans. Magn., Vol. 40, No. 2, pp. 1268-1273, Feb.<br />

2004.<br />

[3] D. Rodger, H. C. Lai, and P. J. Leonard, “Coupled elements for<br />

problems involving movement,” IEEE Trans. Magn., Vol. 26, No.<br />

2, pp. 548-550, Feb. 1990.<br />

- 287 - 15th IGTE Symposium 2012<br />

TABLE IV<br />

ANALYSIS RESULTS OF OPEN TYPE MRI MODEL<br />

mesh type<br />

inner region<br />

NoE<br />

outer space total<br />

DoF nonzero NR ite.<br />

time for<br />

global matrix [s]<br />

total<br />

ICCG ite.<br />

time for<br />

ICCG [s]<br />

conf. 2nd (stand.)<br />

conf. 1st 8,000<br />

556,528 564,528<br />

6,680,172<br />

1,662,234<br />

278,902,794<br />

27,727,823<br />

<br />

7<br />

1035.0<br />

171.7<br />

2,254<br />

1,714<br />

2268.6<br />

262.6<br />

nonconf. 1st + 1st<br />

223,699 3,663,713 7 23.3 617 14.2<br />

conf. 2nd<br />

nonconf. 2nd + 1st<br />

1,000<br />

69,566 77,566 823,308<br />

212,099<br />

33,753,786<br />

3,709,345<br />

7<br />

7<br />

127.5<br />

23.3<br />

315<br />

624<br />

526.3<br />

14.9<br />

CPU: Intel Core i7-2620M 2.7 GHz & 16 GB<br />

[4] E. Lange, F. Henrotte, and K. Hameyer, “A variational<br />

formulation for nonconforming sliding interfaces in finite element<br />

analysis <strong>of</strong> electric machines,” IEEE Trans. Magn., Vol. 46, No. 8,<br />

pp. 2755-2758, Aug. 2010.<br />

[5] C. Golovanov, J.-L. Coulomb, Y. Marechal, and G. Meunier, “3D<br />

mesh connection techniques applied to movement simulation,”<br />

IEEE Trans. Magn., Vol. 28, No. 2, pp. 3359-3362, Feb. 1992.<br />

[6] H. Kometani, S. Sakabe, and A. Kameari, “3-D analysis <strong>of</strong><br />

induction motor with skewed slots using regular coupling mesh,”<br />

IEEE Trans. Magn., Vol. 36, No. 4, pp. 1769-1773, Apr. 2000.<br />

[7] K. Muramatsu, Y. Yokoyama, N. Takahashi, A. Nafalski, and Ö.<br />

Göl, “Effect <strong>of</strong> continuity <strong>of</strong> potential on accuracy in magnetic field<br />

analysis using nonconforming mesh,” IEEE Trans. Magn., Vol. 36,<br />

No. 4, pp. 1578-1582, Apr. 2000.<br />

[8] Y. Okamoto, R. Himeno, K. Ushida, A. Ahagon, and K. Fujiwara,<br />

“Dielectric heating analysis method with accurate rotational motion<br />

<strong>of</strong> stirrer fan using nonconforming mesh connection,” IEEE Trans.<br />

Magn., Vol. 44, No. 6, pp. 806-809, Jun. 2008.<br />

[9] Y. Okamoto, K. Fujiwara, and R. Himeno, “Exact minimization <strong>of</strong><br />

energy functional for NR method with line-search technique,” IEEE<br />

Trans. Magn., Vol. 45, No. 3, pp. 1288-1291, Mar. 2009.<br />

[10] K. Fujiwara, T. Nakata, and H. Fusayasu, “Acceleration <strong>of</strong><br />

convergence characteristic <strong>of</strong> the ICCG method,” IEEE Trans.<br />

Magn., Vol. 29, No. 2, pp. 1958-1961, Mar. 1993.<br />

[11] A. Kameari, “Calculation <strong>of</strong> transient 3D eddy current using edgeelements,”<br />

IEEE Trans. Magn., Vol. 26, No. 2, pp. 466-469, Mar.<br />

1990.<br />

[12] C. Lee and K. Miyata, “Large-scale magnetic field analysis on<br />

MRI with hysteresis,” The papers <strong>of</strong> Joint Technical Meeting on<br />

Static Apparatus and Rotating Machinery, SA-06-20, RM-06-20,<br />

pp. 25-30, Jan. 2005. (in Japanese)


- 288 - 15th IGTE Symposium 2012<br />

Topology Optimization Using Parallel Search<br />

Strategy for Magnetic Devices<br />

1 Takumi Nagano, 1 Shogo Yasukawa, 1 Shinji Wakao, and 2 Yoshihumi Okamoto<br />

1 Waseda <strong>University</strong>, 3-4-1, Okubo, Shinjuku, Tokyo 169-8555, Japan<br />

2 Utsunomiya <strong>University</strong>, 7-1-2, Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />

E-mail: wakao@waseda.jp<br />

Abstract— In this paper, we propose a topology optimization method using parallel search strategy for magnetic devices.<br />

Here, we use the gradient method to minimize an object function from the viewpoint <strong>of</strong> convergence speed, i.e., steepest<br />

descent method. By applying parallel computing, we will carry out calculations simultaneously for some patterns <strong>of</strong> initial<br />

variables. With these calculation results, new patterns <strong>of</strong> initial variables are efficiently created for restarting new search<br />

processes, which results in the better topologies than previous ones. Compared with the conventional method, the proposed<br />

search method enables us to decrease the whole CPU time with keeping the optimization quality.<br />

Index terms— density method, parallel computing, topology optimization.<br />

I. INTRODUCTION<br />

Structural optimization is categorized into three types<br />

<strong>of</strong> problems, i.e., size optimization, shape optimization,<br />

and topology optimization. Topology optimization is a<br />

useful method especially in terms <strong>of</strong> weight saving. And<br />

there is possibility <strong>of</strong> discovering a new topology<br />

unthinkable by conventional methods. However, the<br />

computational load <strong>of</strong> topology optimization will be<br />

heavier than that <strong>of</strong> other optimization, because <strong>of</strong> the<br />

large search space. In this paper, we applied “density<br />

method with gradient method” to make the optimization<br />

process more efficient. In density method, topology <strong>of</strong> the<br />

target is expressed with the density value <strong>of</strong> elements<br />

which consist <strong>of</strong> the design domain [1]. And gradient<br />

method has good convergence speed as the search<br />

method. However, the result <strong>of</strong> minimization with<br />

gradient method will be mostly local minimum<br />

depending on initial variables. Therefore this paper<br />

proposes an efficient topology optimization method based<br />

on parallel computing for magnetic devices. In the<br />

proposed method, we can efficiently escape local<br />

minimums by simultaneous searches from various initial<br />

variables and creating new initial variables with the<br />

results <strong>of</strong> parallel optimization.<br />

II. PROPOSED METHOD<br />

A. Density method based on sensitivity analysis<br />

In density method, topology <strong>of</strong> the target is expressed<br />

with the density value <strong>of</strong> elements. In this paper, the<br />

target is magnetic circuit. The magnetic permeability i<br />

<strong>of</strong> ith element can be formulated as<br />

2<br />

{ 1<br />

1<br />

(1)<br />

},<br />

i 0 r i<br />

where i is the density value <strong>of</strong> ith element, and r is<br />

relative permeability <strong>of</strong> the material. In this paper, r is<br />

1000, and the property <strong>of</strong> magnetic system is regarded as<br />

linear one. In density method, the number <strong>of</strong> elements in<br />

the design domain should be large to express the detail <strong>of</strong><br />

topologies. When density method is combined with<br />

gradient method, the first derivative <strong>of</strong> object function<br />

with respect to density value (sensitivity) <strong>of</strong> each element<br />

in the design domain needs to be calculated. As the<br />

efficient sensitivity analysis, “adjoint variable method” is<br />

applied [2]. The algebraic equation <strong>of</strong> FEM can be<br />

formulated as<br />

HA G,<br />

(2)<br />

where H is whole coefficient matrix, A is unknown<br />

magnetic vector potential, and G is right side vector.<br />

(3) is obtained by differential calculus <strong>of</strong> (2) with respect<br />

to density vector .<br />

<br />

A<br />

G<br />

H<br />

H = A.<br />

(3)<br />

ρ<br />

ρ<br />

ρ<br />

(4) is obtained by multiplying H -1 to (3).<br />

<br />

A 1<br />

G<br />

H<br />

~ <br />

= H<br />

<br />

A<br />

<br />

,<br />

(4)<br />

i<br />

i<br />

i<br />

<br />

where A ~ is the solution <strong>of</strong> (2).<br />

Sensitivity <strong>of</strong> the object function W is obtained as<br />

T<br />

dW W W<br />

A<br />

= . (5)<br />

d<br />

<br />

A<br />

<br />

i<br />

i<br />

In this paper, W is defined with the value <strong>of</strong> magnetic<br />

flux density vector. Therefore, the 1 st term <strong>of</strong> (5) is<br />

invisible. By substituting (4) into (5), (6) is obtained.<br />

T<br />

W W<br />

1<br />

G<br />

H<br />

~ T G<br />

H<br />

~ <br />

H <br />

A<br />

λ <br />

A<br />

.<br />

(6)<br />

i<br />

A<br />

i<br />

i<br />

i<br />

i<br />

<br />

As the property <strong>of</strong> FEM matrix, H is symmetric. Taking<br />

account <strong>of</strong> this point, (7) is obtained by transforming (6).<br />

Hλ.<br />

A <br />

W<br />

(7) <br />

The is called “adjoint variable”. is obtained by<br />

solving (7). Finally, the whole sensitivity vector is<br />

obtained by substituting into (6).<br />

B. Basic concept <strong>of</strong> proposed method<br />

Here, we carried out parallel computing based on Open<br />

MP for topology optimization by using a computer with 8<br />

cores. The basic concept <strong>of</strong> proposed method is shown in<br />

figure 1.<br />

i


Figure 1: Basic concept <strong>of</strong> proposed method.<br />

The proposed method is based on gradient method, where<br />

the solution depends on initial values. Therefore, we<br />

regard the data set <strong>of</strong> initial topology, optimized<br />

topology, and object function value as the properties <strong>of</strong><br />

one “individual”.<br />

First, we prepare 8 individuals i.e., the number <strong>of</strong><br />

cores, by creating random initial topologies and<br />

computing their optimized topologies. Next, a new initial<br />

topology is created with property information <strong>of</strong> 2<br />

selected individuals. The process <strong>of</strong> creating a new<br />

topology with 2 individuals’ information is named<br />

“intercross”. We prepare new 8 initial topologies for the<br />

next generation, and obtain the data sets by optimizing<br />

new initial topologies. The better solution will be<br />

obtained by repeating the above cycle.<br />

In the next section, we explain how to create initial<br />

topologies in the next generation with 2 individuals.<br />

C. How to select 2 individuals for intercross<br />

Three manners <strong>of</strong> selection <strong>of</strong> 2 individuals for<br />

intercross are proposed.<br />

a) We combined 2 individuals with superior object<br />

function value to give properties <strong>of</strong> superior individuals<br />

to the next generation. In this paper, the combinations <strong>of</strong><br />

individuals in the top 4 from the viewpoint <strong>of</strong> object<br />

function superiority are selected for intercross. We create<br />

3 initial topologies in the next generations based on this<br />

selection.<br />

b) We combined 2 individuals with different optimized<br />

topologies. The subject <strong>of</strong> the selection is to make<br />

diversity for intercross. The difference between 2<br />

topologies is evaluated in the following rule.<br />

As shown in figure 2, design domain is separated into<br />

some areas. Next, we define a reference value in the i th<br />

area with the density values as<br />

area<br />

i<br />

<br />

Ni<br />

<br />

2<br />

j<br />

j (8)<br />

.<br />

N<br />

Ni is the number <strong>of</strong> elements in the i th area. We define a<br />

vector C as in (9).<br />

i<br />

C area , area ,..., area ). (9)<br />

( 1 2<br />

8<br />

- 289 - 15th IGTE Symposium 2012<br />

Figure 2: Separation <strong>of</strong> design domain.<br />

The vector C expresses the character <strong>of</strong> the optimized<br />

topology. The characteristic difference <strong>of</strong> 2 topologies A<br />

and B, is evaluated as (10).<br />

2<br />

B<br />

Diff C C . (10)<br />

AB<br />

The combination <strong>of</strong> 2 individuals with larger value <strong>of</strong><br />

(10) is selected for intercross. We create 3 initial<br />

topologies in the next generations based on this selection.<br />

c) We combined 2 individuals chosen randomly. The<br />

subject <strong>of</strong> the selection is also to make diversity <strong>of</strong><br />

intercross. We create 2 initial topologies in the next<br />

generations based on this selection.<br />

8 initial topologies in the next generation are created<br />

based on the selections a)-c).<br />

D. How to make new initial topologies<br />

Here, 2 individuals selected for intercross are named as<br />

IA and IB, and new individual is named as IC. The initial<br />

topology <strong>of</strong> IC is created in the following three kinds <strong>of</strong><br />

manners, which are adopted randomly.<br />

a) The density values <strong>of</strong> IC’s initial topology are created<br />

as the weighted mean <strong>of</strong> those <strong>of</strong> initial topologies <strong>of</strong> IA<br />

and IB.<br />

b) The density values <strong>of</strong> IC’s initial topology are created<br />

as the weighted mean <strong>of</strong> those <strong>of</strong> optimized topologies <strong>of</strong><br />

IA and IB.<br />

To inherit the properties <strong>of</strong> superior individuals to the<br />

next generation, the weight coefficients <strong>of</strong> IA and IB are<br />

formulated as<br />

,<br />

Ci<br />

Ai<br />

W<br />

<br />

W<br />

3<br />

A<br />

3<br />

3<br />

A WB<br />

,<br />

Bi<br />

A<br />

W<br />

<br />

W<br />

3<br />

B<br />

3<br />

3<br />

A WB<br />

(11)<br />

,<br />

where, WA and WB are object function values <strong>of</strong> IA and IB,<br />

and i stands for the element number.<br />

c) The density values <strong>of</strong> IC’s initial topology are created<br />

as the multiplication <strong>of</strong> initial topology <strong>of</strong> IA and<br />

optimized topology <strong>of</strong> IB.<br />

. (12)<br />

Ci<br />

Ai<br />

Bi


The distribution <strong>of</strong> density value i <strong>of</strong> 0 in optimized<br />

topology will be strongly inherited to the next generation.<br />

The probabilities <strong>of</strong> occurrence <strong>of</strong> a),b), and c) are 3/7,<br />

3/7, 1/7 respectively.<br />

III. NUMERICAL EXAMPLE<br />

A. C-shaped iron core model<br />

To demonstrate the validity <strong>of</strong> the proposed method, we<br />

carried out the optimization <strong>of</strong> the model shown in figure<br />

3.<br />

Figure 3: Magnetic force model. (unit : mm)<br />

The main subject <strong>of</strong> this optimization problem is to<br />

maximize the electromagnetic force generated in the<br />

magnetic bar lying in the right side <strong>of</strong> design domain. The<br />

density values <strong>of</strong> elements in the design domain are<br />

design variables <strong>of</strong> this optimization [3]. To evaluate the<br />

electromagnetic force, we adopt the following equation as<br />

the object function <strong>of</strong> this optimization,<br />

1<br />

2<br />

By<br />

W , (13)<br />

where, By is the y component <strong>of</strong> magnetic flux density<br />

vector generated in the target element. This model is<br />

discretized by triangular elements. The numbers <strong>of</strong><br />

elements, in the design domain and whole domain, are<br />

1,575 and 5,576 respectively.<br />

Next, the detail <strong>of</strong> the proposed method is explained.<br />

Initial topologies <strong>of</strong> the 1 st generation are created as<br />

random values i which range from 0 to 1. The initial<br />

magnetic permeability distribution is determined as to<br />

(1). The best object function value before the n th<br />

generation is Wbest, and the best object function value in<br />

the n+1 th generation is Wn+1. If Wn+1 Wbest , we update<br />

the value <strong>of</strong> Wbest as Wn+1. If the update doesn’t occur over<br />

10 generations, the calculation is terminated.<br />

- 290 - 15th IGTE Symposium 2012<br />

Figure 4: Flow chart <strong>of</strong> proposed method.<br />

Now, for the comparison with the proposed method, the<br />

optimization without intercross is also applied to this<br />

model. In the method, the initial topologies in whole<br />

generation are created as randomly as that <strong>of</strong> the 1 st<br />

generation. This method is named “random method”.<br />

2 pattern <strong>of</strong> initial topologies are prepared as those <strong>of</strong> in<br />

the 1 st generation, which are named as ITA and ITB. An<br />

example <strong>of</strong> initial topology in 1 st generation is shown in<br />

figure 5.<br />

Figure 5: Density distribution example <strong>of</strong> initial topology.


Wbest<br />

best object funciton value<br />

68<br />

67.5<br />

67<br />

66.5<br />

66<br />

65.5<br />

65<br />

64.5<br />

64<br />

63.5<br />

Initial<br />

pattern<br />

IT A<br />

IT B<br />

random method (ITA) proposed method<br />

random method (ITB) proposed method (ITB) 0 10 20<br />

generations<br />

30 40<br />

Figure 6: Convergence characteristic <strong>of</strong> each method.<br />

TABLE I<br />

THE OPTIMIZATION RESULTS OF BOTH METHODS.<br />

method<br />

total<br />

generations<br />

Wbest<br />

(IT A)<br />

(a) The obtained topology with random method<br />

from initial pattern ITA.<br />

CPU<br />

time(sec.)<br />

Random 20 65.0926 764.56<br />

Proposed 34 64.0327 347.91<br />

Random 25 64.2958 954.66<br />

Proposed 17 63.9976 229.98<br />

- 291 - 15th IGTE Symposium 2012<br />

Figure 7: Comparison <strong>of</strong> CPU time.<br />

We carry out the optimizations with these topologies by<br />

the proposed method and the random method. The<br />

convergence characteristics <strong>of</strong> both methods are shown in<br />

figure 6. In random method, the improvement <strong>of</strong> object<br />

function value is obviously inefficient, and the<br />

computational result is in the local minimum. By<br />

contrast, the proposed method enables us to improve the<br />

object function value more efficiently, and to achieve<br />

better solution than that <strong>of</strong> the random method. The<br />

object function values Wbest <strong>of</strong> both methods are shown in<br />

table 1.<br />

(b) The obtained topology with proposed method<br />

from initial pattern ITA.<br />

(c) The obtained topology with random method<br />

(d) The obtained topology with proposed method<br />

from initial pattern ITB. Figure 8: Optimization result <strong>of</strong> each method.<br />

from initial pattern ITB.<br />

frequency<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

proposed method<br />

random method<br />

2 6 10 14 18 22 26 30 34 38 42 46<br />

calculation time[CPU sec.]


The frequency distribution <strong>of</strong> calculation time for each<br />

individual is shown in Figure 7. Figure 7 indicates that<br />

the calculation time for each individual is sufficiently<br />

reduced in the proposed method because the starting<br />

points for optimization, i.e., the initial topologies, are<br />

effectively created near stationary points. We can<br />

estimate the total CPU time by multiplying the<br />

calculation time for individuals in one generation by the<br />

required number <strong>of</strong> generations. As the results, the CPU<br />

time <strong>of</strong> the proposed method is much less than that <strong>of</strong> the<br />

random method in spite <strong>of</strong> their total generations required<br />

for convergence as shown in table I.<br />

The obtained topologies by both methods are shown in<br />

Figure 8. Black elements correspond to the magnetic<br />

material with i = 1, and white elements to air elements<br />

with i = 0. Gray elements have an intermediate property<br />

between air and magnetic material with 0 < i < 1 which<br />

are named as “gray scale”. The topologies obtained by<br />

the random method strongly depend on their initial<br />

variables and contain many gray scales. On the contrary,<br />

we can obtain the similar topologies without gray scales<br />

by using the proposed method.<br />

B. Magnetic shielding model<br />

Proposed method is applied to the optimization <strong>of</strong> the<br />

shield model shown in Figure 9.<br />

y<br />

x<br />

Figure 9: shield model. (unit : mm)<br />

The main subject <strong>of</strong> the optimization problem is to<br />

minimize the magnetic flux entering into target domain<br />

generated by 2 surrounding coils. The density values <strong>of</strong><br />

elements in the design domain are design variables <strong>of</strong> the<br />

optimization. The object function is defined as (14).<br />

W <br />

2<br />

Bx<br />

<br />

t arg et<br />

domain<br />

2<br />

By<br />

.<br />

(14)<br />

- 292 - 15th IGTE Symposium 2012<br />

This model is discretized by triangular elements. The<br />

number <strong>of</strong> elements, in the design domain and the whole<br />

domain, are 3,800 and 12,106 respectively.<br />

The optimization results <strong>of</strong> the random and the proposed<br />

methods are shown in Figures 10-13.<br />

Wbest<br />

best object funtion value<br />

ferquency<br />

0.0012<br />

0.001<br />

0.0008<br />

0.0006<br />

0.0004<br />

0.0002<br />

0.45<br />

0.4<br />

0.35<br />

0.3<br />

0.25<br />

0.2<br />

0.15<br />

0.1<br />

0.05<br />

0<br />

random method<br />

proposed method<br />

0 20 40 60 80<br />

generations<br />

Figure 10: Convergence characteristic <strong>of</strong> object function.<br />

0<br />

Wbest = 4.0610 -4<br />

0<br />

24<br />

48<br />

72<br />

96<br />

120<br />

144<br />

168<br />

192<br />

216<br />

240<br />

264<br />

288<br />

312<br />

336<br />

360<br />

384<br />

408<br />

432<br />

calculation time (sec.)<br />

Wbest = 6.9010 -5<br />

random method<br />

proposed method<br />

Figure 11: Comparison <strong>of</strong> CPU time.<br />

Fig 12: Result <strong>of</strong> optimization. (random method)


Figure 13: Result <strong>of</strong> optimization. (proposed method)<br />

The computational results demonstrate the effectiveness<br />

<strong>of</strong> the proposed method compared with the random<br />

method as is the case with the magnetic force model.<br />

We can successfully obtain the multilayer structure as<br />

an effective topology <strong>of</strong> the shield as shown in Figure 13.<br />

IV. CONCLUSION<br />

In this paper, a topology optimization using parallel<br />

search strategy for magnetic devices is proposed to<br />

efficiently obtain more global solutions. In the proposed<br />

method, we effectively introduce the novel concept <strong>of</strong><br />

intercross into the density method with sensitivity<br />

analysis, which results in the CPU time reduction with<br />

keeping the optimization quality.<br />

We will investigate various algorithms <strong>of</strong> intercross,<br />

and apply the proposed method to more practical<br />

nonlinear problems as future works.<br />

REFERENCES<br />

[1] H. P. Mlejnek and R. Schirrmacher, "An engineer's approach to<br />

optimal material distribution and shape finding," Comput.<br />

Methods Appl. Mech. Eng., Vol. 106, pp. 1-26 (1993).<br />

[2] S.Gitosusastro, J.L.Coulomb and J.C. Sabonnadiere, "Performance<br />

derivative calculations and optimization process," IEEE Trans.<br />

Magn, Vol.25, No.4, pp. 2834-2839(1989)<br />

[3] Yoshihumi Okamoto, and Norio Takahashi, "Investigation <strong>of</strong><br />

Topology Optimization <strong>of</strong> Magnetic Circuit by Using Density<br />

Method", IEEJ Trans. IA, Vol.124, No.12, pp. 1228-1235(2004).<br />

[4] Jin-kyu Byun, Il-han Park, and Song-yop Hahn, "Topology<br />

optimization <strong>of</strong> electrostatic actuator using design sensitivity,"<br />

IEEE Trans. Magn. Vol.38, No. 2, pp. 1053-1056 (2002).<br />

[5] Jin-kyu Byun and Song-yop Hahn, "Application <strong>of</strong> topology<br />

optimization to electromagnetic system," International journal <strong>of</strong><br />

applied electromagnetics and mechanics, Vol. 13, No. 1-4, pp. 25-<br />

33 (2002).<br />

- 293 - 15th IGTE Symposium 2012


- 294 - 15th IGTE Symposium 2012


- 295 - 15th IGTE Symposium 2012


- 296 - 15th IGTE Symposium 2012


- 297 - 15th IGTE Symposium 2012


- 298 - 15th IGTE Symposium 2012


- 299 - 15th IGTE Symposium 2012


- 300 - 15th IGTE Symposium 2012<br />

Interaction Magnetic Force Calculation <strong>of</strong> Axial<br />

Passive Magnetic Bearing Using Magnetization<br />

Charges and Discretization Technique<br />

*Saša S. Ilić, *Ana N.Vučković and *Slavoljub Aleksić<br />

*<strong>University</strong> <strong>of</strong> Niš, Faculty <strong>of</strong> Electronic Engineering <strong>of</strong> Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia<br />

E-mail: ana.vuckovic@elfak.ni.ac.rs<br />

Abstract— The paper presents calculation <strong>of</strong> the force between two ring permanent magnets whose magnetization is axial.<br />

Such configuration corresponds to a passive magnetic bearing. The simple and fast analytical approach is used for this<br />

calculation based on magnetization charges and discretization technique. The results for interaction magnetic force obtained<br />

using proposed approach are compared with finite element method using FEMM 4.2 s<strong>of</strong>tware.<br />

Index Terms— Permanent magnet, interaction magnetic force, magnetization charges, discretization technique, Finite<br />

Element Method (FEM).<br />

charges for uniform magnetization do not exist. nˆ is the<br />

unit vector normal to surface.<br />

I. INTRODUCTION<br />

Permanent magnets are used nowadays in many<br />

applications, and the general need for dimensioning and<br />

optimizing leads to the development <strong>of</strong> calculation<br />

methods. Permanent magnets are commonly used in<br />

many electrical devices and their own quality depends on<br />

the magnet material, magnetization and dimensions. Two<br />

major kinds <strong>of</strong> applications can be identified: the ones<br />

which use block magnets and the ones which use<br />

cylindrical magnets. Block permanent magnets are easy<br />

to manufacture and to magnetize, and it’s easier to<br />

calculate magnetic field they create [1-3]. Indeed, most<br />

engineering applications need several ring permanent<br />

magnets and the determination <strong>of</strong> the magnetic force<br />

between them is thus required.<br />

Magnetic bearings are contactless suspension devices<br />

with various rotating and translational applications [4].<br />

Depending on the ring permanent magnet magnetization<br />

direction, the devices work as axial or radial bearings and<br />

thus control the position along an axis or the centering <strong>of</strong><br />

an axis. Knowledge <strong>of</strong> the interaction magnetic force is<br />

required to control devices reliably.<br />

There are numerous techniques for analyzing permanent<br />

magnet devices and different approaches for determining<br />

interaction forces between them [5-8]. Many authors<br />

are proposing simplified and robust formulations <strong>of</strong> the<br />

interaction forces created by permanent magnets. The<br />

authors generally use the Ampere's current model [9],[10]<br />

or the Columbian approach [11],[12]. Several application<br />

examples were previously presented in [13],[14] where<br />

levitation forces for magnetic bearings were calculated.<br />

II. THEORETICAL BACKGROUND<br />

Axial passive magnetic bearing [5] that is considered in<br />

the paper is presented in the Fig.1.<br />

Since the boundary condition for surface magnetization<br />

charges density has to be satisfied for both magnets,<br />

m ˆ<br />

1 n M1<br />

and m ˆ<br />

2 n M2<br />

, (1)<br />

it is obvious that fictitious surface magnetization charges<br />

[2] exist only on the bottom and the top bases <strong>of</strong> each<br />

permanent magnet, because volume magnetization<br />

Figure 1: Axial passive magnetic bearing.<br />

The simplest procedure for interaction magnetic force<br />

determination is to discretize each base <strong>of</strong> both<br />

permanent magnets into system <strong>of</strong> circular loops. The<br />

interaction force between two magnetized circular loops<br />

will be calculated first. That will be performed by<br />

calculating the magnetic field and magnetic flux density<br />

generated by the arbitrary magnetized circular loop <strong>of</strong> the<br />

upper magnet first and then the force that acts on the<br />

arbitrary loop <strong>of</strong> the lower magnet (Figure 2). Magnetic<br />

field <strong>of</strong> the upper loop will be determined by calculating<br />

the magnetic scalar potential. Using results for interaction<br />

magnetic force between two circular loops, magnetic<br />

force <strong>of</strong> the axial magnetic bearing can be obtained by<br />

summing the contribution <strong>of</strong> both magnet bases <strong>of</strong> lower<br />

and upper permanent magnets by using uniform<br />

discretization technique.<br />

The goal <strong>of</strong> this approach is to determine the interaction<br />

magnetic force between two circular loops uniformly<br />

loaded with magnetization charges Qm1 and Q m2<br />

.<br />

Dimensions and positions <strong>of</strong> the loops are presented in<br />

the Figure 2. For determining the interaction force<br />

between two circular loops, magnetic scalar potential,<br />

magnetic field and magnetic flux density generated by<br />

the upper loop will be calculated. Elementary magnetic<br />

scalar potential generated by the elementary point<br />

magnetization charge, dQ m1,is


Figure2: Two circular loops.<br />

dQm1<br />

1<br />

d m . (2)<br />

4<br />

R<br />

Qm1<br />

Qm1<br />

Since dQ<br />

m1<br />

Qm<br />

1 dl<br />

a d '<br />

d '<br />

,<br />

2a<br />

2<br />

elementary magnetic scalar potential has the following<br />

form<br />

Qm1<br />

1<br />

d m<br />

d '<br />

, (3)<br />

2<br />

8<br />

R<br />

and the resulting magnetic scalar potential generated by<br />

the upper circular loop at an arbitrary point P( r,<br />

,<br />

z)<br />

is<br />

2<br />

Qm1<br />

1<br />

m <br />

d '<br />

,<br />

2<br />

8<br />

2 2<br />

2<br />

0 r r0<br />

zz0 2r0r<br />

cos'<br />

(4)<br />

Considering the existing symmetry, in 0 plane,<br />

magnetic scalar potential has the following form<br />

<br />

Qm1<br />

1<br />

m ( r,<br />

z)<br />

<br />

d '.<br />

2<br />

4<br />

2 2<br />

2<br />

0 r r0<br />

zz0 2rr0<br />

cos '<br />

(5)<br />

Substituting θ' 2<br />

in Eq. (5), magnetic scalar<br />

potential is obtained as:<br />

2<br />

Qm1<br />

m ( r,<br />

z)<br />

<br />

2<br />

2<br />

<br />

0<br />

1<br />

2<br />

2<br />

2<br />

( r r0<br />

) 4rr0<br />

sin zz0 d <br />

. (6)<br />

After some simple operations the magnetic scalar<br />

potential can be given in the form:<br />

where<br />

m<br />

( r,<br />

z)<br />

<br />

Qm1<br />

2<br />

2<br />

2<br />

<br />

<br />

K<br />

, k <br />

2 <br />

2<br />

( r r0<br />

) <br />

2 z z<br />

<br />

1<br />

K , k K <br />

,<br />

2 <br />

2 2<br />

1<br />

k sin<br />

0<br />

0<br />

d<br />

<br />

- 301 - 15th IGTE Symposium 2012<br />

(7)<br />

is complete elliptic integral <strong>of</strong> the first kind with modulus<br />

2 4rr0<br />

k .<br />

2<br />

2<br />

( r r0<br />

) ( z z0<br />

)<br />

External magnetic field (magnetic field generated by the<br />

upper loop) at an arbitrary point can be determined as<br />

ext<br />

H ( r, z)<br />

grad<br />

m<br />

( r,<br />

z)<br />

Hr<br />

( r,<br />

z)<br />

rˆ<br />

H z ( r,<br />

z)<br />

zˆ<br />

,<br />

(8)<br />

External magnetic flux density is<br />

ext<br />

ext<br />

ext<br />

ext<br />

B ( r, z)<br />

H ( r,<br />

z)<br />

, (9)<br />

ext<br />

r<br />

ext<br />

r<br />

ext<br />

z<br />

with components<br />

ext<br />

Br <br />

<br />

<br />

<br />

2<br />

<br />

r<br />

<br />

<br />

2r<br />

and<br />

0<br />

B ( , z)<br />

B ( r,<br />

z)<br />

rˆ<br />

B ( r,<br />

z)<br />

zˆ<br />

. (10)<br />

( r,<br />

z)<br />

<br />

ext<br />

ext H<br />

( r,<br />

z)<br />

Br ( r,<br />

z)<br />

0<br />

, (11)<br />

r<br />

2 2<br />

2 <br />

rr0zz0 E,<br />

k <br />

2 <br />

2<br />

2<br />

2<br />

( r r ) zz ( r r ) zz 0<br />

( r r<br />

<br />

K<br />

, k <br />

2 <br />

0<br />

)<br />

B z<br />

0<br />

2<br />

Q<br />

m1<br />

2<br />

2<br />

<br />

<br />

0<br />

zz 0<br />

2<br />

<br />

<br />

<br />

<br />

,<br />

<br />

<br />

<br />

0<br />

ext<br />

0<br />

2<br />

<br />

(12)<br />

ext H<br />

( r,<br />

z)<br />

( r,<br />

z)<br />

0<br />

, (13)<br />

z<br />

ext Qm1<br />

Bz ( r,<br />

z)<br />

0<br />

<br />

2<br />

2<br />

<br />

zz0E, k <br />

2 <br />

2 2<br />

2<br />

2<br />

( r r ) z z ( r r ) z z<br />

0<br />

0<br />

2<br />

<br />

0<br />

0<br />

(14)<br />

<br />

2 2<br />

where E , k E 1<br />

k sin d ,<br />

2 <br />

0<br />

is complete elliptic integral <strong>of</strong> the second kind with<br />

modulus<br />

2 4rr0<br />

k <br />

2<br />

2<br />

( r r0<br />

) ( z z0<br />

)<br />

.<br />

The interaction magnetic force on elementary<br />

magnetization charge <strong>of</strong> lower circular loop<br />

Q<br />

m2<br />

Qm2<br />

dQm<br />

2 Qm2<br />

dl<br />

bd<br />

d<br />

is<br />

2b<br />

2<br />

ext<br />

d m2<br />

m m<br />

F dQ<br />

B ( r , z ) . (15)<br />

Finally, interaction magnetic force components can be<br />

expressed as:


Qm1Qm<br />

2<br />

Fr ( r,<br />

z)<br />

0<br />

<br />

2<br />

2<br />

<br />

<br />

<br />

<br />

2<br />

<br />

rm<br />

<br />

m 0 m 0 m 0<br />

<br />

<br />

K<br />

, k0<br />

<br />

2 <br />

2rm<br />

2<br />

( rm<br />

r0<br />

) m<br />

2 2<br />

2 <br />

rmr0zmz0 E,<br />

k0<br />

<br />

2 <br />

2<br />

2<br />

2<br />

( r r ) zz ( r r ) zz Qm1Qm<br />

2<br />

Fz ( r,<br />

z)<br />

0<br />

<br />

2<br />

2<br />

<br />

zmz0E, k0<br />

<br />

2 <br />

m<br />

zz 0<br />

2<br />

<br />

<br />

<br />

0<br />

2 <br />

0 <br />

<br />

<br />

2 2<br />

2<br />

2<br />

( rm<br />

r0<br />

) zm<br />

z0<br />

( rm<br />

r0<br />

) zm<br />

z0<br />

(16)<br />

- 302 - 15th IGTE Symposium 2012<br />

(17)<br />

Q<br />

( , ) m1Q<br />

F m 2<br />

z r z 0<br />

F ( r0,<br />

rm,<br />

z0,<br />

zm)<br />

2 z<br />

, (18)<br />

p<br />

2<br />

with elliptic integrals modulus<br />

2 4r0rm<br />

k0<br />

2<br />

2<br />

( rm<br />

r0<br />

) ( zi<br />

zm<br />

) <br />

.<br />

The axial component <strong>of</strong> the force (17) presents<br />

interaction force between two magnetized circular loops.<br />

The simplest procedure for levitation magnetic force<br />

determination is to discretize each bases <strong>of</strong> permanent<br />

magnets into system <strong>of</strong> circular loops, where N 1 is the<br />

number <strong>of</strong> discretized segments <strong>of</strong> each bases <strong>of</strong> upper<br />

permanent magnet and N 2 is the number <strong>of</strong> discretized<br />

segments <strong>of</strong> each bases <strong>of</strong> lower permanent magnet.<br />

Figure 3: Discretizing model.<br />

By taking into account the ring geometry <strong>of</strong> permanent<br />

magnets (Figure 3), the radius <strong>of</strong> each discretized<br />

segment <strong>of</strong> both bases <strong>of</strong> upper magnet is<br />

r n<br />

2n<br />

1<br />

a ( b a),<br />

n 1,<br />

2,<br />

,<br />

N1<br />

2N1<br />

, (19)<br />

and magnetization loop charges <strong>of</strong> upper permanent<br />

magnet bases are<br />

b a<br />

Qm n M12rn<br />

, n 1,<br />

2,...,<br />

N1<br />

N1<br />

. (20)<br />

For lower magnet bases the radius <strong>of</strong> each discretized<br />

segments is<br />

r i<br />

2i<br />

1<br />

C ( d c),<br />

i 1,<br />

2,<br />

,<br />

N2<br />

2N<br />

2<br />

. (21)<br />

Magnetization loop charges <strong>of</strong> lower permanent magnet<br />

bases are<br />

d c<br />

Qm i M 2 2ri , i 1,<br />

2,...,<br />

N2<br />

N2<br />

. (22)<br />

Using results for interaction magnetic force between<br />

two circular loops, Eqs. (17), the levitation magnetic<br />

force between two ring permanent magnets can be<br />

obtained. It can be achieved by summing the contribution<br />

<strong>of</strong> both magnet bases <strong>of</strong> lower and upper permanent<br />

magnets by using uniform discretization technique,<br />

<br />

<br />

1 2<br />

20<br />

M1M<br />

2<br />

Fz<br />

( b a)(<br />

d c)<br />

rnri<br />

<br />

N N <br />

<br />

F<br />

F<br />

zp<br />

zp<br />

1<br />

2<br />

( r , r , h,<br />

0)<br />

F<br />

n<br />

( r , r , h L , 0)<br />

F<br />

n<br />

i<br />

i<br />

1<br />

Lh zp<br />

n1<br />

i1<br />

( r , r , h,<br />

L ) <br />

n<br />

zp<br />

<br />

E<br />

, k3<br />

<br />

2 <br />

i<br />

N<br />

2<br />

N<br />

( rn<br />

, ri<br />

, h L1,<br />

L2<br />

)<br />

; (23)<br />

2<br />

0M<br />

1M<br />

2<br />

Fz<br />

<br />

( b a)(<br />

d c)<br />

<br />

N1N<br />

2<br />

<br />

<br />

N1<br />

N2<br />

hE<br />

, k1<br />

<br />

<br />

2 <br />

rn<br />

ri<br />

<br />

<br />

2 2<br />

2 2<br />

n1<br />

i1 <br />

<br />

( ri<br />

rn<br />

) h ( ri<br />

rn<br />

) h<br />

<br />

<br />

L2hE, k2<br />

<br />

2 <br />

<br />

2<br />

2<br />

2<br />

2<br />

( ri<br />

rn<br />

) L2h ( ri<br />

rn<br />

) L2h 2<br />

2<br />

2<br />

( r r ) Lh ( r r ) Lh i<br />

1<br />

LLh <br />

E<br />

, k4<br />

<br />

2 <br />

2<br />

2<br />

2<br />

( r r ) LLh ( r r ) LLh i<br />

n<br />

n<br />

where<br />

2 4rnri<br />

k1<br />

,<br />

2 2<br />

( ri<br />

rn<br />

) h<br />

2 2 4rnri<br />

k2<br />

,<br />

2<br />

ri<br />

rn<br />

L2<br />

h<br />

2 2 4rnri<br />

k3<br />

,<br />

2<br />

ri<br />

rn<br />

L1<br />

h<br />

2 2<br />

4rnri<br />

k4<br />

.<br />

2<br />

ri<br />

rn<br />

L2<br />

L1<br />

h<br />

2<br />

1<br />

2<br />

1<br />

1<br />

i<br />

n<br />

i<br />

n<br />

1<br />

2<br />

2<br />

1<br />

<br />

<br />

<br />

<br />

2 <br />

<br />

<br />

<br />

(24)<br />

III. NUMERICAL RESULTS<br />

We are working under presumption that the both ring<br />

permanent magnets are made <strong>of</strong> the same material and<br />

magnetized uniformly along their axis <strong>of</strong> symmetry, but<br />

in opposite direction, M1 M 2 M .


Distribution <strong>of</strong> magnetic flux density obtained using<br />

FEMM 4.2 s<strong>of</strong>tware [15] is presented in Fig. 6. The<br />

values <strong>of</strong> the geometrical parameters used in the<br />

numerical computation are: 2 1,<br />

L a , 2 2 L b<br />

c L2<br />

3,<br />

d / L2<br />

4,<br />

L1<br />

/ L2<br />

0.<br />

5,<br />

2 1.<br />

5,<br />

L h<br />

2 1mm<br />

L and kA/m 900 M .<br />

Convergence <strong>of</strong> the normalized interaction force,<br />

nor Fz<br />

Fz<br />

obtained using presented approach is<br />

2 2<br />

0M<br />

L2<br />

,<br />

given in Table I for magnetic bearing dimensions:<br />

a L2<br />

1, b L2<br />

2,<br />

2 3,<br />

L c , 4 2 L d , 5 . 0 / L1<br />

L2<br />

<br />

h / L2<br />

0.<br />

1 .<br />

Figure 4: Distribution <strong>of</strong> magnetic flux density for magnetic bearing<br />

obtained using FEMM 4.2 s<strong>of</strong>tware.<br />

TABLE I<br />

CONVERGENCE OF LEVITATION MAGNETIC FORCE FORCE VERSUS<br />

NUMBER OF SEGMENTS.<br />

N tot<br />

nor<br />

F z<br />

nor<br />

F z (FEM)<br />

10 -0.0732327<br />

20 -0.0741579<br />

30 -0.0743319<br />

50<br />

100<br />

-0.0744213<br />

-0.0744591<br />

-0.07491167<br />

200 -0.0744685<br />

300 -0.0744703<br />

500 -0.0744712<br />

In order to save the calculation time, the number <strong>of</strong><br />

segments is limited on N tot N1<br />

N2<br />

200 because it<br />

is not necessary to take a greater number <strong>of</strong> segments to<br />

obtain a desired accuracy.<br />

Compared results for normalized interaction magnetic<br />

force <strong>of</strong> two identical ring permanent magnets, obtained<br />

using presented analytical approach and finite element<br />

method (FEM) versus 2 L h , for parameters: , 1 2 L a<br />

2 2,<br />

L b c L2<br />

3, d L2<br />

4 and 5 . 0 / L 1 L2<br />

are<br />

given in the Table II.<br />

Comparative results for normalized interaction<br />

magnetic force <strong>of</strong> axial passive magnetic bearing versus<br />

ratios 2 L a and 2 L b , obtained using presented approach<br />

and finite element method (FEM), for parameters:<br />

c L2<br />

3, d L2<br />

4,<br />

L1<br />

/ L2<br />

0.<br />

5 and h / L2<br />

1.<br />

5 are<br />

shown in the Table III.<br />

- 303 - 15th IGTE Symposium 2012<br />

TABLE II<br />

COMPARED RESULTS FOR INTERACTION MAGNETIC VERSUS h L2<br />

h / L2<br />

nor<br />

F z<br />

nor<br />

F z (FEM)<br />

0 -0.120071 -0.120515<br />

0.1 -0.074469 -0.074911<br />

0.2 -0.025238 -0.025659<br />

0.3 0.025238 0.024873<br />

0.4 0.074469 0.074099<br />

0.5 0.120071 0.119765<br />

0.6 0.159974 0.159691<br />

0.7 0.192597 0.192327<br />

0.8 0.216978 0.216771<br />

0.9 0.232821 0.232627<br />

1.0 0.240465 0.240293<br />

1.1 0.240766 0.240654<br />

1.2 0.234930 0.234829<br />

1.3 0.224326 0.224240<br />

1.4 0.210322 0.210279<br />

1.5 0.194163 0.194138<br />

TABLE III<br />

COMPARED RESULTS FOR INTERACTION MAGNETIC FORCE VERSUS<br />

a L AND<br />

2 b L2<br />

a / L2<br />

b/<br />

L2<br />

nor<br />

F z<br />

nor<br />

F z (FEM)<br />

1.0 2.0 0.194163 0.194138<br />

1.5 2.5 0.320992 0.321208<br />

2.0 3.0 0.209864 0.210493<br />

2.5 3.5 -0.614301 -0.613222<br />

3.0 4.0 -1.341280 -1.339868<br />

3.5 4.5 -0.714116 -0.712579<br />

4.0 5.0 0.272002 0.273619<br />

4.5 5.5 0.491236 0.492676<br />

5.0 6.0 0.352195 0.353756<br />

IV. CONCLUSION<br />

Determination <strong>of</strong> the interaction forces <strong>of</strong> axial passive<br />

magnetic bearing is presented. It is preformed using<br />

magnetization charges and discretization technique.<br />

Presumption was that both magnets are made <strong>of</strong> the<br />

same material and magnetized uniformly along the<br />

magnet axis <strong>of</strong> symmetry, with the same intensity, but in<br />

opposite directions. The derived algorithm is easily<br />

implemented in any standard computer environment and<br />

it enables rapid parametric studies <strong>of</strong> the interaction<br />

force. The results <strong>of</strong> the presented approach are<br />

successfully confirmed using FEMM 4.2 s<strong>of</strong>tware. Table<br />

I shows that it is not necessary to take a great number <strong>of</strong><br />

segments (not more then 200) to obtain a desired<br />

accuracy so the computational time can be saved.<br />

Interaction forces calculation using presented approach<br />

for mentioned parameters and N tot 200 is performed<br />

with Intel Core 2 Duo CPU at 2.4GHz and 4GB RAM<br />

memory and it took less than two seconds <strong>of</strong> run time.<br />

Interaction forces are also determined on the same<br />

computer using FEMM 4.2 s<strong>of</strong>tware and the computation<br />

time was 14 minutes for about 1.8million finite elements.<br />

Therefore, the advantage <strong>of</strong> presented analytical approach<br />

is its accuracy, simplicity and time efficiency.


V. ACKNOWLEDGEMENT<br />

The work presented here was partly supported by the<br />

Serbian Ministry <strong>of</strong> Education and Science in the frame<br />

<strong>of</strong> the project TR 33008.<br />

REFERENCES<br />

[1] J. S Agashe and D. P Arnold, “A study <strong>of</strong> scaling and geometry<br />

effects on the forces between cuboidal and cylindrical magnets<br />

using analytical force solutions”, J. Phys. D: Appl. Phys. 41<br />

105001, pp.1-9, 2008.<br />

[2] A. N. Vučković, S. R. Aleksić, S. S. Ilić.: “Calculation <strong>of</strong> the<br />

Attraction and Levitation Forces Using Magnetization Charges”,<br />

The 10th International Conference on Applied Electromagnetics –<br />

PES 2011, <strong>Proceedings</strong> <strong>of</strong> full papers (CDROM), pp. 33-55, 25-<br />

29 September, Niš, Serbia, 2011.<br />

[3] G. Akoun, J. P. Yonnet.: “3d Analytical Calculation <strong>of</strong> the Forces<br />

Exerted between two Cuboidal Magnets”, IEEE Transactions on<br />

Magnetics, Vol. 20, No. 5, pp. 1962-1964, September 1984.<br />

[4] S. I. Babic, C. Akyel.: “Magnetic Force Calculation between Thin<br />

Coaxial Circular Coils in Air”, IEEE Transactions on Magnetics,<br />

Vol. 44, No. 4, pp. 445-452, April 2008.<br />

[5] V. Lemarquand, G. Lemarquand.: “Passive Permanent Magnet<br />

Bearings for Rotating Shaft: Analytical Calculation”, Magnetic<br />

Bearings, Theory and Applications, Sciyo Published book, pp. 85-<br />

116, October 2010.<br />

[6] R. Ravaud, G. Lemarquand, V. Lemarquand.: “Force and<br />

Stiffness <strong>of</strong> Passive Magnetic Bearings Using Permanent Magnets.<br />

Part 1: Axial Magnetization”, IEEE Transactions on Magnetics,<br />

Vol. 45, No. 7, pp. 2996-3002, July 2009.<br />

[7] R. Ravaud, G. Lemarquand, S. Babic, V. Lemarquand, C. Akeyel.:<br />

“Cylindrical Magnets and Coils: Fields, Forces and Inductances”,<br />

IEEE Transactions on Magnetics, Vol. 46, No. 9, pp. 3585-3590,<br />

September 2010.<br />

- 304 - 15th IGTE Symposium 2012<br />

[8] M. Greconici, Z. Ž. Cvetković, A. N. Mladenović, S. R. Aleksić,<br />

D. Vesa.: “Analytical-numerical Approach for Levitation Force<br />

Calculation <strong>of</strong> a Cylindrical Bearing with Permanent Magnets<br />

Used in an Electric Meter” <strong>Proceedings</strong> <strong>of</strong> full papers OPTIM<br />

2010, pp. 197-201, 20-21 May, Brasov, Romania, 2010.<br />

[9] Furlani, E. P., S. Reznik, & A. Kroll. 1995. A three-dimensional<br />

field solution for radially polarized cylinders. IEEE Trans. Magn.,<br />

vol. 31, no.1, pp. 844–851.<br />

[10] M. Braneshi, O. Zavalani and A. Pijetri.: “The Use <strong>of</strong> Calculating<br />

Function for the Evaluation <strong>of</strong> Axial Force between Two Coaxial<br />

Disk Coils”, 3 rd International PhD Seminar Computational<br />

Electromagnetics and Technical Application, pp. 21-30, 28<br />

August - 1 September, Banja Luka, Bosnia and Hertzegovina,<br />

2006.<br />

[11] Rakotoarison, H. L., J.-P. Yonnet, & B. Delinchant.2007. Using<br />

Coulombian Approach for Modeling Scalar Potential and<br />

Magnetic Field <strong>of</strong> a Permanent Magnet With Radial Polarization.<br />

IEEE Transactions on Magnetics, Vol. 43, No. 4, pp. 1261-1264.<br />

[12] R. Ravaud, G. Lemarquand, V. Lemarquand.: “Force and<br />

Stiffness <strong>of</strong> Passive Magnetic Bearings Using Permanent Magnets.<br />

Part 2: Radial Magnetization”, IEEE Transactions on Magnetics,<br />

Vol. 45, No. 9, pp. 3334-3342, September 2009.<br />

[13] Ana N. Vučković, Saša S. Ilić & Slavoljub R. Aleksić: Interaction<br />

Magnetic Force Calculation <strong>of</strong> Ring Permanent Magnets Using<br />

Ampere's Microscopic Surface Currents and Discretization<br />

Technique, Electromagnetics, 32:2, pp. 117-134, 2012.<br />

[14] A. N. Mladenović, S. R. Aleksić, S. S. Ilić.: “Levitation Force<br />

Calculation for Permanent Magnet Bearings Using Ampere’s<br />

Currents”, The 14 th International IGTE Symposium on Numerical<br />

Field Calculation in Electrical Engineering, <strong>Proceedings</strong> <strong>of</strong> full<br />

papers (CDROM), pp. 149-153, 19-22 September, <strong>Graz</strong>, Austria,<br />

2010<br />

[15] Meeker, D. n.d. S<strong>of</strong>tware package FEMM 4.2. Available on-line<br />

at http://www.femm.info/wiki/ Download (accessed 2 March<br />

2007).


- 305 - 15th IGTE Symposium 2012<br />

Magnet deviation measurements and<br />

their consideration in<br />

electromagnetic field simulation<br />

Peter Offermann ∗ , Isabel Coenen ∗ , David Franck ∗ and Kay Hameyer ∗<br />

∗ Institute <strong>of</strong> Electrical Machines<br />

RWTH Aachen <strong>University</strong><br />

Schinkelstrasse 4<br />

D-52062 Aachen, Germany<br />

E-mail: Peter.Offermann@IEM.rwth-aachen.de<br />

Abstract—Due to their manufacturing process arc segment magnets for the use in permanent-magnet synchronous machines<br />

(PMSM) may show deviations from their intended ideal magnetization. Using magnets with unfavourable error constellations<br />

in one rotor <strong>of</strong> a PMSM will result in a spatial unsymmetric air gap field, causing undesired parasitic effects as e.g. torque<br />

pulsations. Most manufacturer information only contain the mean values <strong>of</strong> the magnetization as well as certain guaranteed<br />

error bounds, not stating if (and how) the magnetization will vary spatial over a set <strong>of</strong> magnets. In order to allow an<br />

accurate consideration <strong>of</strong> these deviations in the machine simulation, the emitted radial field <strong>of</strong> a set <strong>of</strong> magnets has been<br />

measured and compared to their assumed magnetisation using finite element method (FEM). As a result, the measured<br />

deviations can be quantified and the influence <strong>of</strong> magnet deviations can be estimated using e.g. stochastic collocation<br />

methods in combination with the FEM.<br />

Index Terms—finite element method, magnetization errors, measurements, stochastics variations<br />

I. INTRODUCTION<br />

The simulation <strong>of</strong> an electrical machine employing<br />

the finite element method (FEM) requires the exact<br />

knowledge <strong>of</strong> the machine’s geometry, its excitations and<br />

its material properties. For machines which are manufactured<br />

in mass production, the material or geometry <strong>of</strong><br />

one specific instance <strong>of</strong> the designed machine may vary<br />

from its specified targets [1], leading in the worst case<br />

to a non-fulfilment <strong>of</strong> the rated machine’s data.<br />

For geometry variations a typical cause is the abrasion<br />

<strong>of</strong> the punching tools. Varying material properties<br />

may be caused e.g. by a stochastic jitter in the orientation<br />

<strong>of</strong> the punched stator lamination sheets, which<br />

can be tainted with anisotropy. Causes for variations<br />

in excitations can either arise from the converter or –<br />

in case <strong>of</strong> a permanent-magnet synchronous machines<br />

(PMSM) – from magnet deviations [2] with respect to<br />

their intended ideal magnetization [3]. Using magnets<br />

with unfavourable error constellations in one rotor <strong>of</strong> a<br />

PMSM will result in a spatial unsymmetric air gap field,<br />

causing undesired parasitic effects as torque pulsation [4],<br />

[5].<br />

Most manufacturer information only contain the mean<br />

values <strong>of</strong> the magnetization as well as certain guaranteed<br />

error bounds, not stating if (and how) the magnetization<br />

will vary spatial over a set <strong>of</strong> magnets. The goal <strong>of</strong><br />

this publication hence is to improve the simulation <strong>of</strong><br />

electrical machines by reducing the described epistemic<br />

uncertainty <strong>of</strong> magnet variations. Therefore, a magnet<br />

test-bench has been created, in order to measure the<br />

emitted radial field <strong>of</strong> a set <strong>of</strong> magnets. From this, the<br />

modality and probability distribution <strong>of</strong> the occurring<br />

variations have been deduced.<br />

The comparison <strong>of</strong> the magnets’ FEM-simulations<br />

with their measurements may allow the calculation <strong>of</strong><br />

improved simulation parameters for complete machine<br />

simulations. For the measured magnets, which were<br />

diametrally magnetized, three error-types have been identified:<br />

A general variation <strong>of</strong> the flux-density’s strength<br />

<strong>of</strong> up to 11.6%, a maximal local, angle deviation at the<br />

magnet’s outer borders <strong>of</strong> 8 ◦ and local errors <strong>of</strong> up to<br />

9.1%.<br />

II. MAGNETIZATION MEASUREMENT TEST-BENCH<br />

In order to obtain reliable data about possible magnetisation<br />

errors, a test bench for the evaluation <strong>of</strong> surface<br />

magnets has been built. In the following the sensor<br />

selection (sec. II-A) and the test-bench construction (sec.<br />

II-B) are described.<br />

A. Sensor selection<br />

Typical methods to measure the magnetic flux-density<br />

are Hall-sensors and Helmholtz-coils. In this paper, a<br />

Hall-sensor as depicted in fig. 1 has been selected, due<br />

to the following reasoning:<br />

For best results, both methods require that the measured<br />

magnetic field is oriented perpendicular to the<br />

measuring coil respectively Hall-sensor. This can be<br />

easier accomplished for larger sensors than for very small


devices. Hall-sensors can be miniaturized due to the fact<br />

that an interaction with a given current is measured.<br />

Therefore the concomitant reduction <strong>of</strong> the Hall-constant<br />

CH, being a consequence <strong>of</strong> a reduction in material<br />

volume, can be compensated to certain extents with an<br />

increase in the measurement current (fig. 1). This allows<br />

to measure field components nearly pointwise.<br />

d<br />

ϕ1<br />

I<br />

B<br />

ϕ2<br />

Fig. 1. Hall-sensor and its distinctive input sizes.<br />

Helmholtz-coil configurations – in contrast to Hallsensors<br />

– always measure the the overall magnetic fluxdensity.<br />

Due to this integration over the magnet’s surface<br />

flux-density, however, a pointwise selective resolution <strong>of</strong><br />

the magnetic field is no longer possible. Global angle<br />

<strong>of</strong>fsets in the magnetization can be detected with both<br />

measurement methods by either using multiple sensors<br />

respectively coils or by turning the magnet under test.<br />

For this purpose, coils are preferable, because their<br />

orientation is better adjustable and an integration over<br />

all local values for a single angle value is implemented<br />

intrinsic in the coil. Local angle errors however cannot<br />

be detected using such a setup. Lastly, coil measurements<br />

are less noise sensitive because the integration already<br />

smoothes some measurement noise.<br />

The decisive factor for Hall-sensors was the interest<br />

in local magnet variations, since most publications until<br />

now focus only on global magnet variations [6], [7] in<br />

electrical machines. Furthermore, this selection allows<br />

the analysis <strong>of</strong> possible locational misalignments <strong>of</strong> the<br />

magnets and will enable a later use <strong>of</strong> the measured<br />

variations in conformal mapping Ansatz functions [8],<br />

[9].<br />

B. Test-bench construction<br />

For the construction <strong>of</strong> the magnet test bench, Hallsensors<br />

<strong>of</strong> the type HE-244 [10] were selected. Table II-B<br />

summarizes the main features <strong>of</strong> the selected sensor:<br />

TABLE I<br />

PROPERTIES OF THE USED HALL SENSOR.<br />

value unit<br />

supply current up to 10 mA<br />

sensitivity 90 to 190 V / (A · T)<br />

linearity<br />

hall voltage typical ≤ 0.2 %<br />

Three sensors for the measurement <strong>of</strong> the magnetic<br />

field components Bx, By and Bz are located on an index<br />

- 306 - 15th IGTE Symposium 2012<br />

arm with predefined 90 degree edges, in order to achieve<br />

a good positioning. The sensors are positioned directly<br />

on adjacent edges to measure the field at approximately<br />

one point as depicted in fig. 2.<br />

y x<br />

z<br />

Fig. 2. Positions and labelling <strong>of</strong> the used Hall-sensors on the<br />

measurement anchor.<br />

The index arm itself is mounted on a gibbet, which<br />

is constructed in such a way, that it allows a position<br />

adjustment in all three dimensions. Below the index arm<br />

the magnets under test can be mounted upon a cylindric<br />

shaft which rotates around its symmetry-axis (fig. 3, 4).<br />

z<br />

encoder<br />

y<br />

x<br />

step motor<br />

magnet mounting<br />

rotation axis<br />

hall sensor<br />

magnet under test<br />

Fig. 3. Schematic scetch <strong>of</strong> the created test bench for magnet<br />

measurements.<br />

This allows the use <strong>of</strong> a connected stepper-motor to<br />

measure the field along a circular line over the magnet’s<br />

surface. To avoid field distortion by flux guidance all<br />

relevant test bench components have been constructed<br />

from aluminium. Data acquisition and the stepper-motor<br />

control are implemented using a dSpace-system in combination<br />

with a PC.<br />

III. RESULTS<br />

In this study 52 magnets with diametral magnetization<br />

and a field strength <strong>of</strong> Br = 1.04T were analysed,<br />

consisting <strong>of</strong> two equally sized groups with either northor<br />

south-pole on the outer magnet circumference. For<br />

each magnet, the Hall-voltage <strong>of</strong> the radial outwards<br />

pointing flux-density was measured 1.5mm above the<br />

magnet’s surface. The magnet’s dimensions are given in<br />

fig. 5.<br />

A. Simulations<br />

In the simulations, the magnet (as depicted in fig. 5)<br />

is surrounded by an air layer which measures ten times


Fig. 4. Photograph <strong>of</strong> the constructed magnet test bench.<br />

Br =1.04T<br />

3mm<br />

Fig. 5. Dimensions <strong>of</strong> the measured magnets.<br />

15mm<br />

the magnet’s height in every direction [11]. The applied<br />

solver implements the magnetic vector-potential formulation.<br />

All boundaries were set as Neumann conditions. The<br />

radial flux-density was sampled along a circumference <strong>of</strong><br />

1.5mm above the magnet.<br />

B. Measurements<br />

1) Repetition measurements:<br />

Repetitive measurements were executed to determine the<br />

test-bench’s measurement reproducibility. The average<br />

error between two arbitrary measurements <strong>of</strong> the same<br />

magnet is below 0.5% and mainly caused by very small<br />

positioning errors <strong>of</strong> the magnet in the tangential direction<br />

<strong>of</strong> the measurement shaft. Fig. 6 depicts five<br />

repetitive measurements <strong>of</strong> magnet #7.<br />

2) Post-processing <strong>of</strong> measurements:<br />

For data acquisition, every magnet is inserted, measured,<br />

and removed from the test-bench five times (fig. 6).<br />

Afterwards, the repetitive data <strong>of</strong> each magnet data are<br />

scanned for obvious misplacement errors. If they exist,<br />

the worst deviating measurement is removed. Thereafter,<br />

- 307 - 15th IGTE Symposium 2012<br />

V(Brad)[V ]<br />

2<br />

0<br />

−2<br />

−4<br />

−6<br />

200 220 240 260 280 300 320 340<br />

angle [ ◦ −8<br />

]<br />

Fig. 6. Five repetitive measurements <strong>of</strong> magnet #7, showing the testbench’s<br />

reproduction quality.<br />

the repetitive measurements are aligned to have their<br />

outer minima centred at around fixed value. Ultimately,<br />

the remaining, centred flux-density values <strong>of</strong> the magnet<br />

are averaged. Fig. 7 shows – for the purpose <strong>of</strong> demonstration<br />

exaggerated – examples <strong>of</strong> the described process.<br />

raw measurements<br />

delete errors<br />

x-align measurements<br />

average<br />

Fig. 7. Post-processing <strong>of</strong> measured flux-density curves.<br />

3) Variation measurements:<br />

Figure 8 presents the results <strong>of</strong> the variation measure-


ments for all magnets which have their north pole located<br />

on the outer side. Two obvious variations can be directly<br />

identified:<br />

• Strength variations in the overall remanence fluxdensity<br />

per magnet,<br />

• Strong deformations from the expected curve shape<br />

in terms <strong>of</strong> local variations.<br />

V(Brad)[V ]<br />

8<br />

6<br />

4<br />

2<br />

0<br />

−2<br />

200 220 240 260 280 300 320 340<br />

angle [ ◦ ]<br />

Fig. 8. Measured radial flux-density 1.5mm above each magnet’s<br />

centre in the magnet group ’north-up’.<br />

Fig. 9 shows accordingly the likelihood <strong>of</strong> occurrence<br />

for the radial outwards pointing flux-density over the<br />

magnet angle for the opposite magnet group. Due to the<br />

envelope shape <strong>of</strong> the resulting curve, the strong influence<br />

<strong>of</strong> the variations is even more obvious.<br />

Fig. 9. Probability <strong>of</strong> measured magnetisation strength, probabilities<br />

ranging from low (dark) to high (light).<br />

C. Comparison <strong>of</strong> measurements and simulations<br />

In order to quantify the strength <strong>of</strong> the occurring<br />

deviations in terms <strong>of</strong> changes in excitation (in contrast<br />

to changes in the resulting flux-density), the excitation<br />

<strong>of</strong> each magnet had to be reconstructed from the given<br />

- 308 - 15th IGTE Symposium 2012<br />

measurements. To solve this inverse problem [12], a<br />

straightforward approach was to compare the measured<br />

radial flux-density component <strong>of</strong> each magnet to a set<br />

<strong>of</strong> simulations. In these simulations, the magnet’s remanence<br />

flux-density Br was varied as parameter ξ1,<br />

applying the simulation conditions presented in section<br />

III-A. However, the resulting shapes did not agree to<br />

the measured curves. The employed magnetisation model<br />

was therefore extended to include a second deviation<br />

parameter ξ2, allowing an angle spread in magnetisation<br />

as given in fig. 10 and yealding the excitation given in<br />

eq. 1:<br />

⎛<br />

B(Δα, ξ1,ξ2) =Br(ξ1) · ⎝ cos(αmid<br />

⎞<br />

+Δα(ξ2))<br />

sin(αmid +Δα(ξ2)) ⎠ (1)<br />

0<br />

Δα<br />

Fig. 10. Determined second deviation parameter ξ2 (grey) from the<br />

ideal, unidirectional magnetisation.<br />

Applying both variation types, the magnet excitation<br />

parameters could be reconstructed sufficiently in most<br />

cases using the least-square minimization from eq. 2 for<br />

parameter determination:<br />

<br />

<br />

<br />

min <br />

<br />

ξ1,ξ2<br />

310 ◦<br />

<br />

α=230 ◦<br />

[Brad,sim(α, ξ1,ξ2) − Brad,mes(α)] 2<br />

<br />

<br />

<br />

<br />

(2)<br />

Fig. 11 shows the comparison <strong>of</strong> the measured radial<br />

flux-density (dashed) in comparison to the best fitting<br />

simulated curve (solid). The divergence <strong>of</strong> both curves at<br />

V(Brad)[V ]<br />

8<br />

6<br />

4<br />

2<br />

0<br />

−2<br />

200 220 240 260 280 300 320<br />

angle [ ◦ −4<br />

]<br />

Fig. 11. Measured (dashed) radial outwards pointing flux-density in<br />

comparison to its best fitting siumlation for magnet #1.


the outer side <strong>of</strong> both graphs can safely be neglected here,<br />

because they are caused by effects <strong>of</strong> the 2D-simulation<br />

and are considered as not relevant, as this area is not<br />

above, but beside the magnet.<br />

Figure 12 finally shows the comparison <strong>of</strong> measured<br />

and simulated radial outwards pointing flux-density for a<br />

magnet having a local magnetisation error. As the graph<br />

clearly shows, this behaviour cannot be reproduced by the<br />

applied model yet. The three identified error-types finally<br />

have been identified to: flux-density’s strength variations<br />

<strong>of</strong> up to 11.6%, a maximal local, angle deviation at the<br />

magnet’s outer borders <strong>of</strong> 8 ◦ and local errors <strong>of</strong> up to<br />

9.1%<br />

V(Brad)[V ]<br />

8<br />

6<br />

4<br />

2<br />

0<br />

−2<br />

200 220 240 260 280 300 320<br />

angle [ ◦ −4<br />

]<br />

Fig. 12. Measured (dashed) radial outwards pointing flux-density in<br />

comparison to its best fitting siumlation for magnet #13. Local errors<br />

cannot be reproduced yet.<br />

IV. CONCLUSIONS<br />

The presented methodology allows an accurate determination<br />

<strong>of</strong> remanence flux-density variations above the<br />

surface <strong>of</strong> a set <strong>of</strong> magnets or rotors. A comparison <strong>of</strong><br />

the measured curves with the magnet’s simulated and<br />

intended remanence flux-density reveals, in which way<br />

the used FE-magnet-models have to be adopted to be<br />

used in stochastic considerations <strong>of</strong> parameter variations<br />

in electrical machines. Necessary implementations are a<br />

scalable magnetization strength and an over the magnet<br />

changing deviation angle. Optional, local errors can be<br />

considered as well. The resulting magnet parameters<br />

finally can be used for uncertainty propagation applying<br />

appropriate tools as stochastic collocation [13] or polynomial<br />

chaos approaches [14] to propagate the magnet<br />

deviations onto output sizes <strong>of</strong> interest.<br />

V. ACKNOWLEDGEMENT<br />

The results presented in this paper have been developed<br />

in the research project Propagation <strong>of</strong> uncertainties<br />

across electromagnetic models granted by the Deutsche<br />

Forschungsgemeinschaft (DFG).<br />

- 309 - 15th IGTE Symposium 2012<br />

REFERENCES<br />

[1] M. Ci<strong>of</strong>fi, A. Formisano, and R. Martone, “Stochastic handling<br />

<strong>of</strong> tolerances in robust magnets design,” IEEE Transactions on<br />

Magnetics, vol. 40, no. 2, pp. 1252 – 1255, march 2004.<br />

[2] M.-F. Hsieh, C.-K. Lin, D. Dorrell, and P. Wung, “Modeling<br />

and effects <strong>of</strong> in-situ magnetization <strong>of</strong> isotropic ferrite magnet<br />

motors,” in Energy Conversion Congress and Exposition (ECCE),<br />

2011 IEEE, sept. 2011, pp. 3278 –3284.<br />

[3] K.-C. Kim, S.-B. Lim, D.-H. Koo, and J. Lee, “The shape<br />

design <strong>of</strong> permanent magnet for permanent magnet synchronous<br />

motor considering partial demagnetization,” IEEE Transactions<br />

on Magnetics, vol. 42, no. 10, pp. 3485 –3487, oct. 2006.<br />

[4] D. Torregrossa, A. Khoobroo, and B. Fahimi, “Prediction <strong>of</strong><br />

acoustic noise and torque pulsation in pm synchronous machines<br />

with static eccentricity and partial demagnetization using field<br />

reconstruction method,” IEEE Transactions on Industrial Electronics,<br />

vol. 59, no. 2, pp. 934 –944, feb. 2012.<br />

[5] G. Heins, T. Brown, and M. Thiele, “Statistical analysis <strong>of</strong> the<br />

effect <strong>of</strong> magnet placement on cogging torque in fractional pitch<br />

permanent magnet motors,” IEEE Transactions on Magnetics,<br />

vol. 47, no. 8, pp. 2142 –2148, aug. 2011.<br />

[6] F. Jurisch, “Production process based deviations in the orientation<br />

<strong>of</strong> anisotropic permanent magnets and their effects onto the operation<br />

performance <strong>of</strong> electrical machines and magnetic sensors –<br />

german –,” International ETG-Kontress Tagungsband, (ETG-FB<br />

107), no. 1, pp. 255–261, 2007.<br />

[7] I. Coenen, M. Herranz Gracia, and K. Hameyer, “Influence and<br />

evaluation <strong>of</strong> non-ideal manufacturing process on the cogging<br />

torque <strong>of</strong> a permanent magnet excited synchronous machine,”<br />

COMPEL, vol. 30, no. 3, pp. 876–884, 2011.<br />

[8] M. Hafner, D. Franck, and K. Hameyer, “Accounting for saturation<br />

in conformal mapping modeling <strong>of</strong> a permanent magnet<br />

synchronous machine,” COMPEL, vol. 30, no. 3, pp. 916–928,<br />

May 2011.<br />

[9] D. Zarko, D. Ban, and T. Lipo, “Analytical calculation <strong>of</strong> magnetic<br />

field distribution in the slotted air gap <strong>of</strong> a surface permanentmagnet<br />

motor using complex relative air-gap permeance,” Magnetics,<br />

IEEE Transactions on, vol. 42, no. 7, pp. 1828 – 1837,<br />

july 2006.<br />

[10] H. Electronics, “He244 series analog hall sensor - datasheet,”<br />

Download from www.hoeben.com, downloaded at 15.08.2012,<br />

November 2011.<br />

[11] P. Offermann and K. Hameyer, “Non-Linear stochastic variations<br />

in a magnet evaluated with Monte-Carlo simulation and a polynomial<br />

Chaos META-Model,” in XXII Symposium on Electromagnetic<br />

Phenomena in Nonlinear Circuits,EPNC 2012. Pula,<br />

Croatia: PTETIS Publishers, June 2012, pp. 21–22.<br />

[12] A. Mohamed Abouelyazied Abdallh, “An inverse problem based<br />

methodology with uncertainty analysis for the identification <strong>of</strong><br />

magnetic material characteristics <strong>of</strong> electromagnetic devices,”<br />

Ph.D. dissertation, Ghent <strong>University</strong>, 2012.<br />

[13] E. Rosseel, H. De Gersem, and S. Vandewalle, “Nonlinear<br />

stochastic Galerkin and collocation methods: application to a<br />

ferromagnetic cylinder rotating at high speed,” Communications<br />

in Computational Physics, vol. 8, no. 5, pp. 947–975, 2010.<br />

[14] B. Sudret, “Uncertainty propagation and sensitivity analysis<br />

in mechanical modesl – contributions to structural reliability<br />

and stochastic spectral methods,” Ph.D. dissertation, Universite<br />

BLAISE PASCAL - Clermont II, Ecole Doctorale Sciences pour<br />

l’Ingenieur, 2007.


- 310 - 15th IGTE Symposium 2012<br />

Potential <strong>of</strong> Spheroids in a Homogeneous<br />

Magnetic Field in Cartesian Coordinates<br />

Markus Kraiger∗ and Bernhard Schnizer †<br />

∗Institute for Radiopharmacy - PET Center, Helmholtz-Zentrum Dresden - Rossendorf e.V., Bautzner Landstr. 400,<br />

D-01328 Dresden - Schönfeld/Schullwitz, Germany. Email: m.kraiger@hzdr.de<br />

† Institute for Theoretical Physics - Computational Physics, Technische Universität <strong>Graz</strong>, Petersg. 16, A-8010 <strong>Graz</strong>,<br />

Austria<br />

E-mail: schnizer@itp.tu-graz.ac.at<br />

Abstract—The potential and the field <strong>of</strong> a prolate or an oblate magnetic spheroid in a static homogeneous field are computed<br />

and expressed in Cartesian coordinates. The directions <strong>of</strong> both the primary magnetic field and <strong>of</strong> the symmetry axis are<br />

completely arbitrary. These expressions are used to investigate trabecular structures built from spheroids having different<br />

symmetry axes and positions for Magnetic Resonance (MR-) Osteodensitometry.<br />

Index Terms—Prolate or oblate spheroid in homogeneous field, building flexible models for magnetic resonance imaging or<br />

spectroscopy.<br />

I. INTRODUCTION<br />

In gerneral, the potential <strong>of</strong> a magnetic spheroid in a given<br />

external magnetic field is derived in spheroidal coordinates,<br />

whose symmetry axis is the z-axis. Models <strong>of</strong> biological tissues,<br />

as e.g. trabecular bones, are arrays <strong>of</strong> such spheroids with symmetry<br />

axes having various directions. Having such applications<br />

in mind, we derived potential and field expressions for prolate<br />

and oblate spheroids in a homogeneous field. These expressions<br />

depend on Cartesian coordinates for arbitrary directions <strong>of</strong> both<br />

the field and the symmetry axes.<br />

II. METHOD OF SOLUTION<br />

A spheroid (permeability μi = μ0(1 + χi); semi-axes<br />

a, a, c) is in a medium (permeability μe = μ0(1 + χe))<br />

and a static homogeneous field H0 = (H0x,H0y,H0z) =<br />

H0(sin β cos α, sin β sin α, cos β) <strong>of</strong> arbitrary direction. At<br />

first the problem <strong>of</strong> a prolate spheroid is solved in prolate<br />

spheroidal coordinates ([1], Fig.1.06)<br />

x + iy = ep sinh η sin θe iψ<br />

(1)<br />

z = ep cosh η cos θ<br />

or in the corresponding oblate spheroidal coordinates ([1],<br />

Fig.1.07)<br />

x + iy = eo cosh η sin θe iψ<br />

(2)<br />

z = eo sinh η cos θ.<br />

for an oblate spheroid as shown e.g. in [2] to [4]. The particular<br />

solutions <strong>of</strong> the potential equation are obtained by separation<br />

giving Legendre functions and polynomials <strong>of</strong> cosh η, i sinh η<br />

respectively multiplied by Legendre polynomials <strong>of</strong> cos θ and<br />

by trigonometric functions <strong>of</strong> ψ. A solution <strong>of</strong> this problem<br />

is found by the usual method, namely by expanding the<br />

potential in the interior and in the exterior <strong>of</strong> the spheroid<br />

w.r.t. the particular solutions fulfilling the appropriate boundary<br />

conditions: i) the total potential must be finite at η =0; ii)<br />

the total potential must agree with that <strong>of</strong> the primary field<br />

(5) at η = ∞. The expansion coefficients are determined<br />

by the continuity conditions that the total potential must be<br />

continuous Φ0 +Φ σ e =Φ0 +Φ σ i and the corresponding normal<br />

component <strong>of</strong> the magnetic induction must be continuous at the<br />

interface <strong>of</strong> the two media ((7) with n = ez). The solutions<br />

contain only Legendre funtions and polynomials <strong>of</strong> order 1<br />

since the inhomogeneity (5) is <strong>of</strong> that order. Thereafter the<br />

Legendre functions and polynomials may be replaced with<br />

elementary functions <strong>of</strong> η and θ. These may be in turn expressed<br />

by functions <strong>of</strong> Cartesian coordinates by use <strong>of</strong> (1),<br />

(2) respectively and by cosh η = up(r, ez)/ √ 2, sinh η =<br />

uo(r, ez)/ √ 2, eq.(24) respectively. The expansion coefficients<br />

L σ 0 ,L σ 1 ,M σ 0 ,M σ 1 obtained from matching the two pieces <strong>of</strong><br />

the potential at the interface are first expressed in Legendre<br />

functions and polynomials <strong>of</strong> argument ηp,ηo respectively:<br />

ηp = Arcoth(cp/ap) (3)<br />

ηo = Artanh(co/ao). (4)<br />

The coefficients are also reexpressed in elementary functions<br />

<strong>of</strong> these geometrical parameters and by the magnetic susceptibilities<br />

χe,χi to give eqs.(8) to (11), (13) to (16) respectively.<br />

In the last step the potential in both domains is transformed to<br />

an arbitrary direction n <strong>of</strong> the spheroidal symmetry axis. All<br />

vectors in the potential are decomposed into vectors parallel to<br />

or perpendicular to the z-axis. Finally all vectors ez occuring<br />

in these expressions are replaced by n.<br />

This description is rather concise; full details may be found<br />

in the papers [3] and [4] and in the notebooks at the website<br />

quoted. But the next paragraph gives a complete listing <strong>of</strong> all<br />

formulas needed for the applications.<br />

III. RESULTS<br />

The primary field is homogeneous with the potential<br />

Φ0(x, y, z) = − (H0x x + H0y y + H0z z). (5)<br />

A. The potentials <strong>of</strong> the reaction fields<br />

The presence <strong>of</strong> a spheroid induces a reaction field with<br />

potential (r =(xβ)) :<br />

Φ σ k(x, y, z) =<br />

3X<br />

α,β=1<br />

H0αt σ,k<br />

αβ xβ = H0 · T σ,k · r (6)<br />

with σ = p (= prolate) or = o (= oblate) and k = e (= external)<br />

or i (= internal) to the ellipsoid<br />

Eσ := r2 − (n · r) 2<br />

a 2 σ<br />

+ (n · r)2<br />

c 2 σ<br />

=1. (7)


For p a prolate spheroid, ap < cp, the excentricity is ep =<br />

c2 p − a2 p; for an oblate one, co


the additional contribution, originating from the local field<br />

inhomogeneities, to the effective transversal relaxation rate R ∗ 2.<br />

Further, R ′ 2 ≈ γΔB with ΔB representing the field variation<br />

and γ the gyromagnetic ratio.<br />

B. Theory: Computersimulation<br />

The aim <strong>of</strong> the current simulation is to investigate effects on<br />

the induced line broadening <strong>of</strong> the resonance spectra evoked<br />

through micro cracks as examples <strong>of</strong> trabecular rarefaction.<br />

Thus, the evaluation <strong>of</strong> the magnetic field distribution was<br />

performed utilizing a two-compartment model, consisting <strong>of</strong><br />

marrow and bone. In oder to mimic the known trabecular micro<br />

structure within a vertebra [13] prolate ellipsoids were arranged<br />

appropriately within a three-dimensional unit cell.<br />

The precession frequency <strong>of</strong> spins in a homogeneous magnetic<br />

field is determined through the magnetic induction B.<br />

Hence, in a first step the reaction fields induced by the susceptibility<br />

difference between the ellipsoids (trabeculae) and the<br />

background (bone marrow) were computed [14].<br />

Introducing a sample with a different susceptibility, in the<br />

current experiment trabecular bone (χ2) is surrounded by bone<br />

marrow (χ1), the resulting magnetic induction Bz can be<br />

generally written as:<br />

Bz = μ (H0z + Mz (r)) = μ0(1 + χ)(H0z + Mz (r)) , (31)<br />

with Mz characterising the induced reaction field. Herin the<br />

units are given in the MKS-system, and susceptibility units are<br />

per unit volume.<br />

Since the transversal magnetization decay <strong>of</strong> mineralized<br />

bone is several magnitudes faster comparing to bone marrow,<br />

the received resonance signal in MR-Osteodensitometry is governed<br />

by the magnetization arising within the marrow. Thus Mz<br />

corresponds to the computed reaction fields ΔHr1,z caused by<br />

the difference in magnetic property between bone and marrow.<br />

The resulting magnetic field distribution within the unit cell<br />

was determined as the sum <strong>of</strong> the individual contributions Hzi<br />

originating from all ellipsoids n:<br />

nX<br />

ΔHr1,z (r) = Hzi (r) . (32)<br />

i=1<br />

Interactions between the trabeculae have been neglected. This<br />

assumption is valid, since interactions between such structures<br />

include susceptibility effects <strong>of</strong> the second order, which will<br />

give rise to field contributions <strong>of</strong> the order <strong>of</strong> H0 (Δχ) 2 ,or<br />

≈ H0 · 10 −12 .<br />

In a simple MR experiment, excitation followed by an<br />

acquisition period, the signal <strong>of</strong> the free induction decay (FID)<br />

can be written as:<br />

S(t) =const<br />

Z<br />

VOI<br />

with ω(r) =γBz(r) it follows:<br />

Z<br />

S(t) =const<br />

VOI<br />

d 3 r e −iω(r)t e −T2/t ; (33)<br />

d 3 r e −iγBz(r)t e −T2/t . (34)<br />

Using again expression (31) the following expression in<br />

ΔHr1,z can be found:<br />

Z<br />

S(t) =const<br />

VOI<br />

d 3 r e −iγtμ0(1+χ)(H0z+ΔHr1,z(r)) e −T2/t .<br />

(35)<br />

This integral must be extended over the entire unit cell enclosing<br />

the ellipsoids.<br />

In order to compare the simulation results with MR images<br />

the magnitude <strong>of</strong> S(t) must be found. Except for the dissipative<br />

relaxation phenomenon e −T2/t<br />

the expressions in (35) are<br />

- 312 - 15th IGTE Symposium 2012<br />

purely oscillatory in H0z. Hence, for the analysis <strong>of</strong> the signal<br />

course the essential decay can be expressed as:<br />

Z<br />

|S(t)| = const d 3 r e −iγtμ0(1+χ)ΔHr1,z(r)<br />

. (36)<br />

VOI<br />

ΔHr1,z(r) can be computed according to (32) as the sum<br />

over all the reactions fields <strong>of</strong> the individual ellipsoids, where<br />

μ0(1+χ) describes the magnetic permeability at the location r.<br />

1) Algorithm: Utilizing the expression developed for the<br />

reaction field (28) the simulation was implemented in Mathematica<br />

(Wolfram Research, Inc.). The program computed the<br />

field distribution <strong>of</strong> ΔHr1,z(r) in the sense <strong>of</strong> a histogram and<br />

generated the MR signal curve according to (36).<br />

As input parameters the spacing <strong>of</strong> the trabeculae in x-,<br />

y- and z-direction, the dimensions <strong>of</strong> the ellipsoids and the<br />

position <strong>of</strong> the symmetry axis with respect to the z-axis<br />

<strong>of</strong> the coordinate system had to be defined. Further, the<br />

susceptibilities <strong>of</strong> the bones and the background as well as the<br />

orientation <strong>of</strong> the applied homogenous main magnetic field had<br />

to be set. The results <strong>of</strong> the simulations were the histograms<br />

<strong>of</strong> the magnetic field distribution and the signal curve, which<br />

was further utilized within a fitting-procedure yielding the<br />

relaxation constant R ′ 2.<br />

2) Data fitting: Utilizing the simulated signal curves a<br />

exponential signal model was applied in order to approximate<br />

the relaxation time T ′ 2 [15]. The computed signal intensities (36)<br />

at the echo times ranging from 0 to 50 ms, 5 ms increment, were<br />

used to generate a single T ′ 2 value by means <strong>of</strong> a non linear<br />

least-squares-approximation to a two parameter fit function:<br />

S(t) =Ae −t/T ′ 2 . (37)<br />

C. Model <strong>of</strong> vertebra<br />

The three-dimensional unit cell was composed out <strong>of</strong> thirty<br />

prolate ellipsoids, fifteen aligned along the x- and z-direction<br />

each, mimicing the initial intact trabeculae. The interruptions<br />

were simulated in the way, that each trabecula was replaced by<br />

two ellipsoids, which were displaced along the x/z-axis by 50<br />

μm forming a crack. The configuration <strong>of</strong> the three-dimensional<br />

vertebra model and the applied parameter setting are given in<br />

Fig.1.<br />

Fig. 1. Depiction <strong>of</strong> the 3.75 × 3.75 × 3.75 mm 3 unit cell; the<br />

x/z aligned sets are built up <strong>of</strong> three planes displaced by 750 μm.<br />

The trabeculae in each plane were modelled with a trabecular spacing<br />

and width <strong>of</strong> 500 μm and 120 μm respectively. The trabecular micro<br />

fractures were simulated by replacing each <strong>of</strong> the intact trabeculae with<br />

two opposed shifted versions.


D. Results<br />

The resulting reaction fields Hr1 pre- and post bone rarefaction<br />

are depicted in Fig.2. Note, that the field distribution is<br />

directly affected by the shape <strong>of</strong> the micro cracks, whereby the<br />

resulting field inhomogeneities in the vicinity <strong>of</strong> the spiky edges<br />

lead to the observed major field broadening. Prior rarefaction,<br />

the inital field distribution ranged approximately around ±1<br />

A/m, afterwards field values from almost ±2 A/m were found<br />

within the three-dimensional vertebra model. The effect <strong>of</strong> the<br />

interrupted bone mesh on the MR signal decay and the resulting<br />

estimated relaxation time T ′ 2 is presented in Fig.3. The modelled<br />

cracks gave rise to a change <strong>of</strong> the initial T ′ 2 <strong>of</strong> 26.1 ms to<br />

approximately 14.4 ms.<br />

Fig. 2. Resulting field distribution <strong>of</strong> the reaction field Hr1,z within<br />

the applied three-dimensional vertebra model. The trabecular cracks<br />

causing a broadening <strong>of</strong> the distribution, resulting in a more Lorentzian<br />

like line shape. A main magnetic field H0 =2.38732 · 10 6 A/m with<br />

α =30 ◦ and β parallel z-axes, and values <strong>of</strong> χ1 = −0.62·4·π ·10 −6<br />

and χ2 = −0.9 · 4 · π · 10 −6 were applied.<br />

V. CONCLUSION<br />

The advantage <strong>of</strong> this new approach is that it is very easy<br />

to build and investigate structures built from spheroids with<br />

different axes and positions. There is no need <strong>of</strong> complicated<br />

coordinate transformations.<br />

The analytical solutions <strong>of</strong> the Laplacian potential problem<br />

<strong>of</strong> spheroids in Cartesian coordinates were successfully applied.<br />

Fig. 3. Resulting resonance signal decays affected by the reaction field<br />

Hr1,z <strong>of</strong> the vertebra model in the two situations. As a consequence<br />

<strong>of</strong> the increasing inhomogeneous reaction field a rapid signal decay in<br />

case <strong>of</strong> micro cracks is visible (green curve). The signals are normalized<br />

to the values at the first echo time TE, markers are indicating the<br />

computed signal values at TE.<br />

- 313 - 15th IGTE Symposium 2012<br />

A three-dimensional magnetostatic problem in the area <strong>of</strong> MR-<br />

Osteodensitometry, susceptibility effects in the vicinity <strong>of</strong> micro<br />

cracks, was analysed. Within vertebrae affected by pathologies<br />

such as osteoporosis horizontally arranged structures get typically<br />

interrupted at first. The novel expressions make it possible<br />

to study the bone rarefaction along such pathologies, whereby<br />

either cracks <strong>of</strong> the horizontal, the vertical or arbitrary structures<br />

are accessible for modelling.<br />

In the present work just one application <strong>of</strong> the analytical<br />

expressions, the modelling <strong>of</strong> bone disorders in the area <strong>of</strong><br />

MR-Osteodensitometry, was given. For example in the field<br />

<strong>of</strong> functional MRI the devoloped toolbox eases the analysis <strong>of</strong><br />

the BOLD (blood oxygenation level-dependent) contrast, where<br />

induced reaction fields in the surrounding <strong>of</strong> vascular networks<br />

are <strong>of</strong> great interest [16]. A fast and precise computation <strong>of</strong> the<br />

magnetic distortion is essential for improving the precision <strong>of</strong><br />

the temperature determination in techniques using the proton<br />

resonance frequency (PRF) shift method [17], [18]. Temperature<br />

mapping in the vicinity <strong>of</strong> the needle electrode is a crucial<br />

determinant <strong>of</strong> MRI guided interventional radi<strong>of</strong>requency ablations<br />

[19]. Further, in the field <strong>of</strong> metabolism studies using<br />

NMR spectroscopy (MRS) the expressions can be used in<br />

order to model specific cells introduced in solutes differing in<br />

magnetic susceptibility. [20].<br />

In summary, the authors believe that the novel formulation<br />

<strong>of</strong> solutions depending solely on the Cartesian coordinates will<br />

facilitate the modelling <strong>of</strong> countless magnetostatic problems.<br />

REFERENCES<br />

[1] P. Moon and D.E. Spencer: Field Theory Handbook. Including<br />

coordinate systems, differential equations and their solutions.<br />

Springer 1988.<br />

[2] P. W. Kuchel, and B. T. Birman, ”Perturbation <strong>of</strong> Homogeneous<br />

Magnetic Fields by Isolated Single and Confocal Spheroids. Implications<br />

for NMR Spectroscopy <strong>of</strong> Cells,” NMR in Biomedicine,<br />

vol.2 (4) pp. 151-160, 1989.<br />

[3] M. Kraiger, and B. Schnizer, ”Potential and Field <strong>of</strong> a Homogeneous<br />

Magnetic Spheroid <strong>of</strong> Arbitrary Direction in a Homogeneous<br />

Magnetic Field in Cartesian Coordinates,” to appear in COMPEL,<br />

2012.<br />

[4] M. Kraiger, and B. Schnizer, ”Reaction Fields <strong>of</strong> a Homogeneous<br />

Magnetic Spheroids <strong>of</strong> Arbitrary Direction in a Homogeneous<br />

Magnetic Field. A Toolbox for MRI and MRS <strong>of</strong> Heterogeneous<br />

Tissue.” Report ITPR-2011-021. Institute for Theoretical<br />

and Computational Physics. Technische Universität <strong>Graz</strong>, Austria.<br />

http://itp.tugraz.at/∼schnizer/MedicalPhysics/<br />

[5] F. W. Wehrli, H. K. Song, P. K. Saha, and A. C. Wright, ”Quantitative<br />

MRI <strong>of</strong> the assessment <strong>of</strong> bone structure and function,” NMR<br />

in Biomedicine, vol. 19 pp. 731-764, 2006.<br />

[6] C. A. Davis, H. K. Genant, and J. S. Dunham, ”The effects <strong>of</strong><br />

bone on proton NMR relaxation times <strong>of</strong> surrounding liquids,”<br />

Investigative Radiology, vol. 21 pp. 472-477, 1986.<br />

[7] S. Grampp, S. Majumdar, M. Jergas, P. Lang, A. Gies, and HK.<br />

Genant, ”MRI <strong>of</strong> bone marrow in the distal radius: in vivo precision<br />

<strong>of</strong> effective transverse relaxation times,” European Radiology, vol.<br />

5 pp. 43-48, 1995.<br />

[8] T. M. Link, J. C. Lin, D. Newitt, N. Meier, S. Waldt, and S.<br />

Majumdar, ”Computergestützte Strukturanalyse des trabekulären<br />

Knochens in der Osteoporosediagnostik,” Der Radiologe, vol. 38<br />

pp. 853-859 , 1998.<br />

[9] M. H. Arokoski, J. P. Arokoski, P. Vainio, L. H. Niemitukia,<br />

H. Kroeger, and J. S. Jurvelin, ”Comparison <strong>of</strong> DXA and MRI<br />

methods for interpreting femoral neck bone mineral density,”<br />

Journal <strong>of</strong> Clinical Densitometry, vol. 5 pp. 289-296. 2002.<br />

[10] H. Chung, F. W. Wehrli, J. L. Williams, and S. D. Kugelmass,<br />

”Relationship between NMR transverse relaxation, trabecular bone<br />

architecture and strength,” <strong>Proceedings</strong> <strong>of</strong> the National Academy<br />

<strong>of</strong> Sciences, vol. 90 pp. 10250-10254, 1993.<br />

[11] T. B. Brismar, T. Hindmarsh, and H. Ringertz, ”Experimental<br />

correlation between T2* and ultimate compressive strength in<br />

lumbar porcine vertebrae,” Academic Radiology, vol. 4 pp. 426-<br />

430, 1997.


[12] O. Beuf, D. C. Newitt, L. Mosekilde, and S. Majumdar, ”Trabecular<br />

Structure Assessment in Lumbar Vertebrae Specimens Using<br />

Quantitative Magnetic Resonance Imaging and Relationship with<br />

Mechanical Competence,” Journal <strong>of</strong> Bone and Mineral Research,<br />

vol. 16 pp. 1511-1519, 2001.<br />

[13] T. Hildebrand, A. Laib, R. Müller, J. Dequeker, and P. Regsegger,<br />

”Direct Three-Dimensional Morphometric Analysis <strong>of</strong> Human<br />

Cancellous Bone: Microstructural Data from Spine, Femur, Iliac<br />

Crest, and Calcaneus,” Journal <strong>of</strong> Bone and Mineral Research, vol.<br />

14 pp. 1167-1174, 1999.<br />

[14] C. J. C. Bakker, R. Bhagwandien, M. A. Moerland, and M.<br />

Fuderer, ”Susceptibility artifacts in 2D FT spin-echo and gradientecho<br />

imaging: the cylinder model revisted,” Magnetic Resonance<br />

Imaging, vol. 11 pp. 539-548, 1992.<br />

[15] A. Fransson, S. Grampp, and H. Imh<strong>of</strong>, ”Effects <strong>of</strong> trabecular<br />

bone on marrow relaxation in the tibia,” Magnetic Resonance<br />

Imaging, vol. 17 pp. 69-82, 1998.<br />

[16] S. Ogawa, T. M. Lee, A. R. Kay, and D. W. Tank, ”Brain magnetic<br />

resonance imaging with contrast dependent on blood oxygenation,”<br />

<strong>Proceedings</strong> <strong>of</strong> the National Academy <strong>of</strong> Sciences <strong>of</strong> the United<br />

States <strong>of</strong> America, vol. 87 pp. 9868-9872, 1990.<br />

[17] J. C. Hindman, ”Proton resonance shift <strong>of</strong> water in gas and liquid<br />

states,” Journal <strong>of</strong> Chemical Physics, vol. 44 pp. 4582-4592, 1966.<br />

[18] V. Rieke, and K. B. Pauly, ”MR Thermometry,” Journal <strong>of</strong><br />

Magnetic Resonance, vol. 27 pp. 376-390, 2008.<br />

[19] A. Boss, H. Graf, B. Müller-Bierl, S. Clasen, D. Schmidt, P.<br />

L. Pereira, and F. Schick, ”Magnetic susceptibility effects on the<br />

accuracy <strong>of</strong> MR temperature monitoring by the proton resonance<br />

frequency method,” Journal <strong>of</strong> Magnetic Resonance Imaging, vol.<br />

22 pp. 813-820, 2005.<br />

[20] P. W. Kuchel, ”Red cell metabolism: studies using NMR spectroscopy,”<br />

<strong>Proceedings</strong> <strong>of</strong> Australian Biochemistry Society, vol. 15<br />

pp. P5-P6, 1983.<br />

- 314 - 15th IGTE Symposium 2012


- 315 - 15th IGTE Symposium 2012<br />

Application <strong>of</strong> Signal Processing Tools for Fault<br />

Diagnosis in Induction Motors-A Review<br />

*Jawad Faiz, *Amir Masoud Takbash, *Bashir Mahdi Ebrahimi and †Subhasis Nandi<br />

*Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering,<br />

College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran<br />

†Department <strong>of</strong> Electrical and Computer Engineering, <strong>University</strong> <strong>of</strong> Victoria, Victoria, BCV8W 3P6, Canada<br />

E-mail: jfaiz@ut.ac.ir<br />

Abstract—Use <strong>of</strong> efficient signal processing tools (SPTs) to extract proper indices for fault detection in induction motors (IMs)<br />

is the essential part <strong>of</strong> any fault recognition procedure. In this paper, all utilized SPTs employed in fault identification <strong>of</strong> IMs<br />

are analyzed in details. Then, their competency and their drawbacks for extracting indices in transient and steady-state modes<br />

are criticized from different aspects. The considerable experimental results are used to certificate demonstrated discussion.<br />

Different kinds <strong>of</strong> faults including eccentricity, broken bars and bearing faults as major internal faults in IMs, are<br />

investigated.<br />

Index Terms—Fault detection, Fast Fourier Transform, Hilbert, Wavelet Transform.<br />

I. INTRODUCTION<br />

Ever increasing application <strong>of</strong> induction motors (IMs)<br />

and importance <strong>of</strong> its uninterrupted operation in<br />

production lines make it necessary to diagnose internal<br />

faults in IMs quickly and precisely. The internal faults in<br />

IMs consist <strong>of</strong> electrical and mechanical faults. Electrical<br />

faults occur in the stator and rotor. Electrical faults <strong>of</strong><br />

squirrel-cage rotor <strong>of</strong> IM consist <strong>of</strong> bars and end-rings<br />

breakage which are about 10% <strong>of</strong> the internal faults <strong>of</strong><br />

squirrel-cage IM [1]. The reasons for these faults are as<br />

follows:<br />

1. Thermal stresses due to over-load and asymmetrical<br />

dissipation <strong>of</strong> heat which may change the hot spot.<br />

2. Magnetic stresses arising from electromagnetic<br />

forces.<br />

3. Mechanical stresses due to mechanical fatigue <strong>of</strong><br />

different parts, bearing damage, etc.<br />

4. Stresses due to assembling process and centrifugal<br />

forces arising from shaft torque.<br />

Some impacts <strong>of</strong> broken bars on IM are: Increasing<br />

core losses and total losses in faulty machine [2]-[4] and<br />

asymmetrical vector diagram <strong>of</strong> rotor current [2]. Broken<br />

bars and end rings faults have been studied more than<br />

other internal faults <strong>of</strong> IM. The faulty motor is studied<br />

using experimental or modeling and simulation methods.<br />

Following analysis <strong>of</strong> faulty motor by test or modeling, a<br />

proper signal must be selected. The signals used in the<br />

fault diagnosis process, consist <strong>of</strong> mechanical and<br />

electrical signals. There are three following reasons that<br />

make the stator current an appropriate signal for fault<br />

diagnosis:1) unique effect <strong>of</strong> motor internal fault on this<br />

signal. 2) there is no need to have sensor for monitoring<br />

the signal. 3) this method is economical. After testing or<br />

modeling motor and selecting a proper signal, it must be<br />

processed and effect <strong>of</strong> the proposed fault upon the signal<br />

is determined. The signal processing methods are based<br />

on the mathematical transformations. The well-known<br />

transformations are Fourier, Wavelet, Hilbert and<br />

multiple signal classification (Music). These processors<br />

are widely used in the fault diagnosis; however, recently<br />

intelligent methods such as Genetic, Fuzzy and Neural<br />

Network algorithms have been applied to make fault<br />

diagnosis methods more efficient [5]-[7]. Thus<br />

considering fault type, load conditions and the proposed<br />

processor characteristics, a particular processor will be<br />

suitable for each case. To choose an appropriate<br />

processor, different faults and operating conditions such<br />

as load are considered.<br />

II. FAST FOURIER TRANSFORM<br />

Fourier transform expresses signal as a sum <strong>of</strong><br />

sinusoidal functions. This transform expresses a timedomain<br />

signal to frequency-domain signal. This transform<br />

determines the frequency components arising from the<br />

fault. In application <strong>of</strong> Fourier transform to a signal, the<br />

signal must have two basic features: stability and<br />

alternating. A fast Fourier transform (FFT) is a faster<br />

version <strong>of</strong> the discrete Fourier transform<br />

(DFT). Application <strong>of</strong> these processors includes sampling<br />

and applying Fourier transform. Sampling has a series <strong>of</strong><br />

rules and laws as described in [7].<br />

A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />

For broken bars and end ring fault diagnosis in IM,<br />

FFT base processor is <strong>of</strong>ten used and frequency spectrum<br />

<strong>of</strong> torque, speed, instantaneous power, body vibration and<br />

stator current signals are obtained. Torque signal has been<br />

employed as reference for fault diagnosis in [3].<br />

Harmonics 2sfs are produced in the torque frequency<br />

spectrum which used for fault detection [8]. Some<br />

references use the speed signal for fault diagnosis and<br />

here harmonics 2sfs <strong>of</strong> frequency spectrum are again<br />

proposed. Figure 1 exhibits the frequency spectrum <strong>of</strong><br />

speed signal and its variations due to the broken rotor<br />

bars [3]. Torque and speed signals depend on the external<br />

factors such as load and this makes hard to diagnose the<br />

fault. Also the procedure for acquiring these signals is<br />

important, because using sensors and other devices affect<br />

the accuracy <strong>of</strong> the operation. Another signal that is<br />

considered for fault diagnosis is the case vibration signal<br />

The reason for this vibration is air gap radial


Figure 1: Frequency spectrum <strong>of</strong> motor speed for<br />

different numbers <strong>of</strong> rotor broken bars [4].<br />

electromagnetic forces. Broken bars lead to the odd<br />

harmonics in the frequency spectrum <strong>of</strong> vibration signal.<br />

Although signal with twice supply frequency has been<br />

used for fault diagnosis, this signal is not suitable because<br />

it also appears in the healthy motor vibration frequency<br />

spectrum. The above-mentioned signal depends on the<br />

load and its detection requires a sensor [9]. On the other<br />

hand, Fourier transforms application to the transient<br />

signals such as speed and vibration does not yield<br />

accurate results. Pendulum oscillations and increment <strong>of</strong><br />

<br />

[10], [11 <br />

proposed in [11] with some simplifying assumptions. As<br />

seen, broken bars generate 2sfs <br />

[10]. Instantaneous power signal can be also utilized for<br />

broken bars and end rings fault diagnosis. Advantages <strong>of</strong><br />

using instantaneous power spectrum are listed in [12].<br />

Output voltage harmonics <strong>of</strong> motor following the power<br />

supply interruption can be used to diagnose the fault. The<br />

main idea <strong>of</strong> this method is eliminating the harmonics<br />

generated by the voltage supply [13]. However, this<br />

signal is a transient signal and FFT application on this<br />

signal leads to inaccurate fault detection. Intelligent<br />

algorithms can be used to diagnose the fault through<br />

current signal envelop [14]. The drawback <strong>of</strong> this method<br />

is that the harmonics <strong>of</strong> the envelop signal depends on the<br />

severity <strong>of</strong> the rotor bar fault as well as their locations<br />

[13]. Current signal has been considered as the most<br />

appropriate signal for internal fault diagnosis. Some<br />

references [15]-[17] use time signal <strong>of</strong> the line current for<br />

fault diagnosis but this fault is <strong>of</strong>ten detected through line<br />

current harmonics [1]-[3]. The most important harmonics<br />

used for fault diagnosis are (1±2s)fs. The amplitudes <strong>of</strong><br />

these harmonics are larger than that <strong>of</strong> other harmonics<br />

and their diagnosis is easier. Amplitude <strong>of</strong> harmonic (1-<br />

2s)fs depends on the rotor broken bars fault and its<br />

intensity and harmonic (1+2s)fs is mostly depends on the<br />

speed variations [18]. It is noted that harmonic (1-2s)fs<br />

may be disappeared when broken bars has 90 degrees<br />

increased by the broken bars fault. The reason for such<br />

amplitude rising is asymmetry <strong>of</strong> the rotor due to the fault<br />

and consequently generating a negative rotating field<br />

[22]. Table I shows these harmonics before and after the<br />

- 316 - 15th IGTE Symposium 2012<br />

TABLE I<br />

AMPLITUDES OF CURRENT SIDEBANDS FOR MOTOR WITH DIFFERENT ROTOR<br />

BROKEN BARS [4]<br />

NBB fs+2fr fs-2fr<br />

0 -58 -57<br />

1 -54 -55<br />

2 -53 -48<br />

3 -48 -42<br />

4 -46 -40<br />

Figure 2: Frequency spectrum <strong>of</strong> stator current for broken<br />

end-ring [23].<br />

fault versus number <strong>of</strong> broken bars (NBB) [3]. However,<br />

raising the fault degree produces lower changes in the<br />

amplitudes <strong>of</strong> the sidebands. The reason is increasing the<br />

number <strong>of</strong> parallel paths <strong>of</strong> currents and saturation due to<br />

asymmetry <strong>of</strong> the currents passing the bars. Influence <strong>of</strong><br />

bars inner current in the broken bars fault has been<br />

proposed in [23] and its effects consisting <strong>of</strong> harmonics<br />

amplitude reduction has been mathematically proved. In<br />

[22], broken end-ring has been considered. Figure 2<br />

presents stator current frequency spectrum for such a<br />

case. Starting current signal may be used for fault<br />

diagnosis [3] where broken bars generate harmonics<br />

(3±4k)fr in the current spectrum (Figure 3). Frequency<br />

spectrum <strong>of</strong> starting transient current signal is determined<br />

STFT in which the problem <strong>of</strong> processor with the<br />

transient signal is solved. However, dimensions <strong>of</strong> the<br />

window are fixed and therefore it has not good frequency<br />

and time resolution at the same time [24]. Sometimes<br />

current is indirectly used, for instance Park transform <strong>of</strong><br />

stator current has been used for fault diagnosis [25]. Of<br />

course, this method has some drawbacks such as no-clear<br />

fault effect and susceptible to noise, so it seems that<br />

application <strong>of</strong> this method beside other techniques such as<br />

intelligent methods is useful. However, in this case a set<br />

<strong>of</strong> full data is necessary. Park transform <strong>of</strong> stator current<br />

leads to iD+jiQ Modulus and harmonics arising from the<br />

broken bars fault in line current are as 2sfs and 4sfs. The<br />

advantage <strong>of</strong> these harmonics is that these are far from the<br />

fundamental harmonic so its detection is simple.<br />

B. Impacts <strong>of</strong> Load Variation<br />

Side-band components vary with the load torque<br />

fluctuations [26], [27]. Figure 4 shows the impact <strong>of</strong> the<br />

load upon the high and low side-bands <strong>of</strong> the stator<br />

current spatial vector [21]. Load fluctuation decreases the<br />

amplitude <strong>of</strong> low-band and increases the amplitude <strong>of</strong><br />

high-band.<br />

C. Impact <strong>of</strong> Drive<br />

In the presence <strong>of</strong> drive and closed-loop circuits the<br />

situation differs. In PWM-driven motor odd harmonics as


Figure 3: Frequency spectrum <strong>of</strong> starting current: (a)<br />

healthy motor, (b) motor with 4 rotor broken bars [3].<br />

Figure 4: Impact <strong>of</strong> load upon high and low side-bands <strong>of</strong><br />

stator current spatial vector [21].<br />

well as third harmonic are injected to the motor. These<br />

harmonics are fb=(m±2nks)fs (m=supply odd harmonicorders,<br />

n=odd harmonics due to rotor induced currents<br />

and k=integer number) and odd-order harmonics currents<br />

are induced in the rotor, that subsequently produces oddorder<br />

rotor flux in the air gap. Therefore, a new frequency<br />

pattern is introduced in faulty motors under PWM supply<br />

[28]. In the closed-loop drive, mutual effects <strong>of</strong> electrical<br />

and mechanical oscillations amplify each other and<br />

amplitude <strong>of</strong> the above-mentioned frequency spectrum<br />

increases. Figure 5 shows rotor asymmetry signature in<br />

inverter-fed motor line current spectrum for healthy motor<br />

and motor with broken bars [29].<br />

D. Impact <strong>of</strong> Broken Rotor Bars Location<br />

Rotor bar location and its impact upon the fault<br />

diagnosis have been investigated and reported in [30] and<br />

effect <strong>of</strong> the broken bars location on the waveform and<br />

frequency spectrum <strong>of</strong> stator current and side-bands<br />

components (Figure 6) have been given. Influence <strong>of</strong> the<br />

broken bars location on the amplitude <strong>of</strong> the torque<br />

harmonics has been pointed out in [28]. Amplitude <strong>of</strong><br />

torque harmonic is increased by more concentration <strong>of</strong> the<br />

broken bars [31].<br />

III. WAVELET TRANSFORM<br />

Wavelet transform is a method that transforms the<br />

signal to time and frequency spectrum. This transform is<br />

based on transforming a signal to different kinds <strong>of</strong> scaled<br />

and shifted <strong>of</strong> mother wavelet function [13]. Wavelet<br />

transform enables to show some characteristics <strong>of</strong> the<br />

- 317 - 15th IGTE Symposium 2012<br />

Figure 5: Rotor asymmetry signature in inverter-fed<br />

motor line current spectrum (a) around fundamental, (b)<br />

around fifth and seventh harmonics [29].<br />

signal such as non-continuity <strong>of</strong> high-order derivatives <strong>of</strong><br />

the function and sharp point <strong>of</strong> maximum <strong>of</strong> the function<br />

that cannot be shown by other transforms, because they<br />

eliminate these characteristics during transform [13].<br />

Considering the above-mentioned points, wavelet<br />

transform gives a detailed and fully localized view <strong>of</strong> the<br />

function. Having frequency components caused by the<br />

internal fault <strong>of</strong> the motor, this transform can concentrate<br />

on particular regions and this can enhance the precision,<br />

while Fourier series provides a general view over a period<br />

<strong>of</strong> signal [12].<br />

A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />

Various wavelet transforms have been so far used for<br />

fault diagnosis. Most <strong>of</strong> these methods are based on the<br />

sidebands components <strong>of</strong> frequency spectrum <strong>of</strong> the<br />

current signal. In [28], energy <strong>of</strong> a bandwidth is used to<br />

diagnose the fault in which the load impact is also taken<br />

into account. Since discrete wavelet transform (DWT) has<br />

a better clarity over the low frequencies, the use <strong>of</strong> the<br />

current spatial vector which has harmonics with lower<br />

frequencies will yield more precise results [32]. In [33], a<br />

method based on CWT has been used to diagnose the<br />

fault in different drives. However, there is no physical<br />

interpretation for fault diagnosis using the Figurers. In<br />

[34], power spectral density (PSD) values <strong>of</strong> details signal<br />

in any level <strong>of</strong> transform is fault diagnosis criterion.<br />

Figure 7 shows the pattern <strong>of</strong> current signal wavelet<br />

transform <strong>of</strong> healthy and rotor broken bars motor [34]. In<br />

[35], the reason for application <strong>of</strong> DWT in the papers has<br />

been noted. There are not suitable physical description for<br />

results, complicated trend and algorithm <strong>of</strong> other wavelet<br />

methods and ambiguous results. In [35], fault has been<br />

diagnosed using envelope <strong>of</strong> the starting current signal<br />

and procedure has been introduced for extracting the<br />

envelope signal considering the impact <strong>of</strong> the broken bars<br />

on the settling time and amplitude <strong>of</strong> the envelope <strong>of</strong> the<br />

starting current. Also determination <strong>of</strong> wavelet main<br />

function is important in fault diagnosis. Harmonics due to<br />

torque ripples and unbalanced voltage generate harmonics<br />

similar to that <strong>of</strong> the broken bar and this reduces the


- 318 - 15th IGTE Symposium 2012<br />

Figure 6: Impact <strong>of</strong> bars location on amplitude <strong>of</strong> sidebands; (a) three broken bars in one pole and one broken bar in<br />

adjacent pole, (b) Two broken bars in one pole and two broken bars in adjacent pole, (c) One bar under each pole [30].<br />

a b<br />

Figure 7: Pattern <strong>of</strong> current signal wavelet transform, (a)<br />

healthy, (b) rotor broken bar motor [34].<br />

accuracy <strong>of</strong> the fault diagnosis process. However, this can<br />

be solved by application <strong>of</strong> DWT transform [36]. The<br />

analytical wavelet transform (AWT) is one <strong>of</strong> the wavelet<br />

transforms which has been used to diagnose the rotor<br />

broken bars fault. Advantage <strong>of</strong> this wavelet transform is<br />

keeping the characteristics <strong>of</strong> time domain, amplitude and<br />

phase as well as frequency. Amplitude is related to the<br />

proposed signal envelope and the phase is related to the<br />

time characteristics <strong>of</strong> the signal. In [37], AWT has been<br />

used to diagnose the rotor broken bars fault by the help <strong>of</strong><br />

starting signal <strong>of</strong> the IM under low level loads.<br />

B. Impacts <strong>of</strong> Load Variation<br />

Impact <strong>of</strong> load fluctuations on wavelet coefficients <strong>of</strong><br />

the stator current spectrum <strong>of</strong> a motor under broken bars<br />

fault has been studied in [38]. Table II summarizes the<br />

variations <strong>of</strong> D4 coefficient and values <strong>of</strong> a function (that)<br />

defined in the reference. The un-decimated discrete<br />

wavelet transform (UDWT) is a type <strong>of</strong> DWT in which<br />

shift- invariant has been included. This leads to a good<br />

time precision over high frequency harmonics, and good<br />

time and frequency precision over low frequency<br />

harmonics. In addition to DWT and CWT, there is<br />

another wavelet called wavelet packet decomposition<br />

(WPD), which yields more precise results but it is time<br />

consuming method [37], [39]. Sidebands move to higherorder<br />

nodes WPD transform due to load fluctuations [39].<br />

One important point in the application <strong>of</strong> this transform is<br />

the use <strong>of</strong> a proper node for fault diagnosis. For high<br />

loads the low-order nodes and for low loads high-order<br />

nodes are investigated [39]. In [40], the impact <strong>of</strong> the<br />

drive in broken bar diagnosis using wavelet transform has<br />

been proposed. Although fault diagnosis procedure and<br />

load impact have been considered in the above-mentioned<br />

reference, the location <strong>of</strong> the broken bars has not been<br />

taken into account.<br />

TABLE II<br />

VARIATIONS OF D4 COEFFICIENT OF WAVELET TRANSFORM OF CURRENT<br />

SIGNAL OF MOTOR UNDER BROKEN GAR FAULT AGAINST LOAD [38].<br />

% <strong>of</strong> rated<br />

load<br />

Mean Current<br />

(A)<br />

Mean distortion in D4 Index2<br />

0 9.54 0.0923 0.97%<br />

33 8.92 0.3220 3.61%<br />

71 8.81 0.4044 4.59%<br />

100 8.74 0.5469 6.25%<br />

133 8.57 0.5674 6.62%<br />

IV. HILBERT PROCESSOR<br />

Due to the drawbacks <strong>of</strong> the above-mentioned<br />

processors some new processors such as Hilbert processor<br />

have been introduced. Hilbert transform similar with<br />

Fourier transform is orthogonal in respect <strong>of</strong> its main<br />

transform. In addition one function and its Hilbert<br />

transform has identical energy. One type <strong>of</strong> Hilbert<br />

transform is the Hilbert- Huang transform (HHT). In<br />

HHT the energy distribution in time-frequency domain is<br />

obtained by estimation <strong>of</strong> the local energy <strong>of</strong> signal in<br />

different times and frequencies. On contrary to other<br />

time- frequency transforms which depend on the size <strong>of</strong><br />

window and sampling frequency in Fourier transform and<br />

mother wavelet in wavelet transform, HHT is independent<br />

on aforementioned parameters, that is an advantage <strong>of</strong> this<br />

transform. No-load and light load cases in rotor broken<br />

bars and end-rings have been emphasized in [41].<br />

A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />

In no-load IM there is no harmonic arising from the<br />

load, but harmonics are very close to the fundamental<br />

frequency. Here a Hilbert vector is defined for signal and<br />

using this vector instead <strong>of</strong> the proposed signal has some<br />

advantages:<br />

1. Requirement <strong>of</strong> phase current.<br />

2. Generation <strong>of</strong> harmonic components due to fault and<br />

deletion <strong>of</strong> non-applicable harmonics.<br />

Figure 8: Hilbert modulus: (a) healthy motor, (b) motor<br />

with two broken bars [42].<br />

3. Elimination <strong>of</strong> frequency scattering.<br />

4. Nonexistence <strong>of</strong> fundamental frequency that lets to


use linear scale on the vertical axes instead <strong>of</strong><br />

logarithm scale and this clarifies the graphs.<br />

5. No need to sample with twice <strong>of</strong> Nayquest<br />

frequency.<br />

Considering the proposed low frequencies the sampling<br />

speed is reduced to 0.1 <strong>of</strong> the normal case values and this<br />

is useful in practice. In [42], a fault diagnosis method<br />

based on fundamental harmonic deletion and<br />

determination <strong>of</strong> Hilbert Modulus has been introduced.<br />

Figure 8 shows Hilbert modulus for a healthy motor<br />

and a motor with broken bars. By increasing the fault<br />

degree, this modulus becomes larger due to the<br />

harmonics. In the following part a dimensionless<br />

numerical criterion with a low dependency on the load is<br />

introduced.<br />

V. MUSIC PROCESSOR<br />

Recently some methods with high frequency precision<br />

such as Music and Root-Music have been proposed [43].<br />

These methods are used where keeping a particular<br />

frequency is necessary. So in these methods, precise<br />

information on frequency components are required [19].<br />

This mathematical transform is similar with other<br />

transforms and it converts a signal to sum <strong>of</strong> several<br />

signals with identical feature. Music transform consists <strong>of</strong><br />

transform <strong>of</strong> a signal and expressing it based on K<br />

complex sinusoidal pair and a signal e(n).<br />

C. Rotor Bars and End Ring Breakage Fault Diagnosis<br />

Combination <strong>of</strong> FFT and Music methods and a<br />

method <strong>of</strong> fundamental harmonic deletion have been used<br />

for fault diagnosis in [44]; because lonely application <strong>of</strong><br />

Music leads to error. This method provides clearer results<br />

compared to FFT method. In Figure 9 the results <strong>of</strong> two<br />

methods have been compared [44]. In [45], frequency<br />

spectrum <strong>of</strong> output voltage after disconnecting the input<br />

supply obtained by FFT and Music methods and they<br />

compared. It has been shown that a series <strong>of</strong> particular<br />

harmonics in the frequency spectrum is excited due to the<br />

broken bars and variations <strong>of</strong> these variations are seen.<br />

Reliability and low impact <strong>of</strong> noise are the advantages <strong>of</strong><br />

this method compared to FFT method [45]. Music- based<br />

methods similar with Hilbert transform are new methods<br />

which have precise results and computation is quicker<br />

than Wavelet method. However, it needs improved<br />

algorithms for deletion <strong>of</strong> the fundamental harmonic<br />

which complicates these methods. Since these methods<br />

are new, many fault diagnosis indexes have not been yet<br />

modeled by these methods.<br />

VI. CONCLUSIONS<br />

Different methods and processors used for diagnosis<br />

<strong>of</strong> rotor bars and end ring breakage fault in IMs were<br />

investigated briefly. At this end, four types <strong>of</strong> processors<br />

and their advantages and drawbacks were studied. It is<br />

clear that a single method and a common processor<br />

cannot be specified for fault detection in all conditions.<br />

Fourier processor as a most applied processor for broken<br />

- 319 - 15th IGTE Symposium 2012<br />

Figure 9: Comparison <strong>of</strong> FFT and Music-based methods<br />

for motor with 1 broken bar: (a) FFT, (b) Music [44].<br />

bars fault has weak and strong points. Its most important<br />

weakness is in processing <strong>of</strong> transient signals. To<br />

overcome this problem, application <strong>of</strong> wavelet processor<br />

was suggested which provide more detailed time and<br />

frequency view <strong>of</strong> the signal. Following wavelet packet<br />

with simultaneous high precision <strong>of</strong> time and frequency is<br />

commonly used. These processors <strong>of</strong>ten are used for<br />

broken bars fault but there are no appropriate researches<br />

about number <strong>of</strong> broken bars and their location. Other<br />

drawbacks <strong>of</strong> this method are that it is time consuming<br />

and complicated. In recent years, Hilbert-based methods<br />

with high frequency precision methods such as Music<br />

have been proposed. A common point that must be taken<br />

into account in an appropriate fault diagnosis method in<br />

industry beside on-line case; is the method must quick<br />

and at the same time have good accuracy.<br />

REFERENCES<br />

[1] J. Faiz, B. M. Ebrahimi, and M. B. B. Sharifian, “Different faults<br />

and their diagnosis techniques in three-phase squirrel-cage<br />

induction motors-a review, “ Electromagnetics, vol. 26, no. 7, pp.<br />

543–569, October 2006.<br />

[2] X. Ying, “Characteristic performance analysis <strong>of</strong> squirrel cage<br />

induction motor with broken bars,” IEEE Trans. on Magnetics,<br />

vol. 45, no. 2, pp. 759-766, Feb. 2009.<br />

[3] J. Faiz, B. M. Ebrahimi, "A new pattern for detecting broken rotor<br />

bars in induction motors during start-up," IEEE Trans. on<br />

Magnetics, vol. 44, no. 12, pp. 4673-4683 ,December 2008.<br />

[4] J. Faiz, and B. M. Ebrahimi, "Determination <strong>of</strong> number <strong>of</strong> rotor<br />

broken bars and static eccentricity degree in induction motor<br />

under mixed fault," Electromagnetics, vol. 28, no. 6, pp. 433-449,<br />

August 2008.<br />

[5] M. J. Devaney and L. Eren, "Monitoring an induction motors<br />

current and detecting bearing failure," Trans. on Instrumentation<br />

and Measurement, vol.7, no.4, pp. 30- 50 December 2004.<br />

[6] B. Li, M. Y. Chow, Y. Tipusuwan, and J. C. Hung , "Neuralnetwork-based<br />

motor rolling bearing fault diagnosis,", IEEE<br />

Trans. on Industrial Electronics, pp. 1060-1069, vol. 47, no.5,<br />

Oct. 2000.<br />

[7] J. Faiz, and I. Tabatabaei, " Extension <strong>of</strong> winding function theory<br />

for non uniform Air gap in eccentric machinery," IEEE<br />

Transaction on Magnetics, vol. 38, no. 6, pp. 3654-3657,<br />

November 2002.<br />

[8] J. F. Bangura, R. J. Povinelli, N. A. Demerdash, and R. H. Brown,<br />

"Diagnostics <strong>of</strong> eccentricities and bar/end ring connector<br />

breakages in poly phase induction motors through a combination<br />

<strong>of</strong> time-series data mining and time-stepping coupled FE–statespace<br />

techniques," IEEE Trans. on Industry Applications, vol. 39,<br />

no. 4, pp. 1005-1013, Jul./Aug. 2003.<br />

[9] P. J. Rodriguez, A. Belahcen, and A. Arkkio, "Signature <strong>of</strong><br />

electrical faults in the force distribution and vibration pattern <strong>of</strong><br />

induction motors," IEE Proc.-Electr. Power Appl., vol. 153, no. 4,<br />

pp. 523-529, July 2006.<br />

[10] C. Yeh, G. Y. Sizov, A. S. Ahmed, N. A. O. Demerdash, R. J.<br />

Povinelli, E. E. Yaz, and D. M. Ionel, "A reconfigurable motor for<br />

experimental emulation <strong>of</strong> stator winding inter turn and broken<br />

bar faults in poly-phase induction machines," IEEE Trans. on


Energy Conversion, vol. 23, no. 4, pp. 1005-1014, December<br />

2008.<br />

[11] B. Mirafzal, and N. A. O. Demerdash, "Effects <strong>of</strong> load magnitude<br />

on diagnosing broken bar faults in induction motors using the<br />

pendulous oscillation <strong>of</strong> the rotor magnetic field orientation,"<br />

IEEE Trans. on Industry Applications, vol. 41, no. 3, pp. 771-<br />

783, May/June 2005.<br />

[12] G. Didier, E. Ternisien, O. Caspary, and H. Razik, "Fault<br />

detection <strong>of</strong> broken rotor bars in induction motor using a global<br />

fault index," IEEE Trans. on Ind. Appl., vol. 42, no. 1, pp. 79-88,<br />

Jan./Feb. 2006.<br />

[13] J. Milimonfared, H. Meshginkelk, S. Nandi, A. D. Minassians,<br />

and H. A. Toliyat, "A novel approach for broken-rotor-bar<br />

detection in cage induction motors," IEEE Trans. on Ind. Appl. ,<br />

vol.35, no. 5, pp. 1000-1006, Sep./Oct. 1999.<br />

[14] A. M. D. Silva, R. J. Povinelli, and N. A. O. Demerdash,<br />

"Induction machine broken bar and end stator short-circuit fault<br />

diagnostics based on three-phase stator current envelopes," IEEE<br />

Trans. on Industrial Electronics, vol. 55, no. 3, pp. 1310-1318,<br />

March 2008.<br />

[15] C. E. Kim, Y. B. Jung, S. B. Yoon, and D. Im, "The fault<br />

diagnosis <strong>of</strong> rotor bars in squirrel cage induction motors by timestepping<br />

finite element method," IEEE Trans. on Magnetics, vol.<br />

33, no. 2, pp. 2131-2134, March 1997.<br />

[16] N. M. Elkasabgy, A. R. Eastham, and G. E. Dawson, "Detection<br />

<strong>of</strong> broken bars in the cage rotor on an induction machine," IEEE<br />

Trans. on Industry Applications, vol. 22, no. 6, pp. 165-171, Jan./<br />

Feb. 1992.<br />

[17] V. F. Pires, J. F. Martins, and A. J. Pires, "Eigen vector / Eigen<br />

value analysis <strong>of</strong> a 3D current referential fault detection and<br />

diagnosis <strong>of</strong> an induction motor," Energy Conversion and<br />

Management, vol. 51, issue. 5, pp. 901-907, May 2010.<br />

[18] S. Nandi, H. A. Toliyat, and X. Li, "Condition monitoring and<br />

fault diagnosis <strong>of</strong> electrical motors- a review," IEEE Trans. on<br />

Energy Conversion, vol. 20, no. 4, pp. 719-729, December 2005.<br />

[19] M. E. H. Benbouzid, and G. B. Kilman, "What stator current<br />

processing-based technique to use for induction motor rotor faults<br />

diagnosis?," IEEE Trans. on Energy Conversion, vol. 18, no. 2,<br />

pp. 238-244, June 2003.<br />

[20] M. Riera-Guasp, M. F. Cabanas, J. A. Antonino-Daviu, M.<br />

Pineda-Sanchez, and C. H. R Garcia, "Influence <strong>of</strong><br />

Nonconsecutive Bar Breakages in Motor Current Signature<br />

Analysis for the Diagnosis <strong>of</strong> Rotor Faults in Induction Motors,"<br />

IEEE Trans. on Energy Conversion, vol. 25, no.1, pp. 80-89,<br />

Jan./ Feb. 2010.<br />

[21] A. Bellini, F. Filippetti, G. Franceschini, C. Tassoni, and G. B.<br />

Kilman, "Quantitative evaluation <strong>of</strong> induction motor broken bars<br />

by means <strong>of</strong> electrical signature analysis," IEEE Trans. on<br />

Industry Applications, vol. 37, no. 5, pp. 1248-1255, Sep./Oct.<br />

2001.<br />

[22] J. F. Bangura, and N. A. Demerdash, "Diagnosis and<br />

characterization <strong>of</strong> effects <strong>of</strong> broken bars and connectors in<br />

squirrel-cage induction motors by a time-stepping coupled finite<br />

Element-state space modeling approach," IEEE Trans. on Energy<br />

Conversion, vol. 14, no. 4, pp. 1167-1176, December 1999.<br />

[23] R. F. Walliser, and C. F. Landy, "Determination <strong>of</strong> inter bar<br />

current effects in the detection <strong>of</strong> broken rotor bars in squirrel<br />

cage induction motors," IEEE Trans. on Energy Conversion, vol.<br />

9, no. 1, pp. 152-158, March 1994.<br />

[24] P. V. Goode, and M. Chow, "Using a neural/fuzzy system to<br />

extract heuristic knowledge <strong>of</strong> incident faults in induction motors:<br />

part II-application," IEEE Trans. on Industrial Electronics, vol.<br />

42, no. 2, pp. 139-146 , April 1995.<br />

[25] M. Haji, and H. A. Toliyat, "Pattern recognition- a technique for<br />

induction machine rotor broken bar detection," IEEE Trans. on<br />

Energy Conversion , vol. 16, no. 4, pp. 312-317, December 2001.<br />

[26] J. Faiz, B. M. Ebrahimi, and M. B. B. Sharifian, "Time Stepping<br />

Finite Element Analysis Of Broken Bars Fault In A Three-Phase<br />

Squirrel-Cage Induction Motor," Progress in Electromagnetics<br />

Research, PIER, vol. 68, pp. 53-70, 2007.<br />

[27] J. Faiz, B. M. Ebrahimi, H.A. Toliyat, and W.S. Abu-Elhaija,<br />

"Mixed – fault diagnosis in induction motors considering varying<br />

load and broken bars location," Energy Conversion and<br />

Management, vol. 51, issue. 7, pp. 1432-1441, July 2010.<br />

[28] D. F. Pires, V. F. Pires, J.F. Martins, and A.J. Pires, "Rotor cage<br />

fault diagnosis in three-phase induction motors based on a current<br />

- 320 - 15th IGTE Symposium 2012<br />

and virtual flux approach," Energy Conversion and Management,<br />

vol. 50, issue. 4, pp. 1026-1032, April 2009.<br />

[29] B. Akin, U. Orguner, H. A. Toliyat, and M. Rayner, "Low order<br />

PWM inverter harmonics contributions to the inverter-fed<br />

induction machine fault diagnosis," IEEE Trans. on Industrial<br />

electronics, vol. 55, no. 2, pp. 610-619, February 2008.<br />

[30] J. Faiz, and B. M. Ebrahimi, "Locating rotor broken bars in<br />

induction motors using finite element method," Energy<br />

Conversion and Management, vol. 50, pp. 125-131, August 2008.<br />

[31] S. H. Kia, H. Henano, and G. A. Capolino, "Diagnosis <strong>of</strong> Brokenbar<br />

Fault in Induction Machines Using Discrete Wavelet<br />

Transform Without Slip Estimation," IEEE Trans. on Industry<br />

Applications, vol. 45, no. 4, pp. 1395-1404, July/August 2009.<br />

[32] Z. K. Peng, M. R. Jackson , J. A. Rongong , F. L. Chu , and R.<br />

M. Parkin, " On the energy leakage <strong>of</strong> discrete wavelet transform,"<br />

Mechanical Systems and Signal Processing, vol. 29, pp. 330-343,<br />

2009.<br />

[33] I. P. Georgakopoulos, E. D. Mitronikas, and A. N. Safacas, "<br />

Condition Monitoring <strong>of</strong> an Inverter-Driven Induction Motor<br />

Using Wavelets," Advanced Electromechanical Motion Systems<br />

& Electric Drives Joint Symposium, 2009. ELECTROMOTION<br />

2009. 8th International Symposium on, pp.1-5, 1-3 July 2009.<br />

[34] J. Cusido, L. romeral, J. A. Ortega, A. Rosero and A. G.<br />

Espinosa, "Fault Detection in Induction Machines Using Power<br />

Spectral Density in Wavelet decomposition," IEEE Trans. on<br />

Industrial Electronics, vol. 55, no. 2, pp. 633-643, February 2008.<br />

[35] Randy Supangat, NesimiErtugrul, Wen l.Soog, Douglas A.Gray<br />

and Colin Hasen, "Broken Rotor Bar fault Detection in induction<br />

Motors Using Starting Current Analysis," EPE 2005, pp. 1-1,<br />

2005.<br />

[36] J. A. Daviu, M. R. Guasp, J. R. Floch, and P. M. Palomares, "<br />

Validation <strong>of</strong> a New Method for the Diagnosis <strong>of</strong> Rotor Bar<br />

Failures via Wavelet Transform in Industrial Induction<br />

Machines," IEEE Trans. on Industry Applications, vol. 42, no. 4,<br />

pp. 990-996, July/August 2006.<br />

[37] Z. Ye, B. Wu, and A. Sadeghian, "Current Signature Analysis <strong>of</strong><br />

Induction Motor Mechanical Faults by Wavelet Packet<br />

Decomposition," IEEE Trans. on Industrial Electronics, vol. 50,<br />

no. 4, pp. 1217- 1228, December 2003.<br />

[38] J. Faiz, B.M. Ebrahimi, B. Asaie, R. Rajabioun, and H. A.<br />

Toliyat, "A Criterion Function for Broken Bar Fault Diagnosis in<br />

Induction Motor under Load Variation using Wavelet Transform,"<br />

Electrical Machines and Systems, 2007. ICEMS. International<br />

Conference on, pp.1249-1254, 8-11 Oct. 2007.<br />

[39] S. A. Saleh, T.S. Radwan, and M.A. Rahman, "Real- Time <strong>of</strong><br />

WPT-Based Protection <strong>of</strong> Three-Phase Vs PWM Inverter-Fed<br />

Motors," IEEE Trans. on Power Delivery, vol. 22, no. 4, pp.<br />

2108-2115, October 2007.<br />

[40] M. A. S. K. Khan and M. A. Azizur Rahman, "A New Wavelet<br />

Based Diagnosis and Protection <strong>of</strong> Faults in Induction Motor<br />

Drives," Power Electronics Specialists Conference, 2008. PESC<br />

2008. IEEE, pp.1536-1541, 15-19 June 2008.<br />

[41] I. Aydin, M. Karakose, E. Akin, "A new method for early fault<br />

detection and diagnosis <strong>of</strong> broken rotor bars," Energy Conversion<br />

and Management, vol. 52, issue 4, pp. 1790-1799, April 2011.<br />

[42] L. Sun, "Detection <strong>of</strong> Rotor Bar Breaking Fault in Induction<br />

Motors Based on Hilbert Modulus Gyration Radius <strong>of</strong> Filtered<br />

Stator current Signal," Electrical Machines and Systems, 2008.<br />

ICEMS 2008. International Conference on, pp. 877-881, 17-20<br />

Oct. 2008.<br />

[43] L. A. Periera, D. Fernandes, D. S. Gazzana, F. B. Libano, and S.<br />

Haffner, "Application <strong>of</strong> Welch, burg and music methods to the<br />

detection <strong>of</strong> rotor cage faults <strong>of</strong> induction motors," Transmission<br />

& Distribution Conference and Exposition: Latin America, 2006.<br />

TDC '06. IEEE/PES, pp. 1-6, 15-18 Aug. 2006.<br />

[44] B. Xu, S. liu and L. Sun, "A novel method for the early detection<br />

<strong>of</strong> broken rotor bars in squirrel cage induction motors," Electrical<br />

Machines and Systems, 2008. ICEMS 2008. International<br />

Conference on, pp. 751-754, 17-20 Oct. 2008.<br />

[45] S. Hamdani, A. Bouzida, O. Touhami, and R. Ibtiouen,<br />

"Diagnosis <strong>of</strong> Rotor Fault in Induction Motors Using Music<br />

Analyses <strong>of</strong> Terminal Voltage after Switch <strong>of</strong>f,", 18th<br />

International Conference on Electrical Machines, pp. 1-5, 6-9<br />

Sept. 2008.0


†<br />

- 321 - 15th IGTE Symposium 2012<br />

Experimental Calibration <strong>of</strong> Numerical Model<br />

<strong>of</strong> Thermoelastic Actuator<br />

*L. Voracek, *V. Kotlan and *B. Ulrych<br />

*<strong>University</strong> <strong>of</strong> West Bohemia, Faculty <strong>of</strong> Electrical Engineering, Univerzitní 26, 30614, Pilsen, Czech Republic<br />

Abstract—A numerical model <strong>of</strong> the thermoelastic actuator for accurate settings <strong>of</strong> position is compared with the data<br />

obtained by measurements on an experimental prototype. Some disagreements between the results became the reason for a<br />

calibration <strong>of</strong> the model carried out using an appropriate iterative process.<br />

Index Terms—Actuator, FEM, Measurement, Thermoelasticity.<br />

I. INTRODUCTION<br />

Various industrial technologies work with extremely<br />

small and accurate shifts on the order <strong>of</strong> 10 –3 to 10 –6 m.<br />

One way <strong>of</strong> reaching that small shifts and exact positions<br />

is using a thermoelastic actuator. Its theoretical<br />

backgrounds are described in previous papers written by<br />

our group (i.e., [1], [2]). Recently, we also built a<br />

prototype <strong>of</strong> the device and started experimental verifying<br />

the calculated results. Certain discrepancies between the<br />

computations and measurements lead to necessity <strong>of</strong><br />

calibrating the material parameters and their temperature<br />

dependences.<br />

II. FORMULATION OF THE PROBLEM<br />

The arrangement <strong>of</strong> the device is depicted in Fig. 1.<br />

The dilatation element 2 made <strong>of</strong> a suitable electrical<br />

conductive metal is inserted into a coil 3 fixed in frame 4.<br />

The coil is supplied by harmonic current. The whole<br />

system is placed in an insulating shell 1. The device is<br />

clamped by its bottom part 5 in the basement 6 that is<br />

supposed to be perfectly stiff. The time-variable magnetic<br />

field generated by the field coil 3 induces in the dilatation<br />

element 2 eddy currents. These eddy currents produce<br />

heat and consequent geometrical changes (mainly in its<br />

longitudinal direction z ) <strong>of</strong> the dilatation element.<br />

r<br />

6 5 4 3 2 1<br />

Figure 1. The basic arrangement <strong>of</strong> the device<br />

1 –shell, 2 – dilatation element, 3 – field coil, 4 – fixing frame, 5 –front,<br />

6 – stiff wall<br />

III. MATHEMATICAL MODEL<br />

Mathematical modelling <strong>of</strong> the device represents a<br />

triply coupled problem. Its mathematical model consists<br />

<strong>of</strong> three partial differential equations describing the<br />

distribution <strong>of</strong> magnetic field, temperature field and field<br />

<strong>of</strong> thermoelastic displacements.<br />

The device does not contain any ferromagnetic part. If<br />

the field coil 3 (Fig. 1) carries harmonic current <strong>of</strong><br />

z<br />

density J , the magnetic field in the system may be<br />

ext<br />

described by the Helmholtz partial differential equation<br />

for the phasor A <strong>of</strong> the magnetic vector potential A in<br />

the form [3]<br />

curlcurlA j A J<br />

. (1)<br />

Here, symbol denotes the magnetic permeability, is<br />

the electric conductivity, stands for the angular<br />

frequency, and J is the phasor <strong>of</strong> external harmonic<br />

ext<br />

current density in the field coil. The conditions along the<br />

axis <strong>of</strong> the device and artificial boundary placed at a<br />

sufficient distance from the system are <strong>of</strong> the Dirichlet<br />

type ( A 0 ).<br />

Heat power in the system is generated by currents in<br />

the field coil and induced currents in the dilatation<br />

element. The distribution <strong>of</strong> the temperature in the system<br />

can be described by equation [4]<br />

T<br />

div grad T cp pJ<br />

,<br />

t<br />

where stands for the thermal conductivity, is the<br />

specific mass, and c p denotes the specific heat at a<br />

constant pressure. Finally, the symbol p stands for the<br />

J<br />

time average internal sources <strong>of</strong> heat represented by the<br />

volumetric Joule losses. These are given by the formula<br />

2<br />

ext<br />

(2)<br />

J<br />

p J , J J + J , J = j <br />

A , (3)<br />

ext ind ind<br />

<br />

where symbol J denotes the current density in the<br />

ind<br />

electrically conductive parts <strong>of</strong> the device.<br />

The boundary conditions should take into account<br />

both convection and radiation. But since between the<br />

dilatation element 2 and field coil 3 there is a ceramic<br />

tube characterized by a very poor thermal conductivity<br />

and all parts are placed in a Teflon insulating shell<br />

(characterized also by a poor thermal conductivity),<br />

radiation can be – with a practically negligible error –<br />

disregarded.<br />

The last considered field is the field <strong>of</strong> thermoelastic<br />

displacements. The distribution <strong>of</strong> displacements in the<br />

dilatation element 2 follows from the solution <strong>of</strong> the<br />

Lamé equation [5]


grad div <br />

3 2 T grad T<br />

u u <br />

f 0 ,<br />

where 0, 0 are coefficients associated with<br />

material parameters by the relations<br />

E E<br />

<br />

1 1 2 2 1 <br />

<br />

, .<br />

Here E is the modulus <strong>of</strong> elasticity and denotes<br />

the Poisson coefficient <strong>of</strong> the contraction. Finally, symbol<br />

u ur , u, u z represents the vector <strong>of</strong> the displacement,<br />

is the coefficient <strong>of</strong> the linear thermal dilatability <strong>of</strong><br />

T<br />

the material, and f stands for the vector <strong>of</strong> the internal<br />

volumetric forces. These consist (at least in the dilatation<br />

element 2) <strong>of</strong> the gravitational and Lorentz volumetric<br />

forces. But in comparison with the thermoelastic strains<br />

and stresses they are very small and may be neglected.<br />

The boundary conditions depend on the particular<br />

arrangement. In the solved case the displacements <strong>of</strong> the<br />

dilatation element 2 at the place <strong>of</strong> clamping are assumed<br />

to be equal to zero.<br />

IV. NUMERICAL SOLUTION<br />

The numerical solution was performed by a<br />

combination <strong>of</strong> pr<strong>of</strong>essional codes COMSOL<br />

Multiphysics and Matlab that were supplemented with a<br />

lot <strong>of</strong> own procedures and scripts. Special attention was<br />

paid to the convergence <strong>of</strong> the results (dependence <strong>of</strong> the<br />

distribution <strong>of</strong> physical fields on the density <strong>of</strong><br />

discretization meshes and in the case <strong>of</strong> electromagnetic<br />

field also on the position <strong>of</strong> the artificial boundary). Some<br />

part <strong>of</strong> solution was verified with results obtained from<br />

code Agros2D. This is a new program from the category<br />

<strong>of</strong> FEM based s<strong>of</strong>tware developed by group at our<br />

department. This program is very strong for the solution<br />

<strong>of</strong> various physical fields but nowadays it is still in a test<br />

version and some specific parts are not finished yet.<br />

Therefore it cannot be used for the nonlinear systems.<br />

V. ILLUSTRATIVE EXAMPLE AND RESULTS<br />

An illustrative example concerns a prototype <strong>of</strong> the<br />

thermoelastic actuator built in our laboratory. Its<br />

arrangement and dimensions are depicted in Fig. 2. The<br />

field coil is wound from a copper wire with diameter<br />

D w 0,63 mm and has approximately 2400 turns. This<br />

coil is supplied by a harmonic current <strong>of</strong> frequencies<br />

519 Hz, 1005 Hz and 2210 Hz, respectively, whose<br />

RMS values were 0.5 A and 1 A. The dilatation element<br />

is made <strong>of</strong> brass UNS C26000, whose nonlinear<br />

temperature-dependent characteristics <strong>of</strong> most important<br />

physical parameters are depicted in next Figs. 3–5. Other<br />

parts exhibit mainly the function <strong>of</strong> electrical and thermal<br />

insulation. All remaining parameters for computation are<br />

listed in Tab. 1. The entire device is fixed in a firm steel<br />

construction for its correct functioning.<br />

- 322 - 15th IGTE Symposium 2012<br />

(4)<br />

(5)<br />

Figure 2: Arrangement <strong>of</strong> the considered actuator (dimensions in mm):<br />

1–nylon shell, 2–brass core, 3–copper coil, 4–Teflon fixing part <strong>of</strong> the<br />

coil, 5–nylon stand, 6–metallic stand, 7–ceramic tube, 8–nylon cap<br />

Figure 3: Temperature dependence <strong>of</strong> thermal expansion coefficient for<br />

brass UNS C26000 (data from [6]).<br />

Figure 4: Temperature dependence <strong>of</strong> the thermal conductivity for brass<br />

UNS C26000 (data from [6]).<br />

Figure 5: Temperature dependence <strong>of</strong> the electrical conductivity for<br />

brass UNS C26000 (data from [6]).


Table 1: Physical properties <strong>of</strong> particular materials for the initial step <strong>of</strong><br />

simulation.<br />

brass Teflon ceramic nylon copper<br />

rel. permeability [-] 1 1 1 1 1<br />

rel. permittivity [-] 1 1 1 1 1<br />

el. conductivity [S/m] 1.5e7 0 0 0 5.7e7<br />

therm. cond. [W/(mK)] 115 0.24 1.6 0.26 395<br />

density [kg/m 3 ] 8440 2220 2500 1150 8930<br />

spec. heat cap [J/(kgK)] 375 1050 1090 1100 313<br />

Young. modulus [Pa] 9.79e10<br />

Poisson ratio [-] 0.301<br />

therm. expans. coeff. [1/K] 18.7e–6<br />

A. Measurements<br />

The physical model <strong>of</strong> the thermoelastic actuator (see<br />

Fig. 6) was designed by our group at the <strong>University</strong> <strong>of</strong><br />

West Bohemia. This is the first prototype that can help us<br />

to validate the results <strong>of</strong> several projects <strong>of</strong> devices based<br />

on the principle <strong>of</strong> thermoelasticity.<br />

Figure 6: The thermoelastic actuator – manufactured prototype.<br />

We also designed and manufactured a measurement<br />

stand intended for fixing <strong>of</strong> the device, which<br />

significantly contributes to the accuracy <strong>of</strong><br />

measurements. The rigidity <strong>of</strong> the measurement device is<br />

<strong>of</strong> extreme importance for measuring such small shifts.<br />

The measuring stand is supposed to be connected with a<br />

dynamometer in the future with the aim <strong>of</strong> making use<br />

the device for another purpose: the actuator can also act<br />

as a source <strong>of</strong> large forces produced by small shifts <strong>of</strong> the<br />

dilatation element.<br />

Figure 7: Arrangement <strong>of</strong> the measurement.<br />

The measuring circuit (see Fig. 7) consists <strong>of</strong> a<br />

measuring rack, thermoelastic actuator, function<br />

generator, amplifier, capacitance decade, auxiliary<br />

resistor and oscilloscope. The measurements <strong>of</strong> the<br />

thermoelastic actuator are carried out in several regimes<br />

- 323 - 15th IGTE Symposium 2012<br />

characterized by the above frequencies and RMS values<br />

<strong>of</strong> the field current. A sinusoidal signal delivered from<br />

the frequency generator is amplified by the amplifier to<br />

the desired value <strong>of</strong> the current at a given frequency.<br />

A small resistor connected to the thermoelastic<br />

actuator in series is used for measuring the current<br />

through the circuit. We oscilloscopically measured the<br />

voltage at the above resistor and the value <strong>of</strong> the current<br />

was determined using the Ohm law from the measured<br />

voltage and the known resistance. Using <strong>of</strong> an ammeter is<br />

inappropriate due to its internal resistance. This would<br />

increase the total resistive load and the input signal could<br />

not be amplified sufficiently by the used amplifier. The<br />

thermoelastic actuator can be considered an RL circuit<br />

and we obtain the maximum current through it by getting<br />

it into resonance. This can be achieved by adding a serial<br />

capacitor. For given values <strong>of</strong> R and L and prescribed<br />

frequency it is very easy to calculate its capacitance<br />

C using the well-known Thomson relation<br />

2 f <br />

1<br />

LC<br />

.<br />

(6)<br />

For compensation we used rolled capacitors whose<br />

capacitance was determined with respect to the given<br />

frequency. But we had to respect the available types <strong>of</strong><br />

the capacitors, which did not allow reaching exactly the<br />

desired values <strong>of</strong> capacitances from (6). And this explains<br />

the above values <strong>of</strong> frequencies that differ from<br />

“reasonable” values <strong>of</strong> 500 , 1000 , and 2000 Hz.<br />

The displacement <strong>of</strong> the top front <strong>of</strong> the dilatation<br />

element was measured using digital indicator MarCator<br />

1088 (accuracy 0.001mm) in the time interval from 0 to<br />

300 seconds with increments equal to 30 seconds. In the<br />

same intervals we measured the temperature inside the<br />

brass core. The measurements <strong>of</strong> the internal temperature<br />

were performed on the coil. The corresponding values<br />

were plotted into graphs for a comparison with the results<br />

from the mathematical model.<br />

The measurements performed on an experimental<br />

prototype were used for calibration. The measured results<br />

are shown in Figs. 8 and 9. Figure 10 shows the<br />

dependence between the temperature and displacement in<br />

the brass core 2.<br />

Figure 8: Time dependence <strong>of</strong> the temperature for the specified<br />

parameters (current RMS value 1 A and frequencies 519, 1005 and 2210<br />

Hz) <strong>of</strong> the field current).


B. Numerical simulation<br />

For numerical solution we used the previously<br />

mentioned programs and algorithms based on the FE<br />

method. At the beginning we created a geometrical model<br />

according to the technical drawing, which was used for<br />

manufacturing <strong>of</strong> the prototype. In the first step <strong>of</strong><br />

simulation we used the material properties according to<br />

the Tab. 1, whose were found in the base material<br />

datasheets.<br />

Figure 9: Time dependence <strong>of</strong> the displacement for the specified<br />

parameters (current RMS value 1 A and frequencies 519, 1005 and 2210<br />

Hz) <strong>of</strong> the field current.<br />

Figure 10: Dependence <strong>of</strong> the displacement on the temperature for the<br />

specified parameters (current RMS value 1 A and frequencies 519, 1005<br />

and 2210 Hz) <strong>of</strong> the field current.<br />

The time dependence <strong>of</strong> the temperature and<br />

displacement (Figs. 11 and 12) show the large<br />

discrepancy between the measurement and simulation.<br />

Figure 11: Time dependence <strong>of</strong> displacement for the first solution –<br />

current RMS value 1 A and selected frequency 1005 Hz.<br />

- 324 - 15th IGTE Symposium 2012<br />

These discrepancies were obviously brought about by<br />

the temperature-dependent material properties used in the<br />

numerical simulation (that differed from the real<br />

properties <strong>of</strong> those used for building the physical model)<br />

and also differences between the real geometry <strong>of</strong> the<br />

prototype and geometry used for the numerical model.<br />

Therefore, we checked all dimensions <strong>of</strong> the model and<br />

prototype. The next step was usage <strong>of</strong> some iteration<br />

processes to found new material properties to get the<br />

better agreement <strong>of</strong> the results. Especially we focused on<br />

the brass, because we did not know its exact designation<br />

and chemical composition <strong>of</strong> this material. All materials<br />

used in the prototype have the relative permeability near<br />

to one and are not ferromagnetic, therefore all used<br />

nonlinear characteristics are just the temperature<br />

dependencies (see Figs. 3, 4 and 5). After several<br />

corrections we found the satisfying values <strong>of</strong> material<br />

properties and comparing them with available database <strong>of</strong><br />

materials [6] we found that the used brass should be UNS<br />

C26000, whose characteristics are in the mentioned<br />

figures and were used for the final solution.<br />

Figure 12: Time dependence <strong>of</strong> temperature for the first solution –<br />

current RMS value 1 A and selected frequency 1005 Hz.<br />

The next two figures show an acceptable agreement<br />

between the measurement and simulation, for the<br />

frequency f 1005 Hz and RMS value <strong>of</strong> current<br />

I 1A.<br />

in<br />

Figure 13: Time dependence <strong>of</strong> displacement for the final solution –<br />

current RMS value 1 A and selected frequency 1005 Hz.


Figure 14: Time dependency <strong>of</strong> temperature for the final solution –<br />

current RMS value 1 A and selected frequency 1005 Hz.<br />

Figures 15 and 16 depict the distribution <strong>of</strong> the<br />

temperature and displacement in the dilatation element<br />

for time t 300 s , frequency f 1005 Hz and current<br />

I 1A.<br />

in<br />

Figures 17 and 18 show the final comparison <strong>of</strong> the<br />

measured data with the results <strong>of</strong> simulation for all three<br />

frequencies ( 519 Hz , 1005 Hz and 2210 Hz ).<br />

From the results is visible a small discrepancy,<br />

especially for the highest frequency. This can be brought<br />

about by the incorrect temperature-dependent<br />

characteristic <strong>of</strong> the thermal conductivity. Therefore, it is<br />

necessary to perform next steps to find the exact model.<br />

Figure 15: Graphical presentation <strong>of</strong> obtained results for total<br />

displacement <strong>of</strong> the dilatation element in time t = 300 s, for frequency<br />

f = 1005 Hz and current Iin = 1 A.<br />

- 325 - 15th IGTE Symposium 2012<br />

Figure 16: Graphical presentation <strong>of</strong> obtained results for temperature in<br />

the model <strong>of</strong> the thermoelastic actuator in time t = 300 s, for frequency<br />

f = 1005 Hz and current Iin = 1 A.<br />

Figure 17: Comparison <strong>of</strong> the final solution results <strong>of</strong> the time<br />

dependency <strong>of</strong> displacement with the measured data – for current RMS<br />

value Iin = 1 A.<br />

Figure 18: Comparison <strong>of</strong> the final solution results <strong>of</strong> the time<br />

dependency <strong>of</strong> temperature with the measured data – for current RMS<br />

value Iin = 1 A.


VI. CONCLUSION<br />

Thermoelasticity may prove to be a mighty tool in<br />

some applications where setting <strong>of</strong> accurate position is<br />

needed. The process <strong>of</strong> reaching the required dilatation is<br />

slow, but reliable.<br />

Nevertheless, accuracy <strong>of</strong> the results strongly depends<br />

on correctness <strong>of</strong> the input data as is shown in this paper.<br />

Further research will be, therefore, aimed at possibilities<br />

<strong>of</strong> their improvement. At this time we are preparing the<br />

measurements <strong>of</strong> material properties <strong>of</strong> the available<br />

materials and we want to make the brass spectroscopy to<br />

find the correct chemical composition to improve the<br />

model. And, <strong>of</strong> course, we need to improve the material<br />

properties <strong>of</strong> other materials in the model that are not so<br />

important like the brass, but they can affect the results as<br />

well.<br />

Another important aim is to accelerate the heating<br />

process. New possibilities are investigated in this<br />

direction, based on using variable amplitude <strong>of</strong> the field<br />

current, which can be realized, for example, by pulsewidth<br />

modulation.<br />

VII. ACKNOWLEDGMENT<br />

This work was supported by the <strong>University</strong> <strong>of</strong> West<br />

Bohemia grant system (project No. SGS-2012-039) and<br />

by the Grant Agency <strong>of</strong> the Czech Republic (project<br />

102/11/0498).<br />

REFERENCES<br />

[1] I. Dolezel, B. Ulrych and V. Kotlan, “Combined Actuator for<br />

Accurate Setting <strong>of</strong> Position Based on Thermoelasticity Produced<br />

by Induction Heating”, in IEEE Transaction on Industry<br />

Applications, Vol. 47, No. 5, 2011, ISSN 0093-9994, p. 2250–<br />

2256.<br />

[2] Doležel, I., Kotlan, V., Ulrych, B. Electromagnetic-thermoelastic<br />

actuator for accurate wide-range setting <strong>of</strong> position. Przeglad<br />

Elektrotechniczny, 2011, Vol. 87, No. 5, p. 22-27. ISSN: 0033-<br />

2097<br />

[3] Kuczmann, M.: Iványi, A.: The Finite Element Method in<br />

Magnetics. Akademiai Kiado, Budapest, 2008.<br />

[4] Holman, J.P.: Heat Transfer. McGrawHill, NY, 2002.<br />

[5] Boley, B., Wiener, J.: Theory <strong>of</strong> Thermal Stresses. NY, 1960.<br />

[6] MPDB Database <strong>of</strong> materials: www.jahm.com.<br />

- 326 - 15th IGTE Symposium 2012


- 327 - 15th IGTE Symposium 2012<br />

Scattering Calculations <strong>of</strong> Passive UHF-RFID<br />

Transponders<br />

*Thomas Bauernfeind, *Gergely Koczka, *Kurt Preis and *Oszkár Bíró<br />

*Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, 8010 <strong>Graz</strong>, Austria<br />

E-mail: t.bauernfeind@TU<strong>Graz</strong>.at<br />

Abstract—Beside the energy extraction capability from impinging electromagnetic waves provided by the interrogator, the<br />

signal strength <strong>of</strong> the field scattered by the transponder is also a quality criterion <strong>of</strong> UHF-RFID transponder tags. In general,<br />

the transponder antenna is conjugate complex matched to the nominal transponder IC impedance to achieve maximum<br />

energy extraction. The reverse link from the transponder to the reader is commonly not taken into account. Due to the<br />

modulation technique and a voltage limiter circuit at the analog IC frontend, the input impedance <strong>of</strong> the IC is strongly<br />

nonlinear. In the present paper a method is proposed which is able to analyze the influence <strong>of</strong> this nonlinearity on the<br />

scattered signal by separating the scattered field <strong>of</strong> the transponder to a reference scattering problem and a pure radiation<br />

problem.<br />

Index Terms— Antenna Impedance, Radar Cross Section, UHF-RFID.<br />

I. INTRODUCTION<br />

In passive backscattering applications like UHF-RFID<br />

(ultra high frequency-radio frequency identification) the<br />

communication between the interrogator unit (reader) and<br />

the transponder is established by means <strong>of</strong> modulating the<br />

radar cross section (RCS) <strong>of</strong> the transponder. In general,<br />

for UHF-RFID applications, this is done by switching the<br />

analog input impedance between two states in phase with<br />

the data stream to be transmitted, e.g. the EPC (electronic<br />

product code) value [1], [2]. Unfortunately, the analog IC<br />

input impedance is not constant, indeed it has a strong<br />

nonlinear behavior versus applied power e.g. as shown in<br />

Figure 1. This behavior is mainly caused by a voltage<br />

limiter at the transponder IC’s frontend and on the power<br />

consumption <strong>of</strong> the IC which is determined by the actual<br />

mode <strong>of</strong> operation <strong>of</strong> the transponder [3]. Hence, the<br />

transponder input impedance is a function <strong>of</strong> the induced<br />

antenna voltage [4]. To capture this nonlinear behavior,<br />

an iterative full-wave simulation <strong>of</strong> the whole channel<br />

including the reader antenna, the air volume and the tag<br />

antenna as proposed in [5] should be applied, taking into<br />

account the feedback <strong>of</strong> the tag on the reader. The IC<br />

behavior is commonly taken into account by means <strong>of</strong><br />

circuital co-simulation [6]. Due to the huge problem<br />

domain, this technique is unpractical to carry out<br />

optimization investigations. A possibility to reduce the<br />

computational costs is to describe the scattered field <strong>of</strong><br />

the nonlinearly terminated tag antenna in terms <strong>of</strong> a<br />

reference scattering field and a pure radiated field from<br />

the excited transponder antenna [7]. Since, in general, the<br />

effect <strong>of</strong> the tag on the reader field is small, the feedback<br />

on the reader is neglected in a first approximation hence,<br />

the scattered field can be calculated by superimposing<br />

those fields applying the finite element method.<br />

II. GENERALIZED SCATTERING MATRIX<br />

In Figure 2a) a typical UHF-RFID application<br />

consisting <strong>of</strong> a transponder tag and a reader antenna is<br />

shown. Following the approach described in [7], the<br />

situation at the RFID-transponder tag can be modeled as<br />

shown in Figure 2b) where the field situation is described<br />

in terms <strong>of</strong> spherical wave modes. A mathematical<br />

description <strong>of</strong> the simplified transponder model is given<br />

by the generalized scattering matrix [7], [8]:<br />

Resistance in Ohm<br />

100<br />

90<br />

80<br />

70<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

0<br />

b S S S a<br />

d S c<br />

d S S c<br />

00 01 0 N<br />

1 S 10 ... c 1 . (1)<br />

N N0NN N<br />

Real{Z_IC} un-modulated<br />

Real{Z_IC} modulated<br />

Imag{Z_IC} un-modulated<br />

Imag{Z_IC} modulated<br />

-10 -8 -6 -4 -2 0 2 4 6 8 10<br />

Pa in dBm<br />

Figure 1: Nonlinear transponder IC impedance versus applied power<br />

(NXP Ucode G2X).<br />

In (1), a relationship between the complex applied (a)<br />

and reflected (b) waves at the transmission line<br />

connecting the load impedance to the antenna and the<br />

incoming (cn) and outgoing (dn) spherical wave mode<br />

series is given.<br />

a) b)<br />

Figure 2: a) Typical RFID application. b) Simplified transponder<br />

model.<br />

0<br />

-50<br />

-100<br />

-150<br />

-200<br />

-250<br />

Reactance in Ohm


In case <strong>of</strong> an antenna driven by external waves<br />

(c1,… cN), the mode on the feed transmission line reflects<br />

at the load impedance defined by the load reflection<br />

coefficient given by<br />

Z L Z 0<br />

Z ZL Z 0 Z 0 . (2)<br />

0<br />

So the total field can be written as:<br />

N<br />

b 1 S 00 S0m c m<br />

m 1 m<br />

N<br />

S0 1 0 , (3)<br />

t<br />

dn S nn0 0 b<br />

N<br />

Snmc m<br />

m 1<br />

c<br />

1 m<br />

N<br />

S<br />

1 nm S . (4)<br />

Combining (3) and (4), the total outgoing field dn t can be<br />

calculated as:<br />

t<br />

dn S N<br />

n<br />

0<br />

S<br />

0<br />

m c m<br />

1 S00 00 m 1 N<br />

S nm nmc c m<br />

m<br />

1<br />

m<br />

0<br />

S S0<br />

. (5)<br />

In (5), S00 is the antenna port reflection coefficient given<br />

to be<br />

Z Ant A t Z 0<br />

S00<br />

Z Ant Z0<br />

0 Z 0 . (6)<br />

0<br />

The coefficients S0m are the transmission coefficients<br />

from the antenna to the transmission line, Sn0 the<br />

transmission coefficients from the transmission line to the<br />

antenna and Snm are the mode reflection coefficients<br />

directly connecting the incoming and outgoing wave<br />

modes. The first term in (5) can be thought <strong>of</strong> as only<br />

load impedance dependent and the second term as<br />

structure dependent, respectively.<br />

If one is interested in the scattered field only, one has<br />

to subtract the field in absence <strong>of</strong> the antenna (dn = cn)<br />

from (5) yielding<br />

N N<br />

s S N<br />

N<br />

n<br />

0<br />

dn S 0<br />

m c cm m c cn n S Snm<br />

nm c cm<br />

m<br />

1<br />

S00 00 m 1 m 1<br />

m<br />

0<br />

. (7)<br />

S S0<br />

Since a short circuit condition can easily be achieved, it is<br />

reasonable to use the short circuited case as reference.<br />

With = -1, one can calculate the scattered field in the<br />

short circuit case too:<br />

s S N N<br />

sc n<br />

0<br />

dn S S00m 0<br />

m c cm m c cn n S Snm<br />

nm c m<br />

1 S 00 m 1 m<br />

1<br />

m<br />

0<br />

. (8)<br />

S S0<br />

Using (7) and (8) it is now possible to rewrite (7) in terms<br />

<strong>of</strong> the reference scattered field from the short circuit<br />

condition as:<br />

1 0<br />

N<br />

s s 1 S<br />

sc<br />

n 0<br />

dn d dn n S0mcm 1<br />

S00 1 S00 m 1<br />

sc s<br />

N<br />

S0cm 1 m 0 S0 00 00 m<br />

S 00 1 S0<br />

1<br />

. (9)<br />

In (9) it is assumed that the incoming spherical wave<br />

mode series cm is known. Since the finite element method<br />

should be applied, the description in terms <strong>of</strong> the wave<br />

modes is not practical. Introducing the antenna short<br />

circuit current Isc and describing the radiated field in<br />

terms <strong>of</strong> an antenna driving current IAnt as presented in<br />

[7], one can eliminate the incoming wave mode series cm<br />

from (9) yielding:<br />

- 328 - 15th IGTE Symposium 2012<br />

0 1 1<br />

s s I<br />

sc rad sc<br />

Z 1 S<br />

rad 0 1<br />

00<br />

dn d sc sc 0 1 S<br />

sc rad<br />

00 000<br />

n d n<br />

n<br />

1...<br />

N<br />

. (10)<br />

2 I IAnt Z ZAnt 1<br />

S S00<br />

000<br />

Equation (10) is the key expression enabling the<br />

description <strong>of</strong> a scattered field in terms <strong>of</strong> a reference<br />

scattered field and a pure radiated one. Finally, using (2)<br />

and (6), (10) can be written as:<br />

s s ssc<br />

rad IscZ d L<br />

n d sc<br />

n d<br />

n<br />

I IAnt ZAnt Z L Z . (11)<br />

L<br />

The electric field E is the summation <strong>of</strong> the spherical<br />

wave mode series, so E is given by [7], [11]:<br />

IZ<br />

E E E 0 L<br />

Scattered E EShort Short E EAntenna<br />

Antenna<br />

. (12)<br />

IAnt ZAnt Z L<br />

Since, especially for UHF-RFID applications in<br />

general, the transponder antenna is conjugate complex<br />

matched to the nominal transponder IC impedance it is<br />

advantageous not to use the short circuit case as reference<br />

but rather the conjugate complex matched case. In [7], [8]<br />

it is described how to eliminate the short circuit case from<br />

(12) to get the final relationship:<br />

* *<br />

EScattered ( ZL) E Scattered ( Z Ant ) E EAntenn<br />

Antenna<br />

. (13)<br />

* *<br />

I I<br />

m I<br />

I<br />

m<br />

Ant<br />

In (13), Im * is the current at the terminal <strong>of</strong> the antenna<br />

for a conjugate complex matched transponder antenna in<br />

case <strong>of</strong> a pure scattering problem and * is the conjugate<br />

matched reflection coefficient:<br />

*<br />

* Z ZAnt AAnt t Z L<br />

Z ZAnt ZL<br />

L Z . (14)<br />

L<br />

Finally, this means that the field scattered by an antenna<br />

terminated with a certain load impedance ZL can be<br />

calculated by a superposition <strong>of</strong> a reference scattering<br />

field and a scaled radiated field <strong>of</strong> the antenna driven<br />

with a current IAnt.<br />

III. NUMERICAL INVESTIGATIONS<br />

The basic electromagnetic field problem shown in<br />

Figure 3 is analyzed with a finite element based in-house<br />

code. Introducing the magnetic vector potential A and the<br />

modified scalar potential V the electric field intensity E<br />

and the magnetic field intensity H in the time harmonic<br />

case can be written as:<br />

E j A j V<br />

, (15)<br />

1<br />

H A = A. (16)<br />

Using n1 edge basis functions Ni for the magnetic vector<br />

potential A and n2 nodal basis functions Ni for the<br />

modified electric scalar potential Vh as proposed in [9],<br />

the Galerkin equations to be solved become:<br />

N i A h hd j c c N i i A<br />

h<br />

h d<br />

j l<br />

j c N<br />

i<br />

gradV h hd N<br />

i A<br />

h hd<br />

d 0<br />

Z IC w<br />

SIBC<br />

( i 1,2,..., 12 1,2,..., , , , n )<br />

N c i<br />

c i (17)<br />

1


j c cgradN<br />

gra adNi<br />

A h d<br />

j c gradN gra adNi<br />

gradV h d 0 (18)<br />

( i 1, 12 , 2,..., 22,...,<br />

, , n2).<br />

In (17) and (18), the approximations <strong>of</strong> the potential<br />

functions are given by:<br />

n 1<br />

Ah a i N<br />

i<br />

i 1<br />

N<br />

i 1 i<br />

, (19)<br />

n<br />

2<br />

Vh VVN i<br />

N i<br />

i<br />

1 i<br />

. (20)<br />

The needed truncation <strong>of</strong> the problem domain has been<br />

realized by applying perfectly matched layers (PMLs) as<br />

proposed in [10]. For the first basic scattering<br />

investigations it was refrained from modeling the reader<br />

antenna structure e.g. as shown in Figure 2a). Instead, the<br />

actual scattering problem was excited by means <strong>of</strong> a<br />

Hertz-dipole as proposed in [4] since the Hertz-dipole can<br />

be modeled by a filament current with a given length. The<br />

main advantage <strong>of</strong> this excitation technique beside the<br />

reduction in the degree <strong>of</strong> freedom is, that the radiated<br />

power <strong>of</strong> a Hertz-dipole can be calculated analytically<br />

[11], which <strong>of</strong>fers the possibility <strong>of</strong> validating the quality<br />

<strong>of</strong> the results gathered within the post processing. On the<br />

other hand, the feedback <strong>of</strong> the transponder tag on the<br />

reader is neglected in this case. Since the influence <strong>of</strong> the<br />

transponder on the reader for UHF-RFID applications is<br />

small in general, it is assumed that this effect is negligible<br />

in a first approximation [4].<br />

The excitation <strong>of</strong> the pure radiation problem is done by<br />

impressing a voltage U0 at the feed gap <strong>of</strong> the dipole<br />

structure by prescribing a constant vector potential for the<br />

length y <strong>of</strong> the feed gap:<br />

Ee y y j j A y y U 0 . (21)<br />

As it can be seen from (17), the IC impedance is<br />

modeled with a surface impedance boundary condition<br />

(SIBC) as proposed in [12]:<br />

E Ett1 t 1 dds<br />

s<br />

U E<br />

l<br />

t 1 l l<br />

Z l<br />

t 1<br />

l<br />

IC Z<br />

SIBC , (22)<br />

I H t 2 d ds s K<br />

w<br />

t1<br />

w w<br />

t<br />

c<br />

where l is the length <strong>of</strong> the impedance geometry and w is<br />

the width. The surface impedance ZSIBC in (22) is given<br />

by the relationship <strong>of</strong> the tangential component <strong>of</strong> the<br />

electric field intensity Et1 and the tangential component<br />

Ht2 <strong>of</strong> the magnetic field intensity, assuming constant<br />

tangential components at the surface impedance.<br />

Due to the choice <strong>of</strong> the basis functions, the resulting<br />

system <strong>of</strong> equations (17) and (18) becomes singular<br />

which is not a drawback applying an iterative solver<br />

method. Unfortunately, the resulting system <strong>of</strong> equations<br />

is ill conditioned as described in [13], [14]. Hence, a<br />

direct solver method [14] has to be applied to avoid<br />

impractically long simulation durations. Due to the<br />

singularity <strong>of</strong> the system <strong>of</strong> equations, a tree-gauging [15]<br />

is needed to be able to apply the direct solver method.<br />

- 329 - 15th IGTE Symposium 2012<br />

IV. BASIC EXAMPLE<br />

The proposed method is tested on a very basic example<br />

shown in Figure 3. Applying a electric boundary<br />

condition<br />

E n 0 (23)<br />

(which is a Dirichlet boundary condition for A) in the x-zplane<br />

and a magnetic boundary condition<br />

H n 0 (24)<br />

(which is a Neumann boundary condition for A) in the yz-plane,<br />

only a quarter <strong>of</strong> the problem has to be modeled.<br />

The half wavelength dipole at a frequency <strong>of</strong> f = 1 GHz<br />

with a length <strong>of</strong> 0.5 l Ant 7.5 cm and a width <strong>of</strong><br />

0.5 w Ant 2.5 mm has a thickness <strong>of</strong> d = 1 mm and is<br />

placed at a distance <strong>of</strong> 25 cm to the Hertz-dipole<br />

excitation which is 0.8 times the wavelength . Hence,<br />

far field conditions can be assumed. In Figure 3b) a detail<br />

<strong>of</strong> the feed gap with a width <strong>of</strong> wgap = 200 μm is shown.<br />

The SIBC is connected via perfect electric conductors<br />

(PEC) to the excitation and the antenna structure.<br />

a) b)<br />

Figure 3: a) Typical RFID application. b) Simplified transponder<br />

model.<br />

The input impedance <strong>of</strong> the dipole antenna structure in<br />

the present model is calculated to be<br />

ZAnt = 100.15 + j 44.05 . Hence, the input impedance <strong>of</strong><br />

the fictitious IC must be ZIC = 100.15 – j 44.05 to<br />

fulfill the conjugate complex matching. With the given IC<br />

impedance, the needed reference field can be calculated<br />

by subtracting the field <strong>of</strong> the Hertz-dipole excitation in<br />

absence <strong>of</strong> the dipole structure from the total scattering<br />

problem as proposed in [4]. The results are shown in<br />

Figure 4a) to c).<br />

a) b) c)<br />

Figure 4: a) Scattering problem. b) Hertz-Dipole excitation.<br />

c) Scattered field from the dipole structure.<br />

Next the scattered field in case <strong>of</strong> the modulated IC<br />

impedance should be calculated with the proposed<br />

method. For the modulated IC impedance, it is assumed


that a resistance <strong>of</strong> Rmod = 150 is parallel to the IC<br />

impedance which is a typical value for UHF-RFID<br />

transponder ICs. So the modulated IC impedance is given<br />

to be ZICmod = 62.76 – j 15.6<br />

In Figure 5, the result obtained by the proposed method<br />

is compared with the result gathered by the method <strong>of</strong> [4].<br />

As it can be seen, a good qualitative agreement between<br />

the two methods is obtained. The current distribution<br />

along the dipole structure has also been investigated. This<br />

is done by comparing the magnetic field intensity in the<br />

vicinity <strong>of</strong> the antenna structure along the antenna for<br />

different phase angles in the excitation. The results are<br />

shown in Figure 6. In Figure 7, the relative error between<br />

the two methods is shown. As it can be seen, the good<br />

agreement between the two methods is also quantitative.<br />

The difference can be explained by uncertainties in the<br />

determination <strong>of</strong> the antenna input impedance ZAnt, since<br />

the scaling factor in (13) is directly related to this<br />

number.<br />

a) b)<br />

Figure 5: a) Scattered field calculated with the method from [4].<br />

b) Scattered field calculated with the proposed method.<br />

|H| in A/m<br />

1,6<br />

1,4<br />

1,2<br />

1,0<br />

0,8<br />

0,6<br />

0,4<br />

0,2<br />

0,0<br />

0 0,02 0,04 0,06 0,08<br />

Distance in m<br />

Figure 6: Current distribution along the dipole antenna in terms <strong>of</strong> the<br />

magnetic field intensity.<br />

8<br />

6<br />

4<br />

2<br />

-4<br />

-6<br />

-8<br />

-10<br />

Magnetic field intensity along the dipole antenna<br />

Scattering 0°<br />

Proposed method 0°<br />

Scattering 45°<br />

Proposed method 45°<br />

Scattering 90°<br />

Proposed method 90°<br />

Scattering 180<br />

Proposed method 180<br />

Relative error <strong>of</strong> the magnetic field intensity in %<br />

0<br />

0<br />

-2<br />

0,02 0,04 0,06 0,08<br />

Relative error 0°<br />

Relative error 45°<br />

Relative error 90°<br />

Relative error 180°<br />

Distance in m<br />

Figure 7: Relative error <strong>of</strong> the current distribution along the antenna.<br />

- 330 - 15th IGTE Symposium 2012<br />

V. CONCLUSION<br />

It has been shown on a very basic example that the<br />

scattering from passive objects like UHF-RFID<br />

transponder tags can be described in terms <strong>of</strong> a reference<br />

scattering problem and a pure radiation problem. Hence,<br />

the proposed method <strong>of</strong>fers the possibility <strong>of</strong> a certain<br />

reduction in the computational effort, since if multiple IC<br />

impedances have to be taken into account the whole<br />

channel including the reader antenna has to be simulated<br />

for the reference case only. All other IC states can be<br />

modeled as pure radiation problems without having to<br />

model the reader structure.<br />

REFERENCES<br />

[1] K. V. S. Rao, P. V. Nikitin and S. F. Sander, “Antenna design for<br />

UHF RFID tags: a review and a practical application,” IEEE<br />

Trans. on Ant. and Prop., vol. 53, no. 12, pp. 3870-3876, 2005.<br />

[2] V. Chawla and D. S. Ha, “An overview <strong>of</strong> passive RFID,” IEEE<br />

Communications Magazine, vol. 45, no. 9, pp. 11-17, Sept. 2007.<br />

[3] A. Moretto, E. Colin, C. Ripoll and S. A. Chakra, “Shunt behavior<br />

in RFID UHF tag according to ISO standard and manufacturer<br />

requirements,” <strong>Proceedings</strong> <strong>of</strong> the IEEE, vol. 98, no. 9, pp. 1550-<br />

1554, 2010.<br />

[4] T. Bauernfeind, K. Preis, G. Koczka, S. Maier and O. Biro,<br />

“Influence <strong>of</strong> the Non-Linear UHF-RFID IC Impedance on the<br />

Backscatter Abilities <strong>of</strong> a T-Match Tag Antenna Design,” IEEE<br />

Trans. on Magn., vol. 48, no. 2, pp. 755-758, 2012.<br />

[5] R. Wang and J. Jin, “A Flexible Time-Stepping Scheme for<br />

Hybrid Field-Circuit Simulation Based on the Extended Time-<br />

Domain Finite Element Method,” IEEE Trans. on Advanced<br />

Packaging, vol. 33, no. 4, pp. 769-776, 2010.<br />

[6] G. Manzi and U. Mühlmann, “Passive UHF RFID sensor /<br />

transponder antenna optimization for backscatter operation by<br />

electromagnetic-circuital co-simulation,” <strong>Proceedings</strong> <strong>of</strong> the 11 th<br />

International Conference on Telecommunications, pp. 17-22,<br />

2011.<br />

[7] R. C. Hansen, “Relationships Between Antennas as Scatterers and<br />

as Radiators,” <strong>Proceedings</strong> <strong>of</strong> the IEEE, vol. 77, no. 5, pp. 659-<br />

662, 1989.<br />

[8] R.G. Green, “Scattering from conjugate-matched antennas,” IEEE<br />

Trans. on Ant. and Prop., vol. 14, no. 1, pp. 17-21, 1966.<br />

[9] O. Biro, “Edge element formulations <strong>of</strong> eddy current problems,”<br />

Comput. Methods Appl. Mech. Eng., vol. 169, pp. 391-405, 1999.<br />

[10] I. Bardi, R. Remski, D. Perry and Z. Cendes “Plane wave<br />

scattering from frequency-selective surfaces by the finite-element<br />

method,” IEEE Trans. on Magn., vol. 38, no. 2, pp. 641-644,<br />

2002.<br />

[11] C. Balanis, Antenna Theory: Analysis and Design, Hoboken: John<br />

Wiley & Sons, 2005.<br />

[12] K. Hollaus, O. Biro, K. Preis and C. Stockreiter, “Edge finite<br />

elements coupled with a circuit for wave problems,” International<br />

Conference on Electromagnetics in Advanced Applications, pp.<br />

956-959, Torino, 2007.<br />

[13] G. Koczka, T. Bauernfeind, K. Preis and O. Biro, “Schur<br />

Complement Method Using Domain Decomposition for Solving<br />

Wave Propagation Problems,” The 10th International Workshop<br />

on Finite Elements for Microwave Engineering, FEM2010, pp. 53,<br />

Meredith, 2010.<br />

[14] G. Koczka, T. Bauernfeind, K. Preis and O. Biro, “An Iterative<br />

Domain Decomposition Method for Solving Wave Propagation<br />

Problems,” The 11th International Workshop on Finite Elements<br />

for Microwave Engineering, FEM2012, pp. 66, Estes Park, 2012.<br />

[15] R. Albanese and G. Rubinacci, “Solution <strong>of</strong> three dimensional<br />

eddy current problems by integral and differential methods,” IEEE<br />

Trans. on Magn., vol. 24, pp. 98-101, 1988.


- 331 - 15th IGTE Symposium 2012<br />

Simulation <strong>of</strong> a High Speed Reluctance Machine<br />

Including Hysteresis and Eddy Current Losses<br />

B. Schweigh<strong>of</strong>er∗ , H. Wegleiter∗ , M. Recheis∗ , and P. Fulmek †<br />

∗<strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing<br />

<strong>Graz</strong>, Austria<br />

† Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Institute <strong>of</strong> Sensor and Actuator Systems, Vienna, Austria<br />

E-mail: bernhard.schweigh<strong>of</strong>er@TU<strong>Graz</strong>.at<br />

Abstract—Flywheel energy storage systems in automotive applications require a compact design, which typically uses the<br />

rotor <strong>of</strong> the electrical machine as storage mass. In order to minimise friction losses the rotor is running in vacuum. The<br />

heat generated in the rotor can only be transferred by thermal radiation and thermal conduction through the bearings,<br />

eventually. Therefore, a precise estimation <strong>of</strong> the expected rotor losses is needed to design an efficient thermal management<br />

<strong>of</strong> the whole machine. In this paper a switched reluctance machine is analysed by finite–element simulations with focus on<br />

hysteresis losses and eddy current losses.<br />

Index Terms—Flywheel, Reluctance Motor, Hysteresis, EM Model<br />

I. INTRODUCTION<br />

Flywheel energy storage systems are an important<br />

area <strong>of</strong> research in the automotive industry, to satisfy<br />

the demands for low–emission or zero–emission by<br />

hybridisation and electrification <strong>of</strong> vehicles, trucks and<br />

buses. Typical systems have to be designed for high<br />

power and medium energy content, e.g. 120 kW and<br />

1.5 kWh. Advantages <strong>of</strong> flywheel systems are the high<br />

energy density in comparison to double–layer capacitors,<br />

the higher power density in comparison to batteries, the<br />

State–Of–Charge is always known, almost no degradation<br />

<strong>of</strong> performance with age, etc. Additionally several constraints,<br />

such as size, weight, and costs, have to be fulfilled.<br />

Typically a compact design, in which the rotor <strong>of</strong><br />

the electrical machine acts as the flywheel storage mass<br />

is chosen. Permanent magnet (PM) electrical machines<br />

seem to be the optimum choice for flywheel systems<br />

due to their high efficiency, high power density, and<br />

lower rotor losses in comparison to induction machines.<br />

Especially for automotive applications, however, several<br />

drawbacks have to be considered:<br />

– high power rare earth (RE) magnets (samariumcobalt<br />

or neodymium-iron-boron) are very expensive<br />

– even moderate temperatures may dramatically degrade<br />

the RE–magnets performance by demagnetisation<br />

or even destroy them<br />

– high mechanical stress due to centrifugal forces on<br />

the rotor in flywheel machines necessitate additional<br />

supporting structures to protect the RE–magnets<br />

– the combination <strong>of</strong> a RE–magnet–rotor with an<br />

iron stator leads to significant zero–torque losses,<br />

limiting the storage capabilities <strong>of</strong> the flywheel.<br />

Reluctance machines (RM) generally show a higher mechanical<br />

robustness. They can be produced from high–<br />

strength electrical steels to easily withstand the centrifugal<br />

forces, the high torques, and the pulse accelerations<br />

during operation in the vehicle. In comparison to PM–<br />

machines the rotational speed can be increased, leading to<br />

higher storage densities. Additionally, no electromagnetic<br />

losses have to be expected for the free running flywheel,<br />

which means no zero–torque losses. A comparison<br />

between synchronous (synRM) and switched reluctance<br />

machine (SRM) shows an advantage for the SRM due<br />

to its simple coil design and the higher power density.<br />

For the design <strong>of</strong> the SRM the rotor losses need special<br />

attention. As the rotor is operated in vacuum to avoid<br />

frictional losses, heat dissipation can only happen by<br />

thermal radiation and, eventually, by thermal conduction<br />

through conventional mechanical bearings. The stator is<br />

cooled by conventional water–cooling. For an accurate<br />

modeling <strong>of</strong> the heat flow as well as to predict the<br />

efficiency <strong>of</strong> the machine the knowledge <strong>of</strong> the losses<br />

inside the motor, especially in the rotor is needed.<br />

This paper deals with the analysis <strong>of</strong> the rotor losses in<br />

a switched reluctance machine (SRM). We have chosen<br />

an external rotor 12/8 SRM with a power <strong>of</strong> 120 kW and<br />

a rated speed <strong>of</strong> 25000 rpm.<br />

II. METHODOLOGY<br />

Losses in ferromagnetic materials are divided into the<br />

static, rate–independent hysteresis loss and dynamic rate–<br />

dependent electromagnetic losses (eddy–current losses,<br />

excess losses) [1], [2], [3]. The rate–independent iron–<br />

losses are determined by the materials’ hysteresis loop.<br />

If the course <strong>of</strong> the local vectorial flux–density and<br />

magnetic field is known, the integral over the respective<br />

vectorial hysteresis loops gives the hysteresis loss in the<br />

material. Existing models for ferromagnetic hysteresis<br />

(e.g. Jiles–Atherton, Preisach, Energetic Model) describe<br />

scalar BH–loops, only.<br />

The calculation <strong>of</strong> rate–dependent losses is usually<br />

based on the time variation <strong>of</strong> the flux density obtained<br />

from static finite element (FE) simulations. With increasing<br />

frequencies, however, the skin–depth decreases leading<br />

to an increasing deviation <strong>of</strong> the real flux density from<br />

the static simulation results. For the rate dependent losses


several formulations for sinusoidal flux densities have<br />

been proposed [1], [4], suitable only for linear materials,<br />

in the frequency domain. The correspondence between<br />

simulation results and experiments is not satisfying if the<br />

material is used at high flux densities, at saturation, due<br />

to the assumption <strong>of</strong> linear material behaviour. During the<br />

operation <strong>of</strong> a high performance electrical machine the<br />

magnetic fields and the flux densities in the core material<br />

are neither unidirectional nor sinusoidal.<br />

Loss calculations taking into account the non–linear<br />

materials’ properties, the eddy current distribution and<br />

the skin effect require an exorbitantly high computational<br />

effort for 3D dynamic FE simulations.<br />

We have prepared a 2D FE–model <strong>of</strong> the machine to<br />

calculate the distribution <strong>of</strong> the static flux density for<br />

arbitrary rotor angles and excitation patterns. An external<br />

source current is used to establish the magnetic field. The<br />

model is prepared to analyse the influence <strong>of</strong> different<br />

current pulse patterns, and to develop efficient control<br />

strategies. The non–linear ferromagnetic materials’ properties<br />

have been modelled by the scalar Energetic Model<br />

(EM) [5], which has been parameterised by Epstein frame<br />

experiments and from manufacturers data–sheet. The EM<br />

is used to derive the single valued BH–commutation<br />

curve used to characterise the material in FEM, and<br />

to calculate the static hysteresis loss for arbitrary flux<br />

density waveforms [6], [7]. The variation <strong>of</strong> the flux<br />

density with time, obtained by the FE–simulation, is<br />

used to estimate eddy current losses by approximating<br />

equations [8]. Finally, an expression for the losses in<br />

the machine can be found by integrating these local loss<br />

expressions over the whole volume.<br />

III. RELUCTANCE MACHINE FE–MODEL<br />

Fig. 1. Geometry <strong>of</strong> the reluctance machine RM. Three sets <strong>of</strong> coils<br />

produce a rotating magnetic quadrupole field. With the 12/8 ratio the<br />

principle step angle is 15 Degrees, π/12. The shown position <strong>of</strong> the<br />

external rotor is defined as α =0 ◦ , rotation direction chosen is clock–<br />

wise. Two points are chosen to evaluate the magnetic field, flux density<br />

and hysteresis loss: Point 1 at the center <strong>of</strong> a rotor tooth, Point 2 at the<br />

surrounding yoke area.<br />

We have chosen a 12/8 SRM (double 6/4 machine [9])<br />

with an external rotor. The geometrical dimensions <strong>of</strong><br />

- 332 - 15th IGTE Symposium 2012<br />

the SRM are described in Table I. With the chosen ratio<br />

<strong>of</strong> air–gap–diameter to length <strong>of</strong> the machine ≈ 12:16,<br />

the 3rd dimension has been omitted leading to a more<br />

simple 2D FEM model <strong>of</strong> the SRM Figure 1. Three sets<br />

<strong>of</strong> coils (ABC) build a rotating quadrupole field in the<br />

machine. With 12 stator teeth and 8 rotor teeth the step<br />

angle is 15 ◦ . The central steel shaft <strong>of</strong> the stator is used<br />

for cooling.<br />

Both, stator and rotor, are built up from steel sheet.<br />

The stator is made from a standard electrical steel. For<br />

the rotor material we have chosen the Vacodur50S high<br />

strenght cobalt–steel from Vacuumschmelze.<br />

The rated speed <strong>of</strong> the SRM is 25000 rpm, the stator<br />

coils switching frequency results to 10 kHz.<br />

length <strong>of</strong> motor<br />

external rotor<br />

160.0 mm<br />

No. <strong>of</strong> teeth 8<br />

material Vacodur 50S<br />

outer diameter 180.0 mm<br />

inner diameter 121.0 mm<br />

tooth depth 14.5 mm<br />

gap width<br />

stator<br />

28.0 mm<br />

No. <strong>of</strong> teeth 12<br />

material Armco electrical steel<br />

outer diameter 120.0 mm<br />

inner diameter 60.0 mm<br />

gap depth 17.4 mm<br />

tooth width 14.0 mm<br />

shaft<br />

material construction steel<br />

outer diameter 60.0 mm<br />

TABLE I<br />

GEOMETRIC DIMENSIONS OF SRM.<br />

Fig. 2. FEM: Gmsh/GetDP 2D–model for rotor position α =15 ◦ .<br />

17516 vertices and 37224 elements in a free triagonal mesh.<br />

For the FEM simulation we use two different s<strong>of</strong>tware<br />

packages: the commercial Comsol–Multiphysics [10] and<br />

the general open–source packages Gmsh/GetDP [11].<br />

Figure 2 shows the final mesh <strong>of</strong> the model with 37224<br />

triangle elements. In both FEM–packages we defined the<br />

magnetic materials’ property as BH–table, calculated by<br />

the Energetic Model.


IV. MATERIALS MODEL<br />

The static magnetic materials’ properties are described<br />

by the Energetic Model (EM) <strong>of</strong> ferromagnetic hysteresis<br />

[5]. The EM describes the non–linear, hysteretic behaviour<br />

<strong>of</strong> magnetic polarisation and magnetic field in<br />

the material based on the concept <strong>of</strong> minimising the<br />

total energy in a statistical description <strong>of</strong> the magnetic<br />

domain structure. Consequently, many physical factors<br />

influencing the magnetisation process can be included:<br />

e.g. magnetocrystalline anisotropy, internal demagnetisation,<br />

anisotropy <strong>of</strong> magnetostriction, etc. The EM can<br />

simulate major and minor hysteresis loops, it also simulates<br />

the effect <strong>of</strong> slowly evolving closed minor loops, in<br />

contrast to other models (e.g. Preisach model, wipeout–<br />

property). The simplified scalar formulation <strong>of</strong> the EM<br />

[5], [6] is perfectly prepared for integration into FEM.<br />

The parameters <strong>of</strong> the EM are found by evaluating several<br />

important points <strong>of</strong> a measured BH–loop (e.g. saturation,<br />

coercivity, remanence, initial susceptibility).<br />

The rotor material has to be chosen with respect<br />

to s<strong>of</strong>t–magnetic properties (low coercivity, high flux<br />

density, low losses), and mechanical properties, as well.<br />

A material with optimum properties for high speed<br />

rotors is the s<strong>of</strong>t–magnetic cobalt–iron alloy Vacodur50<br />

manufactured by Vacuumschmelze [12]. Due to the Co–<br />

content it exhibits a very high saturation flux density <strong>of</strong><br />

2.35 T. At a magnetic field strenght <strong>of</strong> 800 A/m more<br />

than 2.0 T are reached. The coercivity is in the range<br />

<strong>of</strong> 100–200 A/m. Figure 3 shows the BH–loop from the<br />

Vacodur50S datasheet and the corresponding results <strong>of</strong><br />

EM–simulations.<br />

Fig. 3. Hysteresis loops <strong>of</strong> Vacodur50S. Red lines: BH–loop from<br />

datasheet (Vacuumschmelze), blue line: BH–loop (virgin curve and<br />

major hysteresis loop) from EM–simulation.<br />

Vacodur50S Armco<br />

Js 2.40 2.10<br />

q 40.12 15.26<br />

k 314.95 16.31<br />

Ne 2.09e-5 3.15e-6<br />

g 11.30 23.41<br />

h 3.40 3.1e-3<br />

TABLE II<br />

EM–PARAMETERS FOR VACODUR50S AND ARMCO<br />

- 333 - 15th IGTE Symposium 2012<br />

For our simulations <strong>of</strong> the stator material we used the<br />

EM parameterised for s<strong>of</strong>t–magnetic Armco electrical<br />

steel sheets (GO, FeSi)[13]. This grain oriented FeSi–<br />

steel exhibits a coercivity as low as 7 A/m, the technical<br />

saturation is limited to 2.0 T. Epstein measurements<br />

provided the data necessary to parameterise the EM.<br />

Table II shows the parameters used for the EM simulations<br />

<strong>of</strong> rotor and stator material. The single valued<br />

BH–function, required for our FEM simulations, has been<br />

found by EM calculations <strong>of</strong> the commutation curves<br />

Fig. 4 and 5. Both FEM packages (Comsol, GetDP)<br />

use tables <strong>of</strong> the simulated BH–commutation curve to<br />

interpolate the required BH–point.<br />

Fig. 4. Hysteresis loops for Armco electrical steel. EM–simulations<br />

for several complete symmetrical loops build the commutation curve.<br />

Fig. 5. Vacodur50S: simulated symmetrical hysteresis loops build the<br />

commutation curve.<br />

V. FEM RESULTS<br />

The above described model setup (geometry and materials)<br />

was used for magnetostatic FEM simulations with<br />

current excitation in Comsol and GetDP. Both packages<br />

gave almost exactly identical results. A series <strong>of</strong> static<br />

calculations has been done to simulate the rotating machine<br />

for various coil currents. Typical results <strong>of</strong> both<br />

FEM simulations are depicted in the following figures.<br />

The excitation current was applied to Coil–set A (see<br />

Fig. 1), producing a quadrupole magnetic field. The<br />

angular rotor position has been changed in 1◦ steps, and<br />

the corresponding flux densities in 2 points <strong>of</strong> the rotor


and the flux in the coil A stator tooth have been evaluated<br />

exemplarily to estimate the rotor losses.<br />

Figure 6 shows the corresponding relative permeability<br />

values, i.e. the quotient B/(μ0H), for a moderate coil<br />

current density <strong>of</strong> 1 A/mm 2 . Different μr–scales are<br />

used for stator and rotor. As the respective range <strong>of</strong> flux<br />

densities Fig. 7 is below 300 mT, the permeability is still<br />

rising with flux density (cmp. Fig. 4 and 5), accordingly<br />

the maximum permeability is at the loci <strong>of</strong> maximum<br />

flux density.<br />

Figure 8 shows Comsol results for a coil current<br />

density <strong>of</strong> 5 A/mm 2 . The flux density at the partly overlapping<br />

stator and rotor teeth approaches the materials<br />

saturation.<br />

Fig. 6. GetDP simulation: relative magnetic permeabilities μr at coil<br />

A current density s =1A/mm 2 (I = 112 A), rotor angle α =10 ◦ .<br />

Different scales are used for rotor (Vacodur50) and stator (Armco).<br />

Fig. 7. GetDP simulation: flux density B at coil A current density<br />

s =1A/mm 2 (I = 112 A), rotor angle α =10 ◦ .<br />

The evaluation <strong>of</strong> the flux in the stator teeth is shown<br />

in Fig. 9. The line integral <strong>of</strong> B over a plane stator tooth<br />

cross–section, multiplied by the length <strong>of</strong> the machine,<br />

gives the total flux in Wb for a quarter <strong>of</strong> the machine.<br />

Perfect alignment <strong>of</strong> stator and rotor teeth at a rotor angle<br />

<strong>of</strong> 22.5 ◦ gives the steepest flux versus field curve. As the<br />

rotor teeth are moved towards the gap between the stator<br />

teeth, the demagnetising effect <strong>of</strong> the increasing length <strong>of</strong><br />

the effective air–gap is clearly visible. When the stator<br />

tooth faces a rotor gap exactly at 0 ◦ the flux vs. field<br />

characteristic becomes a rather flat, almost straight line.<br />

- 334 - 15th IGTE Symposium 2012<br />

Fig. 8. Comsol simulation: flux density B at coil A current density<br />

s =5A/mm 2 (I = 560 A), rotor angle α =10 ◦ .<br />

Fig. 9. Flux in stator tooth versus coil current for varying rotor angle.<br />

Maximum flux for α =22.5 ◦ , stator tooth exactly aligned with rotor<br />

tooth, minimum demagnetisation. Minimum flux for α =0 ◦ , stator<br />

tooth exactly between rotor teeth, maximum demagnetisation, almost<br />

linear dependence.<br />

VI. LOSSES<br />

The conventional three–term iron loss model [1], [8]<br />

contains expressions for hysteresis loss, eddy current loss,<br />

and excess loss. All these loss–contributions depend on<br />

the flux density itself and its time derivative. The local<br />

flux densities are usually estimated by finite element analyses,<br />

completely neglecting hysteresis and eddy currents,<br />

sometimes even the single valued non–linear BH curve.<br />

In a first step FEM calculations approximately determine<br />

the local flux density, from the flux density the iron losses<br />

are determined.<br />

A. Hysteresis loss<br />

In our work we use the EM hysteresis model described<br />

above, to calculate the hysteresis loss for any arbitrary<br />

course <strong>of</strong> flux densities in the material. Series <strong>of</strong> simulations<br />

for constant coil current and varying rotor angle<br />

are used to identify the course <strong>of</strong> the flux density at two<br />

chosen distinct points <strong>of</strong> the rotor (see Fig. 1). Figure 10<br />

shows the evolution <strong>of</strong> the vectorial components <strong>of</strong> the<br />

flux density at point 1 (rotor tooth) versus rotor angle<br />

when coil A is switched on. The symmetry with a change<br />

in sign at 90 ◦ and the periodicity <strong>of</strong> 180 ◦ are evident.


Both components <strong>of</strong> the B–vector, radial and tangential,<br />

and the total <strong>of</strong> the flux density are shown.<br />

Under normal operation each set <strong>of</strong> coils (ABC) is<br />

active during a rotation angle <strong>of</strong> 15 ◦ . Figure 11 shows<br />

the radial component <strong>of</strong> the flux density at point 1 (rotor<br />

tooth) when all coils A–B–C are excited sequentially,<br />

leading to a periodicity <strong>of</strong> 60 ◦ for a fixed point on the<br />

rotor. The total hysteresis loss at point 1 for a complete<br />

revolution <strong>of</strong> 360 ◦ is Wh =6·555.5 J/m 3 = 3332.8 J/m 3<br />

(Fig. 12),<br />

Fig. 10. Flux density at rotor point 1 (tooth) versus rotor angle, coil A<br />

activated with 5 A/mm 2 . The thick lines indicate the normal switched–<br />

on range for coils A.<br />

Fig. 11. Radial component <strong>of</strong> the flux density at rotor point 1 (tooth)<br />

versus rotor angle. Coils A–B–C are excitated sequentially with 5<br />

A/mm 2 .<br />

The same procedure, described above for a rotor tooth,<br />

is applied to point 2 on the yoke part <strong>of</strong> the rotor (see<br />

Fig. 1). Figure 13 shows the evolution <strong>of</strong> the vectorial<br />

components <strong>of</strong> the flux density at point 1 (rotor tooth)<br />

versus rotor angle when coil A is switched on. In the rotor<br />

yoke area there exists only a tangential component <strong>of</strong> the<br />

flux density, the radial component completely vanishes.<br />

Figure 14 shows the tangential component <strong>of</strong> the flux<br />

density at point 2 (rotor yoke) when all coils A–B–C<br />

are excited sequentially, leading to a periodicity <strong>of</strong> 60 ◦<br />

for a fixed point on the rotor. The total hysteresis loss<br />

at point 2 for a complete revolution <strong>of</strong> 360 ◦ is Wh =<br />

6 · 672.1 J/m 3 = 4032.6 J/m 3 (Fig. 15). The two extra<br />

minor loops lead to a significant increase <strong>of</strong> the hysteresis<br />

loss in the rotor’s yoke area.<br />

- 335 - 15th IGTE Symposium 2012<br />

Fig. 12. Simulated BH–loop for a complete period <strong>of</strong> the magnetisation<br />

process point 1, covering 60 ◦ rotational angle.<br />

Fig. 13. Flux density at rotor point 2 (yoke) versus rotor angle, coil A<br />

activated with 5 A/mm 2 . The thick lines indicate the normal switched–<br />

on range for coils A.<br />

Figure 16 shows the hysteresis losses at two points <strong>of</strong><br />

the rotor for a complete rotor revolution in dependence <strong>of</strong><br />

coil current. Although the amplitude <strong>of</strong> the flux density<br />

is significantly larger in the rotor tooth than in the rotor<br />

yoke, the losses in the yoke are dominant due to the<br />

existence <strong>of</strong> a pair <strong>of</strong> extra minor loops. As the yoke<br />

reaches local saturation, however, the flux density becomes<br />

distributed more uniformly, and the amplitude <strong>of</strong><br />

the minor loops decreases, accompanied by a decreasing<br />

hysteresis loss.<br />

Fig. 14. Tangential component <strong>of</strong> the flux density at rotor point 2<br />

(yoke) versus rotor angle. Coils A–B–C are excitated sequentially with<br />

5 A/mm 2 .


Fig. 15. Simulated BH–loop for a complete period <strong>of</strong> the magnetisation<br />

process at point 2, covering 60 ◦ rotational angle.<br />

Fig. 16. Hysteresis loss for one complete rotor revolution versus coil<br />

current. The losses are calculated for two points in the rotor (see Fig. 1).<br />

B. Eddy current loss<br />

Eddy current losses depend on the rate <strong>of</strong> change <strong>of</strong><br />

flux density, the electrical conductivity <strong>of</strong> the material,<br />

and the geometry. As long as the induced eddy currents<br />

are small enough to allow the flux density to completely<br />

penetrate the material, eddy current losses can be locally<br />

calculated straight forward. This criterion would<br />

allow a maximum frequency for sinusoidal excitation<br />

below 1 kHz (Vacodur50: h = 0.35 mm, μr ≈ 4000,<br />

σ =2.83 · 106 S/m). A modified expression [8] can be<br />

used to determine eddy current loss and excess loss:<br />

W ′ e ∼ = κσh2 1<br />

·<br />

12 T ·<br />

T 2 dB<br />

dvdt<br />

0 dt<br />

The time derivative dB/dt is defined by our FEM<br />

results, σ is the electrical conductivity <strong>of</strong> the rotor material,<br />

h is the thickness <strong>of</strong> the rotor steel sheets, κ is the<br />

modified coefficient for excess loss, T is the time period.<br />

Figure 17 shows the resulting eddy current losses for<br />

a complete revolution at 12000 rpm. The modified loss<br />

coefficient was chosen as κ =1. For higher frequencies<br />

the skin–effect has to be taken into account additionally.<br />

VII. CONCLUSIONS<br />

A 2D finite element model <strong>of</strong> a switched reluctance<br />

machine is presented, using the EM–hysteresis model to<br />

- 336 - 15th IGTE Symposium 2012<br />

Fig. 17. Eddy current loss per revolution at 12000 rpm versus coil<br />

current. The losses are calculated for two points in the rotor (see Fig. 1).<br />

derive the non–linear, hysteretic BH–function. The EM<br />

is parameterised from BH–loops from experiment (e.g.<br />

Epstein measurement) or data–sheet information. The<br />

FEM calculations use only the single–valued commutation<br />

curve, derived from EM simulations. Hysteresis loss<br />

is calculated based on the local flux densities resulting<br />

from 2D–FEM, by numerical integration <strong>of</strong> the respective<br />

EM BH–loops. Under the necessary assumption <strong>of</strong> a<br />

negligible influence <strong>of</strong> the skin depth, we can estimate<br />

eddy current and excess losses in the rotor, as well.<br />

VIII. ACKNOWLEDGMENT<br />

Part <strong>of</strong> this research has been supported by the Austrian<br />

FFG, project No. 824164.<br />

REFERENCES<br />

[1] G. Bertotti. Hysteresis in magnetism. Academic Press, 2008.<br />

[2] I. D. Mayergoyz. Mathematical Models <strong>of</strong> Hysteresis. New York,<br />

Springer, 1991.<br />

[3] D. C. Jiles, D. L. Atherton. J. Magn. Magn. Mater. vol. 61, pp.<br />

48–60, 1986.<br />

[4] D. Lin et.al. IEEE Trans. Magn. 40 (2), pp. 1318–1321, 2004.<br />

[5] H. Hauser. J. Appl. Phys. 96 (5), pp. 2753–2767, 2004.<br />

[6] P. Fulmek, P. Haumer, H. Wegleiter, B. Schweigh<strong>of</strong>er. COMPEL<br />

29 (6), pp. 1504–1513, 2010.<br />

[7] P. Fulmek, N. Mehboob, P. Haumer, M. Kriegisch, R. Grössinger.<br />

SMM19, Book <strong>of</strong> Abstracts, B1–16, 2009.<br />

[8] K. Yamazaki, N. Fukushima. IEEE Trans. Magn. 46 (8), 3121 –<br />

3124, 2010.<br />

[9] T. J. E. Miller. Switched Reluctance Motors and their Control.<br />

Magna Physics Publishing and Oxford Science Publications<br />

(1993)<br />

[10] Comsol Multiphysics, http://www.comsol.com.<br />

[11] C. Geuzaine, J.–F. Remacle, Gmsh, http://www.geuz.org/gmsh.<br />

P. Dular, C. Geuzaine, GetDP, http://www.geuz.org/getdp.<br />

[12] Vacodur50, S<strong>of</strong>t magnetic Cobalt Iron http://www.<br />

vacuumschmelze.com/<br />

[13] http://www.aksteel.eu/en/1-products/3-electrical-sheet/


- 337 - 15th IGTE Symposium 2012<br />

An Iterative Domain Decomposition Method for<br />

Solving Wave Propagation Problems<br />

*Gergely Koczka, *Thomas Bauernfeind, *Kurt Preis and *Oszkár Bíró<br />

*Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />

E-mail: gergely.koczka@TU<strong>Graz</strong>.at<br />

Abstract—Solving wave propagation problems with FEM results in a huge number <strong>of</strong> unknowns due to the large air volume to<br />

be modeled. These equation systems are very ill-conditioned, because <strong>of</strong> the big material differences, element-size changes and<br />

due to the fact that the system matrices are indefinite. Common iterative methods (CG, GMRES) exhibit bad convergence due<br />

to these conditions. The memory requirement is the weakness <strong>of</strong> the direct methods. The aim <strong>of</strong> this paper is to present a<br />

method, which has smaller memory requirement than the direct methods, and converging faster than iterative methods.<br />

Index Terms—no more than 4 in alphabetical order.<br />

I. INTRODUCTION<br />

It is always an open question how to solve huge<br />

indefinite, ill-conditioned equation systems efficiently.<br />

Common iterative methods (CG, GMRES) exhibit bad<br />

convergence due to these conditions [1]. Assembling the<br />

system matrix <strong>of</strong> wave propagation problems in frequency<br />

domain with finite element method (FEM) results in this<br />

kind <strong>of</strong> equation systems because <strong>of</strong> the huge material<br />

differences and element-size changes.<br />

Applying direct solver methods to overcome the<br />

problem <strong>of</strong> the ill-conditioned system <strong>of</strong> equations results<br />

in high memory requirements [2]. The aim <strong>of</strong> this paper is<br />

to present a method with smaller memory requirement<br />

than the direct methods and better convergence quality<br />

than common iterative methods.<br />

The memory requirement <strong>of</strong> the classical LU<br />

decomposition applied to sparse matrices can be reduced<br />

with fill-in reduction algorithms: the minimum degree<br />

algorithm [3] or the nested dissection algorithm [4].<br />

These algorithms are implemented in the Intel® Math<br />

Kernel Library (PARDISO: sparse linear equation system<br />

solver routine). However, the memory requirement <strong>of</strong> the<br />

direct method is higher than first order: between<br />

2<br />

Onlog n and <br />

1.3<br />

1.4<br />

O n , typically On ,<br />

<br />

O n .For<br />

huge systems a method is required which is capable <strong>of</strong><br />

decreasing it.<br />

Fig. 1. The geometrical decomposition <strong>of</strong> the domain<br />

II. DOMAIN DECOMPOSITION<br />

With geometrical domain decomposition it is possible<br />

to decrease the memory requirement <strong>of</strong> the direct method.<br />

The problem domain should be subdivided into n<br />

open disjunctive sub-domains, namely<br />

<br />

n<br />

<br />

,<br />

i1<br />

i<br />

(1)<br />

i, j 1,2,.., n : i j . (2)<br />

i j<br />

The interfaces between the domains are<br />

ij : i j ,<br />

n<br />

(3)<br />

: .<br />

(4)<br />

<br />

i, j1<br />

With these notations, the equation system can be<br />

written in block-form:<br />

AII AIxI bI ,<br />

AIA <br />

x<br />

<br />

b<br />

<br />

<br />

where the sub-matrix AII corresponds to the domains,<br />

ij<br />

(5)<br />

A to the interfaces between the domains and I A and<br />

AI to the connections <strong>of</strong> the sub-domains and the<br />

interface.<br />

The Schur-complement equation system <strong>of</strong> (5) is<br />

obtained as<br />

<br />

A A A A x b A A b .<br />

<br />

1 1<br />

I II I I<br />

II I<br />

<br />

S<br />

c<br />

(6)<br />

To build the Schur-complement matrix S ,<br />

it is<br />

necessary to have regular subsystems corresponding to<br />

the domains (the matrix AII has to be regular).<br />

The memory requirement <strong>of</strong> the method can be<br />

estimated as follows:<br />

Let us assume that the memory requirement <strong>of</strong> the


direct method is One <br />

, where ne is the number <strong>of</strong><br />

equations and 1 2.<br />

If all the sub-domains have the<br />

same number <strong>of</strong> unknowns, then<br />

1<br />

<br />

ne <br />

n nen ,<br />

n<br />

<br />

1<br />

<br />

i.e. the overall memory requirement will be decreased by<br />

a multiplier which depends on the number <strong>of</strong> the subdomains.<br />

Assembling the Schur complement matrix takes<br />

a long time. However, for solving the reduced equation<br />

system, not the full Schur complement matrix is<br />

necessary. Applying the biconjugate gradient method<br />

(BiCG) to solve the Schur complement equation system<br />

will result in an efficient iterative solver (DD-BiCG) for<br />

solving wave propagation problems.<br />

III. ANALYSIS OF THE METHOD<br />

A. Stability<br />

Since the original equation system is symmetric, the<br />

matrices corresponding to the sub-domains and the<br />

interfaces are also symmetric. To improve the<br />

conditioning <strong>of</strong> (6), the following form is used:<br />

T T<br />

I L A A A L L x L b L<br />

A A b<br />

<br />

where L <br />

T A LL 1 1 1 1 1<br />

I II I I<br />

II I<br />

<br />

1T L SL y 1<br />

L c<br />

is the Cholesky decomposition <strong>of</strong> A <br />

. Using (8) with BiCG without<br />

preconditioning is theoretically equivalent to the form (6)<br />

using A as a preconditioner. The practical examples<br />

show that the symmetric form (8) is more stable and<br />

therefore converging faster.<br />

B. Condition number<br />

The convergence speed <strong>of</strong> the BiCG depends on the<br />

condition number <strong>of</strong> the equation system.<br />

Since the A,v formulation is used to solve the<br />

Maxwell’s equations, the resulting linear equation system<br />

is singular. To formulate a regular system a tree has to be<br />

eliminated in the discretized domain. Due to the fact that<br />

the singular system is better conditioned than the regular<br />

one, it is better to work with the singular system.<br />

If and only if the original matrix is singular, the Schurcomplement<br />

matrix is also singular, if the sub-domain<br />

matrix AII is regular.<br />

1<br />

AII :<br />

1<br />

<br />

A AIAII AIv0AII AIvI 0 1<br />

v A I A<br />

<br />

v<br />

<br />

0<br />

.(9)<br />

I AII AIv <br />

<br />

So to let the Schur complement matrix be singular, it is<br />

enough to not eliminate edges on the interface.<br />

A huge advantage <strong>of</strong> this method is that the matrix<br />

,<br />

- 338 - 15th IGTE Symposium 2012<br />

(7)<br />

(8)<br />

corresponding to the sub-domains is block-diagonal, and<br />

its blocks can be inverted in parallel. The speed <strong>of</strong> one<br />

iteration-step <strong>of</strong> the DD-BiCG can be increased by<br />

choosing the sub-domains with about the same number <strong>of</strong><br />

unknowns and using a multi-core computer.<br />

IV. NUMERICAL RESULT<br />

The efficiency <strong>of</strong> this method is shown on the example<br />

<strong>of</strong> a dipole. The dipole has a length <strong>of</strong> 140 mm, a width <strong>of</strong><br />

1 mm and thickness <strong>of</strong> 20 μm. There is a 160 μm air gap<br />

in the middle (see Fig. 3.). An air volume <strong>of</strong> 250 mm<br />

radius is modeled around the antenna (see Fig. 2.). The<br />

air volume is truncated by a first order absorbing<br />

boundary condition (ABC).<br />

Fig. 2. The structure <strong>of</strong> the dipole antenna and the truncation <strong>of</strong> the air<br />

volume. 1/8 model.<br />

The voltage is prescribed in the air gap (1 V, 1.5 GHz).<br />

Modeling an eighth <strong>of</strong> the problem, using A,v formulation<br />

(A is the magnetic vector potential, v is the modified<br />

electric scalar potential), and second order hexahedral<br />

finite elements (20 nodes, 36 edges) the resulting problem<br />

has 1.986.152 edges and 669.398 nodes.<br />

Fig. 3. The structure <strong>of</strong> the dipole antenna (yellow) near the air gap and<br />

the prescribed electric field in the gap (blue). 1/8 model.<br />

The efficiency <strong>of</strong> the DD-BiCG method compared with<br />

the incomplete Cholesky preconditioned Biconjugate<br />

gradient method (IC-BiCG) is shown in Fig. 6. The<br />

convergence criterion was the global relative residual to<br />

become smaller than 10 -7 .<br />

To demonstrate the efficiency <strong>of</strong> the method, two<br />

different decompositions were tested. In the first case the<br />

domain has been subdivided into 5 sub-domains (see Fig.<br />

4.), in the second case 8 sub-domains. (Fig. 5.).


Fig. 4. The problem with five sub-domains.<br />

In the first case the first domain is the antenna, the second<br />

is a small air volume around the antenna, and the huge air<br />

volume is subdivided into three parts.<br />

Fig. 5. The problem with eight sub-domains.<br />

In the second case, the antenna and a small air volume<br />

again build the first two sub-domains, but the air volume<br />

is subdivided into six parts.<br />

Method<br />

name<br />

TABLE I<br />

Comparison <strong>of</strong> the methods<br />

Memory<br />

Requirement<br />

Iterations Run<br />

time (h)<br />

ICCG 9,0 GB 68.750 77,86<br />

DDCG<br />

5 Domains<br />

DDCG<br />

8 Domains<br />

41,8 GB 975 4,41<br />

31,8 GB 1.036 4,66<br />

LU 81,0 GB - -<br />

To solve the problem an “Intel(R) Xeon(R) CPU 2x<br />

X5570@2.93GHz 8 cores 64 GB RAM” computer was<br />

used.<br />

- 339 - 15th IGTE Symposium 2012<br />

Fig. 6. The best residuum <strong>of</strong> the methods during the iterations.<br />

Blue line: DD-BiCG (5 domains) global residuum;<br />

Red line: DD-BiCG residuum on the interface;<br />

Blue dotted line: DD-BiCG (8 domains);<br />

Red dotted line: DD-BiCG (8 domain) residuum on the interface;<br />

Black line: IC-BiCG residuum in the whole domain.<br />

V. CONCLUSION<br />

Applying the domain decomposition method for<br />

solving huge indefinite equation system iteratively results<br />

in an efficient method with reduced memory requirement<br />

compared to direct methods, and accelerates the iteration<br />

by decreasing the number <strong>of</strong> iterations. It enables an<br />

efficient parallelization technique in implementating <strong>of</strong><br />

the algorithm.<br />

The choice <strong>of</strong> the sub-domains is very important to<br />

increase the efficiency <strong>of</strong> the method. The sub-domains<br />

should have about the same number <strong>of</strong> unknowns.<br />

Increasing the number <strong>of</strong> domains decreases the memory<br />

requirement but results in a higher condition number.<br />

REFERENCES<br />

[1] O. Nevanlinna, Convergence <strong>of</strong> iterations for linear equations.<br />

Birkhauser Verlag AG, Basel, 1993, pp. viii+177<br />

[2] G. Koczka , T. Bauernfeind, K. Preis and O. Bíró, "Schur<br />

complement method using domain decomposition for solving<br />

wave propagation problems," presented at The 10th International<br />

Workshop on Finite Elements for Microwave Engineering,<br />

Meredith, New Hampshire United States, Oct. 12-13, 2010.<br />

[3] J.W.H. Liu. Modification <strong>of</strong> the Minimum-Degree algorithm by<br />

multiple elimination. ACM Transactions on Mathematical<br />

S<strong>of</strong>tware, 11(2):141-153, 1985.<br />

[4] G. Karypis and V. Kumar. A Fast and High Quality Multilevel<br />

Scheme for Partitioning Irregular Graphs. SIAM Journal on<br />

Scientific Computing, 20(1):359-392, 1998.


- 340 - 15th IGTE Symposium 2012<br />

On Effectiveness <strong>of</strong> Model Order Reduction<br />

for Computational Electromagnetism<br />

*Yuki Sato, *Hajime Igarashi<br />

*Graduate School <strong>of</strong> Information Science and <strong>Technology</strong>, Hokkaido <strong>University</strong><br />

Kita 14, Nishi 9, Kita-ku, Sapporo, 060-0814<br />

E-mail: yukisato@em-si.eng.hokudai.ac.jp<br />

Abstract— This paper presents the model reduction method based on the method <strong>of</strong> snapshots for time-domain finite element<br />

analysis <strong>of</strong> quasi-static electromagnetic fields. In this method, the snapshots <strong>of</strong> transient electromagnetic fields for relatively<br />

short periods are stored to build the variance-covariance matrix, from whose eigenvalues the basis functions for reduced<br />

analysis are constructed. In this paper, the effect <strong>of</strong> various parameters in the present method such as the number <strong>of</strong><br />

snapshots, snapshot intervals on the results <strong>of</strong> the reduced field computations is discussed.<br />

Index Terms—Model order reduction, finite element method, method <strong>of</strong> snapshots, eddy current problem.<br />

applying it to three dimensional eddy current problems.<br />

Moreover, we discuss a possible method to determine the<br />

adequate values <strong>of</strong> the parameters for this method.<br />

I. INTRODUCTION<br />

In recent years, finite element method (FEM) has<br />

widely been applied to transient analysis <strong>of</strong> quasi-static<br />

and high-frequency electromagnetic fields. However,<br />

since FE equations must be solved at each time step, it<br />

has significant computational burden. Therefore, a lot <strong>of</strong><br />

efforts have been made to reduce the computational times<br />

for analysis <strong>of</strong> transient electromagnetic fields.<br />

One <strong>of</strong> the most promising methods to shorten the<br />

computational time would be the time-period explicit<br />

error correction (TP-EEC) method [1], [2]. It has been<br />

shown that TP-EEC method applied to non-linear eddy<br />

current problems reduces the computational time for<br />

transient analysis and gives correct steady state solutions<br />

[1], [2]. However, TP-EEC method cannot be applied to<br />

analysis <strong>of</strong> non-time-periodic problems or high-frequency<br />

problems.<br />

On the other hand, there is yet another method, called<br />

the model order reduction, which can reduce the<br />

computational time for transient analysis [3], [4]. In this<br />

method, the snapshots <strong>of</strong> transient solution are stored for<br />

initial short period. Then, using these snapshotted<br />

solutions, a variance-covariance matrix is constructed and<br />

the eigenvectors <strong>of</strong> this matrix is computed. The reduced<br />

FE matrix is then constructed using the transform matrix<br />

whose column space is spanned by the dominant<br />

eigenvectors. There are a few merits in this method; it can<br />

accurately analyze the transient solutions and can be<br />

applied to non-time-periodic problems and highfrequency<br />

problems. However, in order to realize accurate<br />

analysis, it is important to determine adequate values <strong>of</strong><br />

the parameters in this method such as snapshot interval<br />

and period, and number <strong>of</strong> the basis vectors that are<br />

chosen from the eigenvectors <strong>of</strong> the variance-covariance<br />

matrix. However, the dependence <strong>of</strong> the accuracy on<br />

these parameters has not been clarified. Moreover, though<br />

the effectiveness <strong>of</strong> the model reduction for twodimensional<br />

eddy current problems has been discussed<br />

[5], that for three dimensional problems has not been<br />

shown yet.<br />

In our study, the dependence <strong>of</strong> the accuracy in the<br />

model reduction method on its parameters is evaluated by<br />

II. REDUCTION TECHNIQUE<br />

A. Time-Domain Finite Element Method<br />

The A-φ (A-V) method is used for FE analysis <strong>of</strong><br />

quasi-static electromagnetic analysis. The governing<br />

equations derived from Maxwell’s equations is expressed<br />

as<br />

A<br />

<br />

<br />

rot rotA<br />

grad J , (1)<br />

t<br />

t<br />

<br />

A <br />

<br />

div grad 0 ,<br />

(2)<br />

t<br />

t<br />

<br />

where ν is magnetic resistivity, is conductivity and J is<br />

forced current density. The vector potential A and scholar<br />

potential φ are discretized as follows<br />

e<br />

<br />

A a N ,<br />

(3)<br />

j<br />

n<br />

<br />

j<br />

j<br />

, (4)<br />

j<br />

j j N<br />

where e and n is the number <strong>of</strong> edges and nodes, and Nj<br />

and Nj are vector and scholar interpolation functions<br />

respectively. The weighted residual method with Galerkin<br />

method applied to (1) and (2) results in the FE equation<br />

given by<br />

K 0a<br />

d N<br />

S a<br />

b<br />

<br />

,<br />

t<br />

0 0<br />

<br />

<br />

d<br />

<br />

S M<br />

<br />

<br />

0<br />

(5)<br />

<br />

t <br />

<br />

where<br />

K rotN<br />

rotN<br />

dV<br />

,<br />

(6)<br />

ij<br />

<br />

V<br />

i<br />

j<br />

Nij <br />

N i N jdV<br />

,<br />

(7)<br />

V<br />

Sij <br />

N i grad N jdV<br />

,<br />

(8)<br />

V<br />

<br />

M grad N grad N dV<br />

, (9)<br />

ij<br />

V<br />

i<br />

j


i <br />

N i JdV.<br />

V<br />

(10)<br />

Moreover, time derivative is approximated by the finite<br />

difference and the unknown variables and right hand<br />

vector are interpolated as<br />

k<br />

x x<br />

k1<br />

( 1<br />

) x ,<br />

(11)<br />

k<br />

b b<br />

k1<br />

( 1<br />

) b ,<br />

(12)<br />

where x = [a φ] t , 0 ≤ θ ≤ 1 and k represents time steps.<br />

Equation (5) now becomes<br />

1 N<br />

<br />

t<br />

t<br />

<br />

S<br />

S K<br />

M<br />

<br />

<br />

0<br />

0<br />

k<br />

0<br />

<br />

<br />

<br />

x <br />

<br />

1 N<br />

<br />

t<br />

t<br />

<br />

S<br />

S K<br />

( 1<br />

)<br />

M<br />

<br />

0<br />

k<br />

k 1<br />

0<br />

k 1<br />

b ( 1<br />

) b <br />

<br />

,<br />

0<br />

<br />

x<br />

<br />

0 <br />

(13)<br />

where Δt is the time step interval. The transient solutions<br />

can be obtained by solving (13) at each time step.<br />

B. Model Reduction<br />

As mentioned above, solution <strong>of</strong> (13) at each time step<br />

is computationally expensive if the number <strong>of</strong> unknowns<br />

is large. To reduce the computational time, the reduced<br />

equation is obtained from (13) using the model reduction<br />

method. To do so, after obtaining snapshots for the initial<br />

periods by solving (13), the variance-covariance matrix<br />

Cm<br />

t<br />

m XX C (14)<br />

is constructed where<br />

1<br />

2<br />

s<br />

X [<br />

x μ x μ x μ]<br />

, (15)<br />

x i , i=1,2,..,s are snapshotted solution vectors, s is the<br />

number <strong>of</strong> snapshots (m>>s) and μ is the mean vector <strong>of</strong><br />

these solutions. Note that the matrix Cm is a dense matrix<br />

whose size is the same as that <strong>of</strong> the FE matrix.<br />

Therefore, numerical solution to the eigenvalue problem<br />

for (14) is computationally prohibitive. In order to<br />

alleviate this problem, we consider the smaller matrix <strong>of</strong><br />

s×s defined by<br />

t<br />

Cs<br />

X X<br />

(16)<br />

instead <strong>of</strong> Cm. The eigenvalues <strong>of</strong> Cm and Cs are identical<br />

except m-s zero eigenvalues <strong>of</strong> Cm as shown below. The<br />

singular value decomposition <strong>of</strong> matrix XR m×s is given<br />

by<br />

X<br />

s<br />

<br />

i1<br />

t<br />

t<br />

u v UV<br />

,<br />

(17)<br />

i<br />

i<br />

i<br />

where UR m×s , VR s×s and<br />

diag[ 1 2 s ] , (18)<br />

σ1 ≥σ2 ≥ ... ≥σs ≥ 0 and σi is singular value <strong>of</strong> X. The<br />

matrices U and V satisfy<br />

t<br />

U I ,<br />

(19)<br />

U s<br />

t<br />

V V Is<br />

,<br />

(20)<br />

where Is denotes the s×s unit matrix. Then, the matrices<br />

Cm and Cs can be decomposed as follows:<br />

C<br />

2 t<br />

U<br />

U ,<br />

(21)<br />

m<br />

- 341 - 15th IGTE Symposium 2012<br />

2 t<br />

Cs V<br />

V . (22)<br />

From eqs. (21) and (22), we find that the eigenvalues <strong>of</strong><br />

Cm and Cs are essentially identical. Moreover, since<br />

X vi iu<br />

(23)<br />

i<br />

holds, the eigenvectors <strong>of</strong> Cm can be easily obtained by<br />

solving the eigenvalue problem for Cs.<br />

Then, the dominant r eigenvectors are chosen to<br />

construct the matrix defined by<br />

W [ 1 2<br />

r ] . w w w <br />

(24)<br />

It is assumed that the original unknown variable x k R m<br />

can be expressed by the linear combination <strong>of</strong> the reduced<br />

variables y k R r in the form<br />

W .<br />

k<br />

k<br />

x y<br />

(25)<br />

Using the transform (25), the original FE equation (13),<br />

which is simply expressed by Ax=b, can be reduced to<br />

t n t n<br />

W AWy<br />

W b .<br />

(26)<br />

Since the size <strong>of</strong> the coefficient matrix W t AW is r×r<br />

(m>>s>r), eq. (26) can be solved much faster than (13).<br />

III. ANALYSIS OF BULK CONDUCTOR MODEL<br />

The bulk conductor model shown in Fig. 1 is analyzed<br />

by using the present method. The FE model has 125000<br />

nodes, 117649 elements, 367500 edges and 369036<br />

unknown variables. The conductivity and relative<br />

permeability in the magnetic material are 0.510 7 S/m<br />

and 1000. The driving frequency is set to 50 Hz and time<br />

step is ∆t=10 -4 sec. Under these conditions, as the time<br />

constant τ is estimated to be about 0.01 sec and the period<br />

T is 0.02 sec, the relation T>τ holds.<br />

A. Dependence on snapshot interval and period<br />

We change the snapshot intervals and the period during<br />

which the snapshots are taken to clarify the dependence <strong>of</strong><br />

the solution on them. In this study, the solution to eq. (13)<br />

is snapshotted from 0 to T with snapshot intervals 4∆t,<br />

2∆t and ∆t. The snapshot period is T, T/2 and T/4.<br />

Moreover, the number <strong>of</strong> the basis functions is set as r=40<br />

and 50.<br />

The time variation in the magnetic flux density |B| at<br />

the center <strong>of</strong> magnetic material is shown in Fig. 2. In this<br />

problem, we set the number <strong>of</strong> basis function as r=40. In<br />

Fig. 2(a), we find that there are no significant differences<br />

between the original solution and the solutions obtained<br />

by the conventional method and model reduction method<br />

with different snapshot intervals. Also, Fig. 2(b) plots the<br />

time changes in |B| for initial period, 0


35<br />

20<br />

15<br />

y(mm)<br />

magnetic<br />

material<br />

J(A/m 2 )<br />

TABLE I<br />

DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON SNAPSHOT<br />

INTERVAL AND PERIOD<br />

(A) SNAPSHOT INTERVAL<br />

Snapshot intervals Δt 2Δt 4Δt<br />

computational time (%) 43.2 37.3 29.6<br />

error e(%) 0.07 0.19 0.65<br />

Snapshot period<br />

(B) SNAPSHOT PERIOD<br />

T T/2 T/4<br />

computational time (%) 42.5 30.0 24.7<br />

error e(%) 0.02 0.32 0.60<br />

TABLE II<br />

DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON NUMBER OF<br />

BASIS FUNCTION<br />

Number <strong>of</strong> basis function 20 30 40<br />

computational time (%) 40.2 41.8 43.2<br />

error (%) 2.38 0.29 0.07<br />

TABLE III<br />

DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON NUMBER OF<br />

BASIS FUNCTION LONGER TIME CONSTANT<br />

Number <strong>of</strong> basis function 40 60 80<br />

computational time (%) 46.6 56.0 65.5<br />

error (%) 0.39 0.15 0.07<br />

conductor, shown in Fig. 4, obtained by the conventional<br />

method and present method in which the snapshot period<br />

is set to T/4. The discrepancy can also be found in the<br />

initial responses shown in Fig. 3(b). This suggests that the<br />

long range errors can be predicted from the initial<br />

responses. That is, by performing the analysis using the<br />

present method for initial short period changing the<br />

snapshot period, we could know the appropriate value for<br />

it.<br />

The error between the original solution and that<br />

obtained by the present method is defined by<br />

e <br />

H i<br />

i<br />

<br />

i<br />

red<br />

H<br />

H<br />

i<br />

i<br />

z(mm)<br />

x(mm)<br />

15 20 35<br />

15<br />

coil<br />

Figure1 : Bulk conductor model<br />

100(%)<br />

- 342 - 15th IGTE Symposium 2012<br />

(27)<br />

where Hi red is the magnetic field obtained by the present<br />

method and Hi is computed from the original solution.<br />

Table I and II summarize the error e evaluated at t=0.08<br />

sec where the solutions sufficiently converge to steady<br />

state and corresponding computational time. We can see<br />

that the errors e become small as the snapshot interval<br />

decreases or the snapshot period increases. However, the<br />

35<br />

5<br />

x<br />

magnetic<br />

material<br />

20<br />

coil<br />

35<br />

x(mm)<br />

Magnetic Density (T)<br />

Magnetic Density (T)<br />

2.00E-04<br />

1.50E-04<br />

1.00E-04<br />

5.00E-05<br />

0.00E+00<br />

Original solution without reduction<br />

T/4<br />

T/2<br />

T<br />

-5.00E-05<br />

0 0.02<br />

Time (s)<br />

0.04 0.06<br />

(a) |B| during 0


(a) Original solution.<br />

(b) Solution obtained by present method.<br />

Figure 4 : Eddy current distribution.<br />

computational time simultaneously increases. This means<br />

that we must determine these parameters considering both<br />

effects.<br />

B. Dependence on number <strong>of</strong> basis functions<br />

In this section, we discuss the dependence <strong>of</strong> the<br />

solutions obtained by the present method on r, the number<br />

<strong>of</strong> the basis functions. The snapshot period and interval<br />

are set to T and Δt, respectively. The analysis results are<br />

shown in Fig. 5. It can be found both in Fig. 5 (a) and (b)<br />

that the solutions obtained by the present results approach<br />

the original solution as r increases.<br />

The computational time and error e depending on r are<br />

summarized in Table II. We can see that e decreases as r<br />

increases while there are little dependence <strong>of</strong> the<br />

computational time on r.<br />

C. Bulk conductor with longer time constant<br />

To test the validity <strong>of</strong> the present method for systems<br />

with longer time constants, we increase the conductivity <br />

- 343 - 15th IGTE Symposium 2012<br />

Magnetic Density (T)<br />

Magnetic Density (T)<br />

2.00E-04<br />

1.50E-04<br />

1.00E-04<br />

5.00E-05<br />

0.00E+00<br />

-5.00E-05<br />

0 0.01 0.02 0.03 0.04 0.05<br />

(a) |B| during 0


(a) Original solution<br />

(b) Primal basis vector w1<br />

(c) Second basis vector w2<br />

(d) Fifth basis vector w5<br />

Figure 7 : Distribution <strong>of</strong> original solution and basis vector.<br />

- 344 - 15th IGTE Symposium 2012<br />

35<br />

20<br />

15<br />

Magnetic Density (T)<br />

2.50E-04<br />

2.00E-04<br />

1.50E-04<br />

1.00E-04<br />

5.00E-05<br />

Original solution without reduction<br />

Δt<br />

0.00E+00<br />

-5.00E-05<br />

2Δt<br />

4Δt<br />

0 0.01 0.02<br />

Time (s)<br />

0.03 0.04 0.05<br />

(a) |B| during 0


sec. The snapshot period is set to T/2, T/4 and T/8. The<br />

snapshot interval and the number <strong>of</strong> basis vectors are set<br />

to Δt and 40, respectively.<br />

The time change in |B| at the center <strong>of</strong> the model is<br />

shown in Fig. 9, where (a) and (b) show the relatively<br />

long-range and initial responses, respectively. Due to the<br />

structure <strong>of</strong> the stacked iron core, the time constant <strong>of</strong> this<br />

system is much smaller than that <strong>of</strong> the bulk iron shown in<br />

Fig. 1. We can see that in Fig. 9 that the solutions are in<br />

good agreement with the original solution except during<br />

the initial short period.<br />

V. CONCLUSION<br />

In this paper, the three dimensional time-domain FE<br />

analysis using the model order reduction based on the<br />

method <strong>of</strong> snapshots has been presented. Effectiveness <strong>of</strong><br />

this present method is shown for bulk conductor and<br />

stacked iron model. It has been found that the snapshot<br />

period and number <strong>of</strong> basis functions have great influence<br />

on the transient solutions obtained by the present method.<br />

It has been suggested that these parameters could be<br />

appropriately determined by performing time marching<br />

for initial some steps for the different parameter values.<br />

In future, we plan to apply the present method to nonlinear<br />

eddy current problems and high-frequency<br />

problems.<br />

REFERENCES<br />

[1] Y. Takahashi, T. Tokumasu, M. Fujita, S. Wakao, T. Iwashita,<br />

and M.Kanazawa, “Improvement <strong>of</strong> convergence characteristic in<br />

nonlinear transient eddy-current analyses using the error<br />

correction <strong>of</strong> time integration based on the time-periodic FEM and<br />

the EEC method,” (in Japanese) IEEJ Trans. PE, vol. 129, no. 6,<br />

2009.<br />

[2] H. Igarashi, Y. Watanabe and Y. Ito, ”Why Error Correction<br />

Methods Realize Fast Computations,” IEEE Trans. Magn., vol.<br />

48, no. 2, pp.415-418, 2012.<br />

[3] Krysl, P., S. Lall, and J. Marsden, “Dimensional Model Reduction<br />

in Non-linear Finite Element Dynamics <strong>of</strong> Solids and Structures,”<br />

International Journal for Numerical Methods in Engineering,<br />

vol. 51, pp479-504, 2001.<br />

[4] G. Kerschen J. Golinval, AF. Vakakis, LA. Bergman, The<br />

method <strong>of</strong> proper orthogonal decomposition for dynamical<br />

characterization and order reduction <strong>of</strong> mechanical systems: an<br />

overview, Nonlinear Dynamics, vol. 41, pp. 147 169, 2005.<br />

[5] S. Rutenkroger, B. Deken, S. Pekarek, Reduction <strong>of</strong> Model<br />

Dimension in Nonlinear Finite Element Approximations <strong>of</strong><br />

Electromagnetic, Computers in Power Electronics, 2004,<br />

<strong>Proceedings</strong>. IEEE Workshop on, pp. 20-27, Aug., 2004.<br />

[6] P. Holmes, JL. Lumley, G. Berkooz, Turbulence; Coherent<br />

Structures; Dynamical Systems and Symmetry. Cambridge<br />

<strong>University</strong> Press: Cambridge, 1996.<br />

- 345 - 15th IGTE Symposium 2012


- 346 - 15th IGTE Symposium 2012<br />

Calculation <strong>of</strong> eddy-current probe signal for a<br />

3D defect using global series expansion<br />

Sándor Bilicz, József Pávó and Szabolcs Gyimóthy<br />

Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics<br />

Department <strong>of</strong> Broadband Infocommunications and Electromagnetic Theory<br />

Goldmann Gy. tér 3.,1111 Budapest, Hungary<br />

E-mail: bilicz@evt.bme.hu<br />

Abstract—A novel eddy-current modeling technique <strong>of</strong> volumetric defects embedded in conducting plates is presented in<br />

the paper. This problem is <strong>of</strong> great interest in electromagnetic nondestructive evaluation (ENDE) and has already been<br />

exhaustively studied. The defect is modeled by a volumetric current dipole density which satisfies an integral equation.<br />

The latter is solved by the classical method <strong>of</strong> moments. This have been usually based on the volume discretisation <strong>of</strong><br />

the defect. Contrarily, –as a new contribution– we propose the use <strong>of</strong> globally defined, continuous basis functions for the<br />

expansion <strong>of</strong> the current dipole density. This global expansion lets us expect for an improvement <strong>of</strong> the numerical stability<br />

and the performance <strong>of</strong> the simulation. The proposed method is tested against both measured and synthetic data obtained<br />

by a different defect model.<br />

Index Terms—eddy-current modeling, integral equation, global expansion, moment method<br />

I. INTRODUCTION<br />

Eddy-current nondestructive testing (ECT) is a widely<br />

used technique to reveal and characterize in-material<br />

flaws (inclusions, voids, cracks, etc.) within conducting<br />

specimens. The principle <strong>of</strong> ECT is based on the local<br />

changes in the specimen’s electromagnetic (EM) parameters<br />

due to the flaw. These changes result in an EM field<br />

different from the field in the flawless case. Either the<br />

field directly, or a deduced quantity (e.g., impedance <strong>of</strong><br />

a probe coil) is measured during a nondestructive test and<br />

the acquired data are used for the flaw reconstruction.<br />

The inverse problem <strong>of</strong> nondestructive testing can be<br />

ill-posed. This means that any <strong>of</strong> the existence, unicity<br />

and stability <strong>of</strong> the solution is not necessarrily provided.<br />

Beyond these theoretical challenges, the flaw characterization<br />

can be numerically demanding as well: the<br />

inversion algorithms are <strong>of</strong>ten iterative, i.e., several flaws<br />

are to be sequentially simulated in an optimisation loop.<br />

Consequently, a key element <strong>of</strong> the inversion is a fast and<br />

reliable numerical simulation <strong>of</strong> flaws.<br />

Classical attempts <strong>of</strong> flaw modeling are the integral<br />

approaches. They can cope with the difficulties arisen by<br />

the relatively small size <strong>of</strong> flaws compared to the excited<br />

region (yielding discretisation issues). The classical work<br />

[1] presents a flaw simulation where the flawed volume is<br />

discretised by a regular grid. The yielded volume integral<br />

eqution is resolved by the Method <strong>of</strong> Moments (MoM)<br />

[2], assuming a piecewise constant approximation <strong>of</strong> the<br />

EM field over each cell <strong>of</strong> the grid. By now, this method<br />

has been implemented in commercial s<strong>of</strong>twares, e.g., [3],<br />

and has been successfully applied in inversion algorithms<br />

as well [4]. The volume integral method has recently<br />

been revisited in [5], where the EM field is expanded<br />

by means <strong>of</strong> locally defined splines. This provides the<br />

smoothness <strong>of</strong> the field, which is violated in the previous<br />

approach. Variational formalisms have also been tried<br />

with success: in [6], a Finite Element Method (FEM)<br />

scheme is presented for the separated computation <strong>of</strong> the<br />

field in the flawless specimen and the “reaction field”<br />

risen by the presence <strong>of</strong> the flaw. Coupled methods<br />

have been introduced, e.g., in [7]: a FEM code for the<br />

computation <strong>of</strong> the flawless field is coupled with a surface<br />

integral scheme <strong>of</strong> the ideally thin crack model.<br />

In [8], an ideally thin crack is considered which is<br />

modeled by a surface integral equation, again resolved<br />

by MoM, using a piecewise constant approximation.<br />

Though some <strong>of</strong> the works above were carried out<br />

decades ago, several pitfalls are still present in eddycurrent<br />

flaw modeling. Today’s challenges are mainly<br />

related to the increasing needs <strong>of</strong> flaw inversion in the<br />

sense <strong>of</strong> accuracy and speed. Beyond being small, flaws<br />

can have bad aspect ratio as well, making the volumetric<br />

models fail. It is also not straightforward how to choose<br />

between the volumetric and the ideally thin crack models<br />

for an arbitrary defect. The optimisation-based inversion<br />

schemes can badly perform if the sensitivity data are<br />

inaccurate. Another important issue is the convergence <strong>of</strong><br />

the simulation with respect to the discretisation applied.<br />

In case <strong>of</strong> a grid-discretisation, this can only be controled<br />

at the price <strong>of</strong> computational load.<br />

These challenges inspired the improvement <strong>of</strong> the<br />

MoM-based discretisation techniques <strong>of</strong> the integral<br />

equation models. The above cited formalisms are resolved<br />

by using locally defined basis functions for the<br />

expansion <strong>of</strong> the EM field. A new approach has been<br />

presented in [9], where the basis functions were globally<br />

defined (i.e., all over the surface <strong>of</strong> the ideal crack)<br />

harmonic functions. The use <strong>of</strong> such global expansion


provided considerable advantages over local expansions.<br />

In this paper, we present the use <strong>of</strong> global expansion<br />

functions for volumetric flaw modeling. In a certain<br />

sense, this is an extension <strong>of</strong> the method formalized for<br />

the ideally thin flaws in [9].<br />

II. THE VOLUME INTEGRAL METHOD<br />

Let us consider a non-magnetic, conducting specimen<br />

(to be tested against material flaws) with a homogeneous<br />

conductivity σ0. A time-harmonic source (typically, a<br />

coil) near the specimen induces eddy-currents within the<br />

conductive medium. In the presence <strong>of</strong> a flaw embedded<br />

in the volume region V , the otherwise constant conductiviy<br />

<strong>of</strong> the specimen will locally change: σ = σ(r),<br />

r ∈ V , so the EM field will change, too. The EM field<br />

can be decomposed into a so-called incident term and a<br />

defect term :<br />

E(r) =E i (r)+E d (r), (1)<br />

where only Ei (r) would exist in the flawless (σ(r) ≡ σ0)<br />

specimen, whereas Ed (r) rises due to the flaw. The latter<br />

is imagined as the field corresponding to a fictious source<br />

distribution which takes place in the flawless specimen<br />

and has exactly the same effect as imposed by the flaw<br />

[1]. Formally, let the secondary source be a current dipole<br />

density P =(σ(r) − σ0)E, r ∈ V .ThenEd (r) can be<br />

expressed as<br />

E d <br />

(r) =−jωμ0 G(r|r ′ )P(r ′ )dV ′ , (2)<br />

V<br />

where G(r|r ′ ) is the electric-electric dyadic Green’s<br />

function transforming the current density excitation at<br />

the point r ′ to the generated electric field at the point<br />

r. ω is the angular frequency <strong>of</strong> the source and μ0 is<br />

the vacuum permeability. By substituting (2) into (1),<br />

using the definition <strong>of</strong> P, we get a Fredholm-type integral<br />

equation <strong>of</strong> the second kind for the unknown current<br />

dipole density:<br />

<br />

1<br />

P(r)+jωμ0<br />

σ(r) − σ0<br />

G(r|r<br />

V<br />

′ )P(r ′ )dV ′ =<br />

= E i (r).<br />

Once the integral equation is solved, P(r) can be used<br />

to derive quantites that can be measured during the<br />

nondestructive test. In the illustrative cases that we will<br />

present in this paper, a probe coil is used for both the<br />

excitation <strong>of</strong> the field and the acquisition <strong>of</strong> the measured<br />

data via its complex impedance variation ΔZ. As a<br />

consequence <strong>of</strong> the reciprocity principle [10], ΔZ can<br />

be computed as<br />

ΔZ = − 1<br />

I2 <br />

E i (r) · P(r)dV, (4)<br />

V<br />

with I being the amplitude <strong>of</strong> the probe coil’s current.<br />

This decomposition (1) let E i (r) and G(r|r ′ ) be<br />

separately computed, which provides the well-known<br />

advantages from the viewpoint <strong>of</strong> numerical evaluation.<br />

Moreover, the formula (4) can also be easily evaluated.<br />

- 347 - 15th IGTE Symposium 2012<br />

(3)<br />

Let us also highlight that the volume integral method<br />

bears the potential pitfall <strong>of</strong> properly computing the<br />

Green’s function. Due to its singularity, a numerically<br />

stable expression <strong>of</strong> G(r|r ′ ) <strong>of</strong>ten requires special efforts,<br />

as it will be shown in Subsection III-C.<br />

III. SOLUTION OF THE INTEGRAL EQUATION<br />

A. The studied configuration<br />

We restrict our studies to a special, but practically<br />

important configuration, outlined in Fig. 1. The specimen<br />

is assumed to be a homogeneous conducting plate with<br />

a finite thickness. The dimensions <strong>of</strong> the plate in the x<br />

and y directions are assumed to be infinite. The flaw<br />

is <strong>of</strong> cuboid shape and it has four edges perpendicular<br />

to the plate surface. The flaw edges are A, B and D,<br />

respectively, and the volume V is defined as<br />

|x| ≤ A B<br />

, |y| ≤ and |z − C| ≤<br />

2 2<br />

D<br />

, (5)<br />

2<br />

where C is the center <strong>of</strong> the crack along z. The conductivity<br />

within the flaw volume V is known, σ(r), typically,<br />

σ(r) =0.<br />

r 1<br />

r2<br />

Coil<br />

x<br />

c<br />

Coil<br />

Plate<br />

z=C<br />

y<br />

c<br />

A<br />

y<br />

z<br />

B<br />

z=0 l<br />

A<br />

z=−d<br />

D<br />

Plate<br />

Flaw<br />

TOP VIEW<br />

d<br />

h<br />

SIDE VIEW<br />

Figure 1. The studied configuration. An air-cored pancake-type coil<br />

scans above the infinite plate near the flaw. For generality, a burried<br />

flaw is sketched, however, we deal with ID and OD flaws.<br />

The probe coil is actually an air-cored pancake-type<br />

probe, driven by a time-harmonic current. During the<br />

nondestructive test, the coil scans above the damaged<br />

zone and its impedance variation is measured at given<br />

coil positions.<br />

B. Global approximation <strong>of</strong> the current dipole density<br />

Let us approximate the solution P(r) <strong>of</strong> the integral<br />

equation (3) by means <strong>of</strong> a finite series. Let the basis<br />

x<br />

x


functions <strong>of</strong> this expansion be products <strong>of</strong> three factors,<br />

each depending only on one Cartesian coordinate:<br />

P(r) =<br />

M<br />

N<br />

Q<br />

m=−M n=−N q=−Q<br />

Pmnqf m x (x)f n y (y)f q z (z),<br />

(6)<br />

The key idea in this paper is the choice <strong>of</strong> the basis<br />

functions: in contrary with the classical schemes, herein<br />

each basis function is globally defined, i.e., all over the<br />

flaw volume V . Let us note that the special restrictions<br />

for the shape <strong>of</strong> the flaw are needed here. We propose<br />

the use <strong>of</strong> the following harmonic factors in the basis<br />

functions:<br />

f m <br />

1<br />

x (x) =<br />

A exp<br />

<br />

2πj mx<br />

<br />

,<br />

A<br />

f n <br />

1<br />

y (y) =<br />

B exp<br />

<br />

2πj ny<br />

<br />

,<br />

B<br />

f q <br />

1<br />

z (z) =<br />

D exp<br />

(7)<br />

<br />

<br />

q(z − C)<br />

2πj .<br />

D<br />

In fact, this leads to a three-dimensional complex Fourierseries;<br />

the integers m, n and q are the harmonic orders.<br />

The basis functions form an orthonormal set with respect<br />

to the scalar product:<br />

<br />

<br />

g(r) ,h(r) := g(r)h ⋆ (r)dV. (8)<br />

Let us notice that our choice for the basis functions<br />

provides a smooth approximation <strong>of</strong> P, instead <strong>of</strong> the<br />

piecewise constant approximation discussed in [1]. In<br />

Section IV, the advantages provided by the global expansion<br />

are discussed along with numerical examples.<br />

C. Discretisation by the Method <strong>of</strong> Moments; computation<br />

<strong>of</strong> the matrix elements<br />

The R =(2M +1)(2N+1)(2Q+1) unknown vectorial<br />

coefficients Pmnq in the series (6) are determined by<br />

means <strong>of</strong> the Method <strong>of</strong> Moments. The testing functions<br />

are the same as the basis functions (Galerkin-method) and<br />

a linear system <strong>of</strong> R vectorial equations is obtained. For a<br />

handy formalization, let the basis functions be ordered so<br />

as each triplet <strong>of</strong> harmonic orders (m, n, q) has a unique<br />

index k (1 ≥ k ≥ R) and denote the kth basis function<br />

as<br />

wk(x, y, z) =f m x (x)f n y (y)f q z (z). (9)<br />

The elements in the system matrix <strong>of</strong> the linear equations<br />

are in the form<br />

a λκ<br />

lk =(eλ · eκ) wl(r) ,wk(r)/(σ − σ0) +<br />

<br />

<br />

jωμ0 wl(r) , eλ · G(r|r<br />

V<br />

′ )(eκwk(r ′ ))dV ′ ,<br />

(10)<br />

where l and k are the indices <strong>of</strong> the test and basis<br />

functions (l, k =1, 2,...,R), eλ and eκ are the unitvectors<br />

(λ, κ = x, y, z) and , stands for the scalar<br />

product.<br />

V<br />

- 348 - 15th IGTE Symposium 2012<br />

The evaluation <strong>of</strong> the integral with respect to r ′ dV ′<br />

needs special numerical treatment due to the singularity<br />

<strong>of</strong> the Green’s function. However, in the case <strong>of</strong> the<br />

considered planar geometry and <strong>of</strong> the proposed basis<br />

functions, one can cope with the singular kernel by<br />

means <strong>of</strong> the spectral method, presented in detail in<br />

[11]. In brief, by using the 2-dimensional spatial Fouriertransform<br />

in the xy plane, the spectrum <strong>of</strong> the Green’s<br />

function can be represented as a sum <strong>of</strong> planar waves<br />

traveling along z. Due to the product-separation form (7)<br />

<strong>of</strong> the bais functions, the integral with respect to r ′ dV ′<br />

in (10) splits up to a factor depending only on z and z ′<br />

and to an other factor which is represented by its 2D<br />

Fourier-transform. The integral with respect to z ′ can<br />

be analytically evaluated in the spectral domain, and the<br />

remaining factor (to be inverse transformed) is no longer<br />

singular.<br />

As a useful consequence <strong>of</strong> the Galerkin-method,<br />

certain elements <strong>of</strong> the yielded system matrix must be<br />

the same. In (10), the volume integral with respect to<br />

rdV (due to the scalar product) and to r ′ dV ′ can be<br />

commuted. According to the reciprocity theorem, we<br />

have<br />

a λκ<br />

lk ≡ a κλ<br />

k ′ l ′, (11)<br />

for all k and l if wk(r) ≡ wk ′(r)⋆ and wl(r) ≡ wl ′(r)⋆<br />

hold, respectively. This equivalence can be applied to<br />

check the numerical computations and/or to reduce the<br />

computational load.<br />

Finally, let us notice that the presence <strong>of</strong> the probe coil<br />

is neglected in the expression <strong>of</strong> the Green’s function.<br />

This is usual and does not cause considerable error.<br />

D. Computation <strong>of</strong> the incident field<br />

The Ei (r) incident field can be analytically computed<br />

in the studied case. The pancake-type coil generates an<br />

axisymmetric field which depends only on z and r =<br />

(x − xc) 2 +(y − yc) 2 . This field can be expressed in<br />

the form <strong>of</strong> an integral <strong>of</strong> first-order Bessel-functions, as<br />

detailed in the classical work [12].<br />

More complicated probes (e.g., including ferrit core or<br />

having rectangular-shaped turns) can also be considered.<br />

However, in such cases, Ei (r) is obviously more difficult<br />

to compute (e.g., by a Finite Element Method).<br />

Once the incident field is obtained within the flaw<br />

volume V , the excitation vector <strong>of</strong> the linear system <strong>of</strong><br />

equations yielded by the MoM can be assembled from<br />

the entries<br />

b λ l = wl(r) , eλ · E i (r) , (12)<br />

where l =1, 2,...,Ris the index <strong>of</strong> testing function and<br />

eλ is the unit vector (λ = x, y, z). As a consequence <strong>of</strong><br />

the axial symmetry, bz l ≡ 0 holds for all l.<br />

E. Implementation issues<br />

The algorithms are coded in Matlab R○ . The spectral<br />

domain expression <strong>of</strong> the Green’s function is inverse<br />

transformed by a 2-dimensional Fast Fourer Transform


(FFT2) routine. The width <strong>of</strong> the FFT2’s spatial window<br />

in the xy plane is estimated from the skin depth within<br />

the conductive medium, whereas the spectral window is<br />

assigned with respect to the harmonic orders m and n,<br />

respectively.<br />

The integrals involved by the scalar products in (10)<br />

and (12) are evaluated numerically, based on a regular<br />

discretisation <strong>of</strong> the flaw volume. The number <strong>of</strong> samples<br />

along each axis is set with respect to the harmonic orders<br />

m, n and q <strong>of</strong> the basis and testing functions, respectively.<br />

IV. TEST CASES AND COMPARISONS<br />

In this section, the proposed method is illustrated<br />

and its main advantages are highlighted via numerical<br />

examples.<br />

A. Definition <strong>of</strong> the configurations<br />

The illustrative test cases are presented in Fig. 1. The<br />

air-filled rectangular flaw has constant zero conductivity.<br />

Though a buried flaw is outlined in the sketch, we present<br />

cases for ID-type (“inner defect”, opening to the top<br />

surface <strong>of</strong> the plate: C = −D/2) and OD-type (“outer<br />

defect”, C = −d + D/2) flaws only.<br />

Experimental data <strong>of</strong> the variation <strong>of</strong> the coil’s<br />

impedance (ΔZ) are available on the xc = 0 line in<br />

function <strong>of</strong> yc, at discrete coil positions. The cases #1<br />

and #2 are JSAEM Benchmarks [13], whereas case #3 is<br />

an also frequently cited TEAM Benchmark no. 15 [14].<br />

The parameters <strong>of</strong> each case are given in Tab. I.<br />

Table I<br />

NUMERICAL DESCRIPTION OF THE TEST CASES.(NOTATION IS<br />

ACCORDING TO FIG.1.)<br />

#1 #2 #3<br />

Specimen<br />

d (mm) 1.25 1.25 12.22<br />

σ0 (MS/m) 1 1 30.6<br />

Flaw<br />

A (mm) 0.21 0.21 0.28<br />

B (mm) 10 10 12.6<br />

D (mm) 0.75 0.5 5<br />

C (mm) −d + D/2 −D/2 −D/2<br />

Probe coil<br />

r1 (mm) 0.6 0.6 6.15<br />

r2 (mm) 1.6 1.6 12.4<br />

h (mm) 0.8 0.8 6.15<br />

l (mm) 0.5 1.0 0.88<br />

f (kHz) 150 300 0.9<br />

Turns 140 140 3790<br />

B. Convergence <strong>of</strong> the series<br />

One <strong>of</strong> the main advantages <strong>of</strong> the global expansion<br />

method is the easy access to the convergence with respect<br />

to the maximal harmonic orders <strong>of</strong> M, N and Q in the<br />

series <strong>of</strong> P. By adding further terms <strong>of</strong> higher orders,<br />

the previously computed elements <strong>of</strong> the system matrix<br />

remain unchanged. Consequently, convergence studies<br />

can be performed at a much lower computational cost<br />

than in the case <strong>of</strong> local basis functions (the latter needs<br />

a new grid whenever a finer discretisation is set).<br />

- 349 - 15th IGTE Symposium 2012<br />

We have computed ΔZ in the test case #1 using different<br />

MNQ maximal orders. The discrepancy between<br />

two impedance signals is expressed by the norm<br />

<br />

ΔZ := (1/K)ΣK k=1 |ΔZ(yc,k)| 2 , (13)<br />

where the number <strong>of</strong> coil positions is actually K =11<br />

and yc,k =(k−1) mm. In Fig. 2, the normalised discrepancy<br />

between the first impedance signal (MNQ = 121)<br />

and some others obtained by higher order approximations<br />

is shown. A fast convergence is experienced: e.g., there<br />

is ca. 5% discrepancy between the impedance signals<br />

computed with MNQ = 121 and with the higher orders<br />

MNQ = 163. Let us note that the variation <strong>of</strong> the current<br />

dipole density in the x-direction is smooth enough to be<br />

modeled by a first order Fourier series, i.e., the choice<br />

M =1seems to be appropriate.<br />

||ΔZ − ΔZ 121 || / ||ΔZ 121 ||<br />

0.08<br />

0.07<br />

0.06<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

0<br />

121<br />

131<br />

151161<br />

141<br />

122<br />

132<br />

152162<br />

142<br />

123<br />

133<br />

153163<br />

143<br />

Figure 2. Normalised discrepancy between impedance signals obtained<br />

by different maximal harmonic orders MNQ (marked above each bar)<br />

in test problem #1.<br />

The fast convergence is also reasoned by the behavior<br />

<strong>of</strong> the coefficients Pmnq. Again in the test case #1, we<br />

examined the coefficients <strong>of</strong> the x-directed current dipole<br />

density P x (r) (note that P x is much more dominant than<br />

P y and P z in this case) for a centered coil location (xc =<br />

yc =0). Some coefficients <strong>of</strong> the largest magnitude are<br />

plotted in Fig. 3. A fast decrease <strong>of</strong> the magnitudes is<br />

experienced as the harmonic orders increase.<br />

C. Comparison to exparimental data and to the ideally<br />

thin crack model<br />

The volumetric flaw model using global expansion<br />

is a sort <strong>of</strong> extension <strong>of</strong> the model proposed in [9].<br />

Therein, ideally thin cracks are considered and modeled<br />

by a surface layer <strong>of</strong> current dipole density. This can be<br />

imagined as if the A edge length <strong>of</strong> the crack (Fig. 1)<br />

would collapse to zero whereas the x component <strong>of</strong><br />

the total electric field E vanishes on the crack surface.<br />

For the surface current dipole density, certain boundary<br />

conditions must hold, thus, the basis functions in the


x<br />

Normalized |P |<br />

mnq<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0 1 0<br />

0 −1 0<br />

1 1 0<br />

−1 1 0<br />

1 −1 0<br />

−1 −1 0<br />

0 −2 0<br />

0 2 0<br />

1 −2 0<br />

−1 −2 0<br />

1 2 0<br />

−1 2 0<br />

0 3 0<br />

0 −3 0<br />

−1 1 1<br />

−1 −1 1<br />

1 1 1<br />

1 −1 1<br />

−1 1 −1<br />

−1 −1 −1<br />

1 1 −1<br />

1 −1 −1<br />

1 1 2<br />

−1 1 2<br />

1 −1 2<br />

−1 −1 2<br />

1 1 3<br />

1 −1 3<br />

−1 1 3<br />

−1 −1 3<br />

1 1 −2<br />

−1 1 −2<br />

1 −1 −2<br />

−1 −1 −2<br />

Figure 3. Normalized magnitudes <strong>of</strong> the coefficients <strong>of</strong> the x-directed<br />

current dipole density P x (r) in test problem #1. The 34 highest magnitudes<br />

are plotted (totally there are (2M + 1)(2N + 1)(2Q + 1) = 273<br />

coefficients, as M =1, N =6and Q =3is chosen); the triplets<br />

mnq (any <strong>of</strong> the indices can be negative as it can be seen) are marked<br />

above each bar.<br />

series expansion –e.g., for ID cracks– are products <strong>of</strong><br />

the following sine and cosine functions:<br />

g n <br />

2<br />

y (y) =<br />

B sin<br />

<br />

y + B/2<br />

nπ ,<br />

B<br />

g q <br />

2<br />

z (z) =<br />

D cos<br />

<br />

(2q − 1)π z<br />

(14)<br />

<br />

,<br />

2D<br />

with the integers n and q, as “harmonic orders”. Herein<br />

we do not deal with the surface model in detail, but<br />

only present some results for comparison, provided by<br />

the authors <strong>of</strong> [9].<br />

In Figs. 4, 5 and 6, comparisons <strong>of</strong> impedances (i)<br />

computed by our volumetric model, (ii) by the surface<br />

model and (iii) measured data (provided with the benchmarks)<br />

are presented. The comparisons let us conclude<br />

the followings:<br />

• The volumetric model can appropriately reconstruct<br />

the measured data at very low maximal harmonic<br />

orders. Let us note again that for instance, when<br />

MNQ = 122, we have only 75 (vectorial) unknowns<br />

in the series (6).<br />

• Though there is no straightforward connection between<br />

the harmonic orders <strong>of</strong> the volumetric and<br />

surface models, in the presented cases, the volumetric<br />

model provides better results in the sense <strong>of</strong> |ΔZ|<br />

than the surface model with more-or-less the same<br />

harmonic orders. In Figs. 4 and 5, the surface model<br />

appears to be unstable at NQ =44. (However, the<br />

surface model also has good convergence properties<br />

but at considerably higher N and Q which is not<br />

presented herein.)<br />

• The volumetric model tends to slightly overestimate<br />

|ΔZ| and underestimate the phase arg{ΔZ}.<br />

Whereas the phase error is acceptable in Figs. 4 and<br />

5, it becomes considerable in Fig. 6.<br />

- 350 - 15th IGTE Symposium 2012<br />

|ΔZ| (mΩ)<br />

arg{ΔZ} (rad)<br />

80<br />

60<br />

40<br />

20<br />

0<br />

1.5<br />

1<br />

0.5<br />

0<br />

JSAEM OD−60 Benchmark 150 kHz<br />

measured<br />

vol 011<br />

vol 122<br />

surf 44<br />

surf 66<br />

surf 88<br />

0 2 4 6 8 10<br />

Coil position y (mm)<br />

c<br />

Figure 4. Impedance variation in the test case #1. Legend: “vol”<br />

and “surf” refer to the volumetric and the surface model. The maximal<br />

harmonic orders MNQ and NQ used in the simulations are also given.<br />

|ΔZ| (mΩ)<br />

arg{ΔZ} (rad)<br />

150<br />

100<br />

50<br />

0<br />

3<br />

2<br />

1<br />

JSAEM ID−40 Benchmark 300 kHz<br />

measured<br />

vol 011<br />

vol 122<br />

surf 44<br />

surf 66<br />

surf 88<br />

0<br />

0 2 4 6 8 10<br />

Coil position y (mm)<br />

c<br />

Figure 5. Impedance variation in the test case #2. Legend notations<br />

are explained in Fig. 4.<br />

The computation times are quite short. The assemblation<br />

<strong>of</strong> the system matrix took, e.g., 285 s for the OD60<br />

flaw and 202 s for the ID40 flaw when a basis function<br />

set with maximal orders M =1, N =4, Q =8was<br />

used for both. Though the number <strong>of</strong> samples within the<br />

flaws for the numerical integration is the same in both<br />

cases, the OD60 flaw computation needs a wider spatial<br />

window for the FFT2 as the lower frequency yields a<br />

higher skin-depth.<br />

V. CONCLUSION AND PERSPECTIVES<br />

The classical integral equation models <strong>of</strong> volumetric<br />

flaws have been used for decades in the simulation<br />

<strong>of</strong> ECT. Though these schemes have many advantages,<br />

several bottlenecks are still present. In this paper, we<br />

proposed a new discretisation technique for the numerical<br />

solution <strong>of</strong> the volume integral equation. Instead <strong>of</strong> the<br />

locally defined, pulse basis functions, we use globally<br />

defined, harmonic basis functions for the expansion <strong>of</strong><br />

the unknown current dipole density distribution. Thanks


|ΔZ| (Ω)<br />

arg{ΔZ} (rad)<br />

20<br />

15<br />

10<br />

5<br />

0<br />

2<br />

1.5<br />

1<br />

0.5<br />

TEAM Benchmark<br />

measured<br />

vol 011<br />

vol 122<br />

surf 22<br />

surf 44<br />

surf 66<br />

0 5 10 15 20<br />

Coil position y (mm)<br />

c<br />

Figure 6. Impedance variation in the test case #3. Legend notations<br />

are explained in Fig. 4.<br />

to this choice, an improvement in the accuracy and<br />

performance <strong>of</strong> the simulation has been experienced in<br />

the test cases. The results obtained so far are promising,<br />

the research certainly needs to be continued, with special<br />

emphasis on the followings:<br />

• More parametric studies are needed for a various<br />

range <strong>of</strong> flaw sizes and frequencies to confirm that<br />

the new scheme outperforms the existing ones, and<br />

at the same time, to point out its limitations. We<br />

have not considered yet, for instance, through-plate<br />

flaws.<br />

• The method could easily be extended to the case<br />

<strong>of</strong> flaws embedded in thick plates (modeled as halfspace).<br />

The extension must be possible to the case<br />

<strong>of</strong> layered medium as well, which might be <strong>of</strong> more<br />

practical interest.<br />

<br />

• As the aspect ratio <strong>of</strong> the flaw (e.g., width length)<br />

•<br />

gets worse, the volumetric model is expected to become<br />

less accurate and the ideally thin crack model<br />

should be applied instead. However, the relation between<br />

the two models has not been exactly revealed<br />

yet. One expects the results <strong>of</strong> the volumetric model<br />

to converge to the results <strong>of</strong> the surface model as the<br />

width <strong>of</strong> the flaw collapses. This should be studied<br />

both in theoretical and in numerical senses as well.<br />

The inverse problem is <strong>of</strong>ten formalised as an<br />

optimisation task <strong>of</strong> minimizing the discrepancy<br />

between the measured and simulated data. The<br />

gradient-based schemes require the sensitivity data<br />

with respect to the parameters <strong>of</strong> the flaw. This<br />

sensitivity is accessible, e.g., via the the adjoint<br />

problem [15]. However, the numerical stability <strong>of</strong><br />

the gradient computation strongly depends on the<br />

precision <strong>of</strong> the EM field calculation near the boundaries<br />

<strong>of</strong> the flaw. The proposed global expansion<br />

<strong>of</strong> P could improve the precision in these cruical<br />

regions.<br />

- 351 - 15th IGTE Symposium 2012<br />

The authors think that the contribution <strong>of</strong> this paper<br />

can be <strong>of</strong> industrial interest as well, if the further numerical<br />

studies remain convincing about its performance.<br />

VI. ACKNOWLEDGEMENTS<br />

This research is supported by the Hungarian Science<br />

Research Fund (OTKA grant no. K105996).<br />

REFERENCES<br />

[1] J. R. Bowler, S. A. Jenkins, L. D. Sabbagh, and H. A. Sabbagh,<br />

“Eddy-current probe impedance due to a volumetric flaw,” Journal<br />

<strong>of</strong> Applied Physics, vol. 70, no. 3, pp. 1107 –1114, 1991.<br />

[2] R. F. Harrington, Field computation by moment methods.<br />

Macmillan, 1968.<br />

[3] CIVA. “CIVA: State <strong>of</strong> the art simulation platform for NDE”.<br />

[Online]. Available: http://www-civa.cea.fr<br />

[4] S. Bilicz, E. Vazquez, M. Lambert, S. Gyimóthy, and J. Pávó,<br />

“Characterization <strong>of</strong> a 3D defect using the expected improvement<br />

algorithm,” COMPEL: The International Journal for Computation<br />

and Mathematics in Electrical and Electronic Engineering,<br />

vol. 28, no. 4, pp. 851–864, 2009.<br />

[5] C. Reboud, D. Prémel, D. Lesselier, and B. Bisiaux, “New discretisation<br />

scheme based on splines for volume integral method:<br />

Application to eddy current testing <strong>of</strong> tubes,” COMPEL: The<br />

International Journal for Computation and Mathematics in Electrical<br />

and Electronic Engineering, vol. 27, no. 1, pp. 288–297,<br />

2008.<br />

[6] Z. Badics, Y. Matsumoto, K. Aoki, F. Nakayasu, M. Uesaka, and<br />

K. Miya, “Accurate probe-response calculation in eddy current<br />

NDE by finite element method,” Journal <strong>of</strong> Nondestructive<br />

Evaluation, vol. 14, pp. 181–192, 1995. [Online]. Available:<br />

http://dx.doi.org/10.1007/BF00730888<br />

[7] Y. Le Bihan, J. Pavo, M. Bensetti, and C. Marchand, “Computational<br />

environment for the fast calculation <strong>of</strong> ect probe signal by<br />

field decomposition,” Magnetics, IEEE Transactions on, vol. 42,<br />

no. 4, pp. 1411 –1414, 2006.<br />

[8] J. R. Bowler, “Eddy-current interaction with an ideal crack. I. The<br />

forward problem,” Journal <strong>of</strong> Applied Physics, vol. 75, no. 12, pp.<br />

8128–8137, 1994.<br />

[9] J. Pávó and D. Lesselier, “Calculation <strong>of</strong> eddy current testing<br />

probe signal with global approximation,” IEEE Transactions on<br />

Magnetics, vol. 42, no. 4, pp. 1419–1422, 2006.<br />

[10] R. F. Harrington, Time-harmonic electromagnetic fields.<br />

McGraw-Hill, 1961.<br />

[11] J. Pávó and K. Miya, “Reconstruction <strong>of</strong> crack shape by optimization<br />

using eddy current field measurement,” IEEE Transactions on<br />

Magnetics, vol. 30, no. 5, pp. 3407–3410, 1994.<br />

[12] C. V. Dodd and W. E. Deeds, “Analytical solutions to eddy-current<br />

probe-coil problems,” Journal <strong>of</strong> Applied Physics, vol. 39, no. 6,<br />

pp. 2829–2838, 1968.<br />

[13] T. Takagi, M. Uesaka, and K. Miya, “Electromagnetic NDE<br />

research activities in JSAEM,” in Electromagnetic Nondestructive<br />

Evaluation, ser. Studies in Applied Electromagnetics and Mechanics,<br />

T. Takagi, J. R. Bowler, and Y. Yoshida, Eds. IOS Press,<br />

1997, vol. 1, pp. 9–16.<br />

[14] T.E.A.M. Benchmark Problems. Accessed on 7.08.2012. [Online].<br />

Available: http://www.compumag.org/jsite/team.html<br />

[15] S. J. Norton and J. R. Bowler, “Theory <strong>of</strong> eddy current inversion,”<br />

Journal <strong>of</strong> Applied Physics, vol. 73, no. 2, pp. 501–512, 1993.


- 352 - 15th IGTE Symposium 2012<br />

Computation <strong>of</strong> the Motion <strong>of</strong> Conducting Bodies<br />

Using the Eddy-Current Integral Equation<br />

*Mihai Maricaru, † Ioan R. Ciric, *Horia Gavrila, *George-Marian Vasilescu and *Florea I. Hantila<br />

*Department <strong>of</strong> Electrical Engineering, Politehnica <strong>University</strong> <strong>of</strong> Bucharest, Spl. Independentei 313,<br />

Bucharest, 060042, Romania, E-mail: mihai.maricaru@upb.ro<br />

† Department <strong>of</strong> Electrical and Computer Engineering, The <strong>University</strong> <strong>of</strong> Manitoba, Winnipeg, MB R3T 5V6, Canada<br />

Abstract—The analysis <strong>of</strong> the motion <strong>of</strong> a system <strong>of</strong> solid conductors in the presence <strong>of</strong> magnetic fields is performed by<br />

solving the classical mechanics equation <strong>of</strong> motion under the action <strong>of</strong> magnetic forces. Application <strong>of</strong> the eddy-current<br />

integral equation and the usage <strong>of</strong> the local coordinates attached to the bodies in motion allow the determination <strong>of</strong><br />

electromagnetic field without being necessary to reconstruct the discretization grid at each new position <strong>of</strong> the conducting<br />

bodies. Only the submatrices associated with the coupling between the bodies in relative motion are modified in the global<br />

system matrix. A time-domain method <strong>of</strong> solution is first presented for the electromagnetic field problem, coupled with the<br />

equation <strong>of</strong> motion, which can be efficiently applied at high frequencies when the time steps are small. The eddy-current<br />

integral equation for the derivative <strong>of</strong> current density contains a term that takes into account the relative motion <strong>of</strong> the<br />

bodies. Since the electromagnetic quantities vary much more rapidly than the mechanical quantities, a second method is also<br />

proposed in this paper, where the eddy-current integral equation is solved in the frequency domain by assuming that the<br />

bodies are motionless, but by adding supplementary terms due to the actual motion <strong>of</strong> the bodies. Thus, only the average<br />

force over a period <strong>of</strong> time is now computed. This method is extremely efficient especially at higher frequencies when the<br />

time steps are very small.<br />

Index Terms—eddy-current integral equation, electrodynamics <strong>of</strong> moving conductors, levitation.<br />

I. INTRODUCTION<br />

The equation <strong>of</strong> translational motion <strong>of</strong> a solid<br />

conducting body <strong>of</strong> mass m under the action <strong>of</strong> the<br />

magnetic force F is<br />

2<br />

d r dr<br />

m F(<br />

r,<br />

, )<br />

G<br />

(1)<br />

2<br />

dt dt<br />

where r is the position vector <strong>of</strong> a point <strong>of</strong> the body, for<br />

instance <strong>of</strong> its center <strong>of</strong> gravity, is a vector<br />

representing the imposed current distribution and G is the<br />

gravitational force acting on the body. Equation (1) is<br />

discretized in time and F is determined at each time step<br />

by solving an electromagnetic field problem in the region<br />

with moving bodies. The application <strong>of</strong> the Finite<br />

Element Method requires a tremendous amount <strong>of</strong><br />

computation since it is necessary to reconstruct the<br />

discretization mesh at each time step. Moreover, the<br />

modifications <strong>of</strong> the discretization mesh are, usually,<br />

accompanied by undesired cumulative errors in the<br />

successive solutions <strong>of</strong> the electromagnetic field. A<br />

substantial improvement can be achieved when adopting<br />

hybrid Finite Element – Boundary Element Methods [1].<br />

Using the “laboratory” frame <strong>of</strong> references complicates<br />

the field problem solution due to the presence <strong>of</strong> the<br />

motional electric field intensity v0 B , where v 0 is the<br />

body velocity in this frame <strong>of</strong> references and B the<br />

magnetic induction. This disadvantage is eliminated<br />

when employing local frames <strong>of</strong> reference, attached to<br />

the bodies in motion [1], [2]. This also allows the usage<br />

<strong>of</strong> the simpler eddy-current integral equation for the<br />

bodies at rest, as it has been done in the case the<br />

velocities <strong>of</strong> the bodies are known [2]. However, in many<br />

situations the velocities <strong>of</strong> the bodies are not known, as,<br />

for instance, in the case <strong>of</strong> the electromagnetic levitation,<br />

their determination constituting one <strong>of</strong> the objectives <strong>of</strong><br />

the present work.<br />

In the case <strong>of</strong> the electromagnetic levitation, to ensure<br />

the stability <strong>of</strong> the solution it is necessary to choose a<br />

sufficiently small time step. Since for same accuracy <strong>of</strong><br />

the results the time period has to be divided practically in<br />

the same number <strong>of</strong> intervals (for example, at 50 Hz in<br />

200 intervals [1]), at higher frequencies the time step<br />

decreases. Unfortunately, as the time step decreases, the<br />

successively computed solutions tend to be very close to<br />

each other and the errors in the solution differences<br />

increase considerably, the computation procedure<br />

becoming inefficient.<br />

In the present paper, a new procedure is described for<br />

the time-domain solution <strong>of</strong> the eddy-current integral<br />

equation applicable to small time steps. As well, a<br />

technique is proposed for accelerating the determination<br />

<strong>of</strong> the trajectory <strong>of</strong> the moving bodies, based on the<br />

frequency-domain solution <strong>of</strong> the eddy-current integral<br />

equation.<br />

II. TIME-DOMAIN SOLUTION OF THE EDDY-CURRENT<br />

INTEGRAL EQUATION<br />

For two-dimensional field problems, the time-domain<br />

eddy-current integral equation for motionless conductors<br />

is<br />

<br />

d<br />

1<br />

J ( r, t)<br />

J ( r',<br />

t)<br />

ln dS'<br />

dt R<br />

<br />

d<br />

1<br />

<br />

Ji<br />

(r',<br />

t)<br />

ln dS'<br />

(2)<br />

dt<br />

R<br />

i<br />

where r and r ' are the position vectors <strong>of</strong> the observation<br />

point and <strong>of</strong> the source point, respectively, is the


egion containing the solid conductors, i is the region<br />

where the imposed current density J i is confined,<br />

0<br />

R |<br />

r r'|<br />

, , 0 being the permeability <strong>of</strong> free<br />

2<br />

space.<br />

In the three-dimensional case, the eddy-current integral<br />

equation has the form<br />

d J(<br />

r',<br />

t)<br />

J(<br />

r,<br />

t)<br />

dV '<br />

2 dt R<br />

<br />

d Ji<br />

( r',<br />

t)<br />

dV ' grad <br />

2 dt R<br />

<br />

i<br />

where is the electric scalar potential.<br />

To simplify the formulation, we consider here a<br />

two-dimensional structure with a single solid conductor.<br />

Using a frame <strong>of</strong> reference attached to the conducting<br />

body in motion, the time discretization <strong>of</strong> (3) leads to<br />

t<br />

1<br />

( J J0<br />

) ( J1<br />

J0<br />

) ln dS '<br />

2<br />

R<br />

<br />

1 <br />

- 353 - 15th IGTE Symposium 2012<br />

(3)<br />

1<br />

1<br />

J0t Ji<br />

ln dS'<br />

ln '<br />

1 Ji<br />

dS (4)<br />

0<br />

R<br />

R<br />

<br />

i1<br />

where the subscript “0” indicates the time t and the<br />

subscript “1” the time t t<br />

. Dividing (4) by t yields<br />

<br />

i0<br />

J<br />

t<br />

J<br />

1<br />

ln dS'<br />

J<br />

t<br />

1 2 t<br />

1 R<br />

2<br />

<br />

2<br />

Ji<br />

1<br />

1<br />

ln ' ( ) 1 dS <br />

ln '<br />

1 n v<br />

Ji<br />

dl (5)<br />

t<br />

R<br />

2 R<br />

i<br />

1<br />

2<br />

2<br />

i<br />

1 t<br />

where the subscript “ ” refers to the time t<br />

2<br />

2<br />

<br />

, i<br />

is the boundary <strong>of</strong> i , v is the velocity <strong>of</strong> the i in the<br />

frame <strong>of</strong> references attached to , and n is the outward<br />

unit vector normal to i<br />

. The last term in (5) is due to<br />

the relative motion <strong>of</strong> and i . Solution <strong>of</strong> (5) gives<br />

the current distribution J 1 at the time step t t<br />

in<br />

terms <strong>of</strong> that at the time step t in the form<br />

1<br />

2<br />

J<br />

<br />

J1 t<br />

J<br />

1<br />

0<br />

t<br />

<br />

2<br />

. (6)<br />

The magnetic force is evaluated by applying Ampère’s<br />

force formula, i.e.,<br />

r ri<br />

F <br />

J<br />

( r,<br />

t)<br />

Ji<br />

( ri<br />

, t)<br />

dSi<br />

dS (7)<br />

2<br />

| r r |<br />

<br />

i<br />

i<br />

0<br />

with r and r i being the position vectors <strong>of</strong> the points <strong>of</strong><br />

and i , respectively.<br />

The spatial discretization grid in the two-dimensional<br />

case is constructed by dividing the region into<br />

polygonal surface elements m , with the induced current<br />

density considered to be constant through each m . The<br />

region i is divided into surface elements i , with the<br />

k<br />

imposed current density being constant through each<br />

i . Integrating (5) over each <br />

k<br />

m yields the following<br />

J<br />

<br />

matrix equation for the vector :<br />

t<br />

t<br />

<br />

J<br />

<br />

J<br />

A B AJ<br />

i <br />

<br />

0 Bi<br />

C i<br />

2 <br />

t<br />

<br />

t<br />

<br />

1<br />

2<br />

1<br />

2<br />

1<br />

2<br />

J1 2<br />

where A is a diagonal matrix with entries Am mSm<br />

,<br />

m being the resistivity <strong>of</strong> the material for m and S m<br />

its area, and B is a symmetric matrix with its entries<br />

corresponding to the elements m <strong>of</strong> having the form<br />

B<br />

<br />

<br />

1<br />

ln dS<br />

dS<br />

S mS<br />

m,<br />

k<br />

k m = k<br />

R<br />

mk <br />

4<br />

1 2<br />

<br />

<br />

m k<br />

(8)<br />

1<br />

( n m nk<br />

) R ln dlkdlm<br />

. (9)<br />

R<br />

The entries <strong>of</strong> the matrix B i are defined as in (9), but<br />

with the elements k <strong>of</strong> being replaced with the<br />

elements i belonging to <br />

k<br />

i , while the entries <strong>of</strong> the<br />

matrix C are<br />

C<br />

m,<br />

i<br />

k<br />

<br />

1<br />

2<br />

<br />

<br />

m ik<br />

1<br />

( n m R)(<br />

ni<br />

v)<br />

ln dl<br />

k<br />

i dl<br />

k m . (10)<br />

R<br />

All integrals in (9) and (10) are evaluated by analytic<br />

expressions, the entries <strong>of</strong> the matrix B being calculated<br />

only once, but those <strong>of</strong> the matrices B i and C are to be<br />

calculated for each new position <strong>of</strong> i .<br />

Taking into account the small dimensions <strong>of</strong> the<br />

elements m , a rapid numerical computation <strong>of</strong> the force<br />

in (7) is performed using the approximation<br />

where the vector<br />

Jm<br />

F Sm Pi<br />

(11)<br />

k<br />

m k<br />

P i is expressed in the form<br />

k<br />

1<br />

P i n<br />

k<br />

i ln dl<br />

k<br />

i (12)<br />

k<br />

| rm<br />

ri<br />

|<br />

Jik <br />

ik<br />

k


with r m being the position vector <strong>of</strong> the center <strong>of</strong> the<br />

element m <strong>of</strong> and r i the position vector <strong>of</strong> the<br />

k<br />

point <strong>of</strong> integration on i<br />

. When the ratio <strong>of</strong> the linear<br />

k<br />

dimensions <strong>of</strong> i to the distance between its center and<br />

k<br />

the center <strong>of</strong> m is sufficiently small, P i can be<br />

k<br />

calculated by subdividing i in a number <strong>of</strong> elements<br />

k<br />

p in terms <strong>of</strong> this ratio and by using the summation<br />

k<br />

rm<br />

rpk<br />

P i J<br />

k ik<br />

S p (13)<br />

2 k<br />

p | rm<br />

rp<br />

|<br />

k<br />

where r p is the position vector <strong>of</strong> the center <strong>of</strong> p<br />

k<br />

k and<br />

S p the area <strong>of</strong> the element p<br />

k<br />

k . The same technique is<br />

applied for a rapid numerical calculation <strong>of</strong> the entries in<br />

the matrices B i and C in (8), making also use <strong>of</strong> the<br />

relation<br />

1 R<br />

( n i v)<br />

ln dl'<br />

v <br />

k dS'<br />

. (14)<br />

R<br />

2<br />

<br />

R<br />

i<br />

i<br />

k<br />

In the case the imposed currents are periodic, the initial<br />

distribution <strong>of</strong> the induced current can be obtained by<br />

performing a Fourier expansion and by employing the<br />

phasor form <strong>of</strong> the eddy-current integral equation (see<br />

Section IV).<br />

III. SOLUTION OF EQUATION OF MOTION<br />

Equation (1) is solved iteratively. We choose an<br />

appropiate time step t and assume that the magnetic<br />

force F has a linear variation during t . At the time t the<br />

body has a position defined by the vector r 0 and a<br />

magnetic force F 0 is exerted upon it. The iterative<br />

process is started by imposing the value F1 F0<br />

at the<br />

time t t<br />

and the position vector r 1 results from<br />

solving (1). The electromagnetic field problem is then<br />

solved for the new r 1 and a new value <strong>of</strong> the force F 1 is<br />

determined for the time t t<br />

. This operation is repeated<br />

until the difference between two successive values <strong>of</strong> the<br />

magnetic force for the time t t<br />

is sufficiently small<br />

and, then, we proceed to the next time step.<br />

IV. FREQUENCY-DOMAIN SOLUTION OF THE<br />

EDDY-CURRENT INTEGRAL EQUATION<br />

Since the region i is moving with the velocity v in<br />

the frame <strong>of</strong> reference attached to , (2) is written in the<br />

form<br />

J<br />

( r',<br />

t)<br />

1 Ji<br />

( r',<br />

t)<br />

1<br />

J ( r, t)<br />

ln dS'<br />

<br />

ln dS'<br />

t R<br />

t<br />

R<br />

<br />

i<br />

k<br />

1<br />

( n v)<br />

Ji ( r',<br />

t)<br />

ln dl'<br />

. (15)<br />

R<br />

i<br />

k<br />

- 354 - 15th IGTE Symposium 2012<br />

If the imposed currents are sinusoidal, the phasor<br />

representation <strong>of</strong> (15) is<br />

1<br />

1<br />

J ( ) j J ( ')<br />

ln dS'<br />

j J i ( ')<br />

ln dS'<br />

R<br />

R<br />

<br />

<br />

r<br />

r r <br />

<br />

1<br />

( n v)<br />

J i ( r')<br />

ln dl'<br />

(16)<br />

R<br />

<br />

i<br />

where 2<br />

f<br />

, f being the frequency, and<br />

re im<br />

J J jJ<br />

is the phasor form <strong>of</strong> the current density,<br />

with j 1<br />

. The two terms on the right side <strong>of</strong> (16)<br />

show the contribution to the induced current density due<br />

to the time variation <strong>of</strong> the imposed currents and that due<br />

to the relative motion. The same technique as in the case<br />

<strong>of</strong> the time-domain analysis is used for the space<br />

discretization <strong>of</strong> (16). One obtains the following<br />

algebraic system with complex coefficients:<br />

re im im re<br />

AJ BJ Bi<br />

Ji<br />

CJi<br />

im re re im<br />

AJ BJ Bi<br />

Ji<br />

CJi<br />

. (17)<br />

The average magnetic force over a period is evaluated<br />

using the relation<br />

<br />

<br />

* r ri<br />

<br />

Fav <br />

ReJ<br />

( r)<br />

J i ( ri<br />

) dSi<br />

dS<br />

2 (18)<br />

<br />

<br />

| r ri<br />

|<br />

i<br />

<br />

where the asterisk indicates the complex conjugate.<br />

For a multiple-conductor systems, one uses local<br />

frames <strong>of</strong> reference attached to each <strong>of</strong> the conductors.<br />

For the conductor q, occupying the region ,<br />

q 1,<br />

2,<br />

,<br />

(16) is written in the form<br />

i<br />

q<br />

1<br />

1<br />

J ( r) j<br />

J ( r')<br />

ln dS'<br />

j<br />

J ( r')<br />

ln dS'<br />

R<br />

R<br />

<br />

pq<br />

<br />

q<br />

( q)<br />

1<br />

1<br />

( n v p ) J ( r')<br />

ln dl'<br />

j<br />

J i ( r'<br />

) ln dS'<br />

R<br />

R<br />

pq<br />

<br />

p<br />

i<br />

( q)<br />

1<br />

( n<br />

vi<br />

) J i ( r')<br />

ln dl'<br />

(19)<br />

R<br />

<br />

i<br />

(q)<br />

(q)<br />

where v p and v i are, respectively, the velocities <strong>of</strong><br />

the conductor p and <strong>of</strong> i with respect to the conductor<br />

q. Equation (1) is always solved separately for each body.<br />

V. SOLUTION ACCELERATION FOR THE EQUATION OF<br />

MOTION<br />

The computation <strong>of</strong> the motion <strong>of</strong> conducting bodies<br />

can be spectacularly accelerated by using the average<br />

value <strong>of</strong> the force over a period, evaluated using the<br />

p


Figure 1: Discretization <strong>of</strong> the levitated plate.<br />

y (m)<br />

0.06<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

0.00<br />

0 1 2 3 4 5 6 7 8 9 10 11 12 13<br />

t (s)<br />

Figure 2: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />

f = 2,000 Hz.<br />

y (m)<br />

0.06<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

0.00<br />

0.0 0.1 0.2 0.3 0.4 0.5<br />

t (s)<br />

Figure 3: Detail regarding the motion at the beginning for f = 2,000 Hz.<br />

phasor representation <strong>of</strong> current density. Using the<br />

algorithm described in Section III, the time step is chosen<br />

to be a multiple <strong>of</strong> the period and is adjusted according to<br />

the force value, such that when the force decreases the<br />

time step is increased and when the force increases it is<br />

reduced.<br />

VI. ILLUSTRATIVE EXAMPLE<br />

A copper plate <strong>of</strong> width 80 mm, thickness 4 mm (see<br />

8<br />

Fig. 1), resistivity 210<br />

m<br />

and <strong>of</strong> mass density<br />

3 3<br />

8. 9<br />

10 kg / m is levitated using two coils <strong>of</strong> 200 turns<br />

each, <strong>of</strong> 10 mm 10 mm in cross section and a distance<br />

between the axes <strong>of</strong> their sides <strong>of</strong> 70 mm and 30 mm,<br />

respectively. The current direction is the same in the<br />

outer and inner coils, the current intensity in each turn<br />

being i I 2 sin 2ft<br />

, with I = 10 A and f = 2,000 Hz.<br />

- 355 - 15th IGTE Symposium 2012<br />

y<br />

y (m)<br />

0.050<br />

0.045<br />

0.040<br />

0.035<br />

0.030<br />

0.025<br />

0.020<br />

0.015<br />

0.010<br />

0.005<br />

0.000<br />

0 1 2 3 4 5 6 7<br />

t (s)<br />

Figure 4: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />

f = 2,000 Hz, with a direct current <strong>of</strong> 10 A added in the outer coil.<br />

y (m)<br />

0.045<br />

0.040<br />

0.035<br />

0.030<br />

0.025<br />

0.020<br />

0.015<br />

0.010<br />

0.005<br />

0.000<br />

0 1 2 3 4 5 6 7 8 9 10 11 12 13<br />

t (s)<br />

Figure 5: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />

f = 200 Hz.<br />

Initially, the conducting plate is located at 10 mm<br />

above the coils. It is assumed that the plate only moves in<br />

the vertical direction, but the procedures described in the<br />

paper are also applicable when more degrees <strong>of</strong> freedom<br />

are considered. The plate cross section is discretized in<br />

180 rectangular elements, as indicated in Fig. 1.<br />

For the time-domain method presented in this paper,<br />

the period was divided in 48 intervals and the motion <strong>of</strong><br />

the plate was observed during 26,000 periods, i.e. for<br />

1,248,000 time steps. The result is presented in Fig. 2,<br />

with the detailed motion at the beginning shown in Fig. 3.<br />

The computation took about 6 hours employing a 2.128<br />

GHz Intel processor notebook. A great reduction in the<br />

amount <strong>of</strong> computation is obtained by approximating the<br />

conducting region with a thin strip <strong>of</strong> thickness equal to<br />

the field depth <strong>of</strong> penetration [3]. The oscillations <strong>of</strong> the<br />

plate can be attenuated by adding a dc component in the<br />

current coils or by using a permanent magnet. If a direct<br />

current <strong>of</strong> 10 A is added in the outer coil, the plate<br />

motion becomes as it is shown in Fig. 4.<br />

For a frequency <strong>of</strong> 200 Hz, the motion <strong>of</strong> the plate is<br />

shown in Fig. 5, the attenuation <strong>of</strong> the mechanical<br />

oscillations being much stronger than for a frequency <strong>of</strong><br />

2,000 Hz.<br />

It should be remarked that the same results in Figs. 2<br />

and 3 were obtained by using the proposed<br />

frequency-domain procedure (see Sections IV and V).<br />

Only 4,393 variable time steps, <strong>of</strong> magnitude between a<br />

period and 50 periods, were necessary for determining<br />

the motion <strong>of</strong> the plate between t = 0 and t = 13 s. The


equired computation time was only 124 s, i.e., about 170<br />

times less than for the time-domain solution.<br />

VII. CONCLUSION AND REMARKS<br />

Two efficient methods are presented for computing the<br />

motion <strong>of</strong> the solid conductors under the action <strong>of</strong><br />

electromagnetic forces. Practically, for the same accuracy<br />

<strong>of</strong> the results, a tremendous reduction in the amount <strong>of</strong><br />

computation is achieved when using a frequency-domain<br />

procedure.<br />

The proposed methods can be extended to nonlinear<br />

media. For the time-domain procedure one can utilize the<br />

polarization method [4], which allows the formulation <strong>of</strong><br />

the eddy-current integral equation [2]. For the<br />

frequency-domain procedure, one can adopt the method<br />

proposed in [5], [6]. Since in some problems, for instance<br />

in electromagnetic levitation problems, the air regions are<br />

relatively large with respect to the conducting or/and<br />

ferromagnetic regions, the weight <strong>of</strong> the fundamental<br />

harmonic in the harmonic spectrum is significant and,<br />

thus, for the convergence acceleration in the polarization<br />

method one can efficiently employ the technique<br />

proposed in [7].<br />

The methods presented can be applied to<br />

three-dimensional structures as well, by adopting the<br />

eddy-current integral equation proposed in [8] and<br />

extended in [2] to nonlinear media and to moving bodies.<br />

In this case, the spatial discretization <strong>of</strong> (16) is done by<br />

decomposing the induced current density using functions<br />

<strong>of</strong> the form W<br />

, where W are edge elements. When<br />

edge elements <strong>of</strong> the first order are used, then the current<br />

density is constant inside the tetrahedral volume elements<br />

and the relations presented in this paper remain valid.<br />

Now, the integrals similar to those in (9) and (10) can<br />

only partially be evaluated analytically.<br />

Finally, it should be remarked that the procedures in<br />

- 356 - 15th IGTE Symposium 2012<br />

[2], [5], [6] and [7] allow the extension to nonlinear<br />

media <strong>of</strong> the proposed frequency-domain technique<br />

which, as illustrated in this paper, could yield a<br />

spectacular reduction in the amount <strong>of</strong> computation<br />

needed.<br />

ACKNOWLEDGMENT<br />

This work was supported in part by the Romanian<br />

Ministry <strong>of</strong> Labour, Family and Social Protection through<br />

the Financial Agreement POSDRU/89/1.5/S/62557 and<br />

by a grant <strong>of</strong> the Romanian National Authority <strong>of</strong><br />

Scientific Research, CNDI-UEFISCDI, project number<br />

PN-II-PT-PCCA-2011-3.2-0373.<br />

REFERENCES<br />

[1] S. Kurz, J. Fetzer, G. Lehner, and W. M. Rucker, “A novel<br />

formulation for 3D eddy current problems with moving bodies<br />

using a Lagrangian description and BEM-FEM coupling,” IEEE<br />

Trans. Magn., vol. 34, no. 5, pp. 3068-3073, Sep. 1998.<br />

[2] R. Albanese, F. Hantila, G. Preda, and G. Rubinacci, “Integral<br />

formulation for 3-D eddy current computation in ferromagnetic<br />

moving bodies,” Rev. Roum. Sci. Techn., Electrotechn. et Energ.,<br />

vol. 41, no. 4, pp. 421-429, 1996.<br />

[3] I. R. Ciric, F. I. Hantila, and M. Maricaru, “Field analysis for thin<br />

shields in the presence <strong>of</strong> ferromagnetic bodies,” IEEE Trans.<br />

Magn., vol. 46, no. 8, pp. 3373-3376, Aug. 2010.<br />

[4] F. Hantila, “A method <strong>of</strong> solving magnetic field in nonlinear<br />

media,” Rev. Roum. Sci. Techn., Electrotechn. et Energ., vol. 20,<br />

no. 3, pp. 397-407, 1975.<br />

[5] I. R. Ciric and F. I. Hantila, “An efficient harmonic method for<br />

solving nonlinear time-periodic eddy-current problems,” IEEE<br />

Trans. Magn., vol. 43, no. 4, pp. 1185-1188, Apr. 2007.<br />

[6] I. R. Ciric, F. I. Hantila, M. Maricaru, and S. Marinescu, “Efficient<br />

analysis <strong>of</strong> the solidification <strong>of</strong> moving ferromagnetic bodies with<br />

eddy-current control,” IEEE Trans. Magn., vol. 45, no. 3, pp.<br />

1238-1241, Mar. 2009.<br />

[7] I. R. Ciric, F. I. Hantila, and M. Maricaru, “Convergence<br />

acceleration in the polarization method for nonlinear periodic<br />

fields,” COMPEL, vol. 30, no. 6, pp. 1688-1700, 2011.<br />

[8] R. Albanese and G. Rubinacci, “Integral formulation for 3D<br />

eddy-current computation using edge elements,” IEE <strong>Proceedings</strong><br />

A , vol.135, no.7, pp.457-462, Sep. 1988.


- 357 - 15th IGTE Symposium 2012<br />

Adaptive Inductance Computation on GPUs<br />

A.G. Chiariello, A. Formisano and R. Martone<br />

Dipartimento di Ingegneria Industriale e dell’Informazione<br />

Seconda Università di Napoli, Via Roma 29, Aversa (CE), Italy<br />

E-mail: Alessandro.Formisano@Unina2.it<br />

Abstract—Inductances computation involving highly complex geometries and linear materials can be tackled by discretizing<br />

coils into simpler elements, whose magnetic behavior is analytically expressible but, to achieve reliable results, very high<br />

numbers <strong>of</strong> elements may be required. In such cases, advantages can be taken from GPU capabilities <strong>of</strong> dealing efficiently<br />

with simple computational tasks. In the paper, a code able to compute self and mutual inductances <strong>of</strong> any 3D coils, taking<br />

advantage <strong>of</strong> GPU capabilities, is presented.<br />

Index Terms— GPU, High Performance Computing, Inductance Computation<br />

I. INTRODUCTION<br />

The computation <strong>of</strong> self and mutual inductances for<br />

complex 3D shaped coils is a demanding task, since no<br />

general formulas exist. Numerical computations, in order<br />

to achieve reliable results, require large discretization<br />

efforts and, in general, multiple runs. As a consequence,<br />

in a number <strong>of</strong> practical cases the computer burden could<br />

be very high.<br />

Accuracy for generally shaped coils and computational<br />

promptness represent conflicting objectives, especially in<br />

optimal design [1, 2], and various computational<br />

paradigms have been proposed to achieve reasonable<br />

trade-<strong>of</strong>fs. One <strong>of</strong> the most promising approaches is the<br />

adoption <strong>of</strong> High-Performance Computing (HPC)<br />

architectures; among HPC approaches, an effective<br />

solution is the use <strong>of</strong> Graphic Processing Units (GPU),<br />

easily available even on desktop class hardware.<br />

On the other hand, in order to exploit at their best these<br />

peculiar architectures, a revising <strong>of</strong> simulation codes is<br />

<strong>of</strong>ten necessary, and new solutions, well suited for CPUbased<br />

computational environments, must be adopted.<br />

If assuming that no magnetic materials are present (i.e.<br />

the relative permeability rel is equal to 1 everywhere),<br />

following well established approximation formulas [3],<br />

self and mutual inductances can be computed using<br />

“segmented” approximations. The basic idea is to<br />

decompose coils into simpler elements, for which self or<br />

mutual inductances can be easily computed, eventually<br />

using closed form expressions. Superposition is then used<br />

to get the final value <strong>of</strong> coils self or mutual inductance.<br />

In this paper a method able to compute self and mutual<br />

inductances in air for generally 3D shaped massive coils,<br />

based on coil decomposition into filamentary elements,<br />

called current sticks, is presented, and its implementation<br />

on HPC environments, based on GPUs, is briefly<br />

discussed. The method is able to adapt the discretization<br />

level to the required accuracy, and was implemented in<br />

the INDIANA code (INDuctance Iterative and Adaptive<br />

Numerical Assessment).<br />

The basic objective <strong>of</strong> INDIANA is the computation <strong>of</strong><br />

self and mutual inductances <strong>of</strong> coils wound with multiple<br />

series-connected turns <strong>of</strong> conductors. In mutual<br />

inductance computation, INDIANA decomposes both<br />

coils into a suited number <strong>of</strong> current sticks, and computes<br />

the line integral along each stick <strong>of</strong> the “target” coil <strong>of</strong> the<br />

vector potential A generated by each stick <strong>of</strong> the “source”<br />

coil, with unit current. Then, the procedure, taking<br />

advantage <strong>of</strong> the concept <strong>of</strong> “partial inductance” [4] and<br />

<strong>of</strong> the linearity assumption, sums all contributions to get<br />

the final, overall required value. Self inductances are<br />

computed by using the same coil for both source and<br />

target, but extracting the singular case <strong>of</strong> “self”<br />

computations for each element, which are treated using<br />

the expression for self inductance <strong>of</strong> a current stick [3].<br />

This scheme suits quite well for GPU-based<br />

computations, as will be further discussed in Sect. III.<br />

In the following, a short overview <strong>of</strong> the relevant<br />

points in the method will be presented (Sect. II), some<br />

comments on the GPU implementations are reported<br />

(Sect. III), and a few examples are discussed to help<br />

assessing the method capabilities (Sect. IV). Finally, in<br />

two final annexes, a brief description <strong>of</strong> the GPUs<br />

architecture and programming paradigms is given.<br />

II. MATHEMATICAL FORMULATION<br />

The decomposition <strong>of</strong> massive coils into elementary<br />

components is performed in two steps, taking into<br />

account the structure <strong>of</strong> the winding, the distance between<br />

the coils, and the local curvature <strong>of</strong> each coil.<br />

As a first step, each coil is decomposed in as many<br />

filamentary conductors as required by accuracy needs.<br />

The typical figure adopted is as many as the actual<br />

conductors in the Winding Pack (WP) <strong>of</strong> each coil.<br />

However the figure can be increased according to the<br />

adopted technology (e.g. in superconducting cable in<br />

conduit conductor technology, the decomposition can be<br />

extended down to petals level) to meet the accuracy<br />

needs.<br />

As a second step, each conductor is described using an<br />

interpolating line, typically a spline, defined by a limited<br />

number <strong>of</strong> parameters, such as the coordinates <strong>of</strong> a<br />

suitable number <strong>of</strong> control points, constraining the shape<br />

<strong>of</strong> the conductor to the required geometrical accuracy. In<br />

addition the continuous curve is reduced to a collection <strong>of</strong><br />

sticks, defined by a number <strong>of</strong> suitable break points (see<br />

Fig. 1 for a schematic view).<br />

The number <strong>of</strong> sticks is selected on the basis <strong>of</strong> the<br />

accuracy required by magnetic field computation, and can<br />

vary depending on the local curvature <strong>of</strong> the interpolating<br />

curve and on the distance from field points. INDIANA<br />

performs a first guess for the distribution <strong>of</strong> break points<br />

along conductors <strong>of</strong> the source coil on the basis <strong>of</strong><br />

average curvature radius and on minimum distance


Actual<br />

Coil<br />

Current<br />

Sticks<br />

Approximation<br />

First coil: source Second coil: target<br />

Figure 1: Segmentation <strong>of</strong> coils centerline into “sticks”: source coil<br />

(left) and target coils (right).<br />

between the midpoint <strong>of</strong> the section <strong>of</strong> the source coil<br />

being considered and the target coil.<br />

If the required accuracy is not fulfilled, the inductance<br />

computation is assessed by increasing the number <strong>of</strong><br />

break points, and performing a new inductance<br />

computation. The process iterates until the convergence,<br />

in Cauchy sense within a prescribed accuracy, is<br />

achieved.<br />

The mutual inductance Mjk between the k-th stick on<br />

the source coil and j-th stick on the target coil can be<br />

computed either using the classical formulas from [3], or<br />

by line integrating (numerically) the vector potential<br />

Ak(x) generated by stick k on stick j:<br />

(1)<br />

ˆ<br />

M jk Akx j tˆ dl j<br />

<br />

j<br />

where j is straight line along the j-th stick, xj is a generic<br />

point along j, and ˆt is the stick unit vector. The<br />

expression for Ak is given in [5]:<br />

1<br />

ˆ<br />

cba A 0 a ln <br />

(2)<br />

4<br />

cba where c=j+1-xi, b=j-xi, and a=j+1-j and the coordinates<br />

<strong>of</strong> the stick tips.<br />

c<br />

xj<br />

Figure 2: Basic elements form computation <strong>of</strong> vector potential using<br />

(2b).<br />

INDIANA implements a slightly modified version <strong>of</strong><br />

(2), to treat the singularity when computing A on points<br />

on the source stick axis [6]. The number <strong>of</strong> Gauss points<br />

for numerical integration <strong>of</strong> (2) is chosen, according to<br />

the requested accuracy, on the basis <strong>of</strong> the distance<br />

between centers <strong>of</strong> source and target points, using a linear<br />

relationship based on the length <strong>of</strong> the target stick and the<br />

distance between mid points <strong>of</strong> source and target sticks.<br />

If source and target sticks are coincident, the self<br />

inductance Mkk <strong>of</strong> the k-th stick can be computed using<br />

the expression for a thin beam [3], providing the value in<br />

Henry if the stick length Lk is given in meters:<br />

4 2Lk <br />

Mkk 210 Lkln<br />

1<br />

(3)<br />

r<br />

<br />

<br />

Lk<br />

where r is the geometric mean distance and is the<br />

a<br />

Ak b<br />

A<br />

- 358 - 15th IGTE Symposium 2012<br />

arithmetic mean distance on the corresponding k-th beam<br />

cross section. Values <strong>of</strong> r and for different cross<br />

sections are given in [3], while INDIANA adopts the<br />

expression for circular cross section and long beams,<br />

where r is the radius <strong>of</strong> the cross section and /Lk is<br />

negligible.<br />

III. GPU IMPLEMENTATION<br />

In this section attention will be focused on the porting<br />

<strong>of</strong> INDIANA code on the peculiar GPUs hardware. In<br />

Appendix A, to the benefit <strong>of</strong> non experts, a short<br />

introduction to GPU’s architecture is given [7-11], while<br />

in Appendix B some hints strictly related to the<br />

peculiarity <strong>of</strong> the GPU’s hardware are provided for the<br />

interested programmers [7, 8].<br />

The typical computational GPU architecture includes a<br />

classical CPU section where the GPUs are grafted. In<br />

order to pr<strong>of</strong>itably use the parallel nature <strong>of</strong> the GPU, any<br />

code implementing numerical computations needs to be<br />

split into sequential parts, which are performed on the<br />

main CPU (or CPUs), and into the numerically intensive<br />

parts, which can be more effectively performed on the<br />

GPUs. In this way the best exploitation <strong>of</strong> GPUs and<br />

CPUs execution capabilities can be achieved.<br />

In the INDIANA code the basic computational task,<br />

i.e. the evaluation <strong>of</strong> (2), the magnetic vector potential A<br />

generated by a single stick in a single field point, can<br />

benefit <strong>of</strong> the GPUs architecture. As a matter <strong>of</strong> fact this<br />

task, which must be repeated a very high number <strong>of</strong><br />

times, can be simply assigned to a thread; then, suitable<br />

grouping <strong>of</strong> treads onto computational blocks, can be<br />

organized to exploit at best the data structure and the<br />

available resources.<br />

In order to achieve the peak performance [9], the<br />

computational kernel needs to use GPU registers; in<br />

addition, it needs to treat a large number <strong>of</strong> independent<br />

instructions to exploit the scheduler capability <strong>of</strong> the<br />

graphic card (further details can be found in Appendix<br />

B). For these reasons the code was structured following<br />

the flowchart:<br />

1) Load the field point associated<br />

to the considered thread. (Global<br />

memory access)<br />

2) Load the start and end points <strong>of</strong><br />

the sticks (Global memory access)<br />

3) Compute the contribution to the<br />

field <strong>of</strong> each stick in the bundle<br />

using (2)<br />

4) Accumulate all the contributions;<br />

in a register<br />

Last<br />

stick?<br />

Yes<br />

5) Store the computed vector field<br />

in the global memory<br />

Figure 3: Flowchart showing the INDIANA kernel computation on<br />

GPU architecture.<br />

No


The final integration step (1) is executed in the CPU<br />

side, since it is well suited to CPU characteristics, and its<br />

impact on the computational burden is quite marginal.<br />

INDIANA was implemented using the MATLAB©<br />

parallel computational toolbox, and the core <strong>of</strong> the code<br />

has been parallelized on the GPU using CUDA©, an<br />

extension <strong>of</strong> the C language for Nvidia© GPU<br />

programming. A few details are given in Appendix B.<br />

IV. EXAMPLES OF APPLICATION<br />

In this section, two groups <strong>of</strong> examples are presented.<br />

In the first group, in order to assess accuracy, results from<br />

INDIANA are compared to standard results for simple<br />

geometries where analytical formulas are available. In the<br />

second group, results from INDIANA are compared to<br />

computations from 3D finite elements to assess speed-up<br />

<strong>of</strong> computations for generally shaped coils.<br />

Computational times are all referred to an intel i7–based<br />

PC, with 8Gb ram, running Matlab© Ver.7 for the CPU<br />

computations, and CUDA© Ver. 4.2 for GPU<br />

computations.<br />

a. Accuracy Assessment<br />

Three filamentary single-turn coils have been<br />

considered in this group. The first one is a circular<br />

coil, while the other two are elliptical coils. Analytical<br />

expressions and reference figures were taken from<br />

[12]. Geometrical details are reported in Table I,<br />

while a comparison <strong>of</strong> results is reported in Table II,<br />

for increasing accuracy (expressed in Parts Per<br />

Million -p.p.m.- <strong>of</strong> the reference value), and<br />

consequently for increasing number <strong>of</strong> sticks in<br />

INDIANA calculations.<br />

z<br />

C3<br />

Figure 4: Coils used for Accuracy Assessment<br />

TABLE I<br />

COILS USED FOR ACCURACY ASSESSMENT<br />

Coil # Centre position [m] Radii (a,b) [m]<br />

C1 (0.0, 0.0, 0.0) 1 (circular)<br />

C2 (0.0, 0.0, 0.5) (1/3, 2/3)<br />

C3 (0.6, 2.0, 0.1) (1/3, 2/3)<br />

Required<br />

Figure<br />

x<br />

b<br />

TABLE II<br />

INDUCTANCES FOR INCREASING ACCURACY<br />

Required<br />

Accuracy<br />

[p.p.m.]<br />

C2<br />

a<br />

C1<br />

Number <strong>of</strong><br />

sticks<br />

Computational<br />

times [s]<br />

Reference<br />

Value [H]<br />

MC1-C2 1.0 10 3 1.13 0.30871178<br />

MC1-C2 0.1 10 4 36.9 0.30871178<br />

MC1-C3 1.0 10 3 1.17 0.03963496<br />

MC1-C3 0.1 10 4 35.8 0.03963496<br />

y<br />

- 359 - 15th IGTE Symposium 2012<br />

b. Speed Assessment<br />

For this second analysis, the mutual inductance <strong>of</strong> two<br />

coaxial circular coils has been considered. This<br />

benchmark case is treated either using INDIANA with<br />

300 sticks and the 3D FEM package COMSOL<br />

multiphysics 4.2a (27524 2 nd order tetrahedral elems.,<br />

neglecting any symmetry for the sake <strong>of</strong> generality). The<br />

two coils are described in Table III, while results are<br />

reported in Table IV.<br />

TABLE III<br />

COILS USED FOR SPEED ASSESSMENT<br />

Coil # Centre position [m] Radius [m]<br />

C1 (0.0, 0.0, 0.0) 0.20<br />

C3 (0.0, 0.0, 0.1) 0.25<br />

Method<br />

TABLE IV<br />

INDUCTANCES FOR SPEED ASSESSMENT<br />

Mutual<br />

Inductance [H]<br />

Computational time<br />

[s]<br />

INDIANA 0.2487 4<br />

3D FEM package 0.2486 490<br />

Reference Value [2] 0.2488 ---<br />

As a second speed test, the computation <strong>of</strong> mutual<br />

inductance between a massive solenoidal source coil<br />

(radius 1.7 m, length 2.0 m, thickness 0.7 m, 14 layers, 38<br />

turns per layer) and a filamentary coil (Rin=4.77 m,<br />

Rout=5.83 m, Zlow=5.02 m, Zup=5.11 m, =17.3°) used to<br />

measure flux across a test surface was considered (See<br />

Fig. 5). This test case is relevant for flux measurements in<br />

magnetic confinement fusion devices [13, 14]. The<br />

accuracy requirements on the flux measurement are rather<br />

severe, in order to achieve that accuracy a high number <strong>of</strong><br />

sticks can be needed, in these cases the GPU speed<br />

enhancement can be very useful to complete the<br />

simulation in a reasonable amount <strong>of</strong> time. All methods<br />

proved able to give the correct result <strong>of</strong> 2.5665048e-3 H.<br />

The massive source coil has been represented by as<br />

many conductors as actually present in its WP (that is,<br />

14×38), while the discretization level along each<br />

conductor has been varied to improve accuracy.<br />

Comparison <strong>of</strong> computational times for various<br />

discretization levels either for GPU computations, and for<br />

purely CPU computations for the sake <strong>of</strong> comparison, are<br />

reported in Table V.<br />

Source Coil<br />

Partial Flux<br />

Measurement<br />

Loop<br />

Figure 5: Sketch <strong>of</strong> source coil for flux generation in Tokamak devices<br />

and Partial Flux measurement loop


TABLE V<br />

SPEED UP FOR MUTUAL INDUCTANCE<br />

Computational Times [s] 1000 sticks 5000 sticks<br />

GPU tGPU 5.31 19.4<br />

CPU tCPU 35.8 125<br />

Speed up (tCPU/ tGPU) 6.74 6.44<br />

V. CONCLUSIONS<br />

A numerical code able to compute mutual inductance<br />

between couples <strong>of</strong> any massive coils has been presented.<br />

The code is called INDIANA, and is able to adaptively<br />

modify its computational parameters to achieve a trade<strong>of</strong>f<br />

between accuracy and computational speed.<br />

INDIANA code benefit <strong>of</strong> a significant acceleration<br />

(up to 7x) thanks to the GPU parallelization. This<br />

performance allows to easily meet high accuracy request<br />

in mutual inductance calculations for complex 3D shaped<br />

coils.<br />

INDIANA performance has been assessed either in<br />

terms <strong>of</strong> accuracy and speed with respect to simple<br />

geometries presented in literature or complex shapes,<br />

compared to FEM computations.<br />

Future activity will be addressed the MPI<br />

parallelization over a computer cluster where each node<br />

is equipped with GPU, in order to analyze more complex<br />

structures.<br />

ACKNOWLEDGEMENTS<br />

Authors wish to thank Mr. M. Nicolazzo from<br />

CREATE and Mr. M. Fatica from Nvidia for fruitful<br />

discussions, and valuable hints and suggestions.<br />

This work was partly supported by Seconda Università<br />

di Napoli under PRIST grant “Generazione distribuita di<br />

energia da fonti tradizionali e rinnovabili: aspetti<br />

ingegneristici e giuridici-economici-ambientali”, partly<br />

by NVIDIA Corporation and partly by<br />

ENEA/EURATOM CREATE association.<br />

REFERENCES<br />

[1] M. Ci<strong>of</strong>fi, A. Formisano, R. Martone, “Increasing design<br />

robustness in evolutionary optimisation“, COMPEL, vol. 23,<br />

pp.187-196, 2004.<br />

[2] M. Ci<strong>of</strong>fi, A. Formisano, R. Martone, G. Steiner, D. Watzenig, ”A<br />

fast method for statistical robust optimization”, IEEE Transactions<br />

on Magnetics , Vol. 42, pp. 1099-1102, 2006.<br />

[3] F. Grover, Inductance Calculation, New York: D. Van Nostrand,<br />

1946.<br />

[4] C. R. Paul, Introduction to Electromagnetic Compatibility,<br />

Hoboken (NJ): J. Wiley & Sons, 2006.<br />

[5] H. A. Haus, J. R. Melcher, Electromagnetic Fields and Energy,<br />

Englewood Cliffs, NJ: Prentice Hall, 1989.<br />

[6] J. Hanson, S. Hirshman, “Compact expressions for the Biot–<br />

Savart fields <strong>of</strong> a filamentary segment”, Phys. <strong>of</strong> Plasmas, vol. 9,<br />

pp.4410-4412, Oct. 2002.<br />

[7] D. Kirk, W. Hwu, Programming Massively Parallel Processors: A<br />

Hands-on Approach, Elsevier, 2010.<br />

[8] M. Garland et al., “Parallel computing experiences with CUDA”,<br />

IEEE Micro, vol. 28, 2008 pp. 13–27.<br />

[9] V. Volkov. “Better performance at lower occupancy”, Proceedins<br />

<strong>of</strong> NVIDIA GPU <strong>Technology</strong> Conference 2010, San Jose, USA,<br />

pp. 20-23, Sept. 2010.<br />

[10] R. Farber, CUDA Application Design and Development, Morgan<br />

Kaufmann, 2011.<br />

[11] F. Calvano, G. Rubinacci, A. Tamburrino, G. Vasilescu, S.<br />

Ventre, “Parallel MGS-QR sparsification for fast eddy current<br />

- 360 - 15th IGTE Symposium 2012<br />

NDT simulation” Studies in Applied Electromagnetics and<br />

Mechanics, vol. 36, pp. 29-36, 2012.<br />

[12] J. T. Conway, “Exact Solutions for the Mutual Inductance <strong>of</strong><br />

Circular Coils and Elliptic Coils”, IEEE Trans. on Magn., vol 48,<br />

pp. 81-94, 2012.<br />

[13] A. J. Donné et al., “Progress in ITER Physics basis, Chapetr 7:<br />

Diagnostics”, Nucl. Fusion, vol. 47, pp. S337-S384, (2007).<br />

[14] A. Formisano, J. Knaster J., R. Martone et al., “ITER nonaxisymmetric<br />

error fields induced by its magnet system”, Fusion<br />

Engineering and Design, vol. 86, pp. 1053-1056, 2011.<br />

APPENDIX A: GPU ARCHITECTURES<br />

Hardware used in computer Central Processing Unit<br />

(CPU) seems to be reaching the physical limits beyond<br />

which increase <strong>of</strong> clocking frequency or <strong>of</strong> integration<br />

scale is very hard with present technology. As a possible<br />

alternative, CPU manufactures are moving to multiplecores<br />

CPU’s, but the number <strong>of</strong> the cores is usually<br />

limited to a few tens. On the other hand, realistic<br />

treatment <strong>of</strong> real world applications gives rise to<br />

computationally demanding numerical models. In order<br />

to speed up the computations, different parallelization<br />

paradigms can be considered, a few examples being<br />

reported in [11].<br />

Recently, the Graphic Processing Units (GPUs),<br />

present on virtually all graphic cards <strong>of</strong> computers, have<br />

been proposed as data-parallel coprocessors, used to<br />

solve compute-intensive science and engineering<br />

problems, since it was observed that the mathematical<br />

processing in high resolution images are very similar to<br />

the computations usually required in numerical models <strong>of</strong><br />

physical phenomena.<br />

A modern GPU can have up to 1024 processor cores or<br />

Streaming Processors (SPs) grouped in Streaming<br />

Multiprocessors (SMs), each containing eight processor<br />

cores.<br />

In order to reduce the dimension <strong>of</strong> chip area dedicated<br />

to the control unit, each core in an SM use a parallel<br />

computation paradigm called Single Instruction, Multiple<br />

Data (SIMD), where concurrent processor execute the<br />

same code (called Kernel) on different data.<br />

The tasks submitted at each core are called threads; the<br />

threads are grouped in thread blocks (fig. 6b); the<br />

maximum dimension <strong>of</strong> a block is presently 512 threads;<br />

hence, a code need to launch a lot <strong>of</strong> thread blocks. For<br />

these reason the thread blocks are grouped into a grid <strong>of</strong><br />

thread blocks (fig. 6c). The threads in a block can be<br />

indexed using a 3D identifier, a block in a grid can be<br />

indexed using a 2D identifier.<br />

A block <strong>of</strong> threads is assigned at a SM; each SM can<br />

use 8,192 registers; the registers are the faster memory<br />

inside a GPU but they are dynamically partitioned among<br />

the threads inside the blocks; each thread can only access<br />

its own registers (fig. 6a). All threads inside a block can<br />

cooperate with the others sharing memory using an onchip<br />

low latency memory called shared memory (48 kB);<br />

the shared memory bandwidth is about 6x lower respect<br />

the register bandwidth.<br />

The GPU has a memory, called Global memory, where<br />

the CPU can upload the input data and download the<br />

result <strong>of</strong> the computation. The global memory is available<br />

at all the SMs; it is the largest memory inside a GPU, up<br />

to 6 GB in modern solutions.


APPENDIX B: GPU PROGRAMMING STRATEGY<br />

In order to obtain the best results, a programmer needs<br />

to take into account the peculiar hardware characteristic<br />

<strong>of</strong> the GPUs, in each step <strong>of</strong> the program [7, 10].<br />

The access to Global Memory are time consuming, in<br />

order to increase the access rate each time a location is<br />

accessed, many consecutive locations are accessed by the<br />

hardware.<br />

a) thread<br />

b) thread blocks<br />

…<br />

c) grid <strong>of</strong> thread blocks<br />

…<br />

Figure 6: GPU memory model<br />

A typical GPU program will follow the steps showed<br />

in Fig.7. In order to obtain the peak performance the<br />

programmer need to reorganize, in the host side (Fig.7<br />

step a), the input data in such a way that adjacent threads<br />

operate on adjacent data in global memory (Coalescing<br />

Access). In this case, the hardware combines, or<br />

coalesces, all <strong>of</strong> these accesses into a unique access to<br />

consecutive locations. An example related to geometric<br />

data is showed in figure 8. In order to allow coalescing<br />

accesses, duplication <strong>of</strong> data can be needed.<br />

Figure 7: Flowchart <strong>of</strong> a typical GPU code<br />

Register<br />

… …<br />

Shared<br />

memory<br />

a) Upload part <strong>of</strong> the input data from the<br />

CPU memory to the GPU global memory<br />

b) Use the thread and block index to select<br />

the data from the GPU global memory<br />

c) Compute the task and store the results<br />

in GPU global memory<br />

d) Download the result from the GPU global<br />

memory to the CPU memory<br />

Global memory<br />

- 361 - 15th IGTE Symposium 2012<br />

a)<br />

b)<br />

X1 Y1 Z1 X2 Y2 Z2 …. XN YN ZN<br />

Thread 1 (th1) th2 thN<br />

X1 X2 …XN Y1 Y2 … YN Z1 Z2 ZN<br />

th1 th2 thN<br />

Figure 8: a) Uncoalescing access pattern, b) Coalescing access pattern<br />

As already discussed, a thread can access three kinds<br />

<strong>of</strong> memory: global memory, shared memory and the<br />

registers. The registers are the fastest memory; hence, in<br />

order to achieve the best performance, the programmer<br />

has to use as many registers as possible.<br />

Of course, the actual speed up can be limited by the<br />

amount <strong>of</strong> on chip memory resources (as registers and<br />

shared memory) and by the shared memory bandwidth.<br />

The GPU has a sophisticated scheduler very effective<br />

in minimizing the performance loss due to access to<br />

global memory. If a sufficient number <strong>of</strong> threads is<br />

available, the scheduler can concurrently run the threads<br />

on the multiprocessor, masking the memory accesses.<br />

This approach is called in literature Thread Level<br />

Parallelism (TLP) approach [7-9].<br />

The larger is the number <strong>of</strong> threads, the fewer are the<br />

registers available per each thread (the registers are 8192<br />

and are partitioned among the threads inside a block).<br />

Unfortunately the actual availability <strong>of</strong> a limited number<br />

<strong>of</strong> registers per thread can be a bottleneck for the entire<br />

code.<br />

In order to increase the speed up, the scheduler is<br />

provided by the capability to analyze the instruction flow<br />

and evaluate how reliable could be to execute two<br />

instructions at the same time. If the answer is positive, the<br />

hardware localizes possible free units and increases the<br />

parallelism and executes more than one instruction during<br />

the same clock cycle. Then, a small number <strong>of</strong> threads<br />

per block usually are recommended (64 threads per block<br />

can be a good compromise) and, in addition, the thread<br />

have to be designed to present as many independent<br />

instructions as possible: in such a way the Instruction<br />

Level Parallelism (ILP) [7-9] increases the performance.


- 362 - 15th IGTE Symposium 2012<br />

The Reduced Basis Method Applied to<br />

Transport Equations <strong>of</strong> a Lithium-Ion Battery<br />

Stefan Volkwein∗ ∗ †<br />

, Andrea Wesche<br />

∗Universität Konstanz, Fachbereich Mathematik und Statistik , Universitätsstraße 10, D-78457 Konstanz,<br />

E-mail: stefan.volkwein@uni.konstanz.de<br />

† Adam Opel AG, Bahnh<strong>of</strong>splatz, D-65423 Rüsselsheim, E-mail: Andrea.Wesche@de.opel.com<br />

Abstract—In this paper we consider a coupled system <strong>of</strong> nonlinear parametrized partial differential equations (P 2 DEs),<br />

which models the concentrations and the potentials in lithium-ion batteries. The goal is to develop an efficient reduced<br />

basis approach for the fast and robust numerical solution <strong>of</strong> the P 2 DE system. Numerical examples illustrate the efficiency<br />

<strong>of</strong> the proposed approach.<br />

Index Terms—finite volume method, greedy algorithm, lithium-ion battery, reduced basis method<br />

I. INTRODUCTION<br />

The modelling <strong>of</strong> lithium-ion batteries has received an<br />

increasing amount <strong>of</strong> attention in the recent past. Several<br />

companies worldwide are developing such batteries<br />

for consumer electronic applications, in particular, for<br />

electric-vehicle applications. To achieve the performance<br />

and lifetime demands in this area, exact mathematical<br />

models <strong>of</strong> the battery are required. Moreover, the multiple<br />

evaluation <strong>of</strong> the battery model for different parameter<br />

settings involves a large amount <strong>of</strong> time and experimental<br />

effort. Here, the derivation <strong>of</strong> reliable mathematical<br />

models and their efficient numerical realization are very<br />

important issues in order to reduce both time and cost in<br />

the improvement <strong>of</strong> the performance <strong>of</strong> batteries.<br />

In the present work we consider a mathematical model<br />

for lithium-ion batteries which describes the transport<br />

processes by a partial differential equation system. This<br />

model is developed in the paper by Popov et al. [17]. The<br />

physical and chemical details can be found in [13] and<br />

[14]. The equation system models a physico-chemical<br />

micro-heterogeneous battery model.<br />

We discretize this by the finite volume method and the<br />

backward Euler method. The reduced basis methodology<br />

for by finite volumes discretized systems can be found<br />

in [10]. The discretized model is reduced by the reduced<br />

basis method [16]. Our numerical tests will illustrate the<br />

efficiency <strong>of</strong> this approach.<br />

A popular battery model is the one developed by Newman<br />

[4], [15], which was implemented and tested [5]. Let<br />

us also refer to the work [6], where a different battery<br />

model is derived. For an equation system which describes<br />

a physico-chemical macro-homogeneous battery model<br />

the well posedness is shown by Wu et al. [19].<br />

II. BATTERY MODEL<br />

PDE Model<br />

Let Ω ⊂ R be an open interval, which is divided in three<br />

disjunct open sub-intervals Ωc, Ωe, Ωa ⊂ R, see Figure<br />

2.1. For tend > 0 we define Q ∶= Ω ×(0,tend) and let<br />

c, φ ∶ Q → R and α, β, λ, κ ∶ R2 → R, the notations<br />

positive electrode<br />

electrolyte<br />

negative electrode<br />

Ωc Ωe Ωa<br />

BUTLER-VOLMER-equation<br />

Fig. 2.1. Structure <strong>of</strong> the considered battery domain<br />

can be found in Table 1.2 in the appendix. The transport<br />

processes in a battery, i.e. transport <strong>of</strong> mass and charge,<br />

are described by the equations [17]<br />

∂c<br />

−∇⋅(α (c, φ)∇c + β (c, φ)∇φ) =0<br />

∂t<br />

(2.1a)<br />

−∇ ⋅ (λ (c, φ)∇c + κ (c, φ)∇φ) =0 (2.1b)<br />

in Ωc ×(0,tend), Ωe ×(0,tend) and Ωa ×(0,tend), where<br />

“∇” denotes the gradient and “∇⋅” the divergence.<br />

The positive electrode is Ωc (cathode for discharge),<br />

the electrolyte Ωe and the anode Ωa (anode for discharge).<br />

Boundary/Interface Conditions<br />

The boundary conditions are<br />

∂c<br />

∂ν = 0, φ = 0 on (∂Ω ∩ ∂Ωc)×(0,tend) (2.2a)<br />

∂c<br />

= 0,<br />

∂ν<br />

∂φ I<br />

=−<br />

∂ν σa<br />

on (∂Ω ∩ ∂Ωa)×(0,tend) (2.2b)<br />

where ν is the outer unit normal vector, I ∶ R + → R<br />

(time-dependent current) and σa ∈ R + /{0} (electric conductivity<br />

multiplied with the cross section). The initial<br />

condition is given by<br />

c(x, t0 = 0) =c0 (x) ,x∈ Ω (2.3)


The interface conditions are given by<br />

−(α (c, φ)∇c + β (c, φ)∇φ)<br />

⎧⎪ I(cec,cc,φec,φc) ∣ in (∂Ωc ∩ ∂Ωe)<br />

(0,tend)<br />

= ⎨<br />

⎪⎩<br />

−I (cea,ca,φea,φa) ∣ in (∂Ωe ∩ ∂Ωa)<br />

(0,tend)<br />

−(λ (c, φ)∇c + κ (c, φ)∇φ)<br />

⎧⎪ J(cec,cc,φec,φc) ∣ in (∂Ωc ∩ ∂Ωe)<br />

(0,tend)<br />

= ⎨<br />

⎪⎩<br />

−J (cea,ca,φea,φa) ∣ in (∂Ωe ∩ ∂Ωa)<br />

(0,tend)<br />

(2.4a)<br />

(2.4b)<br />

where cea is the concentration in the electrolyte at the<br />

negative electrode interface:<br />

cea (t) =lim<br />

h→0 c (x∣ Ωe∩Ωa − h, t)<br />

and h > 0 is small enough, i.e. x∣ − h ∈ Ωe.<br />

Ωe∩Ωa<br />

Analogously<br />

cec (t) =lim<br />

h→0 c (x∣ Ωe∩Ωc + h, t)<br />

cc (t) =lim<br />

h→0 c (x∣ Ωe∩Ωc − h, t)<br />

ca (t) =lim<br />

h→0 c (x∣ Ωe∩Ωa + h, t)<br />

for sufficient small h > 0. Thevariablesφc, φa, φec, φea<br />

are defined in the same way. We write cs for the concentration<br />

in the solid part, i.e. in the negative and positive<br />

electrode, and ce for the concentration in the electrolyte,<br />

φs and φe are analogously denoted. The scalar functions<br />

I∶R 4 → R and J∶R 4 → R are defined by<br />

I(ce,cs,φe,φs) = J(ce,cs,φe,φs)<br />

√ √<br />

F<br />

√<br />

ce cs<br />

J(ce,cs,φe,φs) =k<br />

1 − cs<br />

c 0 e<br />

c 0 s<br />

cs,max<br />

⋅ 2sinh( F<br />

(φs − φe − U0 (cs)))<br />

2RT<br />

where F = 96486 A⋅s<br />

is the Faraday constant, R =<br />

mol<br />

A⋅V ⋅s<br />

8.314 is the gas constant and T > 0 [K] is the<br />

K⋅mol<br />

temperature. The function U0 ∶ R → R is the over<br />

potential and depends on the concentration c in the<br />

electrodes. The coefficient functions are defined as<br />

α (c, φ) ∶=De (c, φ)+ RT<br />

F 2<br />

(t+ (c, φ)) 2 κ (c, φ)<br />

c<br />

t+ (c, φ)<br />

β (c, φ) ∶=κ (c, φ)<br />

F<br />

λ (c, φ) ∶= RT<br />

F<br />

t+ (c, φ) κ (c, φ)<br />

c<br />

[ cm2<br />

s ]<br />

[ mol<br />

V ⋅ cm ⋅ s ]<br />

[ A ⋅ cm2<br />

mol ]<br />

where the transference number t+ is zero in the electrodes<br />

and larger than zero in the electrolyte and κ<br />

is the ionic/electric conductivity; κ, t+, De ∶ R 2 → R.<br />

To measure the battery parameters experimentally it is<br />

assumed, that they are polynomials in c and φ.<br />

The homogeneous Neumann boundary conditions for<br />

the concentration (2.2) mean that no flux <strong>of</strong> lithium(-ions)<br />

- 363 - 15th IGTE Symposium 2012<br />

can pass through. The inhomogeneous Neumann boundary<br />

condition for the potential is Ohm’s law, the homogeneous<br />

Dirichlet boundary condition have no physical<br />

meaning. It ensures the uniqueness <strong>of</strong> the solution if one<br />

exists.<br />

The interface conditions describe the exchange <strong>of</strong> the<br />

lithium-ions at the interfaces which are modeled by the<br />

Butler-Volmer-equation [1].<br />

For physical reasons we assume that<br />

c (x, t) ≥0 ∀x ∈ Ω, t∈(0,tend)<br />

We remark that the coefficient functions α and κ are<br />

larger than zero for physical reasons: the diffusivity De<br />

and the conductivity κ are larger than zero. Because <strong>of</strong><br />

the definition <strong>of</strong> the transference number t+ the coefficient<br />

functions β and λ are equal or larger than zero.<br />

Discretization <strong>of</strong> the Problem<br />

We discretize the partial differential equation system<br />

(2.1a)-(2.1b) with the appropriate boundary (2.2) and<br />

interface conditions (2.4) by the cell centered finite<br />

volume method. We divide therefore Ωc in Nc ∈ N, Ωe in<br />

Ne ∈ N and Ωa in Na ∈ N, ND = Nc+Ne+Na, equidistant<br />

control volumes <strong>of</strong> the width Δx. We use the method <strong>of</strong><br />

lines and solve the equation system for every time step.<br />

The time step size is Δt. The integrals over the spatial<br />

we approximate by the middle point rule, the integrals<br />

over the time by the backward Euler method, for details<br />

cf. [17]. These discretized equations are implemented in<br />

MATLAB 7.10.0 (R2010a).<br />

III. REDUCED BASIS METHOD<br />

Initial Point<br />

We consider a parametrized PDE which we want to<br />

solve for many parameter sets, e.g. for parametric studies.<br />

The better the numerical model approximates the physical<br />

phenomenon, the more expensive the computation<br />

gets. So in some cases e. g. a parameter analysis needs<br />

too much effort, because a single computation is too<br />

expensive. Therefore one has to develop a reduced model<br />

to get cheap solutions.<br />

The reduced basis method is based on the discretized<br />

model: the idea is to compute a “few” times an expensive<br />

solution to different parameter sets which are in the range<br />

<strong>of</strong> interest. With the knowledge <strong>of</strong> these so called “true”<br />

solutions basis vectors are computed. The approach is<br />

that the reduced solutions in the parameter set <strong>of</strong> interest<br />

are linear combinations <strong>of</strong> these basis vectors.<br />

An assumption is that the error between the “exact”<br />

analytical solution and the “true” numerical solution is<br />

small in contrast to the error between the “true” and the<br />

“reduced” solution.<br />

A big advantage <strong>of</strong> the present method is that you<br />

determine the error between the true and reduced solution<br />

during “developing” your reduced model (→ Greedy<br />

algorithm). A further property is that the method has<br />

two phases: the <strong>of</strong>fline computation in which the reduced


model is set which fulfills the given error tolerance and<br />

in which the needed true solutions are computed and<br />

the online phase in which the reduced solution(s) are<br />

computed. The <strong>of</strong>fline part is expensive and the online<br />

phase is cheap.<br />

Approach<br />

In the following we have to resolve how to choose the<br />

true solution and how to estimate the error between the<br />

reduced and true solution. The (POD-)Greedy algorithm<br />

ensures both issues.<br />

We now describe how to apply the reduced basis<br />

method on our battery model. The transport equations <strong>of</strong><br />

the battery (2.1a)-(2.1b) depend on many parameters: on<br />

geometrical parameters (e.g. the width <strong>of</strong> the electrode)<br />

on state parameters (e.g. temperature) and on battery<br />

parameters (e.g. the diffusion coefficient). We note these<br />

parameters with μ ∈Dand assume that all these different<br />

parameters are polynomials in c and φ, some are <strong>of</strong><br />

course constant and so polynomials <strong>of</strong> the degree zero.<br />

We write u N ∈X N for a piecewise linear functions,<br />

its coefficient are denoted with u N ∈ R N . The discretized<br />

problem is now the following: Find a u N ∈X N so that<br />

F N (u N ; μ) =0<br />

⇔⟨F N (u N ; μ) ,v N ⟩W =0 ∀v N ∈X N<br />

(3.1)<br />

where the mass matrix W is in the present case given by<br />

W = Δx ⋅ 1 ∈ R N×N .<br />

We approximate the finite volume space by a Ndimensional<br />

space X N which is spanned by “snapshots”.<br />

A snapshot is a true solution to a specific parameter set<br />

μ ∈D and time node t ∈(0,tend). We assume now, that<br />

we have a (orthonormalized) basis Ξ =(ξ1,...,ξN )∈<br />

R N×N <strong>of</strong> this space and that the reduced solution can be<br />

written as<br />

N<br />

u N (μ) = ∑ θ uN<br />

j (μ) ξj (3.2)<br />

j=1<br />

If we replace u N in equation (3.1) by u N <strong>of</strong> equation<br />

(3.2) and choose v N = ξi ∈ R N we get:<br />

Ξ T ⋅ W ⋅ F N (θ uN<br />

(μ)⋅Ξ; μ) =∶F N (θ uN<br />

(μ))<br />

=0 (3.3)<br />

The start vector for the coefficient vector for Newton’s<br />

method one can get by<br />

u start<br />

coeff (μ) =ΞT ⋅ W ⋅ u N (⋅,t0 = 0; μ)<br />

To find a basis we use for our time dependent problem<br />

the POD-Greedy algorithm, cf. algorithm 1 and [10]. It<br />

consists <strong>of</strong> two loops: the outer loop is the “standard”<br />

Greedy, cf. for instance [16], which finds the new parameter<br />

set to which the error between the reduced and<br />

- 364 - 15th IGTE Symposium 2012<br />

true solution is the largest. For this we need a problem<br />

specific error estimator; an error estimator for a linear<br />

by finite volumes discretized problem can be found in<br />

[10]. The inner loop reduces the trajectory u N (⋅,tn; μ ∗ )<br />

in time with the POD (proper orthogonal decomposition)<br />

algorithm. This algorithm returns for each snapshot matrix<br />

u N (μ) ℓ ∈ N eigen-/basis vectors. One can state the<br />

number <strong>of</strong> basis vectors or the projection error <strong>of</strong> the<br />

POD method. For details to the POD algorithm cf. for<br />

instance [11], [18].<br />

Usually we take an error estimator to estimate the error<br />

between the true and reduced solution to each parameter<br />

<strong>of</strong> the discretized parameter set. Until now we have no<br />

error estimator for the present problem, so we compare<br />

the true solution to a parameter set <strong>of</strong> the training set,<br />

to the reduced solution computed by the so far reduced<br />

basis vectors. The function u (i)<br />

RB (μj) is the reduced<br />

solution constructed by i basis functions evaluated at the<br />

parameter set μj ∈Dtrain ⊂D.<br />

Algorithm 1 POD-Greedy algorithm, c.f [10]<br />

Require: ● Limit the parameter range, discretize the<br />

parameter set Dtrain ={μ1,...,μNP }<br />

● Ξtrain = {u N (μ1) ,...,u N (μNP )}, uN (μi) ∈<br />

R Nx×Nt , ∀i ∈{1,...,NP }<br />

● Choose a tolerance for the Greedy: TOLGreedy<br />

● Choose the exactness for the POD basis per ∈<br />

[0, 1], it is just a measurement for the projection<br />

error (or directly choose the number <strong>of</strong> POD<br />

basis elements in each “Greedy step” ℓ).<br />

Ensure:<br />

1: Initializing:<br />

● Choose μ (1) ∈Dtrain → ˜ ξ (1) ∈ Ξtrain<br />

ξ (1) = POD( ˜ ξ (1) ) ∈ R Nx×ℓ (1), with the POD<br />

tolerance per<br />

● Set: i = 2, ɛ = 1<br />

2: while i ≤ NP and ɛGreedy > TOL do<br />

3: [μ (i) ,ɛ]=max j∈{1,...,NP } ∣u (i−1)<br />

RB (μj)−u N (μj)∣<br />

4: ˜ ξ (i) = POD(u N (μ (i) )) NT<br />

n=0 ∈ RNx×ℓ (i), where<br />

u N (μ (i) ) ∈ Ξtrain<br />

5: (ξ (1) ,...,ξ (i) )<br />

= Gram-Schmidt (ξ (1) ,...,ξ (i−1) , ˜ ξ (i) )<br />

6: end while<br />

An essential property <strong>of</strong> the reduced basis method is<br />

that you can decompose the computation into an <strong>of</strong>fline<br />

and online phase.<br />

Offline: After determination <strong>of</strong> the set <strong>of</strong> parameter<br />

sets and the accuracy <strong>of</strong> the reduced solutions,<br />

we start the Greedy algorithm to compute the<br />

basis vectors, the true solutions to the chosen<br />

parameters respectively. The <strong>of</strong>fline phase is<br />

computationally expensive and so the reduced<br />

basis method is only worth if you want to solve<br />

the equation system many times.<br />

Online: In the present case we have to compute the


coefficient vector for the basis to the different<br />

parameter sets. We get it by the damped Newton’s<br />

method. This phase is cheap.<br />

The big advantage <strong>of</strong> the reduced solution is that<br />

you know how good your reduced solution approximates<br />

the true solution. A big disadvantage is, that if you<br />

change your parameter ranges you usually have to do<br />

the expensive <strong>of</strong>fline computation again.<br />

RBM applied on the battery model<br />

In the following we explain how to apply the reduced<br />

basis method on our discretized problem (3.1).<br />

We denote by F N 1 (C, Φ) ∶R N × R N → R N equation<br />

(2.1a) with boundary and interface conditions discretized<br />

by the finite volume method, analogously F N 2 (C, Φ) ∶<br />

R N × R N → R N stands for the finite volume discretized<br />

equation (2.1b) with the boundary and interface conditions.<br />

We choose a parameter training set for C and Φ 1 :<br />

Ξ C<br />

train = (c N (μ1) ,...,c N (μNP ))<br />

Ξ Φ<br />

train = (φ N (μ1) ,...,φ N (μNP ))<br />

Let us assume that we have a basis matrix for C<br />

) ∈ RND×NBc and for Φ ΨΦ =<br />

ΨC = (ψc 1,...,ψc NBC (ψ φ<br />

1 ,...,ψφ )∈R NBφ ND×NBφ then the reduced models,<br />

reduced functions respectively, are given by<br />

(Ψ c ) T ⋅ W ⋅(F N<br />

1 (Ψ c Ccoeff , Ψ φ Φcoeff )) ! = 0 (3.4a)<br />

(Ψ φ ) T<br />

⋅ W ⋅(F N<br />

2 (Ψ c Ccoeff , Ψ φ Φcoeff )) ! = 0 (3.4b)<br />

We should add that the snapshots for Ψ c and Ψ φ are<br />

taken at the same parameter sets (outer POD-Greedy),<br />

but the number <strong>of</strong> the POD basis elements could differ<br />

to achieve the same accuracy (inner POD-Greedy).<br />

A further issue is how to choose the next parameter<br />

in the (POD-)Greedy algorithm. The error estimation we<br />

have to do for two functions c and φ. So we usually<br />

get two different parameter values μc and μφ, where<br />

the L 2 -errors <strong>of</strong> the concentration and <strong>of</strong> the potential,<br />

respectively, attain their maximum values. Then, we<br />

choose the parameter μ ∈{μc,μφ} corresponding to the<br />

greater L 2 -error <strong>of</strong> both.<br />

IV. NUMERICAL EXPERIMENTS<br />

In this section we apply the reduced basis method to<br />

the discretized equations describing the transport processes<br />

in a lithium-ion battery (3.1). The step size in<br />

spatial is Δx = 1μm, the time step size Δt = 5s and we<br />

compute Nt = 10 time steps. The Newton tolerance to<br />

compute the true solution is set 10−6 relatively and 10−9 absolutely. The discretization error <strong>of</strong> the finite volume<br />

solution is ɛFVM =O( 1<br />

100 ).<br />

1 One can also choose different training set, e.g. cf. [7]<br />

- 365 - 15th IGTE Symposium 2012<br />

pos. electrode electrolyte neg. electrode<br />

De, [ cm2<br />

s ] 1.0 ⋅ 10−9 7.5 ⋅ 10−7 3.9 ⋅ 10−10 κ, [ A<br />

]<br />

V ⋅cm<br />

c<br />

0.038 0.002 1.0<br />

0 , [ mol<br />

cm3 ]<br />

cmax, [<br />

0.020574 0.001 0.002639<br />

mol<br />

cm3 ]<br />

U0, [V ]<br />

t+, [−]<br />

k, [<br />

0.02286<br />

0.001<br />

0 0.2<br />

0.02639<br />

0<br />

0<br />

A<br />

cm2 ] 1.3716 ⋅ 10−4 5.2780 ⋅ 10−7 N⋅, [−]<br />

A⋅, [cm<br />

10 30 10<br />

2 ] (50 ⋅ 10−4 ) 2<br />

(50 ⋅ 10−4 ) 2<br />

TABLE 4.1<br />

BATTERY PARAMETERS, [17]<br />

The Newton tolerance for the reduced solution is 10 −5 .<br />

So the L 2 -error between the finite volume and reduced<br />

solution is at best less than<br />

ɛ L 2 = 10 −5 ⋅ Nx ⋅ Nt = 0.005 =∶ ɛGreedy<br />

The tolerance <strong>of</strong> the POD method is set 99%, cf. for the<br />

POD method for instance [18].<br />

Our “standard” battery parameter set is listed in table<br />

4.1, notations can be found in Table 1.2 in the appendix.<br />

We charge the battery with 1.5913 ⋅ 10 −8 A which corresponds<br />

to 1C-rate and set the temperature T = 300K.<br />

Test 1<br />

In this subsection we variate the open circuit voltage<br />

in the positive electrode: μ = Uc ∈[0.001, 4.501]. We<br />

discretize this parameter set with the equidistant step<br />

width ΔUc = 0.1V .Sowehavea46-dimensional training<br />

set for the parameter. All the other parameters are fixed,<br />

cf. Table 4.1.<br />

In Figure 4.1 and 4.2 the finite volumes solutions<br />

chosen by the first two iterations <strong>of</strong> algorithm 1 are<br />

presented. The associated parameters are Uc = 3.001V<br />

and Uc = 4.501V . The concentration seems to be less sensitive<br />

to the circuit voltage than the electrical potential.<br />

The L2-error for the concentration becomes worse after<br />

the second Greedy step, but already after the first Greedy<br />

step the error is smaller than the L2-error tolerance<br />

ɛL2 and stays smaller; the L2-error for the electrical<br />

potential gets denotative smaller with the information <strong>of</strong><br />

a second true solution, cf. Figure 4.3 and 4.4. The same<br />

observation can be done for the L∞-error which is not<br />

presented here. The basis functions are shown in Figure<br />

4.5: they have the same “structure” as the finite volume<br />

solutions for one fixed time step and there is just one<br />

basis function for each Greedy step.<br />

The speed up <strong>of</strong> the reduced solution in comparison<br />

to the true solution is 17.54.<br />

Test 2<br />

In this section we variate a few parameters:<br />

μ ={Dec,Dee,Dea,t+,kc,ka}<br />

The subscript c denotes the parameter in the positive<br />

electrode, e in the electrolyte and a in the


c [mol/cm 3 ]<br />

0.025<br />

0.02<br />

0.015<br />

0.01<br />

0.005<br />

0<br />

20 40 60<br />

x [μ m]<br />

0<br />

t [s]<br />

50<br />

U [V]<br />

0<br />

−1<br />

−2<br />

−3<br />

−4<br />

0 20 40 60<br />

x [μ m]<br />

Fig. 4.1. Test 1: Finite volume solution fort the concentration (left)<br />

and the potential (right) for Uc = 3.001V<br />

c [mol/cm 3 ]<br />

0.025<br />

0.02<br />

0.015<br />

0.01<br />

0.005<br />

0<br />

20 40 60<br />

x [μ m]<br />

0<br />

t [s]<br />

50<br />

U [V]<br />

0<br />

−2<br />

−4<br />

−6<br />

0 20 40 60<br />

x [μ m]<br />

Fig. 4.2. Test 1: Finite volume solution fort the concentration (left)<br />

and the potential (right) for Uc = 4.501V<br />

x 10−5<br />

6.5<br />

6<br />

5.5<br />

5<br />

4.5<br />

1 2 3 4<br />

U [V]<br />

0c<br />

0.08<br />

0.06<br />

0.04<br />

0.02<br />

0<br />

0<br />

0<br />

t [s]<br />

t [s]<br />

1 2 3 4<br />

U [V]<br />

0c<br />

Fig. 4.3. Test 1: L 2 -error after the first Greedy step for the<br />

concentration (left) and the potential (right).<br />

x 10−5<br />

7.019<br />

7.019<br />

7.019<br />

7.019<br />

7.019<br />

7.019<br />

1 2 3 4<br />

U [V]<br />

0c<br />

x 10−3<br />

3.6949<br />

3.6949<br />

3.6949<br />

3.6949<br />

3.6949<br />

1 2 3 4<br />

U [V]<br />

0c<br />

Fig. 4.4. Test 1: L 2 -error after the second Greedy step for the<br />

concentration (left) and the potential (right).<br />

40<br />

20<br />

0<br />

−20<br />

−40<br />

−60<br />

Basis functions for the concentration<br />

−80<br />

0 10 20 30 40 50<br />

Spatial<br />

Greedy step 1<br />

Greedy step 2<br />

30<br />

20<br />

10<br />

0<br />

−10<br />

Basis functions for the potential<br />

−20<br />

0 10 20 30 40 50<br />

Spatial<br />

50<br />

50<br />

Greedy step 1<br />

Greedy step 2<br />

Fig. 4.5. Test 1: Basis functions for the first Greedy step for the<br />

concentration (left) and the potential (right).<br />

negative electrode. We choose the following parameter<br />

set range: for the diffusion coefficients Dec ∈<br />

[1.0 ⋅ 10 −9 , 1.1 ⋅ 10 −9 ], Dee ∈ [7.5 ⋅ 10 −7 , 7.6 ⋅ 10 −7 ],<br />

- 366 - 15th IGTE Symposium 2012<br />

Dea ∈ [3.9 ⋅ 10 −10 , 4.0 ⋅ 10 −10 ], the transference number<br />

t+ ∈ [0.2, 0.3] and the reaction rates kc,ka ∈<br />

[0.02, 0.022]. We discretize the parameter set in the<br />

following way: for the diffusion coefficients we choose<br />

the boundary values, for the other parameters we also<br />

take the boundary values and a value in between. With<br />

this discretization we get a 216-dimensional trainings<br />

set. All the other parameters are fixed like in table 4.1<br />

noted, but in contrast to the previous subsection the POD<br />

tolerance is set 1 − 1 ⋅ 10 −8 %.<br />

The graphical results are listed in the Figures 4.6 and<br />

4.7: In Figure 4.6 the finite volume solutions for the first<br />

parameter set can be seen. The first parameter set is the<br />

one denoted in table 4.1. The L 2 -error <strong>of</strong> the reduced<br />

solutions in comparison to the finite volume solution<br />

is for the concentration smaller than 10 −5 and for the<br />

electrical potential 10 −4 to all 216 parameter sets. The<br />

L ∞ -error is smaller than 10 −3 for the concentration as<br />

well as for the potential after one Greedy step. The basis<br />

functions are plotted in Figure 4.7: there are four basis<br />

functions for the concentration and three for the potential.<br />

c [mol/cm 3 ]<br />

0.025<br />

0.02<br />

0.015<br />

0.01<br />

0.005<br />

0<br />

20 40 60<br />

x [μ m]<br />

0<br />

t [s]<br />

50<br />

−1<br />

−2<br />

−3<br />

−4<br />

x 10<br />

0<br />

−3<br />

−5<br />

0 20 40 60<br />

x [μ m]<br />

Fig. 4.6. Test 2: Finite volume solution fort the concentration (left)<br />

and the potential (right) for first parameter set<br />

80<br />

60<br />

40<br />

20<br />

0<br />

−20<br />

−40<br />

Basis functions for the concentration<br />

−60<br />

0 10 20 30 40 50<br />

Spatial<br />

U [V]<br />

30<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

−30<br />

0<br />

t [s]<br />

Basis functions for the potential<br />

−40<br />

0 10 20 30 40 50<br />

Spatial<br />

Fig. 4.7. Test 2: Basis functions after the first Greedy step for the<br />

concentration (left) and the potential (right).<br />

The L 2 -error between the reduced solution and the true<br />

solution is after one Greedy step smaller than the L 2 -error<br />

tolerance ɛ L 2 to all 216 parameter set. That means that<br />

just the information <strong>of</strong> one computational expensive true<br />

solution is needed to compute the reduced solutions to<br />

all 216 parameter sets with an acceptable error.<br />

The speed up <strong>of</strong> the reduced solution in comparison<br />

to the true solution is 12.37.<br />

V. DISCUSSION<br />

In the present document we do the same approach like<br />

in [10].<br />

The reduced basis approach works for the transport<br />

equation in a lithium-ion battery: in the above numerical<br />

50


Notation<br />

c, [ mol<br />

cm3 ] concentration <strong>of</strong> the lithium/lithium-ions<br />

φ, [V ] electrical potential<br />

α, [ cm2<br />

] coefficient function<br />

s<br />

β, [ mol<br />

] coefficient function<br />

V ⋅cm⋅s<br />

λ, [ A⋅cm2<br />

] coefficient function<br />

mol<br />

I, [A] current<br />

De, [ cm2<br />

]<br />

s<br />

κ, [<br />

diffusion coefficient<br />

A<br />

]<br />

V ⋅cm<br />

c<br />

electric/ionic conductivity<br />

0 , [ mol<br />

cm3 ] start concentration <strong>of</strong> lithium in the<br />

electrodes/electrolyte<br />

cmax, [ mol<br />

cm3 ]<br />

U0, [V ]<br />

t+, [−]<br />

k, [<br />

maximum <strong>of</strong> lithium the electrode can store<br />

open circuit potential<br />

transference number<br />

A<br />

cm2 ]<br />

N⋅, [−]<br />

A⋅, [cm<br />

reaction rates<br />

number <strong>of</strong> control volumes<br />

2 ] cross section<br />

TABLE 1.2<br />

NOTATIONS OF THE BATTERY PARAMETERS<br />

tests there are at most two Greedy steps needed to reach<br />

the L 2 -error tolerance ɛ L 2. If the knowledge <strong>of</strong> all finite<br />

volume solutions to the training set is needed, the method<br />

would not be sufficient. In the above numerical tests<br />

we see that the limiting factor is the potential: for the<br />

concentration one Greedy step is sufficient but for the<br />

potential we need in some cases an additional Greedy<br />

step.<br />

The application <strong>of</strong> the reduced basis method to this<br />

problem is not completed yet: We have to develop an<br />

a posteriori error estimator so that we do not have to<br />

compute all true solutions to the discretized parameter<br />

set. Further the computational time <strong>of</strong> the reduced<br />

solutions in comparison to the finite volume solutions<br />

for the presented numerical tests are fast but not so<br />

fast as it could be. In every Newton step we have to<br />

evaluate the nonlinearities completely. Also we have no<br />

affine parameter dependence. If you have a linear(ized)<br />

affine parameter dependent problem you can separate the<br />

parameter dependence from the bilinear form and from<br />

the linear form. To get an affine parameter dependent<br />

problem as well as a linearized problem we have to<br />

apply the (discrete) empirical interpolation method, cf.<br />

for instance [2], [3].<br />

VI. ACKNOWLEDGMENTS<br />

The authors gratefully acknowledge support by the<br />

Adam Opel AG. Besides Competence Center The Virtual<br />

Vehicle (<strong>Graz</strong>) supported the lecture by Mr. Volkwein<br />

within the scope <strong>of</strong> the IGTE Symposium.<br />

APPENDIX<br />

NOTATION<br />

In Table 1.2 one can find some notations for the battery<br />

parameters.<br />

- 367 - 15th IGTE Symposium 2012<br />

REFERENCES<br />

[1] P.W. Atkins, “Physikalische Chemie”, Wiley-VCH, 2., vollst.<br />

neubearb. A., 1996.<br />

[2] M. Barrault and N.C. Nguyen and Y. Maday and A.T. Patera,<br />

“An “Empirical Interpolation” Method: Application to Efficient<br />

Reduced-Basis Discretization <strong>of</strong> Partial Differential Equations”, C.<br />

R. Acad. Sci. Paris, Série I., pp. 667–672, 339, 2004.<br />

[3] S. Chaturantabut and D.C. Sorensen, “Nonlinear Model Reduction<br />

via Discrete Empirical Interpolation”, SIAM J. Sci. Comput., 32(5),<br />

pp. 2737–2764, 2010.<br />

[4] M. Doyle and T.F. Fuller and J. Newman, “Modeling <strong>of</strong> Galvanostatic<br />

Charge and Discharge <strong>of</strong> the Lithium/Polymer/Insertion Cell”,<br />

Journal <strong>of</strong> The Electrochemical Society, 140(6), pp. 1526–1533,<br />

1993.<br />

[5] C. M Doyle, “Design and Simulation <strong>of</strong> Lithium Rechargeable<br />

Batteries”, Ph.D. thesis, 1995.<br />

[6] W. Dreyer and M. Gaberscek and C. Guhlke and R. Huth and<br />

J. Jamnik, “Phase Transition and Hysteresis in a Recharchable<br />

Lithium Battery Revisited”, European J. Appl. Math., 22, pp. 267–<br />

290, 2011.<br />

[7] A.-L. Gerner and K. Veroy, “Certified reduced basis method for<br />

parameterized saddle point problems”, SIAM J. Sci. Comput.,<br />

(accepted Jul 2012).<br />

[8] M.A. Grepl and A.T. Patera, ”A Posteriori Error Bounds for<br />

Reduced-Basis Approximations <strong>of</strong> Parametrized Parabolic Partial<br />

Differential Equations”, Mathematical Modelling and Numerical<br />

Analysis, 2005, 39(1), pp. 157-181.<br />

[9] M.A. Grepl, Y. Maday, N.C. Nguyen, and A.T. Patera, ”Efficient<br />

Reduced-Basis Treatment <strong>of</strong> Nonaffine and Nonlinear Partial<br />

Differential Equations”, Mathematical Modelling and Numerical<br />

Analysis, 2007, 41(3), pp. 575-605.<br />

[10] B. Haasdonk and M. Ohlberger, “Reduced Basis Method for<br />

Finite Volume Approximations <strong>of</strong> Parametrized Linear Evolution<br />

Equations”, Math. Model. Numer. Anal., 42(2), 2008, pp. 277-302.<br />

[11] P. Holmes and J.L. Lumley and G. Berkooz and C. Rowley,<br />

“Turbulence, Coherent Structures, Dynamical Systems and Symmetry”,<br />

Cambridge Monographs on Mechanics, 2012, Cambridge<br />

<strong>University</strong> Press.<br />

[12] O. Lass and S. Volkwein, “POD Galerkin schemes for nonlinear<br />

elliptic-parabolic systems”, submitted, 2011.<br />

[13] A. Latz and J. Zausch and O. Iliev, “Modeling <strong>of</strong> Species and<br />

Charge Transport in Li–Ion Batteries Based on Non-Equilibrium<br />

Thermodynamics”, Lecture Notes in Computer Science 6046, 329–<br />

337, 2011.<br />

[14] A. Latz and J. Zausch, “Thermodynamic Consistent Transport<br />

Theory <strong>of</strong> Li-Ion Batteries”, Journal <strong>of</strong> Power Sources 196, 3296-<br />

3302, 2011.<br />

[15] J. S. Newman and K. E. Thomas-Alyea, “Electrochemical Systems”,<br />

Wiley John + Sons, 3rd ed., 2004.<br />

[16] A. T Patera and G. Rozza, Reduced Basis approximation and<br />

A Posteriori Error Estimation for Parametrized Partial Differential<br />

Equations, MIT, 2007.<br />

[17] P. Popov, Y. Vutov, S. Margenov and O. Iliev, ”Finite Volume<br />

Discretization <strong>of</strong> Equations Describing Nonlinear Diffusion in Li-<br />

Ion Batteries,” Fraunh<strong>of</strong>er ITWM report 191, 2010.<br />

[18] S. Volkwein, “Model Reduction Using Proper Orthogonal Decomposition”,<br />

lecture notes, Konstanz, 2011.<br />

[19] J. Wu and J. Xu and H. Zou, “On the Well-posedness <strong>of</strong> a<br />

Mathematical Model for Lithium-Ion Battery Systems, Methods<br />

and Applications <strong>of</strong> Analysis, 13(3), pp. 275–298, 2006.


- 368 - 15th IGTE Symposium 2012<br />

Surrogate Parameter Optimization based on<br />

Space Mapping for Lithium-Ion Cell Models<br />

Matthias K. Scharrer∗ , Bettina Suhr∗ , and Daniel Watzenig∗† ∗Kompetenzzentrum – Das Virtuelle Fahrzeug Forschungsgesellschaft mbH (ViF),<br />

Inffeldgasse 21/A/I, A-8010 <strong>Graz</strong>, Austria<br />

† Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing,<br />

Kopernikusgasse 24/4, A-8010 <strong>Graz</strong>, Austria<br />

E-mail: matthias.scharrer@v2c2.at<br />

Abstract—Optimizing batteries <strong>of</strong> electric cars is a complex and time consuming task. In order to reduce the number <strong>of</strong><br />

prototypes, development costs and time, reliable numerical models are highly required. But optimizing models reflecting the<br />

fundamental electrochemical processes is typically computationally expensive. In this paper we present a surrogate model<br />

optimization approach based on space mapping to reduce computation time. This technique is applied to the parameter<br />

estimation problem <strong>of</strong> an electrochemical cell model by linking a coarse linearized model to the accurate model. We<br />

present results <strong>of</strong> two synthetical fitting problems solved directly and by our surrogate optimization method to validate the<br />

approach. As a remarkable result 15% reduction <strong>of</strong> computation time for the one dimensional case and 25% for the two<br />

dimensional case are obtained. We discuss a simple measure that doubles the achieved reduction to 48% for the latter.<br />

The method can easily be adopted to speed up other gradient–based optimization problems. Since the used electrochemical<br />

model shows strong non–linear behaviour, the achieved speed up indicates even better performance in the case <strong>of</strong> relaxed<br />

conditions.<br />

Index Terms—multi physics, space mapping, surrogate optimization<br />

I. INTRODUCTION<br />

In terms <strong>of</strong> pollutant emissions during vehicle operation,<br />

battery–powered and hybrid vehicles are clearly<br />

more environmentally friendly than those purely based<br />

on combustion engines. In order to reduce the number <strong>of</strong><br />

expensive prototypes the fast and reliable simulations <strong>of</strong><br />

the electrical and chemical behavior <strong>of</strong> cells are becoming<br />

increasingly important. Also a more efficient operation <strong>of</strong><br />

a battery can be achieved, when simulation models are<br />

used to estimate the battery’s internal states – e.g. state–<br />

<strong>of</strong>–charge (SOC), state–<strong>of</strong>–function and state–<strong>of</strong>–health –<br />

from measurement data.<br />

Many internal variables and material properties are<br />

difficult to access or not measurable. Several approaches<br />

exist to get insight into the internal dynamic processes in<br />

a lithium–ion cell. The field was pioneered by Newman<br />

and co–workers [1], [2]. Overviews can be found in [3]<br />

and [4]. The authors focus on modeling the cell in terms<br />

<strong>of</strong> transport equations for lithium ions, chemical interaction<br />

and electronic field computations in active particles<br />

<strong>of</strong> anode and cathode. These are coupled by modeling<br />

electrode kinetics occuring on the particle surfaces <strong>of</strong><br />

such electrodes.<br />

Recently, much effort is put into estimating parameters<br />

for cell models to gain better knowledge about effects<br />

occuring during life time. In this work we focus on non–<br />

invasive methods only, i.e. methods that estimate parameters<br />

by matching predicted cell model voltages for a given<br />

current pr<strong>of</strong>ile to experimental measurements without<br />

the need to destructively open the cell. For example,<br />

Santhanagopalan and White [5] devised an online SOC–<br />

estimation method by applying an extended Kalman<br />

filter to a simplified ordinary differential equation model.<br />

In [6], [7] a gradient based method to parameter estimation<br />

is introduced – based on Levenberg–Marquardt<br />

optimization. Here Santhanagopalan et al. investigate<br />

both single and multi–particle systems and successfully<br />

identify five parameters for constant current charge and<br />

discharge cycles. Schmidt et al. estimate parameters<br />

<strong>of</strong> a single–particle model in a combined approach by<br />

performing Fisher–information based parameter analysis<br />

and applying a pattern search algorithm consecutively [8].<br />

In contrast to the usage <strong>of</strong> single–particle models (SPM)<br />

and deterministic estimation algorithms above, Forman et<br />

al. [9] focus on fitting the Doyle–Fuller–Newman (DFN)<br />

model [1] to battery cycling data by application <strong>of</strong> genetic<br />

algorithm.<br />

The above literature provides an overview <strong>of</strong> a range<br />

<strong>of</strong> different methods and models to estimate the intrinsic<br />

material properties and unknown states. In contrast to<br />

using a SPM, we try to find a mechanism that allows<br />

us to use the higher detailed DFN model, as suggested<br />

in [9]. But, as opposed to the latter, we try to further<br />

improve the speed <strong>of</strong> parameter estimation up to online<br />

application if possible.<br />

In this paper the application <strong>of</strong> the so called space<br />

mapping method – first mentioned in [10] – to the<br />

parameter estimation <strong>of</strong> the cell model is investigated.<br />

Thus, it is possible to apply a fast gradient based Gauss–<br />

Newton optimization method to the complex DFN model<br />

and use a simplified model as a surrogate to both, speed<br />

up direct model evaluation and gradient computation at<br />

once.


The remainder <strong>of</strong> this paper is structured as follows:<br />

Section II defines the simulation framework <strong>of</strong> the cell<br />

model and briefly summarizes the solution procedure. A<br />

mechanistic model describing the electrochemistry <strong>of</strong> a<br />

lithium–ion cell motivated by the DFN model [1] has<br />

been implemented as a system <strong>of</strong> coupled non–linear<br />

partial differential equations (PDEs) in one dimension<br />

[11]. In Section III the optimization problem and solution<br />

algorithm to estimate the parameters is defined.<br />

Section IV presents a framework how to replace many <strong>of</strong><br />

the time consuming evaluations <strong>of</strong> the complex forward<br />

model by a surrogate model – a linearization <strong>of</strong> the<br />

complex model in this case – and how a link between<br />

the two models is established. In Section V we present<br />

and discuss the results. Finally, Section VI summarizes<br />

and concludes the paper.<br />

II. FORMULATION OF THE PROBLEM<br />

In order to mathematically describe the internal dynamic<br />

processes in a lithium–ion cell, a mechanistic<br />

electrochemical model has been realized as a system<br />

<strong>of</strong> coupled non–linear partial differential equations in<br />

one dimension [11]. A lithium–ion cell with two porous<br />

intercalation electrodes (cathode in Ωc and anode in<br />

Ωa) and an electronically isolating separator in Ωs in<br />

between is considered. For homogenization purpose each<br />

electrode is assumed to consist <strong>of</strong> two phases. We assume<br />

spherical particles in both cathode (in Λc) and anode<br />

(in Λa), which line up continously in x direction. The<br />

liquid phase modeled in each electrode is electrolyte.<br />

In the separator Ωs we only consider electrolyte, as the<br />

solid phase in the separator does not participate in the<br />

reactions. In Figure 1 a schematic view <strong>of</strong> the modeled<br />

domain is given.<br />

Ri<br />

, a<br />

a<br />

a<br />

Ro,<br />

a<br />

a<br />

r r<br />

<br />

c <br />

s<br />

a,<br />

s c,<br />

s<br />

c<br />

<br />

<br />

<br />

Ro, c<br />

Fig. 1. Problem Domain: The spatial domains are defined as Ω=<br />

Ωa ∪ Ωs ∪ Ωc ⊂ R, Ω ′ =Ωa ∪ Ωc, Λa =Ωa × [0,Ra] ⊂ R 2 ,<br />

Λc =Ωc × [0,Rc] ⊂ R 2 , Λ=Λa ∪ Λc and Ra,Rc ∈ R.<br />

The implemented model is similar to the widely<br />

used DFN approach [1], extended to include additional<br />

aspects, e.g. from [2] and [12]. The full model will<br />

be described in [13]. It is a coupled system <strong>of</strong> non–<br />

linear partial differential equations in one dimension.<br />

The variables are potentials and concentrations for the<br />

electrolyte (ϕℓ,cℓ), for the cathode (ϕsc,csc) and for<br />

the anode (ϕsa,csa). The one–dimensional cell model<br />

considered is defined by the system (1).<br />

c<br />

Ri , c<br />

x<br />

- 369 - 15th IGTE Symposium 2012<br />

TABLE I<br />

LIST OF SYMBOLS<br />

Ai<br />

inner surface (m 2 m −3 )<br />

Dℓ<br />

diffusivity in electrolyte (m 2 s −1 )<br />

Ds diffusivity in solid (m 2 s −1 )<br />

Cdl double layer capacity (Fm −1 )<br />

F Faraday’s constant (= 96485C mol −1 )<br />

R universal gas constant (= 8.31447 Jmol −1 K −1 )<br />

T temperature (K)<br />

UOCV(cs) equilibrium potential function (V )<br />

cℓ<br />

Li + –concentration in electrolyte (mol m −3 )<br />

cℓ,0 initial Li + –concentration in electrolyte (mol m −3 )<br />

cs<br />

Li + –concentration in active particles (mol m −3 )<br />

cs,0 initial Li + –concentration in particles (mol m −3 )<br />

i(t) cell current density (Am −2 )<br />

j ∗<br />

BV Butler–Volmer current density (Am −2 k<br />

)<br />

exchange current density and reaction rate (mol m −2 s −1 )<br />

t +<br />

ℓ<br />

z<br />

transference number <strong>of</strong> cations (1)<br />

number <strong>of</strong> transferred electrons per unit (Li + : z =1)(1)<br />

αA,αK anodic/cathodic charge transfer coefficients (1)<br />

εℓ<br />

electrolyte volume fraction (1)<br />

κ (cℓ) ionic conductivity function (Sm −1 )<br />

μℓ<br />

migration coefficient<br />

ϕs<br />

electrochemical potential <strong>of</strong> the active material (V )<br />

ϕℓ<br />

electrochemical potential <strong>of</strong> the electrolyte (V )<br />

σs<br />

electronic conductivity (Sm −1 )<br />

−∇ · (σs∇ϕs) =−Aij ∗<br />

BV in Ω ′<br />

<br />

−∇ · κℓ(cℓ)∇ϕℓ + RT<br />

<br />

+ 1<br />

κℓ(cℓ)t ℓ ∇cℓ = Aij<br />

zF cℓ<br />

∗<br />

BV in Ω<br />

∂ (ɛℓcℓ)<br />

∂t<br />

−∇·<br />

<br />

Dℓ ∇cℓ + zF<br />

RT μℓcℓ∇ϕℓ<br />

<br />

= Ai<br />

zF j∗ BV in Ω<br />

∂cs 1<br />

−<br />

∂t r2 <br />

∂<br />

Dsr<br />

∂r<br />

2 <br />

∂cs<br />

=0 in Λ<br />

∂r<br />

j ∗<br />

BV =<br />

⎧ <br />

αAzF(ϕs−ϕℓ −UOCV(cs))<br />

<br />

zFk exp<br />

RT<br />

+<br />

⎪⎨<br />

<br />

−(1−αK)zF(ϕs−ϕℓ −UOCV(cs))<br />

<br />

−zFk exp<br />

RT<br />

+<br />

∂(ϕs−ϕℓ) ⎪⎩<br />

+Cdl ∂t<br />

in Ω ′<br />

(1)<br />

0 else<br />

where the system variables are defined as ·(t, x) at time<br />

t ∈ [0,T], T ∈ R and at space point x and (x, r),<br />

respectively. A comprehensive overview <strong>of</strong> symbols is<br />

given in Table I.<br />

Homogenous Neumann conditions are applied at the<br />

boundaries except for the outer boundaries <strong>of</strong> potentials<br />

and concentrations in solid phase:<br />

ϕs =0 on Γa × [0,T]<br />

−σs∇ϕs = −i (t) on Γc<br />

∂cs 1<br />

−Ds =<br />

∂r zF j∗<br />

BV on ΓRo,a ∪ ΓRo,c<br />

The concentrations are restricted by the following<br />

initial conditions:<br />

cℓ = cℓ,0<br />

cs = cs,0<br />

in Ω<br />

in Λ<br />

The potentials are consistently initialized at rest by<br />

the condition j (x, 0) = 0. The solution <strong>of</strong> this system <strong>of</strong><br />

four non–linearly coupled partial differential equations<br />

is done by application <strong>of</strong> the Finite Element Method<br />

with linear test functions for spatial discretization and<br />

Backwards Euler Method for time integration. The non–<br />

linearity is solved by a damped Newton method – see<br />

[11] for details.<br />

(2)<br />

(3)


III. PARAMETER ESTIMATION<br />

The system described in the previous section contains<br />

many parameters which cannot be measured directly. To<br />

formulate the parameter estimation problem in a general<br />

way, we merge the parameter set <strong>of</strong> interest into the<br />

parameter vector μ ∈ Pad ⊂ R m , where Pad is defined<br />

as the admissible parameter set. The basis optimization<br />

problem is introduced as<br />

μ ∗ =argminH (f (μ)) , (4)<br />

μ∈Pad where an optimal set <strong>of</strong> parameters μ ∗ ∈ Pad is sought,<br />

which minimizes a merit function H <strong>of</strong> a model response<br />

f (μ) depending on the parameters μ.<br />

Since we focus on parameter estimation based on cell<br />

voltages, we set H to compute the difference with respect<br />

to a predescribed function ˆy. We rewrite (4) to<br />

μ ∗ =argminwi (y(ti; μ) − ˆy(ti))<br />

μ∈Pad 2<br />

2<br />

where we want to minimize the difference between measured<br />

cell voltages, ˆy, and computed voltages, f (μ) =<br />

y (·; μ) =ϕs| Γc − ϕs| Γa , at predefined times, ti. Variations<br />

in time step sizes are taken into account by the<br />

weights wi.<br />

Classical optimization using this objective function<br />

yields unacceptable response times, since not only the<br />

solution <strong>of</strong> the system defined in Section II has to be<br />

computed, but additionally the derivative <strong>of</strong> the objective<br />

function with respect to every parameter in our set <strong>of</strong><br />

interest μ has to be estimated. Since this might be<br />

intractable for non–linear PDE constraint problems, we<br />

revert to numerical gradient estimation by finite differences.<br />

As execution time <strong>of</strong> a single simulation on current<br />

hardware varies between seconds and hours – depending<br />

on the prescribed input pr<strong>of</strong>ile – direct evaluation <strong>of</strong> (5)<br />

is not acceptable due to its enormous computation times.<br />

IV. PROPOSED FRAMEWORK<br />

To speed up parameter estimation, we introduce space<br />

mapping – first mentioned in [10] – to the cell model optimization.<br />

The idea behind is best described as follows:<br />

We have a very complex and accurate – fine – model<br />

that describes a process on basis <strong>of</strong> a couple <strong>of</strong> parameters.<br />

We search for an optimal set <strong>of</strong> parameters with<br />

respect to some cost function by repeatingly evaluating<br />

our model and computing model responses for intermediate<br />

sets <strong>of</strong> parameters. Since a single evaluation <strong>of</strong> the<br />

model is expensive, we replace the responses by results <strong>of</strong><br />

a much cheaper and less accurate – coarse – model (also<br />

known as surrogate model) describing the same process<br />

by using a similar parameter set. Thus we only get a<br />

vague idea <strong>of</strong> where the optimal parameters are with<br />

respect to the fine model. Finally, we link the results<br />

by evaluating the fine model and establish a mapping<br />

between the individual parameter spaces <strong>of</strong> both, the fine<br />

and the coarse model. Since this results in fewer calls <strong>of</strong><br />

the fine model, the optimization time can be reduced.<br />

- 370 - 15th IGTE Symposium 2012<br />

(5)<br />

In our case this means to substitute evaluations <strong>of</strong> the<br />

fine model u = F(μ), where u is the tuple representing<br />

the solutions to (1) and F(·) is the solution operator,<br />

by evaluations <strong>of</strong> a coarse model v = C(λ), where λ ∈<br />

Lad ⊂ R m is the parameter set <strong>of</strong> interest <strong>of</strong> the coarse<br />

model, v is the tuple representing the solutions <strong>of</strong> the<br />

coarse model and C(·) is the solution operator.<br />

A mapping function p : Pad → Lad enables us<br />

to establish a link between the two models such that<br />

the response <strong>of</strong> the coarse model c(p(μ)) is a good<br />

approximation for f(μ). Of course, the coarse model<br />

response c (·) has to be defined the same way as the<br />

fine model response f (·). Since directly evaluating the<br />

mapping function p is not possible, we introduce a new<br />

optimization problem:<br />

<br />

<br />

λ =argminc( ˜λ∈L ad<br />

˜ <br />

<br />

λ) − f(μ) 2<br />

2<br />

A problem with this approach is the computation <strong>of</strong> the<br />

Jacobian <strong>of</strong> the space mapping. Bandler suggested a time<br />

consuming way in [10]. This unnecessary big effort can<br />

be circumvented by applying Broyden’s formula [14], as<br />

discussed in [15]. The latter will be used in this paper.<br />

Using the coarse model response c (·), we reformulate<br />

the optimization problem in (4) as follows:<br />

˜μ ∗ =argminH (c (p (μ))) , (7)<br />

μ∈Pad where ˜μ ∗ is a coarse approximation <strong>of</strong> μ ∗ . By iteratively<br />

updating p(·), the solution <strong>of</strong> the surrogate problem ˜μ ∗<br />

is supposed to converge towards the real solution μ ∗ .<br />

The algorithm applied to indirectly solve the optimization<br />

problem defined in (5) by means <strong>of</strong> the space mapping<br />

is stated in Algorithm 1.<br />

Following the idea <strong>of</strong> [16], we obtain the simplified<br />

coarse model by linearizing the right hand side <strong>of</strong> the<br />

original system by approximation on the basis <strong>of</strong> Taylor<br />

series expansion:<br />

<br />

j (v; λ) ≈ ˆj (û; ˆμ)+ ∇uˆj T <br />

(û; ˆμ) (v − û)+ ∇μˆj T (û; ˆμ) (λ − ˆμ),<br />

(8)<br />

where û denotes the state <strong>of</strong> the original system for a<br />

reference parameter set ˆμ, ˆj (û;ˆμ) denotes the function<br />

j∗ BV in a working point – throughout the rest <strong>of</strong> the paper<br />

we write ˆj for a function ˆj (û;ˆμ). The parameters <strong>of</strong> the<br />

linearized model are denoted by v and λ, respectively.<br />

The non–linear factors on the left hand side, i.e. the<br />

ionic conductivity κℓ and direct occurrences <strong>of</strong> the Li + –<br />

concentrations in solution cℓ, are fixed to their initial<br />

values. In (11) the coarse model is stated as used. In<br />

addition, the boundary condition <strong>of</strong> the concentrations in<br />

solid phase cs changes to:<br />

<br />

∂cs 1 <br />

−Ds = ∇u<br />

∂r zF<br />

ˆj<br />

T <br />

v + ∇μˆj T λ + ˆ <br />

Jc on ΓRo,a ∪ ΓRo,c<br />

(9)<br />

where the constant parts <strong>of</strong> the linearization (8) are given<br />

as:<br />

<br />

ˆJc = ˆj − ∇uˆj T <br />

û − ∇μˆj T ˆμ (10)<br />

(6)


- 371 - 15th IGTE Symposium 2012<br />

<br />

∂ˆj<br />

−∇ · (σs∇ϕs) +Ai ϕs +<br />

∂ϕs<br />

∂ˆj<br />

ϕℓ +<br />

∂ϕℓ<br />

∂ˆj<br />

cℓ +<br />

∂cℓ<br />

∂ˆj<br />

cs = −Ai ∇μ<br />

∂cs<br />

ˆj<br />

T λ + ˆ <br />

Jc in Ω ′<br />

<br />

−∇ · κℓ(ĉℓ)∇ϕℓ + RT<br />

<br />

+ 1<br />

∂ˆj<br />

κℓ(ĉℓ)t ℓ ∇cℓ − Ai ϕs +<br />

zF ĉℓ<br />

∂ϕs<br />

∂ˆj<br />

ϕℓ +<br />

∂ϕℓ<br />

∂ˆj<br />

cℓ +<br />

∂cℓ<br />

∂ˆj<br />

cs = Ai ∇μ<br />

∂cs<br />

ˆj<br />

T λ + ˆ <br />

Jc in Ω<br />

∂ (ɛℓcℓ)<br />

∂t<br />

−∇·<br />

<br />

Dℓ ∇cℓ + zF<br />

RT μℓĉℓ∇ϕℓ<br />

<br />

+ Ai<br />

<br />

∂ˆj<br />

ϕs +<br />

zF ∂ϕs<br />

∂ˆj<br />

ϕℓ +<br />

∂ϕℓ<br />

∂ˆj<br />

cℓ +<br />

∂cℓ<br />

∂ˆj<br />

<br />

cs = −<br />

∂cs<br />

Ai<br />

∇μ<br />

zF<br />

ˆj<br />

T λ + ˆ <br />

Jc in Ω<br />

∂cs 1<br />

−<br />

∂t r2 <br />

∂<br />

Dsr<br />

∂r<br />

2 <br />

∂cs<br />

=0 in Λ<br />

∂r<br />

Algorithm 1 Space mapping surrogate optimization<br />

Input: Initial μ0 ∈ Pad; set i =0,λ0 = μ0, and B0 = I.<br />

1: Evaluate f (μ0) and H (f (μ0))<br />

2: repeat<br />

3: Define mapping function<br />

pi (μ) ← Bi (μ − μi) +λi<br />

4: Compute new candidate parameter<br />

˜μ ∗<br />

i+1 ← arg min H (c (pi (μ)))<br />

μ∈Pad 5: Evaluate f ˜μ ∗ ∗<br />

i+1 and H f ˜μ i+1<br />

6: if H f ˜μ ∗ <br />

i+1


implementation <strong>of</strong> Algorithm 1 reached its final residual<br />

<strong>of</strong> 4.110 −7 after 4 hours and 4 iterations which results in<br />

a reduction <strong>of</strong> runtime <strong>of</strong> 15%. This speed up is induced<br />

by the number <strong>of</strong> evaluations <strong>of</strong> the fine model being<br />

reduced from 20 to 5. But the additional 54 evaluations <strong>of</strong><br />

the coarse model in typically 189.6±2.4s during the two<br />

optimization processes undermines this large reduction.<br />

The differences in the curves resulting from inital and<br />

final parameters can be mainly related to the strong<br />

impact <strong>of</strong> the non–linearity <strong>of</strong> the equilibrium potential<br />

function U OCV(cs). Different inital Li + –concentrations<br />

cs,0 lead to a different working window <strong>of</strong> the curve,<br />

so that the shape changes. Additionally, by changing the<br />

initial amount <strong>of</strong> Li + , the available amount to move<br />

inside the system is changed. This intrinsic value can<br />

be estimated by the volume integral <strong>of</strong> the initial Li + –<br />

concentrations in the electrodes <br />

Ω εscs,0 dΩ. The limits<br />

<strong>of</strong> Li + –concentrations <strong>of</strong> each electrode – commonly<br />

referred to as full and empty, respectively – will result in<br />

the cell capacity. By modifying the cell capacity and the<br />

effective load, which can differ from the preset 0.2h−1 ,<br />

the lower cut–<strong>of</strong>f voltage is reached at different times.<br />

The model response and the reference data where padded<br />

with their final value, to match the length <strong>of</strong> one another.<br />

Because <strong>of</strong> the sharp voltage drop near the empty cell,<br />

the sensitivity <strong>of</strong> the objective function is very high.<br />

This simplifies finding the optimum and interpreting the<br />

results.<br />

The second task was to simulate a 100s short charge<br />

pulse <strong>of</strong> 0.5h−1 load to find the exchange current density<br />

and reaction rate μ = k = {ka,kc} in both anode<br />

Ωa and cathode Ωc starting from 50% SOC. As shown<br />

in [6] and confirmed by our own investigations, the<br />

exchange current density and reaction rate are showing<br />

high impact on the results. The reference value μ∗ in this<br />

case was set to 10−7 , 10−7 mol m−2s−1 , optimization<br />

was initialized at μ0 = 10−8 , 10−6 mol m−2s−1 –<br />

see Figure 3 for resulting voltage curves. In this case,<br />

stopping criteria were applied tighter as before, because<br />

<strong>of</strong> the smaller number <strong>of</strong> points in time <strong>of</strong> the problem:<br />

• absolute value <strong>of</strong> the function value<br />

H (f (μi))


TABLE II<br />

COARSE MODEL SPEED UP COMPARISON<br />

T model runtime model Total Speed<br />

evaluations runtime up<br />

1 2.95 ± 0.07s 549 (7) 1469s 1.3<br />

2 2.39 ± 0.05s 474 (7) 1241s 1.6<br />

5 2.04 ± 0.05s 474 (7) 1071s 1.8<br />

10 1.92 ± 0.04s 486 (7) 1038s 1.9<br />

20 1.87 ± 0.04s 549 (8) 1145s 1.7<br />

non–linear 11.8 ± 0.19s 165 1947s 1.0<br />

Speed up achieved by space mapping surrogate optimization <strong>of</strong><br />

kinetic rate parameters k for different reassembly periods T<br />

compared to the non–linear case (last line). Model evaluations in<br />

parentheses (·) are non–linear model evaluations.<br />

TABLE III<br />

PROGRESS OF RESIDUALS DURING OPTIMIZATION<br />

H (f(μk))<br />

i T =1 T =2 T =5 T =10 T =20<br />

0 9.566E-02<br />

1 5.159E-01 5.161E-01 5.159E-01 5.126E-01 4.261E-01<br />

2 5.276E-02 5.280E-02 5.287E-02 4.127E-02 4.179E-02<br />

3 2.414E-03 2.426E-03 2.439E-03 3.793E-03 3.802E-03<br />

4 5.399E-06 5.484E-06 5.648E-06 9.178E-06 1.049E-05<br />

5 3.229E-11 3.726E-11 4.589E-11 6.074E-11 3.258E-11<br />

6 7.749E-13 7.840E-13 7.796E-13 9.783E-13 1.035E-12<br />

7 — — — — 9.031E-13<br />

Residuals achieved by space mapping surrogate optimization <strong>of</strong><br />

kinetic rate parameters k at iterations i for different reassembly<br />

periods T .<br />

Additionally, an optimum exists for some reassembly<br />

period T , which appears in the decrease <strong>of</strong> the speed<br />

up factor at some point. This is related to the additional<br />

iteration necessary to reach the target residual threshold.<br />

Yet, the performance <strong>of</strong> each iteration <strong>of</strong> the optimization<br />

procedure shows similar progression, as can be seen in<br />

Table III for different reassembly periods which strengthens<br />

the before mentioned assumption <strong>of</strong> adaptivity <strong>of</strong> the<br />

algorithm.<br />

VI. CONCLUSION<br />

This paper shows the application <strong>of</strong> the space mapping<br />

approach to speed up estimation <strong>of</strong> the parameters <strong>of</strong> a<br />

DFN motivated battery model. This is done by substituting<br />

model evaluations by the response <strong>of</strong> a fast surrogate<br />

model. Afterwards, the obtained parameters are mapped<br />

into the original model’s parameter space by an iteratively<br />

refined mapping function.<br />

We have validated the algorithm by applying it to<br />

two synthetic fitting problems, where parameters <strong>of</strong> a<br />

simulation are recovered starting from a different point<br />

in parameter space. The one dimensional quasi–stationary<br />

case results in a reduction <strong>of</strong> 15%, whereas in the two<br />

dimensional case shows 25% as compared to the direct<br />

optimization computation times. Further simplification <strong>of</strong><br />

the coarse model increases the latter to 48%.<br />

There is another advantage that evolves out <strong>of</strong> the<br />

usage <strong>of</strong> a linear coarse model, which is the possibility<br />

to state the adjoint system, which can be used to estimate<br />

the cost functions’s exact gradient after only one<br />

- 373 - 15th IGTE Symposium 2012<br />

additional evaluation instead <strong>of</strong> one per parameter using<br />

finite differences to approximate the gradient. This way<br />

<strong>of</strong> optimization seems to be prospective, for example, for<br />

estimating parameters <strong>of</strong> a real system or optimization<br />

<strong>of</strong> the battery itself, with much less effort and increased<br />

efficiency than by using direct methods.<br />

VII. ACKNOWLEDGMENT<br />

The authors gratefully acknowledge<br />

financial support from Climate- and<br />

Energy Fund “Klima- und Energiefonds”<br />

as part <strong>of</strong> the program New Energy 2020<br />

“NEUE ENERGIEN 2020” <strong>of</strong> the Federal<br />

Province <strong>of</strong> Styria/Austria for the project in which the<br />

above presented research results were achieved.<br />

REFERENCES<br />

[1] M. Doyle, T. F. Fuller, and J. Newman, “Modeling <strong>of</strong> galvanostatic<br />

charge and discharge <strong>of</strong> the lithium/polymer/insertion cell,” J<strong>of</strong><br />

The Electrochemical Society, vol. 140, no. 6, pp. 1526–1533,<br />

1993.<br />

[2] J. Newman and K. E. Thomas-Alyea, Electrochemical Systems,<br />

3rd ed. John Wiley & Sons, Inc., Hoboken, New Jersey, 2004.<br />

[3] P. M. Gomadam et al., “Mathematical modeling <strong>of</strong> lithium-ion<br />

and nickel battery systems,” J <strong>of</strong> Power Sources, vol. 110, no. 2,<br />

pp. 267–284, 2002.<br />

[4] S. Santhanagopalan et al., “Review <strong>of</strong> models for predicting the<br />

cycling performance <strong>of</strong> lithium ion batteries,” J <strong>of</strong> Power Sources,<br />

vol. 156, no. 2, pp. 620–628, 2006.<br />

[5] S. Santhanagopalan and R. E. White, “Online estimation <strong>of</strong> the<br />

state <strong>of</strong> charge <strong>of</strong> a lithium ion cell,” J <strong>of</strong> Power Sources, vol.<br />

161, no. 2, pp. 1346–1355, 2006.<br />

[6] S. Santhanagopalan, Q. Guo, and R. E. White, “Parameter estimation<br />

and model discrimination for a lithium-ion cell,” J <strong>of</strong> The<br />

Electrochemical Society, vol. 154, no. 3, pp. A198–A206, 2007.<br />

[7] S. Santhanagopalan et al., “Parameter estimation and life modeling<br />

<strong>of</strong> lithium-ion cells,” J <strong>of</strong> The Electrochemical Society, vol.<br />

155, no. 4, pp. A345–A353, 2008.<br />

[8] A. P. Schmidt et al., “Experiment-driven electrochemical modeling<br />

and systematic parameterization for a lithium-ion battery cell,”<br />

J <strong>of</strong> Power Sources, vol. 195, no. 15, pp. 5071–5080, 2010.<br />

[9] J. C. Forman et al., “Genetic identification and fisher identifiability<br />

analysis <strong>of</strong> the doyle-fuller-newman model from experimental<br />

cycling <strong>of</strong> a lifepo4 cell,” J <strong>of</strong> Power Sources, vol. 210, no. 0, pp.<br />

263–275, 2012.<br />

[10] J. Bandler et al., “Space mapping technique for electromagnetic<br />

optimization,” IEEE Trans. on Microwave Theory and Techniques,<br />

vol. 42, no. 12, pp. 2536–2544, dec 1994.<br />

[11] F. Pichler, “Anwendung der Finite-Elemente Methode auf ein<br />

Litium-Ionen Batterie Modell,” Master Thesis, <strong>University</strong> <strong>of</strong> <strong>Graz</strong>,<br />

2011.<br />

[12] I. J. Ong and J. Newman, “Double-layer capacitance in a dual<br />

lithium ion insertion cell,” J <strong>of</strong> The Electrochemical Society, vol.<br />

146, no. 12, pp. 4360–4365, 1999.<br />

[13] M. Cifrain et al., “Elektrochemisches Zellmodell,” publication<br />

planned.<br />

[14] C. G. Broyden, “A class <strong>of</strong> methods for solving nonlinear simultaneous<br />

equations,” Mathematics <strong>of</strong> Computation, vol. 19, no. 92,<br />

pp. 577–593, Oct. 1965.<br />

[15] M. H. Bakr et al., “An introduction to the space mapping<br />

technique,” Optimization and Engineering, vol. 2, no. 4, pp. 369–<br />

384, 2001.<br />

[16] O. Lass et al., “Space mapping techniques for a structural optimization<br />

problem governed by the p-Laplace equation,” Optimization<br />

Methods and S<strong>of</strong>tware, vol. 26, no. 4-5, pp. 617–642,<br />

2011.<br />

[17] E. Jones et al., “SciPy: Open source scientific tools for Python,”<br />

2001–. [Online]. Available: http://www.scipy.org/<br />

[18] MATLAB, version 7.12 (R2011a). Natick, Massachusetts: The<br />

MathWorks Inc., 2011.


- 374 - 15th IGTE Symposium 2012<br />

Large Scale Energy Storage with Redox Flow Batteries<br />

Piergiorgio Alotto, Massimo Guarnieri, Federico Moro and Andrea Stella<br />

Dipartimento di Ingegneria Industriale, Università di Padova, Via Gradenigo 6/a, 35131 Padova, Italy<br />

E-mail: name.surname@unipd.it<br />

Abstract— The expected expansion <strong>of</strong> renewable energy sources is calling for large and efficient energy storage systems.<br />

Electrochemical ones are considered the solution <strong>of</strong> choice in most cases, since they present unique features <strong>of</strong> localization<br />

flexibility, efficiency and scalability. Among them, Redox Flow Batteries (RFBs) exhibit remarkably high potential for several<br />

reasons, including power/energy independent sizing, high efficiency, room temperature operation and extremely long life.<br />

The most developed RFBs are the all-vanadium based ones (VRB), but other research programs are underway in many<br />

countries. They aim at substantial improvements which can lead to more compact systems, capable <strong>of</strong> taking the technology<br />

to a real breakthrough in stationary grid-connected applications, but which can also prove suitable for powering electric<br />

vehicles. This paper gives an overview <strong>of</strong> the RFB technology state-<strong>of</strong>-the-art, highlights its pros and cons, and indicates<br />

current research challenges.<br />

Index Terms— Electrochemical storage, redox flow batteries, vanadium flow batteries.<br />

I. INTRODUCTION<br />

Presently, renewable sources except hydroelectric,<br />

particularly solar and wind, provide roughly 4% <strong>of</strong> the<br />

worldwide electricity production, but they are expected<br />

to grow substantially in the near future (up to 26% by<br />

2030 [1]).<br />

In contrast with conventional electrical power plants,<br />

wind, solar, and other primary renewable sources are<br />

intermittent, because the generated electrical power<br />

depends on the time <strong>of</strong> the day and on the climatic<br />

availability <strong>of</strong> resources. The integration into the grid <strong>of</strong><br />

such primary energy sources, each with different<br />

peculiarities, requires their careful design and control.<br />

Furthermore, traditional grids have not been designed for<br />

such operational conditions so that they are not always<br />

able to work satisfactorily when many renewable-source<br />

generators are connected. In fact, recent studies suggest<br />

that grids can become unstable if such sources provide<br />

more than 20% <strong>of</strong> the whole generated power without<br />

adequate energy storage.<br />

Thus, future power grids provided with relevant<br />

amounts <strong>of</strong> renewable sources call for adequate energy<br />

storage systems capable <strong>of</strong> storing production surplus<br />

during some periods and <strong>of</strong> contributing to face higher<br />

demand during others, while at the same time<br />

contributing to stabilizing the grid operation. Applied in<br />

this way, energy storage systems will allow to<br />

substantially undersize the primary power plants with<br />

respect to peak demand, since relevant quantity <strong>of</strong> power<br />

will be provided by the storage systems.<br />

Two main different application scenarios can be<br />

identified: i) “peak shaving” and “sag compensation”<br />

refer to charge/discharge cycles on short timescales (secmin)<br />

and are needed for grid stabilization; ii) “load<br />

leveling” concerns charge/discharge cycles on longer<br />

timescales (hour) and allow to improve load factor <strong>of</strong> the<br />

grid.<br />

Several recent surveys show that electrochemical<br />

storage systems will be the solution <strong>of</strong> choice for<br />

complementing intermittent photovoltaic and wind<br />

generation facilities with long-time-scale energy storage.<br />

In fact, such storage technologies feature site versatility,<br />

modularity, scalability, ease <strong>of</strong> operation, and no moving<br />

parts [2]. Worldwide important funding programs have<br />

been established for the scientific and technological<br />

Fig. 1: Discharge times vs. power for mainstream energy<br />

storage technologies<br />

development <strong>of</strong> innovative electrochemical storage<br />

systems.<br />

Among them, Redox Flow Batteries (RFBs) are<br />

particularly promising They have emerged in recent<br />

years as a very promising solution for stationary<br />

applications, in combinations with renewable sources,<br />

for applications such as peak shaving, sag compensation,<br />

and load leveling [3,4,5]. The reason for their potential<br />

success depends on the fact that, with respect to<br />

competing technologies, they cover a very wide range <strong>of</strong><br />

discharge times (energy) and powers, as shown in Fig. 1.<br />

RFBs exploit reduction and oxidation (redox) reactions<br />

<strong>of</strong> ion metals (i.e. electrochemical species) solved in<br />

aqueous or nonaqueous liquids. The storage <strong>of</strong> these<br />

solutions is performed in external tanks, potentially <strong>of</strong><br />

very high capacity, and they are circulated into the RFB<br />

battery when power exchange is required. The main<br />

appealing features <strong>of</strong> RFBs are: scalability and<br />

flexibility, independent sizing <strong>of</strong> power and energy, high<br />

round-trip efficiency, high depth <strong>of</strong> discharge (DOD),<br />

long durability, fast response times, reduced<br />

environmental impact, and absence <strong>of</strong> expansive noblemetal<br />

based catalyzers.<br />

In the rest <strong>of</strong> this paper, the most important features <strong>of</strong><br />

RFBs will be presented together with an overview <strong>of</strong> the<br />

current state-<strong>of</strong>-the art <strong>of</strong> commercial systems.<br />

Furthermore, current research and development issues <strong>of</strong><br />

RFB systems will be highlighted.


II. RFB BASICS AND FEATURES<br />

A. RFB basics<br />

RFB cells operate on the basis <strong>of</strong> electrochemical<br />

reduction and oxidation reactions <strong>of</strong> two liquid<br />

electrolytes containing ionized metals [6]. One electrode<br />

performs the reduction half-reaction <strong>of</strong> one electrolyte<br />

that releases one electron and one ion while the other<br />

electrode performs an oxidation half-reaction that<br />

recombines them into the other electrolyte (Fig. 2). Ions<br />

can then migrate from one electrode to the other (from<br />

anode to cathode) through a membrane which can not be<br />

crossed by electrons, which are instead forced to pass<br />

through the external circuit thus exchanging electric<br />

energy. Typical RFB cells must operate at room<br />

temperature in order to keep the solutions in the liquid<br />

phase. This condition implies that the ion-conducting<br />

membrane between the two electrodes is a polymeric<br />

one. Both half-cells are connected to external storage<br />

tanks where the solutions are stored and circulated by<br />

means <strong>of</strong> two suitable pumps. In order to design such a<br />

storage system based on a RFB, expertise in<br />

electrochemistry, chemistry, chemical engineering,<br />

electrical engineering, power electronics, and control<br />

engineering are required, thus calling for highly<br />

multidisciplinary research and development teams.<br />

B. RFB features<br />

RFBs can be considered as a particular type <strong>of</strong> Fuel<br />

Cell (FC), since they can generate electrical power as<br />

long as they are fed with fuel (in this case the electrolyte<br />

solutions), and indeed the cell structure is very similar to<br />

the one <strong>of</strong> Polymer Electrolyte Membrane Fuel Cells<br />

(PEMFCs). Another similarity between RFBs and FCs is<br />

that energy is stored in components, the tanks, which are<br />

physically separated from the cells themselves, were<br />

power conversion takes place.<br />

Independent sizing <strong>of</strong> power and energy in typical<br />

RFB systems is therefore possible and this feature allows<br />

for virtually unlimited capacity simply by using larger<br />

storage tanks, without altering the battery itself or<br />

Fig. 2: Schematic <strong>of</strong> a RFB energy storage system: RFB<br />

stack and electrolyte tanks are separated<br />

- 375 - 15th IGTE Symposium 2012<br />

Fig. 3: Schematic <strong>of</strong> a typical RFB cell structure with<br />

MEA (membrane-electrode assembly) and flow-by<br />

solution distribution in bipolar plates with parallel-channel<br />

layout (gaskets are not shown)<br />

power conversion devices. Compared to other<br />

electrochemical systems that incorporate energy and<br />

power in a single device, RFBs are usually more<br />

advantageous when generation for 4-6 hours or more at<br />

maximum power is required. Furthermore, RFBs can<br />

also be fully charged or discharged and left in such<br />

conditions for long periods with no negative effects.<br />

RFB cells consist <strong>of</strong> a sandwiched structure composed<br />

<strong>of</strong> electrodes and interposed proton conducting<br />

membranes that are very similar to the Membrane<br />

Electrode Assembly (MEA) typical <strong>of</strong> PEMFCs (Fig. 3).<br />

The electrolyte solutions reach the electroactive sites<br />

within the electrodes by flowing through a porous<br />

structure consisting <strong>of</strong> materials such as carbon felt. In<br />

contrast with FC storage systems, which require a<br />

specific device, i.e. the electrolyzer, for converting<br />

electrical energy into hydrogen and oxygen, RFBs can<br />

perform both conversions, from electrical to chemical<br />

and from chemical to electrical, in a single device.<br />

A second advantage <strong>of</strong> RFBs with respect to FCs is that<br />

their fuels are not hazardous gases such as hydrogen and<br />

oxygen, but much less dangerous electrolyte solutions,<br />

which makes handling and storage much simpler and<br />

cheaper. As shown in Fig. 2, only two tanks and two<br />

pumps are required for these functions.<br />

Moreover, RFBs operate by changing the metal ion<br />

valence, but the ions themselves are not consumed. This<br />

feature allows extremely long cyclic service with very<br />

low maintenance.<br />

The optimal electrolyte temperatures are in a narrow<br />

range, roughly between 15°C and 35°C, and outside this<br />

range unwanted side effects such as solution<br />

precipitation may occur. On the other hand, the<br />

temperature <strong>of</strong> the battery can be controlled rather easily<br />

by appropriately regulating the electrolyte flow. The<br />

control RFBs is also relatively easy: in fact, the cell<br />

voltage allows to monitor easily the state <strong>of</strong> charge<br />

(SOC) <strong>of</strong> the battery and, at the same time, very deep<br />

discharges are possible because no damage occurs to the<br />

morphology <strong>of</strong> the cell with such operations.<br />

Furthermore, self-discharge is prevented by the<br />

separation <strong>of</strong> the two electrolytes in two different


circuits. The very fast reaction kinetics provides very<br />

fast response times and high overloading is tolerable on<br />

short time scales.<br />

On the other hand, at present even the most advanced<br />

RFBs have relatively low power and energy densities<br />

compared to other competing electrochemical storage<br />

technologies. Consequently, RFBs tend to have large<br />

active areas and ion conducting membranes and<br />

therefore their overall size is usually bulky, making them<br />

unsuitable for mobile applications. Also, the large active<br />

areas cause high transverse gradients <strong>of</strong> the solutions<br />

that feed the electrochemically active sites, particularly<br />

when operating at high power and with high flows. This<br />

causes the current density to be far from uniform on the<br />

active areas, with average values quite lower than the<br />

maximum ones.<br />

III. RFB TECHNOLOGIES<br />

A. Fe-Cr systems<br />

The first commercial RFBs were <strong>of</strong> the Fe-Cr type,<br />

featuring open circuit voltages <strong>of</strong> about 1 V for the<br />

single cell. Test systems in the range <strong>of</strong> 10-60 kW were<br />

produced in the late 1980s by several Japanese<br />

companies including: Mitsui Engineering and<br />

Shipbuilding Co. Ltd, Kansai Electric Power Co. Inc,<br />

and Sumitomo Electric Industries Ltd (SEI). Beside<br />

relatively low energy density, the main drawbacks <strong>of</strong><br />

such systems included: slow reaction <strong>of</strong> the Cr ion,<br />

membrane aging and cell degradation due to the mixing<br />

<strong>of</strong> the two ions. Due to these problems, Fe-Cr cells are<br />

inferior to VRBs, so that they were abandoned after the<br />

emerging <strong>of</strong> the latter.<br />

B. VRB systems<br />

VRBs (Vanadium Redox Batteries), also called allvanadium<br />

RFBs, are currently the most successful RFB<br />

technology, the only one that has reached substantial<br />

quite commercial maturity. Such systems use only<br />

vanadium, dissolved in aqueous sulfuric acid (~5 M). A<br />

positive feature with respect to other RFBs is that, since<br />

they use the same metal on each electrode, the electrodes<br />

and membrane are not cross-contaminated, preventing<br />

capacity decrease and providing longer lifetimes.<br />

Exploiting the ability <strong>of</strong> vanadium to exist in solution<br />

in four different oxidation states, vanadium II-III<br />

(bivalent-trivalent) is used at one electrode while<br />

vanadium IV-V (tetravalent-pentavalent) is used at the<br />

other one. The electrochemical half-reactions are:<br />

positiveelectrode<br />

VO 2+ charge<br />

+ H2O↽⇀ + + −<br />

VO2 + 2H + e<br />

discharge<br />

negativeelectrode<br />

V 3+ + e − charge<br />

↽ ⇀V 2+<br />

discharge<br />

- 376 - 15th IGTE Symposium 2012<br />

(1)<br />

Fig. 4: Polarization curve <strong>of</strong> a RFB<br />

During charge, tetravalent vanadium ions VO2+ are<br />

oxidized to pentavalent vanadium ions VO2 + at the<br />

positive electrode, while trivalent ions V3+ are reduced<br />

to bivalent ions V2+ at the negative electrode. The<br />

hydrogen ions 2H + created at the positive electrode flow<br />

to the negative one through the membrane thus<br />

maintaining the electrical neutrality <strong>of</strong> the electrolytes.<br />

The theoretical open circuit voltage (OCV) <strong>of</strong> VRB cell<br />

is Eo =1.26 V at 25°C, but, in fact, real cells exhibit<br />

Eo =1.4 V in operating conditions. The cell voltage v in<br />

load operation differs from Eo due to the activation<br />

overpotentials η <strong>of</strong> the electrodes which are modeled by<br />

the Butler-Volmer equation:<br />

c<br />

j = j r (0,t)<br />

o exp<br />

cr *<br />

αF<br />

RT η<br />

⎛ ⎞<br />

⎝<br />

⎜<br />

⎠<br />

⎟ − c ⎡<br />

p(0,t) ⎛ (1− α )F ⎞ ⎤<br />

⎢<br />

exp η<br />

cp * ⎝<br />

⎜<br />

RT ⎠<br />

⎟ ⎥<br />

⎣⎢<br />

⎦⎥<br />

where j is the current density at the electrode, jo is the<br />

exchange current density, ci are the species<br />

concentrations at the electrochemical activity sites <strong>of</strong> the<br />

reagents and products indicated in (1) (i = r regents,<br />

i = p products), α is the transfer coefficient (with a value<br />

around 0.5), F is the Faraday constant, R is the gas<br />

constant, and T is the absolute temperature. The ci/ci<br />

fractions express the dynamic reduction <strong>of</strong> the<br />

concentrations normalized to steady state equilibrium<br />

values.<br />

According to (2) v =Eo – η is higher than Eo in the<br />

charge phase, i.e. where the current density is j 0 when electric power<br />

is released. jo is a parameter which depends on the<br />

reactions and on the physical-chemical structure <strong>of</strong> the<br />

electrodes, and is crucial for the cell operation, since the<br />

higher jo the lower η for a given j. In fact, the activation<br />

overpotentials are the major culprits for the cell’s<br />

internal losses at lower current densities (with ci/ci≅1,<br />

Fig. 4). Thus, increasing jo by means <strong>of</strong> appropriate<br />

electrode designs allows to get performance<br />

improvements and higher round trip efficiency. jo can be<br />

increased with higher concentrations, lower activation<br />

(2)


Fig. 5: Schematic <strong>of</strong> a RFB stack with side fluid feedings –<br />

series <strong>of</strong> about 100 cells with active areas as large as<br />

0.4x0.4 m 2 are usual<br />

barriers (i.e. higher activity provided by efficient<br />

catalysts), and larger effective active areas, achievable<br />

by means <strong>of</strong> highly porous electrodes (e.g.<br />

nanostructured materials).<br />

At medium current densities internal losses mainly<br />

depend on the ion conducting membrane that separates<br />

the electrodes (Fig. 3). The material <strong>of</strong> choice is a<br />

perfluorosulfonic acid polymer that allows, if properly<br />

hydrated, the transport <strong>of</strong> ions by binding cations to its<br />

sulfonic acid sites. It is a rather expensive material<br />

patented by DuPont and commercially available under<br />

the name Nafion. Electrically the membrane behaves as<br />

a linear resistor, if temperature and hydration are kept<br />

constant.<br />

At higher current densities the losses are dominated by<br />

transport phenomena in the electrode diffusion layers,<br />

which dramatically reduce the concentrations (ci/ci


system so far, intended for smoothing power output<br />

fluctuations at the Subaru Wind Villa Power Plant which<br />

is rated at 30.6 MW, is a 4 MW / 6 MWh installation<br />

built by Sumitomo Electric Industries (SEI), Japan, for J-<br />

Power in 2005. The system consists <strong>of</strong> 4 banks, each <strong>of</strong><br />

24 stacks and rated at 1 MW (which can be overloaded<br />

up to a maximum <strong>of</strong> 1.5 MW). Individual stacks consist<br />

<strong>of</strong> 108 cells, with a rated power <strong>of</strong> 45 kW each. During<br />

over 3 years <strong>of</strong> operation, the system completed more<br />

than 270,000 complete cycles, thus demonstrating its<br />

reliability.<br />

The abovementioned SEI is one <strong>of</strong> the largest<br />

manufacturers <strong>of</strong> VRB systems for the smoothing and<br />

leveling <strong>of</strong> the fluctuating power generated by wind<br />

farms. Most <strong>of</strong> such systems have been built by SEI and<br />

later by VRB Power Inc., based in Vancouver, CA,<br />

which acquired SEI patents around 2005. In 2009, all<br />

vanadium redox battery assets <strong>of</strong> VRB Power Inc. where<br />

acquired by Prudent Energy, controlled by investors<br />

from China and the U.S.A., in a plan <strong>of</strong> business<br />

expansion in China and abroad. Further important efforts<br />

in the development <strong>of</strong> commercial RFB technologies in<br />

China are those <strong>of</strong> the Chengde Wanlitong Industrial<br />

Group. The reason <strong>of</strong> this interest lies in Chinese plans<br />

to expand the exploitation <strong>of</strong> intermittent renewable<br />

energy sources, especially wind. In fact, wind power<br />

production in the country is expected to rise from about<br />

20 GW in 2010 to 100 GW in 2015, and almost $50<br />

billion per year are expected to be invested in power grid<br />

improvements in the next decade to handle this<br />

increasing amount <strong>of</strong> energy production from<br />

intermittent sources.<br />

Significant developments are also taking place in other<br />

Asian countries, e.g. Cellennium Company Ltd. <strong>of</strong><br />

Thailand produces VRB systems under license, while<br />

Samsung Electronics Co. Ltd. in South Korea is engaged<br />

in the development <strong>of</strong> RFBs with nonaqueous<br />

electrolytes.<br />

Further interesting developments are taking place in<br />

Australia, where V-Fuel Pty Ltd is pursuing innovative<br />

V-Br technology in cooperation with the <strong>University</strong> <strong>of</strong><br />

New South Wales (UNSW). Other Australian companies<br />

working on RFBs, are ZBB Energy Corp. and Redflow<br />

Ltd., both involved in the development and installation<br />

<strong>of</strong> Zn/Br2 batteries.<br />

In the U.S., the Department <strong>of</strong> Energy (DoE) launched<br />

an RFB development program which identified Ashlawn<br />

Energy, LLC for the design <strong>of</strong> a 1 MW / 8 MWh VRB<br />

test plant while Primus Power Corp. was funded to<br />

develop a 25 MW / 75 MWh system based on Zn/Cl2<br />

RFBs. Premium Power Corp. is also developing Zn/Br2<br />

batteries.<br />

In Europe, Renewable Energy Dynamics (RED-T),<br />

Ireland, Cellstrom GmbH, Austria, and RE-Fuel<br />

<strong>Technology</strong> Ltd., UK, are some <strong>of</strong> the most active<br />

companies developing and producing VRB systems.<br />

High-energy density innovative RFBs are also being<br />

investigated in Germany, where the Fraunh<strong>of</strong>er-<br />

Gesellschaft is researching nonaqueous electrolytes, and<br />

in the UK where Plurion Ltd is working on Zn-Ce<br />

systems.<br />

Overall, since the market for smart grid technologies is<br />

expected to grow significantly worldwide in the near<br />

- 378 - 15th IGTE Symposium 2012<br />

future, the market for VRB systems, which is already<br />

starting to flourish, is also expected to expand<br />

vigorously.<br />

V. RESEARCH ISSUES<br />

In spite <strong>of</strong> the previously described initial commercial<br />

successes, RFB technology has yet to witness a complete<br />

technical and commercial breakthrough and substantial<br />

R&D programs are still required in order to fully unleash<br />

its industrial potential. The next generation <strong>of</strong> systems,<br />

expected within the next 5 years, will be even more<br />

economically competitive and will be able to provide the<br />

capital and lifecycle cost reductions that are essential for<br />

widespread commercial success.<br />

The basis for more compact and efficient systems,<br />

exhibiting higher power and energy densities will be<br />

provided by non-aqueous electrolytic solutions and by<br />

improved electrode activity. Improved electrolytes will<br />

also allow to expand the operation temperature range.<br />

For example, the nonaqueous 2MW/20MWh RFB<br />

system under development at the Fraunh<strong>of</strong>er Institute<br />

will consists <strong>of</strong> 8 blocks <strong>of</strong> 7 stacks, with 100-cell<br />

stacks, and will have an output <strong>of</strong> 2 kV, 1 kA, while<br />

being fed from 2x300 m3 tanks. Further improvements<br />

will come also from nanostructured electrodes, currently<br />

under development, with increased effective surface area<br />

and hence improved exchange current density.<br />

In next generation systems, the currently common and<br />

expensive Nafion ion conducting membrane will be<br />

substituted with alternative ones having significantly<br />

reduced cost and, at the same time, lower ohmic losses<br />

in the cells. Incidentally, further material cost reduction<br />

will also be provided by higher power densities, through<br />

more compact designs.<br />

Apart from the above mentioned developments which<br />

involve mainly materials science and basic chemistry,<br />

important engineering efforts are being aimed at system<br />

scale-up and at the structural and operational<br />

optimization <strong>of</strong> flow geometries, state-<strong>of</strong>-charge<br />

monitoring and supervisor systems. Numerical modeling<br />

and simulation are instrumental in improving the current<br />

systems which are far from optimal in many respects.<br />

Multi-scale, multidimensional, multi-physic, both<br />

steady-state and dynamic, models can accurately<br />

simulate the behavior <strong>of</strong> the whole system and its<br />

components and thus speedup the development <strong>of</strong> more<br />

efficient components and systems. Many modeling<br />

problems encountered in RFB systems are similar to<br />

those posed by direct alcohol fuel cells which also<br />

consist <strong>of</strong> the same basic building blocks (MEA-based<br />

cells, bipolar plates and stacks) and are also fed with<br />

liquid solutions instead <strong>of</strong> gases, so that some <strong>of</strong> the<br />

numerical tools developed in that context [10] may be<br />

adapted to the simulation <strong>of</strong> RFB systems. Sophisticated<br />

modeling tools are also aimed at designing advanced<br />

bipolar plates with either flow-by or flow-through<br />

diffusion <strong>of</strong> the electrolytic solutions, were the aim is to<br />

minimize transverse gradients and, at the same time, to<br />

reduce longitudinal conductance for lowering the shunt<br />

currents. Advanced computational techniques are needed<br />

to deal with the very challenging numerical problems<br />

arising from cell elements which exhibit multi-physic


material behavior and high aspect ratio geometries<br />

[11,12].<br />

In the area <strong>of</strong> controls engineering, advanced control<br />

systems will provide automatic electrolyte rebalancing<br />

and capacity correction and will possibly allow the<br />

remote operation <strong>of</strong> large RFB systems. Optimized<br />

electrolyte flow-rates will also minimize pumping<br />

energy requirements, which are one <strong>of</strong> the main factors<br />

affecting the overall efficiency (together with shunt<br />

currents and internal cell losses). Such control systems<br />

will eventually cope with the conflicting requirements<br />

arising from the strong dependence <strong>of</strong> the cell voltage<br />

vs. current polarization curve on the solution flow-rates.<br />

As far as the electrical interface <strong>of</strong> RFB systems is<br />

concerned, modeling, simulation and optimization are<br />

aimed at designing supervisor and control sub-systems<br />

with proper feed-back loops and reduced response times<br />

which are required to assure improved performance for<br />

peak shaving, sag compensation and load leveling in the<br />

smart-grid context. Flexible solutions for interfacing<br />

both the DC renewable energy sources and AC grid and<br />

load can be obtained with DC/DC converters coupled to<br />

inverters. Non-linear control techniques <strong>of</strong> the inverter<br />

can allow RFB systems to provide active as well as<br />

reactive power to the smart-grid connected loads. The<br />

success in designing such power management subsystems,<br />

including both the DC/DC converter and the<br />

inverter, strongly depends on the accuracy in modeling<br />

the various components and the whole system.<br />

Further research is also needed for optimizing the<br />

technological solutions from the economical (operating<br />

earning and savings arising from the RFBs operation)<br />

and environmental (primary energy savings, carbon<br />

dioxide emission reductions) point <strong>of</strong> view. The results<br />

<strong>of</strong> these analyses will allow assessing the viability <strong>of</strong><br />

RFB technologies within the context <strong>of</strong> modern energy<br />

hubs.<br />

All the above described scientific challenges raised by<br />

RFBs require strongly interdisciplinary development<br />

programs and collaborative efforts among researchers<br />

with different and complementary expertise. If such<br />

efforts are successful the next generation <strong>of</strong> RFB<br />

systems will be low cost, highly efficiency and durable,<br />

and thus be suitable for large-scale industrial<br />

exploitation, overcoming the limitations <strong>of</strong> more<br />

conventional systems.<br />

Finally, more compact and more flexible RFB systems,<br />

such as the ones mentioned above, may one day become<br />

suitable for powering some classes <strong>of</strong> electric vehicles.<br />

VI. CONCLUSIONS<br />

Redox flow batteries (RFBs) are already a promising<br />

energy storage technology and first generation systems,<br />

based on all-vanadium solutions, have already been<br />

successfully demonstrated in test installations<br />

worldwide, and their commercial exploitation is<br />

undergoing. The next generation <strong>of</strong> systems, with<br />

increased power and energy densities, are currently<br />

under development, but further progresses in<br />

electrochemical materials and system engineering are<br />

expected to produce the final technical and commercial<br />

breakthrough. RFB systems are expected to become a<br />

- 379 - 15th IGTE Symposium 2012<br />

key technology for stationary smart-grid-oriented<br />

applications supporting the load leveling and peak<br />

shaving <strong>of</strong> intermittent renewable energy sources. Future<br />

high-density systems may also become suitable for some<br />

automotive applications.<br />

REFERENCES<br />

[1] European Commission, “Proposal for a COUNCIL<br />

DECISION<br />

establishing the Specific Programme Implementing Horizon<br />

2020 - The Framework Programme for Research and<br />

Innovation (2014-2020), COM(2011) 811 final, 2011/0402<br />

(CNS).<br />

[2] B. Dunn, H. Kamath, J.Tarascon, “Electrical Energy<br />

Storage for the Grid: A Battery <strong>of</strong> Choices”, Science, 334, pp.<br />

928-935, 2011.<br />

[3] Z. Weber, M. M. Mench, J. P. Meyers, P. N. Ross, J. T.<br />

Gostick, Q. Liu, “Redox flow batteries: a review”, J. Appl.<br />

Electrochem. 41, pp. 1137-1164, 2011.<br />

[4] T. Shigematsu, “Redox Flow Batteries for Energy Storage”,<br />

SEI Technical Review, 73, pp. 4-13, 2011.<br />

[5] C. Ponce de León, A. Frías-Ferrer, J. González-García,<br />

D.A. Szánto, F. C. Walsh, “Redox flow cells for energy<br />

conversions”, J. Power Sources, 160, pp. 716-732, 2006.<br />

[6] C. Menictas, M. Skyllas-Kazacos, “Performance <strong>of</strong><br />

vanadium-oxigen redox fuel cell”, J. Appl. Electrochem., 41,<br />

pp. 1223-1232, 2011.<br />

[7] M. Skyllas-Kazacos, G. Kazacos, G. Poon, H. Verseema,<br />

“Recent advances with UNSW vanadium-based redox flow<br />

batteries”, Int. J. Energ. Res., 34, pp. 182-189, 2010.<br />

[8] Kaneko H, Negishi A, Nozaki K, Sato K, Nakajima M<br />

(1992) Redox battery. US Patent 5318865.<br />

[9] C. Menictas, M. Skyllas-Kazacos, “Vanadium-oxygen<br />

redox fuel cell”, Final report. SERDF Grant, NSW Department<br />

<strong>of</strong> Energy, 1997.<br />

[10] V. Di Noto, M. Guarnieri, F. Moro: “A Dynamic Circuit<br />

Model <strong>of</strong> a Small Direct Methanol Fuel Cell for Portable<br />

Electronic Devices”, IEEE Tran.s Ind. Electronics, Vol. 57, N.<br />

6, pp. 1865-1873, 2010.<br />

[11] P. Alotto, M. Guarnieri, F. Moro, A. Stella: “A Proper<br />

Generalized Decomposition Approach for Fuel Cell Polymeric<br />

Membrane Modelling”, IEEE Trans. Mag., Vol. 47 No. 5, pp.<br />

1462-1465, 2011.<br />

[12] P. Alotto, M. Guarnieri, F. Moro, A. Stella: “Multi-physic<br />

3D dynamic modelling <strong>of</strong> polymer membranes with a proper<br />

generalized decomposition model reduction approach”,<br />

Electrochimica Acta, pp. 250-256, 2011.<br />

[1]


- 380 - 15th IGTE Symposium 2012<br />

Model Order Reduction via Proper Orthogonal<br />

Decomposition for a Lithium-Ion Cell<br />

B. Suhr∗ , J. Rubeˇsa∗ ∗Kompetenzzentrum - Das Virtuelle Fahrzeug Forschunggesellschaft mbH (VIF), <strong>Graz</strong>, Austria<br />

E-mail: bettina.suhr@v2c2.at<br />

Abstract—The simulation <strong>of</strong> lithium-ion batteries is a challenging research topic. Since there are many electrochemical<br />

processes involved in dis-/charging, models which aim to include these processes are in general complex and therefore slow.<br />

For many tasks, e.g. in optimization, a repeated solution <strong>of</strong> a model is necessary. In this paper a speed up in simulations,<br />

with acceptable error in results, is obtained by combining proper orthogonal decomposition with empirical interpolation<br />

method. We report a speed up factor between 10 and 15.<br />

Index Terms—electrochemical model, empirical interpolation method, model reduction, proper orthogonal decomposition<br />

I. INTRODUCTION<br />

The accurate and fast simulation <strong>of</strong> lithium-ion batteries<br />

is <strong>of</strong> a growing interest in the automotive industry.<br />

As fossil fuels are limited, more and more research is<br />

conducted on electric, especially on hybrid cars. Here, the<br />

quality and the speed <strong>of</strong> the battery simulation is a crucial<br />

point. Often battery models are simplified strongly, in<br />

a physical meaning, to obey the need for speed <strong>of</strong> on<br />

board usage or optimization purposes. In contrary, here a<br />

speed up in simulation will be gained by using model<br />

reduction via proper orthogonal decomposition (POD)<br />

combined with a fast evaluation <strong>of</strong> nonlinearities, the<br />

empirical interpolation method (EIM).<br />

Cai and White in [4] applied POD method to a battery<br />

model, but the mayor nonlinearity <strong>of</strong> the system was<br />

assumed to be constant. Starting from the full model and<br />

very fine discretization in space, a speed up factor <strong>of</strong> 4<br />

was obtained. In their work a comparison between full<br />

and reduced for only constant discharge simulations were<br />

done.<br />

We follow the work introduced by Lass and Volkwein<br />

in [7] where POD and EIM were applied to the battery<br />

model <strong>of</strong> Wu-Xu-Zou [10]. We use this procedure and<br />

apply it on more general but more complex battery model<br />

<strong>of</strong> Cifrain [5]. In our work, as in the work <strong>of</strong> Lass,<br />

no simplifications <strong>of</strong> the battery model, as mentioned<br />

previously, are necessary and a speed up factor <strong>of</strong> 15<br />

was obtained for constant discharge simulations.<br />

The paper is organized in the following manner: In<br />

Section II the nonlinear parabolic dynamical system that<br />

describes considered battery model is formulated. Section<br />

III is devoted to the reduced order model (ROM) utilizing<br />

proper orthogonal decomposition (POD) method.<br />

We describe the method <strong>of</strong> POD in general and its<br />

application to the battery system. Moreover, the empirical<br />

interpolation is introduced. In Section IV numerical<br />

results are presented. Finally, in Section V conclusions<br />

are drawn and an outlook on future work is given.<br />

II. BATTERY MODEL<br />

The battery cell consists <strong>of</strong> two electrodes, an anode<br />

and a cathode, and a separator between them. Each<br />

electrode consists <strong>of</strong> particles and an electrolyte, while<br />

in the separator we consider only the electrolyte.<br />

The mathematical model described in [5] and used<br />

here is an electrochemical model similar to the well<br />

known model <strong>of</strong> Newman [6]. It is a coupled dynamical<br />

system <strong>of</strong> four nonlinear partial differential equations.<br />

The system variables are potentials and concentrations<br />

for the electrolyte, φl,cl, for the cathode, φsc,csc, and<br />

for the anode φsa,csa. All state variables describing the<br />

potential and the liquid concentration variable are one<br />

dimensional system variables. Those four variables are<br />

time t ∈ [0,T], T ∈ R, and space x ∈ Ω dependent,<br />

where Ω ⊂ R. Two remaining variables, the variables<br />

for the solid concentration are two dimensional variable,<br />

i.e., cs := cs(x, r, t) ∈ Λ × (0,T) where Λ ⊂ R 2 . Such<br />

model is also referred to as the pseudo-two-dimensional<br />

model; see Figure 1. Considered battery model is given<br />

in the following way:<br />

∂ (φs − φl)<br />

CDSAi<br />

−∇·(σs∇φs) =−AiθjBV in Ω<br />

∂t<br />

′<br />

∂ (φs − φl)<br />

−CDSAi<br />

−∇ ·<br />

RT<br />

zF<br />

∂ (ɛlcl)<br />

∂t<br />

∂t<br />

κl(cl)t +<br />

l<br />

<br />

−∇ · Dl ∇cl + zF<br />

∂cs<br />

∂t<br />

= 1<br />

r 2<br />

−∇·(κL(cl)∇φl)+<br />

(1a)<br />

<br />

1<br />

∇cl = AiθjBV in Ω (1b)<br />

cl<br />

∂ (φs − φl)<br />

− CDSAi<br />

+<br />

∂t<br />

RT μlcl∇φl<br />

<br />

= Aiθ<br />

F jBV in Ω (1c)<br />

<br />

∂<br />

Dsr<br />

∂r<br />

2 fRK(cs) ∂cs<br />

<br />

in Λ (1d)<br />

∂r<br />

strongly coupled with<br />

⎧<br />

⎪⎨<br />

⎪⎩<br />

<br />

αAzF(φs−φ<br />

zFkθ exp<br />

l−UOCV (cs))<br />

+<br />

RT<br />

<br />

−(1−αK )zF(φs−φ<br />

−zFkθ exp<br />

l−UOCV (cs))<br />

RT<br />

in Ω ′<br />

0 in Ωs<br />

jBV =


where the spatial sub-domains are introduced<br />

as Ω = Ωc∪ Ωs ∪ Ωa, Ω ′ Λa<br />

= Ωc∪ Ωa,<br />

=Ωa × [0,Ra] ⊂ R2 , Λc =Ωc × [0,Rc] ⊂ R2 ,<br />

Λ=Λa∪ Λc and Ra,Rc ∈ R. See Section VI for a<br />

list <strong>of</strong> symbols. The system is initialized with constant<br />

values which correspond to an equilibrium state, i.e.,<br />

jBV =0. The boundary conditions are all homogeneous<br />

Neumann conditions at the external boundaries and<br />

continuous flux conditions at internal boundaries<br />

(between electrodes and separator). Exceptions are<br />

the solid potential where either a current is specified<br />

−σS ∂φS<br />

<br />

<br />

∂x = − Γa<br />

I(t)<br />

<br />

or a voltage φS<br />

= U(t). Also<br />

Acell Γa<br />

the solid concentration has a non zero boundary condition<br />

DSfRK(cS) ∂cS<br />

<br />

∂r = − j∗ BV 1 ∂(φS−φL)<br />

F − F<br />

CDS ∂t .<br />

r=Rp<br />

The nonlinear initial value problem (1) is discretized<br />

in space with the Finite Element (FE) method, in time<br />

using the implicit Euler method and linearized with the<br />

damped Newton method. The details on numerics, initial<br />

values and boundary conditions one can find in [8]. After<br />

obtaining the numerical solution <strong>of</strong> the model, the need<br />

for model reduction and speed up has occurred.<br />

<br />

<br />

<br />

Fig. 1. Pseudo-two-dimensional model.<br />

III. MODEL REDUCTION<br />

The reduction <strong>of</strong> linear dynamical systems is a classical<br />

research topic and there exist several well known<br />

methods for this task, e.g. balanced truncation, Krylov<br />

methods, reduced basis and POD. Detailed information<br />

one can found in the standard literature or in [1], [3]<br />

and [9]. A relatively new area in research is the model<br />

reduction <strong>of</strong> nonlinear dynamical systems <strong>of</strong> equations.<br />

A. The POD method.<br />

POD is a commonly used model order reduction technique,<br />

when repeated simulations are to be conducted.<br />

Relevant information is extracted from snapshots generated<br />

with the full model and saved into the POD basis.<br />

Using the POD basis a ROM will be build and for all<br />

further simulations the ROM is used.<br />

Computation <strong>of</strong> the POD basis: The solutions yj ∈<br />

Rm , j =1,...,n, <strong>of</strong> the full system we refer to as<br />

snapshots and we define the following:<br />

Y := [y 1 , ..., y n ] ∈ R m×n .<br />

<br />

- 381 - 15th IGTE Symposium 2012<br />

The goal <strong>of</strong> the POD is to find l ≤ d = dim span (Y ) ≤ n<br />

orthonormal vectors {Ψi} l i=1 in Rm that minimize the<br />

cost function and does the best approximation <strong>of</strong> Y , i.e.,<br />

J(Ψ1,...,Ψl) =<br />

n<br />

αjyj −<br />

j=1<br />

l<br />

〈yj, Ψi〉XΨi 2 , (2)<br />

i=1<br />

where x = √ x T x is the Euclidean norm, αj ≥ 0 are<br />

the weights and 〈x, y〉 L 2 is the inner product. Further,<br />

from the Lagrange functional:<br />

L(Ψ1,...,Ψl,λ11,...,λll) =J(Ψ1,...,Ψl)+<br />

l<br />

+ λij(〈Ψi, Ψj〉X − δij) ,<br />

i,j=1<br />

with the Kronecker symbol δij, we obtain two necessary<br />

optimality conditions:<br />

n<br />

1) αjyj〈yj, Ψi〉X = λiiΨi ,λij =0for i = j<br />

j=1<br />

2) 〈Ψi, Ψj〉X = δij ,i,j=1,...,l.<br />

The second condition is satisfied under the assumption<br />

that the vectors {Ψi} l i=1 are orthonormal and the first<br />

condition is equivalent with<br />

YBY T MΨi = λiΨi , i =1,...,l, (3)<br />

where M ∈ Rm×m is the mass matrix from the FE<br />

simulation, λi = λii, B := diag(α1,...,αn), α1 = δt1<br />

2 ,<br />

αj = δtj+δtj−1<br />

2 ,j=2,...,n− 1 and αn = δtn<br />

2 . There<br />

exist more possibilities to solve (3) and we will consider<br />

the following approach.<br />

a) K-Ansatz (Covariance matrix): By setting ¯ Y :=<br />

M 1<br />

2 YB 1<br />

2 we can define the symmetric matrix K, i.e.,<br />

K := ¯ Y T ¯ Y = B 1<br />

2 Y T MYB 1<br />

2 ∈ R n×n .<br />

After the application <strong>of</strong> the eigenvalue decomposition on<br />

K we obtain the following relation:<br />

Kvj = λjvj , j =1,...,n.<br />

The n eigenvalues <strong>of</strong> the matrix K we denote by λj<br />

and the eigenvectors by vj. We sort the eigenvalues in<br />

decreasing order, i.e., λ1 ≥ λ2 ≥ ... ≥ λl ≥ ... ≥ λn,<br />

and we cut the first l eigenvalues such that following<br />

holds:<br />

n<br />

λj ≤ tolerance ,<br />

j=l+1<br />

since the error <strong>of</strong> the cost function (2) can be calculated<br />

by J(Ψ1,...,Ψl) = n j=l+1 λj. Hence, the choice <strong>of</strong><br />

l strictly depends on the eigenvalues λj, j =1,...,n<br />

and how fast same decay. In Figure 2, the computed<br />

eigenvalues are shown which are scaled by trace <strong>of</strong> matrix<br />

K.<br />

The POD basis consists <strong>of</strong> l functions:<br />

Ψi(x) = 1<br />

n<br />

√ α<br />

λi<br />

1<br />

2<br />

j (¯vi)jy j (x) ∈ R m , i =1,...,l.<br />

j=1


Fig. 2. Example plot <strong>of</strong> decaying eigenvalues.<br />

Denoting the FE basis functions as ϕ, the snapshots<br />

yj have the standard Galerkin representation: yj <br />

=<br />

m<br />

k=1 yj<br />

kϕ(x). This directly yields:<br />

m<br />

Ψi(x) = ψ i kϕ(x), ψ i k = 1<br />

n m<br />

√ α<br />

λi<br />

1<br />

2 j<br />

j (¯vi)jy k .<br />

k=1<br />

j=1 k=1<br />

For later use we define (Ψi)k := ψ i k , Ψi ∈ R m ,i =<br />

1,...,l, and the matrix ˆ Ψ:=[Ψ1, ..., Ψl] ∈ R m×l .<br />

B. Application <strong>of</strong> POD to the battery model.<br />

The POD basis for the battery model is calculated<br />

for each variable separately. The snapshots <strong>of</strong> the full<br />

FE solutions are split up in the data for each <strong>of</strong> the<br />

six variables φl,cl,φsc,φsa,csc,csa and the six matrices<br />

<strong>of</strong> snapshots, Yi ∈ RNi×n , i =1,...,6, are obtained.<br />

Here n is the number <strong>of</strong> time steps and Ni is the<br />

number <strong>of</strong> degrees <strong>of</strong> freedom in the FE Ansatz for the<br />

corresponding variable, i.e., m = 6 i=1 Ni. With these<br />

prerequisites the computation <strong>of</strong> the POD basis for each<br />

<strong>of</strong> the six variables can be carried out, as described above.<br />

The differences in the dimension <strong>of</strong> the domains <strong>of</strong> the<br />

six variables, cause no further changes.<br />

Before we describe how the reduced order model can<br />

be build we need to give a quick overview on how the<br />

full FE system is solved and to introduce some notations.<br />

As the FE method is well known we will not give details<br />

about how one can transform the battery system (1) in<br />

its weak form and apply the Galerkin method on it.<br />

The system, which is discretized in space, is still<br />

continuous in time and has the following form:<br />

M ˙u + K(u)u = f(u) , (4)<br />

where u := [φl,cl,φsc,φsa,csc,csa] T , M is the mass<br />

matrix, K(u) is the stiffness matrix and f(u) is the right<br />

hand side <strong>of</strong> the system. Please note that we solve all<br />

six equations simultaneously, therefore all matrices are<br />

block matrices. For time discretization the implicit Euler<br />

method is applied on (4) and we use<br />

<br />

1<br />

M + K(uk+1) uk+1 − f(uk+1) =<br />

Δt 1<br />

Δt Muk , (5)<br />

- 382 - 15th IGTE Symposium 2012<br />

in every time step k = 1, 2, ..., n. As this system is<br />

fully discretized but still nonlinear, we apply now the<br />

Newton method in order to linearize the system (5). After<br />

introducing the following notation:<br />

<br />

1<br />

A(uk+1) := M + K(uk+1)<br />

Δt<br />

we can rewrite equation (5) and define the operator G as<br />

and F := 1<br />

Δt Muk ,<br />

G(uk+1) :=A(uk+1)uk+1 − f(uk+1) − F =0. (6)<br />

Applying the Newton method we end up calculating the<br />

Newton step<br />

Δu = −[JG(u i k+1)] −1 G(u i k+1) , (7)<br />

in every time step k until the predefined stopping criterion<br />

is met for the sequence {ui k }∞i=0 , ui+1<br />

k+1 = ui k+1 +Δu.<br />

The derivative JG is calculated using the derivatives <strong>of</strong><br />

K and f such that<br />

JG(uk+1) = 1<br />

Δt M + JK(uk+1) − Jf(uk+1). (8)<br />

1) Reduced order model for the battery: Next step<br />

is to build the reduced order model using the POD<br />

basis. We point out that it is necessary to formulate<br />

non homogeneous Dirichlet boundary conditions with a<br />

penalty method.<br />

To obtain the reduced order model the Galerkin method<br />

is used with the POD basis functions, instead <strong>of</strong> the FE<br />

basis functions as before. We denote our calculated POD<br />

basis for each <strong>of</strong> the six variables with:<br />

ˆΨi ∈ R Ni×li 6<br />

, i =1,...,6 , lPOD = li.<br />

i=1<br />

In the case <strong>of</strong> one single equation the mass matrix M <strong>of</strong><br />

the full system and the mass matrix ˆ M <strong>of</strong> the reduced<br />

system are connected via the POD basis ˆ Ψ as follows:<br />

ˆM = ˆ Ψ T M ˆ Ψ , M ∈ R m×m , Mˆ l×l<br />

∈ R . (9)<br />

This can be adapted to a system <strong>of</strong> equations using the<br />

block structure <strong>of</strong> the mass matrix. The mass matrix<br />

M ∈ R m×m in (4) is a block matrix with the following<br />

structure: Mij ∈ R Ni×Nj , i,j =1,...,6. The corresponding<br />

matrix ˆ M ∈ R lPOD×lPOD , in the reduced order<br />

model, can be obtained as follows:<br />

ˆMij = ˆ Ψ T i Mij ˆ Ψj , i,j =1,...,6 ,<br />

where ˆ Mij ∈ R li×lj for i, j =1,...,6. For the right<br />

hand side in (5) it holds analogously:<br />

ˆFi = ˆ Ψ T i Fi ∈ R li , i,j =1,...,6 ,<br />

where Fi ∈ R Ni , F =[F1, ..., F6] T ∈ R m and ˆ F =<br />

[ ˆ F1, ..., ˆ F6] T ∈ R lPOD . To improve readability we use<br />

from now on L(M) := ˆ M and L(F ):= ˆ F .<br />

There are many hidden nonlinearities in equations (7)<br />

which have to be evaluated for each time and Newton<br />

step in the full FE dimension m. This can be accessed<br />

via the relation (uk+1)i = ˆ Ψi (ûk+1)i.


The corresponding equations to (6) and (8), for the<br />

reduced order model, are<br />

L(G(ûk+1)) :=L(A( ˆ Ψûk+1))ûk+1 −L(f( ˆ Ψûk+1))<br />

−L(F ) (10a)<br />

L(JG(ûk+1)) := 1<br />

Δt L(M)+L(JK( ˆ Ψûk+1))<br />

−L(Jf( ˆ Ψûk+1)) (10b)<br />

then the Newton step and the iterations are defined by<br />

Δû = −[L(JG(ûk+1))] −1 L(G(ûk+1)) , (11a)<br />

ûk+1 = Luk +Δû. (11b)<br />

C. Application <strong>of</strong> EIM<br />

The evaluation <strong>of</strong> nonlinearities in (10) is very slow.<br />

EIM can be used to reduce number <strong>of</strong> necessary evaluations<br />

<strong>of</strong> these nonlinearities, see [2] for details on<br />

the method. Let g be a given nonlinear function, which<br />

can be evaluated point wise. Needed are snapshots <strong>of</strong><br />

this function with fast decaying eigenvalues. For the<br />

basis construction a greedy algorithm for the biggest<br />

residual is used, which stops at a given tolerance. The<br />

returned basis, Υ=[υ1,...,υlEIM ], has lEIM entries<br />

and also the indices for the evaluation <strong>of</strong> the nonlinearity<br />

p =[p1,...,plEIM ] are given. The nonlinear function g<br />

is approximated as follows:<br />

g(y) =Υc(y) =<br />

lEIM <br />

i=1<br />

υici(y). (12)<br />

To compute the coefficients c the above equality can be<br />

transformed to:<br />

P T Υc(y) =P T g(y) ⇔ c(y) =(P T Υ) −1 P T g(y) (13)<br />

with the matrix P =[ep1,...,epl ] where epi ∈ Rm<br />

EIM<br />

is the pith unit vector.<br />

Since the singular values <strong>of</strong> the nonlinearity on the<br />

right hand side <strong>of</strong> the system decay slowly, the use <strong>of</strong><br />

EIM is not useful here. Luckily, the eigenvalues on the<br />

left hand side <strong>of</strong> the system decay fast and EIM can be<br />

used. This substitutes the matrix assembly.<br />

In equation (10) the nonlinearities are part <strong>of</strong> the<br />

stiffness matrix K and its derivative JK. As both cases<br />

are handled analogously we explain the case for the<br />

stiffness matrix only. The nonlinearities in the different<br />

blocks <strong>of</strong> the stiffness matrix are dealt separately and<br />

therefore we drop one pair <strong>of</strong> indices and denote the<br />

entries <strong>of</strong> an arbitrary block <strong>of</strong> the stiffness matrix as<br />

<br />

Kij = g(y)∇ϕi∇ϕjdx, i, j =1,...,m, (14)<br />

Ω<br />

where g(y) is the nonlinear function to be approximated.<br />

When we now insert equation (12) in (14) and combine<br />

this with the reduced order stiffness matrix ˆ K = ˆ Ψ T K ˆ Ψ<br />

- 383 - 15th IGTE Symposium 2012<br />

we obtain:<br />

ˆK = ˆ Ψ T<br />

=<br />

lEIM<br />

<br />

lEIM <br />

k=1<br />

k=1<br />

<br />

<br />

ck(y) υk∇ϕi∇ϕjdx<br />

Ω<br />

ck(y) ˆ Ψ T<br />

<br />

<br />

υk∇ϕi∇ϕjdx ˆΨ . (15)<br />

<br />

Ω<br />

<br />

ij<br />

<br />

The underlined expressions can be precomputed, these<br />

are lEIM matrices <strong>of</strong> dimension lPOD×lPOD. Instead <strong>of</strong><br />

the assembly <strong>of</strong> the stiffness matrix the coefficients c have<br />

to be calculated using (13), this includes lEIM function<br />

evaluations. These coefficients are then used in the above<br />

formula where a linear combination <strong>of</strong> the precomputed<br />

matrices is calculated. When lEIM is small a massive<br />

speed up can be gained by this method.<br />

IV. RESULTS<br />

In general, our goal is to replace a slow FE simulation<br />

with a fast ROM simulation by allowing a small relative<br />

error between those two simulations. In this section we<br />

will compare the FE solutions and the ROM solutions<br />

for accuracy and speed archived by three different approaches.<br />

We will start with one very simple example.<br />

A. Simple simulation <strong>of</strong> battery.<br />

We consider a simulation where the battery is charged,<br />

discharged and charged again between 3.8V and 2.5V (all<br />

three times with 0.2C); see the solid blue line in Figure 3.<br />

To solve the full model the following spatial discretization,<br />

given in degrees <strong>of</strong> freedom (DOFs), for the single<br />

variables is used:<br />

φl φsc φsa cl csc csa total<br />

241 101 101 241 5151 5151 10986<br />

The simulated time is 61586s and 1451 time steps are<br />

used due to the adaptive time stepping algorithm. The<br />

solutions <strong>of</strong> the full FE system is plotted in Figure 4.<br />

These solutions were taken as snapshots for the POD<br />

basis computation for each <strong>of</strong> the six variables. When we<br />

cut the scaled eigenvalues, at the tolerance <strong>of</strong> 10−8 , the<br />

number <strong>of</strong> POD basis functions for the single variables<br />

is as follows:<br />

Fig. 3. Simple simulation <strong>of</strong> battery.<br />

ij<br />

ˆΨ


Fig. 4. 3d plots <strong>of</strong> full FE solution for first example.<br />

φl φsc φsa cl csc csa total<br />

5 3 3 3 14 12 42<br />

The total dimension <strong>of</strong> the ROM is therefore 42. Also<br />

EIM bases for the different nonlinearities are computed.<br />

The ROM is then solved using these POD and EIM bases.<br />

For a comparison between the FE and the ROM solution<br />

we consider the L2-error. For voltage and current the<br />

absolute error is 6.6 · 10−06 and 1.0 · 10−04 , respectively.<br />

A difference between the voltage <strong>of</strong> the FE solution and<br />

the ROM solution is shown with asterisk line in Figure 3.<br />

Also we consider the relative error <strong>of</strong> each variable which<br />

is defined for φl as:<br />

eφl :=<br />

<br />

<br />

<br />

<br />

<br />

||φFE l − φROM<br />

l || L2 (Ω)<br />

||φFE l || L2 (Ω)<br />

<br />

<br />

<br />

<br />

<br />

L 2 (0,T )<br />

, (16)<br />

and for all other variables analogously. The resulting<br />

relative errors for each variable are:<br />

φl φsc φsa<br />

2.2 · 10 −6<br />

1.3 · 10 −6<br />

cl csc csa<br />

1.7 · 10 −7<br />

4.2 · 10 −6<br />

2.6 · 10 −5<br />

5.5 · 10 −6<br />

All <strong>of</strong> these relative errors are acceptably small. Now<br />

that we checked the accuracy <strong>of</strong> the ROM solution we<br />

are interested in the speed <strong>of</strong> the computation. The full<br />

FE solution took 2828s in CPU time while the ROM<br />

solution lasted 189s, which means that a speed up <strong>of</strong><br />

factor 15 was achieved.<br />

B. More general POD and EIM bases.<br />

We need POD and EIM bases which are more general<br />

and applicable for a broader range <strong>of</strong> use cases, concerning<br />

the battery behavior. This is achieved this by building<br />

one basis out <strong>of</strong> a set <strong>of</strong> similar simulations.<br />

In battery simulation a natural choice for this set <strong>of</strong><br />

simulations is the discharge <strong>of</strong> the battery at different<br />

C-rates. To be more precise: we discharge the battery<br />

from 3.8V until 2.5V at the C-rates 0.1, 0.2, 0.5, 1,<br />

2, 3, 4 and 5C. From these set <strong>of</strong> snapshots we build<br />

the POD and EIM bases by using again the tolerance <strong>of</strong><br />

10−8 . The ROM is <strong>of</strong> a dimension <strong>of</strong> 85, compared to<br />

the full dimension <strong>of</strong> 10986 DOFs. To make sure that<br />

- 384 - 15th IGTE Symposium 2012<br />

Fig. 5. Voltage <strong>of</strong> simulations at different C-rates.<br />

Fig. 6. Voltage <strong>of</strong> simulations not used for POD basis calculation.<br />

the dynamics <strong>of</strong> the different C-rates are captured well,<br />

we repeat all discharge simulations used for building<br />

the basis with the ROM. In Figure 5 we plotted the<br />

calculated voltage <strong>of</strong> the full FE solution and the ROM<br />

solution using the new POD and EIM bases. We detected<br />

a extremely small L 2 relative error and we conclude that<br />

the results are in good agreement for all used C-rates.<br />

Next we want to find out whether C-rates not used<br />

for the basis computation, are also simulated well by the<br />

ROM. For this purpose we simulate the C-rates: 0.15,<br />

0.3 and 3.5 with the full FE system and the ROM. The<br />

results are in good accordance as can be seen in Figure 6.<br />

We conclude that all C-rates between 0.1 and 5 can be<br />

simulated well with the ROM using the improved, more<br />

general, basis.<br />

C. POD basis switching.<br />

With the single POD basis we can simulate well<br />

different discharges in the 0.1 - 5 C-rate range and now<br />

we want to use the ROM for a simulation that involves<br />

charge and discharge processes. We have the set <strong>of</strong> POD<br />

and EIM bases for a discharge. Equivalently using the<br />

same C-rates, we calculate the POD and the EIM basis for<br />

the charge processes. Using the same tolerance, 10−8 ,we


Fig. 7. Voltage <strong>of</strong> simulations where bases for ROM computation<br />

were switched.<br />

obtain 110 POD basis functions. In the ROM simulation<br />

we can then switch between these two sets <strong>of</strong> bases.<br />

This is demonstrated through the simulation <strong>of</strong><br />

charge/discharge processes were charge and discharge are<br />

simulated at C-rates not included in the bases computation.<br />

The voltage plot <strong>of</strong> the full FE solution and the<br />

ROM solution can be seen in Figure 7.<br />

The voltage curves are in good accordance, only at the<br />

switching points there occurs an error. Both the discharge<br />

and the charge POD basis span a subspace in the space <strong>of</strong><br />

all possible solutions. When switching between discharge<br />

to the charge basis (or vice versa) the ROM solution is<br />

projected from one subspace to the other and an error<br />

occurs.<br />

Below the relative errors for each variable are given,<br />

see (16) for the definition <strong>of</strong> the error norm.<br />

φl φsc φsa<br />

1.4 · 10 −5<br />

3.4 · 10 −6<br />

cl csc csa<br />

4.1 · 10 −6<br />

1.3 · 10 −5<br />

3.8 · 10 −5<br />

3.0 · 10 −5<br />

Naturally the error in this simulation is bigger than in<br />

the first example, nevertheless the results are acceptable.<br />

This method seems to be very suitable to allow a fast<br />

simulation <strong>of</strong> different processes, as pulses, rest steps,<br />

cycling etc. To find appropriate POD and EIM bases for<br />

these processes, which also give a small projection error,<br />

remains our next task.<br />

V. CONCLUSION<br />

For speeding up the numerical simulation <strong>of</strong> the<br />

lithium-ion battery model, a reduced order model was<br />

build utilizing proper orthogonal decomposition method<br />

and empirical interpolation method.<br />

Speed up factors <strong>of</strong> 10-15 (depending on the considered<br />

case) with acceptable error was obtained. The<br />

developed method <strong>of</strong> switching between POD and EIM<br />

bases for different purposes, shows good results for<br />

charging/discharging processes at different C-rates. This<br />

- 385 - 15th IGTE Symposium 2012<br />

method is very promising to allow a fast simulation <strong>of</strong><br />

different processes, e.g. pulses, rest steps, cycling etc.<br />

Our future research will aim for the construction <strong>of</strong><br />

appropriate POD bases for these processes. Also the<br />

question whether the projection error can be minimized<br />

will be considered.<br />

Acknowledgment:<br />

The authors gratefully acknowledge financial support<br />

from “Zukunftsfonds des Landes Steiermark” <strong>of</strong> the Federal<br />

Province <strong>of</strong> Styria/Austria for the project in which<br />

the above presented research results were achieved.<br />

VI. LIST OF SYMBOLS<br />

cs<br />

concentration <strong>of</strong> Li + in active material<br />

cl<br />

concentration <strong>of</strong> Li + in electrolyte<br />

Φs<br />

electrochemical potential <strong>of</strong> active material<br />

Φl<br />

electrochemical potential <strong>of</strong> electrolyte<br />

Ai<br />

inner active surface<br />

αA,αK anodic/cathodic charge transfer coefficients<br />

CDS double layer capacity<br />

Dl<br />

solution diffusivity<br />

Ds<br />

solid diffusivity<br />

εl<br />

electrolyte volume fraction<br />

jBV<br />

Butler-Volmer current density<br />

F Faraday’s constant (= 96485Cmol −1 )<br />

k exchange current density and reaction rate<br />

κl(cl) ionic conductivity function<br />

μl<br />

migration coefficient<br />

σs<br />

electronic conductivity<br />

R universal gas constant (= 8,31447 Jmol −1 K −1 )<br />

T temperature<br />

t time<br />

t +<br />

transference number<br />

UOCV (cs) equilibrium potential function<br />

USEI ohmic loss <strong>of</strong> potential due to solid electrolyte interface at<br />

ΩA<br />

z number <strong>of</strong> transfered electrons (for Li + : z =1)<br />

REFERENCES<br />

[1] A.C. Antoulas, “Approximation <strong>of</strong> Large-Scale Dynamical Systems.”<br />

in Siam, 2005.<br />

[2] M. Barrault, Y. Maday, N. Nguyen, A. Patera, “An empirical<br />

interpolation method: Application to efficient reduced basis<br />

discretization <strong>of</strong> partial differential equations.” Comptes Rendus<br />

Mathematique, 339: 667-672, 2004.<br />

[3] P. Benner, V. Mehrmann, D.C. Sorensen et. al., “Dimension Reductin<br />

<strong>of</strong> Large-Scale Systems.” Springer, 2003.<br />

[4] L. Cai and R.E. White, “An Efficient Electrochemical–Thermal<br />

Model for a Lithium-Ion Cell by Using the Proper Orthogonal<br />

Decomposition Method.” in J. Electrochem. Soc., vol. 157, pp.<br />

A1188-A1195, 2010<br />

[5] M. Cifrain et. al., “Elektrochemisches Zellmodell.” publication in<br />

preparation, 2012.<br />

[6] M. Doyle, T.F. Fuller, J. Newman, “Modeling <strong>of</strong> Galvanostatic<br />

Charge and Discharge <strong>of</strong> the Lithium/Polymer/Insertion Cell.” in<br />

J. Electrochem. Soc., vol. 140 (7), pp. 1526–1533, 1993.<br />

[7] O. Lass and S. Volkwein, “POD Galerkin schemes for nonlinear<br />

elliptic-parabolic systems.” submitted for publication in 2011<br />

[8] F. Pichler, “Anwendung der Finite-Elemente Methode auf ein<br />

Litium-Ionen Batterie Modell.” Master Thesis, <strong>University</strong> <strong>of</strong> <strong>Graz</strong>,<br />

2011.<br />

[9] S. Volkwein, “Proper orthogonal decomposition (POD) for nonlinear<br />

systems.” PhD program in Mathematics for <strong>Technology</strong><br />

Catania, 2007.<br />

[10] J. Wu, J. Xu, H. Zou, “On the well-posedness <strong>of</strong> a mathematical<br />

model for lithium-ion battery systems.” Methods and Applications<br />

<strong>of</strong> Analysis, 13:275-298, 2006.


- 386 - 15th IGTE Symposium 2012<br />

Automatic domain detection for a meshfree postprocessing<br />

in boundary element methods<br />

André Buchau, Matthias Jüttner, and Wolfgang M. Rucker<br />

Universität Stuttgart, Institut für Theorie der Elektrotechnik, Pfaffenwaldring 47, 70569 Stuttgart, Germany<br />

E-mail: andre.buchau@ite.uni-stuttgart.de<br />

Abstract—Modern advanced visualization techniques for three-dimensional electromagnetic fields evaluate field values in<br />

some points in space, which are determined only during the computation <strong>of</strong> visual objects like streamlines. Furthermore, a<br />

meshfree post-processing in boundary element methods along with a bidirectional coupling <strong>of</strong> numerical field computations<br />

with a visualization tool are advisable to reduce significantly computational costs and the total amount <strong>of</strong> stored data. However,<br />

a completely automatic domain detection method is then required. Domain data like material values are not explicitly<br />

defined due to the lack <strong>of</strong> a volume mesh but are necessary for a correct computation <strong>of</strong> field values in arbitrary points. Here,<br />

a robust and fast octree-based method is presented to determine the domain data <strong>of</strong> an evaluation point efficiently even for<br />

large and complex field problems. The computational costs <strong>of</strong> position detection <strong>of</strong> a single evaluation point are kept small by<br />

filtering <strong>of</strong> relevant boundary elements. Furthermore, position data <strong>of</strong> other evaluation points is used if possible.<br />

Index Terms—boundary element methods, domain detection methods, meshfree post-processing, octree-based schemes<br />

I. INTRODUCTION<br />

A very important step in numerical field computations<br />

is an extensive post-processing including a vivid visualization<br />

<strong>of</strong> the obtained results. Today, visualization tools<br />

like virtual and augmented reality along with modern<br />

visualization techniques enable even experienced engineers<br />

a deep insight into the physical properties <strong>of</strong> the<br />

studied problem [1, 2]. However, an expressive visualization<br />

<strong>of</strong> three-dimensional fields is still a challenge. One<br />

possibility is to use volume rendering in the case <strong>of</strong> scalar<br />

data [3]. An interesting approach, which has been presented<br />

for two-dimensional magnetic fields, is to compute<br />

visual objects that represent the topology <strong>of</strong> the vector<br />

field [4]. Further techniques, which are more commonly<br />

used, are to filter three-dimensional data first and to visualize<br />

three-dimensional fields in slices or to compute<br />

streamlines or isosurfaces. Data exchange between the<br />

numerical field computation tool and the visualization<br />

tool is normally done with the help <strong>of</strong> volume meshes <strong>of</strong><br />

all considered domains. The field values are precomputed<br />

in all nodes <strong>of</strong> this mesh and transferred to the<br />

visualization tool that performs the post-processing independently<br />

<strong>of</strong> the numerical field computation tool.<br />

The boundary element method (BEM) is a very attractive<br />

method for the solution <strong>of</strong> three-dimensional electromagnetic<br />

field problems, which consist <strong>of</strong> multiple,<br />

piece-wise homogeneous, linear media. Then, a modeling<br />

and discretization <strong>of</strong> domain surfaces suffices. Hence, the<br />

discretized model is much smaller than in volume-based<br />

methods like the finite element method (FEM). Furthermore,<br />

well-established compression techniques for the<br />

linear system <strong>of</strong> equations exist to enable a fast and efficient<br />

solution <strong>of</strong> large and complex field problems [5].<br />

However, an additional volume mesh is normally created<br />

for the post-processing [6]. Domain data like material<br />

values are assigned to the auxiliary mesh and are available<br />

for the computation <strong>of</strong> field values in the mesh nodes.<br />

The application <strong>of</strong> the fast multipole method (FMM)<br />

enables the computation <strong>of</strong> field values in a huge number<br />

<strong>of</strong> points at acceptable computational costs [7], but the<br />

amount <strong>of</strong> data, which must be stored and transferred to<br />

the visualization tool, is relatively large and a bottleneck.<br />

A better approach, which fits much more the basic<br />

concept <strong>of</strong> a BEM, is a meshfree post-processing [8].<br />

There, field values are only computed at points, which are<br />

necessary for visualization. Hence, the number <strong>of</strong> evaluation<br />

points is dramatically reduced in comparison to the<br />

classical approach, which uses an auxiliary volume mesh.<br />

Furthermore, the meshfree approach is much more flexible.<br />

Post-processing domains and visualization techniques<br />

are defined completely after the solution <strong>of</strong> the<br />

problem. The creation <strong>of</strong> an expensive volume mesh is<br />

unnecessary and the BEM is integrated into the visualization<br />

tool. However, automatic domain detection is required<br />

to make a meshfree post-processing applicable for<br />

problems, which consist <strong>of</strong> multiple domains.<br />

Here, a novel method is presented that completely automatically<br />

detects the position <strong>of</strong> an arbitrary evaluation<br />

point directly from the given boundary element mesh.<br />

The octree-based method is fast and efficient to enable<br />

extensive post-processing <strong>of</strong> large and complex field<br />

problems. A very small number <strong>of</strong> boundary elements are<br />

extracted from the total model to perform the position<br />

detection efficiently with a method that is similar to the<br />

well-known ray tracing method [9]. Furthermore, domain<br />

data <strong>of</strong> other evaluation points is used if possible.<br />

The paper is structured as follows. First, the problem<br />

<strong>of</strong> position detection directly from boundary elements is<br />

formulated and the concept <strong>of</strong> the presented method is<br />

shown. Then, a flexible octree-based scheme is introduced<br />

to enable a grouping <strong>of</strong> all boundary elements<br />

regarding their position in three-dimensional space. It is<br />

applied to determine the position <strong>of</strong> an evaluation point<br />

with the help <strong>of</strong> other evaluation points <strong>of</strong> the same domain<br />

or to filter candidate boundary elements for the<br />

actual position detection, which is described afterwards.<br />

There, a method similar to ray tracing is presented to find<br />

a boundary element that is the closest boundary element<br />

to the given evaluation point with the help <strong>of</strong> a small list<br />

<strong>of</strong> candidates. Two numerical examples have been studied<br />

to demonstrate robustness and efficiency <strong>of</strong> the presented<br />

automatic domain detection method. Finally, significance<br />

<strong>of</strong> the method to BEM and an outlook to future<br />

work are given in the conclusions.


II. NUMERICAL METHOD<br />

In general, automatic domain detection is required for<br />

each evaluation point in a meshfree BEM postprocessing.<br />

Since both the number <strong>of</strong> evaluation points<br />

and the number <strong>of</strong> boundary elements are <strong>of</strong>ten very large<br />

in practical three-dimensional problems, a fast algorithm,<br />

which exploits properties <strong>of</strong> BEM, is advisable.<br />

The concept <strong>of</strong> the presented approach is presented in<br />

the first sub-section. There, a technical formulation <strong>of</strong> the<br />

problem is given along with a comparison to ray tracing<br />

methods. The second sub-section is about the novel fast<br />

octree-based scheme. It is the key to a successful application<br />

<strong>of</strong> meshfree post-processing in large BEM problems.<br />

Finally, an efficient and general method <strong>of</strong> ray tests based<br />

on gradient search is given in the third sub-section. Note,<br />

the presented approach has been developed to achieve<br />

two goals, an efficient and robust automatic domain detection<br />

method and a flexible octree for fast and efficient<br />

post-processing computations using the fast multipole<br />

method (FMM).<br />

A. Concept <strong>of</strong> automatic position detection<br />

The aim <strong>of</strong> the presented automatic domain detection<br />

method is to determine the domain <strong>of</strong> an arbitrary<br />

evaluation point, which is defined by its global Cartesian<br />

coordinates, e. g. during the computation <strong>of</strong> a streamline,<br />

=<br />

. (1)<br />

The complete BEM problem is discretized by in total<br />

boundary elements. The shape <strong>of</strong> boundary elements<br />

and the order <strong>of</strong> their shape functions can be arbitrarily<br />

chosen for the presented domain detection method. The<br />

direction <strong>of</strong> the normal vector <strong>of</strong> a boundary element<br />

is known. Furthermore, the domain , which lies in<br />

direction <strong>of</strong> , and the domain , which lies in<br />

direction <strong>of</strong> , are stored for each boundary element.<br />

The concept <strong>of</strong> the presented approach is to construct a<br />

single ray, which starts at the given evaluation point<br />

= + , (2)<br />

where is the direction <strong>of</strong> the ray and 0 is a parameter.<br />

Then, the domain is determined with the<br />

help <strong>of</strong> the first boundary element, which is intersected<br />

by the ray (2). An example configuration is given in<br />

Fig. 1. The red point is the given evaluation point, the<br />

blue line is the ray, and the green and yellow lines are<br />

two boundary elements. Here, the domain <strong>of</strong> the evaluation<br />

point is = <strong>of</strong> the green boundary element.<br />

Fig. 1: Example <strong>of</strong> domain detection with the help <strong>of</strong> a ray<br />

The concept <strong>of</strong> domain detection is similar to the wellknown<br />

ray tracing method [9]. The main difference is that<br />

the direction <strong>of</strong> the ray is unknown. In ray tracing, the<br />

direction corresponds to the view direction. Furthermore,<br />

a single ray suffices for successful domain detection. In<br />

ray tracing <strong>of</strong> computer graphics, a ray for each pixel <strong>of</strong><br />

the screen is constructed. Hence, an adapted octree-based<br />

algorithm for BEM is presented in the next sub-section.<br />

- 387 - 15th IGTE Symposium 2012<br />

B. Fast octree-based algorithm<br />

A fast algorithm is necessary to enable an application<br />

<strong>of</strong> the automatic domain detection method to large BEM<br />

problems. The costs <strong>of</strong> a standard implementation <strong>of</strong> the<br />

concept, which has been presented in the previous subsection,<br />

are proportional to . Hence, an efficient filtering<br />

<strong>of</strong> relevant boundary elements is required to reduce<br />

computation time. Furthermore, the computational costs<br />

are proportional to the number <strong>of</strong> evaluation points , which are given by the visualization tool. However, an<br />

evaluation point is <strong>of</strong>ten close to a previous evaluation<br />

point, for instance points on a streamline. An approach to<br />

reduce the computational costs is to use domain data <strong>of</strong><br />

previous evaluation points if possible.<br />

Trees are a common method to group and select<br />

boundary elements in three-dimensional space. Here, an<br />

octree has been chosen, since the FMM is used to accelerate<br />

BEM computations and the FMM is based on an<br />

octree, too [10]. An implementation <strong>of</strong> the octree, which<br />

uses modern s<strong>of</strong>tware techniques, enables the application<br />

<strong>of</strong> the same code foundations for both domain detection<br />

and post-processing computations. Then, code is more<br />

reliable and the complete method works more robust.<br />

The first step is to initialize the octree using all boundary elements. The so-called root cube at octree<br />

level 0 is the smallest cube, which encloses all boundary<br />

elements. Its edges are parallel to the axes <strong>of</strong> the global<br />

Cartesian coordinate system. Then, the root cube is subdivided<br />

into eight equal sized cubes and the boundary<br />

elements are assigned to these so-called child cubes. Each<br />

child cube is again subdivided into child cubes. The subdivision<br />

is continued while the boundary elements <strong>of</strong> a<br />

cube can be assigned to its child cubes or the total number<br />

<strong>of</strong> octree levels is smaller than a given limit.<br />

Boundary elements are assigned to a cube, if their centroid<br />

lies inside the cube. Of course, the position <strong>of</strong><br />

boundary elements and their real dimensions must be<br />

considered. The bounding box <strong>of</strong> a cube is determined<br />

including all its boundary elements. This bounding box<br />

must completely lie inside a test cube with the same center<br />

as the considered cube and an edge length <strong>of</strong><br />


The domain detection starts with the addition <strong>of</strong> the<br />

evaluation point to the octree. If is outside the<br />

root cube <strong>of</strong> the octree, the spatial domain <strong>of</strong> the octree is<br />

enlarged by creating a new root cube. Of course, is<br />

increased in that case. The evaluation point is assigned to<br />

a cube in the same way as the boundary elements during<br />

initialization. Here, the subdivision <strong>of</strong> a cube with an<br />

evaluation point is aborted, if no boundary elements are<br />

assigned to the cube <strong>of</strong> the evaluation point. The number<br />

<strong>of</strong> evaluation points in a cube is not taken into account.<br />

Hence, the cubes <strong>of</strong> evaluation points are chosen as large<br />

as possible.<br />

An example is given in Fig. 2. The black lines with<br />

black points represent some boundary elements. The red<br />

point is an evaluation point. The thick black lines are the<br />

octree cubes, which have been created according to the<br />

above-described rules. One boundary element is assigned<br />

to one cube and the boundary elements stick slightly out<br />

<strong>of</strong> the cubes. In contrast, the cube <strong>of</strong> the evaluation point<br />

is relatively large.<br />

Fig. 2: Example <strong>of</strong> an octree for position detection<br />

A strategy is to reduce the number <strong>of</strong> actual domain<br />

detections. Hence, a goal is to determine the domain not<br />

only for the given evaluation point but also for the cube<br />

<strong>of</strong> the evaluation point if possible. Consequently, an initial<br />

cube is searched that fulfills the following conditions.<br />

The octree level <strong>of</strong> a cube <strong>of</strong> an evaluation point is<br />

maximal the finest octree level <strong>of</strong> cubes <strong>of</strong> boundary<br />

elements. While the octree level criterion is satisfied, the<br />

cube <strong>of</strong> the evaluation point is refined until no boundary<br />

elements are assigned to that cube or no boundary elements<br />

stick into this cube. To avoid expensive tests based<br />

on the real position <strong>of</strong> boundary elements, first a cube is<br />

searched that has no neighbors with boundary elements.<br />

If the cube <strong>of</strong> the evaluation point already has no neighbors<br />

with boundary elements, its parent cube is tested for<br />

the above-described rules.<br />

The black cube <strong>of</strong> the evaluation point in Fig. 2 has<br />

neighbor cubes with boundary elements. Hence, the black<br />

cube is refined and the blue cube is obtained. Since the<br />

blue cube has also neighbor cubes with boundary elements,<br />

it is refined as well and the green cube is the initial<br />

cube for domain detection.<br />

If no elements are lying inside the cube , the domain<br />

is determined for including all its child cubes. The<br />

domain data <strong>of</strong> the cube is then used for all evaluation<br />

points, which are assigned to the cube at a later moment.<br />

Neighbor cubes <strong>of</strong> the cube are at several octree levels<br />

due to the adaptive octree rules <strong>of</strong> octree initialization.<br />

Furthermore, octree structure is changing during the post-<br />

- 388 - 15th IGTE Symposium 2012<br />

processing. Hence, it is not possible to determine cube<br />

neighbors in advance. Neighbor cubes, or cubes in general,<br />

are searched by the position <strong>of</strong> the cube center. Here,<br />

the centers <strong>of</strong> possible neighbor cubes are computed and<br />

the cubes are searched starting from the root cube by<br />

simple and fast comparison <strong>of</strong> Cartesian coordinates.<br />

If the domain data <strong>of</strong> a neighbor cube <strong>of</strong> cube has<br />

been already determined, it can be used for the cube ,<br />

too. Otherwise, a cube with boundary elements <strong>of</strong> the<br />

second neighbors <strong>of</strong> cube has to be chosen for domain<br />

detection. Since no elements are assigned to the cube <br />

and its neighbor cubes, one second neighbor cube, which<br />

adjoins a neighbor cube <strong>of</strong> , suffices for domain detection.<br />

At least one second direct neighbor cube with<br />

boundary elements exists, because the cube is chosen<br />

as large as possible as described above.<br />

After a cube has been chosen, the neighbor cube<br />

<strong>of</strong> , which lies between and , is determined.<br />

First, all boundary elements, which stick into , are<br />

searched. Note, most neighbors <strong>of</strong> are without<br />

boundary elements and only neighbors, which adjoin<br />

, have to be considered. In practice, the number <strong>of</strong><br />

relevant cubes is small. Furthermore, the boundary elements,<br />

which are assigned to and which stick into<br />

, are determined. In total, a list with relevant<br />

boundary elements is obtained. To improve robustness <strong>of</strong><br />

the domain detection method, a new point for domain<br />

detection is defined inside and close to .<br />

Some examples <strong>of</strong> typical situations <strong>of</strong> domain detection<br />

are depicted in Fig. 3.<br />

Fig. 3: Some typical situations <strong>of</strong> domain detection<br />

First, the domain <strong>of</strong> the red evaluation point is determined.<br />

Since no boundary elements are assigned to the<br />

red cube, the domain <strong>of</strong> the red cube is evaluated. The<br />

black cube with the red lines is chosen as cube . The<br />

orange evaluation point inside is used for the domain<br />

detection <strong>of</strong> the read evaluation point. Next, the<br />

domain <strong>of</strong> the blue evaluation point is detected. Since the<br />

blue evaluation point lies within a cube with boundary<br />

elements, these boundary elements are used for domain<br />

detection. Finally, the domain <strong>of</strong> the green evaluation<br />

point is determined with the help <strong>of</strong> domain data <strong>of</strong> its red<br />

neighbor cube.<br />

As already mentioned, the list <strong>of</strong> relevant<br />

boundary elements for domain detection includes all<br />

boundary elements, which are assigned to a relevant cube<br />

and all boundary elements <strong>of</strong> neighbor cubes, which stick<br />

into the relevant cube. A fast method to test whether a<br />

boundary element sticks into a cube is to test an intersection<br />

<strong>of</strong> the bounding box <strong>of</strong> the cube and <strong>of</strong> the bounding


ox <strong>of</strong> the boundary element. Two examples <strong>of</strong> boundary<br />

elements inside a cube are given in Fig. 4. Although no<br />

boundary elements are assigned to the blue cube, the blue<br />

boundary element <strong>of</strong> its neighbor cube must be taken into<br />

account, since it sticks into the blue cube. In the case <strong>of</strong><br />

the red cube, one red boundary element, which is assigned<br />

to the red cube, and one red boundary element <strong>of</strong><br />

its neighbor cube have to be considered.<br />

Fig. 4: Examples <strong>of</strong> elements inside a cube<br />

In total, the presented octree-based method is fast,<br />

since is approximately independent <strong>of</strong> , at least<br />

in the case <strong>of</strong> large problems. If possible, domain data for<br />

cubes is determined. The domain detection with the help<br />

<strong>of</strong> the filtered boundary elements is described in<br />

the following sub-section.<br />

C. Domain detection from boundary elements<br />

Starting position <strong>of</strong> a domain detection from boundary<br />

elements is a very small list <strong>of</strong> boundary elements,<br />

which is determined using the octree-based method <strong>of</strong> the<br />

previous sub-section, and the given evaluation point (1).<br />

Furthermore, domain detection from boundary elements<br />

is only necessary, if domain data <strong>of</strong> octree cubes cannot<br />

be used. Hence, general applicability and extension possibilities<br />

<strong>of</strong> the following method are more important than<br />

pure efficiency considerations.<br />

The initial step is to sort the boundary elements<br />

by their distance to the evaluation point<br />

= , 0


III. NUMERICAL EXAMPLES<br />

The presented automatic domain detection method for<br />

BEM has been tested on two numerical examples. The<br />

first example is a capacitor, which can be simply discretized<br />

with different sizes <strong>of</strong> boundary elements. Hence, it<br />

is well suited for fundamental tests <strong>of</strong> the automatic domain<br />

detection method. The second example is an inductor<br />

<strong>of</strong> a micro-electro-mechanical system. It represents a<br />

typical configuration in an application <strong>of</strong> BEM with <strong>of</strong>ten<br />

changing domains in a slice. There, domain data <strong>of</strong> evaluation<br />

points cannot be easily defined and the power <strong>of</strong><br />

the presented automatic domain detection method is<br />

clearly demonstrated.<br />

The domain detection method has been implemented in<br />

C# using the .NET framework 4.0 [11]. C# is a managed<br />

language and it supports very well necessary data handling<br />

<strong>of</strong> the domain detection method. Furthermore, the<br />

interface <strong>of</strong> C# to native C++ enables the use <strong>of</strong> existing<br />

high-performance code <strong>of</strong> numerical methods, for instance<br />

the used BEM and FMM implementation. Bounding<br />

boxes including intersection tests or vector operations<br />

are standard methods <strong>of</strong> the Windows Presentation<br />

Framework (WPF), which is part <strong>of</strong> the .NET framework.<br />

Furthermore, the Windows Communication Foundation<br />

(WCF) <strong>of</strong> the .NET framework enables an interactive<br />

data exchange between different processes based on extensible<br />

markup language (XML) over hypertext<br />

transport protocol (http). Here, WCF is applied to couple<br />

the process <strong>of</strong> the visualization tool HLRS COVISE,<br />

which is developed at the High Performance Computing<br />

Center at the <strong>University</strong> <strong>of</strong> Stuttgart, with the implementation<br />

<strong>of</strong> the domain detection method. HLRS COVISE<br />

visualizes three-dimensional data with the help <strong>of</strong> virtual<br />

and augmented reality techniques [1].<br />

The surfaces <strong>of</strong> both numerical examples have been<br />

discretized with second order, quadrilateral boundary<br />

elements. The Galerkin method has been applied to indirect<br />

and direct BEM formulations. The matrix <strong>of</strong> the<br />

linear system <strong>of</strong> equations has been compressed with the<br />

help <strong>of</strong> the fast multipole method [12]. The matrix has<br />

been assembled in parallel using the OpenMP standard.<br />

The system <strong>of</strong> linear equations has been solved in parallel,<br />

too. An implementation <strong>of</strong> the BEM in combination<br />

with the FMM in C++ has been executed on a workstation<br />

with two six-core Intel Xeon E5649 2.53 GHz<br />

processors.<br />

Although the presented domain detection method supports<br />

all kinds <strong>of</strong> boundary elements, the second order,<br />

quadrilateral boundary elements have been converted into<br />

linear, triangular boundary elements for the postprocessing.<br />

The reason is that in computer graphics only<br />

linear elements, <strong>of</strong>ten only linear triangles, are well supported.<br />

Rendering and graphics processing on linear triangles<br />

is much faster than on other types <strong>of</strong> elements and<br />

linear triangles are supported by modern graphics processors.<br />

The implementation <strong>of</strong> the domain detection method<br />

has been executed on an Intel Core 2 Duo T9900<br />

3.06 GHz laptop processor using a single core. Graphical<br />

objects have been rendered on a NVIDIA Quadro FX<br />

770M graphic card.<br />

- 390 - 15th IGTE Symposium 2012<br />

A. Capacitor<br />

The electric field <strong>of</strong> a capacitor has been studied as<br />

first example. The capacitor consists <strong>of</strong> two quadratic<br />

electrodes and a homogeneous, linear, isotropic dielectric<br />

between the electrodes. The potential <strong>of</strong> the electrodes<br />

has been set to 0.5 V and -0.5 V respectively. The relative<br />

permittivity <strong>of</strong> the dielectric is 10.<br />

The capacitor has been discretized with 9600 second<br />

order, quadrilateral elements (Fig. 5). An indirect BEM<br />

formulation is applied. The Dirichlet boundary conditions<br />

are the potential at the two electrodes. The Neumann<br />

boundary condition is the continuity <strong>of</strong> the electric flux<br />

density at the surface between the dielectric and the surrounding<br />

free space domain. The corresponding linear<br />

system <strong>of</strong> equations with in total 29442 unknowns has<br />

been solved iteratively using generalized minimal residual<br />

method (GMRES) along with a Jacobi preconditioner<br />

within 91 iteration steps in approximately 3 minutes.<br />

Fig. 5: Discretized BEM model <strong>of</strong> a capacitor<br />

The original second order boundary elements have<br />

been converted into 19200 first order, triangular elements<br />

for the post-processing including the domain detection.<br />

The boundary elements are grouped by an octree, which<br />

consists <strong>of</strong> 9 octree levels and 2 elements assigned to a<br />

cube in average. The maximum number <strong>of</strong> elements <strong>of</strong> a<br />

cube is 6. The domain data in a slice in 40000 evaluation<br />

points has been determined in 23 s (Fig. 6). The red color<br />

represents the domain inside the dielectric and the green<br />

color represents the air domain. Furthermore, the two<br />

electrodes are depicted. The color at the electrodes displays<br />

the surface charge density, which equals the solution<br />

<strong>of</strong> the linear system <strong>of</strong> equations. The surface <strong>of</strong> the<br />

dielectric has been omitted for graphical reasons.<br />

The presented octree-based scheme reduces the number<br />

<strong>of</strong> boundary elements, which must be considered for<br />

correct domain detection, from 19200 to maximal 39. The<br />

domain data <strong>of</strong> 93 % <strong>of</strong> the given evaluation points could<br />

be obtained from position data <strong>of</strong> the octree cubes without<br />

expensive ray hit tests.


Fig. 6: Detected domains in a slice through the capacitor<br />

B. Inductor in micro-electro-mechanical systems<br />

The electric current inside an inductor <strong>of</strong> a microelectro-mechanical<br />

system (MEMS) has been studied as<br />

second example. The inductor has been discretized using<br />

9168 second order, quadrilateral elements (Fig. 7). The<br />

potential at the ports <strong>of</strong> the inductor has been set as Dirichlet<br />

boundary condition <strong>of</strong> a direct BEM formulation.<br />

The linear system <strong>of</strong> equations with in total 27594 unknowns<br />

has been solved within 166 iteration steps <strong>of</strong><br />

GMRES in approximately 4 minutes.<br />

Fig. 7: Discretized BEM model <strong>of</strong> a inductor in MEMS<br />

The boundary elements have been converted into<br />

18336 first order, triangular elements for the postprocessing.<br />

The domain in 40000 evaluation points,<br />

which are lying in a slice through the inductor (Fig. 8),<br />

has been detected in 106 s. The yellow color represents<br />

the domain inside the conductor <strong>of</strong> the inductor and the<br />

blue color represents the surrounding free space domain.<br />

Although the slice is <strong>of</strong>ten intersected by boundaries <strong>of</strong><br />

the inductor, the number <strong>of</strong> actual domain detections is<br />

reduced by 45 % by using domain data <strong>of</strong> the octree cubes.<br />

is maximal 26.<br />

Fig. 8: Detected domains in a slice through the inductor<br />

- 391 - 15th IGTE Symposium 2012<br />

IV. CONCLUSION<br />

A fast and efficient automatic domain detection method<br />

for a meshfree post-processing in three-dimensional<br />

boundary element methods has been presented. Relevant<br />

boundary elements are filtered with the help <strong>of</strong> an adaptive<br />

octree-based method. Hence, the number <strong>of</strong> boundary<br />

elements, which have to be considered for domain detection,<br />

is extremely reduced and approximately independent<br />

<strong>of</strong> the total number <strong>of</strong> boundary elements for large problems.<br />

The application <strong>of</strong> the octree, bounding boxes, and<br />

optimized standard libraries results in very low computational<br />

costs. As a result, the shown domain detection<br />

method enables an efficient post-processing in a large<br />

number <strong>of</strong> evaluation points even for large and complex<br />

BEM problems. Furthermore, the complete method including<br />

its implementation is very flexible and supports<br />

all types <strong>of</strong> elements. Hence, it is not only restricted to<br />

pure BEM applications, but volume elements <strong>of</strong> a volume<br />

integral equation can be used, too. Finally, the numerical<br />

examples show that domain data is detected in arbitrary<br />

chosen evaluation points reliably and efficiently.<br />

The shown method is a very important step towards<br />

flexible and powerful post-processing in boundary element<br />

methods. A direct coupling <strong>of</strong> visualization tools<br />

with a boundary element method is enabled. Postprocessing<br />

objects as streamlines or isosurfaces can be<br />

computed totally meshfree even in the case <strong>of</strong> multiple<br />

domains. The octree, which is used here, is the basis <strong>of</strong><br />

fast and efficient field computations using the fast multipole<br />

method, too.<br />

[1]<br />

REFERENCES<br />

U. Lang and U. Wössner, “Virtual and augmented reality developments<br />

for engineering applications”, <strong>Proceedings</strong> <strong>of</strong><br />

[2]<br />

ECCOMAS 2004, Jyväskylä, July 24-28, pp. 24-8., 2004<br />

A. Buchau, W. M. Rucker, U. Wössner, and M. Becker, “Augemented<br />

reality in teaching <strong>of</strong> electrodynamics”, COMPEL, vol.<br />

28, no. 4, pp. 948-963, 2009<br />

[3] D. Weiskopf, „GPU-Based Interactive Visualization Techniques“,<br />

Springer, 2006<br />

[4] S. Bachthaler, F. Sadlo, R. Weeber, S. Kantorovich, Ch. Holm,<br />

and D. Weiskopf, “Magnetic Flux Topology <strong>of</strong> 2D Point Dipoles”,<br />

Eurographics Conference on Visualization (EuroVis) 2012, vol.<br />

31, no. 3, 2012<br />

[5] A. Buchau, W. M. Rucker, O. Rain, V. Rischmüller, S. Kurz, S.<br />

Rjasanow, “Comparison Between Different Approaches for Fast<br />

and Efficient 3D BEM Computations”, IEEE Transactions on<br />

Magnetics, vol. 39, no. 3, pp. 1107-1110, 2003<br />

[6] W. Hafla, A. Weinläder, A. Bardakcioglu, A. Buchau, and W. M.<br />

Rucker, “Efficient Post-Processing with the Integral Equation<br />

Method”, COMPEL, vol. 26, no. 3, pp. 873-887, 2007<br />

[7] A. Buchau, W. Rieger, and W. M. Rucker, “Fast Field Computations<br />

with the Fast Multipole Method”, COMPEL, vol. 20, no. 2,<br />

pp. 547-561, 2001<br />

[8] A. Buchau and W. M. Rucker, “Meshfree Visualization <strong>of</strong> Field<br />

Lines in 3D”, 14 th IGTE Symposium, pp. 172-177, <strong>Graz</strong>, 2010<br />

[9] J. Goldsmith, J. Salmon, “Automatic Creation <strong>of</strong> Object Hierarchies<br />

for Ray Tracing”, IEEE Computer Graphics and Applications,<br />

vol. 7, no. 5, pp. 14-20, 1987<br />

[10] A. Buchau, Ch. J. Huber, W. Rieger, W. M. Rucker, ”Fast BEM<br />

Computations with the Adaptive Multilevel Fast Multipole Method”,<br />

IEEE Transactions on Magnetics, vol. 36, no. 4, pp. 680-684,<br />

2000<br />

[11] “.NET Framework Developer Center”, Micros<strong>of</strong>t Corporation<br />

[12] A. Buchau, W. Hafla, F. Groh, and W. M. Rucker, ”Parallelized<br />

Computation <strong>of</strong> Compressed BEM Matrices on Multiprocessor<br />

Computer Clusters”, COMPEL, vol. 24, no. 2, pp. 468-479, 2005


- 392 - 15th IGTE Symposium 2012<br />

Efficient modeling <strong>of</strong> coil filament losses in 2D<br />

L. Lehti∗ ,J.Keränen †∗ ,S.Suuriniemi∗ , T. Tarhasaari∗ , and L. Kettunen∗ ∗Tampere <strong>University</strong> <strong>of</strong> <strong>Technology</strong> - Electromagnetics, P.O. Box 692, FI-33101 Tampere, Finland<br />

† VTT Technical Research Centre <strong>of</strong> Finland, P.O. Box 1300, FI-33101 Tampere, Finland<br />

E-mail: leena.lehti@tut.fi<br />

Abstract—Practical estimates for losses in coil filaments <strong>of</strong> a FEM model are sought for. A low-dimensional function space<br />

is introduced on the filament-air interface and then suitably extended into the filament to significantly reduce the number<br />

<strong>of</strong> unknowns per filament. Careful choice <strong>of</strong> extensions enables good loss estimate accuracy. The result is a system matrix<br />

assembly block that can be used verbatim for all filaments, further reducing the cost. Both net current and voltage per<br />

length <strong>of</strong> the filament are readily available in the problem formulation.<br />

Index Terms—coil modeling, FEM, winding loss estimate<br />

I. INTRODUCTION<br />

Improving the efficiency <strong>of</strong> electrical machines is an<br />

important aspect <strong>of</strong> machine design. Modeling methods<br />

are required to be as fast and accurate as possible to<br />

help the designer optimize the machines. One important<br />

aspect <strong>of</strong> machine design are Ohmic coil loss estimates.<br />

They enable the designer to choose the placement <strong>of</strong> the<br />

conductors such that the losses are minimized.<br />

Solving for the conductor losses is not a straightforward<br />

task. The losses depend on the conductivity and<br />

the current density. The current density is affected by<br />

numerous factors, which cannot be separately solved for.<br />

The different elements include the feeding current, fields<br />

generated by neighboring conductors, placement <strong>of</strong> the<br />

permeable materials, and—depending on the frequency—<br />

skin effect.<br />

Different methods have been developed to overcome<br />

these difficulties. One method is to replace the conductivity<br />

<strong>of</strong> the material by other parameters which<br />

transform the eddy-current losses to hysteresis losses<br />

[1]. In this approach the parameters for resistance and<br />

inductance are sought for. A separate mesh for magnetic<br />

and electric problems are introduced in [2]. The results<br />

are acceptable, but the method has computationally slow<br />

segments. In [3] a cell-model is used to reduce the<br />

computational effort and to extract the resistance <strong>of</strong><br />

windings. A homogenization technique in [4] derives<br />

parameters to characterize skin and proximity effects in<br />

windings. Surface impedance methods have also been<br />

used [5], where it is assumed that the magnetic flux does<br />

not penetrate into the conducting material. Therefore the<br />

conducting material can be approximated on the surface<br />

only and the interior <strong>of</strong> the material can be ignored. This<br />

can be used for conductors with low curvature and small<br />

skin depth.<br />

We aim at a good trade-<strong>of</strong>f between moderate calculation<br />

time and accuracy <strong>of</strong> loss estimates while maintaining<br />

an explicit connection to the exterior circuit.<br />

In addition, the conductors can be placed freely, i.e. a<br />

periodical spacing is not required, as in [3] and [4].<br />

Previously [6], magnetic flux was not allowed to enter the<br />

filaments and a separable problem was achieved, i.e. the<br />

field problem was divided into the filament interiors and<br />

the exterior. The exterior and interior had only limited<br />

interaction through net current and constant stream function<br />

values on the boundary. To improve the accuracy,<br />

it is necessary to admit some magnetic flux into the<br />

filaments while solving for the exterior problem. Ignoring<br />

the magnetic energy and losses inside the filaments on<br />

the exterior problem enables subsequent solving for the<br />

interior problem, but it leads to a very small reluctance<br />

inside the filaments. When the filaments are close to each<br />

other, this is detrimental.<br />

Consequently, the filament interiors have to be coupled<br />

with the exterior problem. To save computational effort,<br />

the function space on the filament interface is significantly<br />

limited. The basis representing the result inside<br />

the filaments is spanned by solutions <strong>of</strong> eddy current<br />

problems that use the functions from the interface as<br />

boundary conditions. The use <strong>of</strong> these solutions improves<br />

the loss estimates significantly compared to [6] and the<br />

same solutions can be used for all filaments with similar<br />

cross-section. The resulting method is called an interface<br />

technique.<br />

II. EXTERIOR FORMULATION<br />

Here, we concentrate on two-dimensional cases, since<br />

they are widely used in industry. Solving for 2D problems<br />

is simple, and they provide enough accuracy for many industrial<br />

applications. Since we look for a time-harmonic<br />

solution, the materials used are assumed to be linear.<br />

However, the method could be extended to nonlinear<br />

time-domain problems with convolution techniques if the<br />

material <strong>of</strong> the filaments stays linear.<br />

A few terms are represented in Figure 1 for convenience.<br />

Exterior problem refers to Ωe and interior<br />

problem to Ωin = Ωj. The geometry contains K<br />

conducting filaments, Ωj, and the relative permeability <strong>of</strong><br />

the core is 1000. The whole domain is Ω= Ωj ∪ Ωe<br />

and on the boundary <strong>of</strong> Ω the stream function is set to


zero. This geometry, with K =25and f =50Hz, is<br />

also used as an example in the computations. The radius,<br />

r, for each filament is 0.01m to produce a challenging<br />

modeling problem (skin depth/radius ≈ 1).<br />

symmetry axis<br />

∂Ωj<br />

Ω=Ωin ∪ Ωe<br />

Ωj<br />

Ωe<br />

b · n =0<br />

Fig. 1. An example geometry for a transformer with an E-shaped core.<br />

There are 25 conductors in the coil and their net current is set to one.<br />

The core material’s relative permeability is 1000 and f =50Hz. Ω is<br />

the whole domain that consists <strong>of</strong> Ωin = Ωj and Ωe.<br />

In our previous work [6], the floating potential approach<br />

with a constant but unknown stream function on<br />

each filament boundary was used. This prevented the flux<br />

from entering the filament and the eddy currents were<br />

similar in all filaments. Now, we replace the constant<br />

potential with a low-dimensional subspace <strong>of</strong> functions,<br />

L, that enables the magnetic flux to enter the filament.<br />

The function space comprises <strong>of</strong> a constant function and<br />

trigonometric functions. Let ˆ L be the space <strong>of</strong> Whitney<br />

nodal basis function interpolations <strong>of</strong> L. We construct the<br />

following formulation for the exterior problem<br />

div 1<br />

μ grad a =0 in Ωe, (1)<br />

a =0 on ∂Ω, (2)<br />

<br />

∂Ωj<br />

a =<br />

M−1 <br />

i=0<br />

cijfi ,fi ∈ ˆ L on ∂Ωj ∀ j, (3)<br />

h · dl = Ij on ∂Ωj ∀ j. (4)<br />

Here μ is the permeability, a the stream function,<br />

M the number <strong>of</strong> functions on the interface, cij are<br />

complex scalar coefficients, h the magnetic field, and Ij<br />

the net current in j th filament. There can be multiple<br />

trigonometric functions, i.e., (3) can be written<br />

a = c0 +<br />

N<br />

(c2n−1 sin nα + c2n cos nα), (5)<br />

n=1<br />

- 393 - 15th IGTE Symposium 2012<br />

where N is the number <strong>of</strong> trigonometric functions and<br />

α an angle parameter. 1 For N =0this reduces to floating<br />

potential. On the interface, other than trigonometric<br />

functions could be used.<br />

The solution for (1)–(4) is sought for in the form<br />

a = <br />

diλi + <br />

cij ˆ fi, (6)<br />

i<br />

where each λi is a Whitney nodal basis function associated<br />

to the interior nodes <strong>of</strong> Ωe, and ˆ fi are approximated<br />

by Whitney interpolants on the boundary and the extension<br />

<strong>of</strong> these interpolants into the domain Ωe is done<br />

canonically.<br />

In practice, the function space L on the interface is<br />

reduced to ˆ L by using a projection matrix Q. InQ we<br />

have a row for each basis function <strong>of</strong> ˆ L for each filament<br />

and columns for all nodes on the boundary. The i th row<br />

<strong>of</strong> Q consists <strong>of</strong> the values <strong>of</strong> fi in the boundary nodes<br />

<strong>of</strong> one filament. In a standard system, the system matrix<br />

is formed from blocks<br />

i,j<br />

AΩΩ AΩΓ<br />

AΓΩ AΓΓ<br />

<br />

, (7)<br />

where Ω refers to the exterior <strong>of</strong> the filaments and Γ<br />

refers to the filament boundary. We use Q as follows<br />

ÃΩΩ = AΩΩ<br />

ÃΩΓ = AΩΓQ T<br />

ÃΓΩ = QAΓΩ<br />

ÃΓΓ = QAΓΓQ T<br />

(8)<br />

(9)<br />

(10)<br />

(11)<br />

to build the new system matrix à in the same format as<br />

(7). Note that the block with most nodes, i.e. ÃΩΩ, is not<br />

transformed. The dimensions <strong>of</strong> the projection matrix are<br />

(KM) × (boundary nodes).<br />

Remark 1. If the energy stored and dissipated in the<br />

filaments is neglected the exterior problem can be independently<br />

solved for. However, if we use equations (1)–<br />

(4) without including the effects <strong>of</strong> the filament interiors’,<br />

the results are not satisfactory. As an example, we have<br />

the E-magnet from Figure 1. The model lacks the reluctance<br />

and eddy current effects from the filament interiors,<br />

and thus the filaments <strong>of</strong>fer no reluctance to the flux. This<br />

effect gets more pronounced when the filaments are close<br />

to each other. In a tightly wound coil the stored energy<br />

inside the filaments is considerable and, in addition, eddy<br />

current losses inside the filaments have an effect on<br />

the exterior magnetic field. In Figure 2, flux lines are<br />

shown for an exterior solution with a constant, sine, and<br />

cosine functions. In Figure 3 a reference result with A-<br />

V-formulation and a finely discretized mesh in the whole<br />

domain is shown for comparison. The A-V-formulation<br />

provides an accurate solution, but the computational cost<br />

is high.<br />

1 Here sin and cos are to be understood as the Whitney nodal basis<br />

function interpolations <strong>of</strong> the trigonometric functions.


Fig. 2. The solution to E-magnet problem <strong>of</strong> Fig.1 with a constant,<br />

sine and cosine on the boundary. Since the interior energy is ignored,<br />

the filaments <strong>of</strong>fer a zero reluctance path for the flux and the flux gets<br />

attracted into the filaments.<br />

Fig. 3. The E-magnet problem solved in Ω with an A-V-formulation as<br />

a reference result. The solution is very accurate, but the computational<br />

cost is high.<br />

III. INTERIOR FORMULATION<br />

A. Expansion <strong>of</strong> the Basis Functions to the Interior<br />

Because <strong>of</strong> the stored and dissipated energy inside<br />

the filaments and its effect on the exterior, the interiors<br />

cannot be separated from the exterior completely. Thus,<br />

we need to have a model for the interior part and solve<br />

for it simultaneously with the exterior. We take the lowdimensional<br />

function space ˆ L on the filament interface<br />

as the starting point.<br />

For the interior problem we have<br />

div 1<br />

Vz<br />

grad a = jωσ(a +<br />

μ jω ) in Ωj,<br />

<br />

(12)<br />

h · dl = Ij on ∂Ωj ∀ j, (13)<br />

∂Ωj<br />

aj = ae on ∂Ωj ∀ j, (14)<br />

where Vz the filamentwise constant potential gradient and<br />

ae is the value <strong>of</strong> the exterior problem solution on the<br />

- 394 - 15th IGTE Symposium 2012<br />

filament boundary. 2 We want to represent the solution to<br />

this eddy-current problem inside the filament by using<br />

only the unknowns cij related to the functions <strong>of</strong> ˆ L plus<br />

one unknown associated to the net current condition (13).<br />

Note that computational effort is saved in the interiorexterior<br />

coupling, since we restrict ae to be spanned by<br />

the functions <strong>of</strong> ˆ L.<br />

Once we have extensions <strong>of</strong> ˆ L into the filament, we<br />

can assemble a FEM assembly block and corresponding<br />

excitation for the problem (12)–(14). The single block<br />

can then be efficiently used for all filaments <strong>of</strong> identical<br />

cross section. It is important to notice that even though<br />

we have to solve for one boundary value problem (BVP)<br />

in the filament per basis element <strong>of</strong> ˆ L, we only have<br />

to solve them once for filaments with the same crosssection.<br />

Usually the number <strong>of</strong> BVPs is much smaller<br />

than the number <strong>of</strong> similar filaments.<br />

To produce accurate loss estimates, we choose the<br />

extensions to be the solution to (12)–(14) with the basis<br />

<strong>of</strong> ˆ L as boundary conditions. Hence, we decompose the<br />

problem into M +1 separately solvable problems and<br />

the solution to the eddy-current problem (12)–(14) is a<br />

linear combination <strong>of</strong> these solutions. 3 These solutions<br />

are then used to extend the basis { ˆ fj} (<strong>of</strong> (6)) inside<br />

the filaments and to add K extra unknowns for the net<br />

currents to the exterior problem. Note that we are not<br />

restricted to a specific geometry, because these solutions<br />

can be obtained by any means.<br />

The first M problems to solve are<br />

div 1<br />

μ grad ai − jωσai =0 (15)<br />

with boundary condition ai = fi on ∂Ωj, wherefi∈ˆ L.<br />

The remaining one problem is used to account for the<br />

filamentwise constant potential Vz with the following<br />

div 1<br />

μ grad aξ − jωσaξ = jωσ1 (16)<br />

with aξ =0on ∂Ωj and Vz<br />

jω is spanned by the constant 1.<br />

As the term (aξ +1) equals 1 on the interface, we can use<br />

this term to impose the net current <strong>of</strong> the filament with a<br />

circulation <strong>of</strong> h, wherehis the magnetic field. Note that<br />

the restriction <strong>of</strong> a0 into Ωj qualifies as (aξ +1), so that<br />

it involves no extra cost. The solution within the filament<br />

for the electric field (divided by jω) is expressed by the<br />

space <strong>of</strong> linear combinations<br />

a + Vz<br />

jω = cξ(aξ +1)+<br />

M<br />

ciai, (17)<br />

i=0<br />

where a is the magnetic vector potential inside the<br />

filament and cξ = Vz<br />

jω .<br />

2 At first glance, it might seem like the problem is overdetermined,<br />

because (14) states a Dirichlet condition on all <strong>of</strong> the boundary.<br />

However, we have as many extra conditions (13) as we have constants<br />

Vz in (12), namely K.<br />

3 This requires the material to be linear inside the filaments.


B. Assembly Block in Entire Problem<br />

Let us see how the system matrix <strong>of</strong> the exterior<br />

problem is modified when these extended basis functions<br />

(ai’s and aξ’s) are used. We form an assembly block that<br />

is added to the system matrix from the problem<br />

div 1<br />

μ<br />

Vz<br />

grad a = jωσ(a + ) in Ω, (18)<br />

jω<br />

because we combine the solution from the interior to the<br />

exterior problem. The variational formulation for this is<br />

<br />

Ω<br />

w(div 1<br />

μ<br />

Vz<br />

grad a − jωσ(a + )) dΩ =0, (19)<br />

jω<br />

where w is a test function. After integration by parts,<br />

(19) becomes<br />

<br />

grad w 1<br />

grad a dΩ =<br />

μ<br />

<br />

Ω<br />

∂Ω<br />

w 1<br />

<br />

grad a · n dl −<br />

μ<br />

Ω<br />

wjωσ(a + Vz<br />

) dΩ. (20)<br />

jω<br />

Because <strong>of</strong> homogenous Dirichlet condition for a everywhere<br />

on ∂Ω, the boundary integrals are zero for w = ai<br />

<br />

Ω<br />

1<br />

grad ai<br />

μ grad a + jωσai(a + Vz<br />

) dΩ =0. (21)<br />

jω<br />

With weight w = aξ +1, the net current condition is<br />

imposed for each filament. Now, aξ +1 is one at the<br />

boundary <strong>of</strong> its support Ωj, and (20) becomes<br />

<br />

<br />

Ωj<br />

∂Ωj<br />

grad (aξ +1) 1<br />

grad a+<br />

μ<br />

jωσ(aξ +1)(a + Vz<br />

) dΩ =<br />

jω<br />

1 1<br />

<br />

grad a · ndl = h · dl = Ij, (22)<br />

μ<br />

∂Ωj<br />

where Ij is the imposed net current.<br />

After substituting (17) into (21) and (22), we can<br />

reduce most <strong>of</strong> the assembly block elements <strong>of</strong> (21) and<br />

(22) to an integral on the boundary <strong>of</strong> the filament. For<br />

solutions ai and aj <strong>of</strong> (15), consider an integral equation<br />

<br />

Ωj<br />

ai( div 1<br />

μ grad aj − jωσaj) dΩ =0, (23)<br />

which holds because aj is a solution <strong>of</strong> (15). After<br />

integration by parts we get<br />

<br />

<br />

∂Ωj<br />

Ωj<br />

1<br />

ai<br />

μ grad aj · n dl =<br />

grad ai<br />

1<br />

μ grad aj + jωaiσaj dΩ, (24)<br />

which occurs as a basic building block in (21) and (22).<br />

- 395 - 15th IGTE Symposium 2012<br />

∂Ωj Ωj<br />

∂Ωj<br />

Fig. 4. Extension <strong>of</strong> the basis function inside the filament with the<br />

constant boundary condition in the E-magnet example, real part in solid<br />

line and imaginary part in dashed line.<br />

∂Ωj<br />

Ωj<br />

∂Ωj<br />

Fig. 5. Extension <strong>of</strong> the basis function inside the filament with the<br />

sine boundary condition in the E-magnet example, real part in solid<br />

line and imaginary part in dashed line.<br />

C. Example: Circular conductors<br />

Circular conductors <strong>of</strong> radius a admit exact solutions<br />

for (15) in closed form for a constant, sines and cosines:<br />

a0 = J0( 1<br />

−jωσμr)<br />

J0( √ (25)<br />

−jωσμa)<br />

a2n−1 = Jn( 1<br />

−jωσμr)<br />

Jn( √ cos nφ (26)<br />

−jωσμa)<br />

a2n = Jn( 1<br />

−jωσμr)<br />

Jn( √ sin nφ (27)<br />

−jωσμa)<br />

where Jn are Bessel functions <strong>of</strong> first kind with order<br />

n. In Figure 4, we see a cross-sectional view <strong>of</strong> the<br />

extension <strong>of</strong> the constant function (a0) into the filament<br />

and in Figure 5 the extension <strong>of</strong> the sine function (a2).<br />

In Figure 6, flux lines are shown for the interface<br />

technique with one sine and cosine with the extended<br />

basis functions. The comparison to A-V-formulated case<br />

(Fig. 3) shows the flux to be very similar. The figures<br />

cannot be identical, since the function space <strong>of</strong> the interface<br />

technique is severely restricted from the interface<br />

function space <strong>of</strong> A-V-formulation and also the basis<br />

functions inside the filaments differ.


Fig. 6. The flux lines obtained with the interface technique for the<br />

E-magnet are shown. Here we have taken into account the effect <strong>of</strong><br />

the interior <strong>of</strong> the filaments to the exterior solution and have a good<br />

correlation with the reference solution.<br />

<br />

<br />

<br />

Fig. 7. Filament numbers used in Table I for the E-magnet.<br />

IV. LOSS ESTIMATES<br />

By using the interior solution (17), we compute in Ωj<br />

the time average <strong>of</strong> the losses<br />

P = 1<br />

<br />

Re{e · j<br />

2<br />

∗ } da, (28)<br />

where e = −jω(a + Vz/jω) and j = σe.<br />

Losses for selected filaments (see Figure 7) <strong>of</strong> the<br />

example in Fig. 1 are shown in Table I. Losses are<br />

computed for five different filaments with the interface<br />

technique and A-V-formulation throughout for comparison.<br />

For the interface technique, the assembly block was<br />

produced with (25)–(27). The loss estimates were also<br />

computed analytically. The reference results for the A-<br />

V-formulation were obtained with GetDP [8] to verify<br />

our loss estimated from MATLAB R○ . The greatest error<br />

is in filament 2, where the error is -4.0%.<br />

In Table II some computationally relevant figures for<br />

the reference solution and the interface technique are<br />

shown. The mesh outside the filaments is <strong>of</strong> an equal<br />

density in both methods. The system <strong>of</strong> equations was<br />

solved using the backslash-operator in MATLAB R○ and<br />

- 396 - 15th IGTE Symposium 2012<br />

Filament A-V [mW/m] Interface<br />

technique [mW/m]<br />

Error %<br />

1 1.012 1.009 0.2<br />

2 0.7185 0.7470 -4.0<br />

3 0.2634 0.2703 -2.6<br />

4 0.04252 0.04200 1.2<br />

5 0.03497 0.03480 0.5<br />

TABLE I<br />

LOSSES IN NUMBERED FILAMENTS WITH UNIT CURRENT AND<br />

f =50HZ. A-VRESULTS FROM GETDP AND INTERFACE<br />

TECHNIQUE FROM MATLAB R○ . δ/r =0.92, WHERE<br />

δ = 2/(ωσμ) AND r THE RADIUS OF THE FILAMENT.<br />

A-V Interface technique Difference (%)<br />

Nodes 153 796 55 206 -64.1<br />

DoFs 153 435 49 645 -67.6<br />

Time [s] 3.928 0.5230 -86.7<br />

nnz<br />

No. <strong>of</strong> DoFs<br />

1 280 535 367 240 -71.3<br />

in filaments 98 525 100<br />

TABLE II<br />

-99.9<br />

SOME PERFORMANCE INDICATORS FOR COMPUTATIONS.NNZ IS<br />

THE NUMBER OF NONZERO ELEMENTS.<br />

the number <strong>of</strong> nonzero elements (nnz) in the system<br />

matrix is much lower than in the A-V-formulation. Most<br />

<strong>of</strong> the saved elements are in the conducting regions and<br />

this saves computation time. Additionally, for the A-Vformulation,<br />

the number <strong>of</strong> nodes required to maintain<br />

accuracy inside the filaments has to increase significantly<br />

with increasing frequency.<br />

V. CONCLUSION<br />

An approach to model coil filament losses was proposed.<br />

We expanded the function space on the filament<br />

boundaries from the floating potential approach with<br />

trigonometric functions. We observed that the filament<br />

interiors need to be considered as well due to the<br />

significant effect <strong>of</strong> the magnetic energy stored and<br />

dissipated inside them. When interface basis functions<br />

were extended into filaments with solutions <strong>of</strong> magnetoquasi-static<br />

problems, and these were used as FEM basis<br />

functions, the loss estimates are at the most 4% away<br />

from A-V-formulated estimates. The computation time is<br />

significantly reduced in a small problem consisting <strong>of</strong> 25<br />

filaments.<br />

ACKNOWLEDGEMENTS<br />

The authors thank Pr<strong>of</strong>essor Stefan Kurz for discussion<br />

and comments.<br />

REFERENCES<br />

[1] O. Moreau, L. Popiel and J. Pages, ”Proximity Losses Computation<br />

with a 2D Complex Permeability Modelling,” IEEE Trans. Magn.,<br />

vol. 34, pp. 3616-3619, 1998.<br />

[2] H. de Gersem and K. Hameyer, ”A Multiconductor Model for<br />

Finite-Element Eddy-Current Simulation,” IEEE Trans. Magn., vol.<br />

38, pp. 533-536, 2002.<br />

[3] A. Podoltsev, I. Kucheryavaya and B. Lebedev, ”Analysis <strong>of</strong><br />

effective resistance and eddy-current losses in multiturn winding<br />

<strong>of</strong> high-frequency magnetic components,” IEEE Trans. Magn., vol.<br />

36, pp. 539-548, 2003.


[4] J. Gyselinck, R. Sabariego and P. Dular, ”Time-Domain homogenization<br />

<strong>of</strong> windings in 2-D finite element models,” IEEE Trans.<br />

Magn., vol. 43, pp. 1297-1300, 2007.<br />

[5] T. Le-Duc, G. Meunier, O. Chadebec and J.-M. Guichon, ”A<br />

new integral formulation for eddy current computation in thin<br />

conductive shells,” IEEE Trans. Magn., vol. 48, pp. 427-430, 2012.<br />

[6] L. Lehti, J. Keränen, S. Suuriniemi, and L. Kettunen, ”Subsystem<br />

separation by flux linkage in coil filament modelling,” ACOMEN<br />

2011, Liège.<br />

[7] P. Dular, W. Legros, H. De Gersem, and K. Hameyer, ”Floating<br />

potentials in various electromagnetic problems using the finite<br />

element method,” Proc. <strong>of</strong> the 4th int. workshop on electric and<br />

magnetic fields, 1998, Marseille.<br />

[8] P. Dular and C. Geuzaine, GetDP: a General Environment for the<br />

Treatment <strong>of</strong> Discrete Problems, available: http://geuz.org/getdp/<br />

- 397 - 15th IGTE Symposium 2012


- 398 - 15th IGTE Symposium 2012<br />

Optimization <strong>of</strong> Energy Storage Usage<br />

Arnel Glotic 1 , Peter Kitak 1 , Igor Ticar 1 , Adnan Glotic 2<br />

1 <strong>University</strong> <strong>of</strong> Maribor, Faculty <strong>of</strong> electrical engineering and computer science, Smetanova 17, SI-2000 Maribor,<br />

Slovenia<br />

2 Holding Slovenske elektrarne Group, Koprska ulica 92, SI-1000 Ljubljana, Slovenia<br />

Abstract — Energy storage is a physical storage for energy, like Batteries, Flywheels, Compressed Air Storages, Pumped<br />

Storages, etc. This paper presents the use <strong>of</strong> the optimization algorithm in order to achieve the optimal usage <strong>of</strong> Energy Storage.<br />

Reservoirs <strong>of</strong> cascade Hydro Power Plants have been used as model <strong>of</strong> Energy Storage, and these are known as complex<br />

optimization problems. Optimization algorithm used in this paper was the adapted differential evolution algorithm.<br />

Index Terms — Differential evolution, energy storage, optimization, hydro power plants.<br />

I. INTRODUCTION<br />

Energy storage [1] is a physical storage for energy and<br />

can be found in different types. Authors’ research has<br />

been focused to cascade hydro power plants (HPP), where<br />

each individual plant has its own reservoir and energy<br />

storage, respectively.<br />

Various combinations <strong>of</strong> reservoirs’ charging and<br />

discharging produces different amount <strong>of</strong> electricity. In<br />

order to achieve optimal production, several methods can<br />

be implemented [2], such as Lagrangian relaxion and<br />

Benders decomposition-based methods, Mixed-integer<br />

programming, Dynamic programming, Evolutionary<br />

Computing Methods, Artificial intelligence methods and<br />

Interior-point methods.<br />

Differential Evolution (DE) Algorithm [3] is an<br />

efficient and robust global optimization algorithm and<br />

therefore it has been selected in this paper as an<br />

appropriate optimization technique.<br />

Short-term optimization using DE with self-adaptive<br />

parameter settings authors in [4] has been used on four<br />

cascades HPP, where the best objective value has reached<br />

after 2000 generations. The modified DE presented in [5]<br />

includes penalty factor during the objective function<br />

evaluation, which preserves the satisfied final reservoirs<br />

levels <strong>of</strong> four cascades HPP. In [6] authors combined<br />

advantages <strong>of</strong> the two modified DE algorithms, where the<br />

grouping and shuffling operation is carried out over the<br />

population periodically.<br />

Optimization <strong>of</strong> reservoirs scheduling HPP is known as<br />

a complex problem, where large number <strong>of</strong> HPP in<br />

cascade, means much larger number <strong>of</strong> reservoirs<br />

scheduling combinations and convergence time,<br />

respectively. The main goal <strong>of</strong> this paper was to modify<br />

DE in order to be capable <strong>of</strong> reaching the global optimal<br />

solution with fast convergence. This means the adequate<br />

distribution <strong>of</strong> individual HPP electrical energy<br />

production by scheduling reservoirs in order to satisfy the<br />

demand for 24 hours. Besides satisfying the demand, the<br />

decreased usage <strong>of</strong> water quantity per electrical energy<br />

unit (m 3 /MWh) has to be also achieved. Also, the<br />

optimization results must be feasible in range <strong>of</strong> couple<br />

minutes.<br />

Mathematical model <strong>of</strong> cascade hydro power plants is<br />

E-mail: arnel.glotic@uni-mb.si<br />

described in section II, standard and modified differential<br />

evolution algorithm in section III, results in section IV,<br />

and conclusion in section V.<br />

II. MATHEMATICAL MODEL OF CASCADE HYDRO POWER<br />

PLANTS<br />

The mathematical model describes cascade HPP on<br />

Drava River in Slovenia, owned by Dravske elektrarne<br />

Maribor (DEM). DEM is a subsidiary company <strong>of</strong><br />

Holding Slovenske elektrarne (HSE), which is the biggest<br />

producer and trader with electricity in Slovenia. DEM<br />

provides approximately 25.5% <strong>of</strong> produced energy in<br />

Slovenia, with maximum output <strong>of</strong> 587 MW.<br />

The mathematical model consists <strong>of</strong> eight cascades,<br />

t<br />

where i-th HPP has natural inflow Qi ,NI in the observed<br />

hour t <strong>of</strong> the day. The first HPP in decade structure has<br />

t<br />

the inflow Qi,I <strong>of</strong> Drava River coming from Austria. The<br />

source <strong>of</strong> Drava River lies in Italy, near Austrian-Italian<br />

border.<br />

The total inflow for the first HPP in the observed hour t<br />

is,<br />

t t t<br />

Qi,TI Qi,I Qi,NI<br />

, (1)<br />

i 1, t 1,2,...24<br />

t<br />

where Qi ,TI is the sum <strong>of</strong> inflows. The total inflow for the<br />

following seven HPP is expressed as<br />

t t t<br />

Qi,TI Q( i1) Qi,NI<br />

i 2,3,...6, t 1,2,...24 , (2)<br />

t<br />

Q is the outflow <strong>of</strong> the upper HPP, expressed as<br />

where ( i 1)<br />

t t t<br />

i i,T i,O<br />

Q Q Q<br />

i 1,2...8, t 1,2,...24 , (3)<br />

which represents the sum <strong>of</strong> the flow through the turbine<br />

and the overflow in the observed hour t. The last two<br />

HPP’s, HPP 7 and HPP 8, are canal based type HPP’s<br />

where flows merge with the riverbed at the end <strong>of</strong> the<br />

canal. Both <strong>of</strong> these HPP’s have the required biological<br />

minimum flow Q i,B<br />

, which must be provided to the<br />

riverbed.


t<br />

Q1,TI<br />

V<br />

t<br />

HPP1<br />

t<br />

Q1,O<br />

HPP 1<br />

dam<br />

t 1 <br />

H V<br />

t<br />

Q1,T<br />

t<br />

Q1<br />

V2,min<br />

Total inflow for the last two HPP in chain is expressed as<br />

t t t<br />

Qi,TI Q( i1) Qi,NI Qi,B<br />

(4)<br />

i 7,8, t 1,2,...24 .<br />

Inflow water can be used for charging reservoir up to the<br />

maximal reservoir height V i,max<br />

or used in combination<br />

with flow gained from discharging reservoirs. However it<br />

must be considered that in the observed hour t the<br />

reservoirs values<br />

t<br />

V i must be between minimal or<br />

maximal allowed value <strong>of</strong> the individual reservoir. All the<br />

reservoirs also have the prescribed maximal discharging<br />

value.<br />

The hydro generator output power is expressed as<br />

<br />

t<br />

i i,1 <br />

t<br />

i<br />

2<br />

i,2 <br />

t<br />

i<br />

2<br />

i,3 <br />

t<br />

i <br />

t<br />

i i,4 <br />

t<br />

i<br />

ci,5 t<br />

Qi ci,6<br />

P c V c Q c V Q c V<br />

,(5)<br />

<br />

where c represents the hydropower generation<br />

t<br />

coefficient. In cases where the inflow Q i,TI<br />

is larger than<br />

the maximal allowed flow through the turbines <strong>of</strong> the i-th<br />

HPP and the reservoir level t<br />

V i reaches the maximal<br />

t<br />

value allowed, then the overflow Q i,O<br />

is unavoidable and<br />

it can be expressed as<br />

t t t t<br />

Qi,O Qi,TI Qi<br />

Pi,max ,<br />

i 1,2,...8, t 1,2,...24<br />

(6)<br />

t t<br />

where i i,max<br />

<br />

Q P is the flow throughout the turbines,<br />

which provides the maximal output power. Power<br />

generation consider also the head effect,<br />

t t t<br />

Hi HVi Hi,O<br />

i 1,2...8, t 1,2,...24<br />

,<br />

(7)<br />

t<br />

where H i is the difference between the inlet and outlet<br />

t<br />

H V i<br />

t<br />

is the level <strong>of</strong> reservoir at volume V i and<br />

head, <br />

t<br />

i,O<br />

H is the level <strong>of</strong> the outlet. Both levels are expressed<br />

with the polynomial <strong>of</strong> the sixth degree:<br />

- 399 - 15th IGTE Symposium 2012<br />

V2,max<br />

t<br />

t<br />

V Q<br />

2<br />

2,TI<br />

Figure 1: Layout <strong>of</strong> two hydropower plants<br />

t<br />

Q2,O<br />

HPP 2<br />

dam<br />

t 2 <br />

H V<br />

t<br />

Q2,T<br />

<br />

<br />

3<br />

t <br />

2<br />

t <br />

1<br />

t<br />

6 5 4<br />

t t t t<br />

i,O i,1 i i,2 i i,3 i<br />

H k Q k Q k Q<br />

k Q k Q k Q k<br />

i,4 i i,5 i i,6 i i,7<br />

t<br />

Q2<br />

, (8)<br />

where ki are the coefficients <strong>of</strong> the polynomial obtained<br />

by experimental measurements <strong>of</strong> each reservoirs and<br />

provided by DEM personnel.<br />

III. OPTIMIZATION ALGORITHM<br />

Differential evolution (DE) algorithm has been used as<br />

effective global optimizer and was proposed by R. Storn<br />

and K. Price [3]. The main steps <strong>of</strong> DE algorithm are<br />

initialization, mutation, crossover, evaluation and<br />

selection. The initialization step is defined as a randomly<br />

chosen population. Each individual xi <strong>of</strong> the initial<br />

population is composed <strong>of</strong> j variables:<br />

x jG , x j,upp rand(0,1) ( x j,upp xj,low<br />

)<br />

(9)<br />

j 1,2..., D<br />

xiG<br />

, x1, x2,... xD<br />

(10)<br />

i 1,... NP<br />

where UPP x j,upp<br />

and LOW j,low<br />

x are upper and lower<br />

bounds defined for each variable x j , G denotes<br />

generation, NP number <strong>of</strong> population, D number <strong>of</strong><br />

parameters or problem dimension and i the number <strong>of</strong> the<br />

population member and individual, respectively. The<br />

population size depends on number <strong>of</strong> the problem<br />

variables D and parameters <strong>of</strong> the objective function,<br />

respectively.<br />

For the proposed mathematical model the optimization<br />

algorithm has upper and lower bounds defined as minimal<br />

and maximal value <strong>of</strong> the individual reservoirs.<br />

Therefore, after the initialization, the population is<br />

composed <strong>of</strong> NP D-dimensional vectors:<br />

1 24 1 24<br />

xiG<br />

, Vi,1 ,... Vi,1 ,... Vi,8 ,... V <br />

i,8<br />

<br />

(11)<br />

i 1,2... NP<br />

where V is the volume <strong>of</strong> individual HPP reservoir in<br />

time t. At the initialization step <strong>of</strong> DE the volumes are<br />

randomly chosen for each individual HPP and for each<br />

individual hour in 24 hour period. Therefore the


dimension <strong>of</strong> the problem D is 192 and the population<br />

size is five times larger. Therefore the population size is<br />

960.<br />

The mutation stem if followed after the initialization<br />

step. For each target individual and sometimes referred to<br />

as vector x iG , , the mutant vector is created according to<br />

the selected strategy. The applied strategy in this paper is<br />

formulated as<br />

viG , xiG , Fxbest, GxiG , Fxr 1, Gxr2, G<br />

,<br />

i 1,2..., NP<br />

(12)<br />

where xr and x<br />

1 r are randomly chosen individuals from<br />

2<br />

interval [1,NP], x best,G represents the best individual <strong>of</strong><br />

the generation G and F is the weight.<br />

The following step is crossover, where for the each<br />

mutant vector a new trial vector u iG , is produced via<br />

“binary” crossover:<br />

vi, j, G if rand(0,1) CR or j jrand<br />

<br />

ui,<br />

j, G<br />

<br />

xi, j, G if rand(0,1) CR or j jrand<br />

<br />

i 1,2..., NP, j 1,2,...,<br />

D<br />

(13)<br />

where CR is crossover constant selected by the user. The<br />

j rand is a randomly chosen integer from interval [1,…D],<br />

which ensures that the trial vector obtains at least one <strong>of</strong><br />

the parameters from the mutant vector.<br />

In the last step, known as selection, DE evaluates trial and<br />

target vector, commonly referred to as parent vector:<br />

<br />

iG , if f iG , f , <br />

<br />

u u xiG<br />

<br />

xiG<br />

, 1<br />

<br />

xiG , if f uiG , f x (14)<br />

iG , <br />

i 1,2..., NP<br />

where the lower objective function value occupies the<br />

position in next generation (G+1). This comparison is<br />

made for each <strong>of</strong> NP individuals and the new population<br />

in generation G+1 is selected and steps <strong>of</strong> DE algorithm<br />

start once again in the following order; mutation,<br />

crossover, evaluation and selection. The algorithm repeats<br />

all steps until one <strong>of</strong> the stopping criterions is reached.<br />

DE control parameters F, CR and strategy are selected<br />

by the user and have an important influence on the<br />

convergence time, global or local search and manner <strong>of</strong><br />

creating new mutants. Use <strong>of</strong> the standard DE for solving<br />

the presented optimization problem may not always lead<br />

towards the global solution, regardless <strong>of</strong> the effort given<br />

in order to choose the adequate control parameters. The<br />

algorithm can be easily trapped into local optimum and<br />

also the convergence time can be drastically increased. In<br />

order to overcome these problems the modified algorithm<br />

uses self-adaptive F and CR. For the initial generation<br />

both <strong>of</strong> the control parameter are selected by the user and<br />

vary along with the iteration number according to (15)<br />

and (16):<br />

FRif f( xbest, G1) f( xbest,<br />

G)<br />

<br />

FiG<br />

, 1<br />

<br />

FiG , Otherwise.<br />

(15)<br />

<br />

i 1,2..., NP<br />

If the algorithm finds a better solution in generation G<br />

- 400 - 15th IGTE Symposium 2012<br />

compared to generation G - 1, then a randomly selected<br />

FR is employed in generation G + 1.<br />

CRR if FiG , 1<br />

FiG , and rand(0,1)<br />

<br />

CRiG<br />

, 1<br />

<br />

CRiG , Otherwise.<br />

(16)<br />

<br />

i 1,2..., NP<br />

A random CRR in generation G+1 is also provided if a<br />

FR is previously employed and at the same time a<br />

randomly selected value from interval [0, 1] is lower than<br />

0.1 . The described modification loads toward the<br />

global solution and improves the convergence time. A<br />

further improvement in convergence time can be achieved<br />

by parallel computation.<br />

The presented optimization problem is a multiobjective<br />

problem [7], where three different objectives<br />

are merged into a single one by using the weighted sum<br />

method [8]. The first goal <strong>of</strong> the optimization process is<br />

the satisfied demand for 24 hours by scheduling<br />

reservoirs <strong>of</strong> cascade HPP. The satisfied demand should<br />

be followed by the decreased usage <strong>of</strong> water quantity per<br />

3<br />

electrical energy unit ( m MWh) which represents the<br />

second objective. The third objective represents the<br />

decreased and eliminated overflow, respectively. The<br />

objective function for each individual objective is<br />

expressed as:<br />

2<br />

24 8 <br />

<br />

<br />

t t<br />

1 demand i,opt<br />

<br />

<br />

<br />

<br />

<br />

t1 i1<br />

<br />

1<br />

f W W <br />

<br />

24<br />

<br />

8 24 <br />

t Qi,T<br />

<br />

<br />

i1 t1<br />

<br />

2 <br />

<br />

8 24 <br />

t Wi,opt<br />

<br />

<br />

i1 t1<br />

<br />

8 24 <br />

t<br />

3 i,O<br />

<br />

i1 t1<br />

f<br />

f Q<br />

<br />

(17)<br />

t<br />

Demand energy Wdemand and optimal production energy<br />

t<br />

i,opt<br />

W is formulated as a product <strong>of</strong> power P and time t<br />

W PtWh (18)<br />

The unified objective function f is defined as<br />

f f1w1 f2w2 f3w3 (19)<br />

where each individual objective is normalized and<br />

weights are set according to the selected priority <strong>of</strong> the<br />

individual objective. The values selected for a given<br />

problem were 0.6, 0.15 and 0.25, respectively.<br />

IV. RESULTS<br />

The proposed modified DE algorithm has been used in<br />

order to achieve globally optimal production <strong>of</strong> the<br />

cascade <strong>of</strong> the HPP and to satisfy the demand,<br />

respectively. The test data used was a real 24 hours<br />

demand plan from SCADA. It has been shown in Table I<br />

and it is valid for the observed day in the past and<br />

practically realized by scheduling reservoirs.


Time<br />

- 401 - 15th IGTE Symposium 2012<br />

Table I: The satisfied demand by scheduling reservoirs for the dispatcher, standard and modified DE<br />

Demand scheduling the<br />

reservoirs by dispatcher<br />

(real data from SCADA )<br />

Energy<br />

( MWh )<br />

Satisfied demand by scheduling<br />

reservoirs - standard DE<br />

Satisfied demand by scheduling<br />

reservoirs - modified DE<br />

Water discharge Energy Water discharge Energy Water discharge<br />

( /h ) ( MWh ) ( / h ) ( MWh ) ( / h )<br />

1 19.8 828000 0 0 19.0 738446<br />

2 6.8 349200 6.0 284205 7.0 218980<br />

3 0 0 0 0 0 0<br />

4 0 0 0 0 0 0<br />

5 0 0 0 0 0 0<br />

6 0 0 3.0 154185 0 0<br />

7 89.2 2080800 39.0 1372293 89.0 2106432<br />

8 368.6 8355600 368.1 9151060 368.6 8696690<br />

9 417.5 10018800 416.4 10288591 417.5 10553592<br />

10 315.5 7592400 314.7 6764925 315.5 8525579<br />

11 244.2 5968800 244.0 5853691 244.3 5827843<br />

12 313.3 7408800 312.7 7426771 313.3 7453704<br />

13 322.4 7740000 321.9 7392661 322.4 7548595<br />

14 332.7 8089200 332.4 8707231 332.6 7625784<br />

15 320.1 7776000 319.6 7223638 320.2 8137026<br />

16 306.7 7315200 306.4 6806393 306.7 6557976<br />

17 403.3 9518400 403.2 9281914 403.3 9743537<br />

18 397.2 9378000 396.7 9344417 397.1 9186977<br />

19 391.9 9381600 391.4 9250750 391.9 9260050<br />

20 306.5 7452000 306.6 6604082 306.5 7079875<br />

21 290.1 7185600 289.6 6610318 290.0 7440151<br />

22 272.3 6494400 272.2 5624925 272.3 5849220<br />

23 265.2 6091200 265.1 6813626 265.2 5808105<br />

24 249.0 5763600 248.6 5207242 248.9 5301717<br />

Total 5632.2 134787600 5557.4 130162918 5631.3 133660279<br />

The scheduling has been made by the dispatch personnel<br />

<strong>of</strong> the DEM Company. This data has been used as a<br />

reference followed by optimization algorithm – the DE<br />

and the modified DE – with the objective to satisfy the<br />

given demand by determining the optimal production <strong>of</strong><br />

individual HPP during the 24 hour period. According to<br />

the results from Table I, the modified DE compared to<br />

manual dispatch saved approximately 1.12 million<br />

3<br />

m <strong>of</strong><br />

water, which equals to approximately 50 MWh less <strong>of</strong><br />

potential energy used.<br />

Authors [8] showed DE’s control parameters impact on<br />

convergence and global optimization performance. By<br />

using smaller F values, a local optimum can be reached<br />

faster, while a global one can be reached by choosing<br />

larger values. Selection <strong>of</strong> larger CR values can reduce a<br />

convergence time. In order to improve algorithm’s<br />

performance on a given optimization problem, a search<br />

for suitable DE control parameters was not successful,<br />

although the best result have been obtained by using the<br />

following parameters F = 0.5, CR = 0.8 and strategy = 3.<br />

However, the standard DE was not able to achieve the<br />

global optimum and to minimize the difference between<br />

the demand and the production down to zero,<br />

respectively.<br />

Authors [9] have shown the benefits <strong>of</strong> self-adaptive<br />

parameters control. This was a solid research direction in<br />

this paper and in order to achieve the global optimum, the<br />

modified DE with self-adjusting F and CR has been<br />

proposed. The modified DE (Fig. 2) stopped the<br />

evolution process after 200 generations with the<br />

convergence time <strong>of</strong> 280 seconds, while the standard DE<br />

(Fig. 3) stopped after 1000 generations and 1050 seconds.<br />

The stopping criterion in this case was 500 generations<br />

without <strong>of</strong> any change in objective function value.<br />

The final result at the end <strong>of</strong> the optimization process by<br />

using the modified DE algorithm is an optimal 24 h<br />

production <strong>of</strong> each individual HPP. Such a production<br />

completely satisfies the demand. The optimal 24 hours<br />

production <strong>of</strong> individual HPP is shown in Fig.4 and the<br />

corresponding reservoir volumes during the 24 hours<br />

period are shown in Fig. 5.


Objectives<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0 20 40 60 80 100 120 140 160 180 200<br />

Generation<br />

Figure 2: Convergence <strong>of</strong> the unified and three individual<br />

objective functions values by using the modified DE<br />

Electrical Enegy Production [MWh]<br />

450<br />

400<br />

350<br />

300<br />

250<br />

200<br />

150<br />

100<br />

50<br />

HPP1<br />

HPP2<br />

HPP3<br />

HPP4<br />

HPP5<br />

HPP6<br />

HPP7<br />

HPP8<br />

Demand<br />

0<br />

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24<br />

Time [h]<br />

Figure 4: Optimal production <strong>of</strong> individual HPP proposed<br />

by the modified DE<br />

Vmax<br />

Volume [m 3 ]<br />

HPP 1<br />

HPP 2<br />

HPP 3<br />

HPP 4<br />

HPP 5<br />

HPP 6<br />

HPP 7<br />

HPP 8<br />

Vmin<br />

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24<br />

Time [h]<br />

Figure 5: Charging and discharging <strong>of</strong> reservoirs during<br />

the optimal production <strong>of</strong> individual HPP<br />

V. CONCLUSION<br />

The modified DE algorithm in this paper was capable<br />

<strong>of</strong> solving complex optimization problem. As shown on<br />

the presented optimization problem, the algorithm was<br />

f<br />

f<br />

1<br />

f 2<br />

f 3<br />

- 402 - 15th IGTE Symposium 2012<br />

Objectives<br />

1.4<br />

1.2<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0 100 200 300 400 500 600 700 800 900 1000<br />

Generation<br />

Figure 3: Convergence <strong>of</strong> the unified and three individual<br />

objective functions values by using the standard DE<br />

able to satisfy the demand along with the fast<br />

convergence speed. The optimization problem was<br />

observed in period <strong>of</strong> 24 hours and includes 8 HPP’s.<br />

Therefore it has 192 variables that need to be identified.<br />

Despite this fact, the algorithm ensures the exploration <strong>of</strong><br />

large solution space in time span <strong>of</strong> several minutes and<br />

provides the global solution. The dispatch personnel is<br />

unable to explore such a large solution space and the<br />

production <strong>of</strong> individual HPP is determined on the basis<br />

<strong>of</strong> previous experiences and therefore, non-optimally. By<br />

using the modified DE algorithm, the dispatch personnel<br />

obtain the guide which improves the overall production<br />

efficiency. The algorithm can also be used to indicate<br />

whether a given demand plan is feasible.<br />

REFERENCES<br />

[1] S. Vazquez, S. M. Lukic, E. Galvan, L. G. Franquelo and J. M.<br />

Carrasco “Energy Storage System for Transport and Grid<br />

Applications”, IEEE Transactions on industrial electronics, Vol.<br />

57, pp. 3881-3895, December 2010.<br />

[2] I. A. Farhat, M.E. El-Hawary, “Optimization methods applied for<br />

solving the short-term hydrothermal coordination problem,”<br />

Electric Power System Research, pp. 1308-1320, 2009.<br />

[3] R. Storn, K. Price, “Differential Evolution – A simple and<br />

efficient adaptive scheme for global optimization over continuous<br />

spaces,” Journal <strong>of</strong> Global Optimization, pp. 341-359, 1997.<br />

[4] X. Yuan, Y. Zhang, L. Wang, Y. Yuan, “An enhanced differential<br />

evolution algorithm for daily optimal hydro generation<br />

[5]<br />

scheduling,” Computers and Mathematics with Applications, pp.<br />

2458-2468, 2008.<br />

L. Lakshminarasimman, S. Subramanian, “Short-term scheduling<br />

<strong>of</strong> hydrothermal power system cascaded reservoirs by using<br />

modified differential evolution,” IEEE Proc.-Gener. Transm.<br />

Distrib., Vol. 153, pp. 693-700, 2006.<br />

[6] Y. Li, J. Zuo, “Optimal Scheduling <strong>of</strong> Cascade Hydropower<br />

System Using Grouping Differential Evolution Algorithm,”<br />

International Conference on Computer Science and Electronic<br />

Engineering, pp. 625-629, 2012.<br />

[7] J. Grobler, A.P. Engelbrecht, V.S.S. Yadavalli, “Multi-objective<br />

DE and PSO Strategies for Production Scheduling,” IEEE<br />

Congres on Evolutionary Computation, pp. 1154-1161, 2008.<br />

[8] R. Gamperle, S.D. Muller, P. Koumoutsakos, “A parameter Study<br />

for Differential Evolution,” Conf. on Adances in Intelligent<br />

System, Fuzzy Systems, pp. 293-298,2002.<br />

[9] J. Brest, V. Zumer, M.S. Maucec, “Self-Adaptive Differential<br />

Evolution Algorithm in Constrained Real-Parameter<br />

Optimization,” IEEE Conges on Evolutionary Computation, pp.<br />

215-222, 2006.<br />

f<br />

f<br />

1<br />

f 2<br />

f 3


- 403 - 15th IGTE Symposium 2012<br />

Adaptive Surrogate Approach for Bayesian<br />

Inference in Inverse Problems<br />

M. Neumayer∗ ,H.R.B.Orlande ‡ ,M.J.Colaço ‡ , D. Watzenig∗ ,G.Steiner∗ , B. Brandstätter † ,andG.S.<br />

Dulikravich §<br />

∗Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing, <strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, <strong>Graz</strong>,<br />

Austria, † Elin Motoren GmbH, Elinmotorenstrasse 1, A-8160 Preding/Weiz, Austria, ‡ Department <strong>of</strong> Mechanical<br />

Engineering, Federal <strong>University</strong> <strong>of</strong> Rio de Janeiro, UFRJ Rio de Janeiro, RJ, Brazil, § Department <strong>of</strong> Mechanical<br />

and Materials Engineering, Florida International <strong>University</strong> Miami, Florida, U.S.A.<br />

E-mail: neumayer@TU<strong>Graz</strong>.at<br />

Abstract—Bayesian inference forms a flexible and versatile solution strategy for inverse problems. Its advantage lies in the<br />

straight forward formulation <strong>of</strong> the solution process, the ability to incorporate any existing knowledge, as well as in the<br />

output <strong>of</strong> the method itself, which provides statistical knowledge about unknown parameters. The costs <strong>of</strong> the mentioned<br />

benefits are <strong>of</strong>ten largely increased numerical efforts due to the use <strong>of</strong> sampling methods. This especially holds if the<br />

underlying physical problem requires the solution <strong>of</strong> a partial differential equation. In this paper we present a simple, yet<br />

versatile and effective strategy to accelerate Bayesian inference using an adaptive surrogate approach.<br />

Index Terms—surrogate technique, adaptive, Bayesian inference, MCMC<br />

I. INTRODUCTION<br />

Inverse problems and parameter estimation problems<br />

belong to the class <strong>of</strong> indirect measurement problems<br />

where one tries to estimate a parameter vector x ∈ R N<br />

from observations ˜ d ∈ R M [1]. They arise in many<br />

disciplines <strong>of</strong> engineering and science. The term inverse<br />

problem is most <strong>of</strong>ten associated with imaging techniques<br />

like electrical capacitance/impedance tomography (ECT<br />

and EIT), or computed tomography, but their mathematical<br />

common is their inherent ill-posed nature. Parameter<br />

estimation has not that massive association with imaging<br />

like inverse problems, but is in many ways <strong>of</strong> even larger<br />

importance in engineering. Such an example is given by<br />

the determination <strong>of</strong> material parameters from ”simple”<br />

a simple measurements setup.<br />

Formally the physical measurement process P can be<br />

denoted by P : x ↦→ ˜ d. Hereby ˜ d is the corrupted version<br />

<strong>of</strong> the otherwise noise free measurements d. Formost<br />

practical examples an additive noise model <strong>of</strong> form ˜ d =<br />

d + v is valid, where v ∈ R M follows a certain noise<br />

distribution described by a probability density function<br />

(pdf). The modern model based approach to estimate x<br />

from ˜ d maintains a model F : x ↦→ y (y ∈ R M )whichis<br />

referred to as forward map. For most real world problems<br />

this is a computer model solving the underlying partial<br />

differential equations (PDEs) for P in a numerical way.<br />

In this paper we will assume that P = F holds.<br />

Classical deterministic inversion methods then manipulate<br />

the vector x in order to minimize some useful norm<br />

<strong>of</strong> the residual vector e = y − ˜ d. In addition ill-posed<br />

problems require a regularization term for the numerical<br />

stabilization <strong>of</strong> such an optimization problem. The single<br />

result <strong>of</strong> the approach is referred to as point estimate,<br />

which we will denote by xMAP (maximum a posteriori)<br />

as the value <strong>of</strong> x which provides the smallest misfit.<br />

A contrastable approach to solve inverse problems is<br />

provided by the framework <strong>of</strong> Bayesian inference [2].<br />

Rather than providing a single result, Bayesian inference<br />

approaches provide the summary distribution π(x| ˜ d) (the<br />

posterior distribution, MAP). Out <strong>of</strong> this any statistics,<br />

like mean, variance, correlations, MAP estimates, etc.<br />

about x can be computed. The cost for this gain in<br />

information are the increased computational costs, as<br />

the approach requires numerous evaluations <strong>of</strong> F .This<br />

especially holds for the case that Markov chain Monte<br />

Carlo (MCMC) methods are applied. Thus, the practicability<br />

for the application <strong>of</strong> Bayesian methods is limited<br />

if the evaluation <strong>of</strong> F requires computational expensive<br />

operations like the numerical solution <strong>of</strong> PDEs.<br />

In this paper we will present a simple and versatile<br />

strategy to speed up Bayesian inference for inverse problems<br />

and parameter estimation problems. The speed up<br />

is provided by the use <strong>of</strong> an approximation or surrogate<br />

model [3]. The paper is structured as follows. In section<br />

II we will introduce the theory about Bayesian inversion<br />

and the exploration <strong>of</strong> the posterior distribution by the<br />

Metropolis Hastings (MH) algorithm. In section III we<br />

explain an acceleration approach for the MH which is<br />

based on the use <strong>of</strong> approximations. Finally we will<br />

present a numerical example where we estimate thermal<br />

material parameters from a heated slab.<br />

II. BAYESIAN INFERENCE AND MARKOV CHAIN<br />

MONTE CARLO<br />

In this section we will present the framework <strong>of</strong><br />

Bayesian inversion for the solution <strong>of</strong> inverse problems.<br />

Having measurements ˜ d from a measurement process P<br />

and a model F to simulate P , the solution process is


marked by the use <strong>of</strong> Bayes law [1]<br />

π(x| ˜ d)= π(˜ d|x)π(x)<br />

π( ˜ ∝ π(<br />

d)<br />

˜ d|x)π(x). (1)<br />

The law connects the so called likelihood function<br />

π( ˜ d|x) and the prior π(x) to formulate the posterior<br />

distribution π(x| ˜ d). π( ˜ d) is termed the evidence and<br />

has the role <strong>of</strong> a normalization constant to ensure the<br />

property <br />

RN π(x| ˜ d)dx =1<strong>of</strong> a pdf. Hence, it can be<br />

skipped leading to the right hand formula in equation (1).<br />

The likelihood function π( ˜ d|x) provides the probability<br />

measure for x causing the data ˜ d given the model and<br />

statistical knowledge about the measurement noise. For<br />

an additive noise model ˜ d = d + v, the likelihood is<br />

given by π( ˜ d|x) =πv(y − ˜ d), whereyisthe output<br />

<strong>of</strong> the forward map. For many practical problems zero<br />

mean white Gaussian noise, i.e. v ∝N(0, Σv), where<br />

Σv is the covariance matrix, can be assumed. Then the<br />

likelihood function becomes<br />

π( ˜ <br />

d|x) ∝ exp − 1<br />

<br />

y −<br />

2<br />

˜ T <br />

−1<br />

d Σ y − ˜ <br />

d<br />

<br />

, (2)<br />

where Σ is set to Σv.<br />

The prior π(x) provides a probability measure about x<br />

being the solution. While the design <strong>of</strong> the likelihood has<br />

to follow strict mathematical rules due to its definition<br />

the prior provides a very flexible way to incorporate<br />

expert knowledge about x. I.e. if we know that the ith<br />

component <strong>of</strong> x has a lower and an upper bound the<br />

corresponding prior is given by the uniform distribution<br />

xi ∝U(xi,min,xi,max). For the case that a mean value<br />

about the j-th component is known a Gaussian distribution<br />

xj ∝N(μxj ,σxj ) can be used to express the prior<br />

where σxj controls the deviation.<br />

The posterior π(x| ˜ d) expresses the probability for x<br />

being the solution given the data ˜ d, the model and the<br />

prior. Rather than a single result, the posterior covers<br />

all possible solutions. For a post analysis <strong>of</strong> π(x| ˜ d) one<br />

could look at a specific realization <strong>of</strong> x and evaluate its<br />

probability. However, as this procedure is not <strong>of</strong> big use<br />

some meaningful point measures have become popular.<br />

One <strong>of</strong> them is the maximum a posteriori (MAP) estimate<br />

xMAP =argmaxx π( ˜ d|x), which is the mode <strong>of</strong> the<br />

posterior. The other one is the conditional mean (CM)<br />

estimate<br />

<br />

xCM = xπ( ˜ d|x)dx, (3)<br />

R N<br />

which summarizes the complete distribution. It can be<br />

easily seen, that the MAP estimate can be found by<br />

solving an optimization problem by either maximizing<br />

the posterior, or minimizing its logarithm. This corresponds<br />

to classical regularized approaches except, that<br />

the likelihood introduces statistical knowledge about the<br />

noise. This fact results in generally higher modeling<br />

efforts when using Bayesian methods. The CM estimate<br />

requires the evaluation <strong>of</strong> a high dimensional integral.<br />

An analytic solution <strong>of</strong> the integral is <strong>of</strong>ten not possible,<br />

- 404 - 15th IGTE Symposium 2012<br />

as the integral is <strong>of</strong> high dimension and also because<br />

<strong>of</strong> the complicated interaction <strong>of</strong> the forward map. Also<br />

standard numerical schemes like the well known Gauss<br />

quadrature cannot be applied for such integrals, due to<br />

the lack <strong>of</strong> knowledge about the support. The numerical<br />

tool to solve such integrals is known as Monte Carlo<br />

integration. Hereby a set <strong>of</strong> samples x (N) from the posterior<br />

is generated, where the frequency <strong>of</strong> the samples<br />

follows the target distribution. Then the CM integral can<br />

be approximated by<br />

xCM =<br />

<br />

R N<br />

xπ( ˜ d|x)dx ≈<br />

N<br />

i=1<br />

x (N)<br />

i . (4)<br />

In the same way any other integral (also about functions<br />

<strong>of</strong> x) can be solved. The generation <strong>of</strong> samples from<br />

a distribution belongs to the discipline <strong>of</strong> computational<br />

Bayesian inference and will be discussed in the following<br />

subsection.<br />

One important aspect about the Bayesian framework<br />

which was not stated so far is the possibility to treat<br />

nuisance parameters ν in the same way as the state vector<br />

x. I.e.iftheforwardmapF is in fact a function Fν(x)<br />

it is possible to do inference about both, x and ν in<br />

the same natural way. This can be used if parameters<br />

<strong>of</strong> a measurement system are unknown or provide an<br />

uncertain factor.<br />

A. The Metropolis Hastings (MH) Algorithm<br />

Algorithms for practical computational Bayesian inference<br />

are typically sampling algorithms [4]. They can be<br />

seen as random number generators which compute independent<br />

samples from an arbitrary target distribution. For<br />

inverse problems the target distribution is the posterior.<br />

On a discrete state space the frequency <strong>of</strong> certain samples<br />

corresponds to the probability <strong>of</strong> the sample, enabling the<br />

powerful tool <strong>of</strong> Monte Carlo integration. As can be seen<br />

by equation (2), the evaluation <strong>of</strong> π( ˜ d|x) requires one<br />

evaluation <strong>of</strong> the forward map F . This already indicates<br />

the fact, that sampling methods result in generally higher<br />

computational cost. For computational inference a class<br />

<strong>of</strong> algorithms termed MCMC methods were developed,<br />

as they rely on an underlying Markov chain X. TheMH<br />

algorithm [5] is one prominent example out <strong>of</strong> this class<br />

<strong>of</strong> methods. The algorithm works as the following:<br />

1) Pick the current state x = Xn from the Markov<br />

chain.<br />

2) With proposal density q(x, x ′ ) generate a new<br />

state x ′ . <br />

3) Compute α = min 1, π(x′ | ˜ d)q(x ′ ,x)<br />

π(x| ˜ d)q(x,x ′ <br />

.<br />

)<br />

4) With probability α accept x ′ and set Xn+1 = x ′ ,<br />

otherwise reject x ′ and set Xn+1 = x.<br />

Starting from the current state x <strong>of</strong> the Markov chain<br />

(line 1) X the MH algorithm generates a proposal<br />

candidate x ′ (line 2) using the proposal kernel q(x, x ′ ).<br />

Then the acceptance ration α is evaluated in line 3 for the<br />

proposal x ′ , which requires one evaluation <strong>of</strong> the forward


map. If the proposal is accepted it becomes the new state<br />

<strong>of</strong> the Markov Chain, otherwise it gets rejected. The<br />

rejection <strong>of</strong> proposal candidates is critical with respect<br />

to the computational efficiency <strong>of</strong> the MH algorithm, as<br />

a high rejection rate, leads to a large number <strong>of</strong> forward<br />

map evaluations without generating a new state. This is<br />

strongly affected by the proposal kernel q(x, x ′ ) which<br />

drives the exploration <strong>of</strong> the posterior distribution.<br />

III. ACCELERATION OF THE MH USING SURROGATES<br />

The strategy we use to speed up the classical MH<br />

algorithm is based on the use <strong>of</strong> an approximation or<br />

surrogate F ∗ [6]. An approximation F ∗ has a considerable<br />

lower runtime with respect to F but at the cost<br />

<strong>of</strong> an approximation error e = y − y∗ . Subsequently<br />

we introduce the likelihood function π∗ ( ˜ d|x) to indicate<br />

the use <strong>of</strong> F ∗ . Then the delayed acceptance Metropolis<br />

Hastings (DAMH) algorithm [7] is given by<br />

1) Pick the current state x = Xn from the Markov<br />

chain.<br />

2) With proposal density q(x, x ′ ) generate a new<br />

state x ′ . <br />

3) Compute α = min 1, π∗ (x ′ | ˜ d)q(x ′ ,x)<br />

π∗ (x| ˜ d)q(x,x ′ <br />

.<br />

)<br />

4) With probability α accept x ′ to be a proposal for<br />

the standard MH algorithm. Otherwise set x ′ = x<br />

and return to 2. <br />

5) Compute β = min 1, π(x′ | ˜ d)q(x ′ ,x)<br />

π(x| ˜ d)q(x,x ′ <br />

.<br />

)<br />

6) With probability β accept x ′ and Xn+1 = x ′ ,<br />

otherwise reject x ′ and set Xn+1 = x.<br />

As can be seen, the DAMH algorithm consists <strong>of</strong> two<br />

nested MH algorithms (in the original MH algorithm<br />

step 3 and 4 do not exist). The DAMH tries to gain<br />

its advantage from a pre-evaluation <strong>of</strong> the proposal candidates<br />

x ′ on the distribution π∗ (x ′ | ˜ d). An evaluation<br />

<strong>of</strong> π(x ′ | ˜ d) in the inner MH is only performed if the<br />

proposal is accepted in the outer MH. In this sense the<br />

outer MH algorithm <strong>of</strong> the DAMH can be seen as a filter<br />

for bad proposals or as an improved proposal generator.<br />

It is obvious that the gain in performance gain strongly<br />

depends on the difference between π∗ (x ′ | ˜ d) and π(x ′ | ˜ d).<br />

An interesting point about the DAMH is the availability<br />

<strong>of</strong> the deterministic approximation error e = y − y∗ in line 5. This knowledge can be used to improve the<br />

algorithm by two points:<br />

• Learn about the approximation error to adapt the<br />

likelihood π∗ (x ′ | ˜ d).<br />

• Adapt the approximation F ∗ to improve the quality.<br />

The approach to incorporate knowledge about the approximation<br />

error is referred to as enhanced error model<br />

(EEM) [1]. Hereby the deterministic approximation error<br />

e is treated as a random variable. Mostly a Gaussian<br />

distribution about e is assumed, describing the error as<br />

e ∝N(μe, Σe). Then the likelihood function π∗ (x ′ | ˜ d)<br />

becomes π∗ ( ˜ d|x) ∝<br />

<br />

exp − 1<br />

<br />

y<br />

2<br />

∗ + μe − ˜ T <br />

−1<br />

d Σ y ∗ + μe − ˜ <br />

d<br />

<br />

,<br />

(5)<br />

- 405 - 15th IGTE Symposium 2012<br />

where Σ is the sum <strong>of</strong> Σv and Σe. In its original idea the<br />

distribution N (μe, Σe) is computed using samples over<br />

the prior π(x). However, with the availability <strong>of</strong> current<br />

value <strong>of</strong> en in the DAMH an adaptive approximation<br />

error model can be built by [8]<br />

μe,n = 1 <br />

(n − 1)μe,n−1 + en , (6)<br />

n<br />

Ce,n = Ce,n−1 + ene T n , (7)<br />

1 <br />

Σe,n = (n − 1)Ce,n − nμ<br />

n − 1<br />

e,nμ T <br />

e,n . (8)<br />

Due to this the likelihood π∗ ( ˜ d|x) adapts to the posterior<br />

during the runtime, which means that no sampling <strong>of</strong><br />

π(x) is necessary to built N (μe, Σe) in the priming <strong>of</strong><br />

the solution process for the data ˜ d.<br />

The second point addresses the possibility to use<br />

the knowledge about e to improve the quality <strong>of</strong> the<br />

approximation F ∗ during the runtime. A considerable<br />

simple update is possible if F ∗ is <strong>of</strong> form y ∗ = Pxa.<br />

Hereby xa denotes the augmented state vector, which<br />

holds x in an adequate form, i.e. arbitrary functions<br />

<strong>of</strong> the components <strong>of</strong> x or additional variables like the<br />

simulation time for transient problems.<br />

Thus, the approximation can be turned nonlinear with<br />

respect to x but it is linear with respect to the elements <strong>of</strong><br />

P . This is important, as for this class <strong>of</strong> approximations<br />

a number <strong>of</strong> update algorithms exist. In this work we use<br />

the least mean squares (LMS) algorithm given by [9]<br />

P n+1 = P n + γenx T a,n , (9)<br />

where γ is a step width parameter known as adaptation<br />

coefficient. The simpleness <strong>of</strong> the LMS update provides<br />

almost no computational costs and helps to improve<br />

the quality <strong>of</strong> the the approximation F ∗ for the<br />

posterior distribution. Again the initial matrix P can<br />

be determined by samples over the prior distribution<br />

π(x). For this the overdetermined equation system<br />

X aP T = Y has to be assembled and solved, where the<br />

matrix X a holds the augmented state vectors from the<br />

samples, and Y contains the exact solutions evaluated<br />

by F . There is also the possibility to run the standard<br />

MH for some time to learn about P and then switch<br />

to the DAMH. The choice <strong>of</strong> the adaptation parameter<br />

γ affects the learning speed <strong>of</strong> the LMS algorithm. For<br />

stability reasons γ has an upper limit which depends on<br />

the problem and can only be derived under restrictive<br />

conditions. However, as an MCMC algorithm provides a<br />

enormous number <strong>of</strong> evaluations it is less critical to set<br />

μ to a small value, as even this provides an improvement<br />

(although slower) to the approximation P and the LMS<br />

algorithm operates in a stable state.<br />

To use both, the update <strong>of</strong> the approximation y ∗ =<br />

Pxa and the adaptive error model, the approximation<br />

should be reevaluated for the current state vector x.<br />

This requires a second evaluation <strong>of</strong> F ∗ , but this is<br />

computational cheap due to the design <strong>of</strong> F ∗ .


IV. A NUMERICAL EXAMPLE<br />

To demonstrate our approach on a numerical example<br />

we consider an indirect measurement problem where we<br />

want to estimate thermophysical properties <strong>of</strong> a slab<br />

from a transient heat transfer experiment. We consider<br />

a slab <strong>of</strong> length L which we model by means <strong>of</strong> a<br />

1D simulation in the domain Ω : 0 ≤ x ≤ L. The<br />

slab is initially at the uniform temperature ϑ0. Onthe<br />

left side (x = 0) a uniform heat flux J is applied<br />

by an electric heater. On the right side at x = L the<br />

temperature ϑ(L, t) is measured over time. The heat on<br />

this side is exchanged by convection with the surrounding<br />

media at the temperature ϑ0. This exchange depends on<br />

a heat transfer coefficient α in Wm −2 K −1 .Thereareno<br />

heat sources within the medium and the thermophysical<br />

properties are supposed constant in the first assumption.<br />

The mathematical formulation for this heat conduction<br />

problem is given by:<br />

1 dϑ<br />

k dt = ∂2ϑ ∂x2 −λ<br />

in 0 0 (11)<br />

∂ϑ<br />

∂x + αϑ = αϑ0 at x = L, fort>0 (12)<br />

ϑ = ϑ0 for t =0,in0


σ e<br />

- 407 - 15th IGTE Symposium 2012<br />

TABLE I<br />

SUMMARY OF THE RESULTS FOR THE LINEAR CASE.<br />

Nr. Experiment σv<br />

K<br />

μλ<br />

W<br />

mK<br />

σλ<br />

W<br />

mK<br />

μα<br />

W<br />

m<br />

σα μk1<br />

σk1<br />

Tsim,r<br />

2K W<br />

m2K Ω<br />

m2 Ω<br />

m2 true 0.12 11 4.5 × 10<br />

%<br />

−3<br />

1 F 0.1 0.12 6.6 × 10−3 11.4 0.68 4.6 × 10−3 2.3 × 10−4 100<br />

2 F ∗ 1 0.1 0.13 6.3 × 10−3 10.7 0.59 4.3 × 10−3 1.7 × 10−4 3 F<br />

75<br />

∗ 2 0.1 0.12 8.4 × 10−3 10.2 0.23 4.3 × 10−3 6.0 × 10−4 4 F<br />

12<br />

∗ 3 0.1 0.12 4.3 × 10−3 12.1 0.29 4.6 × 10−3 2.3 × 10−5 5 F 0.5 0.13 7.0 × 10<br />

12<br />

−3 12.7 1.72 4.7 × 10−3 4.2 × 10−4 100<br />

6 F ∗ 1 0.5 0.13 5.9 × 10−3 9.8 1.42 4.2 × 10−3 3.5 × 10−4 7 F<br />

110<br />

∗ 2 0.5 0.13 3.1 × 10−3 10.5 1.10 4.4 × 10−3 2.9 × 10−4 8 F<br />

44<br />

∗ 3 0.5 0.13 3.3 × 10−3 10.5 1.05 4.4 × 10−3 2.6 × 10−4 44<br />

TABLE II<br />

SUMMARY OF THE RESULTS FOR THE NONLINEAR CASE.<br />

Nr. Experiment σv<br />

K<br />

μλ<br />

W<br />

mK<br />

σλ<br />

W<br />

mK<br />

μα<br />

W<br />

m<br />

σα μk1<br />

σk1<br />

μk2<br />

σk2<br />

Tsim,rel<br />

2K W<br />

m2K Ω<br />

m2 Ω<br />

m2 W<br />

mK2 W<br />

mK2 true 0.12 11 4.5 × 10<br />

%<br />

−3 0.02<br />

1 F 0.1 0.123 5.5 × 10−3 12.0 0.31 4.8 × 10−3 1.1 × 10−4 0.013 4.6 × 10−3 100<br />

2 F 0.05 0.127 3.9 × 10−3 11.3 0.19 4.6 × 10−3 7.1 × 10−5 0.015 3 × 10−3 100<br />

3 F 0.5 0.130 9.8 × 10−3 13.4 1.42 5.2 × 10−3 4.3 × 10−4 0.018 11 × 10−3 100<br />

4 F ∗ 1 0.1 0.129 4.4 × 10−3 10.7 0.27 4.4 × 10−3 1.1 × 10−4 0.016 5.6 × 10−3 5 F<br />

97.6<br />

∗ 1 0.5 0.128 5.8 × 10−3 11.9 1.24 4.8 × 10−3 3.7 × 10−4 0.012 9.4 × 10−3 6 F<br />

140<br />

∗ 3 0.1 0.122 3.1 × 10−3 11.8 0.22 4.7 × 10−3 8.1 × 10−5 0.019 4.6 × 10−3 7 F<br />

20.9<br />

∗ 3 0.5 0.128 5 × 10−3 10.8 1.29 4.6 × 10−3 4.2 × 10−4 0.017 9.4 × 10−3 72.5<br />

0.12<br />

0.1<br />

0.08<br />

0.06<br />

0.04<br />

0.02<br />

0<br />

20<br />

15<br />

10<br />

dt (s)<br />

5<br />

10<br />

8<br />

6<br />

# FE<br />

(a) Standard deviation <strong>of</strong> e over<br />

the discretization for F ∗ 1 .<br />

4<br />

2<br />

18000<br />

16000<br />

14000<br />

12000<br />

10000<br />

Fig. 1. Approximation error e for F ∗ 1 and F ∗ 2<br />

8000<br />

6000<br />

4000<br />

2000<br />

0<br />

−1.5 −1 −0.5 0 0.5 1 1.5<br />

e<br />

(b) Distribution (pdf) <strong>of</strong> e for F ∗ 2 .<br />

over the prior.<br />

how to analyze the results we refer to [4]. For<br />

the simulation we used the following parameters:<br />

L = 0.1 m, ρ = 1040 kgm −3 , c = 1350 Jkg −1 K −1 .<br />

The ambient temperature ϑ0 was set to ϑ0 = 20 ◦ C,<br />

the electrical current I was set to I = 100 A. We<br />

assumed that ϑ(L, t) is measured every 20 seconds for<br />

3000 s. The priors for the state vector are given by a<br />

Gaussian distribution with μλ = 0.13 Wm −1 K −1 and<br />

σλ =0.01 Wm −1 K −1 for λ and uniform distributions<br />

with the boundaries 1Wm −2 K −1 ≤ α ≤ 15 Wm −2 K −1 ,<br />

0.001 Ωm −2 ≤ k1 ≤ 0.01 Ωm −2 , and<br />

0Wm −1 K −2 ≤ k2 ≤ 0.04 Wm −1 K −2 (nonlinear case)<br />

for the remaining variables. The proposal generation is<br />

done by randomly selecting a component <strong>of</strong> the state<br />

vector x. Forλ the proposal is generated from the prior<br />

about λ. Forα and k1 an additive Gaussian distributed<br />

random variable with a standard deviation being 4% <strong>of</strong><br />

the range given by the prior is added to the current state.<br />

In the nonlinear case we only use the approximations<br />

F ∗ 1 and F ∗ 3 .<br />

Figure 2 depicts the output <strong>of</strong> the Markov chain for<br />

TABLE III<br />

BEHAVIOR OF THE CHAINS FOR THE LINEAR CASE.<br />

Nr. Experiment σv Acα Ac β|α Acβ τIACT<br />

K % % %<br />

1 F 0.1 16.4 X X 392<br />

2 F ∗ 1 0.1 18.0 75.8 13.7 1620<br />

3 F ∗ 2 0.1 10.3 65.8 6.8 420<br />

4 F ∗ 3 0.1 10.2 68.0 6.9 228<br />

5 F 0.5 56.3 X X 263<br />

6 F ∗ 1 0.5 54.4 82.9 45.1 110<br />

7 F ∗ 2 0.5 38.8 78.6 30.5 44<br />

8 F ∗ 3 0.5 38.2 78.6 29.8 37<br />

TABLE IV<br />

BEHAVIOR OF THE CHAINS FOR THE NONLINEAR CASE.<br />

Nr. Experiment σv Acα Ac β|α Acβ τIACT<br />

K % % %<br />

1 F 0.1 30.5 X X 57<br />

2 F 0.05 17.2 X X 63<br />

3 F 0.5 67.9 X X 132<br />

4 F ∗ 1 0.1 33.4 79.8 26.6 38<br />

5 F ∗ 1 0.5 64.6 87.1 56.3 72<br />

6 F ∗ 3 0.1 18.8 70.9 13.4 509<br />

7 F ∗ 3 0.5 56.0 86.1 48.3 121<br />

λ. From the histogram in figure 2(b) we can see the<br />

distribution. Table I and II summarize the results for the<br />

linear and the nonlinear case including the true values for<br />

the state vector x for different standard deviations σv <strong>of</strong><br />

the additive measurement noise. As it can be observed, all<br />

estimates meet the true values with reasonable accuracy.<br />

This especially holds for the linear case. For the nonlinear<br />

case some deviations occur, but that can be linked to<br />

the complexity <strong>of</strong> the problem. An interesting effect can<br />

be seen in the standard deviations <strong>of</strong> table I. Increased<br />

noise levels have a stronger effect on σk1 with respect to<br />

the other standard deviations. In the linear case a speed<br />

improvement <strong>of</strong> up to a factor <strong>of</strong> 10 for the linear, and 5


λ (Wm −1 K −1 )<br />

0.16<br />

0.15<br />

0.14<br />

0.13<br />

0.12<br />

0.11<br />

0 1 2 3 4 5 6<br />

x 10 4<br />

0.1<br />

# MCMC<br />

(a) MCMC output for λ.<br />

Fig. 2. MCMC output and analysis for λ.<br />

1000<br />

900<br />

800<br />

700<br />

600<br />

500<br />

400<br />

300<br />

200<br />

100<br />

0<br />

0.1 0.11 0.12 0.13<br />

λ (Wm<br />

0.14 0.15 0.16<br />

−1 K −1 )<br />

(b) Histogram plot for λ.<br />

for the nonlinear case could be achieved. The speed up<br />

also depends on the noise level, as the noise has direct<br />

influence on π(x| ˜ d) and thus affects the proposal kernel<br />

is important. Table III and IV provide statistics about the<br />

behavior <strong>of</strong> the algorithms. For the MH, the ratio Acα<br />

states the percentage <strong>of</strong> accepted proposal candidates. For<br />

the DAMH it states the ratio <strong>of</strong> accepted proposals in<br />

the first step and Acβ states the overall acceptance in<br />

the second step. The value Ac β|α states the acceptance<br />

in <strong>of</strong> a proposal in the second step, given an acceptance<br />

in the first step. Hence, this number provides a quality<br />

measure for the approximation F ∗ . The almost same<br />

level <strong>of</strong> Ac β|α in line 3 and 4, and line 7 and 8 in<br />

table III indicates, that the approximation already has<br />

a high quality, and that no further improvement could<br />

be achieved by the adaption. This corresponds to the<br />

observations <strong>of</strong> figure 1. The tables III and IV also<br />

explain the larger computation times for some DAMH<br />

variants with respect to the MH. In this case a high value<br />

<strong>of</strong> Acα results in a large number <strong>of</strong> evaluations <strong>of</strong> F .<br />

Thus, the sum <strong>of</strong> all evaluations <strong>of</strong> F and F ∗ increases<br />

the pure evaluation <strong>of</strong> F only. This also indicates, that<br />

the proposal kernel is yet not optimal for sampling from<br />

the posterior. The value τIACT is referred to as integrated<br />

auto correlation time (IACT). It was computed with the<br />

methods explained in [10] and provide a measure about<br />

the statistical efficiency <strong>of</strong> an MCMC algorithm by the<br />

distance between independent samples in the Markov<br />

chain. As can be seen, the DAMH variants have a lower<br />

IACT τIACT and thus are statistically more efficient.<br />

In section IV we stated the linear dependence <strong>of</strong> the<br />

parameters due to the equations (11) and (12) and that<br />

this circumstance can be observed in the results. Figure<br />

3 depicts a scatter plot for α and k1. The correlation<br />

factor can be computed with 0.98. From this we can<br />

conclude, that the measurement system is inappropriate<br />

and a second spatial distributed measurement would be<br />

required. This is a direct conclusion from the Bayesian<br />

analysis - an optimization based solution is not able to<br />

provide such inside information.<br />

VI. CONCLUSION<br />

In this work a general approach for accelerating<br />

Bayesian inference for indirect measurement problems<br />

is presented. The approach features the use <strong>of</strong> simple<br />

approximations by incorporating error knowledge and<br />

- 408 - 15th IGTE Symposium 2012<br />

k 1 (Ωm −2 )<br />

4.9<br />

4.8<br />

4.7<br />

4.6<br />

4.5<br />

4.4<br />

4.3<br />

4.2<br />

4.1<br />

x 10−3<br />

5<br />

4<br />

9.5 10 10.5 11 11.5 12 12.5 13<br />

α (Wm −2 K −1 )<br />

Fig. 3. The scatter plot <strong>of</strong> α and k1 indicates indicates a strong<br />

correlation between the variables. This indicates, that more spatial<br />

distributed measurements are required.<br />

can even be used to update approximation models during<br />

the runtime. The presented framework can be easily<br />

applied to different problem types, e.g. electrical capacitance<br />

tomography, to perform Bayesian inference for the<br />

solution <strong>of</strong> indirect measurement problems.<br />

REFERENCES<br />

[1] J. Kaipio and E. Somersalo, Statistical and Computational Inverse<br />

Problems, ser. Applied Mathematical Sciences. Springer, 2005,<br />

vol. 160.<br />

[2] J.M.BernardoandA.F.M.Smith,Bayesian Theory. New York:<br />

John Wiley & Sons, 1994 (ISBN: 0-471-92416-4).<br />

[3] A. Forrester, A. Sobester, and A. Keane, Engineering Design Via<br />

Surrogate Modelling: A Practical Guide. Wiley, 2008.<br />

[4] L. Tierney, “Markov chains for exploring posterior distributions,”<br />

Annals <strong>of</strong> Statistics, vol. 22, pp. 1701–1762, 1994.<br />

[5] W. Hastings, “Monte Carlo sampling using Markov chains and<br />

their applications,” Biometrica, vol. 57, no. 1, pp. pp. 97–109,<br />

1970.<br />

[6] H.R.B.Orlande,M.J.Colaço, and G. S. Dulikravich, “Approximation<br />

<strong>of</strong> the likelihood function in the bayesian technique for<br />

the solution <strong>of</strong> inverse problems,” Inverse Problems in Science &<br />

Engineering, vol. 16, pp. 677–692, 2008.<br />

[7] J. A. Christen and C. Fox, “Markov chain Monte Carlo Using<br />

an Approximation,” Journal <strong>of</strong> Computational and Graphical<br />

Statistics, vol. 14, no. 4, pp. 795–810, 2005. [Online]. Available:<br />

http://pubs.amstat.org/doi/abs/10.1198/106186005X76983<br />

[8] T. Cui, “Bayesian Calibration <strong>of</strong> Geothermal Reservoir Models<br />

via Markov Chain Monte Carlo,” Ph.D. dissertation, <strong>University</strong><br />

<strong>of</strong> Auckland, 2010.<br />

[9] S. Haykin, Adaptive Filter Theory (4th Edition). Prentice Hall,<br />

Sep.<br />

[10] U. Wolff, “Monte Carlo errors with less errors,” Computer Physics<br />

Communications, vol. 156, no. 2, pp. 143 – 153, 2004. [Online].<br />

Available: http://www.sciencedirect.com/science/article/B6TJ5-<br />

4B3NPMC-3/2/94bd1b60aba9b7a9ea69ac39d7372fc5


A<br />

Aleksić, Slavoljub, 73, 300<br />

Alotto, Piergiorgio, 267, 374<br />

Anastasiadis, Ioannis, 271<br />

Andjelic, Zoran, 167<br />

Arkkio, Antero, 214<br />

B<br />

Balabozov, Iosko, 59<br />

Bardi, Istvan, 1<br />

Bauernfeind, Thomas, 327, 337<br />

Bavastro, Davide, 101<br />

Belahcen, Anouar, 214<br />

Bellwald, Lukas, 271<br />

Benabou, Abdelkader, 95<br />

Besser, Bruno, 7<br />

Bielby, Steven, 248<br />

Bilicz, Sandor, 346<br />

Bíró, Oszkár, 31, 41, 144, 190, 232, 327,<br />

337<br />

Brandstätter, Bernhard, 67, 403<br />

Brochet, Pascal, 78<br />

Buchau, André, 89, 386<br />

Buchinger, Andreas, 271<br />

Burgard, Stefan, 13<br />

C<br />

Calvano, Flavio, 208<br />

Campana, Luca Giovanni, 171<br />

Canova, Aldo, 101<br />

Cardoso Bora, Teodoro, 267<br />

Chiariello, Andrea Gaetano, 357<br />

Ciric, Ioan R., 352<br />

Clénet, Stéphane, 95<br />

Coenen, Isabel, 198, 305<br />

Colaco, Marcello J., 403<br />

Cvetkovic, Nenad, 294<br />

D<br />

Dal Mut, Giorgio, 208<br />

Dessoude, Maxime, 78<br />

Di Barba, Paolo, 171<br />

Diwoky, Franz, 232<br />

dos Santos Coelho, Leandro, 267<br />

Duca, Anton, 262<br />

Düzgün, Bilal, 154<br />

Dughiero, Fabrizio, 171<br />

Dulikravich, George S., 403<br />

Dyczij-Edlinger, Romanus, 13, 19<br />

E<br />

Ebrahimi, Bashir Mahdi, 125, 131, 315<br />

Eidenberger, Norbert, 186<br />

Elistratova, Vera, 78<br />

Ellermann, Katrin, 144<br />

- 409 - 15th IGTE Symposium 2012<br />

Author Index<br />

Ertl, Michael, 181, 226<br />

F<br />

Faiz, Jawad, 125, 131, 220, 315<br />

Farle, Ortwin, 13, 19<br />

Farnleitner, Ernst, 31, 41<br />

Ferraioli, Fabrizio, 208<br />

Figueiredo, William, 175<br />

Fonteyn, Katarzyna, 214<br />

Formisano, Alessandro, 108, 208, 357<br />

Fornieles, Jesús, 7<br />

Fujita, Yoshihisa, 53<br />

Fujiwara, Koji, 113<br />

Fulmek, Paul, 331<br />

G<br />

Gavrila, Horia, 352<br />

Gergely, Koczka, 337<br />

Ghorbanian, Vahid, 125<br />

Giaccone, Luca, 101<br />

Gigov, Georgi, 63<br />

Gjonaj, Erion, 204<br />

Glotic, Adnan, 398<br />

Glotic, Arnel, 238, 398<br />

Göhner, Peter, 89<br />

Guarnieri, Massimo, 374<br />

Gueorgiev, Vultchan, 59<br />

Guimaraes, Frederico, 160<br />

Gyimóthy, Szabolcs, 242, 346<br />

H<br />

Hameyer, Kay, 198, 305<br />

Handgruber, Paul, 190<br />

Hantila, Florea I., 352<br />

Hauck, Andreas, 226<br />

Hecquet, Michel, 78<br />

Herold, Thomas, 198<br />

Hinov, Krastio, 59<br />

I<br />

Iatcheva, Ilona, 63<br />

Igarashi, Hajime, 276, 340<br />

Ikuno, Soichiro, 47, 53<br />

Ilić, Saša, 73, 300<br />

Iovine, Renato, 25<br />

Itoh, Taku, 47, 53<br />

J<br />

Janousek, Ladislav, 262<br />

Jorks, Hai Van, 204<br />

Jüttner, Matthias, 89, 386<br />

K<br />

Kaimori, Hiroyuki, 84<br />

Kaltenbacher, Manfred, 181, 226<br />

Kamitani, Atsushi, 47, 53


Karastoyanov, Dimitar, 59<br />

Kastner, Gebhard, 31, 41<br />

Katsumi, Ryuichi, 242<br />

Keränen, Janne, 392<br />

Kettunen, Lauri, 392<br />

Kiss, Péter, 242<br />

Kitak, Peter, 238, 398<br />

Klomberg, Stephan, 41<br />

Koczka, Gergely, 327<br />

Kömürgöz, Güven, 154<br />

Kotlan, Vaclav, 321<br />

Kouhia, Reijo, 214<br />

Kraiger, Markus, 310<br />

Krstic, Dejan, 294<br />

Kunov, Georgi, 63<br />

L<br />

La Spada, Luigi, 25<br />

Lambert, Nancy, 1<br />

Lehti, Leena, 392<br />

Li, Min, 160<br />

Lichtenegger, Herbert I. M., 7<br />

Lowther, David, 160, 175, 248, 254<br />

M<br />

Magele, Christian, 37<br />

Mair, Mathias, 144<br />

Manca, Michele, 101<br />

Maricaru, Mihai, 352<br />

Marignetti, Fabrizio, 208<br />

Martone, Raffaele, 108, 208, 357<br />

Metzker, Isabela, 175<br />

Miyagi, Daisuke, 84<br />

Moghnieh, Hussein, 254<br />

Mohr, Martin, 232<br />

Moro, Federico, 374<br />

N<br />

Nagano, Takumi, 288<br />

Nakata, Susumu, 53<br />

Nandi, Subhasis, 315<br />

Neumayer, Markus, 67, 403<br />

O<br />

Offermann, Peter, 305<br />

Ofner, Georg, 190<br />

Ojaghi, Mansour, 220<br />

Okamoto, Yoshifumi, 113, 282, 288<br />

Orlande, Helcio R.B., 403<br />

P<br />

Pávó, József, 242, 346<br />

Perić, Mirjana, 73<br />

Petersson, Rickard, 1<br />

Piantsop Mboo, Christelle, 198<br />

Portí, Jorge, 7<br />

Preda, Gabriel, 262<br />

Preis, Kurt, 271, 327, 337<br />

- 410 - 15th IGTE Symposium 2012<br />

R<br />

Raicevic, Nebojsa, 73<br />

Rainer, Siegfried, 144<br />

Ramarotafika, Rindra, 95<br />

Ramirez, Jaime, 160, 175<br />

Rasilo, Paavo, 214<br />

Rauscher, Michael, 89<br />

Rebican, Mihai, 262<br />

Recheis, Manes, 331<br />

Renhart, Werner, 37<br />

Rossi, Carlo Riccardo, 171<br />

Rubesa, Jelena, 380<br />

Rubinacci, Guglielmo, 208<br />

Rucker, Wolfgang M., 89, 386<br />

Ruela, Andre, 160<br />

S<br />

Sabouri, Mahdi, 220<br />

Sadovic, Salih, 167<br />

Salinas, Alfonso, 7<br />

Santos, Rafael, 175<br />

Sato, Shuji, 113, 282<br />

Sato, Yuki, 340<br />

Scharrer, Matthias, 368<br />

Schnizer, Bernhard, 310<br />

Schöberl, Joachim, 226<br />

Schrittwieser, Maximilian, 31<br />

Schweigh<strong>of</strong>er, Bernhard, 331<br />

Shimoyama, Kouske, 84<br />

Sieni, Elisabetta, 171<br />

Silva, Elizabeth, 175<br />

Silvestro, John, 1<br />

Simioli, Marco, 101<br />

Smetana, Milan, 262<br />

Sommer, Alexander, 19<br />

Sonmez, Oluş, 154<br />

Stancheva, Rumena, 63<br />

Steiner, Gerald, 67, 403<br />

Stella, Andrea, 374<br />

Stermecki, Andrej, 190, 232<br />

Štih, Željko, 137<br />

Stojanovic, Miodrag, 294<br />

Strapacova, Tatiana, 262<br />

Suhr, Bettina, 368, 380<br />

Suuriniemi, Saku, 392<br />

Szabo, Zsolt, 119<br />

T<br />

Takahashi, Norio, 84<br />

Takbash, Amir Masoud, 131, 315<br />

Tamburrino, Antonello, 208<br />

Tarhasaari, Timo, 392<br />

Ticar, Igor, 238, 398<br />

Toledo-Redondo, Sergio, 7<br />

Toratani, Tomoaki, 242<br />

Trkulja, Bojan, 137<br />

Tsuburaya, Tomonori, 113<br />

Tuerk, Christian, 37


U<br />

Ulrych, Bohus, 321<br />

V<br />

Vale, Joao Francisco, 175<br />

Varga, Gábor, 242<br />

Vasilescu, George-Marian, 352<br />

Vegni, Lucio, 25<br />

Ventre, Salvatore, 208<br />

Vizireanu, Darius, 78<br />

Volk, Adrian, 181<br />

Volkwein, Stefan, 362<br />

Voracek, Lukas, 321<br />

Vuckovic, Ana, 300<br />

Vuckovic, Dragan, 294<br />

W<br />

Wakao, Shinji, 288<br />

Watanabe, Yuta, 276<br />

Watzenig, Daniel, 67, 368, 403<br />

Wegleiter, Hannes, 331<br />

Weiland, Thomas, 204<br />

Weilharter, Bernhard, 144<br />

Werth, Tobias, 271<br />

Wesche, Andrea, 362<br />

Y<br />

Yasukawa, Shogo, 288<br />

Yatchev, Ivan, 59<br />

Z<br />

Zagar, Bernhard G., 186<br />

Zhao, Kezhong, 1<br />

Župan, Tomislav, 137<br />

- 411 - 15th IGTE Symposium 2012

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!