FIAS Scientific Report 2011 - Frankfurt Institute for Advanced Studies ...

More documents

Recommendations

Info

Lattice QCD on GPUs Collaborators: M. Bach 1 , V. Lindenstruth 1 , O. Philipsen 2 , C. Pinke 2 , C. Schäfer 2 , L. Zeidlewicz 2 1 Frankfurt Institute for Advanced Studies, 2 Institut für Theoretische Physik, Goethe-Universität Frankfurt am Main Quantum Chromodynamics (QCD) is the known theory of the strong force and part of the Standard Model of particle physics. Its phase diagram is a problem of particular interest and is investigated in current, state-of-theart collider experiments at CERN and, in the near future, at FAIR. Lattice QCD provides a first principle access to this problem. Lattice simulations require an enormous amount of computing power. They sample the phase space using Hybrid Monte Carlo techniques, requiring the fermion matrix to be inverted many times. The inversion of this sparse quadratic matrix, typically of size 10 8 × 10 8 , is the most time consuming part of the algorithm. To get physical results simulations need to be carried out at different lattice spacings and extrapolated to the continuum limit. Sparse matrix inversion is completely dominated by the memory bandwidth available in the system. Today Graphics Processing Units (GPUs) provide significantly more bandwidth than CPUs, making them a promising platform for QCD codes. However, so far GPU-enabled Lattice QCD codes have always been focused on the proprietary NVIDIA CUDA programming interface, locking the usability of the code to hardware of this one vendor. We aim at building a versatile Lattice QCD solution that can achieve optimum performance on a wide variety of modern hardware architectures. Therefore we base our solution on OpenCL, an open programming standard for parallel programming supported by all major hardware vendors. We focus on remaining with a single source source code, ensuring maintainability and easier verification of code correctness, while achieving maximum performance over a wide range of system configurations. Bandwidth GB/s 140 120 100 80 60 40 20 16^3x4 16^3x8 24^3x4 16^3x16 16^3x20 16^3x24 16^3x28 32^3x4 24^3x12 Dslash Performance 24^3x16 Lattice Size 32^3x8 24^3x20 24^3x24 32^3x12 GPU Bandwidth 140 GPU Gflops CPU Bandwidth 120 CPU Gflops 100 Dslash Performance on one AMD HD5870 and two Operon 6172 in LOEWE-CSC for a variety of lattice sizes Our first focus for performance optimization was the LOEWE-CSC supercomputer, implementing AMD GPUs. We have carefully analyzed the characteristics of the memory controller of AMD GPUs. Using that knowledge we were able to achieve more than 70 double precision Gflops in the Dslash calculation, the main part of the fermion matrix, while still being able to run the same code parallelized on CPU only system. This is competitive with NVIDIA-only solutions, however, in addition, the methods for memory controller analysis and the OpenCL based code that can be parametrized to match different memory controllers enable high performance on a wider range of hardware. Related publications in 2011: M. Bach, O. Philipsen, C. Pinke, C. Schäfer, L. Zeidlewicz, LatticeQCD using OpenCL, Proceedings of the XXIX International Symposium on Lattice Field Theory - Lattice 2011. 120 24^3x32 80 60 40 20 Gflops
How stable are transport model results to changes of resonance parameters? A UrQMD model study Collaborators: J. Gerhard 1,2 , B. Bäuchle 1,3 , V. Lindenstruth 1,2 , M. Bleicher 1,3 1 Frankfurt Institute for Advanced Studies, 2 Institut für Informatik, Johann Wolfgang Goethe-Universität, 3 Institut für Theoretische Physik, Johann Wolfgang Goethe-Universität Transport models like the Ultra-relativistic Quantum Molecular Dynamics model (UrQMD) rely as an Input heavily on measured quantities like hadron masses, the hadron decay widths, individual branching ratios, and cross sections. Unfortunately these parameters are often not known exactly, as one can see from an inspection of the Particle Data Group (PDG) tables. In the light of the current experiments with the LHC at CERN and the upcoming high precision experiments of FAIR at GSI, it is of crucial interest to validate the transport models. In this interdisciplinary study between computer science and theoretical physics, we have applied meta programming techniques to the UrQMD model in order to do cope with the combinatoric difficulties of a multidimensional parameter scan. We employed the Frankfurt LOEWE-CSC to carry out the systematic analysis on hadron masses and decay withs in order to check the stability of UrQMD. We addressed the question by simulating nucleus-nucleus interactions in an energy regime from 2 AGeV to 30 AGeV, while varying the parameters within the error estimates of the PDG (or ±10%). Although we have restricted our research to the UrQMD model in this study, the results should (at least qualitatively) also be transferable to other transport simulations based on similar physics assumptions, like the Parton-Hadron-String Dynamics or the Multi-Phase-Transport model. σπN tot [mb] 100 80 60 40 20 −10% PDG-Data +10% 0 1.0 1.2 1.4 1.6 ECM [GeV] 1.8 Mass variation 2.0 2.2 Deviation of pt spectrum [%] 30 20 10 0 −10 −20 −5% +5% N variation, Pb+Pb, 2 AGeV −30 0.0 0.2 0.4 0.6 0.8 1.0 pt [GeV/c] Left: When varying the masses of all nucleons simultaneously one can observe a strong shift of the cross sections. Right: Also the pt-spectra of outgoing pions is affected by a simultaneous shift of the nucleon masses. Related publications: 1) J. Gerhard, B. Bäuchle, V. Lindenstruth, M. Bleicher, How stable are transport model results to changes of resonance parameters? A UrQMD model study, submitted to Phys. Rev. C. 2) Klaus Aehlig, Helge Dietert, Thomas Fischbacher, Jochen Gerhard, Casimir forces via worldline numerics: Method improvements and potential engineering applications, arXiv:1110.5936v1. 121
Page 1 and 2:
FIAS Scientific Report 2011
Page 3:
FIAS Scientific Report 2011 Table o
Page 6 and 7:
Physics Research highlights 2011 To
Page 9 and 10:
1. Partner Research Centers 9
Page 11 and 12:
ExtreMe Matter Institute EMMI by Ca
Page 13 and 14:
Bernstein Focus Neurotechnology Fra
Page 15 and 16:
2. Graduate Schools 15
Page 17 and 18:
participants reported on their proj
Page 19 and 20:
Courses offered at FIGSS Summer Sem
Page 21 and 22:
3. FIAS Scientific Life 21
Page 23 and 24:
28.07.2011 Prof. Dr. Victor Flambau
Page 25 and 26:
Conferences and meetings (co)organi
Page 27 and 28:
4. Research Reports 4.1 Nuclear Phy
Page 29 and 30:
Hydrodynamics of a quark droplet Co
Page 31 and 32:
Production of hyper-nuclei in react
Page 33 and 34:
On production and properties of mul
Page 35 and 36:
Monte Carlo modeling microdosimetry
Page 37 and 38:
Monte Carlo simulation of the spall
Page 39 and 40:
Dimuon radiation within a (3+1)d hy
Page 41 and 42:
Initial state anisotropies and thei
Page 43 and 44:
Chiral hadronic model including res
Page 45 and 46:
Trace anomaly and the vector coupli
Page 47 and 48:
Dileptons from the strongly interac
Page 49 and 50:
Space-time evolution of the magneti
Page 51 and 52:
Extreme isospin in heavy nuclei Col
Page 53 and 54:
Production of heavy and superheavy
Page 55 and 56:
The phase diagram in T -µB-Nc spac
Page 57 and 58:
Critical Zeeman Fields for Unitary
Page 59 and 60:
BCS-BEC Crossover in 2D Fermi Gases
Page 61 and 62:
Applications of a chiral SU(3) mode
Page 63 and 64:
The phase diagram of black holes in
Page 65 and 66:
Black holes in short scale modified
Page 67 and 68:
Fractal dimensions and micro-struct
Page 69 and 70: Untangling the interactions between
Page 71 and 72: Stimulus information coded by spike
Page 73 and 74: Non-stationarity of neuronal activi
Page 75 and 76: Timescale of information processing
Page 77 and 78: Discovery of agency in 6 and 8-mont
Page 79 and 80: Power spectra of the natural input
Page 81 and 82: Self-organization in recurrent neur
Page 83 and 84: Learning mechanisms underlying visu
Page 85 and 86: Emerging Bayesian priors in a self-
Page 87 and 88: Non-linear generative models and th
Page 89 and 90: Preference elicitation and Bayesian
Page 91 and 92: 4.3 Biology, Chemistry, Molecules,
Page 93 and 94: DNA unzipping Collaborators: S.N. V
Page 95 and 96: Statistical mechanics of protein fo
Page 97 and 98: Phase transitions in large clusters
Page 99 and 100: Photo-processes in fullerenes and e
Page 101 and 102: Assessment of complex DNA damage Co
Page 103 and 104: Novel light sources: Crystalline un
Page 105 and 106: Monte-Carlo code for channeling dyn
Page 107 and 108: Programs for many-body descriptions
Page 109 and 110: Are 12 C radiation effects in liver
Page 111 and 112: Objective identification of residue
Page 113 and 114: Structural basis for the dual RNA-r
Page 115 and 116: The ALICE High Level Trigger Collab
Page 117 and 118: Arithmetic over Galois fields on mo
Page 119: High Performance GPU-based DGEMM an
Page 123 and 124: 5. Talks and Publications 123
Page 125 and 126: Conference and Seminar Talks 2011
Page 129 and 130: Conference and Seminar Talks 2011 W
Page 133 and 134: FIAS conference abstracts and poste
Page 135 and 136: FIAS conference abstracts and poste
Page 137 and 138: FIAS Publications 2011 - Journal pu
Page 147 and 148: FIAS publications 2011 - Conference
Page 159 and 160: FIAS publications 2011 - Patents 24
Page 161: The success of FIAS would not have
show all

FIAS Scientific Report 2011 - Frankfurt Institute for Advanced Studies ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?