Contents - Max-Planck-Institut für Physik komplexer Systeme

More documents

Recommendations

Info

2.15 Probabilistic Forecasting JOCHEN BRÖCKER, STEFAN SIEGERT, HOLGER KANTZ A ubiquitous problem in our life (both as individuals as well as a society) is having to make decisions in the face of uncertainty. Forecasters are supposed to help in this process by making statements (i.e. forecasts) about the future course of events. In order to allow the forecast user to properly assess potential risks associated with various decisions, forecasters should, in addition, provide some information as to the uncertainty associated with their forecasts. Unequivocal or “deterministic” forecasts are often misleading as they give the false impression of high accuracy. Probabilities, in contrast, allow to quantify uncertainty in a well defined and consistent manner, if interpreted correctly. Forecasts in terms of probabilities have a long and successful history in the atmospheric sciences. In the 1950’s, several meteorological offices started issuing probability forecasts, then based on synoptic information as well as local station data. On a scientific (non– operational) level, probabilistic weather forecasts were discussed even much earlier. Evaluation of probabilistic forecasts Since the prediction in a probabilistic forecast is a probability (distribution) whereas the observation is a single value, quantifying the accuracy of such forecasts is a nontrivial issue. This is of interest not only for quality control of operational forecasting schemes, but also for a more fundamental understanding of predictability of dynamical systems; see for example [5]. Nowadays, probabilistic weather forecasts are often issued over a long period of time under (more or less) stationary conditions, allowing archives of forecast–observation pairs to be collected. Such archives permit to calculate observed frequencies and to compare them with forecast probabilities. The probability distribution of the observation y, conditioned on our probability forecast being p, ideally coincides with p; a forecast having this property is called reliable or consistent. If a large archive of forecast–observation pairs is available, reliability can be tested statistically. It is not difficult though to produce reliable probabilistic forecasts. The overall climatological frequency for example will always be a reliable forecast; this constant forecast is not very informative though. The forecast should delineate different observations from each other. The question arises how virtuous forecast attributes (reliability and information content) can be quantified and evaluated. One therefore seeks for “scoring rules” which reward both reliable and informed forecasters. A scoring rule is a function S(p,y) where p is a probability forecast and y a possible observation. Here, y is assumed to be one of a finite number of labels, say {1...K}. A probability forecast would then consist of a vector p = (p1 ...pK) with pk ≥ 0 for all k, and k pk = 1. The idea is that S(p,y) quantifies how well p succeeded in forecasting y. The general quality of a forecasting system is ideally measured by the mathematical expectation E[S(p,y)], which can be estimated by the empirical mean E[S(p,y)] ∼ = 1 N N S(p(n),y(n)) (1) n=1 over a sufficiently large data set {(p(n),y(n));n = 1...N} of forecast–observation pairs. Two important examples for scoring rules are the logarithmic score [8] (also called Ignorance) and the Brier score [1] S(p,y) = −log(py), (2) S(p,y) = (pk − δy,k) 2 . (3) k The convention here is that a smaller score indicates a better forecast. Although these two scoring rules seem ad hoc, they share an interesting property which arguably any scoring rule should possess in order to yield consistent results. Suppose that our forecast for some observation y is q, then the score of that forecast will be S(q,y). If the correct distribution of y is p, then the expectation value of our score is E(S(q,y)) = S(q,k)pk. (4) The right hand side is referred to as the scoring function s(q,p). Arguably, as p is the correct distribution of y, the expected score of q should be worse (i.e. larger in our convention) than the expected score of p. This is equivalent to requiring that the divergence function k d(q,p) := s(q,p) − s(p,p) (5) be non-negative and zero only if p = q. A scoring rule with this property (for all p,q) is called strictly proper [6, 7]. The divergence function of the Brier score for example is d(q,p) := k (qk − pk) 2 , demonstrating that this score is strictly proper. The Ignorance is proper as well, since (5) is then just the Kullback– Leibler–divergence, which is well known to be positive definite. The expectation value E[S(p,y)] with strictly proper scoring rule S allows for the following decomposition [4]: ES(p,y) = s(¯π, ¯π) + Ed(p,π) − Ed(¯π,π), (6) 70 Selection of Research Results
where π and ¯π are probability distributions with πk = P(y = k|p), and ¯πk = P(y = k), respectively. The first term is the inherent uncertainty of y—this quantity is not affected by the forecasts. The second term quantifies reliability; note that this term is always positive unless p = π, which is the mathematical definition of reliability. The third term quantifies the information content of π. It contributes negatively to (i.e. improves) the score, unless π = ¯π, in which case π is constant. Hence, this term penalises lack of variability of π; the larger the variability of π, the better the score. As a whole, the decomposition If constructed yields credence from a to finite thenumber practiceof ofsamples, assessingthe forecast Talagrand quality diagram through is subject properto scoring randomrules. fluctuations. In -diagram, deviations from flatness due to finite Ensemble samples forecasts are taken into account. Under the assumption of Modern consistency,thenumberofcasesinwhichtheverification weather forecasts are generated using large dynamical atmospheric should models, followrunning a binomial on supercom- distribution puters. In order to initialise these simulations properly, is the current state of the atmosphere has to be known at least approximately, and subsequently projected into the state space of the model; this process is known as data assimilation. The fact that the initial condition is not known with certainty (and also that the model is incorrect) is accounted for by generating not one but several simulations with minutely perturbed initial conditions, resulting in an ensemble of forecasts. Although ensemble forecasts already provide vital information as to the inherent uncertainty, they need to be post-processed before they can be interpreted as probabilities. The interpretation of ensembles and how to generate useful forecast probabilities using ensembles is a very active area of research. Several different interpretations exist, a very common one being the following Monte–Carlo interpretation: An ensemble is a collection x1,...,xK of random variables, drawn independently from a common distribution function p, the forecast distribution. The forecast distribution p can be considered as the distribution of the ensemble members conditioned on the internal state of the forecasting scheme. The forecast distribution p however is but a mental construct and not operationally available. In this interpretation, the forecasting scheme is called reliable if the observation y along with the ensemble members x1 ...xK are independent draws from the forecast distribution p. Less formally stated, the observation behaves like just another ensemble member. A necessary consequence of reliability is that the rank of y among all ensemble members assumes the values 1,...,K +1 with equal probability (namely 1/(K +1)). This means that the histogram of rank(y) should be flat, which can be statistically tested, see for example [3]. In operational ensembles though, histograms are often found to be u–shaped, with the outermost ranks being too heavily populated (see Fig.1 for an example); in other words, outliers happen more often than they should in a reliable ensemble. This can have several reasons, such as insufficient spread or conditional bias. On the other hand, this means that we should be able to predict such outliers by looking at characteristic patterns in the ensemble. Indeed, as we could show, even for reliable ensemble forecasts, the spread of the actual specific ensemble is indicative of the probability that the future observation will be an outlier [10]. v l 0.999 0.99 0.9 0.5 0.1 0.01 0.001 1 52 Figure 1: Rank diagram for temperature forecasts in Hannover: The verification falls much too often into the first and last bins, indicating that outliers are too frequent for reliability. This ensemble features 51 members. The y–axis shows Binomial probabilities, rather than actual counts. Unfortunately, rank based reliability tests are restricted to scalar predictions. In [9], a rank analysis for vector valued predictions was suggested by measuring the length of a minimum spanning tree. Thereby, a new scalar ensemble is created which however ceases to be independent, that is, the Monte–Carlo interpretation no longer applies. This puts the assumptions behind the entire rank histogram analysis into question. However, as it could be shown in [2] the rank based reliability analysis can still be applied to such ensembles, due to some inherent symmetries called exchangeability. In particular, these investigations demonstrated that the minimum spanning tree approach is mathematically sound. [1] Glenn W. Brier. Monthly Weather Review, 78(1):1–3, 1950. [2] J. Bröcker and H. Kantz. Nonlinear Processes in Geophysics, 18(1):1–5, 2011. [3] Jochen Bröcker. Nonlinear Processes in Geophysics, 15(4):661–673, 2008. [4] Jochen Bröcker. Quarterly Journal of the Royal Meteorological Society, 135(643):1512 – 1519, 2009. [5] Jochen Bröcker, David Engster, and Ulrich Parlitz. Chaos, 19, 2009. [6] Jochen Bröcker and Leonard A. Smith. Weather and Forecasting, 22(2):382–388, 2007. [7] Thomas A. Brown. Technical Report RM–6299–ARPA, RAND Corporation, Santa Monica, CA, June 1970. [8] I. J. Good. Journal of the Royal Statistical Society, XIV(1):107–114, 1952. [9] J.A. Hansen and L.A. Smith. Monthly Weather Review, 132(6):1522–1528, 2004. [10] S. Siegert, J. Bröcker, and H. Kantz. Quarterly Journal of the Royal Meteorological Society, 2011 (submitted). 2.15. Probabilistic Forecasting 71
Page 1 and 2:
Contents 1 ScientificWorkanditsOrga
Page 3 and 4:
Chapter1 ScientificWorkanditsOrgani
Page 5 and 6:
2009-2010 •In2009Dr.K.Hornbergera
Page 7 and 8:
possibilityforyoungscientiststotake
Page 9 and 10:
manipulationofchargequbits.TakingCo
Page 11 and 12:
Matter wave interference with compl
Page 13 and 14:
interactinggas,futureworkwilladdres
Page 15 and 16:
scalesrangingfromindividualhaircell
Page 17 and 18:
Ohio(Neiman). Problemsfromcellbioph
Page 19 and 20: oleofdisorderinexoticstronglycorrel
Page 21 and 22: architectures.Togetherwithdeveloper
Page 23 and 24: September.Theverysamefoundationsupp
Page 25 and 26: many-bodyeffectsinquantumdots. Even
Page 27 and 28: insystemsexhibitingalong-rangeinter
Page 29 and 30: experimentalprobesleadtoperturbatio
Page 31 and 32: andefficiency. Theyarethereforewell
Page 33 and 34: isdefinedbyaslow-downofcelldivision
Page 35 and 36: 1.10 AdvancedStudyGroups AdvancedSt
Page 37 and 38: AdvancedStudyGroup2010:Quantumtherm
Page 39: InJanuaryattentionturnedtocoldatoms
Page 42 and 43: 2.1 Photo Activated Coulomb Complex
Page 44 and 45: 2.2 Molecular bond by internal quan
Page 46 and 47: 2.3 Modular Entanglement in Continu
Page 48 and 49: 2.4 Rotons and Supersolids in Rydbe
Page 50 and 51: 2.5 Electromagnetically Induced Tra
Page 52 and 53: 2.6 Delayed Coupling Theory of Vert
Page 54 and 55: 2.7 Reorientation of Large-Scale Po
Page 56 and 57: 2.8 Coupling Virtual and Real Hair
Page 58 and 59: 2.9 How Stochastic Adaptation Curre
Page 60 and 61: 2.10 Experimental Manifestations of
Page 62 and 63: 2.11 The Statistical Mechanics of Q
Page 64 and 65: 2.12 Entanglement Analysis of Fract
Page 66 and 67: 2.13 Quantum Spin Liquids in the Vi
Page 68 and 69: 2.14 Work Dissipation Along a Non Q
Page 72 and 73: 2.16 Magnetically Driven Supercondu
Page 74 and 75: 2.17 Quantum Criticality out of Equ
Page 76 and 77: 2.18 High-Quality Ion Beams from Na
Page 78 and 79: 2.19 Terahertz Generation by Ionizi
Page 80 and 81: 2.20 Many-body effects in mesoscopi
Page 82 and 83: Bone is a complex tissue that is be
Page 84 and 85: Cooperation is a central phenomenon
Page 86 and 87: 2.23 Measuring the Complete Force F
Page 88 and 89: 2.24 Pattern Formation in Active Fl
Page 90 and 91: 2.25 Magnon Pairing in a Quantum Sp
Page 92 and 93: 2.26 Shortcuts to Adiabaticity in Q
Page 94 and 95: 2.27 Itinerant Magnetism in Iron Ba
Page 96 and 97: 2.28 Kinetochores are Captured by M
Page 98 and 99: Chapter3 DetailsandData 3.1 PhDProg
Page 100 and 101: IMPRSmeetinginBadSchandau(Sächsich
Page 102 and 103: thephysicsofcomplexsystems.In2009we
Page 104 and 105: || 2 1 0.8 0.6 0.4 0.2 0 2 4 6 8 10
Page 106 and 107: • C.F.Lee,L.Jean,C.Lee,M.Shaw,and
Page 108 and 109: Externalcollaborations WithA.Chubuk
Page 110 and 111: continuousspectrum)attheedgeoftheKA
Page 112 and 113: • Non-centrosymmetricsuperconduct
Page 114 and 115: 20. TCS-PROGRAM:SynchronizationandM
Page 116 and 117: 3.3.7 WorkshopParticipationandDisse
Page 118 and 119: participatedwhichopenedaparticularw
Page 120 and 121:
Longhi,Malpuech,Peschel,Ruo)andphot
Page 122 and 123:
inferencecanbeused.Thesetopicswerea
Page 124 and 125:
Scientificresults: Thestatementthat
Page 126 and 127:
Altogether,wereceivedverypositivefe
Page 128 and 129:
oftherapidlymaturingtheoryoffixedde
Page 130 and 131:
modeling(HansBraun,DmitryPostnov),a
Page 132 and 133:
moreexperiencedresearchesasthesewer
Page 134 and 135:
Forthismeeting,wedecidedtochoosetwo
Page 136 and 137:
Summary. Theworkshopcollectedandatt
Page 138 and 139:
mpipks colloquium given by S. Ospel
Page 140 and 141:
approachestononlinearquantumopticsw
Page 142 and 143:
climatedata.Finally,PeterTalknerdem
Page 144 and 145:
(e.g. Carl-PhilipHeisenberg,Charlot
Page 146 and 147:
3.4.3 AdditionalExternalFunding •
Page 148 and 149:
3.5.2 Degrees Habilitations • Lin
Page 150 and 151:
3.6 PublicRelations 3.6.1 LongNight
Page 152 and 153:
3.7 BudgetoftheInstitute Thefollowi
Page 154 and 155:
52% 56% 13% 11% Budgetforpersonnel
Page 156 and 157:
In 2010 a large parallel cluster wa
Page 158 and 159:
Gugliandolo,L. ProfessorDr. Laborat
Page 160 and 161:
Orosz,H. Oberbürgermeisterinder La
Page 162 and 163:
Chapter4 Publications 4.1 Light-Mat
Page 164 and 165:
Taya,S.A.,M.M.ShabatandH.M.Khalil:
Page 166 and 167:
Oh,S.: Geometricphasesandentangleme
Page 168 and 169:
Stoyanova,A.,L.Hozoi,P.FuldeandH.St
Page 170 and 171:
Thielemann,B.,C.Rüegg,H.M.Rønnow,
Page 172 and 173:
2010 Charrier,D.andF.Alet:Phasediag
Page 174 and 175:
Gvozdikov, V.M.: Compositefermionsw
Page 176 and 177:
Lindner,B.,D.Gangloff,A.LongtinandJ
Page 178 and 179:
Denisov,S.I.,W.HorsthemkeandP.Häng
Page 180 and 181:
Gogberashvili, M.andR.Khomeriki: Tr
Page 182:
Wang,Q.J.,C.Yan,L.Diehl,M.Hentschel
show all

Contents - Max-Planck-Institut für Physik komplexer Systeme

Create successful ePaper yourself

Delete template?

Save as template?