12.07.2015 Views

1 Studies in the History of Statistics and Probability ... - Sheynin, Oscar

1 Studies in the History of Statistics and Probability ... - Sheynin, Oscar

1 Studies in the History of Statistics and Probability ... - Sheynin, Oscar

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

conclusions are derived <strong>in</strong> a purely ma<strong>the</strong>matical way <strong>and</strong> <strong>the</strong>reforecerta<strong>in</strong>. However, on <strong>the</strong> whole everyth<strong>in</strong>g depends on <strong>the</strong> model.Statistical methods are as certa<strong>in</strong> (not more or less) as <strong>the</strong> conclusions<strong>of</strong> o<strong>the</strong>r sciences apply<strong>in</strong>g ma<strong>the</strong>matical means, for example physics,astronomy or strength <strong>of</strong> material. In practical problems <strong>the</strong>se sciencescan provide guid<strong>in</strong>g l<strong>in</strong>es but can not guarantee that we have correctlyapplied <strong>the</strong>m.1.3. Model <strong>of</strong> trend with an error. In a ma<strong>the</strong>matical model <strong>of</strong> anobservational series someth<strong>in</strong>g is always determ<strong>in</strong>ate <strong>and</strong> someth<strong>in</strong>gr<strong>and</strong>om. We will consider a model <strong>in</strong> which that seriesx 1 , x 2 , ..., x nis given by formulax i = f(t i ) + δ i . (1.7)Here t i is <strong>the</strong> value <strong>of</strong> some determ<strong>in</strong>ate variable specify<strong>in</strong>g <strong>the</strong> i-<strong>the</strong>xperiment, f(t), some determ<strong>in</strong>ate function (<strong>the</strong> trend) <strong>and</strong> δ i , ar<strong>and</strong>om variable usually called <strong>the</strong> error <strong>of</strong> that experiment. Thissituation means that <strong>the</strong> Lord determ<strong>in</strong>ed <strong>the</strong> true dependence by f(t)so that we should have observed f(t i ) <strong>in</strong> experiment i, but that <strong>the</strong> devil<strong>in</strong>serted <strong>the</strong> error δ i .For example, f(t) can represent one or ano<strong>the</strong>r coord<strong>in</strong>ate <strong>of</strong> anobject <strong>in</strong> space as dependent on time, <strong>and</strong> x i is our measurement <strong>of</strong> thatcoord<strong>in</strong>ate at moment t i . The devil’s <strong>in</strong>terference δ i can certa<strong>in</strong>ly bedeterm<strong>in</strong>ate, r<strong>and</strong>om or generally <strong>of</strong> an <strong>in</strong>determ<strong>in</strong>ate nature. Thus, <strong>the</strong>observed x i can be corrupted by a systematic error so that Eδ i is notnecessarily zero. We may assume that Eδ i = C <strong>and</strong> does not depend oni but it is also possible to consider Eδ i = φ(t i ) is a function <strong>of</strong> t i . Stillworse will happen if Eδ i depends on a variable u i which we can notcheck. In nei<strong>the</strong>r <strong>of</strong> those cases statistical treatment can elim<strong>in</strong>ate <strong>the</strong>errors.However, a sufficiently thorough plann<strong>in</strong>g <strong>of</strong> <strong>the</strong> observations canallow us to hope that <strong>the</strong> errors will be purely r<strong>and</strong>om <strong>in</strong> <strong>the</strong> sense thatstatistical homogeneity is ma<strong>in</strong>ta<strong>in</strong>ed <strong>and</strong> <strong>the</strong>re is no systematic shift:Eδ i = 0. More precisely, <strong>the</strong> systematic error will be sufficiently small<strong>and</strong> can be neglected. Such situations <strong>in</strong>deed comprise <strong>the</strong> scope <strong>of</strong> <strong>the</strong>statistical methods.After recall<strong>in</strong>g what was said <strong>in</strong> § 1.2 it becomes clear that mostsimple statistical assumptions should be imposed on <strong>the</strong> errors δ i . Most<strong>of</strong>ten <strong>the</strong>se errors are supposed to be <strong>in</strong>dependent <strong>and</strong> identicallydistributed. Normality is also usually assumed. Only one <strong>of</strong> <strong>the</strong>irdeviations from <strong>the</strong> model <strong>of</strong> sample was brought <strong>in</strong>to use: it issometimes thought that <strong>the</strong>ir variances are not equal to one ano<strong>the</strong>r butproportional to numbers assigned accord<strong>in</strong>g to some considerations.Or, it is assumed that such numbers w i called weights <strong>of</strong> observationsare known thatw 1 var δ 1 = w 2 var δ 2 = ... = w n var δ n = σ 252

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!