Treat<strong>in</strong>g observations whose results depend on many factors isfraught with an absolutely general difficulty <strong>and</strong> overcom<strong>in</strong>g it waspossibly <strong>the</strong> ma<strong>in</strong> f<strong>in</strong>d<strong>in</strong>g <strong>of</strong> <strong>the</strong> work done. The po<strong>in</strong>t is that <strong>the</strong> result<strong>of</strong> observation (<strong>in</strong> this case, <strong>the</strong> emergence <strong>of</strong> <strong>the</strong> IHD) is generallyconnected with <strong>the</strong> values <strong>of</strong> <strong>the</strong> risk factors <strong>in</strong> a barely understoodway. When <strong>the</strong>re are a few such factors, one or two, say, <strong>the</strong> data areusually divided <strong>in</strong>to <strong>in</strong>tervals accord<strong>in</strong>g to <strong>the</strong>ir value; <strong>in</strong> <strong>the</strong> mostsimple case, <strong>in</strong>to two, but this is very crude <strong>and</strong> it is better to havemore.If each factor is subdivided <strong>in</strong>to several levels, all <strong>the</strong>ircomb<strong>in</strong>ations should be applied to form <strong>the</strong> appropriate groupsprovid<strong>in</strong>g <strong>the</strong> frequencies <strong>of</strong> <strong>the</strong> IHD be<strong>in</strong>g estimates <strong>of</strong> <strong>the</strong>probabilities. These will <strong>in</strong>deed adequately describe <strong>the</strong> data(somewhat roughly because <strong>the</strong> values <strong>of</strong> <strong>the</strong> risk factors areconsidered approximately).For example, <strong>the</strong> contents <strong>of</strong> cholesterol can be considered on fourlevels [...], <strong>the</strong> values <strong>of</strong> <strong>the</strong> systolic blood pressure also on four levels[...]. We <strong>the</strong>n arrange a two-dimensional classification [...] <strong>and</strong> obta<strong>in</strong>16 groups with <strong>the</strong> frequency <strong>of</strong> <strong>the</strong> emergence <strong>of</strong> <strong>the</strong> IHD calculated<strong>in</strong> each <strong>of</strong> <strong>the</strong>m not for all 4856 observations, but for <strong>the</strong>ir number <strong>in</strong><strong>the</strong> group which is 16 times smaller <strong>in</strong> <strong>the</strong> mean. Jo<strong>in</strong><strong>in</strong>g men <strong>and</strong>women toge<strong>the</strong>r will likely be thought <strong>in</strong>admissible so that <strong>the</strong> number<strong>of</strong> observations becomes about twice smaller.In general, a modest number <strong>of</strong> observations <strong>of</strong> <strong>the</strong> order <strong>of</strong> ahundred (when hav<strong>in</strong>g a great many total number <strong>of</strong> observations) willbe left for each frequency. But what happens if we add three moregroups <strong>of</strong> different ages? And four more accord<strong>in</strong>g to <strong>the</strong> <strong>in</strong>tensity <strong>of</strong>smok<strong>in</strong>g? [..] As a result, we will obta<strong>in</strong> a classification with eachgroup conta<strong>in</strong><strong>in</strong>g at best one observation <strong>and</strong> cases <strong>of</strong> no observationsat all are not excluded. Consequently, we will be unable to determ<strong>in</strong>eany probabilities. [...]The same difficulty occurs <strong>in</strong> many technical problems concern<strong>in</strong>g<strong>the</strong> reliability <strong>of</strong> mach<strong>in</strong>ery established by several types <strong>of</strong> checks.Suppose that <strong>the</strong> results <strong>of</strong> <strong>the</strong> checks arex 1 , ..., x k (2.8)<strong>and</strong> we would like to derive <strong>the</strong> probability <strong>of</strong> failure-free work as afunction p(x 1 , ..., x k ). The attempt to achieve this by multivariateanalysis will be senseless.Let us see how this problem was solved <strong>in</strong> <strong>the</strong> Fram<strong>in</strong>gham<strong>in</strong>vestigation. As far as it is possible to judge, its solution had an<strong>in</strong>disputable part, but <strong>the</strong> o<strong>the</strong>r part was absolutely illogical. This doesnot mean that it is <strong>in</strong> essence wrong, but that it possibly needs somespecification. The first part can be thus expounded.When hav<strong>in</strong>g to do with several variables, <strong>the</strong>ir only well studiedfunction is <strong>the</strong> l<strong>in</strong>ear function; <strong>the</strong>re exists an entire pert<strong>in</strong>ent science,l<strong>in</strong>ear algebra, which also partly studies <strong>the</strong> function <strong>of</strong> <strong>the</strong> seconddegree. It would <strong>the</strong>refore be expedient to represent <strong>the</strong> unknownprobability p(x 1 , ..., x k ) by a l<strong>in</strong>ear function. This, however, isobviously impossible because probability changes from 0 to 1 whereas108
a l<strong>in</strong>ear function is not restricted. We will <strong>the</strong>refore take a necessarystep to fur<strong>the</strong>r complication suppos<strong>in</strong>g thatp(x 1 , ..., x k ) = f(a 0 + a 1 x 1 + ... + a k x k )where f is a function <strong>of</strong> one variable chang<strong>in</strong>g from 0 to 1.There still rema<strong>in</strong>s <strong>the</strong> problem <strong>of</strong> choos<strong>in</strong>g f; many considerations<strong>of</strong> simplicity show that most convenient is <strong>the</strong> so-called logisticfunction1f ( y) = .y1 + e −F<strong>in</strong>ally, chang<strong>in</strong>g notation to br<strong>in</strong>g it <strong>in</strong> correspondence with <strong>the</strong> citedwork, we have <strong>the</strong> ma<strong>in</strong> hypo<strong>the</strong>sis <strong>in</strong> <strong>the</strong> form1p( x1,..., xk) =.k1−exp[ α β x ]− −∑i=1ii(2.9)This function is called <strong>the</strong> multidimensional logistic function. Wehave certa<strong>in</strong>ly not proved that <strong>the</strong> probability sought, p, must beexpounded by (2.9), but arrived at that function without mak<strong>in</strong>g anylogical absurdities.After formulat<strong>in</strong>g <strong>the</strong> ma<strong>in</strong> hypo<strong>the</strong>sis (2.9) <strong>the</strong> parameters <strong>of</strong> thatfunction ought to be estimated <strong>and</strong> it is here that <strong>the</strong> authorsdeliberately admit a logical contradiction. They suggest <strong>the</strong> model <strong>of</strong> amultidimensional normal distribution for <strong>the</strong> results <strong>of</strong> <strong>the</strong> exam<strong>in</strong>ation(2.8). This is obviously impossible because two <strong>of</strong> <strong>the</strong> seven factors,NNo. 6 <strong>and</strong> 7, are measured <strong>in</strong> discrete units so that normality isformally impossible. Then, it is ra<strong>the</strong>r strange to suppose that age isnormally distributed. In general, unlike <strong>the</strong> small illogicalities <strong>of</strong>choos<strong>in</strong>g a statistical test when data are already available (§ 2.1), herewe see a serious corruption <strong>of</strong> logic which can only be exonerated by<strong>the</strong> result obta<strong>in</strong>ed (cf. <strong>the</strong> proverb: Victors are not judged).More precisely, <strong>the</strong> ma<strong>in</strong> hypo<strong>the</strong>sis consisted <strong>in</strong> that <strong>the</strong>re are twomany-dimensional normal totalities, one consist<strong>in</strong>g <strong>of</strong> <strong>the</strong> observations<strong>of</strong> <strong>the</strong> risk factors for those who were not taken ill dur<strong>in</strong>g <strong>the</strong> next 12years, <strong>the</strong> second concerned those who were. A problem is formulatedabout <strong>the</strong> methods <strong>of</strong> dist<strong>in</strong>guish<strong>in</strong>g <strong>the</strong>se totalities.The classical supposition <strong>of</strong> <strong>the</strong> discrim<strong>in</strong>ant analysis states that <strong>the</strong>covariance matrices <strong>of</strong> both totalities are <strong>the</strong> same which leads (ara<strong>the</strong>r remarkable fact!) to <strong>the</strong> expression (2.9) which we arrived at byconsiderations <strong>of</strong> simplicity. Nowadays this probability is understoodas <strong>the</strong> posterior probability <strong>of</strong> be<strong>in</strong>g taken ill given that <strong>the</strong>observations provided values (2.8) <strong>of</strong> risk factors. This time, however,<strong>the</strong> authors also obta<strong>in</strong>ed a method <strong>of</strong> estimat<strong>in</strong>g <strong>the</strong> parameters <strong>of</strong>(2.9). It is illogical to <strong>the</strong> same extent as <strong>the</strong> supposition <strong>of</strong> normality.Our own reason<strong>in</strong>g which first led us to <strong>the</strong> expression (2.9) wouldhave led us to a quite ano<strong>the</strong>r <strong>and</strong> more complicated <strong>in</strong> <strong>the</strong>109
- Page 1 and 2:
Studies in the History of Statistic
- Page 3 and 4:
Introduction by CompilerI am presen
- Page 5 and 6:
(Lect. Notes Math., No. 1021, 1983,
- Page 7 and 8:
sufficiently securely that a carefu
- Page 9 and 10:
is energy?) from chapter 4 of Feynm
- Page 11 and 12:
demand to apply transfinite numbers
- Page 13 and 14:
for stating that Ω consists of ele
- Page 15 and 16:
chances to draw a more suitable apa
- Page 17 and 18:
Let the space of elementary events
- Page 19 and 20:
2.3. Independence. When desiring to
- Page 21 and 22:
Eξ = ∑ aipi.Our form of definiti
- Page 23 and 24:
absolutely precisely if the pertine
- Page 25 and 26:
where x is any real number. If dens
- Page 27 and 28:
probability can be coupled with an
- Page 29 and 30:
Nowadays we are sure that no indepe
- Page 31 and 32:
λ = λ(T)with λ(T) being actually
- Page 33 and 34:
(1/B n )(m − A n )instead of the
- Page 35 and 36:
along with ξ. For example, if ξ i
- Page 37 and 38:
µ( − p0) ÷np0 (1 − p0)nhas an
- Page 39 and 40:
distribution of the maximal term |s
- Page 41 and 42:
ξ (ω) + ... + ξ (ω)n1n{ω :|
- Page 43 and 44:
P{max ξ(t) ≥ x} = 0.01, 0 ≤ t
- Page 45 and 46:
1. This example and considerations
- Page 47 and 48:
IIV. N. TutubalinTreatment of Obser
- Page 49 and 50:
structure of statistical methods, d
- Page 51 and 52:
Suppose that we have adopted the pa
- Page 53 and 54:
and the variances are inversely pro
- Page 55 and 56:
It is interesting therefore to see
- Page 57 and 58: is applied with P(t) being a polyno
- Page 59 and 60: ut some mathematical tricks describ
- Page 61 and 62: It is clear therefore that no speci
- Page 63 and 64: of various groups of machines, and
- Page 65 and 66: nnA(λ) x sin λ t, B(λ) = x cosλ
- Page 67 and 68: of the mathematical model of the Br
- Page 69 and 70: dF(λ) = f (λ) dλ, so that B( t
- Page 71 and 72: usually very little of them. Indeed
- Page 73 and 74: This is the celebrated model of aut
- Page 75 and 76: applications of the theory of stoch
- Page 77 and 78: achieved by differentiating because
- Page 79 and 80: u(x 1 , x 2 , t 1 , t 2 ) = v(x 1 ,
- Page 81 and 82: Reasoning based on common sense and
- Page 83 and 84: answering that question is extremel
- Page 85 and 86: IIIV. N. TutubalinThe Boundaries of
- Page 87 and 88: periodograms. It occurred that work
- Page 89 and 90: at point x = 1. However, preceding
- Page 91 and 92: He concludes that since the action
- Page 93 and 94: The verification of the truth of a
- Page 95 and 96: In the purely scientific sense this
- Page 97 and 98: ought to learn at once the simple t
- Page 99 and 100: the material world science had inde
- Page 101 and 102: values of (2.1) realized in the n e
- Page 103 and 104: *several dozen. The totality µ ica
- Page 105 and 106: Mendelian laws. It is not sufficien
- Page 107: example, the problem of the objecti
- Page 111 and 112: 258 - 82 - 176 cases or 68.5% of al
- Page 113 and 114: The Framingham investigation indeed
- Page 115 and 116: or, for discrete observations,IT(ω
- Page 117 and 118: What objections can be made? First,
- Page 119 and 120: eliability and queuing are known to
- Page 121 and 122: Kolman E. (1939 Russian), Perversio
- Page 123 and 124: measurement is provided. Recently,
- Page 125 and 126: which means that sooner or later th
- Page 127 and 128: The foundations of the Mises approa
- Page 129 and 130: A rather subtle arsenal is develope
- Page 131 and 132: 4.3. General remarks on §§ 4.1 an
- Page 133 and 134: BibliographyAlimov Yu. I. (1976, 19
- Page 135 and 136: processes are now going on in the s
- Page 137 and 138: obtaining a deviation from the theo
- Page 139 and 140: VIOscar SheyninOn the Bernoulli Law