of the others, then its distribution should be close to normal. I do not know any {rigorous}mathematical findings made in this direction before Chebyshev. And, although his proof ofthe {central limit} theorem has logical flaws <strong>and</strong> the formulation of his theorem lacks thenecessary restrictions, Chebyshev’s merit in solving this issue is everlasting. It consists inthat he was able, first, to develop a method of proof (the method of moments) 2 ; second, toformulate the problem of establishing the rapidity of approximation <strong>and</strong> to discoverasymptotic expansions; <strong>and</strong> third, to stress the importance of the theorem.Incidentally, I note that soon after Chebyshev’s work had appeared, Markov publishedtwo memoirs where he rigorously proved more general propositions. He applied the samemethod of investigation, – the method of moments. After Liapunov had made public tworemarkable writings on the same subject, this method apparently lost its importance;indeed, whereas Chebyshev <strong>and</strong> Markov dem<strong>and</strong>ed that the terms {of the studied sum}possessed finite moments of all orders, Liapunov was able to establish such conditions ofthe theorem that only required a restricted number of moments (up to the third order <strong>and</strong>even somewhat lower). Liapunov’s method was actually a prototype of the modern methodof characteristic functions. Not only had he managed to prove the sufficiency of hisconditions for the convergence of the distribution functions of the appropriately normed<strong>and</strong> centered sums of independent variables to the normal law; he also estimated therapidity of the convergence.Markov exerted great efforts to restore the honor of the method of moments. Hesucceeded by employing a very clever trick which is not infrequently used nowadays aswell. Its essence consists in that, instead of a sequence of given r<strong>and</strong>om variables 1 , 2 , …,we consider curtailed variables* n = n , if | n | ≤ N n , <strong>and</strong> = 0 otherwise.The number N n remains at our disposal, <strong>and</strong>, when it is sufficiently large, the equality* n = n holds with an overwhelming probability. Unlike the initial variables, the new onespossess moments of all orders, <strong>and</strong> Markov’s previous results are applicable to them. Anappropriate choice of the numbers N n ensures that the sums of * n <strong>and</strong> of the initialvariables have approaching distribution functions. Markov was thus able to show that themethod of moments allowed to derive all the Liapunov findings.[2] In 1906 Markov initiated a cycle of investigations <strong>and</strong> thus opened up a new objectof research in probability <strong>and</strong> its applications to natural sciences <strong>and</strong> technology. He beganconsidering sequences of peculiarly dependent r<strong>and</strong>om variables (or trials) n . Thedependence was such that the distribution of n , given the value taken by n–1 , does notchange once the values of k , k < n – 1, become known.Markov only illustrated the idea of these chainwise dependences, which in our timeenjoy various applications, by examples of the interchange of the vowels <strong>and</strong> consonantsin long extracts from Russian poetry (Pushkin) <strong>and</strong> prose (S.T. Aksakov). In the newcontext of r<strong>and</strong>om variables connected in chain dependences he encountered the problemsformulated by Chebyshev for sums of independent terms. The extension of the law of largenumbers to such dependent variables proved not excessively difficult, but the justificationof the central limit theorem was much more troublesome. The method of moments thatMarkov employed required the calculation of the central moments of all the integral ordersfor the sumsn(1/B n )k = 1( k – E k ), B n 2 = varnk = 1 k
<strong>and</strong> the proof of their convergence as n to the respective moments of the normaldistribution.In a number of cases Markov surmounted great calculational difficulties. On principle,even more important was that he substantiated new limit propositions, prototypes of theso-called ergodic theorems. For the Markov chains, the distribution of n as n increasesever less depends on the value taken by 1 : a remote state of the system ever less dependson its initial state.The next direction of the theory of probability developed by Markov <strong>and</strong> otherresearchers before the Great October Socialist Revolution {before the Nov. 7, 1917, newstyle, Bolshevist coup} is connected with the construction of the theory of errors.Astronomers paid much attention to this subject, <strong>and</strong> their contribution was not restrictedto methodologically improving the exposition of already known results 3 .During the 19 th , <strong>and</strong> the beginning of the 20 th century, Buniakovsky {1846},Tikhom<strong>and</strong>ritsky {1898}, Ermakov {1878}, Markov (1900) <strong>and</strong> Bernstein {1911}compiled textbooks on probability theory on the level corresponding to the contemporarystate of that science. Markov’s textbook played a considerable part in developingprobability theory in our country. He explicated a number of findings in sufficient detail,<strong>and</strong>, in the same time, in an elementary way 4 which fostered the readers’ interest not onlyin passive learning, but in active reasoning as well. Already in its first edition, Bernstein’sbook, distinguished by many peculiar traits, for a long time exerted considerable influence.Then, Slutsky (1912) acquainted his Russian readers with the new issues in mathematicalstatistics that had originated in Engl<strong>and</strong> in the first decade of the 20 th century.The works of the two mathematicians, Bernstein <strong>and</strong> Slutsky, who played an importantpart in building up new directions of research in probability theory <strong>and</strong> mathematicalstatistics in our country 5 , began to appear in the years immediately preceding theRevolution. During the first period of his work, Bernstein examined such important issuesas the refinement of the {De Moivre –} Laplace theorem, the logical justification ofprobability theory, <strong>and</strong> the transfer of its peculiar methods to problems in the theory offunctions. It was in this very period that he was able to discover a remarkable proof of theWeierstrass theorem (1912). {At the time,} Slutsky studied problems in mathematicalstatistics chiefly connected with correlation theory.[3] Thus, already before the Revolution, scientific pre-requisites for the development ofprobability theory were created in our country. And the establishment, after the Revolution,of a vast network of academic <strong>and</strong> research institutes <strong>and</strong> of academies of sciences in theUnion Republics 6 fostered the growth of scientific investigations in many cities as well asthe creation of considerable mathematical bodies <strong>and</strong> the initiation of new directions ofresearch.In the then young Central Asian University {Tashkent} Romanovsky established aprominent school of mathematical statistics <strong>and</strong> the theory of Markov chains. In Moscow,in the nation’s oldest university, the well-known school of the theory of probability wascreated on the basis of the school of the theory of functions of a real variable. It is difficultto overestimate its influence on the development of probability theory during the latestdecades. The construction of the foundation of the theory; a vast development of theclassical issues concerning limit theorems for sums of independent variables; the conceptof stochastic processes (without aftereffect; stationary <strong>and</strong> with stationary increments,branching processes); the development of methods of statistical physics; of queuing,reliability <strong>and</strong> information theories; <strong>and</strong> many other issues are the subject of research doneby Moscow specialists. The beginning of stochastic investigations in Moscow wasconnected with two outst<strong>and</strong>ing mathematicians, Khinchin <strong>and</strong> Kolmogorov.
- Page 4 and 5:
[3] I bear in mind the well-known p
- Page 6 and 7:
successes of physical statistics. B
- Page 8 and 9:
classes of independent facts whose
- Page 10 and 11:
distribution is a corollary of the
- Page 12 and 13:
examine in the first place the curv
- Page 14 and 15:
12. According to Bortkiewicz’ ter
- Page 16 and 17:
generality, the similarities taking
- Page 18 and 19:
one on another, as well as the corr
- Page 20 and 21:
is inapplicable because the right s
- Page 22 and 23:
Instead, Slutsky introduced new not
- Page 24 and 25:
abandoned in August 1936, but it is
- Page 26 and 27:
last decades, mathematicians more o
- Page 28 and 29:
charged with making the leading ple
- Page 30 and 31:
motion and a number of others) are
- Page 32 and 33:
phenomena. It is self-evident that
- Page 34 and 35:
Such new demands were formulated in
- Page 36 and 37:
The addition of independent random
- Page 38 and 39:
automatic lathes, etc. Here, the ma
- Page 40 and 41:
11. Kolmogorov, A.N. Grundbegriffe
- Page 42 and 43:
period 1 and remained, until the ap
- Page 44 and 45:
of the analytical tool rather than
- Page 46 and 47:
with probability approaching unity,
- Page 48 and 49:
logic. The ensuing vagueness in his
- Page 50 and 51:
2. Gnedenko, B.V. (1949), On Lobach
- Page 52 and 53:
will be sufficient, although not ne
- Page 54 and 55:
nlimk = 1P(| k (n) - m k (n) | > H
- Page 56 and 57:
favorite classical issue as the gam
- Page 58 and 59:
and some quite definite (not depend
- Page 60 and 61:
influenced by a construction that a
- Page 62 and 63:
P ij (1) = p ij (1) , P ij (t) =kP
- Page 64 and 65:
4.2d. Bebutov [1; 2] as well as Kry
- Page 66 and 67:
are yet no limit theorems correspon
- Page 68 and 69:
described, from the viewpoint that
- Page 70 and 71:
conditional variance and determined
- Page 72 and 73:
Romanovsky [45] and Kolmogorov [46]
- Page 74 and 75:
Let S be the general population wit
- Page 76 and 77:
Part 1. Russian/Soviet AuthorsAmbar
- Page 78 and 79: 2. On necessary and sufficient cond
- Page 80 and 81: Gnedenko, B.V., Groshev, A.V. 1. On
- Page 82 and 83: 52. ( (Mathematical Principl
- Page 84 and 85: Kozuliaev, P.A. 1. Sur la répartit
- Page 86 and 87: Obukhov, A.M. 1. Normal correlation
- Page 88 and 89: 30. Généralisations d’un théor
- Page 90 and 91: 22. Alcune applicazioni dei coeffic
- Page 92 and 93: 10. A.N. Kolmogorov. The Theory of
- Page 94 and 95: Kuznetsov, Stratonovich & Tikhonov
- Page 96 and 97: In the homogeneous case H s t = H t
- Page 98 and 99: to such a generalization. He only s
- Page 100 and 101: In the particular case of a charact
- Page 102 and 103: as it is usual for the modern theor
- Page 104 and 105: 1. {The second reference to Pugache
- Page 106 and 107: Smirnov, Romanovsky and others made
- Page 108 and 109: determined the precise asymptotic c
- Page 110 and 111: for finite values of N, M and R 2 .
- Page 112 and 113: Mikhalevich’s findings by far exc
- Page 114 and 115: Uch. Zap. = Uchenye ZapiskiUkr = Uk
- Page 116 and 117: Khinchin, A.Ya. 43. Math. Ann. 101,
- Page 118 and 119: 7. DAN 115, 1957, 49 - 52.Pinsker,
- Page 120 and 121: Anderson, T.W., Darling, D.A. (1952
- Page 122 and 123: Statistical problems in radio engin
- Page 124 and 125: observations for its power with reg
- Page 126 and 127: securing against mistakes (A.N. Kry
- Page 130 and 131: In Kiev, in the 1930s, N.M. Krylov