a la physique de l'information - Lisa - Université d'Angers

Recommendations

Info

Author's personal copy 3974 F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 7. Description length for the model with joint parameters As in the previous Section 6, the aim is to code the parameter values fk, for k = 1 to K , that instantiate the histogram model for the probability density of an observed data set x of Eq. (1). These fk’s to be coded are matched to the data set x and in actuality are estimated from this data set x. Any effective estimation of the fk’s from x must be based on the counts Nk of data points per bin, which form a minimal sufficient statistic here. Therefore, coding the model parameters fk amounts to coding the integers Nk, for k = 1 to K . Each integer Nk can assume N + 1 distinct values, between 0 and N. As a simple proposal then, lossless coding of an Nk can be realized with a code length of log(N + 1). There are K of these integers Nk, however they always sum to N, so only K − 1 of them need be coded explicitly, the last one being recoverable since N is assumed known by the decoder (at a coding cost common to all models and not included in the description length L(x)). So the code length L({fk}) to code the K parameters fk can be taken as L({fk}) = (K − 1) log(N + 1). (21) If one forms the description length of the model as L(M) = L({fk}) and then adds it to the description L(x|M) of the data given the model in Eq. (12), the total description length Ltotal = L(x|M) + L(M) of Eq. (10) follows as Ltotal = − K Nk log(fkdx) + (K − 1) log(N + 1). (22) k=1 The minimization of Eq. (22) according to Eq. (11) can be solved (analytically) first for the fk’s, with the solution again given by Eq. (6), which is also fk = NkK , k = 1, . . . K. (23) Nx When thesefk’s of Eq. (23) are plugged back into Eq. (22), one obtains the description length x Ltotal = NH({pk}) + (K − 1) log(N + 1) − N log(K) + N log , (24) dx with the probabilities pk = Nk/N deduced from Eq. (23) used in the entropy H(·) of Eq. (9). Eq. (24) is seen as a function Ltotal(K) of K alone, to be minimized (numerically) to obtain the minimizer K . This K together with the fk’s of Eq. (23) provide a complete solution to the problem of optimal probability density estimation by a histogram with regular bins. It is possible to suggest an improvement for the code length L({fk}) of Eq. (21) for the K parameters fk. Eq. (21) is based on a separate coding of K − 1 values of Nk, but it does not fully exploit the dependence between the Nk’s. The fact that the Nk’s sum to N, for instance restrains configurations with several Nk’s simultaneously close to N. Because of the dependence of the Nk’s a global coding of the K values Nk can be achieved, which is more efficient than a separate coding. For K integers Nk ranging between 0 and N and verifying K k=1 Nk = N, there is a number AN,K of distinct possible configurations given by Ref. [40, p. 38] as AN,K = (N + K − 1)! . (25) N!(K − 1)! Then, a lossless coding of the K values Nk is feasible by coding one among the AN,K distinct possible configurations. This is achievable with a code length of log(AN,K ), leading to the replacement of Eq. (21) by (N + K − 1)! L({fk}) = log(AN,K ) = log , (26) N!(K − 1)! which can be verified to be indeed a more efficient (shorter) code length than Eq. (21). With this description length L({fk}) = L(M) for the model, added to the description length L(x|M) of the data given the model in Eq. (12), one obtains the total description length Ltotal = L(x|M) + L(M) of Eq. (10) as Ltotal = − K Nk log(fk dx) + log(AN,K ). (27) k=1 At fixed K , the description length Ltotal of Eq. (27) is again minimized by the fk’s given in Eq. (23). When these values are plugged back into Ltotal of Eq. (27), one obtains a total length expressible as x Ltotal = NH({pk}) + log(AN,K ) − N log(K) + N log , (28) dx with again the probabilitiespk = Nk/N deduced from Eq. (23) used in the entropy H(·) of Eq. (9). Eq. (28) is seen as a function Ltotal(K) of K alone, to be minimized to obtain the minimizer K , this providing, in conjunction with the fk’s of Eq. (23), a complete solution to the problem of optimal probability density estimation by a histogram with regular bins. 155/197
edundancy Author's personal copy F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 3975 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 number of bins K Fig. 1. Redundancy Hmax(K) − H({pk}) in bits of the data represented over K bins, as a function of the number of bins K . The data set is formed by N = 10 5 points xn drawn from a Gaussian probability density f (·) with standard deviation σ = x/9. The dashed line is the saturation level log(x) − Hdiff[f ] which for the Gaussian density here is log[(2πe) −1/2 x/σ ]. 8. Analysis of the total description lengths It is now interesting to analyze and compare the various total description lengths Ltotal obtained with the different possible coding strategies, and which are provided by Eqs. (8), (20), (24) and (28). These total lengths show in common a term N log(x/dx). In this term, dx is the precision or resolution with which the data points xn are measured or defined, while x is the total range over which the data points xn take their values. For instance, a typical situation could be data represented with 16 binary digits, for which x/dx = 2 16 . This amounts to N log(x/dx) = 16N bits which represents the description length associated with direct fixed-length coding of the N data points, with no attempt of optimizing the coding based on a probability model for the data. This common term can thus be understood as the initial description length Linitial prior to any optimized coding: x Linitial = N log . (29) dx In the total description lengths, another common term is N log(K). With K bins to distribute the data points, log(K) can be interpreted as the maximum entropy Hmax(K) = log(K) achieved with uniform probability over the K bins. In these conditions, the total description length of Eq. (8) takes the form Ltotal ≡ L1 = Linitial − N[Hmax(K) − H({pk})]. (30) We recall that this total length L1 of Eq. (30), is the optimal coding length L(x|M) = − log P(x|M) for the data set x based on the histogram model M of probability density, but when we omit to include any coding cost L(M) for the model itself. In Eq. (30), H({pk}) is the entropy of the empirical probability distribution {pk} estimated from the data over the K bins. It cannot be above the maximum entropy Hmax(K), so that in Eq. (30) the difference Hmax(K) − H({pk}) is nonnegative, and it measures the information redundancy of the data represented over K bins. The nonnegative redundancy usually expresses in Eq. (30), a reduction of the initial coding length Linitial which is afforded by nonuniform (variable-length) coding based on a probability model for the data. This reduction is possible except when the data are distributed with uniform probability, in which case the empirical entropy H({pk}) matches the maximum entropy Hmax(K), the redundancy vanishes and no gain on Linitial is achieved in Eq. (30) because the uniform (fixed-length) coding in Linitial already corresponds to the optimal coding. On the contrary, if the departure of the data from uniform probability is strong, then H({pk}) is much less than Hmax(K), the nonnegative redundancy Hmax(K) − H({pk}) is large, and a large reduction on Linitial is accessible in Eq. (30). Also in Eq. (30), when the number of bins K is increased, entropy Hmax(K) = log(K) increases as well; in addition, provided K grows while adhering to K ≪ N, the empirical entropy H({pk}) usually increases as a consequence of this increased resolution K . Moreover, the redundancy Hmax(K) − H({pk}) usually also increases with K . This redundancy grows from 0 at K = 1 to log(x) − Hdiff[f ] at large K , where Hdiff[f ] = − f log f is the differential entropy associated with the probability density f (·) of the data. A typical evolution with K of the redundancy Hmax(K) − H({pk}) is shown in Fig. 1 when f (·) is a Gaussian density. With Hmax(K) − H({pk}) an increasing function of K , the coding length L1 in Eq. (30) decreases with K , since Linitial is invariant with K . Larger K implies higher accuracy in modeling the probabilities of the data, which in turn implies more efficiency in the nonuniform coding based on these probabilities; whence the decreasing coding length L1 in Eq. (30) as K grows. 156/197
Page 1 and 2:
Université d’Angers Laboratoire
Page 3 and 4:
Ce mémoire est dédié à mon épo
Page 5 and 6:
4 Physique de l’information 39 4.
Page 7 and 8:
1.2 Organisation du document L’es
Page 9 and 10:
4/197
Page 11 and 12:
de Rennes 1. Licence et Maîtrise
Page 13 and 14:
2.5 Activités de recherche 2.5.1 B
Page 15 and 16:
2.5.2 Encadrement Thèses Thèse so
Page 17 and 18:
2.5.3 Responsabilités Management d
Page 19 and 20:
[A20] S. BLANCHARD, D. ROUSSEAU, D.
Page 21 and 22:
In 8th Euro-American Workshop on In
Page 23 and 24:
études visaient alors à analyser
Page 25 and 26:
Fig. 3.1 réside dans le fait qu’
Page 27 and 28:
également permis de réaliser qu
Page 29 and 30:
caracteristique effective g eff (u)
Page 31 and 32:
des architectures neuronales contr
Page 33 and 34:
Dans l’ Éq.(3.10), ∆t représe
Page 35 and 36:
quantifieur, le niveau de saturatio
Page 37 and 38:
exemples présentés dans [39], l
Page 39 and 40:
• le type de validation des résu
Page 41 and 42:
présenté de façon condensée dan
Page 43 and 44:
38/197
Page 45 and 46:
pour l’étude de la turbulence pa
Page 47 and 48:
A B C Figure 4.1 : Images de coupe
Page 49 and 50:
atteint la capacité informationnel
Page 51 and 52:
* Figure 4.5 : Échelle optimale d
Page 53 and 54:
• L’“intégrale de corrélati
Page 55 and 56:
4.2.3 Analyse multifractale en imag
Page 57 and 58:
fonction de partition Z 10 30 10 25
Page 59 and 60:
exposant τ(q) 15 10 5 0 −5 −10
Page 61 and 62:
complexité colorimétrique des ima
Page 63 and 64:
dimension fractale généralisée D
Page 65 and 66:
est la moyenne du signal calculée
Page 67 and 68:
A B C Figure 4.17 : Observations in
Page 69 and 70:
A B C Figure 4.20 : Nombre moyen de
Page 71 and 72:
[14] S. Blanchard, D. Rousseau et F
Page 73 and 74:
[43] J. Chauveau, D. Rousseau et F.
Page 75 and 76:
[76] A. Humeau, B. Buard, F. Chapea
Page 77 and 78:
[100] M. D. McDonnell, N. G. Stocks
Page 79 and 80:
[131] D. Rousseau, F. Duan, J. Roja
Page 81 and 82:
76/197
Page 83 and 84:
D. ROUSSEAU and F. CHAPEAU-BLONDEAU
Page 85 and 86:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 87 and 88:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 89 and 90:
D. ROUSSEAU, G. V. ANAND, and F. CH
Page 91 and 92:
quantizers. Classically, the design
Page 93 and 94:
Fig. 2. Nonlinear array used as pre
Page 95 and 96:
probability of error P er 10 0 10 -
Page 97 and 98:
probability of error (expected as s
Page 99 and 100:
[3] N.H. Lu, B.A. Einstein, Detecti
Page 101 and 102:
Abstract Neurocomputing 71 (2007) 3
Page 103 and 104:
3. Assessing nonlinear transmission
Page 105 and 106:
output SNR 250 200 150 100 50 0 N=3
Page 107 and 108:
precisely the scope of the present
Page 109 and 110: [4] F. Chapeau-Blondeau, G. Chauvet
Page 111 and 112: Nonlinear SNR amplification of harm
Page 113 and 114: P.R. BHAT and D. ROUSSEAU and G. V.
Page 115 and 116: It is assumed that the signal s is
Page 117 and 118: Let us de£ne the improvement in pe
Page 119 and 120: F. CHAPEAU-BLONDEAU and D. ROUSSEAU
Page 121 and 122: Contents Raising the noise to impro
Page 123 and 124: Raising the noise to improve perfor
Page 135 and 136: S. BLANCHARD, D. ROUSSEAU, D. GINDR
Page 137 and 138: 1984 OPTICS LETTERS / Vol. 32, No.
Page 139 and 140: F. CHAPEAU-BLONDEAU, D. ROUSSEAU, S
Page 141 and 142: 1288 J. Opt. Soc. Am. A/Vol. 25, No
Page 147 and 148: 36 IEEE SIGNAL PROCESSING LETTERS,
Page 149 and 150: 38 IEEE SIGNAL PROCESSING LETTERS,
Page 151 and 152: A. HISTACE and D. ROUSSEAU. Constru
Page 153 and 154: noises Z i in (3) are chosen Gaussi
Page 155 and 156: Author's personal copy Physica A 38
Page 157 and 158: Author's personal copy F. Chapeau-B
Page 159: Author's personal copy F. Chapeau-B
Page 163 and 164: model description length Author's p
Page 165 and 166: total description length x 10 4 1.6
Page 171 and 172: J. CHAUVEAU and D. ROUSSEAU and F.
Page 173 and 174: 2 for natural images, and carry rel
Page 175 and 176: 4 Fig. 2 Random RGB color image I2(
Page 177 and 178: 6 log2[ number of boxes N(a) ] 24 2
Page 179 and 180: 8 Fig. 7 Color histogram in the RGB
Page 181 and 182: 10 log2[ number of boxes N(a) ] 8 7
Page 183 and 184: 12 17. Landgrebe, D.: Hyperspectral
Page 185 and 186: Numerical simulation of laser Doppl
Page 187 and 188: esults are meaningful to better app
Page 189 and 190: A. HUMEAU, B. BUARD, F. CHAPEAU-BLO
Page 191 and 192: 618 A Humeau et al 1. Introduction
Page 193 and 194: 620 A Humeau et al (a) (b) (c) Figu
Page 195 and 196: 622 A Humeau et al (a) (b) (c) Figu
Page 197 and 198: 624 A Humeau et al Figure 5. Averag
Page 199 and 200: 626 A Humeau et al Therefore, the m
Page 201 and 202: 628 A Humeau et al peripheral cardi
show all

a la physique de l'information - Lisa - Université d'Angers

Create successful ePaper yourself

Delete template?

Save as template?