a la physique de l'information - Lisa - Université d'Angers

Recommendations

Info

Author's personal copy 3972 F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 following way. The complete coding of the data should here include two parts. The first part is the coding of the data based on a definite probability density model to assign the code lengths. For a given data set x, the description length needed by this first part is Ldata of Eq. (7), that we can also write Ldata L(x|M), the description length of the data given a definite model M of probability density. The second part needed for a complete coding of the data is the description of the parameters that completely specify the underlying probability density model M. These parameters include the number of bins K along with the K values fk for k = 1 to K . The description length needed by this second part in charge of coding the parameters of the model M is denoted Lmodel L(M); and we shall soon see how to explicitly quantify this description length L(M). Now the complete coding of the data set x has a total description length Ltotal which sums up the two parts as Ltotal L(x|M) + L(M), (10) signifying that the total description length of the data is the description length of the data given the model plus the description length of the model. For a given data set x, the MDL principle then dictates to select the model parameters {K; fk, k = 1, . . . K} so as to minimize the total description length Ltotal of Eq. (10), i.e. {K; fk, k = 1, . . .K} = arg min {K;fk} Ltotal = arg min [L(x|M) + L(M)] . (11) {K;f k} This is an optimization principle based on optimal coding and information theory. In a prescribed class of models (histograms with regular bins here), the best model for the data is the model that, when known, enables the most efficient (shortest) coding of these data. 5. Description length for the data As already stated, the description length L(x|M) for the data given the model is supplied by Eq. (7). The term − log(dx N ) in Eq. (7) is a constant common to all models. For the purpose of discriminating among models, it is often chosen to omit this constant − log(dx N ) in the description length, with no impact on the final result concerning the model choice. However here, we prefer to maintain this term, in order to keep track of the complete value of the description length, and convey some additional insight into the modeling process beyond the choice of the model itself. So equivalently, the description length of Eq. (7) for the data given the model is written as L(x|M) = − K Nk log(fkdx). (12) k=1 Next, we have to address the quantification of the description length L(M) for the model. 6. Description length for the model parameters as independent real variables To quantify the description length L(M) of the model, a possibility is to use a procedure derived from Ref. [28]. The approach from Ref. [28] to quantify the description length L(M) of the model, considers the K model parameters fk as K independent real (continuously-valued) variables, which need to be quantized to finite precision in order to allow their coding. The histogram model for the density of the data assigns a probability pk = fkδx to bin k with width δx. Under this model also, the number Nk of data points falling in bin k has expected value E(Nk) = Npk = Nfkδx and standard deviation σ (Nk) = [Nfkδx(1−fkδx)] 1/2 , according to the properties of the binomial distribution [40]. Therefore, since fk = E(Nk)/(Nδx), for all k, estimating fk is equivalent to estimating the mean E(Nk) of random variable Nk with standard deviation σ (Nk). The value σ (fk) = σ (Nk)/(Nδx) = [fk(1−fkδx)/(Nδx)] 1/2 fixes a natural precision with which fk can be estimated and need to be coded. This determines σ (fk) as the quantization step relevant for coding the model parameters fk. One has the probability pk ∈ [0, 1] and the density fk = pkδx −1 ∈ [0, δx −1 ]. The parameter fk therefore can take its values in the interval [0, δx −1 ] and is estimated and quantized with the precision σ (fk). Accordingly, a total number δx −1 /σ (fk) of different values for fk can be distinguished and need to be coded separately, at a code length log[δx −1 /σ (fk)]. For the K parameters fk the code length results as L({fk}) = K log k=1 δx −1 σ (fk) = K 1 log(N) − 2 2 K log[fkδx(1 − fkδx)]. (13) k=1 An alternative, comparable, approach to quantify the cost of coding continuously-valued parameters is described in Ref. [1], based on a slightly more involved mathematical formulation. It turns out that quantifying the coding cost of continuously-valued model parameters is an important and recurrent step when applying the MDL principle. We review this alternative approach from Ref. [1] in the Appendix, for better appreciation of different existing variants for applying the MDL principle. With the present approach derived from Ref. [28] and proceeding through Eq. (13), the description length 153/197
Author's personal copy F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 3973 for the model is L(M) = L({fk}), which is then added to the description of the data given the model L(x|M) of Eq. (12). The total description length Ltotal = L(x|M) + L(M) of Eq. (10) then results as Ltotal = − K Nk log(fkdx) + 1 2 log[fkδx(1 − fkδx)] k=1 + K log(N). (14) 2 The model parameters {K; fk} are then determined by minimizing the total length Ltotal of Eq. (14), under the constraint of Eq. (4). To simplify this minimization, it is possible to use an approximation as in Ref. [28]. In Eq. (14), the quantity fkδx is the probability pk of bin k under the histogram model of the probability density. The number of bins K can often be expected to be sufficiently large to assume this probability fkδx ≪ 1, authorizing the approximation log(1 − fkδx) ≈ −fkδx. (15) Under this approximation, the code length of Eq. (13) for the model parameters reduces to (in nats) L({fk}) = K 1 log(N) + 2 2 − 1 2 K log(fkδx), (16) k=1 and the minimization of Ltotal of Eq. (14) can be performed in two steps. First, at given K , the solution for the fk’s realizing, under the constraint of Eq. (4), the minimum of Ltotal, is accessible in closed form as fk = Nk + 1/2 N + K/2 1 , k = 1, . . . K. (17) δx Then, when thefk’s of Eq. (17) are plugged back into Ltotal of Eq. (14), one obtains K Ltotal = − Nk + k=1 1 log Nk + 2 1 + 2 1 2 log 1 − Nk + 1/2 N + K/2 + N + K log N + 2 K + 2 K x log(N) − N log(K) + N log . (18) 2 dx A useful equivalent expression of Eq. (18) is Ltotal = N + K H({pk}) − 2 1 K log 1 − 2 Nk + 1/2 + N + K/2 K x log(N) − N log(K) + N log , (19) 2 dx k=1 where the entropy H(·) as in Eq. (9) is with the empirical probabilitiespk = fkδx = (Nk + 1/2)/(N + K/2) deduced from Eq. (17). Moreover, in the conditions of the approximation of Eq. (15), the sum over k in Eq. (19) evaluates to −1 nat, so as to yield for Eq. (19), Ltotal = N + K H({pk}) + 2 1 K x + log(N) − N log(K) + N log . (20) 2 2 dx Eq. (18), or Eq. (19) or (20), defines a function Ltotal = Ltotal(K) of the sole (unknown) variable K , whose minimum can be numerically found to determine the minimizer K . Together this K and the fk’s of Eq. (17) form the minimum description length solution to the density estimation problem according to the approach proposed in Ref. [28]. It is to be noted that Ref. [28] rather chooses to estimate the bin probabilities pk rather than the density values fk as we do in this Section 6, and so the specific formulas may differ between both places; but the philosophy is the same, as far as we understand it in Ref. [28]. An important aspect should be emphasized concerning the approach of this Section 6 to quantify the description length L(M) of the model. The approach codes the model parameters fk, for k = 1 to K , as if they were independent and real (continuously-valued) parameters. Because of the constraint of Eq. (4), the parameters fk are not independent. Furthermore, any effective estimation of the fk’s will be performed from the integers Nk, which form a minimal sufficient statistic here. Since the K nonnegative integers Nk sum to N, there are only a finite number of feasible configurations for the Nk’s, and accordingly only a finite number of possible values for the fk’s (instead of a continuum of values as would suggest their being considered as real variables). By taking these two features (dependency and discreteness) into account, a more efficient coding could be envisaged. Also, the coding of the model parameters in this Section 6 takes the form of a lossy coding, in connection with Eq. (13), based on the quantization at a finite precision σ (fk) of the fk’s treated as continuously-valued parameters. Instead, a lossless coding could be envisaged. This we address now, by considering another way of quantifying the description length L(M) of the model. 154/197
Page 1 and 2:
Université d’Angers Laboratoire
Page 3 and 4:
Ce mémoire est dédié à mon épo
Page 5 and 6:
4 Physique de l’information 39 4.
Page 7 and 8:
1.2 Organisation du document L’es
Page 9 and 10:
4/197
Page 11 and 12:
de Rennes 1. Licence et Maîtrise
Page 13 and 14:
2.5 Activités de recherche 2.5.1 B
Page 15 and 16:
2.5.2 Encadrement Thèses Thèse so
Page 17 and 18:
2.5.3 Responsabilités Management d
Page 19 and 20:
[A20] S. BLANCHARD, D. ROUSSEAU, D.
Page 21 and 22:
In 8th Euro-American Workshop on In
Page 23 and 24:
études visaient alors à analyser
Page 25 and 26:
Fig. 3.1 réside dans le fait qu’
Page 27 and 28:
également permis de réaliser qu
Page 29 and 30:
caracteristique effective g eff (u)
Page 31 and 32:
des architectures neuronales contr
Page 33 and 34:
Dans l’ Éq.(3.10), ∆t représe
Page 35 and 36:
quantifieur, le niveau de saturatio
Page 37 and 38:
exemples présentés dans [39], l
Page 39 and 40:
• le type de validation des résu
Page 41 and 42:
présenté de façon condensée dan
Page 43 and 44:
38/197
Page 45 and 46:
pour l’étude de la turbulence pa
Page 47 and 48:
A B C Figure 4.1 : Images de coupe
Page 49 and 50:
atteint la capacité informationnel
Page 51 and 52:
* Figure 4.5 : Échelle optimale d
Page 53 and 54:
• L’“intégrale de corrélati
Page 55 and 56:
4.2.3 Analyse multifractale en imag
Page 57 and 58:
fonction de partition Z 10 30 10 25
Page 59 and 60:
exposant τ(q) 15 10 5 0 −5 −10
Page 61 and 62:
complexité colorimétrique des ima
Page 63 and 64:
dimension fractale généralisée D
Page 65 and 66:
est la moyenne du signal calculée
Page 67 and 68:
A B C Figure 4.17 : Observations in
Page 69 and 70:
A B C Figure 4.20 : Nombre moyen de
Page 71 and 72:
[14] S. Blanchard, D. Rousseau et F
Page 73 and 74:
[43] J. Chauveau, D. Rousseau et F.
Page 75 and 76:
[76] A. Humeau, B. Buard, F. Chapea
Page 77 and 78:
[100] M. D. McDonnell, N. G. Stocks
Page 79 and 80:
[131] D. Rousseau, F. Duan, J. Roja
Page 81 and 82:
76/197
Page 83 and 84:
D. ROUSSEAU and F. CHAPEAU-BLONDEAU
Page 85 and 86:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 87 and 88:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 89 and 90:
D. ROUSSEAU, G. V. ANAND, and F. CH
Page 91 and 92:
quantizers. Classically, the design
Page 93 and 94:
Fig. 2. Nonlinear array used as pre
Page 95 and 96:
probability of error P er 10 0 10 -
Page 97 and 98:
probability of error (expected as s
Page 99 and 100:
[3] N.H. Lu, B.A. Einstein, Detecti
Page 101 and 102:
Abstract Neurocomputing 71 (2007) 3
Page 103 and 104:
3. Assessing nonlinear transmission
Page 105 and 106:
output SNR 250 200 150 100 50 0 N=3
Page 107 and 108: precisely the scope of the present
Page 109 and 110: [4] F. Chapeau-Blondeau, G. Chauvet
Page 111 and 112: Nonlinear SNR amplification of harm
Page 113 and 114: P.R. BHAT and D. ROUSSEAU and G. V.
Page 115 and 116: It is assumed that the signal s is
Page 117 and 118: Let us de£ne the improvement in pe
Page 119 and 120: F. CHAPEAU-BLONDEAU and D. ROUSSEAU
Page 121 and 122: Contents Raising the noise to impro
Page 123 and 124: Raising the noise to improve perfor
Page 135 and 136: S. BLANCHARD, D. ROUSSEAU, D. GINDR
Page 137 and 138: 1984 OPTICS LETTERS / Vol. 32, No.
Page 139 and 140: F. CHAPEAU-BLONDEAU, D. ROUSSEAU, S
Page 141 and 142: 1288 J. Opt. Soc. Am. A/Vol. 25, No
Page 147 and 148: 36 IEEE SIGNAL PROCESSING LETTERS,
Page 149 and 150: 38 IEEE SIGNAL PROCESSING LETTERS,
Page 151 and 152: A. HISTACE and D. ROUSSEAU. Constru
Page 153 and 154: noises Z i in (3) are chosen Gaussi
Page 155 and 156: Author's personal copy Physica A 38
Page 157: Author's personal copy F. Chapeau-B
Page 161 and 162: edundancy Author's personal copy F.
Page 163 and 164: model description length Author's p
Page 165 and 166: total description length x 10 4 1.6
Page 167 and 168: Author's personal copy F. Chapeau-B
Page 169 and 170: Author's personal copy F. Chapeau-B
Page 171 and 172: J. CHAUVEAU and D. ROUSSEAU and F.
Page 173 and 174: 2 for natural images, and carry rel
Page 175 and 176: 4 Fig. 2 Random RGB color image I2(
Page 177 and 178: 6 log2[ number of boxes N(a) ] 24 2
Page 179 and 180: 8 Fig. 7 Color histogram in the RGB
Page 181 and 182: 10 log2[ number of boxes N(a) ] 8 7
Page 183 and 184: 12 17. Landgrebe, D.: Hyperspectral
Page 185 and 186: Numerical simulation of laser Doppl
Page 187 and 188: esults are meaningful to better app
Page 189 and 190: A. HUMEAU, B. BUARD, F. CHAPEAU-BLO
Page 191 and 192: 618 A Humeau et al 1. Introduction
Page 193 and 194: 620 A Humeau et al (a) (b) (c) Figu
Page 195 and 196: 622 A Humeau et al (a) (b) (c) Figu
Page 197 and 198: 624 A Humeau et al Figure 5. Averag
Page 199 and 200: 626 A Humeau et al Therefore, the m
Page 201 and 202: 628 A Humeau et al peripheral cardi
show all

a la physique de l'information - Lisa - Université d'Angers

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?