a la physique de l'information - Lisa - Université d'Angers

Recommendations

Info

Author's personal copy 3982 F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 each axis, leading to a total number K = K D 0 of cells in D dimensions. The search for the minimization of Ltotal is then now reduced to a one-dimensional search in K0. A much simpler minimization process results, at the price of reduced resolution in the D-dimensional histograms. Other strategies can be envisaged to organize the division of the coordinate axes associated with the MDL criterion. In this same direction of multidimensional histograms with MDL, a recent study [41] introduces a new family of multidimensional histograms in the context of data mining for query answering. A local parametric model is employed to describe each multidimensional bin of the histogram, and local application of MDL is performed to optimize the parameterization. This approach of Ref. [41] is reminiscent of kernel methods, which form a different methodology for probability density estimation based on a definite choice of parametric kernel functions, and which although quite distinct from histograms, can also be handled with MDL for optimal parameterization. Beyond histogram estimation, such extensions of the MDL principle naturally offer, as suggested by the examples of Fig. 7, flexible and optimal subquantizations of multidimensional data based on an informational principle. This can be very useful for reduction of the complexity of multidimensional data sets, which are more and more pervasive in many areas of observational sciences, and with a control of the procedure obtained in an informational framework. Appendix. Quantization of continuously-valued parameters In this Appendix, we describe an alternative but comparable approach to Section 6, for quantifying the cost of coding continuously-valued model parameters, based on Ref. [1, p. 55]. The K model parameters fk take their values in [0, δx −1 ]. When considered as continuously-valued, each parameter fk has to be quantized to a finite precision to make its coding possible. A quantization step or precision hk is assumed for the quantization of parameter fk, for k = 1 to K . It results in a total number δx −1 /hk of different values for fk which can be distinguished and need to be coded separately, at a code length log(δx −1 /hk). For the K parameters fk the code length results as K −1 δx L({fk}) = log = K log(δx −1 K ) − log(hk). (A.1) k=1 hk k=1 Given the model parameters {fk}, the data x are coded as in Eq. (7) at the cost − log P(x|{fk}), which is added to the model cost of Eq. (A.1) to yield the total description length Ltotal({fk}) = − log P(x|{fk}) + L({fk}). (A.2) In Eq. (A.2), the fk’s that minimize Ltotal({fk}) also minimize − log P(x|{fk}) and are given by the fk’s of Eq. (6). This is so because in Eq. (A.2), the model code length L({fk}) does not (cannot) depend on the fk’s, since these are not known to the decoder. However, it is not the exact fk’s of Eq. (6) which are used to code the data x at the minimum cost − log P(x|{ fk}). Instead, it is a set {fk}, which is close to { fk}, and which results from quantization of the fk’s at the finite precisions hk. This induces a total description length Ltotal({fk}) slightly longer than the minimum Ltotal({ fk}). Since hk measures the maximum deviation of fk from fk, the maximum of the overcost Ltotal({fk}) above Ltotal({ fk}) can be obtained through the Taylor expansion with Ltotal({fk}) = − log P(x|{ fk}) + 1 Jij({fk}) = ∂ 2 ∂fi∂fj 2 K i=1 K Jij({fk})hihj + L({fk}), (A.3) j=1 − log P(x|{fk}), (A.4) and no contribution of the first derivatives since Ltotal({fk}) and − log P(x|{fk}) are at a minimum in {fk} = {fk}. It is then useful to express the total length of Eq. (A.3) as Ltotal({fk}) = − log P(x|{fk}) + Φ({hk}). This new function Φ({hk}), defined from Eq. (A.3), captures the dependence of Ltotal({fk}) with the precisions {hk} for coding the K parameters fk. It is then natural to select the precisions {hk} in order to minimize the cost expressed by Φ({hk}). If the quantization steps {hk} are small, long code lengths are entailed for the parameters {fk}, but also high precision is obtained in describing the probabilities of the data, allowing to come close to the minimum code length − log P(x|{fk}). On the contrary, larger quantization steps {hk} entail shorter code lengths for the parameters {fk}, but also less accuracy in describing the probabilities of the data with a coding performance further away from the minimum − log P(x|{fk}). One can thus expect an optimal configuration for the precisions {hk} that will minimize the total description length of Eq. (A.3), and that will come by nullifying the derivatives ∂Φ = ∂ K K K 1 Jij({fk})hihj − log(hk) . (A.5) 2 ∂hk ∂hk i=1 j=1 k=1 Due to the symmetry Jij = Jji, it follows from Eq. (A.5), K Jik({fk})hi − 1 = 0, (A.6) i=1 for all k = 1 to K . hk 163/197
Author's personal copy F. Chapeau-Blondeau, D. Rousseau / Physica A 388 (2009) 3969–3984 3983 To make the Jij’s explicit, one has from Eq. (5), ∂[− log P(x|{fk})]/∂fi = −Ni/fi, and then Jij({fk}) = Ni/f 2 i if i = j, and Jij({fk}) = 0 if i = j. The optimal precisions {hk} then follow from Eq. (A.6) as √ Nk hk = fk √Nk = N δx−1 , (A.7) for all k = 1 to K . The optimal precisions {hk} of Eq. (A.7) are dependent upon the optimal parameter values {fk}. For consistency of their derivation, the optimal coding precisions hk’s of Eq. (A.7) are determined by thefk’s but would not vary for coding other fk’s that would deviate from thefk’s. At the optimal {hk} of Eq. (A.7) one finds i j Jij({fk})hihj = K and a minimum for Φ({hk}) which is K/2 + K log(N) − K k=1 log(√Nk), leading to a parameter code length in Eq. (A.1) as L({fk}) = K K 1 Nk log(N) − log . (A.8) 2 2 N k=1 Also, in Eq. (A.3), the data code length − log P(x|{ fk}) is provided by Eq. (8). This leads, at the optimal {hk} of Eq. (A.7), for the total description length of Eq. (A.2), to the minimum K Ltotal({fk}) = − Nk + 1 log(Nk) + 2 K 2 + N + K log(N) + 2 K 2 k=1 or equivalently Ltotal({fk}) = N + K H({pk}) + 2 K 2 with this time the entropy estimator H({pk}) = − K k=1 Nk + 1/2 N + K/2 log Nk N K + log(N) − N log(K) + N log 2 log(N) − N log(K) + N log x dx x dx , (A.9) , (A.10) . (A.11) Eqs. (A.8), (A.9) and (A.10) are close, respectively, to Eqs. (16), (18) and (20) from Section 6. However, this approach based on Eqs. (A.8)–(A.10) suffers from the same limitation as the approach of Section 6, as explained in Section 11: the code it uses for the model parameters { fk} is not decodable by the receiver. This is so because Eq. (A.8) arranges for the parameters { fk} a code length which depends on the data through the Nk’s. Since these counts Nk are not known to the receiver when it starts its decoding task, this first step of decoding the parameters cannot take place. To circumvent this limitation, a simpler approach [1] would quantify the code length for K real independent parameters as K log(N)/2, specially at large N, to replace Eq. (A.8) or Eq. (16). But for the present histogram parameters, as we mentioned, K independent real (continuously-valued) independent parameters is not a natural assumption, and this is not needed by the alternative approach of Section 7. References [1] J. Rissanen, Stochastic Complexity in Statistical Inquiry, World Scientific, Singapore, 1989. [2] A.R. Baron, J. Rissanen, B. Yu, The minimum description length principle in coding and modeling, IEEE Transactions on Information Theory 44 (1998) 2743–2760. [3] P.D. Grünwald, The Minimum Description Length Principle, MIT Press, Cambridge, MA, 2007. [4] M. Li, P. Vitányi, An Introduction to Kolmogorov Complexity and its Applications, Springer, Berlin, 1997. [5] J. Rissanen, Modeling by shortest data description, Automatica 14 (1978) 465–471. [6] P. Grünwald, I.J. Myung, M. Pitt, Advances in Minimum Description Length: Theory and Applications, MIT Press, Cambridge, MA, 2005. [7] K. Judd, A. Mees, On selecting models for nonlinear time series, Physica D 82 (1995) 426–444. [8] L. Diambra, Maximum entropy approach to nonlinear modeling, Physica A 278 (2000) 140–149. [9] E.J. Hannan, B.G. Quinn, The determination of the order of an autoregression, Journal of the Royal Statistical Society B 41 (1979) 190–195. [10] O. Alata, C. Olivier, Choice of a 2-D causal autoregressive texture model using information criteria, Pattern Recognition Letters 24 (2003) 1191–1201. [11] P. Kontkanen, P. Myllymäki, W. Buntine, J. Rissanen, H. Tirri, An MDL framework for data clustering, in: P. Grünwald, I.J. Myung, M. Pitt (Eds.), Advances in Minimum Description Length: Theory and Applications, MIT Press, Cambridge, MA, 2005, pp. 323–353. [12] D.J. Navarro, M.D. Lee, An application of minimum description length clustering to partitioning learning curves, in: Proceedings IEEE International Symposium on Information Theory, Adelaide, Australia, 4–9 Sept. 2005, pp. 587–591. [13] T. Hediger, A. Passamante, M.E. Farrell, Characterizing attractors using local intrinsic dimensions calculated by singular-value decomposition and information-theoretic criteria, Physical Review A 41 (1990) 5325–5332. [14] J. Rissanen, MDL denoising, IEEE Transactions on Information Theory 46 (2000) 2537–2543. [15] S.C. Zhu, A. Yuille, Region competition: Unifying snakes, region growing, and Bayes/MDL for multiband image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence 18 (1998) 884–900. [16] F. Galland, N. Bertaux, P. Réfrégier, Minimum description length synthetic aperture radar image segmentation, IEEE Transactions on Image Processing 12 (2003) 995–1006. [17] T.C.M. Lee, Regression spline smoothing using the minimum description length principle, Statistics and Probability Letters 48 (2000) 71–81. [18] F. Chapeau-Blondeau, Le principe de longueur de description minimale pour la modélisation des données, ou la théorie statistique de l’information pour bien exploiter les mesures, Le Bup - Bulletin de l’Union des Professeurs de Physique et de Chimie 100 (889 (2)) (2006) 145–155. [19] R. Brown, N.F. Rulkov, E.R. Tracy, Modeling and synchronizing chaotic systems from experimental data, Physics Letters A 194 (1994) 71–76. 164/197
Page 1 and 2:
Université d’Angers Laboratoire
Page 3 and 4:
Ce mémoire est dédié à mon épo
Page 5 and 6:
4 Physique de l’information 39 4.
Page 7 and 8:
1.2 Organisation du document L’es
Page 9 and 10:
4/197
Page 11 and 12:
de Rennes 1. Licence et Maîtrise
Page 13 and 14:
2.5 Activités de recherche 2.5.1 B
Page 15 and 16:
2.5.2 Encadrement Thèses Thèse so
Page 17 and 18:
2.5.3 Responsabilités Management d
Page 19 and 20:
[A20] S. BLANCHARD, D. ROUSSEAU, D.
Page 21 and 22:
In 8th Euro-American Workshop on In
Page 23 and 24:
études visaient alors à analyser
Page 25 and 26:
Fig. 3.1 réside dans le fait qu’
Page 27 and 28:
également permis de réaliser qu
Page 29 and 30:
caracteristique effective g eff (u)
Page 31 and 32:
des architectures neuronales contr
Page 33 and 34:
Dans l’ Éq.(3.10), ∆t représe
Page 35 and 36:
quantifieur, le niveau de saturatio
Page 37 and 38:
exemples présentés dans [39], l
Page 39 and 40:
• le type de validation des résu
Page 41 and 42:
présenté de façon condensée dan
Page 43 and 44:
38/197
Page 45 and 46:
pour l’étude de la turbulence pa
Page 47 and 48:
A B C Figure 4.1 : Images de coupe
Page 49 and 50:
atteint la capacité informationnel
Page 51 and 52:
* Figure 4.5 : Échelle optimale d
Page 53 and 54:
• L’“intégrale de corrélati
Page 55 and 56:
4.2.3 Analyse multifractale en imag
Page 57 and 58:
fonction de partition Z 10 30 10 25
Page 59 and 60:
exposant τ(q) 15 10 5 0 −5 −10
Page 61 and 62:
complexité colorimétrique des ima
Page 63 and 64:
dimension fractale généralisée D
Page 65 and 66:
est la moyenne du signal calculée
Page 67 and 68:
A B C Figure 4.17 : Observations in
Page 69 and 70:
A B C Figure 4.20 : Nombre moyen de
Page 71 and 72:
[14] S. Blanchard, D. Rousseau et F
Page 73 and 74:
[43] J. Chauveau, D. Rousseau et F.
Page 75 and 76:
[76] A. Humeau, B. Buard, F. Chapea
Page 77 and 78:
[100] M. D. McDonnell, N. G. Stocks
Page 79 and 80:
[131] D. Rousseau, F. Duan, J. Roja
Page 81 and 82:
76/197
Page 83 and 84:
D. ROUSSEAU and F. CHAPEAU-BLONDEAU
Page 85 and 86:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 87 and 88:
ROUSSEAU AND CHAPEAU-BLONDEAU: BAYE
Page 89 and 90:
D. ROUSSEAU, G. V. ANAND, and F. CH
Page 91 and 92:
quantizers. Classically, the design
Page 93 and 94:
Fig. 2. Nonlinear array used as pre
Page 95 and 96:
probability of error P er 10 0 10 -
Page 97 and 98:
probability of error (expected as s
Page 99 and 100:
[3] N.H. Lu, B.A. Einstein, Detecti
Page 101 and 102:
Abstract Neurocomputing 71 (2007) 3
Page 103 and 104:
3. Assessing nonlinear transmission
Page 105 and 106:
output SNR 250 200 150 100 50 0 N=3
Page 107 and 108:
precisely the scope of the present
Page 109 and 110:
[4] F. Chapeau-Blondeau, G. Chauvet
Page 111 and 112:
Nonlinear SNR amplification of harm
Page 113 and 114:
P.R. BHAT and D. ROUSSEAU and G. V.
Page 115 and 116:
It is assumed that the signal s is
Page 117 and 118: Let us de£ne the improvement in pe
Page 119 and 120: F. CHAPEAU-BLONDEAU and D. ROUSSEAU
Page 121 and 122: Contents Raising the noise to impro
Page 123 and 124: Raising the noise to improve perfor
Page 135 and 136: S. BLANCHARD, D. ROUSSEAU, D. GINDR
Page 137 and 138: 1984 OPTICS LETTERS / Vol. 32, No.
Page 139 and 140: F. CHAPEAU-BLONDEAU, D. ROUSSEAU, S
Page 141 and 142: 1288 J. Opt. Soc. Am. A/Vol. 25, No
Page 147 and 148: 36 IEEE SIGNAL PROCESSING LETTERS,
Page 149 and 150: 38 IEEE SIGNAL PROCESSING LETTERS,
Page 151 and 152: A. HISTACE and D. ROUSSEAU. Constru
Page 153 and 154: noises Z i in (3) are chosen Gaussi
Page 155 and 156: Author's personal copy Physica A 38
Page 157 and 158: Author's personal copy F. Chapeau-B
Page 159 and 160: Author's personal copy F. Chapeau-B
Page 161 and 162: edundancy Author's personal copy F.
Page 163 and 164: model description length Author's p
Page 165 and 166: total description length x 10 4 1.6
Page 167: Author's personal copy F. Chapeau-B
Page 171 and 172: J. CHAUVEAU and D. ROUSSEAU and F.
Page 173 and 174: 2 for natural images, and carry rel
Page 175 and 176: 4 Fig. 2 Random RGB color image I2(
Page 177 and 178: 6 log2[ number of boxes N(a) ] 24 2
Page 179 and 180: 8 Fig. 7 Color histogram in the RGB
Page 181 and 182: 10 log2[ number of boxes N(a) ] 8 7
Page 183 and 184: 12 17. Landgrebe, D.: Hyperspectral
Page 185 and 186: Numerical simulation of laser Doppl
Page 187 and 188: esults are meaningful to better app
Page 189 and 190: A. HUMEAU, B. BUARD, F. CHAPEAU-BLO
Page 191 and 192: 618 A Humeau et al 1. Introduction
Page 193 and 194: 620 A Humeau et al (a) (b) (c) Figu
Page 195 and 196: 622 A Humeau et al (a) (b) (c) Figu
Page 197 and 198: 624 A Humeau et al Figure 5. Averag
Page 199 and 200: 626 A Humeau et al Therefore, the m
Page 201 and 202: 628 A Humeau et al peripheral cardi
show all

a la physique de l'information - Lisa - Université d'Angers

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?