Actes - Société Francophone de Classification

Recommendations

Info

SFC 2009 2. Model and Notations We consider a directed binary random graph G, where V denotes a set of N fixed vertices and X = {Xij, (i, j) ∈ V 2 } is the set of all the random edges. We assume that G does not have any self loop, and therefore, the variables Xii will not be taken into account. For each vertex i ∈ V, we introduce a latent vector Zi, of Q independent Boolean variables Ziq ∈ {0, 1}, drawn from Bernoulli distributions : Zi ∼ Q� B(Ziq; αq) = q=1 Q� q=1 α Ziq q (1 − αq) 1−Ziq , (1) and we denote α = {α1, . . . , αQ} the vector of class probabilities. Note that in the case of a usual mixture model, Zi would be generated according to a multinational distribution with parameters (1, α). Therefore, the vector Zi would see all its components set to zero except one such that Ziq =1if vertex i belongs to class q. The model would then verify �Q q=1 Ziq = �Q q=1 αq =1, ∀i. In this paper, we relax these constraints using the product of Bernoulli distributions (1), allowing each vertex to belong to multiple classes. We point out that Zi can also have all its components set to zero. where Given two latent vectors Zi and Zj, we assume that the edge Xij is drawn from a Bernoulli distribution : Xij| Zi, Zj ∼ B � Xij; g(aZi,Zj ) � = e XijaZ i ,Z j g(−aZi,Zj ), aZi,Zj = Z⊺ i WZj + Z ⊺ i U + V⊺ Zj +W ∗ , (2) and g(x) = (1+e−x ) −1 is the logistic sigmoid function. W is a Q×Q matrix whereas U and V are Q-dimensional vectors. The first term in (2) describes the interactions between the vertices i and j. If i belongs only to class q and j only to class l, then only one interaction term remains (Z ⊺ i WZj = Wql). However, the interactions can become much more complex if one or both of these two vertices belong to multiple classes. Note that the second term in (2) does not depend on Zj. It models the overall capacity of vertex i to connect to other vertices. By symmetry, the third term represents the global tendency of vertex j to receive and edge. Finally, we use W ∗ as a bias, to model sparsity. 3. Variational Approximations The log-likelihood of the observed data set is defined through the marginalization : p(X | α, ˜ W)= � Z p(X, Z |α, ˜ W). This summation involves 2 NQ terms and quickly becomes intractable. To tackle this issue, the Expectation- Maximization (EM) algorithm has been applied on many mixture models. However, the E-step requires the calculation of the posterior distribution p(Z | X, α, ˜ W) which can not be factorized in the case of networks. In order to obtain a tractable procedure, we propose some approximations based on global and local variational techniques. 3.1. The q-transformation (Variational EM) Given a distribution q(Z), the log-likelihood of the observed data set can be decomposed using the Kullback- Leibler divergence (KL) : ln p(X | α, ˜ � W)=L q; α, ˜ � � W + KL q(Z) || p(Z | X, α, ˜ � W) . (3) By definition KL(. || .) is always positive and therefore L is a lower bound of the log-likelihood : ln p(X | α, ˜ � W) ≥ L q; α, ˜ � W , ∀q(Z). (4) 24
The maximum ln p(X | α, ˜ W) of L is reached when q(Z) =p(Z | X, α, ˜ W). Thus, if the posterior distribution p(Z | X, α, ˜ W) was tractable, the optimizations of L and ln p(X | α, ˜ W), with respect to α and ˜ W, would be equivalent. However, in the case of networks, p(Z | X, α, ˜ W) can not be calculated and L can not be optimized over the entire space of q(Z) distributions. Thus, we restrict our search to the class of distributions which satisfy : N� N� Q� q(Z) = q(Zi) = B(Ziq; τiq). (5) i=1 i=1 q=1 Each τiq is a variational parameter and corresponds to the posterior probability of node i to belong to class q. Note that we do not constrain the vectors τ i = {τi1, . . . , τiQ} to lay on the Q − 1 dimensional simplex, and thereby, each node can belong to multiple clusters. The decomposition (3) and the factorization (5) lead to a variational EM algorithm. During the E-step, the parameters α and ˜ W are fixed ; and by optimizing the lower bound with respect to the τiqs, the algorithm looks for the best approximation of the posterior distribution. Then, during the M-step, q(Z) is used to optimize the lower bound and to find new estimates of α and ˜ W. 3.2. The ξ-transformation The lower bound of (3) is given by � L q; α, ˜ � W =EZ[ln p(X | Z, ˜ W)] + EZ[ln p(Z | α)] − EZ[ln q(Z)], (6) where the expectations are calculated according to the distribution q(Z). The first term of (6) is given by : EZ[ln p(X, Z | α, ˜ W)] = N� � i�=j Xij ˜τ i ⊺ W˜ τ˜ j +EZi,Zj [ln g(−aZi,Zj )] � . (7) Unfortunately, since the logistic sigmoid function is non linear, EZi,Zj [ln g(−aZi,Zj )] can not be computed analytically. Thus, we need a second level of approximation to carry out the variational E and M steps described previously (Sect. 3.1). We use the bound ln g(x, ξ) on the log-logistic sigmoid function introduced by [JAA 00]. When applied on EZi,Zj [ln g(−aZi,Zj )], it leads to : EZi,Zj [ln g(−aZi,Zj )] ≥ ln g(−aZi,Zj , ξij) = ln g(ξij)− ( ˜τ i ⊺ W˜ τ˜ j + ξij) � −λ(ξij) EZi,Zj [( ˜ ⊺ Zi ˜W Zj) ˜ 2 ]−ξ 2 � ij , where λ(ξ) = 1 4ξ tanh( ξ 2 )= 1 2ξ 2 � � (8) 1 g(ξ) − 2 . Thus, for each edge (i, j) in the graph, we have introduced a lower bound which depends on a variational parameter ξij. By optimizing each function ln g(−aZi,Zj , ξij) with respect to ξij, we obtain the tightest bounds to the functions EZi,Zj [ln g(−aZi,Zj )]. These bounds are then used during the variational E and M steps to optimize an approximation of L defined in (6). 4. Experiments We consider the yeast transcriptional regulatory network described in [MIL 02] and we focus on a subset of 192 vertices connected by 303 edges. Nodes of the network correspond to operons, and two operons are linked if one operon encodes a transcriptional factor that directly regulates the other operon. Such networks are known to be relatively sparse which makes them hard to analyze. In this Section, we aim at clustering the vertices according to their connection profile. Using Q =6clusters, we apply our algorithm and we obtain the results in Table 1. 25
Page 1: XVIèmes Rencontres de la Société
Page 5 and 6: Préface Construire le programme sc
Page 7: Comité de programme Président : G
Page 10 and 11: Classification supervisée avec sec
Page 12 and 13: DONNEES SYMBOLIQUES Extension de l'
Page 15 and 16: Réduction non-linéaire de dimensi
Page 17 and 18: Inférence de langages stochastique
Page 19 and 20: Approximations en norme du supremum
Page 21 and 22: Ordonnancement et optimisation de l
Page 23 and 24: Forêts aléatoires : importance et
Page 25 and 26: Adaptation des modèles d’auto-or
Page 27 and 28: - le critère objectif évalue le d
Page 29 and 30: Kohonen Approach for Assisted Livin
Page 31 and 32: 4. Results During the Quatra projec
Page 33 and 34: Auto-organisation d’une structure
Page 35 and 36: structure/problème sont les suivan
Page 37: A Latent Logistic Model to Uncover
Page 41 and 42: Classification de variables et dét
Page 43 and 44: Afin de contourner cette difficult
Page 45 and 46: Données manquantes en ACM : l’al
Page 47 and 48: Application des SVM à la classific
Page 49 and 50: TABLE 1. Paramètres retenus pour l
Page 51 and 52: Dissimilarity-based metric for data
Page 53 and 54: where ! = [1 1 "1 "2] T is the norm
Page 55 and 56: Analyse Discriminante Dissymétriqu
Page 57 and 58: Propriété: My étant un produit s
Page 59 and 60: Classification supervisée avec sec
Page 61 and 62: sique, Ii est classée selon la mé
Page 63 and 64: Discrimination sur des données arb
Page 65 and 66: 2.3. Indices de similarité sur les
Page 67 and 68: Reliability of error estimators in
Page 69 and 70: FIG. 1. Comparison of the true and
Page 71 and 72: Comparaison et classification de s
Page 73 and 74: ang k =0est associé au vecteur con
Page 75 and 76: Apprentissage de différentes class
Page 77 and 78: TAB. 1. Caractéristiques des donn
Page 79 and 80: Comparaison et évaluation de métr
Page 81 and 82: g ∈ [−0.05, 0.05] et ag ∈ [0,
Page 83 and 84: Analyse de la stabilité d’une pa
Page 85 and 86: algorithme de partionnement Ak en k
Page 87 and 88: Indice de distance sur les structur
Page 89 and 90:
Imbrication de deux partitions semi
Page 91 and 92:
Détermination du nombre de classes
Page 93 and 94:
à la disjonction floue des degrés
Page 95 and 96:
Distance de compression et classifi
Page 97 and 98:
On appelle fermé minimal de E, tou
Page 99 and 100:
!"#$%&'(&#)*+"#$%&**","#$)##&%-."#$
Page 101 and 102:
!"#$%#&'()'*'+#$,'-.$/%#'0'()'1'-.$
Page 103 and 104:
Une méthode de partitionnement pou
Page 105 and 106:
3. Applications Dans cette section,
Page 107 and 108:
Classification hiérarchique de don
Page 109 and 110:
j et x je i =1si l’individu i pre
Page 111 and 112:
Structure des réseaux phylogénét
Page 113 and 114:
Ce théorème de décomposition en
Page 115 and 116:
Résumés de textes par extraction
Page 117 and 118:
alors que TEXTRANK décrit un proce
Page 119 and 120:
Analyse de graphes de données text
Page 121 and 122:
minimal de toute triangulation mini
Page 123 and 124:
Estimation des paramètres d’une
Page 125 and 126:
La dérivée f ′ d ln Γ(x) (δ)
Page 127 and 128:
Vers une discrétisation locale pou
Page 129 and 130:
2.2. Les treillis dichotomiques et
Page 131 and 132:
Combiner treillis de Galois et anal
Page 133 and 134:
FIG. 1. Résultats de l’AFM et tr
Page 135 and 136:
An approach based on Formal Concept
Page 137 and 138:
In this paper, we only consider num
Page 139 and 140:
Tatouages et motivations pour se fa
Page 141 and 142:
FIGURE 1 - !"#$%&'%(&"(()*+"$+),(&%
Page 143 and 144:
Approche pour le suivi des changeme
Page 145 and 146:
Total of clusters 0 1 2 3 4 5 6 7 8
Page 147 and 148:
Classification des émotions dans u
Page 149 and 150:
3. Méthode Conformément à la pro
Page 151 and 152:
Utilisation de RandomForest pour la
Page 153 and 154:
par pouvoir discriminant décroissa
Page 155 and 156:
Une méthode de combinaison de rés
Page 157 and 158:
Avant d’expliquer la manière don
Page 159 and 160:
Consensus de partitions : une appro
Page 161 and 162:
une partition de score nul. Le cas
Page 163 and 164:
Analyse en Composantes Principales
Page 165 and 166:
modalité), la matrice de variance
Page 167 and 168:
Une méthode d’ACP de données en
Page 169 and 170:
B n = Mn ( ZnZ n '! " n" n '), On d
Page 171 and 172:
Régression - corrélation : un poi
Page 173 and 174:
où les b j i (resp. les b0i )dési
Page 175 and 176:
Classification non-supervisée de d
Page 177 and 178:
Algorithm 1 CoFKM Entrée : Ensembl
Page 179 and 180:
Classification floue de données in
Page 181 and 182:
x " i xˆ # x " i " iMAX ! x ! ! iM
Page 183 and 184:
!"#$%&'()&"*+ #,'+ #$-,*#(*.,'+ %".
Page 185 and 186:
!"#$%&'#()*+,-+."(+/&,01/1."$(-"2$+
Page 187 and 188:
Classification sous contraintes gé
Page 189 and 190:
3. Introduction du Modèle Nous pro
Page 191 and 192:
177
Page 193 and 194:
179
Page 195 and 196:
New LISA indices for spatio-tempora
Page 197 and 198:
semi-definite positive by construct
Page 199 and 200:
K-mean clustering of misaligned fun
Page 201 and 202:
Moreover, define the labelling func
Page 203 and 204:
Multiple Comparison Procedures for
Page 205 and 206:
3. Simulation study and results A s
Page 207 and 208:
Dynamic clustering of data describe
Page 209 and 210:
H where i p ( ) ( j f H f ) i = ! j
Page 211 and 212:
Correspondence analysis with linear
Page 213 and 214:
where UT U = I, VT V = I and Λ is
Page 215 and 216:
Applying Differential Geometric LAR
Page 217 and 218:
Let ru (β A(γ)) = (ru(β1(γ0)),
Page 219 and 220:
Catégorisation de documents à l
Page 221 and 222:
3.2. Les résultats Les documents o
Page 223 and 224:
Essais de classification par l’in
Page 225 and 226:
Pour la modélisation de l’interm
Page 227 and 228:
Index R. Abdesselam 41 J. Aguilar-M
show all

Actes - Société Francophone de Classification

Create successful ePaper yourself

Delete template?

Save as template?