Actes - Société Francophone de Classification

Recommendations

Info

SFC 2009 Moreover, the mixture h m is such that minimizes JS divergence within hth cluster. The final partition’s quality can be evaluated using the index obtained by ratio between divergence, analogously to the way proposed by Chavent et al. [CHA 03] 5. Computation procedure According to (4), the JS divergence among objects belonging to h th cluster has the following expression: d ( h) i = H ( m ) " ( ) H ( f ) JS h # ! ! i C h $ h $ i ( h) and after simple passages, we obtained the following expression for d : d ' p $ ( ( ) $ j = % ! H f ) + H ( ci ) " i # ! C $ % & j= " i h 1 # ( h) i JS H ( mh ) " ! ( h) JS B d JS and total JS Then, we can easily compute JS dissimilarities among objects in a cluster, computing copula entropy, marginal entropies and mixture entropy separately. To obtain these quantities, numerical integration procedure, based on adaptive methods, can be used. Subsequently, the W JS d quantity can be computed. The proposed clustering algorithm allows us to find simultaneously the best partition of symbolic objects, according to the chosen criterion, and a suitable model to describing dependence inside observations. 6. Bibliography [BOC 00] BOCK H.H., DIDAY E., Analysis of Symbolic Data, Expanatory methods for extracting statistical informations from Complex data, Studies in Classification, Data Analysis and Knowledge Organization, Springer Verlag, 2000. [CHA 03] CHAVENT M., DE CARVALHO F.A.T., LECHEVALLIER Y., VERDE R., Trois nouvelles méthodes de classification automatique des données symbolique de type intervalle, Revue de Statistique Appliquée, vol. 4, 2003, p. 5-29. [PAP 91] PAPOULIS A., Probability, Random Variables and Stochastic Process, McGraw-Hill,1991. [SHA 71] SHANNON C.E., WEAVER W., La teoria matematica delle comunicazioni, Etas Kompass, 1971. [SKL 59] SKLAR A., Fonctions de répartition à n dimension et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris, vol. 8, 1959, p. 229-231 [VER 08] VERDE E., IRPINO A., Comparing Histogram Data Using Mahalanobis-Wasserstein Distance, Proceeding in Compstat 2008: Proceedings in Computational Statistics, Heidelberg, Physica-Verlag Springer, 2008. 196 (10) (11)
Correspondence analysis with linear constraints of crossclassification tables using orthogonal polynomials Pietro Amenta Department of Analysis of Economic and Social Systems, University of Sannio Via delle Puglie, 82 82100, Benevento, Italy amenta@unisannio.it ABSTRACT. Within the context of the non-iterative procedures for performing correspondence analysis with linear constraints, a strategy is proposed to impose linear constraints in analyzing a contingency tables with one or two ordered sets of categories. At the heart of the approach is the partition of the Pearson chi-squared statistic which involves terms that summarize the association between the nominal/ordinal variables using bivariate moments based on orthogonal polynomials. Linear constraints are then included directly on suitable matrices which reflect the most important components overcoming the problem to impose linear constraints based on subjective decisions. A possible use of this constrained two-way approach for sliced three ordered sets of categories is also suggested. KEYWORDS : Ordered Correspondence Analysis, Emerson’s orthogonal polynomials, linear constraints. 1. Introduction Correspondence analysis is a widely used tool for obtaining a graphical representation of the dependence between the rows and columns of a contingency table, and it is usually performed by applying a singular value decomposition to the standardised residuals of a two-way contingency table. This decomposition ensures that the maximum information regarding the association between the two categorical variables are accounted for in a factorial plane of a correspondence plot. However, such a plot can identify those categories that are similar but does not clarify how some categories are different. In addition, the interpretation of the multidimensional representation of the row and column categories may be greatly simplified if additional information (as linear constraints) about the row and column structure of the table is available. In the classical analysis, Böckenholt and Böckenholt (hereafter B&B) [BOC 90] considered this problem (see also [BEH 09] [TAK 09] [AME 08a]). This additional information is usually imposed by making use of orthogonal polynomials which are suitable for subdividing total variation of the scores into linear, quadratic, cubic, etc., components. For instance, to obtain a linear order for the standard scores, B&B eliminates the effects of the quadratic and cubic trend by means suitable constraint matrices. Unfortunally, these constraints are commonly selected on the basis of subjective decisions without taking into account if the effects of the linear, quadratic and cubic trend are or not statistically significant, respectively. Aim of this paper is to consider a suitable extension of the B&B’s approach to contingency tables with more than one ordered sets of categories. This is achieved by using the additional information about the structure and statistically significant associations of the data given by the correspondence analysis proposed by Beh [BEH 97]. 2. Correspondence analysis of ordinal cross-classifications based on the moment decomposition Consider a two-way contingency table N describing the joint distribution of two categorical variables where the (i, j)th cell entry is given by nij for i =1, ..., I and j =1, ..., J with n = � i,j nij. The (i, j)th element 197
Page 1:
XVIèmes Rencontres de la Société
Page 5 and 6:
Préface Construire le programme sc
Page 7:
Comité de programme Président : G
Page 10 and 11:
Classification supervisée avec sec
Page 12 and 13:
DONNEES SYMBOLIQUES Extension de l'
Page 15 and 16:
Réduction non-linéaire de dimensi
Page 17 and 18:
Inférence de langages stochastique
Page 19 and 20:
Approximations en norme du supremum
Page 21 and 22:
Ordonnancement et optimisation de l
Page 23 and 24:
Forêts aléatoires : importance et
Page 25 and 26:
Adaptation des modèles d’auto-or
Page 27 and 28:
- le critère objectif évalue le d
Page 29 and 30:
Kohonen Approach for Assisted Livin
Page 31 and 32:
4. Results During the Quatra projec
Page 33 and 34:
Auto-organisation d’une structure
Page 35 and 36:
structure/problème sont les suivan
Page 37 and 38:
A Latent Logistic Model to Uncover
Page 39 and 40:
The maximum ln p(X | α, ˜ W) of L
Page 41 and 42:
Classification de variables et dét
Page 43 and 44:
Afin de contourner cette difficult
Page 45 and 46:
Données manquantes en ACM : l’al
Page 47 and 48:
Application des SVM à la classific
Page 49 and 50:
TABLE 1. Paramètres retenus pour l
Page 51 and 52:
Dissimilarity-based metric for data
Page 53 and 54:
where ! = [1 1 "1 "2] T is the norm
Page 55 and 56:
Analyse Discriminante Dissymétriqu
Page 57 and 58:
Propriété: My étant un produit s
Page 59 and 60:
Classification supervisée avec sec
Page 61 and 62:
sique, Ii est classée selon la mé
Page 63 and 64:
Discrimination sur des données arb
Page 65 and 66:
2.3. Indices de similarité sur les
Page 67 and 68:
Reliability of error estimators in
Page 69 and 70:
FIG. 1. Comparison of the true and
Page 71 and 72:
Comparaison et classification de s
Page 73 and 74:
ang k =0est associé au vecteur con
Page 75 and 76:
Apprentissage de différentes class
Page 77 and 78:
TAB. 1. Caractéristiques des donn
Page 79 and 80:
Comparaison et évaluation de métr
Page 81 and 82:
g ∈ [−0.05, 0.05] et ag ∈ [0,
Page 83 and 84:
Analyse de la stabilité d’une pa
Page 85 and 86:
algorithme de partionnement Ak en k
Page 87 and 88:
Indice de distance sur les structur
Page 89 and 90:
Imbrication de deux partitions semi
Page 91 and 92:
Détermination du nombre de classes
Page 93 and 94:
à la disjonction floue des degrés
Page 95 and 96:
Distance de compression et classifi
Page 97 and 98:
On appelle fermé minimal de E, tou
Page 99 and 100:
!"#$%&'(&#)*+"#$%&**","#$)##&%-."#$
Page 101 and 102:
!"#$%#&'()'*'+#$,'-.$/%#'0'()'1'-.$
Page 103 and 104:
Une méthode de partitionnement pou
Page 105 and 106:
3. Applications Dans cette section,
Page 107 and 108:
Classification hiérarchique de don
Page 109 and 110:
j et x je i =1si l’individu i pre
Page 111 and 112:
Structure des réseaux phylogénét
Page 113 and 114:
Ce théorème de décomposition en
Page 115 and 116:
Résumés de textes par extraction
Page 117 and 118:
alors que TEXTRANK décrit un proce
Page 119 and 120:
Analyse de graphes de données text
Page 121 and 122:
minimal de toute triangulation mini
Page 123 and 124:
Estimation des paramètres d’une
Page 125 and 126:
La dérivée f ′ d ln Γ(x) (δ)
Page 127 and 128:
Vers une discrétisation locale pou
Page 129 and 130:
2.2. Les treillis dichotomiques et
Page 131 and 132:
Combiner treillis de Galois et anal
Page 133 and 134:
FIG. 1. Résultats de l’AFM et tr
Page 135 and 136:
An approach based on Formal Concept
Page 137 and 138:
In this paper, we only consider num
Page 139 and 140:
Tatouages et motivations pour se fa
Page 141 and 142:
FIGURE 1 - !"#$%&'%(&"(()*+"$+),(&%
Page 143 and 144:
Approche pour le suivi des changeme
Page 145 and 146:
Total of clusters 0 1 2 3 4 5 6 7 8
Page 147 and 148:
Classification des émotions dans u
Page 149 and 150:
3. Méthode Conformément à la pro
Page 151 and 152:
Utilisation de RandomForest pour la
Page 153 and 154:
par pouvoir discriminant décroissa
Page 155 and 156:
Une méthode de combinaison de rés
Page 157 and 158:
Avant d’expliquer la manière don
Page 159 and 160: Consensus de partitions : une appro
Page 161 and 162: une partition de score nul. Le cas
Page 163 and 164: Analyse en Composantes Principales
Page 165 and 166: modalité), la matrice de variance
Page 167 and 168: Une méthode d’ACP de données en
Page 169 and 170: B n = Mn ( ZnZ n '! " n" n '), On d
Page 171 and 172: Régression - corrélation : un poi
Page 173 and 174: où les b j i (resp. les b0i )dési
Page 175 and 176: Classification non-supervisée de d
Page 177 and 178: Algorithm 1 CoFKM Entrée : Ensembl
Page 179 and 180: Classification floue de données in
Page 181 and 182: x " i xˆ # x " i " iMAX ! x ! ! iM
Page 183 and 184: !"#$%&'()&"*+ #,'+ #$-,*#(*.,'+ %".
Page 185 and 186: !"#$%&'#()*+,-+."(+/&,01/1."$(-"2$+
Page 187 and 188: Classification sous contraintes gé
Page 189 and 190: 3. Introduction du Modèle Nous pro
Page 191 and 192: 177
Page 193 and 194: 179
Page 195 and 196: New LISA indices for spatio-tempora
Page 197 and 198: semi-definite positive by construct
Page 199 and 200: K-mean clustering of misaligned fun
Page 201 and 202: Moreover, define the labelling func
Page 203 and 204: Multiple Comparison Procedures for
Page 205 and 206: 3. Simulation study and results A s
Page 207 and 208: Dynamic clustering of data describe
Page 209: H where i p ( ) ( j f H f ) i = ! j
Page 213 and 214: where UT U = I, VT V = I and Λ is
Page 215 and 216: Applying Differential Geometric LAR
Page 217 and 218: Let ru (β A(γ)) = (ru(β1(γ0)),
Page 219 and 220: Catégorisation de documents à l
Page 221 and 222: 3.2. Les résultats Les documents o
Page 223 and 224: Essais de classification par l’in
Page 225 and 226: Pour la modélisation de l’interm
Page 227 and 228: Index R. Abdesselam 41 J. Aguilar-M
show all

Actes - Société Francophone de Classification

Create successful ePaper yourself

Delete template?

Save as template?