View - Universidad de AlmerÃa

More documents

Recommendations

Info

Proceedings of GO 2005, pp. 67 – 69.Globally optimal prototypesin kNN classification methods ∗E. Carrizosa, 1 B. Martín-Barragán, 1 F. Plastria, 2 and D. Romero-Morales 31 Universidad de Sevilla, Spain{ecarrizosa,belmart}@us.es2 Vrije Universiteit Brussel, BelgiumFrank.Plastria@vub.ac.be3 University of Oxford, United Kingdomdolores.romero-morales@sbs.ox.ac.ukAbstractThe Nearest Neighbor classifier has shown to be a powerful tool for multiclass classification. Inorder to alleviate its main drawbacks (high storage requirements and time-consuming queries), aseries of variants, such as the Condensed or the Reduced Nearest Neighbor, have been suggested inthe last four decades.In this note we explore both theoretical properties and empirical behavior of another such variant,in which the Nearest Neighbor rule is applied after selecting a set of so-called prototypes, whosecardinality is fixed in advance, by minimizing the empirical misclassification cost.The problem is shown to be N P-Hard. Mixed Integer Programming (MIP) programs are formulated,theoretically compared and solved by a standard MIP solver for problems of small size.Large sized problem instances are solved by a variable neighborhood metaheuristic yielding goodclassification rules in reasonable time.Keywords:Data Mining, Classification, Optimal Prototype Subset, Nearest Neighbor, Integer Programming.1. IntroductionIn a Classification problem, one has a database with individuals of |C| different classes, andone wants to derive a classification rule, i.e., a procedure which labels every future entry v asmember of one of the |C| existing classes.Roughly speaking, classification procedures can be divided into two types: parametric andnon-parametric. Parametric procedures assume that each individual from class c ∈ C is associatedwith a random vector with known distribution, perhaps up to some parameters, to beestimated, (e.g. data are multivariate normal vectors, with unknown mean µ c and covariancematrix Σ c ), and use the machinery of Statistics as main technique, see e.g. [21].For complex databases, with no evident distributional assumptions on the data (typicallythe case of databases with a mixture of quantitative and qualitative variables), non-parametricmethods, as the one described in this talk, are needed.In recent years there has been an increasing interest in deriving (non-parametric) classificationrules via Mathematical Programming. Most of such methods require, for each individuali, a vector v i of n numerical variables. In particular this assumes variables to be ratio-scaled,∗ Partially supported by projects BFM2002-04525-CO2-02, Ministerio de Ciencia y Tecnología, Spain, and FQM-329, Junta deAndalucía, Spain
Page 1:
PROCEEDINGS OF THEINTERNATIONAL WOR
Page 5 and 6:
ContentsPrefaceiiiPlenary TalksYaro
Page 7 and 8:
ContentsviiFuh-Hwa Franklin Liu, Ch
Page 9:
PLENARY TALKS
Page 12 and 13:
4 Yaroslav D. Sergeyevto work with
Page 15:
EXTENDED ABSTRACTS
Page 18 and 19:
10 Bernardetta Addis and Sven Leyff
Page 20 and 21:
Page 22 and 23:
Page 24 and 25: 16 Bernardetta Addis, Marco Locatel
Page 26 and 27: 18 April K. Andreas and J. Cole Smi
Page 32 and 33: 24 Charles Audet, Pierre Hansen, an
Page 38 and 39: 30 János Balogh, József Békési,
Page 44 and 45: 36 Balázs Bánhelyi, Tibor Csendes
Page 47 and 48: Proceedings of GO 2005, pp. 39 - 45
Page 49 and 50: MGA Pruning Technique 41Figure 1. A
Page 51 and 52: MGA Pruning Technique 43O(n) = k 2
Page 53: MGA Pruning Technique 45one), while
Page 56 and 57: 48 Edson Tadeu Bez, Mirian Buss Gon
Page 62 and 63: 54 R. Blanquero, E. Carrizosa, E. C
Page 64 and 65: 56 R. Blanquero, E. Carrizosa, E. C
Page 66 and 67: 58 Sándor Bozókiwhere for any i,
Page 68 and 69: 60 Sándor Bozóki[6] Budescu, D.V.
Page 70 and 71: 62 Emilio Carrizosa, José Gordillo
Page 72 and 73: 64 Emilio Carrizosa, José Gordillo
Page 76 and 77: 68 E. Carrizosa, B. Martín-Barrag
Page 81 and 82: Branch-and-Bound for the semi-conti
Page 83 and 84: Branch-and-Bound for the semi-conti
Page 87 and 88: Reliable Optimization in Civil Engi
Page 91 and 92: A global optimization model for loc
Page 95 and 96: Global Optimization of Low-Thrust S
Page 97 and 98: Global Optimization of Low-Thrust S
Page 101 and 102: Neutral Data Fitting 93260Regressio
Page 103 and 104: Neutral Data Fitting 95For each dat
Page 107 and 108: Methods for obtaining an outer appr
Page 109 and 110: Methods for obtaining an outer appr
Page 113 and 114: Feasibility study by interval arith
Page 115 and 116: Feasibility study by interval arith
Page 119 and 120: Rigorous Affine Lower Bound Functio
Page 121: Rigorous Affine Lower Bound Functio
Page 124 and 125:
116 C. Gil, R. Baños, M. G. Montoy
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
122 C. Gutiérrez, B. Jiménez, and
Page 132 and 133:
124 C. Gutiérrez, B. Jiménez, and
Page 135 and 136:
Proceedings of GO 2005, pp. 127 - 1
Page 137 and 138:
On the goodness of Global Optimisat
Page 139 and 140:
On the goodness of Global Optimisat
Page 141 and 142:
Page 143 and 144:
An Adaptive Radial Basis Algorithm
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Distributed Global Optimisation in
Page 153 and 154:
Distributed Global Optimisation in
Page 155 and 156:
Page 157 and 158:
VNS for Global Optimization 1491. s
Page 159:
VNS for Global Optimization 151[2]
Page 162 and 163:
154 Fuh-Hwa Franklin Liu, Chi-Wei S
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
160 Pierluigi Di Lizia, Gianmarco R
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
166 Dmitrii LozovanuTheorem 1. The
Page 176 and 177:
168 Dmitrii Lozovanuwhich determine
Page 178 and 179:
170 Dmitrii LozovanuIn [3] it is sh
Page 180 and 181:
172 Frédéric Messine and Ahmed To
Page 182 and 183:
Page 184 and 185:
Page 186 and 187:
178 R. P. Mondaini and N. V. Olivei
Page 188 and 189:
180 R. P. Mondaini and N. V. Olivei
Page 191 and 192:
Page 193 and 194:
Fitting separable nonlinear spectro
Page 195 and 196:
Fitting separable nonlinear spectro
Page 197 and 198:
Page 199 and 200:
Global Optimisation Challenges in R
Page 201 and 202:
Global Optimisation Challenges in R
Page 203 and 204:
Page 205 and 206:
On solving of bilinear programming
Page 207 and 208:
Page 209 and 210:
GASUB: A genetic-like algorithm for
Page 211 and 212:
Page 213 and 214:
Page 215 and 216:
Page 217 and 218:
Dynamic Stochastic Optimal Path 209
Page 219 and 220:
Dynamic Stochastic Optimal Path 211
Page 221 and 222:
Page 223 and 224:
Multi-Objective Optimization of Non
Page 225 and 226:
Multi-Objective Optimization of Non
Page 227 and 228:
Page 229 and 230:
Diagonal global search based on a s
Page 231 and 232:
Diagonal global search based on a s
Page 233 and 234:
Page 235 and 236:
Survivable Network Design 227given
Page 237 and 238:
Survivable Network Design 229prefer
Page 239 and 240:
Page 241 and 242:
Precision and Accuracy in Generatin
Page 243 and 244:
Precision and Accuracy in Generatin
Page 245 and 246:
Page 247:
New approach to nonconvex constrain
Page 250 and 251:
242 Boglárka Tóth and L.G. Casado
Page 252 and 253:
Page 254 and 255:
Page 256 and 257:
248 Massimiliano Vasilemultiagent e
Page 258 and 259:
250 Massimiliano Vasileproblems wer
Page 260 and 261:
252 Massimiliano VasileTable 3.Solu
Page 262 and 263:
254 Tamás Vinkó and Arnold Neumai
Page 265 and 266:
Page 267 and 268:
Global optimisation applied to pig
Page 269 and 270:
Global optimisation applied to pig
Page 271 and 272:
Page 273 and 274:
Optimal Triangulation: Old and New
Page 275:
Optimal Triangulation: Old and New
Page 278 and 279:
270 Author IndexGalambos, GáborUni
show all

View - Universidad de AlmerÃ­a

Create successful ePaper yourself

Delete template?

Save as template?

View - Universidad de AlmerÃa