Abstract book (pdf) - ICPR 2010

More documents

Recommendations

Info

13:30-16:30, Paper WeBCT9.34 Comparison of Multidimensional Data Access Methods for Feature-Based Image Retrieval Arslan, Serdar, Middle East Tech. Univ. Açar, Esra, Middle East Tech. Univ. Saçan, Ahmet, Middle East Tech. Univ. Toroslu, Ismail Hakkı , Middle East Tech. Univ. Yazıcı, Adnan, Middle East Tech. Univ. Within the scope of information retrieval, efficient similarity search in large document or multimedia collections is a critical task. In this paper, we present a rigorous comparison of three different approaches to the image retrieval problem, including cluster-based indexing, distance-based indexing, and multidimensional scaling methods. The time and accuracy trade-offs for each of these methods are demonstrated on a large Corel image database. Similarity of images is obtained via a feature-based similarity measure using four MPEG-7 low-level descriptors. We show that an optimization of feature contributions to the distance measure can identify irrelevant features and is necessary to obtain the maximum accuracy. We further show that using multidimensional scaling can achieve comparable accuracy, while speeding-up the query times significantly by allowing the use of spatial access methods. 13:30-16:30, Paper WeBCT9.35 A Pixel-Based Evaluation Method for Text Detection in Color Images Anthimopoulos, Marios, National Center for Scientific Res. “Demokritos” Vlissidis, Nikolaos, National Center for Scientific Res. “Demokritos” Gatos, B., National Center for Scientific Res. “Demokritos” This paper proposes a performance evaluation method for text detection in color images. The method, contrary to previous approaches, is not based on the inexplicitly defined text bounding boxes for the evaluation of the text detection result but considers only the text pixels detected by binarizing the image and applying a color inversion if needed. Moreover, in order to gain independence from the chosen binarization algorithm, the method uses the skeleton of the binarized image. The results produced by the proposed evaluation protocol proved to be quite representative and reasonable compared to the corresponding optical result. 13:30-16:30, Paper WeBCT9.36 Active Boosting for Interactive Object Retrieval Lechervy, Alexis, ETIS, CNRS, ENSEA, Univ. Cergy-Pontoise Gosselin, Philippe Henri, CNRS Precioso, Frederic, ETIS, CNRS, ENSEA, Univ. Cergy-Pontoise This paper presents a new algorithm based on boosting for interactive object retrieval in images. Recent works propose online boosting algorithms where weak classifier sets are iteratively trained from data. These algorithms are proposed for visual tracking in videos, and are not well adapted to online boosting for interactive retrieval. We propose in this paper to iteratively build weak classifiers from images, labeled as positive by the user during a retrieval session. A novel active learning strategy for the selection of images for user annotation is also proposed. This strategy is used to enhance the strong classifier resulting from boosting process, but also to build new weak classifiers. Experiments have been carried out on a generalist database in order to compare the proposed method to a SVM based reference approach. 13:30-16:30, Paper WeBCT9.37 Geotagged Photo Recognition using Corresponding Aerial Photos with Multiple Kernel Learning Keita, Yaegashi, Univ. of Electro-Commnications Keiji, Yanai, Univ. of Electro-Commnications In this paper, we treat with generic object recognition for geotagged images. As a recognition method for geotagged photos, we have already proposed exploiting aerial photos around geotag places as additional image features for visual recognition of geotagged photos. In the previous work, to fuse two kinds of features, we just concatenate them. Instead, in this paper, we introduce Multiple Kernel Learning (MKL) to integrate both features of photos and aerial images. MKL can estimate the contribution weights to integrate both kinds of features. In the experiments, we confirmed effectiveness of usage of aerial photos for recognition of geotagged photos, and we evaluated the weights of both features estimated by MKL for eighteen concepts. - 236 -
13:30-16:30, Paper WeBCT9.38 Efficient Semantic Indexing for Image Retrieval Pulla, Chandrika, International Inst. of Information Tech. Hyderabad Karthik, Suman, International Inst. of Information Tech. Hyderabad Jawahar, C. V., IIIT Semantic analysis of a document collection can be viewed as an unsupervised clustering of the constituent words and documents around hidden or latent concepts. This has shown to improve the performance of visual bag of words in image retrieval. However, the enhancement in performance depends heavily on the right choice of number of semantic concepts. Most of the semantic indexing schemes are also computationally costly. In this paper, we employ a bipartite graph model (BGM) for image retrieval. BGM is a scalable data structure that aids semantic indexing in an efficient manner. It can also be incrementally updated. BGM uses \textbf{tf-idf} values for building a semantic bipartite graph. We also introduce a graph partitioning algorithm that works on the BGM to retrieve semantically relevant images from a database. We demonstrate the properties as well as performance of our semantic indexing scheme through a series of experiments. We also compare our methods with incremental pLSA. 13:30-16:30, Paper WeBCT9.39 Improving and Aligning Speech with Presentation Slides Swaminathan, Ranjini, Univ. of Arizona Thompson, Michael E., Univ. of Arizona Fong, Sandiway, Univ. of Arizona Efrat, Alon, Univ. of Arizona Amir, Arnon Barnard, Kobus, Univ. of Arizona We present a novel method to correct automatically generated speech transcripts of talks and lecture videos using text from accompanying presentation slides. The approach finesses the challenges of dealing with technical terms which are often outside the vocabulary of speech recognizers. Further, we align the transcript to the slide word sequence so that we can improve the organization of closed captioning for hearing impaired users, and improve automatic highlighting or magnification for visually impaired users. For each speech segment associated with a slide, we construct a sequential Hidden Markov Model for the observed phonemes that follows slide word order, interspersed with text not on the slide. Incongruence between slide words and mistaken transcript words is accounted for using phoneme confusion probabilities. Hence, transcript words different from aligned high probability slide words can be corrected. Experiments on six talks show improvement in transcript accuracy and alignment with slide words. 13:30-16:30, Paper WeBCT9.40 The ImageCLEF Medical Retrieval Task at ICPR 2010 - Information Fusion Kalpathy-Cramer, Jayashree, Oregon Health & Science Univ. Müller, Henning, Univ. of Applied Sciences An increasing number of clinicians, researchers, educators and patients routinely search for medical information on the Internet as well as in image archives. However, image retrieval is far less understood and developed than text-based search. The ImageCLEF medical image retrieval task is an international benchmark that enables researchers to assess and compare techniques for medical image retrieval using standard test collections. Although text retrieval is mature and well researched, it is limited by the quality and availability of the annotations associated with the images. Advances in computer vision have led to methods for using the image itself as search entity. However, the success of purely content-based techniques has been limited and these systems have not had much clinical success. On the other hand a combination of text- and content-based retrieval can achieve improved retrieval performance if combined effectively. Combining visual and textual runs is not trivial based on experience in ImageCLEF. The goal of the fusion challenge at ICPR is to encourage participants to combine visual and textual results to improve search performance. Participants were provided textual and visual runs, as well as the results of the manual judgments from ImageCLEFmed 2008 as training data. The goal was to combine textual and visual runs from 2009. In this paper, we present the results from this ICPR contest. 13:30-16:30, Paper WeBCT9.41 Unified Approach to Detection and Identification of Commercial Films by Temporal Occurrence Pattern Putpuek, Narongsak, Chulalongkorn Univ. - 237 -
Page 3 and 4:
CONTENTS Organizing Committees 2 Tr
Page 5 and 6:
Arun Ross West Virginia University
Page 7 and 8:
Track IV: Biometrics and Human Comp
Page 9 and 10:
Message from the Technical Program
Page 11 and 12:
- 9 -
Page 13 and 14:
- 11 -
Page 15 and 16:
- 13 -
Page 17 and 18:
- 15 -
Page 19 and 20:
Technical Program for Monday August
Page 21 and 22:
09:00-09.30, MoOT10 Anadolu Auditor
Page 23 and 24:
11:20-11:40, Paper MoAT2.2 2D Shape
Page 25 and 26:
sponding feature points between the
Page 27 and 28:
This paper concentrates on speech d
Page 29 and 30:
12:00-12:20, Paper MoAT6.4 AR-PCA-H
Page 31 and 32:
tistical models fitted to large dat
Page 33 and 34:
16:10-16:30, Paper MoBT2.3 A Recurs
Page 35 and 36:
MoBT4 Dolmabahçe Hall A Ocular Bio
Page 37 and 38:
equation for estimating alphas of p
Page 39 and 40:
15:30-15:50, Paper MoBT7.1 Dyslexia
Page 41 and 42:
15:00-17:10, Paper MoBT8.3 Fast Odo
Page 43 and 44:
15:00-17:10, Paper MoBT8.11 Abnorma
Page 45 and 46:
process. This method is an importan
Page 47 and 48:
paper describes a general framework
Page 49 and 50:
work machine environments, in which
Page 51 and 52:
15:00-17:10, Paper MoBT8.44 Road Ch
Page 53 and 54:
the backgrounds with random landsca
Page 55 and 56:
Research on complex shape recogniti
Page 57 and 58:
image observations. Experimental re
Page 59 and 60:
15:00-17:10, Paper MoBT9.17 Possibi
Page 61 and 62:
15:00-17:10, Paper MoBT9.25 Gaussia
Page 63 and 64:
15:00-17:10, Paper MoBT9.34 CDP Mix
Page 65 and 66:
15:00-17:10, Paper MoBT9.42 Learnin
Page 67 and 68:
This paper proposes a solution to t
Page 69 and 70:
15:00-17:10, Paper MoBT9.59 Tensor
Page 71 and 72:
Technical Program for Tuesday Augus
Page 73 and 74:
TuAT1 Marmara Hall Object Detection
Page 75 and 76:
complexity of the co-association ma
Page 77 and 78:
with Shape from Shading, we fully r
Page 79 and 80:
09:40-10:00, Paper TuAT5.3 Adding A
Page 81 and 82:
is integrate the features at a desc
Page 83 and 84:
TuAT8 Lower Foyer Structural Method
Page 85 and 86:
09:00-11:10, Paper TuAT8.8 Backgrou
Page 87 and 88:
A Discrete Labelling Approach to At
Page 89 and 90:
09:00-11:10, Paper TuAT8.25 Backgro
Page 91 and 92:
09:00-11:10, Paper TuAT8.33 A Fast
Page 93 and 94:
09:00-11:10, Paper TuAT8.41 A Varia
Page 95 and 96:
matching 3D CAD models, shape match
Page 97 and 98:
TuAT9 Upper Foyer Biometrics Poster
Page 99 and 100:
09:00-11:10, Paper TuAT9.9 Robust R
Page 101 and 102:
09:00-11:10, Paper TuAT9.17 Probabi
Page 103 and 104:
09:00-11:10, Paper TuAT9.24 Attacki
Page 105 and 106:
09:00-11:10, Paper TuAT9.31 An Effi
Page 107 and 108:
evaluation of micropattern represen
Page 109 and 110:
signed a new encoding method. The e
Page 111 and 112:
09:00-11:10, Paper TuAT9.54 Face Re
Page 113 and 114:
some applications such as stereo ma
Page 115 and 116:
14:50-15:10, Paper TuBT2.5 Compress
Page 117 and 118:
13:50-14:10, Paper TuBT4.2 Localize
Page 119 and 120:
In this article, a new fragile, bli
Page 121 and 122:
14:10-14:30, Paper TuBT7.3 Retinal
Page 123 and 124:
similarities among face carvings ar
Page 125 and 126:
16:00-16:20, Paper TuCT3.2 Level-Se
Page 127 and 128:
A new method to extract dashed line
Page 129 and 130:
16:00-16:20, Paper TuCT6.2 Modeling
Page 131 and 132:
Ramos, Daniel, Univ. Autonoma de Ma
Page 133 and 134:
paper, we proposes a one-shot scann
Page 135 and 136:
13:30-16:30, Paper TuBCT8.16 Visual
Page 137 and 138:
13:30-16:30, Paper TuBCT8.24 Discri
Page 139 and 140:
13:30-16:30, Paper TuBCT8.32 Action
Page 141 and 142:
13:30-16:30, Paper TuBCT8.40 Shape
Page 143 and 144:
13:30-16:30, Paper TuBCT8.48 Accura
Page 145 and 146:
13:30-16:30, Paper TuBCT9.3 Prototy
Page 147 and 148:
13:30-16:30,Paper TuBCT9.11 Writing
Page 149 and 150:
13:30-16:30,Paper TuBCT9.20 Unsuper
Page 151 and 152:
13:30-16:30,Paper TuBCT9.28 Documen
Page 153 and 154:
didate word of the document with wh
Page 155 and 156:
string as an input to the transduce
Page 157 and 158:
to previous pre-processing such as
Page 159 and 160:
Technical Program for Wednesday Aug
Page 161 and 162:
WeAT1 Marmara Hall Tracking and Sur
Page 163 and 164:
ing that is constructed in a tangen
Page 165 and 166:
09:20-09:40, Paper WeAT4.2 Von Mise
Page 167 and 168:
10:00-10:20, Paper WeAT5.4 Face Hal
Page 169 and 170:
09:00-09:20, Paper WeAT7.1 Multi-Vi
Page 171 and 172:
Optimal sharpness differs from imag
Page 173 and 174:
09:00-11:10, Paper WeAT8.14 Automat
Page 175 and 176:
09:00-11:10, Paper WeAT8.22 On the
Page 177 and 178:
09:00-11:10, Paper WeAT8.29 Multi-S
Page 179 and 180:
09:00-11:10, Paper WeAT8.38 An Adap
Page 181 and 182:
09:00-11:10, Paper WeAT8.47 Robust
Page 183 and 184:
09:00-11:10, Paper WeAT8.56 Enhanci
Page 185 and 186:
called 22q11.2 Deletion Syndrome. G
Page 187 and 188: cally presented simultaneously. Twe
Page 189 and 190: separate atlas, and identifying the
Page 191 and 192: 09:00-11:10, Paper WeAT9.25 3D Reco
Page 193 and 194: eling the image background with a p
Page 195 and 196: 09:00-11:10, Paper WeAT9.41 Segment
Page 197 and 198: This paper presents a new approach
Page 199 and 200: 13:50-14:10, Paper WeBT3.2 Gait Lea
Page 201 and 202: WeBT5 Topkapı Hall B Feature Extra
Page 203 and 204: 14:10-14:30, Paper WeBT6.3 Script I
Page 205 and 206: 14:50-15:10, Paper WeBT7.5 Gaussian
Page 207 and 208: 16:00-16:20, Paper WeCT2.2 Direct P
Page 209 and 210: 17:00-17:20, Paper WeCT3.5 Length I
Page 211 and 212: 16:20-16:40, Paper WeCT5.3 Developm
Page 213 and 214: WeCT7 Dolmabahçe Hall C Handwritin
Page 215 and 216: 13:30-16:30, Paper WeBCT8.3 Nonline
Page 217 and 218: We present a method of performing k
Page 219 and 220: Tefas, Anastasios, Aristotle Univ.
Page 221 and 222: For classifier ensembles, an effect
Page 223 and 224: native power more efficiently becau
Page 225 and 226: and iterations. To label object can
Page 227 and 228: 13:30-16:30, Paper WeBCT8.53 Spike-
Page 229 and 230: dicates the possible locations of t
Page 231 and 232: In the off-line one, an alphabet, o
Page 233 and 234: image that it is currently displaye
Page 235 and 236: Answering to a query like when a pa
Page 237: 13:30-16:30, Paper WeBCT9.30 A Mult
Page 241 and 242: Technical Program for Thursday Augu
Page 243 and 244: ThAT1 Marmara Hall Object Detection
Page 245 and 246: 10:00-10:20, Paper ThAT2.4 Rectifyi
Page 247 and 248: optimization problem which returns
Page 249 and 250: 10:20-10:40, Paper ThAT5.5 On Selec
Page 251 and 252: 09:20-09:40, Paper ThAT7.2 HMM-Base
Page 253 and 254: to obtain a background image. We ha
Page 255 and 256: sidering the fact that the parts in
Page 257 and 258: Jeon, Moongu, Gwangju Inst. of Scie
Page 259 and 260: fields of view. The nodes in the to
Page 261 and 262: 09:00-11:10, Paper ThAT8.38 Robust
Page 263 and 264: We propose a scene classification m
Page 265 and 266: 09:00-11:10, Paper ThAT8.53 Efficie
Page 267 and 268: letters that can be combined to cre
Page 269 and 270: Vector Machines. Results show that
Page 271 and 272: 09:00-11:10, Paper ThAT9.19 EEG-Bas
Page 273 and 274: algorithm that translates brain sig
Page 275 and 276: 09:00-11:10, Paper ThAT9.35 Off-Lin
Page 277 and 278: In this paper, we present a calibra
Page 279 and 280: curacy for imagined sign. Pairwise
Page 281 and 282: gaze estimation improved by 61:06%
Page 283 and 284: For the interpretation of a visual
Page 285 and 286: camera parameters by minimizing an
Page 287 and 288: combination of Gaussians (LCG), Mar
Page 289 and 290:
and video frames were used to evalu
Page 291 and 292:
Mezghani, Neila, Centre de Recherch
Page 293 and 294:
ThCT2 Anadolu Auditorium Classifica
Page 295 and 296:
16:40-17:00, Paper ThCT3.4 Boosted
Page 297 and 298:
16:00-16:20, Paper ThCT5.2 Panorami
Page 299 and 300:
extend this work by using functiona
Page 301 and 302:
prediction of any one supposed expe
Page 303 and 304:
portant characteristic of the image
Page 305 and 306:
13:30-16:30, Paper ThBCT8.19 Word C
Page 307 and 308:
This paper proposes an incremental
Page 309 and 310:
13:30-16:30, Paper ThBCT8.34 Hetero
Page 311 and 312:
13:30-16:30, Paper ThBCT8.43 Local
Page 313 and 314:
13:30-16:30, Paper ThBCT8.52 Combin
Page 315 and 316:
13:30-16:30, Paper ThBCT8.59 Discri
Page 317 and 318:
In this paper, an anisotropic diffu
Page 319 and 320:
13:30-16:30, Paper ThBCT9.12 Image
Page 321 and 322:
of the inner product matrices into
Page 323 and 324:
done on MIT-BIH Arrhythmia database
Page 325 and 326:
13:30-16:30, Paper ThBCT9.36 Fence
Page 327 and 328:
ness and quality of the tracking co
Page 329 and 330:
13:30-16:30, Paper ThBCT9.52 An Emp
show all

Abstract book (pdf) - ICPR 2010

Create successful ePaper yourself

Delete template?

Save as template?