06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Müller, Henning, Univ. of Applied Sciences Sierre, Switzerland<br />

In this paper, classical approaches such as maximum combinations (combMAX), sum combinations (comb-SUM) and<br />

the product of the maximum and a nonzero number (combMNZ) were employed and the trade off between two fusion effects<br />

(chorus and dark horse effects) was studied based on the sum of n maximums. Various normalization strategies were<br />

tried out. The fusion algorithms are evaluated using the best four visual and textual runs of the ImageCLEF medical image<br />

retrieval task 2008 and 2009. The results show that fused runs outperform the best original runs and multi-modality fusion<br />

statistically outperforms single modality fusion. The logarithmic rank penalization shows to be the most stable normalization.<br />

The dark horse effect is in competition with the chorus effect and each of them can produce best fusion performance<br />

depending on the nature of the input data.<br />

16:40-17:00, Paper TuCT5.4<br />

Perceptual Image Retrieval by Adding Color Information to the Shape Context Descriptor<br />

Rusiñol, Marçal, Univ. Autònoma de Barcelona<br />

Nourbakhsh, Farshad, Computer Vision Center / Univ. Autònoma de Barcelona<br />

Karatzas, Dimosthenis, Univ. Autonoma de Barcelona<br />

Valveny, Ernest, Computer Vision Center / Univ. Autònoma de Barcelona<br />

Llados, Josep, Computer Vision Center<br />

In this paper we present a method for the retrieval of images in terms of perceptual similarity. Local color information is<br />

added to the shape context descriptor in order to obtain an object description integrating both shape and color as visual cues.<br />

We use a color naming algorithm in order to represent the color information from a perceptual point of view. The proposed<br />

method has been tested in two different applications, an object retrieval scenario based on color sketch queries and a color<br />

trademark retrieval problem. Experimental results show that the addition of the color information significantly outperforms<br />

the sole use of the shape context descriptor.<br />

17:00-17:20, Paper TuCT5.5<br />

Weighted Boundary Points for Shape Analysis<br />

Zhang, Jing, Univ. of South Florida<br />

Kasturi, Rangachar, Univ. of South Florida<br />

Shape analysis is an active and important branch in computer vision research field. In recent years, many geometrical, topological,<br />

and statistical features have been proposed and widely used for shape-related applications. In this paper, based on<br />

the properties of Distance Transform, we present a new shape feature, weight of boundary point. By computing the shortest<br />

distances between boundary points and distance contours of a transformed shape, every boundary point is assigned a weight,<br />

which contains the interior structure information of the shape. To evaluate the proposed new shape feature, we tested the<br />

weighted boundary points on shape matching and shape decomposition. The experimental results demonstrated the validity.<br />

TuCT6 Dolmabahçe Hall B<br />

Speech and Speaker Recognition Regular Session<br />

Session chair: Shinoda, Koichi (Tokyo Institute of Technology)<br />

15:40-16:00, Paper TuCT6.1<br />

Dimension-Decoupled Gaussian Mixture Model for Short Utterance Speaker Recognition<br />

Stadelmann, Thilo, Univ. of Marburg<br />

Freisleben, Bernd, Univ. of Marburg<br />

The Gaussian Mixture Model (GMM) is often used in conjunction with Mel-frequency cepstral coefficient (MFCC) feature<br />

vectors for speaker recognition. A great challenge is to use these techniques in situations where only small sets of training<br />

and evaluation data are available, which typically results in poor statistical estimates and, finally, recognition scores. Based<br />

on the observation of marginal MFCC probability densities, we suggest to greatly reduce the number of free parameters in<br />

the GMM by modeling the single dimensions separately after proper preprocessing. Saving about 90% of the free parameters<br />

as compared to an already optimized GMM and thus making the estimates more stable, this approach considerably improves<br />

recognition accuracy over the baseline as the utterances get shorter and saves a huge amount of computing time both in training<br />

and evaluation, enabling real-time performance. The approach is easy to implement and to combine with other short-utterance<br />

approaches, and applicable to other features as well.<br />

- 126 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!