06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

of the inner product matrices into diagonal matrices and orthonormal and into diagonal and upper triangular matrices.<br />

Next we describe the estimation of the inner product matrices from measured data as an optimization process on the homogeneous<br />

space of upper triangular matrices. We show that the decomposition leads to simple forms of partial derivatives<br />

that are commonly used in optimization algorithms. Using the group theoretical parametrization ensures also that all intermediate<br />

estimates of the inner product matrix are symmetric and positive definite. Finally we apply the method to a<br />

problem from psychophysics where the color perception properties of an observer are characterized with the help of color<br />

matching experiments. We will show that measurements from color weak observers require the enforcement of the positive-definiteness<br />

of the matrix with the help of the manifold optimization technique.<br />

13:30-16:30, Paper ThBCT9.21<br />

Rethinking Algorithm Design and Development in Speech Processing<br />

Stadelmann, Thilo, Univ. of Marburg<br />

Wang, Yinghui, Univ. of Marburg<br />

Smith, Matthew, Univ. of Hannover<br />

Ewerth, Ralph, Univ. of Marburg<br />

Freisleben, Bernd, Univ. of Marburg<br />

Speech processing is typically based on a set of complex algorithms requiring many parameters to be specified. When<br />

parts of the speech processing chain do not behave as expected, trial and error is often the only way to investigate the reasons.<br />

In this paper, we present a research methodology to analyze unexpected algorithmic behavior by making (intermediate)<br />

results of the speech processing chain perceivable and intuitively comprehensible by humans. The workflow of the<br />

process is explicated using a real-world example leading to considerable improvements in speaker clustering. The described<br />

methodology is supported by a software toolbox available for download.<br />

13:30-16:30, Paper ThBCT9.22<br />

Phone-Conditioned Suboptimal Wiener Filtering<br />

Gonzalez-Caravaca, Guillermo, Univ. Autonoma de Madrid<br />

Toledano, Doroteo, Univ. Autonoma de Madrid<br />

Puertas, Maria, Univ. Autonoma de Madrid<br />

A novel way of managing the compromise between noise reduction and speech distortion in Wiener filters is presented. It<br />

is based on adjusting the amount of noise reduced, and therefore the speech distortion introduced, on a phone-by-phone<br />

basis. We show empirically that optimal Wiener filters produce different amounts of speech distortion for different phones.<br />

Therefore we propose a phone-conditioned suboptimal Wiener filter that uses different amounts of noise reduction for<br />

each phone, based on a previous estimation of the amount of distortion introduced. Speech recognition results have shown<br />

that phone conditioning suboptimal Wiener filtering can provide almost a 5% additional relative improvement in word<br />

accuracy over comparable optimal Wiener filtering.<br />

13:30-16:30, Paper ThBCT9.23<br />

Geodesic Active Fields on the Sphere<br />

Zosso, Dominique, École Pol. Fédérale de Lausanne<br />

Thiran, Jean-Philippe, École Pol. Fédérale de Lausanne<br />

In this paper, we propose a novel method to register images defined on spherical meshes. Instances of such spherical<br />

images include inflated cortical feature maps in brain medical imaging or images from omni directional cameras. We apply<br />

the Geodesic Active Fields (GAF) framework locally at each vertex of the mesh. Therefore we define a dense deformation<br />

field, which is embedded in a higher dimensional manifold, and minimize the weighted Polyakov energy. While the<br />

Polyakov energy itself measures the hyper area of the embedded deformation field, its weighting allows to account for the<br />

quality of the current image alignment. Iteratively minimizing the energy drives the deformation field towards a smooth<br />

solution of the registration problem. Although the proposed approach does not necessarily outperform state-of-the-art<br />

methods that are tightly tailored to specific applications, it is of methodological interest due to its high degree of flexibility<br />

and versatility.<br />

- 319 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!