06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

foregrounds in multi-views based on simply selecting segments corresponding to the real foreground in only one image,<br />

further improved foregrounds are extracted by back-projecting 3D objects reconstructed based on the foreground extracted<br />

in the previous step into segments of each image in multi-views. These two steps are iteratively performed until the energy<br />

function is optimized. In the experiments, more accurate boundaries were obtained, although the proposed method used a<br />

simple 3D reconstruction method.<br />

09:00-11:10, Paper ThAT8.42<br />

Human Pose Estimation for Multiple Persons based on Volume Reconstruction<br />

Luo, Xinghan, Utrecht Univ.<br />

Berendsen, Berend<br />

Tan, Robby T., Utrecht Univ.<br />

Veltkamp, R. C., Utrecht Univ.<br />

Most of the development of pose recognition focused on a single person. However, many applications of computer vision<br />

essentially require the estimation of multiple people. Hence, in this paper, we address the problems of estimating poses of<br />

multiple persons using volumes estimated from multiple cameras. One of the main issues that causes the multiple person<br />

from multiple cameras to be problematic is the present of ghost; volumes. This problem arises when the projections of<br />

two different silhouettes of two different persons onto the 3D world overlap in a place where in fact there is no person in<br />

it. To solve this problem, we first introduce a novel principal axis-based framework to estimate the 3D ground plane positions<br />

of multiple people, and then use the position cues to label the multi-person volumes (voxels), while considering<br />

the voxel connectivity. Having labeled the voxels, we fit the volume of each person with a body model, and determine the<br />

pose of the person based on the model. The results on real videos demonstrate the accuracy and efficiency of our approach.<br />

09:00-11:10, Paper ThAT8.43<br />

3D Articulated Shape Segmentation using Motion Information<br />

Kalafatlar, Emre, Koç Univ.<br />

Yemez, Yucel, Koç Univ.<br />

We present a method for segmentation of articulated 3D shapes by incorporating the motion information obtained from<br />

time-varying models. We assume that the articulated shape is given in the form of a mesh sequence with fixed connectivity<br />

so that the inter-frame vertex correspondences, hence the vertex movements, are known a priori. We use different postures<br />

of an articulated shape in multiple frames to constitute an affinity matrix which encodes both temporal and spatial similarities<br />

between surface points. The shape is then decomposed into segments in spectral domain based on the affinity<br />

matrix using a standard K-means clustering algorithm. The performance of the proposed segmentation method is demonstrated<br />

on the mesh sequence of a human actor.<br />

09:00-11:10, Paper ThAT8.44<br />

Online Learning with Self-Organizing Maps for Anomaly Detection in Crowd Scenes<br />

Feng, Jie, Peking Univ.<br />

Zhang, Chao, Peking Univ.<br />

Hao, Pengwei, Queen Mary Univ. of London<br />

Detecting abnormal behaviors in crowd scenes is quite important for public security and has been paid more and more attentions.<br />

Most previous methods use offline trained model to perform detection which can’t handle the constantly changing<br />

crowd environment. In this paper, we propose a novel unsupervised algorithm to detect abnormal behavior patterns in<br />

crowd scenes with online learning. The crowd behavior pattern is extracted from the local spatio-temporal volume which<br />

consists of multiple motion patterns in temporal order. An online self-organizing map (SOM) is used to model the large<br />

number of behavior patterns in crowd. Each neuron can be updated by incrementally learning the new observations. To<br />

demonstrate the effectiveness of our proposed method, we have performed experiments on real-world crowd scenes. The<br />

online learning can efficiently reduce the false alarms while still be able to detect most of the anomalies.<br />

09:00-11:10, Paper ThAT8.45<br />

Scene Classification using Spatial Pyramid of Latent Topics<br />

Ergul, Emrah, Turkish Naval Academy<br />

Arica, Nafiz, Turkish Naval Academy<br />

- 260 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!