06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

13:30-16:30, Paper WeBCT8.40<br />

Image Categorization by Learned Nonlinear Subspace of Combined Visual-Words and Low-Level Features<br />

Han, Xian-Hua, Ritsumeikan Univ.<br />

Chen, Yen-Wei, Ritsumeikan Univ.<br />

Ruan, Xiang, Omron Corparation<br />

Image category recognition is important to access visual information on the level of objects and scene types. This paper<br />

presents a new algorithm for the automatic recognition of object and scene classes. Compact and yet discriminative visual-words<br />

and low-level-features object class subspaces are automatically learned from a set of training images by a Supervised<br />

Nonlinear Neighborhood Embedding (SNNE) algorithm, which can learn an adaptive nonlinear subspace by<br />

preserving the neighborhood structure of the visual feature space. The main contribution of this paper is two fold: i) an<br />

optimally compact and discriminative feature subspace is learned by the proposed SNNE algorithm for different feature<br />

space (visual-word and low-level features). ii) An effective merge of different feature subspace can be implemented simply.<br />

High classification accuracy is demonstrated on different database including the scene databas (Simplicity) and object<br />

recognition database (Caltech). We confirm that the proposed strategy is much better than state-of-the-art methods for different<br />

databases.<br />

13:30-16:30, Paper WeBCT8.41<br />

Can Motion Segmentation Improve Patch-Based Object Recognition?<br />

Ulges, Adrian, DFKI<br />

Breuel, Thomas -<br />

Patch-based methods, which constitute the state of the art in object recognition, are often applied to video data, where<br />

motion information provides a valuable clue for separating objects of interest from the background. We show that such<br />

motion-based segmentation improves the robustness of patch-based recognition with respect to clutter. Our approach,<br />

which employs segmentation information to rule out incorrect correspondences between training and test views, is demonstrated<br />

empirically to distinctly outperform baselines operating on unsegmented images. Relative improvements reach<br />

50% for the recognition of specific objects, and 33% for object category retrieval.<br />

13:30-16:30, Paper WeBCT8.42<br />

Semi-Supervised and Interactive Semantic Concept Learning for Scene Recognition<br />

Han, Xian-Hua, Ritsumeikan Univ.<br />

Chen, Yen-Wei, Ritsumeikan Univ.<br />

Ruan, Xiang, Omron Corparation<br />

In this paper, we present a novel semi-supervised and interactive concept learning algorithm for scene recognition by local<br />

semantic description. Our work is motivated by the continuing effort in content-based image retrieval to extract and to<br />

model the semantic content of images. The basic idea of the semantic modeling is to classify local image regions into semantic<br />

concept classes such as water, sunset, or sky [1]. However, labeling concept sampling manually for training semantic<br />

model is fairly expensive, and the labeling results is, to some extent, subjective to the operators. In this paper, by using<br />

the proposed semi-supervised and interactive learning algorithm, training samples and new concepts can be obtained accurately<br />

and efficiently. Through extensive experiments, we demonstrate that the image concept representation is well<br />

suited for modeling the semantic content of heterogenous scene categories, and thus for recognition and retrieval. Furthermore,<br />

higher recognition accuracy can be achieved by updating new training samples and concepts, which are obtained<br />

by the novel proposed algorithm.<br />

13:30-16:30, Paper WeBCT8.43<br />

Dense Structure Inference for Object Classification in Aerial LIDAR Dataset<br />

Kim, Eunyoung, Univ. of Southern California<br />

Medioni, Gerard, Univ. of Southern California<br />

We present a framework to classify small freeform objects in 3D aerial scans of a large urban area. The system first identifies<br />

large structures such as the ground surface and roofs of buildings densely built in the scene, by fitting planar patches<br />

and grouping adjacent patches similar in pose together. Then, it segments initial object candidates which represent the<br />

visible surface of an object using the identified structures. To deal with sparse density in points representing each candidate,<br />

we also propose a novel method to infer a dense 3D structure from the given sparse and noisy points without any meshes<br />

- 222 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!