Abstract book (pdf) - ICPR 2010
Abstract book (pdf) - ICPR 2010
Abstract book (pdf) - ICPR 2010
- TAGS
- abstract
- icpr
- icpr2010.org
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
13:30-16:30, Paper WeBCT9.38<br />
Efficient Semantic Indexing for Image Retrieval<br />
Pulla, Chandrika, International Inst. of Information Tech. Hyderabad<br />
Karthik, Suman, International Inst. of Information Tech. Hyderabad<br />
Jawahar, C. V., IIIT<br />
Semantic analysis of a document collection can be viewed as an unsupervised clustering of the constituent words and documents<br />
around hidden or latent concepts. This has shown to improve the performance of visual bag of words in image retrieval.<br />
However, the enhancement in performance depends heavily on the right choice of number of semantic concepts.<br />
Most of the semantic indexing schemes are also computationally costly. In this paper, we employ a bipartite graph model<br />
(BGM) for image retrieval. BGM is a scalable data structure that aids semantic indexing in an efficient manner. It can also<br />
be incrementally updated. BGM uses \textbf{tf-idf} values for building a semantic bipartite graph. We also introduce a<br />
graph partitioning algorithm that works on the BGM to retrieve semantically relevant images from a database. We demonstrate<br />
the properties as well as performance of our semantic indexing scheme through a series of experiments. We also<br />
compare our methods with incremental pLSA.<br />
13:30-16:30, Paper WeBCT9.39<br />
Improving and Aligning Speech with Presentation Slides<br />
Swaminathan, Ranjini, Univ. of Arizona<br />
Thompson, Michael E., Univ. of Arizona<br />
Fong, Sandiway, Univ. of Arizona<br />
Efrat, Alon, Univ. of Arizona<br />
Amir, Arnon<br />
Barnard, Kobus, Univ. of Arizona<br />
We present a novel method to correct automatically generated speech transcripts of talks and lecture videos using text<br />
from accompanying presentation slides. The approach finesses the challenges of dealing with technical terms which are<br />
often outside the vocabulary of speech recognizers. Further, we align the transcript to the slide word sequence so that we<br />
can improve the organization of closed captioning for hearing impaired users, and improve automatic highlighting or magnification<br />
for visually impaired users. For each speech segment associated with a slide, we construct a sequential Hidden<br />
Markov Model for the observed phonemes that follows slide word order, interspersed with text not on the slide. Incongruence<br />
between slide words and mistaken transcript words is accounted for using phoneme confusion probabilities. Hence,<br />
transcript words different from aligned high probability slide words can be corrected. Experiments on six talks show improvement<br />
in transcript accuracy and alignment with slide words.<br />
13:30-16:30, Paper WeBCT9.40<br />
The ImageCLEF Medical Retrieval Task at <strong>ICPR</strong> <strong>2010</strong> - Information Fusion<br />
Kalpathy-Cramer, Jayashree, Oregon Health & Science Univ.<br />
Müller, Henning, Univ. of Applied Sciences<br />
An increasing number of clinicians, researchers, educators and patients routinely search for medical information on the<br />
Internet as well as in image archives. However, image retrieval is far less understood and developed than text-based search.<br />
The ImageCLEF medical image retrieval task is an international benchmark that enables researchers to assess and compare<br />
techniques for medical image retrieval using standard test collections. Although text retrieval is mature and well researched,<br />
it is limited by the quality and availability of the annotations associated with the images. Advances in computer vision<br />
have led to methods for using the image itself as search entity. However, the success of purely content-based techniques<br />
has been limited and these systems have not had much clinical success. On the other hand a combination of text- and content-based<br />
retrieval can achieve improved retrieval performance if combined effectively. Combining visual and textual<br />
runs is not trivial based on experience in ImageCLEF. The goal of the fusion challenge at <strong>ICPR</strong> is to encourage participants<br />
to combine visual and textual results to improve search performance. Participants were provided textual and visual runs,<br />
as well as the results of the manual judgments from ImageCLEFmed 2008 as training data. The goal was to combine<br />
textual and visual runs from 2009. In this paper, we present the results from this <strong>ICPR</strong> contest.<br />
13:30-16:30, Paper WeBCT9.41<br />
Unified Approach to Detection and Identification of Commercial Films by Temporal Occurrence Pattern<br />
Putpuek, Narongsak, Chulalongkorn Univ.<br />
- 237 -