06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

features widely used in existing DSI solutions, this new feature is much more robust to variations in slant, font and style<br />

of printed documents. Experimental results show that the method achieves promising identification performances.<br />

13:30-16:30,Paper TuBCT9.48<br />

Using Spatial Relations for Graphical Symbol Description<br />

K. C., Santosh, INRIA – LORIA and INPL<br />

Wendling, Laurent, Univ. Paris Descartes<br />

Lamiroy, Bart, LORIA – INPL<br />

In this paper, we address the use of unified spatial relations for symbol description. We present a topologically guided directional<br />

relation signature. It references a unique point set instead of one entity in a pair, thus avoiding problems related<br />

to erroneous choices of reference entities and preserves symmetry. We experimentally validate our method on showing its<br />

ability to serve in a symbol retrieval application, based only on a spatial relational descriptor that represents the links between<br />

the decomposed structural patterns called “vocabulary” in a spatial relational graph.<br />

13:30-16:30,Paper TuBCT9.49<br />

Automatic Discrimination between Confusing Classes with Writing Styles Verification in Arabic Handwritten Numeral<br />

Recognition<br />

He, Chun Lei, Concordia Univ.<br />

Lam, Louisa, Concordia Univ.<br />

Suen, Ching Y.<br />

In handwriting recognition, confusing/conflicting writing styles can result in irreducible errors, so the study of writing<br />

style consistencies is important for applications. In Arabic Handwritten Numeral Recognition, most errors occur between<br />

samples of classes two and three due to their very similar shapes in some writing styles. In this paper, an automated writing<br />

style detection process is effectively implemented in the pair-wise verification of samples in these two classes. As a result,<br />

the recognition results have improved significantly with a reduction by 25% of previous errors. With rejection, when the<br />

LDA (Linear Discriminant Analysis) measurement rejection threshold is adjusted to maintain the same error rate, the<br />

recognition rate increases from 96.87% to 97.81%.<br />

13:30-16:30,Paper TuBCT9.50<br />

Random Subspace Method in Text Categorization<br />

Gangeh, Mehrdad, Univ. of Waterloo<br />

Kamel, Mohamed S, Univ. of Waterloo<br />

Duin, Robert, TU Delft<br />

In text categorization (TC), which is a supervised technique, a feature vector of terms or phrases is usually used to represent<br />

the documents. Due to the huge number of terms in even a moderate-size text corpus, high dimensional feature space is<br />

an intrinsic problem in TC. Random subspace method (RSM), a technique that divides the feature space to smaller ones<br />

each submitted to a (base) classifier (BC) in an ensemble, can be an effective approach to reduce the dimensionality of the<br />

feature space. Inspired by a similar research on functional magnetic resonance imaging (fMRI) of brain, here we address<br />

the estimation of ensemble parameters, i.e., the ensemble size (L) and the dimensionality of feature subsets (M) by defining<br />

three criteria: usability, coverage, and diversity of the ensemble. We will show that relatively medium M and small L yield<br />

an ensemble that improves the performance of a single support vector machine, which is considered as the state-of-the-art<br />

in TC.<br />

13:30-16:30,Paper TuBCT9.51<br />

Shape-DNA: Effective Character Restoration and Enhancement for Arabic Text Documents<br />

Caner, Gulcin, Pol. Rain, Inc.<br />

Haritaoglu, Ismail, Pol. Rain, Inc.<br />

We present a novel learning-based image restoration and enhancement technique for improving character recognition performance<br />

of OCR products for degraded documents or documents/text captured with mobile devices such as cameraphones.<br />

The proposed technique is language independent and can simultaneously increase the effective resolution and<br />

restore broken characters with artifacts due to image capturing device such as a low quality/low resolution camera, or due<br />

- 154 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!