06.02.2013 Views

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

Abstract book (pdf) - ICPR 2010

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

13:30-16:30,Paper TuBCT9.16<br />

Offline Arabic Handwriting Identification using Language Diacritics<br />

Lutf, Mohammed, Huazhong Univ. of Science and Tech.<br />

You, Xinge, Huazhong Univ. of Science and Tech.<br />

Li, Hong, Huazhong Univ. of Science and Tech.<br />

In this paper, we present an approach for writer identification using off-line Arabic handwriting. The proposed method introduced<br />

Arabic writing in a new form, by presenting Arabic writing in its basic components instead of alphabetic. We<br />

split the input document into two parts: one for the letters and the other for the diacritics, we extract all diacritics from the<br />

input image and calculate the LBP histogram for each diacritic then concatenate these histograms to use it as handwriting<br />

features. We use the IFN/ENIT database in the experiments reported here and our tests involve 287 writers. The results<br />

show that our method is very effective and makes the handling of the Arabic handwriting more easily than before.<br />

13:30-16:30,Paper TuBCT9.17<br />

Removing Rule-Lines from Binary Handwritten Arabic Document Images using Directional Local Profile<br />

Shi, Zhixin, SUNY at Buffalo<br />

Setlur, Srirangaraj, Univ. at Buffalo<br />

Govindaraju, Venu, Univ. at Buffalo<br />

In this paper, we present a novel approach for detecting and removing pre-printed rule-lines from binary handwritten<br />

Arabic document images. The proposed technique is based on a directional local profiling approach for the detection of<br />

the rule-line locations. Then a refined adaptive vertical run-length search is designed for removing the rule-line pixels<br />

without much damaging to the text. They are also tolerate to the variations in the rule-lines such as broken lines, orientation<br />

changes and variation in the thickness of the rule-lines. Analysis of experimental results on the DARPA MADCAT Arabic<br />

handwritten document data indicates that the method is robust and is capable of correctly removing rule-lines.<br />

13:30-16:30,Paper TuBCT9.18<br />

A Bag-of-Pages Approach to Unordered Multi-Page Document Classification<br />

Gordo, Albert, Univ. Autònoma de Barcelona<br />

Perronnin, Florent, Xerox Res. Centre Europe<br />

We consider the problem of classifying documents containing multiple unordered pages. For this purpose, we propose a<br />

novel bag-of-pages document representation. To represent a document, one assigns every page to a prototype in a code<strong>book</strong><br />

of pages. This leads to a histogram representation which can then be fed to any discriminative classifier. We also consider<br />

several refinements over this initial approach. We show on two challenging datasets that the proposed approach significantly<br />

outperforms a baseline system.<br />

13:30-16:30,Paper TuBCT9.19<br />

Fast Seamless Skew and Orientation Detection in Document Images<br />

Konya, Iuliu Vasile, Fraunhofer IAIS<br />

Eickeler, Stefan, Fraunhofer IAIS<br />

Seibert, Christoph, Fraunhofer IAIS<br />

Reliable and generic methods for skew detection are a necessity for any large-scale digitization projects. As one of the<br />

first processing steps, skew detection and correction has a heavy influence on all further document analysis modules, such<br />

as geometric and logical layout analysis. This paper introduces a generic, scale-independent algorithm capable of accurately<br />

detecting the global skew angle of document images within the range [-90,90] degrees. By using the same framework, the<br />

algorithm is then extended for Roman script documents so as to cope with the full range [-180,180) degrees of possible<br />

skew angles. Despite its generality, the improved algorithm is very fast and requires no explicit parameters. Experiments<br />

on a combined test set comprising around 110000 real-life images show the accuracy and robustness of the proposed<br />

method.<br />

- 146 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!