Download full issue (PDF 2.1MB) - Council of European ...
Download full issue (PDF 2.1MB) - Council of European ...
Download full issue (PDF 2.1MB) - Council of European ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Information Technologies for Visually Impaired People<br />
a) b) c)<br />
Figure 3 : Structure <strong>of</strong> a Braille Character. a) Generator<br />
Symbol, b) Letter “a”, c) Number “1”.<br />
concepts <strong>of</strong> reality. Moreover, the system cleans up the introduced<br />
hand drawn predefined shapes, improving the final<br />
appearance <strong>of</strong> an edited document.<br />
From a scientific point <strong>of</strong> view both applications belong<br />
to the two main lines <strong>of</strong> the Document Analysis research<br />
field stated in the above paragraphs: text and graphics<br />
analysis. The aim <strong>of</strong> the translation <strong>of</strong> Braille texts is a<br />
particular case <strong>of</strong> OCR named OBR (Optical Braille Recognition)<br />
[5] and the interactive recognition <strong>of</strong> graphical<br />
shapes designed by means <strong>of</strong> a digital tablet, is a case <strong>of</strong><br />
graphical symbol recognition [3].<br />
In the rest <strong>of</strong> the paper we further explain both applications<br />
and their technological basis. Section 2 presents the<br />
Braille translator, Section 3 explains the application <strong>of</strong><br />
graphical shape recognition, and finally Section 4 is devoted<br />
to the conclusions.<br />
2 Translator from Braille to View<br />
In this section we present the framework wherein the<br />
application has been developed and some <strong>of</strong> the technological<br />
principles forming the basis <strong>of</strong> its construction.<br />
2.1Purpose and Application Framework<br />
Unlike the majority <strong>of</strong> other published systems which<br />
use papers made by a Braille printer, the system presented<br />
in this paper works with handwritten documents by means<br />
<strong>of</strong> a Perkins typewriter, see Figure 1. Nowadays, a visually<br />
impaired child who has not yet started to use a computer to<br />
create his/her documents, uses this kind <strong>of</strong> typewriter. When<br />
the child is integrated into a school where neither teachers<br />
nor students are able to understand Braille, someone has to<br />
translates these documents, in order for them to be shared<br />
or evaluated. This task is done by assistant teachers who go<br />
to the schools periodically. A simultaneous translation <strong>of</strong><br />
these papers allows the children to share them with their<br />
environment in a more dynamil way. Their teachers can<br />
correct their homework or exams in a more direct and agile<br />
way, and they can give their marks at the same time as the<br />
rest <strong>of</strong> the students. Their classmates can understand their<br />
homework and this facilitates the integration <strong>of</strong> the child<br />
with them. Finally their families can also follow up the<br />
progress <strong>of</strong> their children as they can understand their homework.<br />
The system presents another more generic advantage,<br />
the possibility to "photocopy" a Braille text. To duplicate a<br />
Braille text means that it has to be reprinted with its relief<br />
necessitating recognition <strong>of</strong> the original text and reprint it<br />
with relief. Nowadays this process can only be done by<br />
manually retyping the text, but with this system it is possible<br />
to scan, analyze and save the text in a digital format<br />
without retyping it.<br />
The system consists <strong>of</strong> a personal computer connected<br />
to a scanner. Directly from the application it is possible to<br />
scan a Braille document, recognize the Braille characters<br />
and translate them. The Braille notation has different interpretation<br />
styles. Thus it allows writing, apart from plain text,<br />
music, chemical symbols, mathematical formulae, etc. Each<br />
<strong>of</strong> these domains has its own symbols and placement rules,<br />
and some common symbols that have different meanings<br />
depending on the domain in which they appear. It allows a<br />
code based in six dots to have a high representational power<br />
(There is another codification based on eight dots which is<br />
not considered in this paper). However, since the analyzed<br />
documents are made by little children using a Perkins typewriter,<br />
the possibility exists <strong>of</strong> having some imperfections<br />
such as "deleted" points, i.e. points flattened that are not<br />
"read" by the fingers but usually visible in the scanned image,<br />
see Figure 2 b). It is also possible that a paper read (i.e.<br />
touched) several times, can have some stains on it. Sometimes<br />
the sheets are taken out <strong>of</strong> the typewriter to be read<br />
and then reintroduced to continue typing; this may result in<br />
some misalignments in the characters and in their separation<br />
with the rest <strong>of</strong> the lines. Finally the pressure exerted<br />
when the child is enthusiastically tapping the keys can create<br />
some little holes in the Braille relief points, these holes<br />
will appear as additional shadows in the image, see Figure<br />
2a). As a result there are papers with different kinds <strong>of</strong> imperfections:<br />
stains that can be misunderstood with real<br />
Braille points, "deleted" points that can appear as non-deleted<br />
points, double points produced by the holes in the<br />
paper, misalignment <strong>of</strong> lines and rows, etc.<br />
2.2OBR Technology<br />
An OCR application is s<strong>of</strong>tware that converts a digital<br />
image <strong>of</strong> text, acquired by a scanner or digital camera, to a<br />
text file that can be edited by a text editor [6] [7]. Nowadays<br />
there are a number <strong>of</strong> commercial OCR tools, sometimes<br />
distributed with scanners, which perform this function.<br />
OCR avoids the need to retype printed texts. From a<br />
technological point <strong>of</strong> view, an OCR system consists <strong>of</strong> three<br />
phases: low level process, segmentation/detection <strong>of</strong> the<br />
document structure and character recognition or classification.<br />
The low level process applies techniques <strong>of</strong> image<br />
processing to enhance digital images acquired from paper<br />
(using contrast enhancement, filters to delete shadows, stains<br />
and different kinds <strong>of</strong> noise etc.). The segmentation process<br />
separates and labels the components forming the physical<br />
structure <strong>of</strong> the document which are structured in a hierarchical<br />
way as: paragraphs, lines, words and characters.<br />
Finally, the recognition or classification step identifies, for<br />
each image zone containing a character, which character it<br />
is. The latter process compares a set <strong>of</strong> features extracted<br />
from the image [8] with the models or patterns previously<br />
56 UPGRADE Vol. VIII, No. 2, April 2007 © Novática