29.05.2013 Views

RR_03_02

RR_03_02

RR_03_02

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

assemblies to lower-level assemblies via 'bubbles' that<br />

are labeled. Each labeled bubble shows a single assembly.<br />

Accordingly, there is a tremendous need for layout<br />

recognizers that can accurately segment the drawings into<br />

their individual assemblies and automatically set up<br />

hyper-links among the assemblies of different levels.<br />

Figure 6: Parts pages require segmentation<br />

In [2], we discussed our research on recognizing the<br />

layout of parts pages in vector format. Applying<br />

geometric proximity algorithms we achieved accuracies<br />

on the order of 84% for grouping art into the proper<br />

segments. Incorporating heuristics regarding the location<br />

of detail labels, the recognition of leader lines and<br />

attaching text to those leader lines increased our layout<br />

recognition accuracy to -97%. Failures generally occur<br />

when parts of a detail are deliberately offset from the<br />

main body to show separation within the detail or when<br />

there is considerable space between the body of the detail<br />

and an orientation arrow (Figure 7). To achieve<br />

accuracies much closer to 100% will require recognition<br />

of the objects being presented, re-mapping the technical<br />

illustration to the real-world object.<br />

4. Raster vs. vector considerations<br />

Layout recognition research has predominantly<br />

focused on the understanding of the layout of raster<br />

3<br />

images. While some of the examples we have discussed<br />

do occur as<br />

Figure 7: Orientation arrows can cause<br />

segmentation errors<br />

raster drawings, most are vector images produced in<br />

modern illustration tools. Vector graphics are generally<br />

easier to analyze than raster - there is no need for optical<br />

character recognition, for example. However vector<br />

graphics present unique challenges that are not present in<br />

raster images and require a very different approach. For<br />

example, techniques involving histograms or ink density<br />

translate poorly, if at all, to vector diagrams. We need<br />

new recognition algorithms based on the rendering<br />

characteristics of vector graphics. This is a fertile area for<br />

new research.<br />

5. Summary<br />

Aerospace companies depend heavily on a wide<br />

variety of technical documentation in tremendous<br />

quantities. We have presented just a few of the types of<br />

layout recognitIOn challenges found in technical<br />

documentation. Solutions must be highly scalable and<br />

extremely accurate. We hope to inspire new research in<br />

this relatively unexplored area of document recognition.<br />

6. References<br />

[1]707,727-777 Standard Wiring Practices Manual, 06-<br />

54446, The Boeing Company<br />

[2] L.S . Baum, J.H.Boose, and R.I. Kelley, "Graphics<br />

Recognition for a Large-Scale Airplane Information<br />

System", Graphics Recognition, Algorithms and Systems,<br />

Second International Workshop, GREC '97, Springer,<br />

Nancy, France, August 1997, pp. 291-30 I.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!