RR_03_02
RR_03_02
RR_03_02
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
assemblies to lower-level assemblies via 'bubbles' that<br />
are labeled. Each labeled bubble shows a single assembly.<br />
Accordingly, there is a tremendous need for layout<br />
recognizers that can accurately segment the drawings into<br />
their individual assemblies and automatically set up<br />
hyper-links among the assemblies of different levels.<br />
Figure 6: Parts pages require segmentation<br />
In [2], we discussed our research on recognizing the<br />
layout of parts pages in vector format. Applying<br />
geometric proximity algorithms we achieved accuracies<br />
on the order of 84% for grouping art into the proper<br />
segments. Incorporating heuristics regarding the location<br />
of detail labels, the recognition of leader lines and<br />
attaching text to those leader lines increased our layout<br />
recognition accuracy to -97%. Failures generally occur<br />
when parts of a detail are deliberately offset from the<br />
main body to show separation within the detail or when<br />
there is considerable space between the body of the detail<br />
and an orientation arrow (Figure 7). To achieve<br />
accuracies much closer to 100% will require recognition<br />
of the objects being presented, re-mapping the technical<br />
illustration to the real-world object.<br />
4. Raster vs. vector considerations<br />
Layout recognition research has predominantly<br />
focused on the understanding of the layout of raster<br />
3<br />
images. While some of the examples we have discussed<br />
do occur as<br />
Figure 7: Orientation arrows can cause<br />
segmentation errors<br />
raster drawings, most are vector images produced in<br />
modern illustration tools. Vector graphics are generally<br />
easier to analyze than raster - there is no need for optical<br />
character recognition, for example. However vector<br />
graphics present unique challenges that are not present in<br />
raster images and require a very different approach. For<br />
example, techniques involving histograms or ink density<br />
translate poorly, if at all, to vector diagrams. We need<br />
new recognition algorithms based on the rendering<br />
characteristics of vector graphics. This is a fertile area for<br />
new research.<br />
5. Summary<br />
Aerospace companies depend heavily on a wide<br />
variety of technical documentation in tremendous<br />
quantities. We have presented just a few of the types of<br />
layout recognitIOn challenges found in technical<br />
documentation. Solutions must be highly scalable and<br />
extremely accurate. We hope to inspire new research in<br />
this relatively unexplored area of document recognition.<br />
6. References<br />
[1]707,727-777 Standard Wiring Practices Manual, 06-<br />
54446, The Boeing Company<br />
[2] L.S . Baum, J.H.Boose, and R.I. Kelley, "Graphics<br />
Recognition for a Large-Scale Airplane Information<br />
System", Graphics Recognition, Algorithms and Systems,<br />
Second International Workshop, GREC '97, Springer,<br />
Nancy, France, August 1997, pp. 291-30 I.