Master Thesis - Fachbereich Informatik
Master Thesis - Fachbereich Informatik
Master Thesis - Fachbereich Informatik
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
138 APPENDIX A. PROFILE ANALYSIS IMPLEMENTATION DETAILS<br />
A.2. Profile Subsampling<br />
In many computer vision tasks it is common to perform a specific operation on lower<br />
resolution images than the input to increase computation speed. For example one could<br />
simply discard every second row or column two obtain an image of half size of the original<br />
image. However, to avoid a violation of the sampling theorem it is important to apply<br />
a low-pass filter operation on the data before. This mechanism can be used to generate<br />
pyramids of images at different resolutions or scales. Each layer in the pyramid has half<br />
the size of the layer above with the top layer corresponding to the original size. Before<br />
subsampling the data a Gaussian smoothing operation is performed to suppress higher<br />
frequencies. Thus, such pyramids are called Gaussian Pyramids in the literature [24].<br />
The same can be applied to one-dimensional signals such as gray level profiles. In<br />
this application, experiments have shown the information about the tube boundaries is<br />
conserved a coarser scale. Thus, a subsampled version two levels down the pyramid instead<br />
of the original profile is used in praxis. The data to be processed after this step is only<br />
a fourth of the input. Obviously, the profile analysis can be accelerated by this step.<br />
Experiments investigating whether the profile subsampling could replace step one in the<br />
profile analysis, i.e. the smoothing with a large mean kernel, came to the conclusion that<br />
in connection with transparent tubes and dark printing, the strong contrast of the letters<br />
could be misclassified as tube boundary. The system tries to detect the real tube location<br />
in a certain region around the wrong position and is likely to fail. The mean filter instead<br />
is able to reduce the influence of the lettering and must not be replaced.<br />
A.3. Scan Lines<br />
As mentioned in Section 4.4.2, the profile to be evaluated is based on the normalized sum<br />
of Nscan scan lines equally distributed over the global ROI. The reason why a single scan<br />
line is not sufficient is shown in Figure A.1(b). Three sample profiles at different heights<br />
(61, 80 and 100) are selected to visualize the influence of the printing. One can see the<br />
strong contrast at the letters as well as a poor contrast at the right tube boundary. Since<br />
it is non-deterministic of whether the printing of a particular tube is visible in an image,<br />
one has to consider the worst case. This is a scan line passing through the printing at as<br />
many positions possible. The global mean of the resulting profile is much lower in this case<br />
and it is possible that the intensity of the tube at regions outside the printing is wrongly<br />
classified as background. The result of this effect is shown in Figure A.1(d). On the other<br />
hand, the usage of several scan lines decreases the influence of the printing significantly.<br />
The probability that more than a few scan lines will pass through the printing is low.<br />
For example, among the sample tubes used for testing of the prototype, the coverage of<br />
the printing is about 16% with respect to the diameter. Thus, it is very likely to have<br />
more than one scan line passing through tube regions without printing. In total, the<br />
influence of the printing decreases with the number of scan lines. However, Figure A.1(c)<br />
shows 11 scan lines equally distributed over the global ROI in y-direction are sufficient<br />
to yield almost equal results as with considering all rows of the ROI. Here, the profile<br />
consisting of 11 scan lines is shifted, i.e. the intensity values are lower compared to the<br />
profile calculated from all ROI rows (90 in this example). This is due to the location of