PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

More documents

Recommendations

Info

3.2. Scale invariant detectors 27 where k is a constant multiplicative factor. This means that D(x, y, σ) is simply the subtraction of two neighboring discrete scale-space representations of the image I. The scale-space for DOG detection is defined in the following manner. It consists of a pre-defined number of partitions, called octaves. Each new octave starts with a σ with a double-as-high value of the previous octave. Each octave is partitioned into a number of s discrete scale-space representations, where s is an integer number. With this condition the parameter k is defined as k = √ 2. For each octave the image I is re-sampled down to half of the size of the previous image. Re-sampling is done by simply selecting every other pixel of the image. This is done for computational efficiency. Doing the re-sampling everytime when the σ is doubling is consistent with the scalespace theory. The difference of Gaussian function D(x, y, σ) is now produced by subtracting the neighboring scale-space slices within each octave. The next step after computation of D(x, y, σ) is the detection of local extrema therein. The extrema to be detected are the local minima and maxima of D(x, y, σ). Every pixel of the scale-space representation is checked if it is an extremum of D(x, y, σ). If a pixel is an extremum then it is selected as a DOG-keypoint. If the extremum is located on one of the re-sampled octaves the x and y coordinate in the original image scale have to be computed. The characteristic scale of the DOG-point is the value of the σ of the scale-space slice on which the extremum has been found. For extremum detection all 26 neighbor pixels in scale-space are investigated. The pixel is a local maximum if its value is higher than the values of its neighbor and it is a local minimum if it is smaller than all of its neighbors. The 26 neighbors are defined by a 8-connecting neighborhood in scale-space. The 26 neighbors consist of the 8 neighbors of the same slice, 9 neighbors on the upper scale-level and 9 neighbors on the lower scale-level. Point detection in such a way only gives detection with pixel accuracy. In a subsequent step to detection a sub-pixel keypoint localization will be performed. This step ensures, that keypoints are located exactly on corners or edges. To gain sub-pixel accuracy a 3D quadratic function will be fitted to the local scale-space region. The keypoint will finally be localized at the interpolated maximum or minimum of the quadratic function (for more details see [13]). However not all detected extrema are suited to finally act as keypoints. Detected keypoints with low contrast are not well suited as keypoints. Scale-space extrema also tend to be located on edges. However, they are not well localized along the edge itself. A final filtering step will eliminate such ambiguous detections. Edge responses are eliminated by Eigenvalue analysis of the Hessian matrix H of the keypoint location. The process is very similar to corner detection using the Hessian matrix. The ratio of the two principal directions is computed and the keypoint is eliminated if one direction is significant stronger than the second one. The ratio is approximated by the ratio of the squared trace to the determinant. If trace(H) 2 det(H) < (r + 1)2 r (3.15) the location is accepted as DOG-keypoint, where r = 10 is a reasonable value for a lot of situations. It is possible to implement the necessary steps of the DOG-detector very efficiently. The DOG-detector is therefore a candidate of choice if one wants to build a real-time system. Figure 3.6(a) shows examples for DOG-keypoints. Each keypoint is represented by the center point (yellow cross) and the characteristic scale drawn as a circle around the center point.
3.2. Scale invariant detectors 28 3.2.4 Salient region detector The salient region detector has been proposed by Kadir and Brady [52]. As the other scale invariant detectors a location and a characteristic scale is detected. However, a major difference is in the selection of the location. The goal is to detect salient image regions. Kadir and Brady propose as a measure for saliency the entropy of the gray-value distribution within an image region. The entropy H of an image region is defined by H = − ∑ i p(d i )log 2 p(d i ) (3.16) where p(d i ) is the probability of gray-value d i in the image region. The values p(d i ) can be computed by the histogram of the image region. The histogram counts the frequency of the occurrence Kadir&Brady: of each gray-value. The entropy Entropy can be computed with the normalized histogram counts. The goal is to select regions which show a distributed histogram. A distributed histogram indicates highly textured, thus salient regions. A peaked histogram indicates low texture and lots of similar gray-values. Figure 3.5 depicts examples for peaked and distributed histograms. distributed peaked Figure 3.5: Example for peaked and distributed histograms. The image patch corresponding to the peaked histogram shows low texture. The distributed histogram corresponds to a highly textured region. Peaked and distributed histograms can be distinguished by their entropy value H. Distributed histograms show a larger (negative) entropy value than peaked histograms. To detect salient regions the entropy is computed for different window sizes. Different window sizes lead to different histograms. Consider an image with a homogeneous background showing a textured object. Computing the histogram for the object yields a distributed histogram. If the window size for the histogram will be increased, the window will contain more and more from the homogeneous background and the histogram will change from distributed to peaked. Such changes now indicate salient regions. In detail, a peak in the function H(w) indicates a salient region. The window size w of the peak in H(w) can be seen as the characteristic scale of the salient region. The algorithm can be summarized as follows. First, compute the entropy value for multiple window sizes for every pixel location. Search for a peak in H(w) for every pixel location. Select the locations which show a peak in H(w). The selected locations can be stored as triplet 〈x, y, s〉, where x, y is the location in the image and s is the window size, or scale respectively. Each triplet corresponds to an entropy value computed at location x, y with window size s. For lots of pixel
Page 1 and 2: Graz University of Technology Insti
Page 3 and 4: Abstract Visual map building and lo
Page 5 and 6: Contents 1 Introduction to mobile r
Page 7 and 8: CONTENTS vi 7.1.5 Sub-map linking .
Page 9 and 10: 1.1. Localization and map building
Page 11 and 12: 1.3. What has already been achieved
Page 13 and 14: 1.5. How can it get solved? 6 fully
Page 15 and 16: 1.6. Contribution of this thesis 8
Page 17 and 18: 1.7. Structure of the thesis 10 com
Page 19 and 20: 2.2. Localization from point featur
Page 21 and 22: 2.2. Localization from point featur
Page 23 and 24: 2.4. Localization from plane featur
Page 25 and 26: 2.5. Summary 18 or not. Clearly thi
Page 27 and 28: Chapter 3 Local detectors Research
Page 29 and 30: 3.1. Interest point detectors 22 fu
Page 31 and 32: 3.2. Scale invariant detectors 24 r
Page 33: 3.2. Scale invariant detectors 26 (
Page 37 and 38: 3.3. Affine invariant detectors 30
Page 47 and 48: 3.4. Comparison of the described me
Page 49 and 50: 3.4. Comparison of the described me
Page 51 and 52: 44 But using a plane to plane homog
Page 53 and 54: 4.2. Representation of the detectio
Page 55 and 56: 4.3. Detection correspondence 48 th
Page 57 and 58: 4.4. Point transfer using the trifo
Page 59 and 60: 4.5. Ground truth generation 52 usi
Page 61 and 62: 4.6. Experimental evaluation 54 Fig
Page 63 and 64: 4.6. Experimental evaluation 56 rep
Page 65 and 66: 4.6. Experimental evaluation 58 MSE
Page 67 and 68: 4.6. Experimental evaluation 60 mat
Page 69 and 70: 4.6. Experimental evaluation 62 vie
Page 71 and 72: 4.6. Experimental evaluation 64 vie
Page 73 and 74: 4.6. Experimental evaluation 66 rel
Page 75 and 76: Chapter 5 Maximally Stable Corner C
Page 77 and 78: 5.1. The MSCC detector 70 (a) (b) (
Page 79 and 80: 5.2. Region representation 72 400 (
Page 81 and 82: 5.3. Computational complexity 74 6.
Page 83 and 84: 5.5. Detection examples 76 paramete
Page 85 and 86:
5.5. Detection examples 78 Figure 5
Page 87 and 88:
5.6. Detector evaluation: Repeatabi
Page 89 and 90:
5.7. Combining MSCC with other loca
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
6.1. Wide-baseline region matching
Page 101 and 102:
6.1. Wide-baseline region matching
Page 103 and 104:
6.2. Piece-wise planar scene recons
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
Page 117 and 118:
Chapter 7 Living in a piecewise pla
Page 119 and 120:
7.1. Map building 112 s,R,t sub-map
Page 121 and 122:
7.1. Map building 114 x = (x 1 ...x
Page 123 and 124:
7.1. Map building 116 distance is u
Page 125 and 126:
7.1. Map building 118 normalization
Page 127 and 128:
7.2. Localization 120 where N = |D
Page 129 and 130:
7.2. Localization 122 registration
Page 131 and 132:
7.2. Localization 124 (a) (b) (c) F
Page 133 and 134:
7.2. Localization 126 3D structure
Page 135 and 136:
7.2. Localization 128 other landmar
Page 137 and 138:
Chapter 8 Map building and localiza
Page 139 and 140:
8.2. Map building experiments 132 8
Page 141 and 142:
8.2. Map building experiments 134 D
Page 143 and 144:
8.2. Map building experiments 136 (
Page 145 and 146:
8.2. Map building experiments 138 F
Page 147 and 148:
8.2. Map building experiments 140 F
Page 149 and 150:
8.3. Localization experiments 142 8
Page 151 and 152:
8.3. Localization experiments 144 F
Page 153 and 154:
8.3. Localization experiments 146 F
Page 155 and 156:
8.3. Localization experiments 148 8
Page 157 and 158:
8.3. Localization experiments 150 (
Page 159 and 160:
Chapter 9 Conclusion More than 25 y
Page 161 and 162:
9.1. Future work 154 Map building a
Page 163 and 164:
9.1. Future work 156 information ca
Page 165 and 166:
A.1. Projective ellipse transfer 15
Page 167 and 168:
A.1. Projective ellipse transfer 16
Page 169 and 170:
A.2. Affine approximation of ellips
Page 171 and 172:
Appendix B The trifocal tensor and
Page 173 and 174:
Bibliography [1] S. Atiya and G. Ha
Page 175 and 176:
168 [31] F. Fraundorfer and H. Bisc
Page 177 and 178:
170 [61] U. Köthe. Edge and juncti
Page 179 and 180:
172 [92] F. Schaffalitzky and A. Zi
show all

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

Create successful ePaper yourself

Delete template?

Save as template?