PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

More documents

Recommendations

Info

2.5. Summary 19 compromise the localization algorithm. Any new method which provides a more reliable landmark matching will therefore increase the overall localization performance. Localization despite large occlusions: Occlusions of the robots view will occur frequently if the robot is operating in a crowded environment. The view to landmarks will therefore be quite often limited. Localization algorithms should therefore be capable of computing an accurate pose from only a minimal number of detected landmark matches. The methods described in [56, 96] require about 10-20 landmark matches for a reliable pose estimate, quite a high number to met in a crowded and heavy occluded environment. Automatic map interpretation: Automatic map interpretation is a necessity to allow mobile robots to interact autonomously with the world and to carry out more complex tasks than vacuum cleaning. Nowadays systems can already get confused by a simple door. Assume that the mobile robot maps a room with an open door. In the map this will be reflected as an opening to traverse. Imagine that the other day the robot is heading towards the door and finds it closed. A simple localization algorithm will believe in a false position estimate. If however the robot knows about the functionality it can reason that the door has an open and a closed state and thus gets not confused. Well working service robot need to know even more about the environment. They need to know the names of the objects, functionalities of the objects, which objects are moveable, etc. Clearly this goes hand in hand with research in object recognition, but it should be thought about how the world representation of a mobile robot can support achieving this goal. Authors World map Sensor system Map features Landmark matching Map building Global localization (#landmarks ∗ ) Pose representation Se, Lowe, Little [96] sparse metric stereo 3D points + SIFT feature matching SLAM tri-angulation, map-alignment (>= 10) 2D (3DOF) Karlsson et al. [56] sparse metric monocular 3D points + SIFT + appearance feature matching SLAM 3D-2D (>= 4) 2D (3DOF) Davison et al. [21] sparse metric active stereo 3D points correlation SLAM tri-angulation (>= 3) 3D (6DOF) Bosse et al. [10] sparse metric omnidirectional 3D points + 3D lines + vanishing points nearest neighbor batch map matching (approx.30) 3D (6DOF) Goedeme et al. [39] topological monocular, omnidirectional 2D lines + color descriptor + intensity descriptor feature matching batch line matching and voting topological location Kosaka et al. [59] sparse metric, CAD-model monocular 3D lines nearest neighbor manual - 2D (3DOF) Hayet et al. [45] sparse metric monocular quadrangular 3D planes + PCA descriptor feature matching batch 3D-2D (1) 3D (6DOF) Table 2.1: Main characteristics of the revised literature approaches. ( ∗ necessary landmark matches for robust pose estimation)
Chapter 3 Local detectors Research on local detectors can be dated back to 1977 when Hans Moravec has described an interest operator which is today known as the Moravec operator [79]. In [80] Hans Moravec described obstacle avoidance and navigation for a mobile robot. He was using his interest operator to detect interest points in stereo image pairs and images from different viewpoints to use them as features to build a 3D map of the environment. Feature matching was achieved with correlation of 6 × 6 pixel image patches around the detected feature locations. The Moravec operator is based on the auto-correlation function, that is measuring the gray-level difference between a window and a shifted window in four directions. Calculating the sum of squared distances in the window gives a measure for every shift. The values are high, if the graylevel variance is high (textured) and low if there is low gray-level variance (e.g. homogeneous region). If the measures for every direction are high, the pixel location is a good candidate for an interest point. The smallest measure is then used as a quality measure for the interest point. In most cases the detected locations lie on edges and corners. For such cases a little shift already causes a difference. However, an obvious deficiency is the anisotropic behavior because there is only a discrete set of shifts. This basic idea was carried on leading to the well known Harris corner detector [40]. The idea got re-formulated using the structure tensor [9] and the second moment matrix respectively, leading to different variants of corner detectors [30, 61, 91, 107]. Other approaches [7, 57] use the second derivatives (Hessian matrix [115]) instead of the first derivatives. All these approaches can be considered belonging to one class of simple interest point detectors. They all have in common to detect a location only. That means that for a subsequent task like image matching via cross-correlation the size of the necessary matching window has to be chosen independently. This limitation shows up if dealing with images which show scale change. Although the detector might be able to detect the corresponding location, the correlation window will not contain the same gray-values and the matching will fail. This limitation was addressed by estimating a proper scale for every detected interest point. With this information the scale of the matching window can be normalized and cross-correlation would again work. The first work going into this direction was done by Tony Lindeberg [64] in 1998. Other approaches followed shortly by David Lowe [66] or Krystian Mikolajczyk [72]. This class of interest operators is usually called scale-invariant interest operators. However, research again went one step further. According to the success of interest operators which are invariant to scale change methods were sought to create interest operators invariant to a larger class of image transformations. This was driven mostly by developments in wide-baseline image matching where significant perspective distortions occur. Research therein led to a new class of interest detectors, affine-invariant detectors. In most cases such a detection consists of a 20
Page 1 and 2: Graz University of Technology Insti
Page 3 and 4: Abstract Visual map building and lo
Page 5 and 6: Contents 1 Introduction to mobile r
Page 7 and 8: CONTENTS vi 7.1.5 Sub-map linking .
Page 9 and 10: 1.1. Localization and map building
Page 11 and 12: 1.3. What has already been achieved
Page 13 and 14: 1.5. How can it get solved? 6 fully
Page 15 and 16: 1.6. Contribution of this thesis 8
Page 17 and 18: 1.7. Structure of the thesis 10 com
Page 19 and 20: 2.2. Localization from point featur
Page 21 and 22: 2.2. Localization from point featur
Page 23 and 24: 2.4. Localization from plane featur
Page 25: 2.5. Summary 18 or not. Clearly thi
Page 29 and 30: 3.1. Interest point detectors 22 fu
Page 31 and 32: 3.2. Scale invariant detectors 24 r
Page 33 and 34: 3.2. Scale invariant detectors 26 (
Page 35 and 36: 3.2. Scale invariant detectors 28 3
Page 37 and 38: 3.3. Affine invariant detectors 30
Page 47 and 48: 3.4. Comparison of the described me
Page 49 and 50: 3.4. Comparison of the described me
Page 51 and 52: 44 But using a plane to plane homog
Page 53 and 54: 4.2. Representation of the detectio
Page 55 and 56: 4.3. Detection correspondence 48 th
Page 57 and 58: 4.4. Point transfer using the trifo
Page 59 and 60: 4.5. Ground truth generation 52 usi
Page 61 and 62: 4.6. Experimental evaluation 54 Fig
Page 63 and 64: 4.6. Experimental evaluation 56 rep
Page 65 and 66: 4.6. Experimental evaluation 58 MSE
Page 67 and 68: 4.6. Experimental evaluation 60 mat
Page 69 and 70: 4.6. Experimental evaluation 62 vie
Page 71 and 72: 4.6. Experimental evaluation 64 vie
Page 73 and 74: 4.6. Experimental evaluation 66 rel
Page 75 and 76: Chapter 5 Maximally Stable Corner C
Page 77 and 78:
5.1. The MSCC detector 70 (a) (b) (
Page 79 and 80:
5.2. Region representation 72 400 (
Page 81 and 82:
5.3. Computational complexity 74 6.
Page 83 and 84:
5.5. Detection examples 76 paramete
Page 85 and 86:
5.5. Detection examples 78 Figure 5
Page 87 and 88:
5.6. Detector evaluation: Repeatabi
Page 89 and 90:
5.7. Combining MSCC with other loca
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
6.1. Wide-baseline region matching
Page 101 and 102:
6.1. Wide-baseline region matching
Page 103 and 104:
6.2. Piece-wise planar scene recons
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
Page 117 and 118:
Chapter 7 Living in a piecewise pla
Page 119 and 120:
7.1. Map building 112 s,R,t sub-map
Page 121 and 122:
7.1. Map building 114 x = (x 1 ...x
Page 123 and 124:
7.1. Map building 116 distance is u
Page 125 and 126:
7.1. Map building 118 normalization
Page 127 and 128:
7.2. Localization 120 where N = |D
Page 129 and 130:
7.2. Localization 122 registration
Page 131 and 132:
7.2. Localization 124 (a) (b) (c) F
Page 133 and 134:
7.2. Localization 126 3D structure
Page 135 and 136:
7.2. Localization 128 other landmar
Page 137 and 138:
Chapter 8 Map building and localiza
Page 139 and 140:
8.2. Map building experiments 132 8
Page 141 and 142:
8.2. Map building experiments 134 D
Page 143 and 144:
8.2. Map building experiments 136 (
Page 145 and 146:
8.2. Map building experiments 138 F
Page 147 and 148:
8.2. Map building experiments 140 F
Page 149 and 150:
8.3. Localization experiments 142 8
Page 151 and 152:
8.3. Localization experiments 144 F
Page 153 and 154:
8.3. Localization experiments 146 F
Page 155 and 156:
8.3. Localization experiments 148 8
Page 157 and 158:
8.3. Localization experiments 150 (
Page 159 and 160:
Chapter 9 Conclusion More than 25 y
Page 161 and 162:
9.1. Future work 154 Map building a
Page 163 and 164:
9.1. Future work 156 information ca
Page 165 and 166:
A.1. Projective ellipse transfer 15
Page 167 and 168:
A.1. Projective ellipse transfer 16
Page 169 and 170:
A.2. Affine approximation of ellips
Page 171 and 172:
Appendix B The trifocal tensor and
Page 173 and 174:
Bibliography [1] S. Atiya and G. Ha
Page 175 and 176:
168 [31] F. Fraundorfer and H. Bisc
Page 177 and 178:
170 [61] U. Köthe. Edge and juncti
Page 179 and 180:
172 [92] F. Schaffalitzky and A. Zi
show all

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?