PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

More documents

Recommendations

Info

7.1. Map building 111 point landmark. In fact, already a single plane match allows pose estimation while this is not possible with a single 3D point landmark. Additional geometric constraints: Landmarks which are located on one and the same 3D plane are connected by geometric constraints. Plane projective relations are much more restrictive than general projective relations. A planar homography can be used very efficiently to verify feature matches geometrically. Feature reduction: By selecting only landmarks located on 3D planes the number of stored features in the map is reduced significantly. The map uses less memory and the computation time for feature matching of course depends on the number of features. It also increases the robustness and reliability. Non-planar features may change in appearance more significantly than planar features under viewpoint changes. Such landmarks are the reason for ambiguities in feature matching, and mis-matches will occur more frequently which cause problems in pose estimation. Easier matching of planar landmarks: State-of-the-art wide-baseline methods assume that landmarks undergo a planar projective transformation under viewpoint change. Approximating the projective transformation by an affine transformation to create viewpoint normalized descriptors are the currently most advanced matching methods. Landmarks located on 3D corners strongly violate the just mentioned assumption. Such features would cause troubles for matching algorithms and should not be stored as landmarks in the map. Increased accuracy: The accuracy of the 3D reconstruction can be increased with plane information. 3D point reconstructions are coupled by geometric constraints and the 3D coordinates can be optimized to be arranged exactly as a plane. In the following a batch method for map building is presented. Input for the method is an image sequence acquired from a mobile robot equipped with a single perspective camera. The camera needs to be calibrated beforehand. Structure-from-motion algorithms and wide-baseline stereo methods are applied to build the piece-wise planar world representation. The created map can then be used for purely vision based global localization. A mobile robot equipped with a single perspective camera can estimate its pose in respect to the world map from a single camera image. The localization approach to be presented is in analogy to [56] as it computes the robot pose from 3D-2D point correspondences. The novelty is the use of small planar patches as 3D landmarks and that the pose can be computed from a single landmark correspondence. This allows to do localization under extreme conditions, where other methods which require usually a high number of correspondences would normally fail. The novel localization approach is presented in the second part of this chapter. 7.1 Map building The world is represented as a network of linked metric sub-maps (see Figure 7.1 for illustration). Each sub-map has its own local coordinate system and each link between two sub-maps represents a rigid transformation (containing rotation, translation and scaling) connecting both local coordinate system. Thus it is possible to express a position within a specific sub-map from each local coordinate system. Furthermore each sub-map contains the transformation into
7.1. Map building 112 s,R,t sub-map s,R,t sub-map s,R,t s,R,t world coordinate system s,R,t sub-map Figure 7.1: The world is represented as a network of linked metric sub-maps. one global world coordinate system yielding one big metric world representation. Robot localization is done in the scope of a single sub-map. The pose is initially expressed within the local coordinate system but can be transferred into the global world coordinate system with the corresponding transformation. A single sub-map is created by 3D reconstruction from a shortbaseline image pair. The links between the sub-maps are established via wide-baseline feature matching. Map building is treated as an off-line process. Images are acquired by one or multiple robots (either controlled manually or using additional sensors, e.g. a laser range finder). From this unordered pile of images the environment map is constructed within three steps. In a first step the image pile is partitioned into smaller piles containing similar images which will correspond to sub-maps. Next, single sub-maps are created using two images of each smaller pile. In a last step the individual sub-maps are linked to form the complete world representation. The such created map can now be used on a mobile robot only equipped with a single camera for global localization within the mapped environment. In the following the three steps are outlined in detail. Global localization within the proposed map is dealt with subsequently. 7.1.1 Sub-map identification Starting point is a large set of images I 1 ...I n taken at a high frame rate. We assume that the ordering of the images is not known, i.e. that we do not know which images are subsequent to others. The task of this step is to partition the whole set, into sub-sets C 1 ...C c containing images with a short-baseline variation only. Each partition will than act as a sub-map. The partitioning is done by means of clustering. A global similarity criteria is used to group visually similar images into clusters. The requirement for the images in each partition is that it is possible
Page 1 and 2:
Graz University of Technology Insti
Page 3 and 4:
Abstract Visual map building and lo
Page 5 and 6:
Contents 1 Introduction to mobile r
Page 7 and 8:
CONTENTS vi 7.1.5 Sub-map linking .
Page 9 and 10:
1.1. Localization and map building
Page 11 and 12:
1.3. What has already been achieved
Page 13 and 14:
1.5. How can it get solved? 6 fully
Page 15 and 16:
1.6. Contribution of this thesis 8
Page 17 and 18:
1.7. Structure of the thesis 10 com
Page 19 and 20:
2.2. Localization from point featur
Page 21 and 22:
2.2. Localization from point featur
Page 23 and 24:
2.4. Localization from plane featur
Page 25 and 26:
2.5. Summary 18 or not. Clearly thi
Page 27 and 28:
Chapter 3 Local detectors Research
Page 29 and 30:
3.1. Interest point detectors 22 fu
Page 31 and 32:
3.2. Scale invariant detectors 24 r
Page 33 and 34:
3.2. Scale invariant detectors 26 (
Page 35 and 36:
3.2. Scale invariant detectors 28 3
Page 37 and 38:
3.3. Affine invariant detectors 30
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
3.4. Comparison of the described me
Page 49 and 50:
3.4. Comparison of the described me
Page 51 and 52:
44 But using a plane to plane homog
Page 53 and 54:
4.2. Representation of the detectio
Page 55 and 56:
4.3. Detection correspondence 48 th
Page 57 and 58:
4.4. Point transfer using the trifo
Page 59 and 60:
4.5. Ground truth generation 52 usi
Page 61 and 62:
4.6. Experimental evaluation 54 Fig
Page 63 and 64:
4.6. Experimental evaluation 56 rep
Page 65 and 66:
4.6. Experimental evaluation 58 MSE
Page 67 and 68: 4.6. Experimental evaluation 60 mat
Page 69 and 70: 4.6. Experimental evaluation 62 vie
Page 71 and 72: 4.6. Experimental evaluation 64 vie
Page 73 and 74: 4.6. Experimental evaluation 66 rel
Page 75 and 76: Chapter 5 Maximally Stable Corner C
Page 77 and 78: 5.1. The MSCC detector 70 (a) (b) (
Page 79 and 80: 5.2. Region representation 72 400 (
Page 81 and 82: 5.3. Computational complexity 74 6.
Page 83 and 84: 5.5. Detection examples 76 paramete
Page 85 and 86: 5.5. Detection examples 78 Figure 5
Page 87 and 88: 5.6. Detector evaluation: Repeatabi
Page 89 and 90: 5.7. Combining MSCC with other loca
Page 99 and 100: 6.1. Wide-baseline region matching
Page 101 and 102: 6.1. Wide-baseline region matching
Page 103 and 104: 6.2. Piece-wise planar scene recons
Page 117: Chapter 7 Living in a piecewise pla
Page 121 and 122: 7.1. Map building 114 x = (x 1 ...x
Page 123 and 124: 7.1. Map building 116 distance is u
Page 125 and 126: 7.1. Map building 118 normalization
Page 127 and 128: 7.2. Localization 120 where N = |D
Page 129 and 130: 7.2. Localization 122 registration
Page 131 and 132: 7.2. Localization 124 (a) (b) (c) F
Page 133 and 134: 7.2. Localization 126 3D structure
Page 135 and 136: 7.2. Localization 128 other landmar
Page 137 and 138: Chapter 8 Map building and localiza
Page 139 and 140: 8.2. Map building experiments 132 8
Page 141 and 142: 8.2. Map building experiments 134 D
Page 143 and 144: 8.2. Map building experiments 136 (
Page 145 and 146: 8.2. Map building experiments 138 F
Page 147 and 148: 8.2. Map building experiments 140 F
Page 149 and 150: 8.3. Localization experiments 142 8
Page 151 and 152: 8.3. Localization experiments 144 F
Page 153 and 154: 8.3. Localization experiments 146 F
Page 155 and 156: 8.3. Localization experiments 148 8
Page 157 and 158: 8.3. Localization experiments 150 (
Page 159 and 160: Chapter 9 Conclusion More than 25 y
Page 161 and 162: 9.1. Future work 154 Map building a
Page 163 and 164: 9.1. Future work 156 information ca
Page 165 and 166: A.1. Projective ellipse transfer 15
Page 167 and 168: A.1. Projective ellipse transfer 16
Page 169 and 170:
A.2. Affine approximation of ellips
Page 171 and 172:
Appendix B The trifocal tensor and
Page 173 and 174:
Bibliography [1] S. Atiya and G. Ha
Page 175 and 176:
168 [31] F. Fraundorfer and H. Bisc
Page 177 and 178:
170 [61] U. Köthe. Edge and juncti
Page 179 and 180:
172 [92] F. Schaffalitzky and A. Zi
show all

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

Create successful ePaper yourself

Delete template?

Save as template?