PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

More documents

Recommendations

Info

Chapter 6 Wide-baseline methods 1 This chapter deals with image matching and 3D reconstruction for wide-baseline scenarios. The first section provides a solution for the problem of detecting corresponding regions in images from far different viewpoints. The proposed method builds upon the detection of affine invariant interest regions. Projective transformations introduced by the wide baseline get reduced by affine normalization. In the proposed method the projective distortion subsequently gets completely removed until both image patches are registered. In the registered image patches point correspondences have simply the same pixel coordinates and by knowing the applied transformations the pixel coordinates in the original image frame can be computed. The second section describes a method to recover scene planes of arbitrary position and orientation from oriented images using homographies. Given at least 2 wide-baseline images a piece-wise planar 3D reconstruction can be computed. Furthermore the input images are segmented into planar image parts. Planar regions are reconstructed using only sparse, affineinvariant sets of corresponding seed regions. These regions are iteratively expanded and refined using plane-induced homographies. 3D reconstruction needs a calibrated setup, while the planar segmentation is possible for uncalibrated images too. 6.1 Wide-baseline region matching In the following the wide-baseline region matching method is described which is a key technique for the proposed map building and localization framework. The algorithm is a key ingredient of the plane segmentation and reconstruction method described in Section 6.2.1. The plane segmentation and reconstruction method is used to build piece-wise planar sub-maps (see Section 7.1). Another application of this algorithm is in linking sub-maps into a complete world map (see Section 7.1.5). It is also a key component of the global localization algorithm presented in section 7.2. The method has been designed in a way to exhibit the following properties: 1. Highly reliable matches, i.e. the algorithms produces a low number of outliers. 2. Exact point correspondences, i.e. with sub-pixel accuracy. 3. High number of point correspondences 1 Based on the publication: F. Fraundorfer, K. Schindler, and H. Bischof. Piecewise planar scene reconstruction from sparse correspondences. Image and Vision Computing, 24(4):395–406, 2006 [38] 91
6.1. Wide-baseline region matching 92 Property 1 is achieved by a 2-step approach. In a first step, tentative correspondences are identified by nearest neighbor matching in feature space. The tentative matches however still contain a lot of outliers. In a second step the tentative matches are verified by area based matching, calculating the correlation over the whole interest region. This step ensures with maximal certainty the correctness of the match. To achieve property 2 matching patches get exactly registered onto each other, by an iterative registration procedure. Registration is performed with sub-pixel accuracy which results in highly accurate point correspondences. Unlike other approaches this algorithm does not simple use the center point of a region match as final correspondence. Instead, within the matched and registered image regions, new point correspondences are detected. Each matched image region yields about 20-50 new point correspondences (property 3). The registration is done by computing the inter-image homography for each region which maps one region exactly onto the other. Therefore the method is restricted to planar interest regions only. In fact, non-planar matches will be rejected by this method. 6.1.1 Matching and registration Let us now have a close look at the details of the method. It is a 2-step approach consisting of generating tentative matches and verification (see Algorithm 2 for a compact description). First we will describe the generation of the tentative matches. Input is a wide-baseline image pair I and I ′ . In each of the images interest regions are detected. We denote the set of interest regions in I with L and in I ′ with L ′ . The method is not restricted to one special detector, every affine interest region detector (see [76] for examples) is possible. After detection a local affine frame (LAF) is computed for every region in L and L ′ . Next the interest regions are normalized using the LAF. Normalization tries to remove the perspective distortion of a viewpoint change and two corresponding regions will appear almost identical. Some normalization methods create multiple normalized images for a single interest region. The multiple appearances are simply added to the region set. For the set of normalized regions L and L ′ SIFT descriptors are extracted and stored in D and D ′ . Each entry in D and D ′ is a vector of length 128 describing the appearance of a normalized patch using orientation histograms. Corresponding interest regions can now be found by nearest neighbor search in this 128-dimensional feature space. For efficient matching a KD-tree K is built with the feature vectors in D ′ . Corresponding interest regions for the entries in D are now found by querying the KD-tree. The corresponding region for D i is the closest feature in D ′ return by the KD-tree query. As distance metric the Euclidean distance is used. To avoid random matches a measure based on the ratio of the nearest to the second closest feature vector is used. A match is accepted if d 0 d 1 < d th , (6.1) where d 0 is the Euclidean distance between the query feature and the nearest neighbor. d 1 is the distance from the query feature to the second closest feature vector. d th is a user set threshold. According to [67] an appropriate threshold is 0.8. We denote correspondences detected in this way as tentative matches. T is the set of tentative matches with T i = (L i , L ′ j ) and is the prerequisite for the verification step. The tentative matches T are now verified by area based matching. Correspondence is checked by normalized cross-correlation. This procedure is quite slow, but it is applied to the set of tentative matches only, which is significantly smaller than the initial set of detected regions. The cross-correlation is calculated on a registered pair of
Page 1 and 2:
Graz University of Technology Insti
Page 3 and 4:
Abstract Visual map building and lo
Page 5 and 6:
Contents 1 Introduction to mobile r
Page 7 and 8:
CONTENTS vi 7.1.5 Sub-map linking .
Page 9 and 10:
1.1. Localization and map building
Page 11 and 12:
1.3. What has already been achieved
Page 13 and 14:
1.5. How can it get solved? 6 fully
Page 15 and 16:
1.6. Contribution of this thesis 8
Page 17 and 18:
1.7. Structure of the thesis 10 com
Page 19 and 20:
2.2. Localization from point featur
Page 21 and 22:
2.2. Localization from point featur
Page 23 and 24:
2.4. Localization from plane featur
Page 25 and 26:
2.5. Summary 18 or not. Clearly thi
Page 27 and 28:
Chapter 3 Local detectors Research
Page 29 and 30:
3.1. Interest point detectors 22 fu
Page 31 and 32:
3.2. Scale invariant detectors 24 r
Page 33 and 34:
3.2. Scale invariant detectors 26 (
Page 35 and 36:
3.2. Scale invariant detectors 28 3
Page 37 and 38:
3.3. Affine invariant detectors 30
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48: 3.4. Comparison of the described me
Page 49 and 50: 3.4. Comparison of the described me
Page 51 and 52: 44 But using a plane to plane homog
Page 53 and 54: 4.2. Representation of the detectio
Page 55 and 56: 4.3. Detection correspondence 48 th
Page 57 and 58: 4.4. Point transfer using the trifo
Page 59 and 60: 4.5. Ground truth generation 52 usi
Page 61 and 62: 4.6. Experimental evaluation 54 Fig
Page 63 and 64: 4.6. Experimental evaluation 56 rep
Page 65 and 66: 4.6. Experimental evaluation 58 MSE
Page 67 and 68: 4.6. Experimental evaluation 60 mat
Page 69 and 70: 4.6. Experimental evaluation 62 vie
Page 71 and 72: 4.6. Experimental evaluation 64 vie
Page 73 and 74: 4.6. Experimental evaluation 66 rel
Page 75 and 76: Chapter 5 Maximally Stable Corner C
Page 77 and 78: 5.1. The MSCC detector 70 (a) (b) (
Page 79 and 80: 5.2. Region representation 72 400 (
Page 81 and 82: 5.3. Computational complexity 74 6.
Page 83 and 84: 5.5. Detection examples 76 paramete
Page 85 and 86: 5.5. Detection examples 78 Figure 5
Page 87 and 88: 5.6. Detector evaluation: Repeatabi
Page 89 and 90: 5.7. Combining MSCC with other loca
Page 97: 5.7. Combining MSCC with other loca
Page 101 and 102: 6.1. Wide-baseline region matching
Page 103 and 104: 6.2. Piece-wise planar scene recons
Page 117 and 118: Chapter 7 Living in a piecewise pla
Page 119 and 120: 7.1. Map building 112 s,R,t sub-map
Page 121 and 122: 7.1. Map building 114 x = (x 1 ...x
Page 123 and 124: 7.1. Map building 116 distance is u
Page 125 and 126: 7.1. Map building 118 normalization
Page 127 and 128: 7.2. Localization 120 where N = |D
Page 129 and 130: 7.2. Localization 122 registration
Page 131 and 132: 7.2. Localization 124 (a) (b) (c) F
Page 133 and 134: 7.2. Localization 126 3D structure
Page 135 and 136: 7.2. Localization 128 other landmar
Page 137 and 138: Chapter 8 Map building and localiza
Page 139 and 140: 8.2. Map building experiments 132 8
Page 141 and 142: 8.2. Map building experiments 134 D
Page 143 and 144: 8.2. Map building experiments 136 (
Page 145 and 146: 8.2. Map building experiments 138 F
Page 147 and 148: 8.2. Map building experiments 140 F
Page 149 and 150:
8.3. Localization experiments 142 8
Page 151 and 152:
8.3. Localization experiments 144 F
Page 153 and 154:
8.3. Localization experiments 146 F
Page 155 and 156:
8.3. Localization experiments 148 8
Page 157 and 158:
8.3. Localization experiments 150 (
Page 159 and 160:
Chapter 9 Conclusion More than 25 y
Page 161 and 162:
9.1. Future work 154 Map building a
Page 163 and 164:
9.1. Future work 156 information ca
Page 165 and 166:
A.1. Projective ellipse transfer 15
Page 167 and 168:
A.1. Projective ellipse transfer 16
Page 169 and 170:
A.2. Affine approximation of ellips
Page 171 and 172:
Appendix B The trifocal tensor and
Page 173 and 174:
Bibliography [1] S. Atiya and G. Ha
Page 175 and 176:
168 [31] F. Fraundorfer and H. Bisc
Page 177 and 178:
170 [61] U. Köthe. Edge and juncti
Page 179 and 180:
172 [92] F. Schaffalitzky and A. Zi
show all

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?