PHD Thesis - Institute for Computer Graphics and Vision - Graz ...
PHD Thesis - Institute for Computer Graphics and Vision - Graz ...
PHD Thesis - Institute for Computer Graphics and Vision - Graz ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
6.1. Wide-baseline region matching 92<br />
Property 1 is achieved by a 2-step approach. In a first step, tentative correspondences are<br />
identified by nearest neighbor matching in feature space. The tentative matches however still<br />
contain a lot of outliers. In a second step the tentative matches are verified by area based<br />
matching, calculating the correlation over the whole interest region. This step ensures with<br />
maximal certainty the correctness of the match.<br />
To achieve property 2 matching patches get exactly registered onto each other, by an iterative<br />
registration procedure. Registration is per<strong>for</strong>med with sub-pixel accuracy which results in highly<br />
accurate point correspondences.<br />
Unlike other approaches this algorithm does not simple use the center point of a region<br />
match as final correspondence. Instead, within the matched <strong>and</strong> registered image regions, new<br />
point correspondences are detected. Each matched image region yields about 20-50 new point<br />
correspondences (property 3). The registration is done by computing the inter-image homography<br />
<strong>for</strong> each region which maps one region exactly onto the other. There<strong>for</strong>e the method is<br />
restricted to planar interest regions only. In fact, non-planar matches will be rejected by this<br />
method.<br />
6.1.1 Matching <strong>and</strong> registration<br />
Let us now have a close look at the details of the method. It is a 2-step approach consisting of<br />
generating tentative matches <strong>and</strong> verification (see Algorithm 2 <strong>for</strong> a compact description). First<br />
we will describe the generation of the tentative matches. Input is a wide-baseline image pair I<br />
<strong>and</strong> I ′ . In each of the images interest regions are detected. We denote the set of interest regions<br />
in I with L <strong>and</strong> in I ′ with L ′ . The method is not restricted to one special detector, every affine<br />
interest region detector (see [76] <strong>for</strong> examples) is possible. After detection a local affine frame<br />
(LAF) is computed <strong>for</strong> every region in L <strong>and</strong> L ′ . Next the interest regions are normalized using<br />
the LAF. Normalization tries to remove the perspective distortion of a viewpoint change <strong>and</strong> two<br />
corresponding regions will appear almost identical. Some normalization methods create multiple<br />
normalized images <strong>for</strong> a single interest region. The multiple appearances are simply added to<br />
the region set. For the set of normalized regions L <strong>and</strong> L ′ SIFT descriptors are extracted <strong>and</strong><br />
stored in D <strong>and</strong> D ′ . Each entry in D <strong>and</strong> D ′ is a vector of length 128 describing the appearance<br />
of a normalized patch using orientation histograms. Corresponding interest regions can now be<br />
found by nearest neighbor search in this 128-dimensional feature space. For efficient matching a<br />
KD-tree K is built with the feature vectors in D ′ . Corresponding interest regions <strong>for</strong> the entries<br />
in D are now found by querying the KD-tree. The corresponding region <strong>for</strong> D i is the closest<br />
feature in D ′ return by the KD-tree query. As distance metric the Euclidean distance is used.<br />
To avoid r<strong>and</strong>om matches a measure based on the ratio of the nearest to the second closest<br />
feature vector is used. A match is accepted if<br />
d 0<br />
d 1<br />
< d th , (6.1)<br />
where d 0 is the Euclidean distance between the query feature <strong>and</strong> the nearest neighbor. d 1 is the<br />
distance from the query feature to the second closest feature vector. d th is a user set threshold.<br />
According to [67] an appropriate threshold is 0.8. We denote correspondences detected in this<br />
way as tentative matches. T is the set of tentative matches with T i = (L i , L ′ j ) <strong>and</strong> is the<br />
prerequisite <strong>for</strong> the verification step. The tentative matches T are now verified by area based<br />
matching. Correspondence is checked by normalized cross-correlation. This procedure is quite<br />
slow, but it is applied to the set of tentative matches only, which is significantly smaller than<br />
the initial set of detected regions. The cross-correlation is calculated on a registered pair of