18.07.2014 Views

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

PHD Thesis - Institute for Computer Graphics and Vision - Graz ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.1. Wide-baseline region matching 92<br />

Property 1 is achieved by a 2-step approach. In a first step, tentative correspondences are<br />

identified by nearest neighbor matching in feature space. The tentative matches however still<br />

contain a lot of outliers. In a second step the tentative matches are verified by area based<br />

matching, calculating the correlation over the whole interest region. This step ensures with<br />

maximal certainty the correctness of the match.<br />

To achieve property 2 matching patches get exactly registered onto each other, by an iterative<br />

registration procedure. Registration is per<strong>for</strong>med with sub-pixel accuracy which results in highly<br />

accurate point correspondences.<br />

Unlike other approaches this algorithm does not simple use the center point of a region<br />

match as final correspondence. Instead, within the matched <strong>and</strong> registered image regions, new<br />

point correspondences are detected. Each matched image region yields about 20-50 new point<br />

correspondences (property 3). The registration is done by computing the inter-image homography<br />

<strong>for</strong> each region which maps one region exactly onto the other. There<strong>for</strong>e the method is<br />

restricted to planar interest regions only. In fact, non-planar matches will be rejected by this<br />

method.<br />

6.1.1 Matching <strong>and</strong> registration<br />

Let us now have a close look at the details of the method. It is a 2-step approach consisting of<br />

generating tentative matches <strong>and</strong> verification (see Algorithm 2 <strong>for</strong> a compact description). First<br />

we will describe the generation of the tentative matches. Input is a wide-baseline image pair I<br />

<strong>and</strong> I ′ . In each of the images interest regions are detected. We denote the set of interest regions<br />

in I with L <strong>and</strong> in I ′ with L ′ . The method is not restricted to one special detector, every affine<br />

interest region detector (see [76] <strong>for</strong> examples) is possible. After detection a local affine frame<br />

(LAF) is computed <strong>for</strong> every region in L <strong>and</strong> L ′ . Next the interest regions are normalized using<br />

the LAF. Normalization tries to remove the perspective distortion of a viewpoint change <strong>and</strong> two<br />

corresponding regions will appear almost identical. Some normalization methods create multiple<br />

normalized images <strong>for</strong> a single interest region. The multiple appearances are simply added to<br />

the region set. For the set of normalized regions L <strong>and</strong> L ′ SIFT descriptors are extracted <strong>and</strong><br />

stored in D <strong>and</strong> D ′ . Each entry in D <strong>and</strong> D ′ is a vector of length 128 describing the appearance<br />

of a normalized patch using orientation histograms. Corresponding interest regions can now be<br />

found by nearest neighbor search in this 128-dimensional feature space. For efficient matching a<br />

KD-tree K is built with the feature vectors in D ′ . Corresponding interest regions <strong>for</strong> the entries<br />

in D are now found by querying the KD-tree. The corresponding region <strong>for</strong> D i is the closest<br />

feature in D ′ return by the KD-tree query. As distance metric the Euclidean distance is used.<br />

To avoid r<strong>and</strong>om matches a measure based on the ratio of the nearest to the second closest<br />

feature vector is used. A match is accepted if<br />

d 0<br />

d 1<br />

< d th , (6.1)<br />

where d 0 is the Euclidean distance between the query feature <strong>and</strong> the nearest neighbor. d 1 is the<br />

distance from the query feature to the second closest feature vector. d th is a user set threshold.<br />

According to [67] an appropriate threshold is 0.8. We denote correspondences detected in this<br />

way as tentative matches. T is the set of tentative matches with T i = (L i , L ′ j ) <strong>and</strong> is the<br />

prerequisite <strong>for</strong> the verification step. The tentative matches T are now verified by area based<br />

matching. Correspondence is checked by normalized cross-correlation. This procedure is quite<br />

slow, but it is applied to the set of tentative matches only, which is significantly smaller than<br />

the initial set of detected regions. The cross-correlation is calculated on a registered pair of

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!