18.07.2014 Views

Semantic Interpretation of Digital Aerial Images Utilizing ...

Semantic Interpretation of Digital Aerial Images Utilizing ...

Semantic Interpretation of Digital Aerial Images Utilizing ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

84 Chapter 4. From 3D to the Fusion <strong>of</strong> Redundant Pixel Observations<br />

to the ground plane (we simply set the height coordinate to a fixed value). Hence, a collection<br />

<strong>of</strong> multiple sampled points results in a highly redundant pool <strong>of</strong> image candidates<br />

for the different modalities. In Figure 4.3, the corresponding image patches, transformed<br />

to a common (orthographic) view, are given for color, height and semantic classification,<br />

where we only display the most dominant object class. Due to missing data in the individual<br />

height fields (e.g.caused by non-stationary objects or occlusions) the initial alignment<br />

causes undefined areas, artifacts or outliers in the novel view.<br />

In the following various regularized approaches, individually adapted to the considered<br />

modality, are introduced to improve the final result with respect to these problems.<br />

4.3 Fusion <strong>of</strong> Redundant Intensity Information<br />

This section highlights our fusion model, that allows us to efficiently fuse redundant color<br />

and height field observations into a single, high-quality result. Note that an integration<br />

<strong>of</strong> multiple images at a city-scale demands fast and sophisticated methods, we thus make<br />

use <strong>of</strong> energy minimization techniques defined in the continuous domain. These methods<br />

provide a high parallelization capability and obtain in general a globally optimal solution.<br />

The basic concept <strong>of</strong> energy minimization methods is to formulate the solution <strong>of</strong> a given<br />

problem as an optimum <strong>of</strong> an functional, by taking into account <strong>of</strong> both the distance<br />

<strong>of</strong> the solution to the observed data and the smoothness itself. First, we briefly discuss<br />

related work. Then, we derive our fusion model, which is based on TV-L 1 denoising<br />

model [Nikolova, 2004], capable to handle multiple input observations. In order to exploit<br />

the particular characteristics <strong>of</strong> the involved modalities we individually adapt the fusion<br />

model for color and height information. In case <strong>of</strong> color we replace the TV-norm by a<br />

wavelet-based regularization, capable to preserve fine image structures and produces a<br />

natural fusion result.<br />

4.3.1 Background<br />

The challenging task <strong>of</strong> reconstructing an original image from given (noisy) observations<br />

is known to be an inverse ill-posed problem. In order to solve such optimization<br />

problems, additional assumptions, such as the smoothness <strong>of</strong> the solution, have to be considered.<br />

The optimization problem can be generally formulated as the minimization <strong>of</strong><br />

the functional<br />

min<br />

u<br />

{ R(u) + λ D(u, f) } , (4.1)<br />

where u is the sought solution, the function R(u) denotes the regularization, which forces<br />

a smooth solution. Depending on the problem R(u) can be defined for different types

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!