15.01.2013 Views

U. Glaeser

U. Glaeser

U. Glaeser

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The first hypothesis (the less plausible of the two) can be easily modeled with temporal linear filters. The<br />

second, more interesting behavior can be modeled with a simple comparator network.<br />

28.4 Image Sequence Representation<br />

What Does “Representation” Mean?<br />

The term “representation” may require some explanation. Perhaps the best way to do so is to consider<br />

some examples of familiar representations. For simplicity, 2-D examples will be used. Extension to 3-D<br />

is relatively straightforward.<br />

The Pixel Representation<br />

The pixel representation is so common and intuitive that it is usually considered to be “the image.” More<br />

precisely, however, it is a linear sum of weighted impulses:<br />

© 2002 by CRC Press LLC<br />

(28.25)<br />

where u(m, n) is the image, u(m′, n′) are the coefficients of the representation (numerically equal to the<br />

pixel values in this case), and the δ(m − m′, n − n′) play the role of basis functions.<br />

The DFT<br />

The next most familiar representations (at least to engineers) is the DFT, in which the image is expressed<br />

in terms of complex exponentials:<br />

where 0 ≤ m, n ≤ N − 1 and<br />

N−1 N−1<br />

∑ ∑<br />

u( m, n)<br />

= u( m′, n′ )δ( m– m′, n– n′ )<br />

u( m, n)<br />

m′=0 n′=0<br />

N−1 N−1<br />

1<br />

= ------ v( h, k)WN<br />

∑∑<br />

N 2<br />

h=0 k=0<br />

W N<br />

e j2π/N –<br />

(28.26)<br />

(28.27)<br />

In this case v(h, k) are the coefficients of the representation and the 2-D complex exponentials<br />

are the basis functions.<br />

The choice of one representation over the other (pixel vs. Fourier) for a given application depends on<br />

the image characteristics that are of most interest. The pixel representation makes the spatial organization<br />

of intensities in the image explicit. Because this is the basis of the visual stimulus, it seems more “natural.”<br />

The Fourier representation makes the composition of the image in terms of complex exponentials<br />

(“frequency components”) explicit. The two representations emphasize their respective characteristics<br />

(spatial vs. frequency) to the exclusion of all others. If a mixture of characteristics is desired, different<br />

representations must be used.<br />

Spatial/Spatial-Frequency Representations<br />

=<br />

– hm −kn<br />

WN<br />

hm<br />

WN – –<br />

kn<br />

WN<br />

A natural mixture is to combine frequency analysis with spatial location. An example of a 1-D representation<br />

of this type (a time/frequency representation) is a musical score. The need to know not only what the<br />

frequency content of a signal is, but where in the signal the frequency components exist is common to many<br />

signal, image, and image sequence processing tasks [7]. A variety of approaches [8,9] can be used to develop<br />

a representation to facilitate these tasks. The most intuitive approach is the finite-support Fourier<br />

transform.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!