08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

was included, we would just multiply by the index <strong>of</strong> summation.<br />

Convert the equation to matrix form by defining z kj = 1 √ n<br />

exp(− 2πi<br />

n ki j) and write<br />

f = Zx. Actually instead <strong>of</strong> x, write the vector consisting <strong>of</strong> the nonzero elements <strong>of</strong> x.<br />

By its definition, x ≠ 0. To prove the lemma we need to show that f is nonzero. This will<br />

be true provided Z is nonsingular. If we rescale Z by dividing each column by its leading<br />

entry we get the Vandermonde determinant which is nonsingular.<br />

Theorem 10.7 Let n x be the number <strong>of</strong> nonzero elements in x and let n f be the number<br />

<strong>of</strong> nonzero elements in the Fourier transform <strong>of</strong> x. Let n x divide n. Then n x n f ≥ n.<br />

Pro<strong>of</strong>: If x has n x nonzero elements, f cannot have a consecutive block <strong>of</strong> n x zeros. Since<br />

n x divides n there are n n x<br />

blocks each containing at least one nonzero element. Thus, the<br />

product <strong>of</strong> nonzero elements in x and f is at least n.<br />

Fourier transform <strong>of</strong> spikes prove that above bound is tight<br />

To show that the bound in Theorem 10.7 is tight we show that the Fourier transform<br />

<strong>of</strong> the sequence <strong>of</strong> length n consisting <strong>of</strong> √ n ones, each one separated by √ n − 1 zeros,<br />

is the sequence itself. For example, the Fourier transform <strong>of</strong> the sequence 100100100 is<br />

100100100. Thus, for this class <strong>of</strong> sequences, n x n f = n.<br />

Theorem 10.8 Let S ( √ n, √ n) be the sequence <strong>of</strong> 1’s and 0’s with √ n 1’s spaced √ n<br />

apart. The Fourier transform <strong>of</strong> S ( √ n, √ n) is itself.<br />

Pro<strong>of</strong>: Consider the columns 0, √ n, 2 √ n, . . . , ( √ n − 1) √ n. These are the columns for<br />

which S ( √ n, √ n) has value 1. The element <strong>of</strong> the matrix Z in the row j √ n <strong>of</strong> column<br />

k √ n, 0 ≤ k < √ n is z nkj = 1. Thus, for these rows Z times the vector S ( √ n, √ n) = √ n<br />

and the 1/ √ n normalization yields f j<br />

√ n = 1.<br />

For rows whose index is not <strong>of</strong> the form j √ n, the row b, b ≠ j √ n, j ∈ {0, √ n, . . . , √ n − 1},<br />

the elements in row b in the columns 0, √ n, 2 √ n, . . . , ( √ n − 1) √ n are 1, z b , z 2b , . . . , z (√ n−1)b<br />

)<br />

and thus f b = √ 1<br />

n<br />

(1 + z b + z 2b · · · + z (√ n−1)b<br />

= √ 1 z √ nb −1<br />

n<br />

= 0 since z b√n = 1 and z ≠ 1<br />

z−1 .<br />

Uniqueness <strong>of</strong> l 1 optimization<br />

Consider a redundant representation for a sequence. One such representation would be<br />

representing a sequence as the concatenation <strong>of</strong> two sequences, one specified by its coordinates<br />

and the other by its Fourier transform. Suppose some sequence could be represented<br />

as a sequence <strong>of</strong> coordinates and Fourier coefficients sparsely in two different ways. Then<br />

by subtraction, the zero sequence could be represented by a sparse sequence. The representation<br />

<strong>of</strong> the zero sequence cannot be solely coordinates or Fourier coefficients. If<br />

y is the coordinate sequence in the representation <strong>of</strong> the zero sequence, then the Fourier<br />

portion <strong>of</strong> the representation must represent −y. Thus y and its Fourier transform would<br />

have sparse representations contradicting n x n f ≥ n. Notice that a factor <strong>of</strong> two comes in<br />

341

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!