Scalar and Vector Quantization

Scalar and Vector Quantization 

National Chiao Tung University 

Chun-Jen Tsai 

4/26/2012

Basic Concept of Quantization 

Quantization is the process of representing a large, 

possibly infinite, set of values with a smaller set 

Example: real-to-integer conversion 

Source: real numbers in the range [–10.0, 10.0] 

Quantizer: Q(x) = ⎣x + 0.5⎦ 

[–10.0, –10.0] → { –10, –9, …, –1, 0, 1, 2, …, 9, 10} 

The set of inputs and outputs of a quantizer can be 

scalars (scalar quantizer) or vectors (vector quantizer) 

2/55

The Quantization Problem 

Encoder mapping 

 

 

 

Map a range of values to a codeword 

Irreversible mapping 

If source is analog → A/D converter 

Decoder mapping 

 

 

 

Map the codeword to a (fixed) value representing the range 

Knowledge of the source distribution can help us pick a 

better value representing each range 

If output is analog → D/A converter 

Informally, the encoder mapping is called the 

quantization process, and the decoder mapping is 

called the inverse quantization process 

3/55

Quantization Examples 

3-bit Quantizer 

 

Encoder (A/D) Decoder (D/A) 

Digitizing a sine wave 

4/55

Quantization Function 

A quantizer describes the relation between the 

encoder input values and the decoder output values 

 

Example of a quantization function: 

5/55

Quantization Problem Formulation 

Input: 

 

 

Output: 

 

 

X – random variable 

f X (x) – probability density function (pdf) 

{b i } i = 0..M decision boundaries 

{y i } i = 1..M reconstruction levels 

Discrete processes are often approximated by 

continuous distributions 

 

 

E.g.: Laplacian model of pixel differences 

If source is unbounded, then first/last decision 

boundaries = ±∞ (they are often called “saturation” values) 

6/55

Quantization Error 

If the quantization operation is denoted by Q(·), then 

Q(x) = y i iff b i–1 < x ≤ b i . 

The mean squared quantization error (MSQE) is then 

σ 

2 

q 

= 

= 

∫ 

∞ 

−∞ 

M 

∑∫ 

i= 

1 

( x − Q( 

x) 

) 

b 

b 

i 

i−1 

( x − y ) 

Quantization error is also called quantization noise or 

quantizer distortion, e.g., additive noise model: 

i 

2 

Quantization noise 

2 

f 

f 

X 

X 

dx 

dx 

Quantizer input 

+ 

Quantizer output 

7/55

Quantized Bitrate with FLC 

If the number of quantizer output is M, then the 

bitrate of the quantizer output is R = ⎡log 2 M⎤ 

Eample: M = 8 → R = 3 

Quantizer design problem: 

 

Given an input pdf f X (x) and the number of levels M in the 

quantizer, find the decision boundaries {b i } and the 

reconstruction levels {y i } so as to minimize the mean 

squared quantization error 

8/55

Quantized Bitrate with VLC 

For VLC representation of quantization intervals, the 

bitrate depends on decision boundary selection 

Example: eight-level quantizer: 

R 

= 

= 

M 

∑ 

i= 

1 

M 

∑ 

i= 

1 

l P( 

y ) 

l 

i 

i 

⎡ 

⎢⎣ 

∫ 

b 

b 

i 

i−1 

i 

f 

X 

( x) 

dx⎤ 

⎥⎦ 

9/55

Optimization of Quantization 

Rate-optimized quantization 

Given: Distortion constraint σ q2 ≤ D* 

Find: { b i }, { y i } binary codes 

 

Such that: R is minimized 

Distortion-optimized quantization 

Given: Rate constraint R ≤ R* 

 

 

Find: { b i }, { y i } binary codes 

Such that: σ q2 is minimized 

10/55

Uniform Quantizer 

All intervals are of the same size 

 

Boundaries are evenly spaced (step size:Δ), except for outmost 

intervals 

Reconstruction 

 

Usually the midpoint is selected as the representing value 

Quantizer types: 

 

 

Midrise quantizer: zero is not an output level 

Midtread quantizer: zero is an output level 

11/55

Midrise vs. Midtread Quantizer 

Midrise Midtread 

12/55

Uniform Quantization of Uniform Source 

If the source is uniformly distributed in [–X max , X max ], 

the output is quantized by an M-level uniform 

quantizer, then the quantization step size is 

and the distortion is 

∆ = 

2X 

max 

M 

, 

σ 

M / 2 

2 

2 

i∆ 

2 ⎛ 2i 

−1 

⎞ 1 ∆ 

= 2∑∫ 

q 

⎜ x − ∆⎟ 

dx = 

( i−1) 

∆ 

i= 

1 ⎝ 2 ⎠ 2X 

max 

12 

. 

This is quite difficult to evaluate! 

13/55

Alternative MSQE Derivation 

We can also compute the “power” of quantization 

error q = x – Q(x), q ∈ [–Δ/2, Δ/2] by: 

σ q 

2 

1 / 2 

2 2 ∆ 

= ∫ ∆ q dq = 

∆ 

−∆ 

/ 2 

12 

. 

14/55

The SNR of Quantization 

For n-bit uniform quantization of an uniform source of 

[–X max , X max ], the SNR is 6.02n dB, where n = log 2 M: 

10log 

10 

⎛ 

2 

⎜ 

σ 

s 

2 

⎝ σ 

q 

⎞ 

⎟ 

⎠ 

= 10log 

10 

⎛ (2X 

⎜ 

⎝ 12 

max 

) 

2 

12 

⋅ 

2 

∆ 

⎞ 

⎟ 

⎠ 

= 10log 

= 

20log 

10 

10 

⎛ 

⎜ 

⎜ (2X 

⎜ 12 

⎜ 

⎝ 

2 

n 

max 

) 

2 

⎛ 

⎜ 

⎝ 

12 

2X 

M 

= 6.02n 

dB. 

max 

⎞ 

⎟ 

⎠ 

2 

⎞ 

⎟ 

⎟ 

⎟ 

⎟ 

⎠ 

= 10log 

10 

M 

2 

15/55

Example: Quantization of Sena 

Darkening and contouring effects of quantization 

8 bits / pixel 




16/55

Quantization of Non-uniform Sources 

Given a non-uniform source, x ∈[–100, 100], 

P(x∈[–1, 1]) = 0.95, and we want to design an 8-level 

(3-bit) quantizer. 

A naïve approach uses uniform quantizer (∆ = 25): 

 

95% of sample values represented by only two numbers: 

–12.5 and 12.5, with a maximal quantization error of 12.5 

and minimal error of 11.5 

If we use ∆ = 0.3 (two end-intervals would be huge) 

 

Max error is now 98.95 (i.e. 100 – 1.05), however, 95% of 

the time the error is less than 0.15 

17/55

Optimal ∆ that minimizes MSQE 

Given pdf f X (x) of the source, let’s design an M-level 

mid-rise uniform quantizer that minimizes MSQE: 

σ 

2 

q 

= 

2 

−1 

M 

2 

∑∫ 

i= 

1 

i∆ 

( i−1) 

∆ 

⎛ 

⎜ 

⎝ 

x − 

2i 

−1 

⎞ 

∆⎟ 

2 ⎠ 

2 

f 

X 

( x) 

dx 

→ Granular error 

+ 2 

∫ 

∞ 

⎛ M ⎞ 

⎜ −1⎟∆ 

⎝ 2 ⎠ 

⎛ 

⎜ 

⎝ 

x − 

M −1 

⎞ 

∆⎟ 

2 ⎠ 

2 

f 

X 

( x) 

dx. 

→ Overload error 

x – Q(x) 

Overload error 

Granular error 

18/55

Solving for Optimum Step Sizes 

Given an f X (x) and M, we can solve for ∆ numerically: 

dσ 

2 

q 

d∆ 

= − 

−1 

M 

2 

∑ 

i= 

1 

− ( M 

(2i 

−1) 

−1) 

∫ 

∞ 

∫ 

i∆ 

( i−1) 

∆ 

⎛ M ⎞ 

⎜ −1⎟∆ 

⎝ 2 ⎠ 

⎛ 

⎜ 

⎝ 

⎛ 

⎜ 

⎝ 

2i 

−1 

⎞ 

x − ∆⎟ 

f 

2 ⎠ 

M −1 

⎞ 

x − ∆⎟ 

f 

2 ⎠ 

( x) 

dx 

( x) 

dx = 

Optimal uniform quantizer ∆ for different sources: 

X 

X 

0. 

19/55

Overload/Granular Regions 

Selection of the step size must trade off between 

overload noise and granular noise 

20/55

Variance Mismatch Effects (1/2) 

Effect of variance mismatch on the performance of a 

4–bit uniform quantizer 

21/55

Variance Mismatch Effects (2/2) 

The MSQE as a function of variance mismatch with a 

4–bit uniform quantizer. 

22/55

Distribution Mismatch Effects 

Given 3-bit quantizer, the effect of distribution 

mismatch for different sources (SNR errors in dB): 

 

 

Form left-to-right, we assume that the sources are uniform, 

Gaussian, Laplacian, and Gamma, and compute the 

optimum MSQE step size for uniform quantizer 

The resulting ∆ gets larger from left-to-right 

→ if there is a mismatch, larger than “optimum” ∆ gives better performance 

→ 3-bit quantizer is too coarse 

23/55

Adaptive Quantization 

We can adapt the quantizer to the statistics of the 

input (mean, variance, pdf) 

Forward adaptive (encoder-side analysis) 

 

 

 

 

Divide input source in blocks 

Analyze block statistics 

Set quantization scheme 

Send the scheme to the decoder via side channel 

Backward adaptive (decoder-side analysis) 

 

 

 

Adaptation based on quantizer output only 

Adjust ∆ accordingly (encoder-decoder in sync) 

No side channel necessary 

24/55

Forward Adaptive Quantization (FAQ) 

Choosing analysis block size is a major issue 

Block size too large 

 

 

Not enough resolution 

Increased latency 

Block size too small 

 

More side channel information 

Assuming a mean of zero, signal variance is 

estimated by 

N 1 

2 1 2 

ˆ 

− 

σ = ∑ q 

x n + i. 

N 

i= 

0 

25/55

Speech Quantization Example (1/2) 

16-bit speech samples → 3-bit fixed quantization 

26/55

Speech Quantization Example (2/2) 

16-bit speech samples → 3-bit FAQ 

 

 

Block = 128 samples 

8-bit variance quantization 

27/55

FAQ Refinement 

So far, we assumed uniform pdf over maximal ranges, 

we can refine it by computing the range of distribution 

adaptively for each block 

Example: Sena image, 8x8 blocks, 2x8-bit for range 

per block, 3-bit quantizer 

Original 8 bits/pixel 

Quantized 3.25 bits/pixel 

28/55

Backward Adaptive Quantization (BAQ) 

Key idea: only encoder sees input source, if we do 

not want to use side channel to tell the decoder how 

to adapt the quantizer, we can only use quantized 

output to adapt the quantizer 

Possible solution: 

 

 

 

Observe the number of output values that falls in outer levels 

and inner levels 

If they match the assumed pdf, ∆ is good 

If too many values fall in outer levels, ∆ should be enlarged, 

otherwise, ∆ should be reduced 

Issue: estimation of pdf requires large observations? 

29/55

Jayant Quantizer 

N. S. Jayant showed in 1973 that ∆ adjustment based 

on few observations still works fine: 

 

 

If current input falls in the outer levels, expand step size 

If current input falls in the inner levels, contract step size 

The total product of expansions and contraction should be 1 

Each decision interval k has a multiplier M k 

If input s n–1 falls in the k th interval, step size is multiplied by M k 

Inner-level M k < 1, outer-level M k > 1 

 

Step size adaptation rule: 

∆ 

= 

n 

M l ( n−1) 

n− 

where l(n–1) is the quantization interval at time n–1. 

∆ 

1, 

30/55

Output Levels of 3-bit Jayant Quantizer 

The multipliers are symmetric: 

M 0 = M 4 , M 1 = M 5 , M 2 = M 6 , M 3 = M 7 

31/55

Example: Jayant Quantizer 

M 0 = M 4 = 0.8, M 1 = M 5 = 0.9 

M 2 = M 6 = 1.0, M 3 = M 7 = 1.2, Δ 0 = 0.5 

Input: 0.1, –0.2, 0.2, 0.1, –0.3, 0.1, 0.2, 0.5, 0.9, 1.5 

32/55

Picking Jayant Multipliers 

We must select Δ min and Δ max to prevent underflow 

and overflow of step sizes 

Selection of multipliers 

Total production of expansion/contractions should be 1 

 

 

Π 

k = 0 

n k 

k 

= 1. 

Scaled to probability of events in each interval, we have 

Pick γ > 1, and let M k = γ l k, we have 

M 

M 

n 

N 

M 

P 

k 

Π = 1, where = , = ∑ 

M 

k 

Pk 

N 

k =0 k =0 

M 

∑ 

k = 0 

M k 

l k 

P k 

= 0, 

→ γ and l k are chosen, P k is known. 

n 

k 

. 

33/55

Example: Ringing Problem 

Use a 2-bit Jayant quantizer to quantize a square 

wave 

P 0 = 0.8, P 1 = 0.2 → pick l 0 = –1, l 1 = 4, γ ~ 1. 

34/55

Jayant Quantizer Performance 

To avoid overload errors, we should expand ∆ rapidly 

and contracts ∆ moderately 

Robustness over changing input statistics 

Jayant is about 1dB lower 

than fixed quantizer 

35/55

Non-uniform Quantization 

For uniform quantizer, 

decision boundaries 

are determined by a 

single parameter ∆. 

We can certainly reduce 

quantization errors 

further if each decision 

boundaries can be 

selected freely 

36/55

pdf-optimized Quantization 

Given f X (x), we can try to minimize MSQE: 

σ 

2 

q 

= 

M 

∑∫ 

i= 

1 

b 

b 

i 

i−1 

2 

( x − y ) f ( x) 

dx. 

i 

X 

Set derivative of σ q2 w.r.t. y j to zero and solve for y j , 

we have: 

b 

∫ 

j 

xf 

X 

( x) 

dx 

b j−1 

y 

y 

j 

= 

. 

j 

is the center of mass of 

b j 

f X 

in [b j–1 

, b j 

) 

f ( x) 

dx 

∫ 

b 

j−1 

X 

If y j are determined, the b j ’s can be selected as: 

j 

( y y )/ 

2. 

b + 

= 

j+ 

1 j 

37/55

Lloyd-Max Algorithm (1/3) 

Lloyd-Max algorithm solves y j and b j iteratively until 

an acceptable solution is found 

Example: 

For midrise quantizer, 

b 0 = 0, 

b M/2 is the largest input, 

we only have to find 

{ b 1 , b 2 , …, b M/2–1 } and 

{ y 1 , y 2 , …, y M/2–1 }. 

38/55


Begin with j = 1, we want to find b 1 and y 1 by 

y 

b1 

b 

1 

= ∫ xf ( ) ∫ 

X 

x dx 

b 

b 

0 

( x) 

dx. 

Pick a value for y 1 (e.g. y 1 = 1), solve for b 1 and 

compute y 2 by 

y 2 = 2b 1 + y 1 , 

and b 2 by 

y 

b2 

b 

2 

= ∫ xf ( ) ∫ 

X 

x dx 

b 

b 

1 

0 

1 

Continue the process until all {b j } and {y j } are found 

1 

f 

2 

X 

f 

X 

( x) 

dx. 

39/55


If the initial guess of y 1 does not fulfills the termination 

condition: 

y M 

ˆ 

/ 2 

− y M / 2 

≤ ε, 

where 

yˆ = M 2 

2bM 

/ 2 1 

yM 

/ 2 1, 

y 

+ 

/ − − 

b 

∫ 

M / 2 

b 

/ 2 

= ( ) ∫ 

M 

xf 

X 

x dx 

b 

b 

M / 2−1 

( x) 

dx. 

we must pick a different y 1 and repeat the process. 

M 

/ 2 

M / 2−1 

f 

X 

40/55

Example: pdf-Optimized Quantizers 

We can achieve gain over the uniform quantizer 

(9.24) (7.05) 

(12.18) (9.56) 

(14.27) (11.39) 

41/55

Mismatch Effects 

Non-uniform quantizers also suffer mismatch effects. 

To reduce the effect, one can use an adaptive nonuniform 

quantizer, or an adaptive uniform quantizer 

plus companded quantization techniques 

Variance mismatch on a 4-bit Laplacian non-uniform quantizer. 

42/55

Companded Quantization (CQ) 

In companded quantization, we adjust (i.e. re-scale) 

the intervals so that the size of each interval is in 

proportion to the probability of inputs in each interval 

equivalent to a non-uniform quantizer 

43/55

Example: CQ (1/2) 

The compressor function: 

c( 

x) 

= 

⎧ 

⎪ 

⎨ 

⎪ 

⎩ 

2x 

3 

2x 

3 

2x 

+ 

− 

4 

3 

4 

3 

if 

−1 

≤ x ≤1 

x > 1 . 

x < −1 

The uniform quantizer: 

step size ∆ = 1.0 

44/55

Example: CQ (2/2) 

The expandar function: 

c 

−1 

( x) 

= 

⎧ 

⎪ 

⎨ 

⎪ 

⎩ 

3x 

2 

3x 

2 

x 

2 

− 2 

+ 2 

if 

− 2 ≤ x ≤ 

x > 2 

x < −2 

2 

. 

The equivalent non-uniform 

quantizer 

45/55

Remarks on CQ 

If the level of quantizer is large and the input is 

bounded by x max , it is possible to choose a c(x) such 

that the SNR of CQ is independent to input pdf: 

SNR = 10 log 10 (3M 2 ) – 20log 10 α, 

where c′(x) = x max / (α |x|) and a is a constant. 

Two popular CQ for telephones: µ-law and A-law 

 

µ-law compressor 

A-law compressor 

c( 

x) 

= 

⎧ 

⎪ 

⎨ 

⎪⎩ 

x 

max 

A x 

1+ 

ln A 

⋅ 

ln(1 + µ 

x 

) 

max 

c( x) 

= x max 

sgn( x). 

ln(1 + µ ) 

1+ 

ln 

sgn( x), 

A x 

xmax 

1+ 

ln A 

sgn( x), 

0 ≤ 

1 

A 

≤ 

x 

x 

x 

max 

x 

x 

max 

≤ 

≤ 

1 

A 

. 

1 

46/55

Entropy Coding of Quantizer Outputs 

The levels of the quantizer is the alphabet of entropy 

coders, for M-level quantizer, FLC needs log 2 M bits 

per output 

Example of VLC coded output of minimum MSQE 

quantizers: 

 

Note: non-uniform quantizer has higher entropies since high 

probability regions uses smaller step sizes 

47/55

Vector Quantization 

Vector quantization groups source data into vectors 

 

At both the encoder and decoder, we have a set of vectors 

called the codebook. Each code-vector in the codebook is 

assigned a binary index. 

48/55

Why Vector Quantization (1/2)? 

Correlated multi-dimensional data have limited valid 

ranges 

useless 

useless 

49/55

Why Vector Quantization (2/2)? 

Looking at the data from a higher dimension allow us 

to better fit the quantizer structure to the joint pdf 

 

Example: quantize the Laplacian source data two at a time: 

11.44 dB 11.73 dB 

50/55

Vector Quantization Rule 

Vector quantization (VQ) of X may be viewed as the 

classification of the outcomes of X into a discrete 

number of sets or cells in N-space 

 

Each cell is represented by a vector output Y j 

Given a distance measure d(x, y), we have 

VQ output: Q(X) = Y j iff d(X, Y j ) < d(X, Y i ), ∀ i ≠ j. 

 

Quantization region: V j = { X: d(X, Y j ) < d(X, Y i ), ∀ i ≠ j}. 

x 2 

V j 

Y j 

x 1 

51/55

Codebook Design 

The set of quantizer output points in VQ is called the codebook 

of the quantizer, and the process of placing these output points 

is often referred to as codebook design 

The k-means algorithm is often used to classify the outputs 

 

 

 

 

Given a large set of output vectors from the source, known as the 

training set, and an initial set of k representative patterns 

Assign each element of the training set to the closest 

representative pattern 

After an element is assigned, the representative pattern is updated 

by computing the centroid of the training set vectors assigned to it 

When the assignment process is complete, we will have k groups of 

vectors clustered around each of the output points 

52/55

The Linde-Buzo-Gray Algorithm 

1. Start with an initial set of reconstruction values 

{Y i 

(0) 

} i=1..M and a set of training vectors {X n } n=1..N . 

Set k = 0, D (0) = 0. Select threshold ε. 

2. The quantization regions {V i 

(k) 

} i=1..M are given by 

V j 

(k) 

= {X n : d(X n , Y i ) < d(X n , Y j ), ∀ j ≠ i}, i = 1, 2, …, M. 

3. Compute the average distortion D (k) between the 

training vectors and the representative value 

4. If (D (k) – D (k–1) )/D (k) < ε, stop; otherwise, continue 

5. Let k = k + 1. Update {Y i 

(k) 

} i=1..M with the average 

value of each quantization region V i 

(k–1) 

. Go to step 2. 

53/55

Example: Codebook Design 

Initial state 

Final state 

54/55

Impact of Training Set 

The training sets used to construct the codebook 

have significant impact on the performance of VQ 

Images quantized at 0.5 

bits/pixel, codebook size 256 

55/55

Scalar and Vector Quantization

Create successful ePaper yourself

Delete template?

Save as template?