A Modified SPIHT Algorithm for Image Coding With a Joint MSE and ...

More documents

Recommendations

Info

722 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 3, MARCJ 2006Fig. 13. Gaussian noise at 03 dB SNR is added to the finest LH waveletcoefficients of the right half texture, to obtain the synthetic textures shown inthe left half.Fig. 15. Gaussian noise at 03:5 dB SNR is added to the second-level LHwavelet coefficients of the right half texture, to obtain the synthetic texturesshown in the left half.Fig. 14.Classification errors as the function of MSE are compared amongMSPIHT ( = 50000), MSPIHT ( =0), and the original SPIHT. The bitrates are 0.2, 0.3, 0.4, 0.5, and 0.6 bpp. Results are shown for data in Fig. 13.As indicated above, there is potential for disturbance of zerotrees via the coefficient reweighting. This implies that the performanceof the MSPIHT algorithm may be impacted by whichparticular wavelet coefficients are scaled to be large (where inthe wavelet tree). In the pervious example the significant waveletcoefficients (for which noise was added) were at the finest level.We now consider a synthetic image as in the left half of Fig. 15,but now the white Gaussian noise is added to the coefficientsin the second LH band (for levels, as in the previousexample). In this case, the SNR, as defined above, is dB.Results are presented in Fig. 16. We again considerfor the MSPIHT results for which classification is emphasized.In Fig. 16, we note that MSPIHT for provides excellentclassification performance for low MSE (highest bpp), betterthan MSPIHT with . This is attributed to the factthat the coefficients that are important are relatively large inamplitude (lower scale than in Fig. 14). These large-amplitudeFig. 16.Classification errors as the function of MSE are compared amongMSPIHT ( = 50000), MSPIHT ( =0), and the original SPIHT. The bitrates are from 0.2 to 0.6 bpp at the interval of 0.05 bpp. Results are shown forsynthesized data in Fig. 15.coefficients, which are also important for classification arereconstructedwell by SPIHTand MSPIHT with . However,at lower bpp (higher MSE), the classification improvementof MSPIHT withis more evident. Note that inFig. 14, for which the small-amplitude finest-level coefficientsare important for classification, the MSPIHT withyields better classification performance for all MSE (bpp)considered.D. Modified SPIHT and Bayes VQThe Bayes tree-structured vector quantization (B-TSVQ) algorithmintroduced by Oehler and Gray [15] is a joint compressionand classification technique. It combines classification andcompression into a single vector quantizer by incorporating aBayes risk term into the distortion measure. For large block
CHANG AND CARIN: MODIFIED SPIHT ALGORITHM FOR IMAGE CODING 723Fig. 17. Training imagery for B-TSVQ, with labeled “urban” and “rural”classes.sizes, B-TSVQ performance approaches the theoretical rate-distortionbound [5]. TSVQ has the limitations of computationalcomplexity, a requirement of knowledge of the posterior probability,and it also requires the availability of a large trainingset. In image processing, the VQ block size is usually 4 4orsmaller because of computational constraints [20].In our first comparison, we consider the measured imageryin Fig. 4, with the “true” segmentation defined as discussedabove (dictated by the results of the HMT segmentation). TheMSPIHT results are extensions from Fig. 12, forand . The MSPIHT results are computed adaptively onthe imagery in Fig. 4, without any a priori training data. By contrast,the B-TSVQ requires training data to design the tree-structuredcodebook and to build the associated classifier (a look-uptable, that maps a code to a texture). In Fig. 17, we present separatetraining data, from the “rural” and “urban” classes, usedto train the B-TSVQ algorithm. These data are distinct examplesfrom the same USC-SIPI database from which Fig. 4 wasacquired. We consider a bit rate of 0.35 bit/pixel. To achievethis bit rate we run the required number of MSPIHT iterations,while for B-TSVQ the bit rate is dictated by the number of thecodes and size of each block. We here consider a codebook ofsize 49, and each block is of size 4 4. The MSPIHT classificationis based upon two wavelet levels, to be consistent withthe 4 4 blocks used by B-TSVQ. However, to improve codingefficiency, the MSPIHT is run for levels (only two ofwhich are used in the classifier). To run MSPIHT with ,wefirst run the iterative algorithm in Section III-C for . Thewavelet and scaling coefficients are then weighted as so determined.The subsequent three wavelet levels are then performedon these weighted coefficients.We also show MSPIHT results for a classifier based on threelevels, corresponding to 8 8 blocks. The results in Fig. 18, forboth MSPIHT and B-TSVQ, are computed by controlling theLagrange multiplier that dictates the balance between concentratingon MSE and classification error. The results indicate thatMSPIHT has better compression performance than B-TSVQ(smaller MSE), but B-TSVQ has more sensitivity to the Lagrangian-driventradeoff between MSE and classification (althoughin these results the B-TSVQ MSE does not change substantiallyas the Lagrange multiplier changes).Fig. 18. Classification error as a function of MSE, at a bit rate of 0.35 bpp.For both B-TSVQ and MSPIHT the variation in classification error and MSE iscontrolled by adjusting a respective Lagrange multiplier. Results are shown fordata in Fig. 4.Fig. 19. Classification error as the function of MSE with Lagrangianmultipliers increasing are compared between B-TSVQ and MSPIHT at bit ratesof 0.35 bpp. Results are shown for data in Fig. 13.To complement the results discussed above (Fig. 4), for whichthere may be some uncertainty as to actual “truth,” we also considersynthetic data. In Fig. 19, we study the tradeoff betweenMSE and classification error for B-TSVQ and MSPIHT for thebit rate 0.35 bit/pixel, for the data in Fig. 13, which for B-TSVQcorresponds to a codebook of size 49, for blocks of size 4 4.Again, the MSPIHT classification is based upon two waveletlevels, to be consistent with the 4 4 blocks used by B-TSVQ,and to improve coding efficiency the MSPIHT is run forlevels. We also show MSPIHT results for a classifier based onthree levels, corresponding to 8 8 blocks. Separate trainingdata with the same statistics were used to build the codes forthe B-TSVQ. When comparing results for 4 4 blocks, we notethat the MSPIHT algorithm performs best for high classificationerror (lower MSE), with this attributed to the fact that theMSPIHT algorithm is effectively employing larger block sizes
Page 1 and 2: IEEE TRANSACTIONS ON IMAGE PROCESSI
Page 3 and 4: CHANG AND CARIN: MODIFIED SPIHT ALG
Page 9: CHANG AND CARIN: MODIFIED SPIHT ALG
Page 13: CHANG AND CARIN: MODIFIED SPIHT ALG

A Modified SPIHT Algorithm for Image Coding With a Joint MSE and ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?