Signal Analysis Research (SAR) Group - RNet - Ryerson University

Recommendations

Info

formant referenced by the black arrow. The watermark is em- Normalized frequency 0.5 0.25 0 0 1000 2000 3000 4000 Normalized time Fig. 3. Log–spectrogram of the test signal. Male utterance of the word “bingo” with a sampling rate of 8 kHz. bedded as described in Sec. 3.1. First, the data (τ)2 = 010011 is used to generate the amplitude of the watermark by means of the Equ. 16. Then the insertion zone is manually choseninordertodene the warping operator used to generate the time–warped watermark. Finally, the time–warped watermark is added to the original signal. Result of the watermark embedding is shown in Fig. 4. As can be seen, the time– Normalized frequency 0.5 0.25 0 0 1000 Normalized time (a) Original signal Normalized frequency 0.5 0.25 0 0 1000 Normalized time (b) Watermarked signal Fig. 4. Zoomed log–spectrogram of the original and watermarked test signal. warped watermark is very close to the original formant. As expected, the frequency spread decreases very fast, thanks to the smoothness of the amplitude of the watermark sequence. We found the embedding strategy satisfactory since we were not able to guess wether the signal was watermarked or not during blind tests. In order to provide a more objective comparison criterion, we make use of the “Auditory Toolbox” [4] to generate auditory representations of original and watermarked signals which is a pseudo–time–frequency representation based on physiological aspects of human hears. Auditory representations of original and watermarked signals are depicted in Fig. 5. Both representations are very similar which conrms stealthiness of the watermark. Next step consists in the recovery of the watermark sequence. We tested the proposed approach on the true watermarked signal, and on two different deteriorated versions of the watermarked signal : the rst is a MP3 compression attack, and the second is an additive Gaussian noise attack with a signal–to–noise ratio of 0 dB. Results of the matched ltering estimation are presented in Tab. 1. Results of the estimation step show that the watermark is perfectly extracted and has resisted to the MP3 attack as well as the white–noise attack. II 204 27 Channels 120 60 0 0 1000 Normalized time (a) Original signal Channels 120 60 0 0 1000 Normalized time (b) Watermarked signal Fig. 5. Auditory representation of the original signal and the watermarked signal. τ τ1 τ2 τ3 τ4 τ5 τ6 True 0 1 0 0 1 1 No attack 0 1 0 0 1 1 MP3 attack 0 1 0 0 1 1 Noise attack 0 1 0 0 1 1 Table 1. Results of the estimation of the set {τi} by matched ltering. 5. CONCLUSION In this paper we have proposed a new watermarking method for speech signals, based on the time–warping signal processing concept. We have shown that it is possible to exploit physiological aspects of the human hear in order to carry information while keeping stealthiness of the inserted watermark. Then, we have developed a complete extraction method based on time–varying lter, time–warping operators and match ltering, to recover the watermark sequence. Numerical results show that the watermark is likely not hearable and numerical information carried by the watermark are retrievable. Future work will include a close study the robustness of the method against various attacks. For real applications, another topic is the unsupervised embedding of the watermark according to the position of formant. This issue is left for future work. 6. REFERENCES [1] M. Arnold, “Audio watermarking: Features, applications and algorithms,” in IEEE International Conference on Multimedia and Expo, New York, USA, July 2000. [2] R. Baraniuk, “Unitary equivalence: A new twist on signal processing,” IEEE Trans. on Signal Processing, vol. 43, no. 10, pp. 2269–2282, Oct. 1995. [3] M.C. Botte, G. Canevet, L. Demany, and C. Sorin, Psychoacoustique et perception auditive, Inserm, 1989. [4] M. Slanley, “Auditory toolbox, version 2.0,” Avaiable at http://www.slaney.org/malcolm/pubs.html, 1994. Authorized licensed use limited to: Ryerson University Library. Downloaded on July 7, 2009 at 11:18 from IEEE Xplore. Restrictions apply.
Chirp-based image watermarking as error-control coding Behnaz Ghoraani, and Sridhar Krishnan Dept. of Elec. & Comp. Eng., Ryerson University, Toronto, Canada E-mail: bghoraan@rnet.ryerson.ca, and krishnan@ee.ryerson.ca Abstract In this paper, we use post processing methods to compensate the bit errors occurred in watermark embedding and extracting. Forward error correction (FEC)-based and chirp-based techniques are applied to encode and shape the embedded watermark message so that even at the presence of some bit error rates (BERs) in the extracted watermark, the watermarking algorithm be able to successfully estimate the correct embedded watermark message. Repetition and Bose-Chaudhuri-Hocquenghem (BCH) codings are used as two well-known FEC schemes, and discrete polynomial transform (DPPT) and Hough-Radon transform (HRT) are utilized as two chirp detectors in chirp-based watermarking. Robustness of all the proposed post processing methods are tested for checkmark benchmark attacks, and we found that the chirp-based watermarking using the DPPT chirp detector offers the highest watermark extraction rate, and the best bit error compensation even at BERs of higher than 17%. 1. Introduction The worldwide trend of using the Internet to electronically distribute multimedia offers lot of advantages such as huge cost reduction and considerable time saving of transportation process to both owners and consumers. However, the available methods for distribution of multimedia lacks the privacy and ownership proof. One of the suggested solutions to protect the copyrights and prevent illegal use of the multimedia is watermarking. Watermarking is embedding a hidden signature into the multimedia signal containing information about the content authentication, the access control and copy protection, and the identification and traitor tracing in the case of illegally distribution. The embedded watermark signal should be imperceptible, and do not effect the quality of the multimedia content. Also, since users normally apply many signal manipulations such as lossy compressions on a multimedia signal, the watermark should be robust to these typical signal operations. However, even us- Proceedings of the 2006 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP'06) 0-7695-2745-0/06 $20.00 © 2006 ing the robustest embedding techniques, there will be some bit errors in the received watermark message. Therefore, the watermark detection process encounters some difficulties in extracting the exact watermark message, and retrieving the hidden information. Shaping the embedded signature in a certain way that the extractor could compensate the bit error rate (BER) of the message using the prior knowledge about the watermark structure can be useful in estimating the exact watermark message. In this study, we focus on utilizing chirps as watermark message structures [1][7], and comparing its results with forward error correction (FEC) schemes. To do the experiments, we use spread spectrum method to embed the watermark messages to discrete cosine transform (DCT) coefficients of the image. After extracting the watermark message from the watermarked image, there is a post processing stage. We utilize BCH and repetition codings for FECbased post processing, and discrete polynomial transform (DPPT) and Hough-Radon transform (HRT) detectors in chirp-based watermarking. In this paper, we present the results of chirp-based and FEC-based post processings, and show that the chirp-based watermarking is found comparable with the FEC schemes. 2. Watermarking method In this study we use spread-spectrum watermarking scheme which is a correlation method that embeds pseudorandom sequence and detects watermark by calculating correlation between pseudo-random noise sequence and watermarked signal. Spread-spectrum scheme is the most popular scheme and has been studied well in literature [2]. The spread-spectrum method can be applied in time domain or transformed frequency domain. We utilized DCT coefficients, which are widely used in compression applications and are easier to impose human visual system (HVS) constraints on them. Figure 1 shows the block diagram of the watermarking embedding and extraction schemes [7]. As mentioned earlier, because of the intentional and nonintentional signal processings there will be some BERs in the received message. In the next section we try to compen- Authorized licensed use limited to: Ryerson University Library. Downloaded on July 7, 2009 at 11:31 from IEEE Xplore. Restrictions apply. 28
Page 1 and 2: Signal Analysis Research (SAR) Grou
Page 3 and 4: 2006.05 Support Vector Machines Bas
Page 5 and 6: 2001.05 Instantaneous Mean Frequenc
Page 7 and 8: Class A Class B TWFB Mapping Discri
Page 9 and 10: as an added advantage of this appro
Page 11 and 12: Combining Vocal Source and MFCC Fea
Page 13 and 14: probability distribution for calcul
Page 15 and 16: Proceedings of the 29th Annual Inte
Page 17 and 18: Fig. 1. Typical images of the small
Page 19 and 20: This full text paper was peer revie
Page 21 and 22: Frequency 1 0.9 0.8 0.7 0.6 0.5 0.4
Page 23 and 24: Frequency TABLE I RESULTS WITH LINE
Page 25 and 26: Emotion Recognition Using Novel Spe
Page 27 and 28: points [1, 2, 3] (Fig. 2). SVM is b
Page 29 and 30: A WATERMARKING METHOD FOR SPEECH SI
Page 31: In order to exploit the masking eff
Page 35 and 36: 3.2.2 Discrete Polynomial Phase Tra
Page 37 and 38: 2006 International Joint Conference
Page 39 and 40: Fig 2 Two-dimensional mapping (Left
Page 41 and 42: p3 Feature Extraction u Database +
Page 43 and 44: 1424403677/06/$20.00 ©2006 I
Page 45 and 46: = − = K ≤
Page 47 and 48: DISCRETE POLYNOMIAL TRANSFORM FOR D
Page 49 and 50: 3.2. Watermark detection Fig.2 show
Page 51 and 52: IMPROVING POSITION ESTIMATES FROM A
Page 53 and 54: _ . r W-. n r We tried two ways of
Page 55 and 56: Acknowledgements This work of Ryers
Page 57 and 58: ing two consecutive keys. Another c
Page 59 and 60: Table 1. Experimental Results for F
Page 61 and 62: D as { gγ γ ∈ Γ, g = 1} = γ ,
Page 63 and 64: frequency parameters issued from th
Page 65 and 66: necessity of prediction, controllin
Page 67 and 68: 4. Experimental Results The objecti
Page 73: Proceedings of the 2005 IEEE Engine
Page 78 and 79: the whole spectrum but on non-overl
Page 80 and 81: Identification Rate 1 0.9 0.8 0.7 0
Page 82 and 83:
signals. On some occasions, when us
Page 84 and 85:
For our first test, we recorded two
Page 86 and 87:
Frequency bands F4 F3 F2 F1 ME5 s1
Page 88 and 89:
as F1, F2, F3 and F4 as shown in Fi
Page 90 and 91:
Figure. 1. Motion Vector magnitudes
Page 92 and 93:
All the above descriptor were quant
Page 94 and 95:
2. METHODOLOGY 2.1. Local Discrimin
Page 96 and 97:
. each group, the posterior probabi
Page 98 and 99:
A NOVEL ROBUST IMAGE WATERMARKING U
Page 100 and 101:
where n is the distortion component
Page 102 and 103:
A Novel Way of Lossless Compression
Page 104 and 105:
111. IMPLEMENTATION As we have pres
Page 106 and 107:
CONTENT BASED AUDIO CLASSIFICATION
Page 108 and 109:
Entropy 4.5 4 3.5 3 2.5 2 1.5 1 0.5
Page 110 and 111:
MODIFIED LOCAL DISCRIMINANT BASES A
Page 112 and 113:
Tree Decomposition (0,0) (1,0) (1,1
Page 114 and 115:
RADIO OVER MULTIMODE FIBER FOR WIRE
Page 116 and 117:
3. COMPARISON OF MULTIMODE FIBER AN
Page 118 and 119:
SUB-DICTIONARY SELECTION USING LOCA
Page 120 and 121:
0 : SI 92 r3 54 =-ax Tirnep"idfh :
Page 122 and 123:
Proceedings of the 25h Annual Inter
Page 124 and 125:
estimated noise generated as the ou
Page 126 and 127:
ROBUST AUDIO WATERMARKING USING A C
Page 128 and 129:
where 0 is the angle of the ray pat
Page 130 and 131:
TIME-FREQUENCY FILTERING OF INTERFE
Page 132 and 133:
3. INTERFERENCE DETECTION. nouen AN
Page 134 and 135:
A GENERAL PERCEPTUAL TOOL FOR EVALU
Page 136 and 137:
Fig. 2. Snapshot of the GUI used fo
Page 138 and 139:
Non-Stationary Noise Cancellation i
Page 140 and 141:
signal and noise spectra overlap, f
Page 142 and 143:
Fig. 4: Output of the Comb filter.
Page 144 and 145:
Indoor infrared transmission suffer
Page 146 and 147:
The inverse wavelet transform is de
Page 148 and 149:
The simulations has been done using
Page 150 and 151:
Figure 6: The original Gaussian noi
Page 152 and 153:
147 Authorized licensed use limited
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Figure 2: Block diagram of the FBB
Page 172 and 173:
Figure 4: Benign mammogram before F
Page 174 and 175:
. Table 1: Compression ratios of be
Page 176 and 177:
Most Cohen’s class TFD derived fr
Page 178 and 179:
the desired frequency marginal m(w)
Page 180 and 181:
constructing an adaptive TFD and ex
Page 182 and 183:
Proceedings of the 22"d Annual EMBS
Page 184 and 185:
are: Proceedings of the 22"d Annual
Page 186 and 187:
may be written as M-1 = (z,grn)gm,
Page 188 and 189:
e 1 os 07- Ohzo5- 04- 09- 02- 01- O
Page 190 and 191:
using the denoised signals. As an i
Page 192 and 193:
signals are: 1) model-based TFD, 2)
Page 194 and 195:
compared to that in the MP dictiona
Page 196 and 197:
fied. In addition to being positive
Page 198 and 199:
08 [3] S. Peleg and B. Friedlander.
Page 200 and 201:
Proceedings - 19th International Co
Page 202 and 203:
Proceedings - 19th International Co
Page 204 and 205:
the x and y axes correspond to the
Page 206 and 207:
Fig. 5. Results with synthetic sign
Page 208 and 209:
the users and is explained in next
Page 210 and 211:
,-t 1 2 J 4 5 e 7 e e 10 SNR (I" de
Page 212 and 213:
'il! (c) (a) Weakea User 2nd Strong
Page 214 and 215:
18th Annual International Conferenc
Page 216 and 217:
quisition board and Lab Windows (Na
Page 218 and 219:
Table 1: Comparison of different cl
Page 220:
K. Umapathy and S. Krishnan, Low bi
show all

Signal Analysis Research (SAR) Group - RNet - Ryerson University

Create successful ePaper yourself

Delete template?

Save as template?