23.10.2014 Views

Applications of Gabor multipliers to sound characterization and ...

Applications of Gabor multipliers to sound characterization and ...

Applications of Gabor multipliers to sound characterization and ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

<strong>Applications</strong> <strong>of</strong> <strong>Gabor</strong> <strong>multipliers</strong> <strong>to</strong> <strong>sound</strong><br />

<strong>characterization</strong> <strong>and</strong> transformation<br />

Adrien Merer<br />

PhD at Labora<strong>to</strong>ire de Mécanique et d’Acoustique - CNRS Marseille<br />

Supervisors: Sølvi Ystad <strong>and</strong> Richard Kronl<strong>and</strong>-Martinet<br />

MULAC kick-<strong>of</strong>f meeting - September 2008


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Plan<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

1 Introduction<br />

2 Genealogy<br />

3 Morphing<br />

4 Conclusion


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Motivation<br />

Characterization <strong>of</strong> perceptual <strong>sound</strong> categories<br />

S1 S2 → Common points?<br />

Is it possible <strong>to</strong> synthesize or transform <strong>sound</strong>s <strong>to</strong> give them<br />

such a property.<br />

Hypothesis<br />

Brain processing responsible <strong>of</strong> <strong>sound</strong>s categorization <strong>and</strong><br />

comparison are based on signal description aiming at s<strong>to</strong>ring the<br />

less information as possible <strong>and</strong> the less ambiguous as possible.<br />

Listeners might use a combination <strong>of</strong> <strong>sound</strong> features, so called<br />

invariants, <strong>to</strong> determine specific aspects <strong>of</strong> <strong>sound</strong>.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Motivation<br />

Characterization <strong>of</strong> perceptual <strong>sound</strong> categories<br />

S1 S2 → Common points?<br />

Is it possible <strong>to</strong> synthesize or transform <strong>sound</strong>s <strong>to</strong> give them<br />

such a property.<br />

Hypothesis<br />

Brain processing responsible <strong>of</strong> <strong>sound</strong>s categorization <strong>and</strong><br />

comparison are based on signal description aiming at s<strong>to</strong>ring the<br />

less information as possible <strong>and</strong> the less ambiguous as possible.<br />

Listeners might use a combination <strong>of</strong> <strong>sound</strong> features, so called<br />

invariants, <strong>to</strong> determine specific aspects <strong>of</strong> <strong>sound</strong>.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Motivation<br />

Characterization <strong>of</strong> perceptual <strong>sound</strong> categories<br />

S1 S2 → Common points?<br />

Is it possible <strong>to</strong> synthesize or transform <strong>sound</strong>s <strong>to</strong> give them<br />

such a property.<br />

Hypothesis<br />

Brain processing responsible <strong>of</strong> <strong>sound</strong>s categorization <strong>and</strong><br />

comparison are based on signal description aiming at s<strong>to</strong>ring the<br />

less information as possible <strong>and</strong> the less ambiguous as possible.<br />

Listeners might use a combination <strong>of</strong> <strong>sound</strong> features, so called<br />

invariants, <strong>to</strong> determine specific aspects <strong>of</strong> <strong>sound</strong>.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Motivation<br />

Characterization <strong>of</strong> perceptual <strong>sound</strong> categories<br />

S1 S2 → Common points?<br />

Is it possible <strong>to</strong> synthesize or transform <strong>sound</strong>s <strong>to</strong> give them<br />

such a property.<br />

Hypothesis<br />

Brain processing responsible <strong>of</strong> <strong>sound</strong>s categorization <strong>and</strong><br />

comparison are based on signal description aiming at s<strong>to</strong>ring the<br />

less information as possible <strong>and</strong> the less ambiguous as possible.<br />

Listeners might use a combination <strong>of</strong> <strong>sound</strong> features, so called<br />

invariants, <strong>to</strong> determine specific aspects <strong>of</strong> <strong>sound</strong>.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Motivation<br />

Characterization <strong>of</strong> perceptual <strong>sound</strong> categories<br />

S1 S2 → Common points?<br />

Is it possible <strong>to</strong> synthesize or transform <strong>sound</strong>s <strong>to</strong> give them<br />

such a property.<br />

Hypothesis<br />

Brain processing responsible <strong>of</strong> <strong>sound</strong>s categorization <strong>and</strong><br />

comparison are based on signal description aiming at s<strong>to</strong>ring the<br />

less information as possible <strong>and</strong> the less ambiguous as possible.<br />

Listeners might use a combination <strong>of</strong> <strong>sound</strong> features, so called<br />

invariants, <strong>to</strong> determine specific aspects <strong>of</strong> <strong>sound</strong>.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Motivation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Example<br />

From recording <strong>of</strong> same notes played on different saxophones<br />

<strong>and</strong> clarinets:<br />

→ find signal similarities <strong>of</strong> saxophone <strong>sound</strong>s <strong>and</strong> differences<br />

with clarinets <strong>sound</strong>s<br />

→ use those signals characteristics <strong>to</strong> synthesize<br />

saxophone/clarinets <strong>sound</strong>s.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Motivation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Example<br />

From recording <strong>of</strong> same notes played on different saxophones<br />

<strong>and</strong> clarinets:<br />

→ find signal similarities <strong>of</strong> saxophone <strong>sound</strong>s <strong>and</strong> differences<br />

with clarinets <strong>sound</strong>s<br />

→ use those signals characteristics <strong>to</strong> synthesize<br />

saxophone/clarinets <strong>sound</strong>s.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Classical method<br />

- Computing <strong>of</strong> signal descrip<strong>to</strong>rs from timbre literature (spectral<br />

centroid, attack time, spectral variation...)<br />

- Correlation between descrip<strong>to</strong>rs <strong>and</strong> perceptual data<br />

(using regression, or machine learning methods)<br />

Problems<br />

- Time depend properties cannot be well described<br />

- Same descrip<strong>to</strong>rs are used for all problems whereas each<br />

problem is specific <strong>and</strong> might necessitates new descrip<strong>to</strong>rs.<br />

We expect GM <strong>to</strong> highlight time-fequency patterns responsible<br />

<strong>of</strong> perceptual categorization


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Classical method<br />

- Computing <strong>of</strong> signal descrip<strong>to</strong>rs from timbre literature (spectral<br />

centroid, attack time, spectral variation...)<br />

- Correlation between descrip<strong>to</strong>rs <strong>and</strong> perceptual data<br />

(using regression, or machine learning methods)<br />

Problems<br />

- Time depend properties cannot be well described<br />

- Same descrip<strong>to</strong>rs are used for all problems whereas each<br />

problem is specific <strong>and</strong> might necessitates new descrip<strong>to</strong>rs.<br />

We expect GM <strong>to</strong> highlight time-fequency patterns responsible<br />

<strong>of</strong> perceptual categorization


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Classical method<br />

- Computing <strong>of</strong> signal descrip<strong>to</strong>rs from timbre literature (spectral<br />

centroid, attack time, spectral variation...)<br />

- Correlation between descrip<strong>to</strong>rs <strong>and</strong> perceptual data<br />

(using regression, or machine learning methods)<br />

Problems<br />

- Time depend properties cannot be well described<br />

- Same descrip<strong>to</strong>rs are used for all problems whereas each<br />

problem is specific <strong>and</strong> might necessitates new descrip<strong>to</strong>rs.<br />

We expect GM <strong>to</strong> highlight time-fequency patterns responsible<br />

<strong>of</strong> perceptual categorization


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Classical method<br />

- Computing <strong>of</strong> signal descrip<strong>to</strong>rs from timbre literature (spectral<br />

centroid, attack time, spectral variation...)<br />

- Correlation between descrip<strong>to</strong>rs <strong>and</strong> perceptual data<br />

(using regression, or machine learning methods)<br />

Problems<br />

- Time depend properties cannot be well described<br />

- Same descrip<strong>to</strong>rs are used for all problems whereas each<br />

problem is specific <strong>and</strong> might necessitates new descrip<strong>to</strong>rs.<br />

We expect GM <strong>to</strong> highlight time-fequency patterns responsible<br />

<strong>of</strong> perceptual categorization


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Using <strong>Gabor</strong> Masks<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

m<br />

Morphing<br />

Conclusion<br />

Mask does not provide more information than <strong>Gabor</strong><br />

transforms.<br />

Estimation <strong>of</strong> time-frequency shift is necessary but not<br />

sufficient.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Using <strong>Gabor</strong> Masks<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

m<br />

Morphing<br />

Conclusion<br />

Mask does not provide more information than <strong>Gabor</strong><br />

transforms.<br />

Estimation <strong>of</strong> time-frequency shift is necessary but not<br />

sufficient.


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Genealogy<br />

From {s 1 , ...s n }, build a <strong>sound</strong> that contains common<br />

properties.<br />

S 1 S 2 S 3 ......<br />

❅ m 1 m 2 <br />

<br />

❅ <br />

❅❘<br />

m<br />

S ✠<br />

3 <br />

<br />

1,2 <br />

❅<br />

<br />

m<br />

❅ 1,2 <br />

. ................<br />

❅❘ ✠<br />

S 1,2,3<br />

Characterization is contained both in masks <strong>and</strong> <strong>sound</strong> S 1..n<br />

(the so called ”son”).


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Genealogy<br />

From {s 1 , ...s n }, build a <strong>sound</strong> that contains common<br />

properties.<br />

S 1 S 2 S 3 ......<br />

❅ m 1 m 2 <br />

<br />

❅ <br />

❅❘<br />

m<br />

S ✠<br />

3 <br />

<br />

1,2 <br />

❅<br />

<br />

m<br />

❅ 1,2 <br />

. ................<br />

❅❘ ✠<br />

S 1,2,3<br />

Characterization is contained both in masks <strong>and</strong> <strong>sound</strong> S 1..n<br />

(the so called ”son”).


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Algorithm<br />

GM is considered as the transformation that makes the <strong>sound</strong><br />

”thinner”<br />

We want <strong>to</strong> keep only common points<br />

1: x0: initialization;<br />

2: repeat<br />

3: Compute m1;<br />

Compute m2;<br />

x0 = m1∗x1+m2∗x2<br />

2<br />

update error<br />

4: until error < thresold<br />

error = ‚ x<br />

i<br />

0 − x i+1 ‚ 2<br />

0


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Algorithm<br />

GM is considered as the transformation that makes the <strong>sound</strong><br />

”thinner”<br />

We want <strong>to</strong> keep only common points<br />

1: x0: initialization;<br />

2: repeat<br />

3: Compute m1;<br />

Compute m2;<br />

x0 = m1∗x1+m2∗x2<br />

2<br />

update error<br />

4: until error < thresold<br />

error = ‚ x<br />

i<br />

0 − x i+1 ‚ 2<br />

0


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Questions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

• Highly dependent on initialization<br />

→ Many different informations can be introduced with<br />

initialization <strong>sound</strong> (we can use harmonic comb or impulse<br />

train)<br />

• We can also initialize using two masks!<br />

→ Problem must be better defined.<br />

- Necessitate <strong>to</strong> know exactly what is expected as the ”son”<br />

...


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Questions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

• Highly dependent on initialization<br />

→ Many different informations can be introduced with<br />

initialization <strong>sound</strong> (we can use harmonic comb or impulse<br />

train)<br />

• We can also initialize using two masks!<br />

→ Problem must be better defined.<br />

- Necessitate <strong>to</strong> know exactly what is expected as the ”son”<br />

...


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

- Classical methods are based on analysis/synthesis process <strong>and</strong><br />

simple interpolation between synthesis parameters.<br />

Examples<br />

- Additive synth: Impacts Glass → Metal<br />

- Using <strong>Gabor</strong> Multipliers Impacts Glass → Metal 2


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

- Classical methods are based on analysis/synthesis process <strong>and</strong><br />

simple interpolation between synthesis parameters.<br />

Examples<br />

- Additive synth: Impacts Glass → Metal<br />

- Using <strong>Gabor</strong> Multipliers Impacts Glass → Metal 2


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing using GM<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

N-root <strong>of</strong> mask<br />

Morphing<br />

Conclusion<br />

m<br />

x a √ √ ✲ x b<br />

❳❳ ❳❳ m<br />

✘<br />

❳❳3<br />

✘ ✘✿<br />

❅ m<br />

n√<br />

❅❅❘ m x 1 ab✘✘✘ 2<br />

x 1<br />

n ab .............. ✒<br />

<br />

• x i<br />

N ab ≠ x i N ba<br />

even for i = N/2 (<strong>and</strong> even values <strong>of</strong> N)<br />

• For noisy signals, we can hear numerical artifacts.<br />

Initial / morphing


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing using GM<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

N-root <strong>of</strong> mask<br />

Morphing<br />

Conclusion<br />

m<br />

x a √ √ ✲ x b<br />

❳❳ ❳❳ m<br />

✘<br />

❳❳3<br />

✘ ✘✿<br />

❅ m<br />

n√<br />

❅❅❘ m x 1 ab✘✘✘ 2<br />

x 1<br />

n ab .............. ✒<br />

<br />

• x i<br />

N ab ≠ x i N ba<br />

even for i = N/2 (<strong>and</strong> even values <strong>of</strong> N)<br />

• For noisy signals, we can hear numerical artifacts.<br />

Initial / morphing


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing using GM<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

N-root <strong>of</strong> mask<br />

Morphing<br />

Conclusion<br />

m<br />

x a √ √ ✲ x b<br />

❳❳ ❳❳ m<br />

✘<br />

❳❳3<br />

✘ ✘✿<br />

❅ m<br />

n√<br />

❅❅❘ m x 1 ab✘✘✘ 2<br />

x 1<br />

n ab .............. ✒<br />

<br />

• x i<br />

N ab ≠ x i N ba<br />

even for i = N/2 (<strong>and</strong> even values <strong>of</strong> N)<br />

• For noisy signals, we can hear numerical artifacts.<br />

Initial / morphing


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing using GM<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

N-root <strong>of</strong> mask<br />

Morphing<br />

Conclusion<br />

m<br />

x a √ √ ✲ x b<br />

❳❳ ❳❳ m<br />

✘<br />

❳❳3<br />

✘ ✘✿<br />

❅ m<br />

n√<br />

❅❅❘ m x 1 ab✘✘✘ 2<br />

x 1<br />

n ab .............. ✒<br />

<br />

• x i<br />

N ab ≠ x i N ba<br />

even for i = N/2 (<strong>and</strong> even values <strong>of</strong> N)<br />

• For noisy signals, we can hear numerical artifacts.<br />

Initial / morphing


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Morphing using GM<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

N-root <strong>of</strong> mask<br />

Morphing<br />

Conclusion<br />

m<br />

x a √ √ ✲ x b<br />

❳❳ ❳❳ m<br />

✘<br />

❳❳3<br />

✘ ✘✿<br />

❅ m<br />

n√<br />

❅❅❘ m x 1 ab✘✘✘ 2<br />

x 1<br />

n ab .............. ✒<br />

<br />

• x i<br />

N ab ≠ x i N ba<br />

even for i = N/2 (<strong>and</strong> even values <strong>of</strong> N)<br />

• For noisy signals, we can hear numerical artifacts.<br />

Initial / morphing


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Deterministic <strong>and</strong> S<strong>to</strong>chastic parts<br />

Idea is not <strong>to</strong> use phase for reconstruction <strong>of</strong> s<strong>to</strong>chastic part <strong>of</strong><br />

the signal, hence <strong>to</strong> define <strong>and</strong> separate it.<br />

How <strong>to</strong> quantify r<strong>and</strong>omness in a frequency b<strong>and</strong>?<br />

R<strong>and</strong>omness<br />

• Linear regression <strong>of</strong> phase<br />

• St<strong>and</strong>ard deviation <strong>of</strong> phase<br />

• Define a threshold <strong>of</strong> linearity → Depends on windows size<br />

Normal morphing Without phase S/D separation<br />

x 1 (Sax) x 2 (Clarinet)<br />

To do: separation for only parts <strong>of</strong> each modulation


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Current directions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Combining morphing <strong>and</strong> genealogy, an idea is <strong>to</strong> build a<br />

pro<strong>to</strong>type <strong>of</strong> a given <strong>sound</strong> category as a weighted morphing<br />

between all the <strong>sound</strong>.<br />

S 6<br />

S 3<br />

S 1<br />

✚✚<br />

P ❦ S 7<br />

S 5<br />

❩<br />

S 2 S 4


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Current directions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Combining morphing <strong>and</strong> genealogy, an idea is <strong>to</strong> build a<br />

pro<strong>to</strong>type <strong>of</strong> a given <strong>sound</strong> category as a weighted morphing<br />

between all the <strong>sound</strong>.<br />

S 6<br />

S 3<br />

S 1<br />

✚✚<br />

P ❦ S 7<br />

S 5<br />

❩<br />

S 2 S 4


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Current directions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Perceptual data<br />

The concept <strong>of</strong> pro<strong>to</strong>type is <strong>of</strong>ten used in study about<br />

categorization <strong>and</strong> it is possible <strong>to</strong> ask listeners <strong>to</strong> judge a level<br />

<strong>of</strong> ”pro<strong>to</strong>typicity” for each <strong>sound</strong> inside a category.<br />

Such rating can be used as a weight/distance for each <strong>sound</strong><br />

Problem<br />

How <strong>to</strong> morph <strong>sound</strong>s <strong>to</strong>gether?<br />

Initialize P <strong>and</strong> all morphings between each <strong>sound</strong> <strong>and</strong> P?


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Current directions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Perceptual data<br />

The concept <strong>of</strong> pro<strong>to</strong>type is <strong>of</strong>ten used in study about<br />

categorization <strong>and</strong> it is possible <strong>to</strong> ask listeners <strong>to</strong> judge a level<br />

<strong>of</strong> ”pro<strong>to</strong>typicity” for each <strong>sound</strong> inside a category.<br />

Such rating can be used as a weight/distance for each <strong>sound</strong><br />

Problem<br />

How <strong>to</strong> morph <strong>sound</strong>s <strong>to</strong>gether?<br />

Initialize P <strong>and</strong> all morphings between each <strong>sound</strong> <strong>and</strong> P?


<strong>Applications</strong><br />

<strong>of</strong> <strong>Gabor</strong><br />

<strong>multipliers</strong> <strong>to</strong><br />

<strong>sound</strong> <strong>characterization</strong><br />

<strong>and</strong><br />

transformation<br />

Current directions<br />

Adrien Merer<br />

Introduction<br />

Genealogy<br />

Morphing<br />

Conclusion<br />

Perceptual data<br />

The concept <strong>of</strong> pro<strong>to</strong>type is <strong>of</strong>ten used in study about<br />

categorization <strong>and</strong> it is possible <strong>to</strong> ask listeners <strong>to</strong> judge a level<br />

<strong>of</strong> ”pro<strong>to</strong>typicity” for each <strong>sound</strong> inside a category.<br />

Such rating can be used as a weight/distance for each <strong>sound</strong><br />

Problem<br />

How <strong>to</strong> morph <strong>sound</strong>s <strong>to</strong>gether?<br />

Initialize P <strong>and</strong> all morphings between each <strong>sound</strong> <strong>and</strong> P?

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!