02.11.2014 Views

第二章语音信号基础知识

第二章语音信号基础知识

第二章语音信号基础知识

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

•<br />

•<br />

•<br />

•<br />


2.1 <br />

<br />

•<br />

•<br />


2.1.1


2.1.1 <br />

•<br />

<br />

<br />

<br />


2.1.1 <br />

• <br />

• <br />

<br />


2.1.1 <br />

• <br />

<br />

<br />

<br />

<br />


2.1.2 <br />

• <br />

<br />

<br />

- <br />

- <br />

- <br />

<br />

<br />

<br />

-


2.1.2 <br />

• <br />

<br />

<br />

• <br />

<br />


2.1.2 <br />

• <br />

<br />

<br />

• <br />

<br />


2.1.2 <br />

• <br />

<br />

• <br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

( )


2.1.2 <br />

T p<br />

<br />

T p<br />

f<br />

p<br />

<br />

— <br />

<br />

<br />

60~200Hz<br />

200~450Hz


2.1.3 <br />

•<br />

<br />

<br />

•<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

•<br />

<br />

<br />

<br />


2.1.3 <br />

<br />

<br />

<br />

<br />

20<br />

<br />

13cm<br />

8.5cm<br />

<br />

<br />

<br />

<br />

17cm<br />

<br />

( )


2.2 <br />

<br />

,<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

•Voiced Speech<br />

•Unvoiced Speech<br />

• Plosive Speech)


“ ”


• <br />

•<br />

<br />

•<br />

•<br />

<br />

<br />

……<br />

<br />

5


1500<br />

1000<br />

<br />

500<br />

0<br />

-500<br />

-1000<br />

-1500<br />

0 100 200 300 400 500<br />

<br />

40<br />

20<br />

<br />

0<br />

(dB<br />

)<br />

-20<br />

F1<br />

F2<br />

FFT<br />

F3<br />

F4<br />

-40<br />

0<br />

0.1 0.2 0.3 0.4 0.5


2.3 <br />

•<br />

<br />

•<br />

<br />

<br />

<br />

•<br />

<br />

<br />

<br />

•<br />


•<br />

•<br />

<br />

<br />

<br />

•<br />

<br />

<br />

•<br />

•<br />

<br />

<br />

[sh] “<br />

” “”


• <br />

•<br />

Vowel:<br />

<br />

<br />

•<br />

<br />

<br />

•<br />

<br />

•<br />

•<br />

•<br />

“ — ”<br />

C— V<br />


9<br />

1~4<br />

<br />

<br />

( )<br />

<br />

<br />

( )<br />

6~9<br />

<br />

5<br />

<br />

<br />

<br />

<br />

<br />

<br />

1<br />

<br />

2<br />

4<br />

<br />

5<br />

9<br />

3<br />

6<br />

7<br />

<br />

8<br />

<br />

<br />

<br />

<br />

7


“ ”<br />

<br />

“ ”


2.4 <br />

<br />

<br />

<br />

<br />

<br />

<br />

+


A V<br />

<br />

GZ<br />

<br />

<br />

<br />

A U<br />

<br />

VZ<br />

<br />

RZ Sn<br />

• /<br />

•<br />

N =<br />

f s<br />

f<br />

•<br />

0 1<br />

f s<br />

= 8Hzf p<br />

= 50 ~ 450Hz<br />

N<br />

0<br />

= 18 160<br />

<br />

0<br />

p


GZ<br />

<br />

<br />

<br />

<br />

<br />

<br />

12dB<br />

<br />

)<br />

)(1<br />

(1<br />

1<br />

)<br />

( 1<br />

2<br />

1<br />

1<br />

−<br />

−<br />

−<br />

−<br />

=<br />

z<br />

g<br />

z<br />

g<br />

Z<br />

G<br />

2<br />

1<br />

, g<br />

g<br />

<br />

<br />

<br />

1<br />

<br />

<br />

<br />

<br />

<br />

v<br />

A<br />

U<br />

A<br />

<br />

V(z) <br />

<br />

<br />

<br />

V(z)


α = 1 0<br />

V ( Z)<br />

= p<br />

∑<br />

i=<br />

0<br />

1<br />

α<br />

z<br />

i<br />

i<br />

α <br />

p<br />

i<br />

p<br />

<br />

<br />

p<br />

p 2<br />

=812<br />

p<br />

ω k<br />

r<br />

k<br />

exp( ± jω<br />

k<br />

= 1, p<br />

2<br />

<br />

R(Z)<br />

),<br />

k<br />

<br />

p<br />

R(<br />

z)<br />

=<br />

(1 −<br />

rz<br />

−1<br />

), r<br />

≈ 1


)<br />

(<br />

),<br />

( Z<br />

R<br />

Z<br />

G<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

U<br />

V<br />

p<br />

A<br />

A<br />

f<br />

,<br />

, p<br />

i<br />

i ,<br />

2,<br />

1,<br />

, L<br />

=<br />

α<br />

<br />

U/V<br />

U<br />

V<br />

p<br />

A<br />

A<br />

f<br />

,<br />

,<br />

<br />

U/V5ms<br />

p<br />

i<br />

i ,<br />

2,<br />

1,<br />

, L<br />

=<br />

α<br />

<br />

0~30ms<br />

<br />

<br />

<br />

<br />

<br />

<br />

•<br />

<br />

•<br />

<br />


2.5 <br />

•<br />


2.5.1


2.5.2 <br />

<br />

<br />

<br />

1<br />

<br />

16 Hz~16 kHz<br />

20 kHz<br />

10 kHz<br />

<br />

0~120dB SPL<br />

Sound Power Level<br />

(0dB SPL)<br />

<br />

−16<br />

2<br />

10 W / cm


1kHz<br />

4dB, 10kHz<br />

15dB, 40kHz<br />

50dB<br />

<br />

<br />

120dB140dB<br />

<br />

<br />

<br />

<br />

<br />

325<br />

<br />

<br />

40dB100Hz~500Hz<br />

1.8Hz500Hz~16kHz ∆f<br />

= f × 0.035 <br />

<br />

<br />

2ms


2<br />

<br />

Loudness Level)<br />

<br />

“”Phon<br />

1<br />

1kHz<br />

120<br />

110<br />

0 100<br />

90<br />

80<br />

70<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

0<br />

<br />

Hz<br />

<br />

Fletcher-Munson <br />

<br />

<br />

dB<br />

<br />

<br />

1kHz<br />

<br />

<br />

<br />

1kHz<br />

<br />

<br />

1<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

2<br />

3~4kHz


“”Sone<br />

1 Sone<br />

40dB 1kHz<br />

k<br />

k Sone<br />

dB<br />

N L<br />

N<br />

= 0 .063×<br />

10<br />

0.03L<br />

L = 33 .33×<br />

lg N + 40<br />

<br />

1 Sone = 40 Phon<br />

N<br />

1<br />

L<br />

<br />

33.33×<br />

lg 2 ≈ 10 Phon<br />

120<br />

110<br />

0 100<br />

90<br />

80<br />

70<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

Hz<br />

<br />

Fletcher-Munson <br />

0


3Pitch<br />

•<br />

•<br />

<br />

<br />

<br />

•<br />

Mel<br />

•<br />

40dB 1kHz<br />

1000Mel<br />

T mel<br />

1000 ⎤<br />

⎢<br />

⎡ f<br />

= log 1 + log 2<br />

⎥<br />

⎣ 1000⎦


4<br />

•<br />

1 50 20~100 80<br />

2 150 100~200 100<br />

<br />

3 250 200~300 100<br />

<br />

5 450 400~510 110<br />

•<br />

6 570 510~630 120<br />

<br />

7 700 630~770 140<br />

<br />

8 840 770~920 150<br />

9 1000 920~1080 160<br />

<br />

•<br />

<br />

<br />

<br />

<br />

•<br />

Bark<br />

20~16kHz<br />

24Bark<br />

<br />

Z(Bark) f(Hz)<br />

<br />

Z<br />

≅ 26.81<br />

f (1960 + f ) − 0.53<br />

<br />

Bark<br />

. <br />

<br />

(Hz)<br />

<br />

(Hz)<br />

(Hz)<br />

4 350 300~400 100<br />

10 1170 1080~1270 190<br />

11 1370 1270~1480 210<br />

12 1600 1480~1720 240<br />

13 1850 1720~2000 280<br />

14 2150 2000~2320 320<br />

15 2500 2320~2700 380<br />

16 2900 2700~3150 450<br />

17 3400 3150~3700 550<br />

18 4000 3700~4400 700<br />

19 4800 4400~5300 900<br />

20 5800 5300~6400 1100<br />

21 7000 6400~7700 1300<br />

22 8500 7700~9500 1800<br />

23 10500 9500~12000 2500<br />

24 13500 12000~15500 3500

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!