16.01.2015 Views

University of Maryland

University of Maryland

University of Maryland

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Making Phonological<br />

Decisions<br />

William J. Idsardi<br />

<strong>University</strong> <strong>of</strong> <strong>Maryland</strong><br />

idsardi@umd.edu<br />

Mayfest 2006<br />

1


Resnick<br />

2


3 topics<br />

Finding vowel categories/features<br />

Finding vowel harmony<br />

Finding word boundaries<br />

3


What is<br />

phonology<br />

for<br />

Swingley<br />

4


Long-term Memory<br />

Audition<br />

Articulation<br />

5


Long-term Memory<br />

Lex Phon<br />

Surface<br />

Audition<br />

Phonetics<br />

Articulation<br />

6


Using Statistics<br />

Magic!<br />

Data summarization and compression<br />

(not exemplars)<br />

Decision making under uncertainty<br />

(not fuzziness)<br />

Measures <strong>of</strong> confidence<br />

Statistics over induced representations<br />

(2nd order, ...)<br />

7


Terminology<br />

Knowledge OF Language<br />

= Decisions about Grammar<br />

Knowledge ABOUT Language<br />

= Statistics supporting KOL<br />

8


Finding phonemes<br />

9


Minimal pair conundrum<br />

No reliable minimal pair judgements<br />

until ~ 18 mos.<br />

“Minimal pairs are a parlour game”<br />

(François Dell)<br />

What are children doing from 12-18 mos.<br />

10


Recursive<br />

binary<br />

divisions.<br />

Dresher<br />

11


Classical Manchu Representations<br />

These percepts are not representations. The learner has not yet<br />

identified any contrasts in the vowel system.<br />

Assume contrastive features are determined in order:<br />

[i<br />

]<br />

/V/<br />

[´]<br />

[u]<br />

[U]<br />

[ç]<br />

[a]<br />

12


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low]<br />

[i<br />

]<br />

/ˆ/<br />

[u]<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

13


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low]<br />

[i<br />

]<br />

/ˆ/<br />

[u]<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

13


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] >[coronal]<br />

[i<br />

]<br />

/I/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

14


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] >[coronal]<br />

[i<br />

]<br />

/I/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

14


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] >[coronal]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

15


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] >[coronal]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

[´]<br />

/A/<br />

[a]<br />

[ç]<br />

15


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

/A/<br />

[´]<br />

[labial]<br />

/O/<br />

[ç]<br />

[a]<br />

16


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

/A/<br />

[´]<br />

[labial]<br />

/O/<br />

[ç]<br />

[a]<br />

16


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

/A/<br />

[´]<br />

[labial]<br />

/O/<br />

[ç]<br />

[a]<br />

17


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

/A/<br />

[´]<br />

[a]<br />

[labial]<br />

/ç/<br />

18


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial]<br />

/i/<br />

[coronal]<br />

[u]<br />

/U/<br />

[U]<br />

[low]<br />

/A/<br />

[´]<br />

[a]<br />

[labial]<br />

/ç/<br />

[low]<br />

18


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

/u/<br />

/U/<br />

[low]<br />

/´/<br />

/a/<br />

[labial]<br />

/ç/<br />

[low]<br />

19


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

/u/<br />

/U/<br />

[low]<br />

/´/<br />

/a/<br />

[labial]<br />

/ç/<br />

[low]<br />

19


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

/u/<br />

/U/<br />

[low]<br />

/´/<br />

/a/<br />

[labial]<br />

/ç/<br />

[low]<br />

20


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

/u/<br />

/U/<br />

[low]<br />

/´/<br />

/a/<br />

[labial]<br />

/ç/<br />

[low]<br />

20


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

/u/<br />

/U/<br />

[low]<br />

/´/<br />

/a/<br />

[labial]<br />

/ç/<br />

ç/<br />

[low]<br />

21


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

/u/ [u]<br />

/U/<br />

/´/ [labial]<br />

/a/<br />

/ç/<br />

[low]<br />

22


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

/u/ [u]<br />

[ATR]<br />

[ATR]<br />

/´/ [labial]<br />

[low]<br />

/ç/<br />

[low]<br />

[low]<br />

/a/<br />

/U/<br />

22


Classical Manchu Vowels<br />

Contrastive Values:<br />

[low] > [coronal] > [labial > [ATR]<br />

/i/<br />

[coronal]<br />

[ATR]<br />

[low]<br />

[low]<br />

/´/<br />

/a/<br />

/u/<br />

[ATR]<br />

[labial]<br />

/ç/<br />

[low]<br />

/U/<br />

23


If bimodal<br />

then active.<br />

Maye<br />

24


100<br />

75<br />

50<br />

25<br />

0<br />

100<br />

-150<br />

-137.5<br />

-125<br />

-112.5<br />

-100<br />

-87.5<br />

-75<br />

-62.5<br />

-50<br />

-37.5<br />

-25<br />

-12.5<br />

0<br />

12.5<br />

25<br />

37.5<br />

50<br />

62.5<br />

75<br />

87.5<br />

100<br />

112.5<br />

125<br />

-150 -100 -50 0 50 100 150<br />

137.5<br />

150<br />

b<br />

p<br />

75<br />

50<br />

25<br />

b<br />

p<br />

ph<br />

0<br />

-150 -100 -50 0 50 100 150<br />

Lisker & Abramson (1964, 1970)<br />

25


100 100<br />

75 75<br />

50 50<br />

25 25<br />

b<br />

p<br />

0<br />

0<br />

-150<br />

-150<br />

-137.5<br />

-137.5 -125<br />

-125<br />

-112.5<br />

-112.5<br />

-100<br />

-100<br />

-87.5<br />

-87.5<br />

-75<br />

-75<br />

-62.5<br />

-62.5<br />

-50<br />

-50<br />

-37.5<br />

-37.5<br />

-25<br />

-25<br />

-12.5<br />

-12.5<br />

00<br />

12.5 12.5<br />

25 25<br />

37.5 37.5<br />

50 50<br />

62.5 62.5<br />

75 75<br />

87.5 87.5<br />

100 100<br />

112.5 112.5<br />

125 125<br />

137.5 137.5<br />

150<br />

150<br />

-150 -100 -50 0 50 100 150<br />

100 100<br />

75 75<br />

50 50<br />

25 25<br />

b<br />

p<br />

ph<br />

0<br />

0<br />

-150 -100 -50 0 50 100 150<br />

Lisker & Abramson (1964, 1970)<br />

25


100 100<br />

75 75<br />

50 50<br />

25 25<br />

b<br />

p<br />

0<br />

0<br />

-150<br />

-150<br />

-137.5<br />

-137.5 -125<br />

-125<br />

-112.5<br />

-112.5<br />

-100<br />

-100<br />

-87.5<br />

-87.5<br />

-75<br />

-75<br />

-62.5<br />

-62.5<br />

-50<br />

-50<br />

-37.5<br />

-37.5<br />

-25<br />

-25<br />

-12.5<br />

-12.5<br />

00<br />

12.5 12.5<br />

25 25<br />

37.5 37.5<br />

50 50<br />

62.5 62.5<br />

75 75<br />

87.5 87.5<br />

100 100<br />

112.5 112.5<br />

125 125<br />

137.5 137.5<br />

150<br />

150<br />

-150 -100 -50 0 50 100 150<br />

100 100<br />

75 75<br />

50 50<br />

25 25<br />

b<br />

p<br />

ph<br />

0<br />

0<br />

-150 -100 -50 0 50 100 150<br />

Lisker & Abramson (1964, 1970)<br />

25


100 100<br />

75 75<br />

50 50<br />

25 25<br />

good discrimination<br />

b<br />

p<br />

0<br />

0<br />

-150<br />

-150<br />

-137.5<br />

-137.5 -125<br />

-125<br />

-112.5<br />

-112.5<br />

-100<br />

-100<br />

-87.5<br />

-87.5<br />

-75<br />

-75<br />

-62.5<br />

-62.5<br />

-50<br />

-50<br />

-37.5<br />

-37.5<br />

-25<br />

-25<br />

-12.5<br />

-12.5<br />

00<br />

12.5 12.5<br />

25 25<br />

37.5 37.5<br />

50 50<br />

62.5 62.5<br />

75 75<br />

87.5 87.5<br />

100 100<br />

112.5 112.5<br />

125 125<br />

137.5 137.5<br />

150<br />

150<br />

-150 -100 -50 0 50 100 150<br />

100 100<br />

75 75<br />

50 50<br />

25 25<br />

good discrimination<br />

b<br />

p<br />

ph<br />

0<br />

0<br />

-150 -100 -50 0 50 100 150<br />

Lisker & Abramson (1964, 1970)<br />

25


100 100<br />

75 75<br />

50 50<br />

poor discrimination<br />

good discrimination<br />

b<br />

p<br />

25 25<br />

0<br />

0<br />

-150<br />

-150<br />

-137.5<br />

-137.5 -125<br />

-125<br />

-112.5<br />

-112.5<br />

-100<br />

-100<br />

-87.5<br />

-87.5<br />

-75<br />

-75<br />

-62.5<br />

-62.5<br />

-50<br />

-50<br />

-37.5<br />

-37.5<br />

-25<br />

-25<br />

-12.5<br />

-12.5<br />

00<br />

12.5 12.5<br />

25 25<br />

37.5 37.5<br />

50 50<br />

62.5 62.5<br />

75 75<br />

87.5 87.5<br />

100 100<br />

112.5 112.5<br />

125 125<br />

137.5 137.5<br />

150<br />

150<br />

-150 -100 -50 0 50 100 150<br />

100 100<br />

75 75<br />

50 50<br />

25 25<br />

good discrimination<br />

good discrimination<br />

b<br />

p<br />

ph<br />

0<br />

0<br />

-150 -100 -50 0 50 100 150<br />

Lisker & Abramson (1964, 1970)<br />

25


Familiarized to<br />

Unimodal vs. Bimodal Distributions<br />

occurrence<br />

per block <strong>of</strong><br />

familiarization<br />

16<br />

12<br />

8<br />

Not always for<br />

phonological<br />

non-contrast<br />

4<br />

0<br />

1 2 3 4 5 6 7 8<br />

Token Number<br />

/da/<br />

/ta/<br />

26


Greek Vowels<br />

F1 x F2 space for Greek vowel tokens<br />

k-means clustering<br />

criteria for stopping (over-fitting penalty)<br />

procedure finds four vowels<br />

(approximately i, e, a, o+u)<br />

27


Bimodality is not<br />

always so clear.<br />

Swingley<br />

28


Learning from the<br />

Primordial soup<br />

Elizabeth Allyn Smith<br />

Kathleen Currie Hall<br />

29


Greek Vowels<br />

F1 x F2 space for Greek vowel tokens<br />

k-means clustering<br />

criteria for stopping (over-fitting penalty)<br />

procedure finds four vowels<br />

(approximately i, e, a, o+u)<br />

30


English: Peterson and Barney<br />

3000<br />

F2<br />

2000<br />

1000<br />

front<br />

back<br />

200 400 600 800 1000 1200<br />

F1<br />

high ------------low<br />

31


Combine approaches<br />

Normalize<br />

Recursive binary division algorithm<br />

Make Ø/[F] divisions<br />

Using compactness <strong>of</strong> clustering on [F] as<br />

criteria<br />

32


Normalization<br />

Don’t use F1 and F2<br />

Use F1/F3 and F2/F3<br />

Corrects for male/female/child differences<br />

On-line rather than statistical normalization<br />

33


Raw data ANOVA<br />

F0<br />

300<br />

200<br />

100<br />

m f<br />

F1<br />

1300<br />

1000<br />

700<br />

400<br />

100<br />

m f<br />

F2<br />

3000<br />

2000<br />

1000<br />

mf<br />

F3<br />

4000<br />

3000<br />

2000<br />

f<br />

m<br />

a<br />

c<br />

a<br />

c<br />

a<br />

c<br />

a<br />

c<br />

main effects (gender, age) p < 0.001 for all<br />

interaction (gender*age) p < 0.01 for F0, F2<br />

34


Ratio with F3 ANOVA<br />

F1/F3<br />

0.40<br />

0.30<br />

0.20<br />

0.10<br />

a<br />

c<br />

mf<br />

F2/F3<br />

0.9<br />

0.7<br />

0.5<br />

0.3<br />

0.1<br />

a<br />

c<br />

m f<br />

no main effects (p > 0.24)<br />

marginal interaction for F2/F3 (p = 0.0919)<br />

35


A little improvement<br />

0.9<br />

F2<br />

3000<br />

2000<br />

F2/F3<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

1000<br />

0.3<br />

0.2<br />

200 400 600 800 1000 1200<br />

F1<br />

.10 .20 .30 .40<br />

F1/F3<br />

36


Best split for front<br />

0.9<br />

How compact<br />

F2/F3<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

Text<br />

.10 .20 .30 .40<br />

F1/F3<br />

front ✔<br />

back<br />

37


Trapping the problem<br />

Number <strong>of</strong> contrasts is bounded:<br />

From below by phonology (too few)<br />

From above by phonetics (too many)<br />

Actual answer is trapped in-between<br />

38


Look for<br />

phonological<br />

activity.<br />

Dresher<br />

39


Context<br />

sensitive<br />

contrasts.<br />

Maye<br />

40


Frequent<br />

frames<br />

Mintz<br />

41


Two different contrastive hierarchies:<br />

a. [high] > [labial]<br />

b. [labial] > [high]<br />

[–high]<br />

[+high]<br />

[–labial]<br />

[+labial]<br />

/a/ [–labial] [+labial]<br />

[–high]<br />

[+high]<br />

/u/<br />

/i/<br />

/u/<br />

/a/<br />

/i/<br />

We might expect that the two vowel systems will pattern differently. For<br />

example, system (a) might show alternations or neutralization between /i/<br />

and /u/; in system (b) /i/ might be more closely related to /a/.<br />

42


Diagnosing vowel harmony<br />

CHILDES transcripts (need sane orthography)<br />

Turkish<br />

need a comparison non-VH (e.g. Polish)<br />

(Auto)correlation between adjacent vowels<br />

Compare same and different vowels<br />

within- and between- utterances<br />

43


CHILDES Turkish<br />

50,000<br />

37,500<br />

25,000<br />

47,340<br />

1:2.27<br />

Chi-square test: p < 0.0001<br />

Fisher’s exact test: p < 0.0001<br />

20,869<br />

12,500<br />

1:5.29<br />

9,119<br />

0<br />

within utterance<br />

same vowel<br />

1,723<br />

across utterances<br />

different vowels<br />

44


Not done<br />

Cue for vowel harmony not the same as rule<br />

for vowel harmony<br />

Cue: more “same” vowels within utterances<br />

Rules: [front] harmony<br />

! ! ! [round] harmony when [high]<br />

45


Finding other rules<br />

Find other active features<br />

Tohono O’odham palatalization:<br />

t, d, s → tʃ, dʒ, ʃ / _ i, u, ɨ<br />

[coronal] → [laminal] / _ [high]<br />

correlation between [laminal] and [high]<br />

So [laminal], [high] are active<br />

Arbitrary correlations<br />

46


Finding words<br />

47


Transitional<br />

probabilities!<br />

Saffran<br />

48


“UG, statistics or both”<br />

Look between<br />

stresses!<br />

Yang<br />

49


Virtuous circularity<br />

“We find that the best<br />

heuristics make use <strong>of</strong><br />

structural information<br />

obtained by parsing<br />

input sentences during<br />

the course <strong>of</strong> learning”<br />

Sakas<br />

50


Bayes!<br />

Johnson<br />

51


Bayes’s Rule<br />

p(A&B) = p(A|B) * p(B)<br />

p(A&B) = p(B|A) * p(A)<br />

p(A|B) * p(B) = p(B|A) * p(A)<br />

p(A|B) = p(B|A) * p(A) / p(B)<br />

52


Bayesian phonology<br />

p(UR|SR) = p(SR|UR) * p(UR) / p(SR)<br />

p(UR|SR) p(SR|UR): analysis by synthesis<br />

p(UR): UG as Bayesian priors<br />

Poeppel<br />

53


Finding words from<br />

vowel harmony<br />

p(#|XY) = p(XY|#) * p(#) / p(XY)<br />

Estimate p(XY|#) from p(XY|Utterance)<br />

Put some prior expectations on p(#)<br />

Get p(XY) from data<br />

54


a ï o u e i ö ü<br />

a ï o u e i ö ü<br />

Some info in there<br />

% # % no #<br />

% # % no #<br />

100<br />

75<br />

50<br />

25<br />

100<br />

75<br />

50<br />

25<br />

0<br />

0<br />

ï<br />

o<br />

55


I’m next!<br />

(after lunch)<br />

Hudson Kam<br />

56


Discussion<br />

57


Cohorts and neighbors<br />

Features<br />

Analysis by synthesis<br />

Multi-resolution<br />

3 passes<br />

58


Detail<br />

Multi-resolution<br />

Both coarse and fine info<br />

Exemplar: k-nearest neighbors<br />

coarseness in retrieval<br />

Substatistics<br />

fineness calculated in coarse addresses<br />

59

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!