University of Maryland
University of Maryland
University of Maryland
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Making Phonological<br />
Decisions<br />
William J. Idsardi<br />
<strong>University</strong> <strong>of</strong> <strong>Maryland</strong><br />
idsardi@umd.edu<br />
Mayfest 2006<br />
1
Resnick<br />
2
3 topics<br />
Finding vowel categories/features<br />
Finding vowel harmony<br />
Finding word boundaries<br />
3
What is<br />
phonology<br />
for<br />
Swingley<br />
4
Long-term Memory<br />
Audition<br />
Articulation<br />
5
Long-term Memory<br />
Lex Phon<br />
Surface<br />
Audition<br />
Phonetics<br />
Articulation<br />
6
Using Statistics<br />
Magic!<br />
Data summarization and compression<br />
(not exemplars)<br />
Decision making under uncertainty<br />
(not fuzziness)<br />
Measures <strong>of</strong> confidence<br />
Statistics over induced representations<br />
(2nd order, ...)<br />
7
Terminology<br />
Knowledge OF Language<br />
= Decisions about Grammar<br />
Knowledge ABOUT Language<br />
= Statistics supporting KOL<br />
8
Finding phonemes<br />
9
Minimal pair conundrum<br />
No reliable minimal pair judgements<br />
until ~ 18 mos.<br />
“Minimal pairs are a parlour game”<br />
(François Dell)<br />
What are children doing from 12-18 mos.<br />
10
Recursive<br />
binary<br />
divisions.<br />
Dresher<br />
11
Classical Manchu Representations<br />
These percepts are not representations. The learner has not yet<br />
identified any contrasts in the vowel system.<br />
Assume contrastive features are determined in order:<br />
[i<br />
]<br />
/V/<br />
[´]<br />
[u]<br />
[U]<br />
[ç]<br />
[a]<br />
12
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low]<br />
[i<br />
]<br />
/ˆ/<br />
[u]<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
13
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low]<br />
[i<br />
]<br />
/ˆ/<br />
[u]<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
13
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] >[coronal]<br />
[i<br />
]<br />
/I/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
14
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] >[coronal]<br />
[i<br />
]<br />
/I/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
14
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] >[coronal]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
15
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] >[coronal]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
[´]<br />
/A/<br />
[a]<br />
[ç]<br />
15
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
/A/<br />
[´]<br />
[labial]<br />
/O/<br />
[ç]<br />
[a]<br />
16
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
/A/<br />
[´]<br />
[labial]<br />
/O/<br />
[ç]<br />
[a]<br />
16
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
/A/<br />
[´]<br />
[labial]<br />
/O/<br />
[ç]<br />
[a]<br />
17
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
/A/<br />
[´]<br />
[a]<br />
[labial]<br />
/ç/<br />
18
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial]<br />
/i/<br />
[coronal]<br />
[u]<br />
/U/<br />
[U]<br />
[low]<br />
/A/<br />
[´]<br />
[a]<br />
[labial]<br />
/ç/<br />
[low]<br />
18
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
/u/<br />
/U/<br />
[low]<br />
/´/<br />
/a/<br />
[labial]<br />
/ç/<br />
[low]<br />
19
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
/u/<br />
/U/<br />
[low]<br />
/´/<br />
/a/<br />
[labial]<br />
/ç/<br />
[low]<br />
19
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
/u/<br />
/U/<br />
[low]<br />
/´/<br />
/a/<br />
[labial]<br />
/ç/<br />
[low]<br />
20
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
/u/<br />
/U/<br />
[low]<br />
/´/<br />
/a/<br />
[labial]<br />
/ç/<br />
[low]<br />
20
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
/u/<br />
/U/<br />
[low]<br />
/´/<br />
/a/<br />
[labial]<br />
/ç/<br />
ç/<br />
[low]<br />
21
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
/u/ [u]<br />
/U/<br />
/´/ [labial]<br />
/a/<br />
/ç/<br />
[low]<br />
22
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
/u/ [u]<br />
[ATR]<br />
[ATR]<br />
/´/ [labial]<br />
[low]<br />
/ç/<br />
[low]<br />
[low]<br />
/a/<br />
/U/<br />
22
Classical Manchu Vowels<br />
Contrastive Values:<br />
[low] > [coronal] > [labial > [ATR]<br />
/i/<br />
[coronal]<br />
[ATR]<br />
[low]<br />
[low]<br />
/´/<br />
/a/<br />
/u/<br />
[ATR]<br />
[labial]<br />
/ç/<br />
[low]<br />
/U/<br />
23
If bimodal<br />
then active.<br />
Maye<br />
24
100<br />
75<br />
50<br />
25<br />
0<br />
100<br />
-150<br />
-137.5<br />
-125<br />
-112.5<br />
-100<br />
-87.5<br />
-75<br />
-62.5<br />
-50<br />
-37.5<br />
-25<br />
-12.5<br />
0<br />
12.5<br />
25<br />
37.5<br />
50<br />
62.5<br />
75<br />
87.5<br />
100<br />
112.5<br />
125<br />
-150 -100 -50 0 50 100 150<br />
137.5<br />
150<br />
b<br />
p<br />
75<br />
50<br />
25<br />
b<br />
p<br />
ph<br />
0<br />
-150 -100 -50 0 50 100 150<br />
Lisker & Abramson (1964, 1970)<br />
25
100 100<br />
75 75<br />
50 50<br />
25 25<br />
b<br />
p<br />
0<br />
0<br />
-150<br />
-150<br />
-137.5<br />
-137.5 -125<br />
-125<br />
-112.5<br />
-112.5<br />
-100<br />
-100<br />
-87.5<br />
-87.5<br />
-75<br />
-75<br />
-62.5<br />
-62.5<br />
-50<br />
-50<br />
-37.5<br />
-37.5<br />
-25<br />
-25<br />
-12.5<br />
-12.5<br />
00<br />
12.5 12.5<br />
25 25<br />
37.5 37.5<br />
50 50<br />
62.5 62.5<br />
75 75<br />
87.5 87.5<br />
100 100<br />
112.5 112.5<br />
125 125<br />
137.5 137.5<br />
150<br />
150<br />
-150 -100 -50 0 50 100 150<br />
100 100<br />
75 75<br />
50 50<br />
25 25<br />
b<br />
p<br />
ph<br />
0<br />
0<br />
-150 -100 -50 0 50 100 150<br />
Lisker & Abramson (1964, 1970)<br />
25
100 100<br />
75 75<br />
50 50<br />
25 25<br />
b<br />
p<br />
0<br />
0<br />
-150<br />
-150<br />
-137.5<br />
-137.5 -125<br />
-125<br />
-112.5<br />
-112.5<br />
-100<br />
-100<br />
-87.5<br />
-87.5<br />
-75<br />
-75<br />
-62.5<br />
-62.5<br />
-50<br />
-50<br />
-37.5<br />
-37.5<br />
-25<br />
-25<br />
-12.5<br />
-12.5<br />
00<br />
12.5 12.5<br />
25 25<br />
37.5 37.5<br />
50 50<br />
62.5 62.5<br />
75 75<br />
87.5 87.5<br />
100 100<br />
112.5 112.5<br />
125 125<br />
137.5 137.5<br />
150<br />
150<br />
-150 -100 -50 0 50 100 150<br />
100 100<br />
75 75<br />
50 50<br />
25 25<br />
b<br />
p<br />
ph<br />
0<br />
0<br />
-150 -100 -50 0 50 100 150<br />
Lisker & Abramson (1964, 1970)<br />
25
100 100<br />
75 75<br />
50 50<br />
25 25<br />
good discrimination<br />
b<br />
p<br />
0<br />
0<br />
-150<br />
-150<br />
-137.5<br />
-137.5 -125<br />
-125<br />
-112.5<br />
-112.5<br />
-100<br />
-100<br />
-87.5<br />
-87.5<br />
-75<br />
-75<br />
-62.5<br />
-62.5<br />
-50<br />
-50<br />
-37.5<br />
-37.5<br />
-25<br />
-25<br />
-12.5<br />
-12.5<br />
00<br />
12.5 12.5<br />
25 25<br />
37.5 37.5<br />
50 50<br />
62.5 62.5<br />
75 75<br />
87.5 87.5<br />
100 100<br />
112.5 112.5<br />
125 125<br />
137.5 137.5<br />
150<br />
150<br />
-150 -100 -50 0 50 100 150<br />
100 100<br />
75 75<br />
50 50<br />
25 25<br />
good discrimination<br />
b<br />
p<br />
ph<br />
0<br />
0<br />
-150 -100 -50 0 50 100 150<br />
Lisker & Abramson (1964, 1970)<br />
25
100 100<br />
75 75<br />
50 50<br />
poor discrimination<br />
good discrimination<br />
b<br />
p<br />
25 25<br />
0<br />
0<br />
-150<br />
-150<br />
-137.5<br />
-137.5 -125<br />
-125<br />
-112.5<br />
-112.5<br />
-100<br />
-100<br />
-87.5<br />
-87.5<br />
-75<br />
-75<br />
-62.5<br />
-62.5<br />
-50<br />
-50<br />
-37.5<br />
-37.5<br />
-25<br />
-25<br />
-12.5<br />
-12.5<br />
00<br />
12.5 12.5<br />
25 25<br />
37.5 37.5<br />
50 50<br />
62.5 62.5<br />
75 75<br />
87.5 87.5<br />
100 100<br />
112.5 112.5<br />
125 125<br />
137.5 137.5<br />
150<br />
150<br />
-150 -100 -50 0 50 100 150<br />
100 100<br />
75 75<br />
50 50<br />
25 25<br />
good discrimination<br />
good discrimination<br />
b<br />
p<br />
ph<br />
0<br />
0<br />
-150 -100 -50 0 50 100 150<br />
Lisker & Abramson (1964, 1970)<br />
25
Familiarized to<br />
Unimodal vs. Bimodal Distributions<br />
occurrence<br />
per block <strong>of</strong><br />
familiarization<br />
16<br />
12<br />
8<br />
Not always for<br />
phonological<br />
non-contrast<br />
4<br />
0<br />
1 2 3 4 5 6 7 8<br />
Token Number<br />
/da/<br />
/ta/<br />
26
Greek Vowels<br />
F1 x F2 space for Greek vowel tokens<br />
k-means clustering<br />
criteria for stopping (over-fitting penalty)<br />
procedure finds four vowels<br />
(approximately i, e, a, o+u)<br />
27
Bimodality is not<br />
always so clear.<br />
Swingley<br />
28
Learning from the<br />
Primordial soup<br />
Elizabeth Allyn Smith<br />
Kathleen Currie Hall<br />
29
Greek Vowels<br />
F1 x F2 space for Greek vowel tokens<br />
k-means clustering<br />
criteria for stopping (over-fitting penalty)<br />
procedure finds four vowels<br />
(approximately i, e, a, o+u)<br />
30
English: Peterson and Barney<br />
3000<br />
F2<br />
2000<br />
1000<br />
front<br />
back<br />
200 400 600 800 1000 1200<br />
F1<br />
high ------------low<br />
31
Combine approaches<br />
Normalize<br />
Recursive binary division algorithm<br />
Make Ø/[F] divisions<br />
Using compactness <strong>of</strong> clustering on [F] as<br />
criteria<br />
32
Normalization<br />
Don’t use F1 and F2<br />
Use F1/F3 and F2/F3<br />
Corrects for male/female/child differences<br />
On-line rather than statistical normalization<br />
33
Raw data ANOVA<br />
F0<br />
300<br />
200<br />
100<br />
m f<br />
F1<br />
1300<br />
1000<br />
700<br />
400<br />
100<br />
m f<br />
F2<br />
3000<br />
2000<br />
1000<br />
mf<br />
F3<br />
4000<br />
3000<br />
2000<br />
f<br />
m<br />
a<br />
c<br />
a<br />
c<br />
a<br />
c<br />
a<br />
c<br />
main effects (gender, age) p < 0.001 for all<br />
interaction (gender*age) p < 0.01 for F0, F2<br />
34
Ratio with F3 ANOVA<br />
F1/F3<br />
0.40<br />
0.30<br />
0.20<br />
0.10<br />
a<br />
c<br />
mf<br />
F2/F3<br />
0.9<br />
0.7<br />
0.5<br />
0.3<br />
0.1<br />
a<br />
c<br />
m f<br />
no main effects (p > 0.24)<br />
marginal interaction for F2/F3 (p = 0.0919)<br />
35
A little improvement<br />
0.9<br />
F2<br />
3000<br />
2000<br />
F2/F3<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
1000<br />
0.3<br />
0.2<br />
200 400 600 800 1000 1200<br />
F1<br />
.10 .20 .30 .40<br />
F1/F3<br />
36
Best split for front<br />
0.9<br />
How compact<br />
F2/F3<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
Text<br />
.10 .20 .30 .40<br />
F1/F3<br />
front ✔<br />
back<br />
37
Trapping the problem<br />
Number <strong>of</strong> contrasts is bounded:<br />
From below by phonology (too few)<br />
From above by phonetics (too many)<br />
Actual answer is trapped in-between<br />
38
Look for<br />
phonological<br />
activity.<br />
Dresher<br />
39
Context<br />
sensitive<br />
contrasts.<br />
Maye<br />
40
Frequent<br />
frames<br />
Mintz<br />
41
Two different contrastive hierarchies:<br />
a. [high] > [labial]<br />
b. [labial] > [high]<br />
[–high]<br />
[+high]<br />
[–labial]<br />
[+labial]<br />
/a/ [–labial] [+labial]<br />
[–high]<br />
[+high]<br />
/u/<br />
/i/<br />
/u/<br />
/a/<br />
/i/<br />
We might expect that the two vowel systems will pattern differently. For<br />
example, system (a) might show alternations or neutralization between /i/<br />
and /u/; in system (b) /i/ might be more closely related to /a/.<br />
42
Diagnosing vowel harmony<br />
CHILDES transcripts (need sane orthography)<br />
Turkish<br />
need a comparison non-VH (e.g. Polish)<br />
(Auto)correlation between adjacent vowels<br />
Compare same and different vowels<br />
within- and between- utterances<br />
43
CHILDES Turkish<br />
50,000<br />
37,500<br />
25,000<br />
47,340<br />
1:2.27<br />
Chi-square test: p < 0.0001<br />
Fisher’s exact test: p < 0.0001<br />
20,869<br />
12,500<br />
1:5.29<br />
9,119<br />
0<br />
within utterance<br />
same vowel<br />
1,723<br />
across utterances<br />
different vowels<br />
44
Not done<br />
Cue for vowel harmony not the same as rule<br />
for vowel harmony<br />
Cue: more “same” vowels within utterances<br />
Rules: [front] harmony<br />
! ! ! [round] harmony when [high]<br />
45
Finding other rules<br />
Find other active features<br />
Tohono O’odham palatalization:<br />
t, d, s → tʃ, dʒ, ʃ / _ i, u, ɨ<br />
[coronal] → [laminal] / _ [high]<br />
correlation between [laminal] and [high]<br />
So [laminal], [high] are active<br />
Arbitrary correlations<br />
46
Finding words<br />
47
Transitional<br />
probabilities!<br />
Saffran<br />
48
“UG, statistics or both”<br />
Look between<br />
stresses!<br />
Yang<br />
49
Virtuous circularity<br />
“We find that the best<br />
heuristics make use <strong>of</strong><br />
structural information<br />
obtained by parsing<br />
input sentences during<br />
the course <strong>of</strong> learning”<br />
Sakas<br />
50
Bayes!<br />
Johnson<br />
51
Bayes’s Rule<br />
p(A&B) = p(A|B) * p(B)<br />
p(A&B) = p(B|A) * p(A)<br />
p(A|B) * p(B) = p(B|A) * p(A)<br />
p(A|B) = p(B|A) * p(A) / p(B)<br />
52
Bayesian phonology<br />
p(UR|SR) = p(SR|UR) * p(UR) / p(SR)<br />
p(UR|SR) p(SR|UR): analysis by synthesis<br />
p(UR): UG as Bayesian priors<br />
Poeppel<br />
53
Finding words from<br />
vowel harmony<br />
p(#|XY) = p(XY|#) * p(#) / p(XY)<br />
Estimate p(XY|#) from p(XY|Utterance)<br />
Put some prior expectations on p(#)<br />
Get p(XY) from data<br />
54
a ï o u e i ö ü<br />
a ï o u e i ö ü<br />
Some info in there<br />
% # % no #<br />
% # % no #<br />
100<br />
75<br />
50<br />
25<br />
100<br />
75<br />
50<br />
25<br />
0<br />
0<br />
ï<br />
o<br />
55
I’m next!<br />
(after lunch)<br />
Hudson Kam<br />
56
Discussion<br />
57
Cohorts and neighbors<br />
Features<br />
Analysis by synthesis<br />
Multi-resolution<br />
3 passes<br />
58
Detail<br />
Multi-resolution<br />
Both coarse and fine info<br />
Exemplar: k-nearest neighbors<br />
coarseness in retrieval<br />
Substatistics<br />
fineness calculated in coarse addresses<br />
59