20.07.2013 Views

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Stabler - Lx 185/209 2003<br />

octave:21> gplot [1:10] result title "x",\<br />

> result using 1:3 title "y", result using 1:4 title "z"<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

path <strong>of</strong> a Markov chain<br />

0<br />

1 2 3 4 5 6 7 8 9 10<br />

(67) Notice that the initial distributi<strong>on</strong> and transiti<strong>on</strong> matrix can be represented by a finite state machine<br />

with no vocabulary and no final states:<br />

0.7<br />

0.2<br />

0.2<br />

0.7<br />

s<br />

1<br />

0.2<br />

s<br />

2<br />

0.1<br />

s<br />

3<br />

0.1<br />

(68) Notice that no Markov chain can be such that after a sequence <strong>of</strong> states acccc there is a probability <strong>of</strong><br />

0.8 that the next symbol will be an a, thatis,<br />

P(a|acccc) = 0.8<br />

when it is also the case that<br />

P(b|bcccc) = 0.8<br />

This follows from the requirement menti<strong>on</strong>ed in (61) that in each row i, the sum <strong>of</strong> the transiti<strong>on</strong><br />

probabilities from that state <br />

qj∈ΩX P(qj|qi) = 1, and so we cannot have both P(b|c) = 0.8andP(a|c) =<br />

0.8.<br />

(69) Chomsky (1963, p337) observes that the Markovian property that we see in state sequences does not<br />

always hold in regular languages. For example, the following finite state machine, to which we have<br />

141<br />

0.7<br />

1<br />

0.1<br />

x<br />

y<br />

z

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!