20.07.2013 Views

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Stabler - Lx 185/209 2003<br />

It is essentially a “dynamic programming” method and it is included here because <strong>of</strong> its simplicity.<br />

It is doubtful, however, that it will find practical use, for three reas<strong>on</strong>s:<br />

1. n3 time is too much to allow for parsing.<br />

2. The method uses an amount <strong>of</strong> space proporti<strong>on</strong>al to the square <strong>of</strong> the input length.<br />

3. The method <strong>of</strong> the next secti<strong>on</strong> (Earley’s algorithm) does at least as well in all respects as<br />

this <strong>on</strong>e, and for many grammars does better.<br />

7.1.1 CKY example 1<br />

ip → dp i1<br />

dp → lambs<br />

i1 → i0 vp<br />

i0 → will<br />

vp → v0 dp<br />

v0 → eat<br />

dp → oats<br />

dp<br />

lambs<br />

The axioms can be regarded as specifying a finite state machine representati<strong>on</strong> <strong>of</strong> the input:<br />

lambs<br />

0 1<br />

will<br />

Given an n state finite state machine representati<strong>on</strong> <strong>of</strong> the input, computing the CKY closure can be regarded as filling in<br />

the “upper triangle” <strong>of</strong> an n × n matrix, from the (empty) diag<strong>on</strong>al up: 32<br />

2<br />

eat<br />

ip<br />

i0<br />

will<br />

3<br />

oats<br />

i1<br />

v0<br />

eat<br />

4<br />

vp<br />

dp<br />

oats<br />

0 1 2 3 4<br />

0 (0,1):dp<br />

(0,1):lambs (0,4):ip<br />

1 (1,2):i0<br />

(1,2):will (1,4):i1<br />

2 (2,3):v0<br />

(2,3):eat (2,4):vp<br />

3 (3,4):dp<br />

(3,4):oats<br />

4<br />

32 CKY tables and other similar structures <strong>of</strong> intermediate results are frequently c<strong>on</strong>structed by matrix operati<strong>on</strong>s. This idea has been<br />

important in complexity analysis and in attempts to find the fastest possible parsing methods (Valiant, 1975; Lee, 1997). Extensi<strong>on</strong>s <strong>of</strong><br />

matrix methods to more expressive grammars are c<strong>on</strong>sidered by Satta (1994) and others.<br />

104

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!