20.07.2013 Views

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

Notes on computational linguistics.pdf - UCLA Department of ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Stabler - Lx 185/209 2003<br />

6.6 Assessment <strong>of</strong> the GLC (“stack based”) parsers<br />

6.6.1 Terminati<strong>on</strong><br />

(50) We have not found any recogniti<strong>on</strong> method that is guaranteed to terminate (i.e. has a finite search space)<br />

<strong>on</strong> any input, even when the grammar has left recursi<strong>on</strong> and empty categories. In fact, it is obvious that<br />

we do not want to do this, since a c<strong>on</strong>text free grammar can have infinitely ambiguous strings.<br />

6.6.2 Coverage<br />

(51) The GLC recogniti<strong>on</strong> methods are designed for CFGs. Human languages have structures that are <strong>on</strong>ly<br />

very inelegantly handled by CFGs, and structures that seem bey<strong>on</strong>d the power <strong>of</strong> CFGs, as we menti<strong>on</strong>ed<br />

earlier (Savitch et al., 1987).<br />

6.6.3 Ambiguity (local and global) vs. glcola(k) parsing<br />

(52) Ambiguity is good.<br />

If you know which Clint<strong>on</strong> I am talking about, then I do not need to say “William Jeffers<strong>on</strong> Clint<strong>on</strong>.”<br />

Doing so violates normal c<strong>on</strong>venti<strong>on</strong>s about being brief and to-the-point in c<strong>on</strong>versati<strong>on</strong> (Grice, 1975),<br />

and c<strong>on</strong>sequently calls for some special explanati<strong>on</strong> (e.g. pomposity, or a desire to signal formality).<br />

A full name is needed when <strong>on</strong>e Clint<strong>on</strong> needs to be distinguished from another. For most <strong>of</strong> us<br />

n<strong>on</strong>-Clint<strong>on</strong>s, in most c<strong>on</strong>texts, using just “Clint<strong>on</strong>” is enough, even though the name is semantically<br />

ambiguous.<br />

(53) For the same reas<strong>on</strong>, it is no surprise that standard Prolog uses the list c<strong>on</strong>structor “.” both as a functi<strong>on</strong><br />

to build lists and as a predicate whose “pro<strong>of</strong>” triggers loading a file. Some dialects <strong>of</strong> Prolog also use<br />

“/” in some c<strong>on</strong>texts to separate a predicate name from its arity, and in other c<strong>on</strong>texts for divisi<strong>on</strong>. This<br />

kind <strong>of</strong> multiple use <strong>of</strong> an expressi<strong>on</strong> is harmless in c<strong>on</strong>text, and allows us to use shorter expressi<strong>on</strong>s.<br />

(54) There is a price to pay in parsing, since structural ambiguities must be resolved. Some <strong>of</strong> these ambiguities<br />

are resolved definitively by the structure <strong>of</strong> the sentence; other ambiguities persist throughout a<br />

whole sentence and are resolved by discourse c<strong>on</strong>text. It is natural to assume that these various types<br />

<strong>of</strong> ambiguity are resolved by similar mechanisms in human language understanding, but <strong>of</strong> course this<br />

is an empirical questi<strong>on</strong>.<br />

(55) Global ambiguity (unresolved by local structure) How much structural ambiguity do sentences <strong>of</strong><br />

human languages really have? 28 We can get a first impressi<strong>on</strong> <strong>of</strong> how serious the structural ambiguity<br />

problem is by looking at simple artificial grammars for these c<strong>on</strong>structi<strong>on</strong>s.<br />

a. PP attachment in [ VP V D N PP1 PP2 ...]<br />

C<strong>on</strong>sider the grammar:<br />

VP → VNPPP ∗<br />

NP → DNPP ∗<br />

A grammar like this cannot be directly expressed in standard c<strong>on</strong>text free form. It defines a c<strong>on</strong>text free<br />

language, but it is equivalent to the following infinite grammar:<br />

np → dn vp→ vnp<br />

np → dnpp vp→ vnppp<br />

np → d n pp pp vp → vnppppp<br />

np → d n pp pp pp vp → vnppppppp<br />

np → d n pp pp pp pp vp → vnppppppppp<br />

np → d n pp pp pp pp pp vp → vnppppppppppp<br />

… …<br />

28 Classic discussi<strong>on</strong>s <strong>of</strong> this point appear in, Church and Patil (1982) and Langendoen, McDaniel, and Langsam (1989).<br />

96

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!