13.11.2014 Views

Introduction to Computational Linguistics

Introduction to Computational Linguistics

Introduction to Computational Linguistics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

23. Some Metatheorems 96<br />

we take ⃗x = a k b b c k . We shall have <strong>to</strong> find appropriate ⃗v and ⃗y. First, it is easy<br />

<strong>to</strong> see that ⃗v⃗y must contain the same number of a, b and c (can you see why?).<br />

The product is nonempty, so at least one a, b and c must occur. We show that<br />

⃗u⃗v 2 ⃗w⃗y 2 ⃗z L. Suppose, ⃗v contains a and b. Then ⃗v 2 contains a b before an a.<br />

Contradiction. Likewise, ⃗v cannot contain a and c. So, ⃗v contains only a. Now<br />

⃗y contains b and c. But then ⃗y 2 contains a c before a b. Contradiction. Hence,<br />

⃗u⃗v 2 ⃗w⃗y 2 ⃗z L.<br />

Now, the Pumping Lemma is not an exact characterization. Here is a language<br />

that satisfies the test but is not context free:<br />

(249) C := {⃗x⃗x : ⃗x ∈ A ∗ }<br />

This is known as the copy language. We shall leave it as an assignment <strong>to</strong> show<br />

that C fulfills the conclusion of the pumping lemma. We concentrate on showing<br />

that it is not context free. The argument is a bit involved. Basically, we take a<br />

string as in the pumping lemma and fix a decomposition so that ⃗u⃗v j ⃗w⃗y j ⃗z ∈ C for<br />

every j. The idea is now that no matter what string y 0 y 1 · · · y p−1 we are given, if it<br />

contains a constituent A:<br />

(250) y 0 g 1 · · · y p−1 • y p · · · y q−1 • y j · · · y q−1<br />

then we can insert the pair ⃗v, ⃗y like this:<br />

(251) y 0 y 1 · · · y p−1 • x p · · · x j−1 ◦ y i · · · y j−1 • x l · · · x m−1 y p · · · y q−1<br />

Now one can show that since there are a limited number of nonterminals we are<br />

bound <strong>to</strong> find a string which contains such a constituent where inserting the strings<br />

is inappropriate.<br />

Natural languages do contain a certain amount of copying. For example,<br />

Malay (or Indonesian) forms the plural of a noun by reduplicating it. Chinese<br />

has been argued <strong>to</strong> employ copying in yes-no questions.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!