13.11.2014 Views

Introduction to Computational Linguistics

Introduction to Computational Linguistics

Introduction to Computational Linguistics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

20. Greibach Normal Form 79<br />

Definition 27 A tree domain is a set T of strings of natural numbers such that (i)<br />

if ⃗x if a prefix of ⃗y ∈ T then also ⃗x ∈ T, (b) if ⃗x j ∈ T and i < j then also ⃗xi ∈ T.<br />

We define ⃗x > ⃗y if ⃗x is a proper prefix of ⃗y and ⃗x ⊏ ⃗y iff ⃗x = ⃗ui⃗v and ⃗y = ⃗u j⃗w for<br />

some sequences ⃗u, ⃗v, ⃗w and numbers i < j.<br />

The depth–first search traverses the tree domain in the lexicographical order, the<br />

breadth–first search in the numerical order. Let the following tree domain be<br />

given.<br />

<br />

<br />

ε<br />

❅<br />

❅<br />

❅<br />

❅<br />

0 1 2<br />

✂ ❇<br />

✂ ❇<br />

00<br />

✂<br />

10<br />

❇<br />

11 20<br />

The depth–first linearization is<br />

(202) ε, 0, 00, 1, 10, 11, 2, 20<br />

The breadth–first linearization, however, is<br />

(203) ε, 0, 1, 2, 00, 10, 11, 20<br />

Notice that with these linearizations the tree domain ω ∗ cannot be enumerated.<br />

Namely, the depth–first linearization begins as follows.<br />

(204) ε, 0, 00, 000, 0000, . . .<br />

So we never reach 1. The breadth–first linearization goes like this.<br />

(205) ε, 0, 1, 2, 3, . . .<br />

So, we never reach 00. On the other hand, ω ∗ is countable, so we do have a<br />

linearization, but it is more complicated than the given ones.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!