Introduction to Computational Linguistics
Introduction to Computational Linguistics
Introduction to Computational Linguistics
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
23. Some Metatheorems 93<br />
This proof can be simplified using another result. Consider a map µ : A → B ∗ ,<br />
which assigns <strong>to</strong> each letter a ∈ A a string (possibly empty) of letters from B. We<br />
extend µ <strong>to</strong> strings as follows.<br />
(231) µ(x 0 x 1 x 2 · · · x n−1 ) = µ(x 0 )µ(x 1 )µ(x 2 ) · · · µ(x n−1 )<br />
For example, let B = {c, d}, and µ(c) := anti ⌢ □ and µ(d) := missile ⌢ □. Then<br />
(232)<br />
(233)<br />
(234)<br />
µ(d) = missile ⌢ □<br />
µ(cdd) = anti missile missile ⌢ □<br />
µ(ccddd) = anti anti missile missile missile ⌢ □<br />
So if M = {c k d k+1 : k ∈ N} then the language above is the µ–image of M (modulo<br />
the blank at the end).<br />
Theorem 36 Let µ : A → B ∗ and L ⊆ A ∗ be a regular language. Then the set<br />
{µ(⃗v) : ⃗v ∈ L} ⊆ B ∗ also is regular.<br />
However, we can also do the following: let ν be the map<br />
(235) a ↦→ c, n ↦→ ε, t ↦→ ε, i ↦→ ε, m ↦→ d, s ↦→ ε, l ↦→ ε, e ↦→ ε, □ ↦→ ε<br />
Then<br />
(236) ν(anti) = c, ν(missile) = d, ν(□) = ε<br />
So, M is also the image of L under ν. Now, <strong>to</strong> disprove that L is regular it is enough<br />
<strong>to</strong> show that M is not regular. The proof is similar. Choose a number k. We show<br />
that the conditions are not met for this k. And since it is arbitrary, the condition<br />
is not met for any number. We take the string c k d k+1 . We try <strong>to</strong> decompose it in<strong>to</strong><br />
⃗u⃗v⃗w such that ⃗u⃗v j ⃗w ∈ M for any j. Three cases are <strong>to</strong> be considered. (Case a)<br />
⃗v = c p for some p (which must be > 0):<br />
(237) cc · · · c • c · · · c • c · · · cdd · · · d<br />
Then ⃗u⃗w = c k−p d k+1 , which is not in M. (Case b) ⃗v = d p for some p > 0. Similarly.<br />
(Case c) ⃗v = c p d q , with p, q > 0.<br />
(238) cc · · · c • c · · · cd · · · d • dd · · · d