Introduction to Computational Linguistics
Introduction to Computational Linguistics
Introduction to Computational Linguistics
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
15. Finite State Transducers 57<br />
And for a set S ⊆ A ∗<br />
(151) R T [S ] := {⃗y : exists ⃗x ∈ S : ⃗x R T ⃗y}<br />
This is the set of all strings over B that are the result of translation via T of a string<br />
in S . In the present case, notice that every string over A has a translation, but not<br />
every string over B is the translation of a string. This is the case if and only if it<br />
has even length.<br />
The translation need not be unique. Here is a finite state machine that translates<br />
a in<strong>to</strong> bc ∗ .<br />
(152)<br />
A := {a}, B := {b, c}, Q := {0, 1}, i 0 := 0, F := {1},<br />
δ := {〈0, a, 1, b〉, 〈1, ε, 1, c〉}<br />
This au<strong>to</strong>ma<strong>to</strong>n takes as only input the word a. However, it outputs any of b, bc,<br />
bcc and so on. Thus, the translation of a given input can be highly underdetermined.<br />
Notice also the following. For a language S ⊆ A ∗ , we have the following.<br />
⎧<br />
⎪⎨ bc ∗<br />
(153) R T [S ] = ⎪⎩ ∅<br />
if a ∈ S<br />
otherwise.<br />
This is because only the string a has a translation. For all words ⃗x a we have<br />
R T (⃗x) = ∅.<br />
The transducer can also be used the other way: then it translates words over B<br />
in<strong>to</strong> words over A. We use the same machine, but now we look at the relation<br />
(154) R ⌣ T (⃗y) := {⃗x : ⃗x R T ⃗y}<br />
(This is the converse of the relation R T .) Similarly we define<br />
(155)<br />
(156)<br />
R ⌣ T (⃗y) := {⃗x : ⃗x R T ⃗y}<br />
R ⌣ T [S ] := {⃗x : exists ⃗y ∈ S : ⃗x R T ⃗y}<br />
Notice that there is a transducer U such that R U = R ⌣ T .<br />
We quote the following theorem.<br />
Theorem 21 (Transducer Theorem) Let T be a transducer from A <strong>to</strong> B, and let<br />
L ⊆ A ∗ be a regular language. Then R T [L] is regular as well.