13.11.2014 Views

Introduction to Computational Linguistics

Introduction to Computational Linguistics

Introduction to Computational Linguistics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

15. Finite State Transducers 57<br />

And for a set S ⊆ A ∗<br />

(151) R T [S ] := {⃗y : exists ⃗x ∈ S : ⃗x R T ⃗y}<br />

This is the set of all strings over B that are the result of translation via T of a string<br />

in S . In the present case, notice that every string over A has a translation, but not<br />

every string over B is the translation of a string. This is the case if and only if it<br />

has even length.<br />

The translation need not be unique. Here is a finite state machine that translates<br />

a in<strong>to</strong> bc ∗ .<br />

(152)<br />

A := {a}, B := {b, c}, Q := {0, 1}, i 0 := 0, F := {1},<br />

δ := {〈0, a, 1, b〉, 〈1, ε, 1, c〉}<br />

This au<strong>to</strong>ma<strong>to</strong>n takes as only input the word a. However, it outputs any of b, bc,<br />

bcc and so on. Thus, the translation of a given input can be highly underdetermined.<br />

Notice also the following. For a language S ⊆ A ∗ , we have the following.<br />

⎧<br />

⎪⎨ bc ∗<br />

(153) R T [S ] = ⎪⎩ ∅<br />

if a ∈ S<br />

otherwise.<br />

This is because only the string a has a translation. For all words ⃗x a we have<br />

R T (⃗x) = ∅.<br />

The transducer can also be used the other way: then it translates words over B<br />

in<strong>to</strong> words over A. We use the same machine, but now we look at the relation<br />

(154) R ⌣ T (⃗y) := {⃗x : ⃗x R T ⃗y}<br />

(This is the converse of the relation R T .) Similarly we define<br />

(155)<br />

(156)<br />

R ⌣ T (⃗y) := {⃗x : ⃗x R T ⃗y}<br />

R ⌣ T [S ] := {⃗x : exists ⃗y ∈ S : ⃗x R T ⃗y}<br />

Notice that there is a transducer U such that R U = R ⌣ T .<br />

We quote the following theorem.<br />

Theorem 21 (Transducer Theorem) Let T be a transducer from A <strong>to</strong> B, and let<br />

L ⊆ A ∗ be a regular language. Then R T [L] is regular as well.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!