08.01.2013 Views

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

300 Peter Leupold, Victor Mitrana, and José M.Sempere<br />

<strong>of</strong> all strings (words) over an alphabet V is denoted by V ∗ and V + = V ∗ \{ε},<br />

where ε denotes the empty string. The length <strong>of</strong> a string x is denoted by |x|,<br />

hence |ε| = 0, while the number <strong>of</strong> all occurrences <strong>of</strong> a letter a in x is denoted<br />

by |x|a . For an alphabet V = {a1,a2,...,ak} (we consider an ordering on V ),<br />

the Parikh mapping associated with V is a homomorphism ΨV from V ∗ into<br />

the monoid <strong>of</strong> vector addition on IN k , defined by ΨV (s) =(|s|a1, |s|a2,...,|s|ak );<br />

moreover, given a language L over V , we define its image through the Parikh<br />

mapping as the set ΨV (L) ={ΨV (x) | x ∈ L}. A subset X <strong>of</strong> IN k is said to be<br />

linear if there are the vectors c0,c1,c2,...,cn ∈ IN k ,forsomen≥0such that<br />

X = {c0 + �n i=1 xici | xi ∈ IN, 1 ≤ i ≤ n}. A finite union <strong>of</strong> linear sets is called<br />

semilinear. For any positive integer n we write [n] fortheset{1, 2,...,n}.<br />

Let V be an alphabet and X ∈{IN}∪{[k] | k ≥ 1}. Forastringw∈V + ,we<br />

set<br />

DX(w) ={uxxv | w = uxv, u, v ∈ V ∗ ,x∈ V + , |x| ∈X}.<br />

We now define recursively the languages:<br />

D 0 X(w) ={w}, D i X(w) = �<br />

DX(x), i≥ 1,<br />

D ∗ X (w) =� D<br />

i≥0<br />

i X (w).<br />

x∈D i−1<br />

X (w)<br />

The languages D∗ IN (w) andD∗ [k] (w), k ≥ 1, are called the unbounded duplication<br />

language and the k-bounded duplication language, respectively, defined by w. In<br />

other words, for any X ∈{IN}∪{[k] | k ≥ 1}, D∗ X (w) is the smallest language<br />

L ′ ⊆ V ∗ such that w ∈ L ′ and whenever uxv ∈ L ′ , uxxv ∈ L ′ holds for all<br />

u, v ∈ V ∗ , x ∈ V + ,and|x| ∈X.<br />

A natural question concerns the place <strong>of</strong> unbounded duplication languages<br />

in the Chomsky hierarchy. In [7] it is shown that the unbounded duplication<br />

language defined by any word over a two-letter alphabet is regular, while [13]<br />

shows that these are the only cases when the unbounded language defined by a<br />

word is regular. By combining these results we have:<br />

Theorem 1. [7,13] The unbounded duplication language defined by a word w is<br />

regular if and only if w contains at most two different letters.<br />

3 Unbounded Duplication Languages<br />

We do not know whether or not all unbounded duplication languages are contextfree.<br />

A straightforward observation leads to the fact that all these languages are<br />

linear sets, that is, the image <strong>of</strong> each unbounded duplication language through<br />

the Parikh mapping is linear. Indeed, if w ∈ V + , V = {a1,a2,...,an}, then one<br />

can easily infer that<br />

ΨV (D ∗ IN(w)) = {ΨV (w)+<br />

n�<br />

i=1<br />

xie (n)<br />

i<br />

| xi ∈ IN, 1 ≤ i ≤ n},

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!