08.01.2013 Views

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Methods for Constructing Coded DNA Languages 243<br />

For words representing DNA sequences we use the following convention. A<br />

word u over ∆ denotes a DNA strand in its 5 ′ → 3 ′ orientation. The Watson-<br />

Crick complement <strong>of</strong> the word u, also in orientation 5 ′ → 3 ′ is denoted with<br />

← u. For example if u = AGGC then ← u= GCCT. There are two types <strong>of</strong> unwanted<br />

hybridizations: intramolecular and intermolecular. The intramolecular<br />

hybridization happens when two sequences, one being a reverse complement <strong>of</strong><br />

the other appear within the same DNA strand (see Fig. 1). In this case the DNA<br />

strand forms a hairpin.<br />

u u<br />

v x<br />

(a) (b)<br />

w=uvux, u = k, v = m<br />

Fig. 1. Intramolecular hybridization (θ-subword compliance): (a) the reverse complement<br />

is at the beginning <strong>of</strong> the 5 ′ end, (b) the reverse complement is at the end <strong>of</strong> the<br />

3 ′ .The3 ′ end<strong>of</strong>theDNAstrandisindicatedwithanarrow.<br />

Two particular intermolecular hybridizations are <strong>of</strong> interest (see Fig. 2). In<br />

Fig. 2 (a) the strand labeled u is a reverse complement <strong>of</strong> a subsequence <strong>of</strong> the<br />

strand labeled v, and in the same figure (b) represents the case when u is the<br />

reverse complement <strong>of</strong> a portion <strong>of</strong> a concatenation <strong>of</strong> v and w.<br />

(a)<br />

u u<br />

v<br />

Fig. 2. Two types <strong>of</strong> intermolecular hybridization: (a) (θ-compliant) one code word is<br />

a reverse complement <strong>of</strong> a subword <strong>of</strong> another code word, (b) (θ-free) a code word is a<br />

reverse complement <strong>of</strong> a subword <strong>of</strong> a concatenation <strong>of</strong> two other code words. The 3 ′<br />

endisindicatedwithanarrow.<br />

Throughout the rest <strong>of</strong> the paper, we concentrate on finite sets X ⊆ Σ ∗ that<br />

are codes such that every word in X + can be written uniquely as a product<br />

<strong>of</strong> words in X. Inotherwords,X ∗ is a free monoid generated with X. For<br />

the background on codes we refer the reader to [5]. We will need the following<br />

definitions:<br />

Pref(w) ={u |∃v ∈ Σ ∗ ,uv = w}<br />

Suff(w) ={u |∃v ∈ Σ ∗ ,vu= w}<br />

Sub(w) ={u |∃v1,v2 ∈ Σ ∗ ,v1uv2 = w}<br />

We define the set <strong>of</strong> prefixes, suffixes and subwords <strong>of</strong> a set <strong>of</strong> words. Similarly,<br />

we have Suffk(w) = Suff(w) ∩ Σ k ,Prefk(w) =Pref(w) ∩ Σ k and Subk(w) =<br />

Sub(w) ∩ Σ k .<br />

v<br />

(b)<br />

w

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!