08.01.2013 Views

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Methods for Constructing Coded DNA Languages 251<br />

Let A be the adjacency matrix <strong>of</strong> this automaton with the characteristic<br />

equation µ 2k−2 − µ k−1 (p − 1) k−2 − (p − 1) k−2 =0.Thisimplies(p− 1) k−2 =<br />

µ k−1 − µk−1<br />

µ k−1 . We are interested in the largest real value for µ. Sinceµ>0, we<br />

+1<br />

have 0 < µk−1<br />

µ k−1 +1<br />

< 1 which implies (p − 1) k−2<br />

k−1 3 2<br />

3 > 2. Hence the entropy <strong>of</strong> X ∗ in this case is greater than log 2.<br />

4 Concluding Remarks<br />

In this paper we investigated theoretical properties <strong>of</strong> languages that consist <strong>of</strong><br />

DNA based code words. In particular we concentrated on intermolecular and<br />

intramolecular cross hybridizations that can occur as a result that a Watson-<br />

Crick complement <strong>of</strong> a (sub)word <strong>of</strong> a code word is also a (sub)word <strong>of</strong> a code<br />

word. These conditions are necessary for a design <strong>of</strong> good codes, but certainly<br />

may not be sufficient. For example, the algorithms used in the programs developed<br />

by Seeman [23], Feldkamp [10] and Ruben [22], all check for uniqueness <strong>of</strong><br />

k-length subsequences in the code words. Unfortunately, none <strong>of</strong> the properties<br />

from Definition 21 ensures uniqueness <strong>of</strong> k-length words. Such code word properties<br />

remain to be investigated. We hope that the general methods <strong>of</strong> designing<br />

such codewords will simplify the search for “good” codes. Better characterizations<br />

<strong>of</strong> good code words that are closed under Kleene ∗ operation may provide<br />

even faster ways for designing such codewords. Although the Proposition 36<br />

provides a rather good design <strong>of</strong> code words, the potential repetition <strong>of</strong> certain<br />

subwords is not desirable. The most challenging questions <strong>of</strong> characterizing and<br />

designing good θ-k-codes that avoids numerous repetition <strong>of</strong> subwords remains<br />

to be developed.<br />

Our approach to the question <strong>of</strong> designing “good” DNA codes has been from<br />

the formal language theory aspect. Many issues that are involved in designing<br />

such codes have not been considered. These include (and are not limited to)<br />

the free energy conditions, melting temperature as well as Hamming distance<br />

conditions. All these remain to be challenging problems and a procedure that<br />

includes all or majority <strong>of</strong> these aspects will be desirable in practice. It may be<br />

the case that regardless <strong>of</strong> the way the codes are designed, the ultimate test for<br />

the “goodness” <strong>of</strong> the codes will be in the laboratory.<br />

Acknowledgment<br />

This work has been partially supported by the grant EIA-0086015 from the<br />

National Science Foundation, USA.<br />

References<br />

1. R.L. Adler, D. Coppersmith, M. Hassner, Algorithms for sliding block codes -an application<br />

<strong>of</strong> symbolic dynamics to information theory, IEEE Trans. Inform. Theory<br />

29 (1983), 5-22.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!