08.01.2013 Views

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Digital Information Encoding on DNA 153<br />

guarantee good performance in test tube protocols. Other than greedy “generate<br />

and filter” methods common in evolutionary algorithms [8], the only systematic<br />

procedure to obtain code sets for DNA computing by analytic methods is the<br />

template method developed in [2]. An application <strong>of</strong> the method requires the use<br />

<strong>of</strong> a binary word, so-called template, in combination with error-correcting codes<br />

from information theory [26], and produces codewords set designs with DNA<br />

molecules <strong>of</strong> size up to 32−mers (more below.)<br />

This paper explores novel methods for encoding information in DNA strands.<br />

The obvious approach is to encode strings into DNA strands. They can be stored<br />

or used so that DNA molecules can self-assemble fault-tolerantly for biomolecular<br />

computation [9,11,19,18]. In Section 2, a binary analog <strong>of</strong> DNA is introduced as<br />

a framework for discussing encoding problems. Section 3.2 describes a new technique,<br />

analogous to the tensor product techniques used in quantum computing<br />

for error-correcting codes [29], to produce appropriate methods to encode information<br />

in DNA-like strands and define precisely what “appropriate” means. It is<br />

also shown how these error-preventing codes for binary DNA (BNA, for short)<br />

can be easily translated into codeword sets <strong>of</strong> comparable quality for DNAbased<br />

computations. Furthemore, two independent evaluations are discussed <strong>of</strong><br />

the quality <strong>of</strong> these codes in ways directly related to their performance in test<br />

tube reactions for computational purposes with DNA. We also compare them to<br />

code sets obtained using the template method.<br />

Direct encoding into DNA strands is not a very efficient method for storage<br />

or processing <strong>of</strong> massive amounts (over terabytes) <strong>of</strong> abiotic data because<br />

<strong>of</strong> the enormous implicit cost <strong>of</strong> DNA synthesis to produce the encoding sequences.<br />

A more indirect and more efficient approach is described in Section 4.<br />

Assuming the existence <strong>of</strong> a large basis <strong>of</strong> noncrosshybridizing DNA molecules,<br />

as obtained above, theoretical and experimental results are presented that allow<br />

a preliminary assesment <strong>of</strong> the reliability and potential capacity <strong>of</strong> this method.<br />

These new methods can be regarded as a different implementation <strong>of</strong> Tom Head’s<br />

idea <strong>of</strong> aqueous computing for writing on DNA molecules [22,21], although only<br />

hybridization is involved. Section 5 summarizes the results and presents some<br />

preliminary conclusions about the technical feasibility <strong>of</strong> these methods.<br />

2 Binary Models <strong>of</strong> DNA<br />

DNA molecules can only process information by intermolecular reactions, usually<br />

hybridization in DNA-based computing. Due to the inherent uncertainty<br />

in biochemical processes, small variations in strand composition will not cause<br />

major changes in hybridization events, with consequent limitations on using<br />

similar molecules to encode different inputs. Input strands must be ”far apart”<br />

from each other in hybridization affinity in order to ensure that only desirable<br />

hybridizations occur. The major difficulty is that the hybridization affinity between<br />

DNA strands is hard to quantify. Ideally, the Gibbs energy released in the<br />

process is the most appropriate criterion, but its exact calculation is difficult,<br />

even for pairwise interactions among small oligos (up to 60−mers), and using

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!