08.01.2013 Views

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Digital Information Encoding on DNA<br />

Max H. Garzon 1 , Kiranchand V. Bobba 1 ,andBryanP.Hyde 2<br />

1 Computer Science, The University <strong>of</strong> Memphis<br />

Memphis, TN 38152-3240, U.S.A.<br />

{mgarzon, kbobba}@memphis.edu<br />

2 SAIC-Scientific Applications International Corporation<br />

Huntsville, AL 35805, U.S.A.<br />

brian.p.hyde@saic.com<br />

Abstract. Novel approaches to information encoding with DNA are<br />

explored using a new Watson-Crick structure for binary strings more<br />

appropriate to model DNA hybridization. First, a Gibbs energy analysis<br />

<strong>of</strong> codeword sets is obtained by using a template and extant errorcorrecting<br />

codes. Template-based codes have too low Gibbs energies that<br />

allow cross-hybridization. Second, a new technique is presented to construct<br />

arbitrarily large sets <strong>of</strong> noncrosshybridizing codewords <strong>of</strong> high<br />

quality by two major criteria. They have a large minimum number <strong>of</strong><br />

mismatches between arbitrary pairs <strong>of</strong> words and alignments; moreover,<br />

their pairwise Gibbs energies <strong>of</strong> hybridization remain bounded within a<br />

safe region according to a modified nearest-neighbor model that has been<br />

verified in vitro. The technique is scalable to long strands <strong>of</strong> up to 150mers,<br />

is in principle implementable in vitro, and may be useful in further<br />

combinatorial analysis <strong>of</strong> DNA structures. Finally, a novel method to encode<br />

abiotic information in DNA arrays is defined and some preliminary<br />

experimental results are discussed. These new methods can be regarded<br />

as a different implementation <strong>of</strong> Tom Head’s idea <strong>of</strong> writing on DNA<br />

molecules [22], although only through hybridization.<br />

1 Introduction<br />

Virtually every application <strong>of</strong> DNA computing [23,1,17] requires the use <strong>of</strong> appropriate<br />

sequences to achieve intended hybridizations, reaction products, and<br />

yields. The codeword design problem [4,19,3] requires producing sets <strong>of</strong> strands<br />

that are likely to bind in desirable hybridizations while minimizing the probability<br />

<strong>of</strong> erroneous hybridizations that may induce false positive outcomes. A fairly<br />

extensive literature now exists on various aspects and approaches <strong>of</strong> the problem<br />

(see [4] for a review). Approaches to this problem can be classified as evolutionary<br />

[7,15,9] and conventional design [6,19]. Both types <strong>of</strong> method require the use<br />

<strong>of</strong> a measure <strong>of</strong> the quality <strong>of</strong> the codewords obtained, through either a fitness<br />

function or a quantifiable measure <strong>of</strong> successful outcomes in test tubes.<br />

Although some algorithms have been proposed for testing the quality <strong>of</strong> codeword<br />

sets in terms <strong>of</strong> being free <strong>of</strong> secondary structure [4,10], very few methods<br />

have been proposed to systematically produce codes <strong>of</strong> high enough quality to<br />

N. Jonoska et al. (Eds.): <strong>Molecular</strong> <strong>Computing</strong> (Head Festschrift), <strong>LNCS</strong> <strong>2950</strong>, pp. 152–166, 2004.<br />

c○ Springer-Verlag Berlin Heidelberg 2004

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!