LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

More documents

Recommendations

Info

Digital Information Encoding on DNA Max H. Garzon 1 , Kiranchand V. Bobba 1 ,andBryanP.Hyde 2 1 Computer Science, The University of Memphis Memphis, TN 38152-3240, U.S.A. {mgarzon, kbobba}@memphis.edu 2 SAIC-Scientific Applications International Corporation Huntsville, AL 35805, U.S.A. brian.p.hyde@saic.com Abstract. Novel approaches to information encoding with DNA are explored using a new Watson-Crick structure for binary strings more appropriate to model DNA hybridization. First, a Gibbs energy analysis of codeword sets is obtained by using a template and extant errorcorrecting codes. Template-based codes have too low Gibbs energies that allow cross-hybridization. Second, a new technique is presented to construct arbitrarily large sets of noncrosshybridizing codewords of high quality by two major criteria. They have a large minimum number of mismatches between arbitrary pairs of words and alignments; moreover, their pairwise Gibbs energies of hybridization remain bounded within a safe region according to a modified nearest-neighbor model that has been verified in vitro. The technique is scalable to long strands of up to 150mers, is in principle implementable in vitro, and may be useful in further combinatorial analysis of DNA structures. Finally, a novel method to encode abiotic information in DNA arrays is defined and some preliminary experimental results are discussed. These new methods can be regarded as a different implementation of Tom Head’s idea of writing on DNA molecules [22], although only through hybridization. 1 Introduction Virtually every application of DNA computing [23,1,17] requires the use of appropriate sequences to achieve intended hybridizations, reaction products, and yields. The codeword design problem [4,19,3] requires producing sets of strands that are likely to bind in desirable hybridizations while minimizing the probability of erroneous hybridizations that may induce false positive outcomes. A fairly extensive literature now exists on various aspects and approaches of the problem (see [4] for a review). Approaches to this problem can be classified as evolutionary [7,15,9] and conventional design [6,19]. Both types of method require the use of a measure of the quality of the codewords obtained, through either a fitness function or a quantifiable measure of successful outcomes in test tubes. Although some algorithms have been proposed for testing the quality of codeword sets in terms of being free of secondary structure [4,10], very few methods have been proposed to systematically produce codes of high enough quality to N. Jonoska et al. (Eds.): Molecular Computing (Head Festschrift), LNCS 2950, pp. 152–166, 2004. c○ Springer-Verlag Berlin Heidelberg 2004
Digital Information Encoding on DNA 153 guarantee good performance in test tube protocols. Other than greedy “generate and filter” methods common in evolutionary algorithms [8], the only systematic procedure to obtain code sets for DNA computing by analytic methods is the template method developed in [2]. An application of the method requires the use of a binary word, so-called template, in combination with error-correcting codes from information theory [26], and produces codewords set designs with DNA molecules of size up to 32−mers (more below.) This paper explores novel methods for encoding information in DNA strands. The obvious approach is to encode strings into DNA strands. They can be stored or used so that DNA molecules can self-assemble fault-tolerantly for biomolecular computation [9,11,19,18]. In Section 2, a binary analog of DNA is introduced as a framework for discussing encoding problems. Section 3.2 describes a new technique, analogous to the tensor product techniques used in quantum computing for error-correcting codes [29], to produce appropriate methods to encode information in DNA-like strands and define precisely what “appropriate” means. It is also shown how these error-preventing codes for binary DNA (BNA, for short) can be easily translated into codeword sets of comparable quality for DNAbased computations. Furthemore, two independent evaluations are discussed of the quality of these codes in ways directly related to their performance in test tube reactions for computational purposes with DNA. We also compare them to code sets obtained using the template method. Direct encoding into DNA strands is not a very efficient method for storage or processing of massive amounts (over terabytes) of abiotic data because of the enormous implicit cost of DNA synthesis to produce the encoding sequences. A more indirect and more efficient approach is described in Section 4. Assuming the existence of a large basis of noncrosshybridizing DNA molecules, as obtained above, theoretical and experimental results are presented that allow a preliminary assesment of the reliability and potential capacity of this method. These new methods can be regarded as a different implementation of Tom Head’s idea of aqueous computing for writing on DNA molecules [22,21], although only hybridization is involved. Section 5 summarizes the results and presents some preliminary conclusions about the technical feasibility of these methods. 2 Binary Models of DNA DNA molecules can only process information by intermolecular reactions, usually hybridization in DNA-based computing. Due to the inherent uncertainty in biochemical processes, small variations in strand composition will not cause major changes in hybridization events, with consequent limitations on using similar molecules to encode different inputs. Input strands must be ”far apart” from each other in hybridization affinity in order to ensure that only desirable hybridizations occur. The major difficulty is that the hybridization affinity between DNA strands is hard to quantify. Ideally, the Gibbs energy released in the process is the most appropriate criterion, but its exact calculation is difficult, even for pairwise interactions among small oligos (up to 60−mers), and using
Page 1 and 2:
Lecture Notes in Computer Science 2
Page 3 and 4:
Nata˘sa Jonoska Gheorghe Păun Grz
Page 5 and 6:
Thomas J. Head
Page 7 and 8:
VIII Preface portant to keep in min
Page 9 and 10:
X Table of Contents Formal Properti
Page 11 and 12:
Solving Graph Problems by P Systems
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
where Solving Graph Problems by P S
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Writing Information into DNA Masano
Page 35 and 36:
Writing Information into DNA 25 Ham
Page 37 and 38:
Writing Information into DNA 27 Fig
Page 39 and 40:
Writing Information into DNA 29 Dea
Page 41 and 42:
5 Results 5.1 DNA Code for the Engl
Page 43 and 44:
Writing Information into DNA 33 3.
Page 45 and 46:
Writing Information into DNA 35 101
Page 47 and 48:
Balance Machines: Computing = Balan
Page 49 and 50:
+ .. . + Balance Machines: Computin
Page 51 and 52:
x Balance Machines: Computing = Bal
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
1 Balance Machines: Computing = Bal
Page 59 and 60:
Eilenberg P Systems with Symbol-Obj
Page 61 and 62:
2 Definitions Definition 1. A strea
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
5 Conclusions Eilenberg P Systems w
Page 71 and 72:
Molecular Tiling and DNA Self-assem
Page 73 and 74:
3 Molecular Self-assembly Processes
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
8 Hierarchical Tiling Molecular Til
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
References Molecular Tiling and DNA
Page 93 and 94:
Page 95 and 96:
On Some Classes of Splicing Languag
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Page 111 and 112: On Some Classes of Splicing Languag
Page 113 and 114: ˆxa ˆby ✬ ✩✬ ✩ a b x y
Page 115 and 116: On Some Classes of Splicing Languag
Page 117 and 118: The Power of Networks of Watson-Cri
Page 129 and 130: Fixed Point Approach to Commutation
Page 135 and 136: Here the last one holds if and only
Page 139 and 140: � � � � � Fixed Point App
Page 143 and 144: Remarks on Relativisations and DNA
Page 149 and 150: Splicing Test Tube Systems and Thei
Page 151 and 152: Splicing Test Tube Systems 141 the
Page 153 and 154: Splicing Test Tube Systems 143 to a
Page 155 and 156: Splicing Test Tube Systems 145 6. R
Page 157 and 158: Splicing Test Tube Systems 147 (c)
Page 159 and 160: I0 Ai i Splicing Test Tube Systems
Page 161: Splicing Test Tube Systems 151 4. J
Page 165 and 166: Digital Information Encoding on DNA
Page 177 and 178: DNA-based Cryptography Ashish Gehan
Page 179 and 180: DNA-based Cryptography 169 concern.
Page 181 and 182: DNA-based Cryptography 171 The one-
Page 183 and 184: DNA-based Cryptography 173 of DNA t
Page 185 and 186: DNA-based Cryptography 175 Fig. 3.
Page 187 and 188: DNA-based Cryptography 177 the comp
Page 189 and 190: DNA-based Cryptography 179 can be c
Page 191 and 192: DNA-based Cryptography 181 4.4 DNA-
Page 193 and 194: DNA-based Cryptography 183 Fig. 7.
Page 195 and 196: DNA-based Cryptography 185 offer li
Page 197 and 198: DNA-based Cryptography 187 30. C. M
Page 199 and 200: Splicing to the Limit Elizabeth Goo
Page 201 and 202: Splicing to the Limit 191 molecular
Page 203 and 204: Splicing to the Limit 193 Discussio
Page 205 and 206: Example 9. The splicing rules are S
Page 207 and 208: −2α � kNNk = −2αMN, k α
Page 209 and 210: Finally we define the limit languag
Page 211 and 212: 6 Conclusion Splicing to the Limit
Page 213 and 214:
Formal Properties of Gene Assembly
Page 215 and 216:
(a) (b) Formal Properties of Gene A
Page 217 and 218:
Page 219 and 220:
Page 221 and 222:
Page 223 and 224:
n-Insertion on Languages Masami Ito
Page 225 and 226:
n-Insertion on Languages 215 3. ∀
Page 227 and 228:
n-Insertion on Languages 217 Theore
Page 229 and 230:
Transducers with Programmable Input
Page 231 and 232:
Page 233 and 234:
1 01 s 0 Transducers with Programma
Page 235 and 236:
order β l Transducers with Program
Page 237 and 238:
Page 239 and 240:
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
start tile Transducers with Program
Page 249 and 250:
Page 251 and 252:
Methods for Constructing Coded DNA
Page 253 and 254:
Page 255 and 256:
u u Methods for Constructing Coded
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
Page 265 and 266:
On the Universality of P Systems wi
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
An Algorithm for Testing Structure
Page 279 and 280:
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Definition 1. Define the relation
Page 291 and 292:
280 Manfred Kudlek Definition 5. Co
Page 293 and 294:
On Languages of Cyclic Words 283 Th
Page 295 and 296:
On Languages of Cyclic Words 285 No
Page 297 and 298:
On Languages of Cyclic Words 287 Th
Page 299 and 300:
A DNA Algorithm for the Hamiltonian
Page 301 and 302:
Page 303 and 304:
Page 305 and 306:
Page 307 and 308:
Formal Languages Arising from Gene
Page 309 and 310:
Page 311 and 312:
Page 313 and 314:
Page 315 and 316:
Page 317 and 318:
Page 319 and 320:
A Proof of Regularity for Finite Sp
Page 321 and 322:
Page 323 and 324:
Page 325 and 326:
--- ◦ ❄ ⊗ γ ✲ A Proof of R
Page 327 and 328:
Page 329 and 330:
The Duality of Patterning in Molecu
Page 331 and 332:
The Duality of Patterning in Molecu
Page 333 and 334:
Membrane Computing: Some Non-standa
Page 335 and 336:
where: Membrane Computing: Some Non
Page 337 and 338:
where: Membrane Computing: Some Non
Page 339 and 340:
Page 341 and 342:
Page 343 and 344:
Page 345 and 346:
Page 347 and 348:
Page 349 and 350:
The P Versus NP Problem Through Cel
Page 351 and 352:
Page 353 and 354:
Page 355 and 356:
Page 357 and 358:
Page 359 and 360:
Page 361 and 362:
Page 363 and 364:
Realizing Switching Functions Using
Page 365 and 366:
Realizing Switching Functions Using
Page 367 and 368:
AND Gate Realizing Switching Functi
Page 369 and 370:
NOR Gate Realizing Switching Functi
Page 371 and 372:
Plasmids to Solve #3SAT Rani Siromo
Page 373 and 374:
Plasmids to Solve #3SAT 363 A comme
Page 375 and 376:
Plasmids to Solve #3SAT 365 from th
Page 377 and 378:
Communicating Distributed H Systems
Page 379 and 380:
Page 381 and 382:
Page 383 and 384:
Proof Communicating Distributed H S
Page 385 and 386:
Page 387 and 388:
Page 389 and 390:
Page 391 and 392:
Page 393 and 394:
Page 395 and 396:
Books: Publications by Thomas J. He
Page 397 and 398:
Publications by Thomas J. Head 387
Page 399 and 400:
Publications by Thomas J. Head 389
show all

LNCS 2950 - Aspects of Molecular Computing (Frontmatter Pages)

Create successful ePaper yourself

Delete template?

Save as template?