01.04.2015 Views

Sequence Comparison.pdf

Sequence Comparison.pdf

Sequence Comparison.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6.6 Generalizations of Spaced Seeds 113<br />

Table 6.4 Top-ranked spaced seeds for different similarity levels and weights. The sign ‘−’ means<br />

that the corresponding seed is not among the top 10 seeds for the similarity level and weight. This<br />

is reproduced from the paper [48] of Choi, Zeng, and Zhang; reprinted with permission of Oxford<br />

University Press.<br />

W<br />

Good spaced seeds<br />

Rank under<br />

a similarity level (%)<br />

65 70 75 80 85 90<br />

9 11*11*1*1***111<br />

11*1*11***1*111<br />

11*11**1*1**111<br />

10 11*11***11*1*111<br />

111**1*1**11*111<br />

11*11**1*1*1**111<br />

11 111*1**1*1**11*111<br />

111**1*11**1*1*111<br />

11*1*1*11**1**1111<br />

12 111*1*11*1**11*111<br />

111*1**11*1*11*111<br />

111**1*1*1**11*1111<br />

13 111*1*11**11**1*1111<br />

111*1**11*1**111*111<br />

111*11*11**1*1*1111<br />

14 111*111**1*11**1*1111<br />

1111*1**11**11*1*1111<br />

1111*1*1*11**11*1111<br />

15 1111**1*1*1*11**11*1111<br />

111*111**1*11**1*11111<br />

111*111*1*11*1**11111<br />

1111*11**11*1*1*11*1111<br />

16<br />

1111**11*1*1*11**11*1111<br />

1111*1**11*1*1**111*1111<br />

111*111*1**111*11*1111<br />

1 1 1 1 1 1<br />

2 2 2 2 2 3<br />

4 4 4 4 4 4<br />

1 1 1 1 1 1<br />

2 2 4 6 8 9<br />

8 6 2 2 2 5<br />

1 1 2 2 2 3<br />

2 2 1 1 1 1<br />

6 3 3 5 5 6<br />

1 1 1 1 1 1<br />

2 2 2 5 3 2<br />

6 3 3 2 4 4<br />

2 1 1 2 2 2<br />

7 2 2 1 1 1<br />

1 4 5 7 8 8<br />

2 1 1 1 1 1<br />

5 2 2 3 3 6<br />

1 3 7 - - -<br />

- 5 1 1 1 1<br />

- 1 2 5 5 4<br />

1 2 - - - -<br />

7 1 2 6 - -<br />

- 7 1 1 1 3<br />

1 9 - - - -<br />

T = { j 1 , j 2 ,···, j t }<br />

the set of its transition positions. Two sequences S 1 and S 2 exhibit a match of the<br />

transition seed P in positions x and y if, for 1 ≤ k ≤ m, S 1 [x−L Q +i k ]=S 2 [y−L Q +<br />

i k ] and, for 1 ≤ k ≤ t, S 1 [x − L Q + j k ]=S 2 [y − L Q + j k ], or two residues S 1 [x − L Q +<br />

j k ] and S 2 [y − L Q + j k ] are both pyrimidines or purines.<br />

The analytical studies presented in this chapter can be generalized to transition<br />

seeds in a straightforward manner. As a result, good transition seeds can be found<br />

using each of the approaches discussed in Section 6.5.1. Hedera and Mandala that<br />

were mentioned earlier can be used for transition seed design.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!