01.04.2015 Views

Sequence Comparison.pdf

Sequence Comparison.pdf

Sequence Comparison.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

160 8 Scoring Matrices<br />

k variables with m constraints to a problem in k + m variable with no constraints.<br />

The new objective function is a linear combination of the original objective function<br />

and the m constraints in which the coefficient of each constraint is a scalar variable<br />

called the Lagrange multiplier.<br />

Here we introduce 20 Lagrange multipliers α i for the constraints in (8.13), 19<br />

Lagrange multipliers β j for the first 19 constraints in (8.14), and additional Lagrange<br />

multiplier γ for the constraint in (8.15). To simplify our description, we define β 20 =<br />

0. Consider the Lagrangian<br />

F ((Q ij ),(α i ),(β j ),γ)<br />

= D(Q ij )+∑α i<br />

(P i −∑<br />

i<br />

j<br />

[ ( )<br />

q ij<br />

+γ ∑q ij ln<br />

ij<br />

P i P<br />

j<br />

′<br />

Q ij<br />

)<br />

+∑<br />

−∑Q ij ln<br />

ij<br />

)<br />

β j<br />

(P j ′ −∑Q ij<br />

j<br />

i<br />

( )]<br />

Q ij<br />

P i P ′ j<br />

. (8.16)<br />

Setting the partial derivative of the Lagrangian F with respect to each of the Q ij<br />

equal to 0, we obtain that<br />

( ) ( ( ( ) ))<br />

Qij<br />

Q ij<br />

ln + 1 − α i + β j + γ ln<br />

q ij P i P<br />

j<br />

′ + 1 = 0. (8.17)<br />

The multidimensional Newtonian method may be applied to equations (8.13) –<br />

(8.15) and (8.17) to obtain the unique optimal solution (Q ij ).<br />

After (Q ij ) is found, we calculate the associated scoring matrix (S ij ) as<br />

( )<br />

S ij = 1 λ ln Q ij<br />

P i P<br />

j<br />

′ ,<br />

which has the same λ as the original scoring matrix (s ij ). The constraint (8.17) may<br />

be rewritten as<br />

Q ij = e (αi−1)/(1−γ) e (β j+γ)/(1−γ) q 1/(1−γ) (<br />

ij Pi P j<br />

′ ) −γ/(1−γ) .<br />

Table 8.4 PAM substitution scores (bits) calculated from (8.18) in the uniform model.<br />

PAM distance Match score Mismatch score Information per position<br />

5 1.928 -3.946 1.64<br />

30 1.588 -1.593 0.80<br />

47 1.376 -1.096 0.51<br />

70 1.119 -0.715 0.28<br />

120 0.677 -0.322 0.08

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!