12.07.2015 Views

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

©) 6#=©,©,©,,ÆNNö,²NL++and>+?1£Ù ŠÙCHAPTER 6. EFFICIENT PARSING WITH STOCHASTIC CONTEXT-FREE GRAMMARS 136c) <strong>The</strong> ) relationthere <strong>is</strong> nonterminal> a such ) ¸ that >defined as the reflexive, transitive closure <strong>of</strong> ) ¸ ›=,=.i.e., ) iff CÀ›=C/ =ë0 <strong>is</strong> defined as a seriesCÀ›= <strong>is</strong>d) <strong>The</strong> probabil<strong>is</strong>tic reflexive, transitive left-corner relation ©§6|©,{§0 <strong>is</strong> a matrix <strong>of</strong> probability sumsor C . Each © CÀœ=ë0 )) +-, ) C=ë0c6)Ð6¢=-0+-,>1 ¸¡) ¸ =90+9,N yvž1) ¸ Ÿ>10=ë0+-,>1 ¸¡+-,>2 ¸›+-,ž1¢ž2+-,y) ¸ Ÿ>10=ë0N/£££Alternatively, © <strong>is</strong> defined by the recurrence relation ) C=ë0'6+-,) ¸ )g=ë020and zero otherw<strong>is</strong>e).( denotes the Kronecker delta, defined as unity if 6#=, )ö<strong>The</strong> ©¥ recurrence for can be conveniently written in matrix notation>!0_© C ,>=ë0from which the closed-form solution <strong>is</strong> derived:© 6 © An ex<strong>is</strong>tence pro<strong>of</strong> for ©¥ <strong>is</strong> given in Section 6.4.8. Appendix 6.6.3.1 shows how to speed up the computation<strong>of</strong> © by inverting only a reduced version <strong>of</strong> the matrix ² L .© 6 0,;²<strong>The</strong> significance <strong>of</strong> the matrix © for the Earley algorithm <strong>is</strong> that its elements are the sums <strong>of</strong>leading from a state6)¸ predicted state £j@, via any number <strong>of</strong> intermediate states.the probabilities <strong>of</strong> the potentially infinitely many prediction paths.M£j>͘ to a¸ can be computed once for each grammar, and used for table-lookup in the following, modified©prediction step.Prediction (probabil<strong>is</strong>tic, transitive),> for all productions=¸@such that ©H :6) ¸.—£>ΘÆ)Šóò6QC:¸ @=C¬£=ã0 <strong>is</strong> non-zero. <strong>The</strong>n£j@ò…ÆóH@E0 (6.3)+-,= ¸Ù += Æ © C ,>=90@E0 (6.4)Note the © new<strong>of</strong> all path linking> to=.For>Ð6n= probabilities th<strong>is</strong> covers the case <strong>of</strong> a single step <strong>of</strong> prediction;ŠÙ 6¸ factor in the updated forward probability, which accounts for the sum CTF=ë0,>+-,= 1 always, since ©¥ <strong>is</strong> defined as a reflexive closure. CÀŸ=-0Œ‹,=

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!