12.07.2015 Views

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

A prerequ<strong>is</strong>ite for th<strong>is</strong> approach <strong>is</strong> to precompute, for all nonterminals ) , the probability that )”””Ó”¸=1=2òò””CHAPTER 6. EFFICIENT PARSING WITH STOCHASTIC CONTEXT-FREE GRAMMARS 141section summarizes the necessary modifications to process null productions correctly, using the previousdescription as a baseline. Our treatment <strong>of</strong> null productions follows the (non-probabil<strong>is</strong>tic) formulation <strong>of</strong>Graham et al. (1980), rather than the original one in Earley (1970).6.4.7.1 ComputingA-expansion probabilities<strong>The</strong> main problem with null productions <strong>is</strong> that they allow multiple prediction-completion cyclesin between the scanning steps (since null productions do not have to be matched against one or more inputsymbols). Our strategy will be to collapse all predictions and completions due to chains <strong>of</strong> null productionsinto the regular prediction and completion steps, not unlike the way recursive predictions/completions werehandled in Section 6.4.5.expands to the empty string. Note that th<strong>is</strong> <strong>is</strong> another recursive problem, ) sinceproduction, but expand to nonterminal= some that does.Computation ) <strong>of</strong> for ) allFor +-, each , let be an abbreviation for +-, C«A0)Ó )itself may not have a nullcan be cast as a system <strong>of</strong> non-linear equations, as follows.. By way <strong>of</strong> example, if ) CEA0 has productions) ¸ Atò1ót 2ó.¸=3=4=5t 3ó<strong>The</strong> semantics <strong>of</strong> context-free rules imply that ) can only expand toAif all <strong>of</strong> the RHSs in one <strong>of</strong> ) ’sproductions expands toA.Translating to probabilities, we obtain the equation6/t 1 Ngt 2Ó‘1Ó‘2Ngt 3Ó‘3Ó‘4Ó‘5N|£££In other words, each production contributes a term in which the rule probability <strong>is</strong> multiplied by the product<strong>of</strong> the Ó variables corresponding to the RHS nonterminals, unless the RHS contains a terminal (in which casethe production contributes nothing Ó to because it cannot possibly lead toA).<strong>The</strong> resulting non-linear system can be solved by iterative approximation. Each Ó variableinitialized to +-, ) ¸A0 , and then repeatedly updated by substituting in the equation right-hand sides, until<strong>is</strong>increasing and bounded above by the true values +9, )the desired level <strong>of</strong> accuracy <strong>is</strong> attained. Convergence <strong>is</strong> guaranteed since Ó the values are monotonically1. For grammars without cyclic dependencies CEA0ŒVamongA-producing nonterminals th<strong>is</strong> procedure degenerates to simple backward substitution. Obviously thesystem has to be solved only once for each grammar.<strong>The</strong> Ó probability can be seen as the precomputed inner probability <strong>of</strong> an expansion ) <strong>of</strong> to theempty string, i.e., it sums the probabilities <strong>of</strong> all Earley paths that ) deriveAfrom . Th<strong>is</strong> <strong>is</strong> the justificationfor the way these probabilities can be used in modified prediction and completion steps, described next.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!