12.07.2015 Views

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

CHAPTER 6. EFFICIENT PARSING WITH STOCHASTIC CONTEXT-FREE GRAMMARS 163th<strong>is</strong> incongruity into an advantage by using the LR parser as a probabil<strong>is</strong>tic model its own right, and showhow LR probabilities can be extended to capture non-context-free contingencies.<strong>The</strong> problem <strong>of</strong> capturing more complex d<strong>is</strong>tributional constraints in natural language <strong>is</strong> clearlyimportant, but well beyond the scope <strong>of</strong> th<strong>is</strong> chapter. We simply remark that it should be possible to define‘interesting’ non-standard probabilities in terms <strong>of</strong> Earley parser actions so as to better model non-context-freephenomena. Apart from such considerations, the choice between LR methods and Earley parsing <strong>is</strong> a typicalspace-time trade<strong>of</strong>f. Even though an Earley parser runs with the same linear time and space complexity as anLR parser on grammars <strong>of</strong> the appropriate LR class, the constant factors involved will be much in favor <strong>of</strong> theLR parser as almost all the work has already been compiled into its transition and action table. However, thesize <strong>of</strong> LR parser tables can be exponential in the size <strong>of</strong> the grammar (due to the number <strong>of</strong> potential itemsubsets). Furthermore, if the generalized LR method <strong>is</strong> used for dealing with non-determin<strong>is</strong>tic grammars(Tomita 1986) the runtime on arbitrary inputs may also grow exponentially.<strong>The</strong> bottom line <strong>is</strong> that each application’s needs have to be evaluated against the pros and cons<strong>of</strong> both approaches to find the best solution. From a theoretical point <strong>of</strong> view, the Earley approach hasthe inherent appeal <strong>of</strong> being the more general (and exact) solution to the computation <strong>of</strong> the various SCFGprobabilities.6.7.4 Other related work<strong>The</strong> literature on Earley-based probabil<strong>is</strong>tic parsers <strong>is</strong> sparse, presumably because <strong>of</strong> the precedentset by the Inside/Outside algorithm, which <strong>is</strong> more naturally formulated as a bottom-up algorithm.Schabes (1991) shows that Earley’s algorithm can be augmented with the computation <strong>of</strong> inner andouter probabilities, in much the same way as presented here. However, the algorithm presented <strong>is</strong> not fullygeneral as it <strong>is</strong> restricted to sentences with bounded ambiguity, i.e., there are no prov<strong>is</strong>ions for handling unitproduction cycles. Schabes focuses on grammar estimation (using generalized inside and outside probabilities)and only mentions the potential for computing prefix probabilities.Magerman & Marcus (1991) are interested primarily in scoring functions to guide a parser efficientlyto the most prom<strong>is</strong>ing parses. <strong>The</strong>y use Earley-style top-down prediction only to suggest worthwhile parses,not to compute prec<strong>is</strong>e probabilities. 24Both Nakagawa (1987) and Päseler (1988) use a non-probabil<strong>is</strong>tic Earley parser augmented with‘word match’ scoring. Though not truly probabil<strong>is</strong>tic, these algorithms are similar to the Viterbi versiondescribed here, in that they find a parse that optimizes the accumulated matching scores (without regard torule probabilities). Prediction and completion loops do not come into play since no prec<strong>is</strong>e inner or forwardprobabilities are computed.Dan Jurafsky (personal communication) had written an Earley parser for the Berkeley RestaurantProject (BeRP) speech understanding system that computed forward probabilities for restricted grammars(without left-corner or unit production recursion). <strong>The</strong> parser now uses the methods described here to provide24 <strong>The</strong>y argue that prec<strong>is</strong>e probabilities are inappropriate in natural language parsing.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!