The dissertation of Andreas Stolcke is approved: University of ...
The dissertation of Andreas Stolcke is approved: University of ...
The dissertation of Andreas Stolcke is approved: University of ...
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
) <strong>The</strong> probabil<strong>is</strong>tic unit-production relation + 6c) <strong>The</strong> relation ) ,©,+,{§0 <strong>is</strong> the matrix <strong>of</strong> probabilities +9, ) ¸defined as the reflexive, transitive closure <strong>of</strong> ) ¸=, i.e., ) ,6NöNN yž1,and>L+?+-,>1 ¸+-,>1 ¸ >20+-,>2 ¸iff) 6L=CHAPTER 6. EFFICIENT PARSING WITH STOCHASTIC CONTEXT-FREE GRAMMARS 138Lemma 6.4 Letbe a completion i.e.,"16§"] cycle, ) , 6Ð) ] 1.1 6m.2 ,.16§.]65£££6m.]1 .1) 2£s6QC 62) 2 ¸ .2) 6¦) 3£s6QC £££6QC 61) ¸ , 6Ð) ] 2 )all productions involved are ) unit productions ) 2£££_) ] ¸ ) ]1 ¸1. <strong>The</strong>n it must be the case that6mA,##i.e., 1.Pro<strong>of</strong>. For all completion chains it <strong>is</strong> true that the start indices <strong>of</strong> the states are monotonically6same input index in all states, all ) 1_) 2£££7_) ] nonterminals have been expanded into the same substring(a state can only complete an expansion that started at the same or a previousincreasing,"1‹¨"2‹Ð£££position). From"1 6L"]follows that"1 6L"2 . Because the current position (dot) also refers to the£££6#"]<strong>of</strong> the input between"1 and the current position. By assumption the grammar contains no nonterminals thatgenerateA, 13 therefore we must have.1 6m.2 6LA, q.e.d.6x£££6m.]We now formally define the relation between nonterminals mediated by unit productions, analogousto the left-corner relation.].] ) ] ¸ 1£ #Definition 6.6 <strong>The</strong> following definitions are relative to a given SCFG{.a) Two nonterminals ) and= are said to be in a unit-production relation ) ¸ =production for )that has= as its RHS.there ex<strong>is</strong>ts a=ë0 .or there <strong>is</strong> a nonterminal> such that ) ¸ >C©=.sums © . Each © C©=ë0 ) <strong>is</strong> C©=ë0 )defined as a seriesC~= <strong>is</strong>C~= iffd) <strong>The</strong> probabil<strong>is</strong>ticreflexive, transitive unit-productionrelation ©ëÜ6 ©¦,{§0 <strong>is</strong> the matrix <strong>of</strong> probability+-, C©=906 ))Ð6#=ë0) ¸=ë0+-,ž1¢ž2+-,) ¸ >10 y+-,) ¸ >10=ë0=90N/£££)g)=ë0MN yž>!0_© C©=-0£,>As before, a matrix inversion can compute the relation ©¤ in closed form:) ¸ +-,,;²1£13 ©¦Ü6 '0Even with null productions, these would not be used for Earley transitions, see Section 6.4.7.