12.07.2015 Views

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

,,CHAPTER 7. -GRAMS FROM STOCHASTIC CONTEXT-FREE GRAMMARS 178Word error (%)Bigram estimated from raw data 33.7Bigram computed from SCFG 32.9by Monte-Carlo sampling 33.3Bigram from SCFG plus data 29.6SCFG 29.6Mixture Bigram-SCFG 28.8Table 7.2: Speech recognition accuracy using various language models.the other hand, <strong>is</strong> a standard weighted mixture <strong>of</strong> two d<strong>is</strong>tinct submodels, as described in Section 2.3.1.<strong>The</strong> experiments therefore support the argument made earlier that more soph<strong>is</strong>ticated languagemodels, even if far from perfect, can ¢ improve -gram estimates obtained directly from sample data. Wealso see that the bulk <strong>of</strong> the improvement does not come from using a SCFG alone, but from smoothing thebigram stat<strong>is</strong>tics through the constraints imposed by the SCFG (possibly combined with a mixture <strong>of</strong> the twolanguage models).7.7 SummaryWe have described an algorithm to compute in closed form the d<strong>is</strong>tribution ¢ <strong>of</strong> -grams for aprobabil<strong>is</strong>tic language given by a stochastic context-free grammar. <strong>The</strong> algorithm <strong>is</strong> based on computingsubstring expectations, which can be expressed as systems <strong>of</strong> linear equations derived from the grammar.L<strong>is</strong>ted below are the steps <strong>of</strong> the ¢ complete -gram-from-SCFG computation. For concreteness we give theversion specific to bigrams (¢ 6 2).1. Compute the prefix (left-corner) and suffix (right-corner) probabilities for each (nonterminal,word)pair.2. Compute the coefficient matrix and right-hand sides for the systems <strong>of</strong> linear equations, as per equations(7.4) and (7.5).3. LU decompose the coefficient matrix.4. Compute the unigram expectations for each word in the grammar, by solving the LU system for theunigram right-hand sides computed in step 2.5. Compute the bigram expectations for each word pair by solving the LU system for the bigram right-handsides computed in step 2.unigram expectation ( 1.9%0 .6. Compute each bigram probability +-, ( 2. ( 10 , by dividing the bigram expectation ( 1( 2.9%0 by the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!