12.07.2015 Views

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

The dissertation of Andreas Stolcke is approved: University of ...

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CHAPTER 8. FUTURE DIRECTIONS 1828.2 No<strong>is</strong>y samplesA related problem concerns the potential presence <strong>of</strong> no<strong>is</strong>e in the sample, i.e., samples randomlyaltered by an independent process, such as random replacement <strong>of</strong> symbols. Plain model merging algorithmswill simply try to model the language resulting from the composition <strong>of</strong> the und<strong>is</strong>torted language and theno<strong>is</strong>e process. If the probabilities <strong>of</strong> d<strong>is</strong>tortions by no<strong>is</strong>e are small enough one could expect to recover theoriginal language model by pruning low probability parts <strong>of</strong> the induced model structures.Pruning techniques have indeed been applied successfully in conjunction with the HMM mergingalgorithm as applied to multiple-pronunciation models (Section 3.6.2) in cases were the training corpus wasknown to contain outlier (m<strong>is</strong>labeled) samples, as reported by Wooters (1993). Again, a formal characterization<strong>of</strong> the conditions under which th<strong>is</strong> method <strong>is</strong> successful would be obviously useful.8.3 More informative search heur<strong>is</strong>tics and biases<strong>The</strong> work so far clearly shows that, as expected, increased model expressiveness requires a morevaried repertoire <strong>of</strong> search operators, which in turn necessitates increasingly non-local search strategies.Carefully chosen heur<strong>is</strong>tics and macro operators can counter the increase in search complexity in some cases.Th<strong>is</strong> ra<strong>is</strong>es the question <strong>of</strong> what principled sources <strong>of</strong> search bias might be used. A main theme<strong>of</strong> the present studies was that a considerable amount <strong>of</strong> lingu<strong>is</strong>tic structure can be found by model merging,based solely on non-informative priors and ‘dumb’ search strategies. However, lingu<strong>is</strong>tic theories with stronga priori bases should help in cases where the uninformed approaches turn out to be insufficient. A prerequ<strong>is</strong>itefor th<strong>is</strong> approach <strong>is</strong> that the predictions <strong>of</strong> the lingu<strong>is</strong>tic theory can be cast into effective search heur<strong>is</strong>tics.8.4 Induction by model specializationPure model merging <strong>is</strong> based on operators that leave the weak generative capacity <strong>of</strong> the modelsunchanged or produce a strictly more inclusive model. We already mentioned an alternative, inverse approachin Chapter 3: model splitting or specialization. Given the local nature <strong>of</strong> the search process, we expectimprovements from adding operators that can effectively undo previous merging steps.Also, a learning algorithm that goes from general-to-specific incorporates a different global bias inthe face <strong>of</strong> local search: it will tend to err on the side <strong>of</strong> the too general, rather than the too specific model.Having both types <strong>of</strong> biases available allows choosing or mixing them according to the needs <strong>of</strong> particularapplications. For example, if it <strong>is</strong> important that the resulting model have high coverage <strong>of</strong> new data, onemight prefer over-general models.<strong>The</strong> approaches in the literature that make use <strong>of</strong> state splittingare so far based entirely on likelihoodcriteria. However, the Bayesian evaluation methods used in th<strong>is</strong> thes<strong>is</strong> are clearly separable from the mergingand search components <strong>of</strong> the overall approach, and should apply to other systems as well. A first practical step

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!