12.07.2015 Views

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.3 <strong>The</strong> Toolkit 18Labelled Utterancesth ih s ih s p iy t shsh t iy s z ih s ih thTranscriptionsth ih s ih s p iy t shsh t iy s z ih s ih thUnlabelled UtterancesHInitHRestHCompVHERestHHEdSub-WordHMMsFig. 2.3Training Sub-word HMMsOnce an initial set of models has been created, the tool HERest is used to perform embeddedtraining using the entire training set. HERest performs a single Baum-Welch re-estimation of thewhole set of HMM phone models simultaneously. For each training utterance, the correspondingphone models are concatenated and then the forward-backward algorithm is used to accumulate thestatistics of state occupation, means, variances, etc., for each HMM in the sequence. When all ofthe training data has been processed, the accumulated statistics are used to compute re-estimatesof the HMM parameters. HERest is the core <strong>HTK</strong> training tool. It is designed to process largedatabases, it has facilities for pruning to reduce computation and it can be run in parallel across anetwork of machines.<strong>The</strong> philosophy of system construction in <strong>HTK</strong> is that HMMs should be refined incrementally.Thus, a typical progression is to start with a simple set of single Gaussian context-independentphone models and then iteratively refine them by expanding them to include context-dependencyand use multiple mixture component Gaussian distributions. <strong>The</strong> tool HHEd is a HMM definitioneditor which will clone models into context-dependent sets, apply a variety of parameter tyingsand increment the number of mixture components in specified distributions. <strong>The</strong> usual processis to modify a set of HMMs in stages using HHEd and then re-estimate the parameters of themodified set using HERest after each stage. To improve performance for specific speakers thetools HEAdapt and HVite can be used to adapt HMMs to better model the characteristics ofparticular speakers using a small amount of training or adaptation data. <strong>The</strong> end result of whichis a speaker adapted system.<strong>The</strong> single biggest problem in building context-dependent HMM systems is always data insufficiency.<strong>The</strong> more complex the model set, the more data is needed to make robust estimates of itsparameters, and since data is usually limited, a balance must be struck between complexity andthe available data. For continuous density systems, this balance is achieved by tying parameterstogether as mentioned above. Parameter tying allows data to be pooled so that the shared parameterscan be robustly estimated. In addition to continuous density systems, <strong>HTK</strong> also supports

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!