12.07.2015 Views

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

The HTK Book Steve Young Gunnar Evermann Dan Kershaw ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 2An Overview of the <strong>HTK</strong> ToolkitDarpa TIM ITNISTEntropic<strong>The</strong> basic principles of HMM-based recognition were outlined in the previous chapter and anumber of the key <strong>HTK</strong> tools have already been mentioned. This chapter describes the softwarearchitecture of a <strong>HTK</strong> tool. It then gives a brief outline of all the <strong>HTK</strong> tools and the way thatthey are used together to construct and test HMM-based recognisers. For the benefit of existing<strong>HTK</strong> users, the major changes in recent versions of <strong>HTK</strong> are listed. <strong>The</strong> following chapter will thenillustrate the use of the <strong>HTK</strong> toolkit by working through a practical example of building a simplecontinuous speech recognition system.2.1 <strong>HTK</strong> Software ArchitectureMuch of the functionality of <strong>HTK</strong> is built into the library modules. <strong>The</strong>se modules ensure thatevery tool interfaces to the outside world in exactly the same way. <strong>The</strong>y also provide a centralresource of commonly used functions. Fig. 2.1 illustrates the software structure of a typical <strong>HTK</strong>tool and shows its input/output interfaces.User input/output and interaction with the operating system is controlled by the library moduleHShell and all memory management is controlled by HMem. Math support is provided by HMathand the signal processing operations needed for speech analysis are in HSigP. Each of the file typesrequired by <strong>HTK</strong> has a dedicated interface module. HLabel provides the interface for label files,HLM for language model files, HNet for networks and lattices, HDict for dictionaries, HVQ forVQ codebooks and HModel for HMM definitions.14

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!