Sphinx Guideline version 1
Sphinx Guideline version 1
Sphinx Guideline version 1
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Chapter1.<br />
Introduction<br />
CMU <strong>Sphinx</strong> is one of the most popular speech recognition applications for Linux and it can correctly<br />
capture words. It also gives the developers the ability to build speech systems, interact with<br />
voice and build something unique and useful. Speech recognition in <strong>Sphinx</strong> is based on Hidden<br />
Markov model (HMM).<br />
An HMM-based system, like all other speech recognition systems, functions by first learning the<br />
characteristics (or parameters) of a set of sound units, and then using what it has learned about the<br />
units to find the most probable sequence of sound units for a given speech signal. The process of<br />
learning about the sound units is called training. The process of using the knowledge acquired to<br />
deduce the most probable sequence of units in a given signal is called decoding, or simply recognition.<br />
Accordingly, you will need those components of the SPHINX system that you can use for training<br />
and for recognition. In other words, you will need the SPHINX Trainer and a SPHINX decoder.<br />
CMU <strong>Sphinx</strong> toolkit has a number of packages for different tasks and applications. It’s sometimes<br />
confusing what to choose. To clean-up, here is the list:<br />
• <strong>Sphinx</strong>base — support library<br />
• <strong>Sphinx</strong>3 v8 — adjustable, modifiable recognizer<br />
• CMUclmtk — language model tools<br />
• <strong>Sphinx</strong>train — acoustic model training tools<br />
You can download CMU <strong>Sphinx</strong> Toolkit from sourceforge.<br />
For installing <strong>Sphinx</strong>, Follow the guide:<br />
Installation guide.<br />
1