12.12.2012 Views

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

19.3 Klatt durations Klatt rules from book.<br />

19.4 CART durations Tree based durations<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

19.1 Default durations<br />

If parameter Duration_Method is set to Default, the simplest duration model is used. All segments are 100<br />

milliseconds (this can be modified by Duration_Stretch, and/or the localised Token related dur_stretch<br />

feature).<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

19.2 Average durations<br />

If parameter Duration_Method is set to Averages then segmental durations are set to their averages. The<br />

variable phoneme_durations should be an a-list of phones and averages in seconds. The file<br />

`lib/mrpa_durs.scm' has an example for the mrpa phoneset.<br />

If a segment is found that does not appear in the list a default duration of 0.1 seconds is assigned, and a warning<br />

message generated.<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

19.3 Klatt durations<br />

If parameter Duration_Method is set to Klatt the duration rules from the Klatt book (allen87, chapter 9). This<br />

method requires minimum and inherent durations for each phoneme in the phoneset. This information is held in the<br />

variable duration_klatt_params. Each member of this list is a three-tuple, of phone name, inherent duration<br />

and minimum duration. An example for the mrpa phoneset is in `lib/klatt_durs.scm'.<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

19.4 CART durations<br />

Two very similar methods of duration prediction by CART tree are supported. The first, used when parameter<br />

Duration_Method is Tree simply predicts durations directly for each segment. The tree is set in the variable<br />

duration_cart_tree.<br />

The second, which seems to give better results, is used when parameter Duration_Method is Tree_ZScores.<br />

In this second model the tree predicts zscores (number of standard deviations from the mean) rather than duration<br />

directly. (This follows campbell91, but we don't deal in syllable durations here.) This method requires means and<br />

standard deviations for each phone. The variable duration_cart_tree should contain the zscore prediction tree<br />

and the variable duration_ph_info should contain a list of phone, mean duration, and standard deviation for<br />

each phone in the phoneset.<br />

An example tree trained from 460 sentences spoken by Gordon is in `lib/gswdurtreeZ'. Phone means and<br />

standard deviations are in `lib/gsw_durs.scm'.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!