Festival Speech Synthesis System: - Speech Resource Pages
Festival Speech Synthesis System: - Speech Resource Pages
Festival Speech Synthesis System: - Speech Resource Pages
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
(set! rab_diphone_dir "/projects/festival/lib/voices/english/rab_diphone")<br />
(set! rab_lpc_group<br />
(list<br />
'(name "rab_lpc_group")<br />
(list 'index_file<br />
(path-append rab_diphone_dir "group/rablpc16k.group"))<br />
'(alternates_left ((i ii) (ll l) (u uu) (i@ ii) (uh @) (a aa)<br />
(u@ uu) (w @) (o oo) (e@ ei) (e ei)<br />
(r @)))<br />
'(alternates_right ((i ii) (ll l) (u uu) (i@ ii)<br />
(y i) (uh @) (r @) (w @)))<br />
'(default_diphone @-@@)<br />
'(grouped "true")))<br />
(us_dipohone_init rab_lpc_group)<br />
[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />
20.4 Making groupfiles<br />
The function us_make_group_file will make a group file of the currently selected US diphone database. It<br />
loads in all diphone sin the dtabaase and saves them in the named file. An optional second argument allows<br />
specification of how the group file will be saved. These options are as a feature list. There are three possible options<br />
track_file_format<br />
The format for the coefficient files. By default this is est_binary, currently the only other alternative is<br />
est_ascii.<br />
sig_file_format<br />
The format for the signal parts of the of the database. By default this is snd (Sun's Audio format). This was<br />
choosen as it has the smallest header and supports various sample formats. Any format supported by the<br />
Edinburgh <strong>Speech</strong> Tools is allowed.<br />
sig_sample_format<br />
The format for the samples in the signal files. By default this is mulaw. This is suitable when the signal files<br />
are LPC residuals. LPC residuals have a much smaller dynamic range that plain PCM files. Because mulaw<br />
representation is half the size (8 bits) of standard PCM files (16bits) this significantly reduces the size of the<br />
group file while only marginally altering the quality of synthesis (and from experiments the effect is not<br />
perceptible). However when saving group files where the signals are not LPC residuals (e.g. in PSOLA) using<br />
this default mulaw is not recommended and short should probably be used.<br />
[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />
20.5 UniSyn module selection<br />
In a voice selection a UniSyn database may be selected as follows<br />
(set! UniSyn_module_hooks (list rab_diphone_const_clusters ))<br />
(set! us_abs_offset 0.0)<br />
(set! window_factor 1.0)<br />
(set! us_rel_offset 0.0)<br />
(set! us_gain 0.9)<br />
(Parameter.set 'Synth_Method 'UniSyn)<br />
(Parameter.set 'us_sigpr 'lpc)<br />
(us_db_select rab_db_name)