12.12.2012 Views

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

(set! rab_diphone_dir "/projects/festival/lib/voices/english/rab_diphone")<br />

(set! rab_lpc_group<br />

(list<br />

'(name "rab_lpc_group")<br />

(list 'index_file<br />

(path-append rab_diphone_dir "group/rablpc16k.group"))<br />

'(alternates_left ((i ii) (ll l) (u uu) (i@ ii) (uh @) (a aa)<br />

(u@ uu) (w @) (o oo) (e@ ei) (e ei)<br />

(r @)))<br />

'(alternates_right ((i ii) (ll l) (u uu) (i@ ii)<br />

(y i) (uh @) (r @) (w @)))<br />

'(default_diphone @-@@)<br />

'(grouped "true")))<br />

(us_dipohone_init rab_lpc_group)<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

20.4 Making groupfiles<br />

The function us_make_group_file will make a group file of the currently selected US diphone database. It<br />

loads in all diphone sin the dtabaase and saves them in the named file. An optional second argument allows<br />

specification of how the group file will be saved. These options are as a feature list. There are three possible options<br />

track_file_format<br />

The format for the coefficient files. By default this is est_binary, currently the only other alternative is<br />

est_ascii.<br />

sig_file_format<br />

The format for the signal parts of the of the database. By default this is snd (Sun's Audio format). This was<br />

choosen as it has the smallest header and supports various sample formats. Any format supported by the<br />

Edinburgh <strong>Speech</strong> Tools is allowed.<br />

sig_sample_format<br />

The format for the samples in the signal files. By default this is mulaw. This is suitable when the signal files<br />

are LPC residuals. LPC residuals have a much smaller dynamic range that plain PCM files. Because mulaw<br />

representation is half the size (8 bits) of standard PCM files (16bits) this significantly reduces the size of the<br />

group file while only marginally altering the quality of synthesis (and from experiments the effect is not<br />

perceptible). However when saving group files where the signals are not LPC residuals (e.g. in PSOLA) using<br />

this default mulaw is not recommended and short should probably be used.<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

20.5 UniSyn module selection<br />

In a voice selection a UniSyn database may be selected as follows<br />

(set! UniSyn_module_hooks (list rab_diphone_const_clusters ))<br />

(set! us_abs_offset 0.0)<br />

(set! window_factor 1.0)<br />

(set! us_rel_offset 0.0)<br />

(set! us_gain 0.9)<br />

(Parameter.set 'Synth_Method 'UniSyn)<br />

(Parameter.set 'us_sigpr 'lpc)<br />

(us_db_select rab_db_name)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!