12.12.2012 Views

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

Festival Speech Synthesis System: - Speech Resource Pages

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

supported to allow smaller databases if desired. A full English LPC plus residual database at 8k ulaw is about 3<br />

megabytes, while a full 16 bit version at 16k is about 8 megabytes.<br />

Group files are created with the Diphone.group command which takes a database name and an output filename<br />

as an argument. Making group files can take some time especially if they are large. The group_type parameter<br />

specifies raw or ulaw for encoding signal files. This can significantly reduce the size of databases.<br />

Group files may be partially loaded (see access strategies) at run time for quicker start up and to minimise run-time<br />

memory requirements.<br />

[ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ]<br />

21.4 Diphone_Init<br />

The basic method for describing a database is through the Diphone_Init command. This function takes a single<br />

argument, a list of pairs of parameter name and value. The parameters are<br />

name<br />

An atomic name for this database.<br />

group_file<br />

The filename of a group file, which may itself contain parameters describing itself<br />

type<br />

The default value is pcm, but for distributed voices this is always lpc.<br />

index_file<br />

A filename containing the diphone dictionary.<br />

signal_dir<br />

A directory (slash terminated) containing the pcm waveform files.<br />

signal_ext<br />

A dot prefixed extension for the pcm waveform files.<br />

pitch_dir<br />

A directory (slash terminated) containing the pitch mark files.<br />

pitch_ext<br />

A dot prefixed extension for the pitch files<br />

lpc_dir<br />

A directory (slash terminated) containing the LPC coefficient files and residual files.<br />

lpc_ext<br />

A dot prefixed extension for the LPC coefficient files<br />

lpc_type<br />

The type of LPC file (as supported by the speech tools)<br />

lpc_frame_offset<br />

The number of frames "missing" from the beginning of the file. Often LPC parameters are offset by one<br />

frame.<br />

lpc_res_ext<br />

A dot prefixed extension for the residual files<br />

lpc_res_type<br />

The type of the residual files, this is a standard waveform type as supported by the speech tools.<br />

lpc_res_offset<br />

Number of seconds "missing" from the beginning of the residual file. Some LPC analysis technique do not<br />

generate a residual until after one frame.<br />

samp_freq<br />

Sample frequency of signal files<br />

phoneset<br />

Phoneset used, must already be declared.<br />

num_diphones<br />

Total number of diphones in database. If specified this must be equal or bigger than the number of entries in<br />

the index file. If it is not specified the square of the number of phones in the phoneset is used.<br />

sig_band<br />

number of sample points around actual diphone to take from file. This should be larger than any windowing

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!