Festival Speech Synthesis System: - Speech Resource Pages

More documents

Recommendations

Info

automatically created (one for each different architecture the system is compiled on). Scripts are added to this top level directory itself. `./lib/voices/' By default this contains the voices used by Festival including their basic Scheme set up functions as well as the diphone databases. `./lib/dicts/' This contains various lexicon files distributed as part of the system. `./config/' This contains the basic `Makefile' configuration files for compiling the system (run-time configuration is handled by Scheme in the `lib/' directory). The file `config/config' created as a copy of the standard `config/config-dist' is the installation specific configuration. In most cases a simpel copy of the distribution file will be sufficient. `./src/' The main C++/C source for the system. `./src/lib/' Where the `libFestival.a' is built. `./src/include/' Where include files shared between various parts of the system live. The file `festival.h' provides access to most of the parts of the system. `./src/main/' Contains the top level C++ files for the actual executables. This is directory where the executable binary `festival' is created. `./src/arch/' The main core of the Festival system. At present everything is held in a single sub-directory `./src/arc/festival/'. This contains the basic core of the synthesis system itself. This directory contains lisp front ends to access the core utterance architecture, and phonesets, basic tools like, client/server support, ngram support, etc, and an audio spooler. `./src/modules/' In contrast to the `arch/' directory this contains the non-core parts of the system. A set of basic example modules are included with the standard distribution. These are the parts that do the synthesis, the other parts are just there to make module writing easier. `./src/modules/base/' This contains some basic simple modules that weren't quite big enough to deserve their own directory. Most importantly it includes the Initialize module called by many synthesis methods which sets up an utterance structure and loads in initial values. This directory also contains phrasing, part of speech, and word (syllable and phone construction from words) modules. `./src/modules/Lexicon/' This is not really a module in the true sense (the Word module is the main user of this). This contains functions to construct, compile, and access lexicons (entries of words, part of speech and pronunciations). This also contains a letter-to-sound rule system. `./src/modules/Intonation/' This contains various intonation systems, from the very simple to quite complex parameter driven intonation systems. `./src/modules/Duration/' This contains various duration prediction systems, from the very simple (fixed duration) to quite complex parameter driven duration systems. `./src/modules/UniSyn/' A basic diphone synthesizer system, supporting a simple database format (which can be grouped into a more efficient binary representation). It is multi-lingual, and allows multiple databases to be loaded at once. It offers a choice of concatenation methods for diphones: residual excited LPC or PSOLA (TM) (which is not distributed) `./src/modules/Text/' Various text analysis functions, particularly the tokenizer and utterance segmenter (from arbitrary files). This directory also contains the support for text modes and SGML. `./src/modules/donovan/' An LPC based diphone synthesizer. Very small and neat. `./src/modules/rxp/' The Festival/Scheme front end to An XML parser written by Richard Tobin from University of Edinburgh's Language Technology Group.. rxp is now part of the speech tools rather than just Festival. `./src/modules/parser' A simple interface the the Stochastic Context Free Grammar parser in the speech tools library.
`./src/modules/diphone' An optional module contain the previouslty used diphone synthsizer. `./src/modules/clunits' A partial implementation of a cluster unit selection algorithm as described in black97c. `./src/modules/Database rjc_synthesis' This consist of a new set of modules for doing waveform synthesis. They are inteneded to unit size independent (e.g. diphone, phone, non-uniform unit). Also selection, prosodic modification, joining and signal processing are separately defined. Unfortunately this code has not really been exercised enough to be considered stable to be used in the default synthesis method, but those working on new synthesis techniques may be interested in integration using these new modules. They may be updated before the next full release of Festival. `./src/modules/*' Other optional directories may be contained here containing various research modules not yet part of the standard distribution. See below for descriptions of how to add modules to the basic system. One intended use of Festival is offer a software system where new modules may be easily tested in a stable environment. We have tried to make the addition of new modules easy, without requiring complex modifications to the rest of the system. All of the basic modules should really be considered merely as example modules. Without much effort all of them could be improved. [ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ] 27.2 Writing a new module This section gives a simple example of writing a new module. showing the basic steps that must be done to create and add a new module that is available for the rest of the system to use. Note many things can be done solely in Scheme now and really only low-level very intensive things (like waveform synthesizers) need be coded in C++. [ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ] 27.2.1 Example 1: adding new modules The example here is a duration module which sets durations of phones for a given list of averages. To make this example more interesting, all durations in accented syllables are increased by 1.5. Note that this is just an example for the sake of one, this (and much better techniques) could easily done within the system as it is at present using a handcrafted CART tree. Our knew module, called Duration_Simple can most easily be added to the `./src/Duration/' directory in a file `simdur.cc'. You can worry about the copyright notice, but after that you'll probably need the following includes #include The module itself must be declared in a fixed form. That is receiving a single LISP form (an utterance) as an argument and returning that LISP form at the end. Thus our definition will start LISP FT_Duration_Simple(LISP utt) { Next we need to declare an utterance structure and extract it from the LISP form. We also make a few other variable declarations
Page 1 and 2:
[Top] [Contents] [Index] [ ? ] Fest
Page 3 and 4:
The Festival Speech Synthesis Syste
Page 5 and 6:
3.3 Edinburgh Speech Tools Library
Page 7 and 8:
multiple methods, though we will of
Page 9 and 10:
for non-commercial use (we are work
Page 11 and 12:
festlex_CMU.tar.gz festlex_OALD.tar
Page 13 and 14:
held), and voices_dir (pointing to
Page 15 and 16:
Ensure your audio device actually w
Page 17 and 18:
$ festival Festival Speech Synthesi
Page 19 and 20:
eference to a manual section and re
Page 21 and 22:
[ < ] [ > ] [ > ] [Top] [Contents]
Page 23 and 24:
To convert a symbol whose print nam
Page 25 and 26:
filter A Unix shell program filter
Page 27 and 28:
into name and IP address. Note that
Page 29 and 30:
The boy saw the girl in the park
Page 31 and 32:
VOLUME Allows the specification of
Page 33 and 34:
festival/lib/tts.scm). [ < ] [ > ]
Page 35 and 36:
13.2 Defining lexicons Building new
Page 37 and 38:
(debug_output t) before compilation
Page 39 and 40:
) The above isn't the most efficien
Page 41 and 42:
The process involves the following
Page 43 and 44:
(y _epsilon_ i ii i@ ai uh y @ ai-@
Page 45 and 46:
lexicon by over 90%. The function r
Page 47 and 48:
(define (postlex_apos_s_check utt)
Page 49 and 50: a list of syllables. Each member wi
Page 51 and 52: Phrase This allows explicit phrasin
Page 53 and 54: `(item.daughter2 ITEM)' Return the
Page 55 and 56: `stress' This item's lexical stress
Page 57 and 58: This pocket-watch was made in 1983.
Page 59 and 60: ((string-matches name "\\([dD][Rr]\
Page 61 and 62: (set! simple_phrase_cart_tree ' ((R
Page 63 and 64: accented (i.e. has an IntEvent rela
Page 65 and 66: (Utterance Words (boy (saw ((accent
Page 67 and 68: After prediction the segmental dura
Page 69 and 70: aa-ll &aa-l This states that the di
Page 71 and 72: The UniSyn_module_hooks are run bef
Page 73 and 74: for i in wave/*.wav do fname=`basen
Page 75 and 76: used on the signal, and/or up to th
Page 77 and 78: lib/voices/english/don_diphone/fest
Page 79 and 80: (Parameter.set 'Audio_Method 'irixa
Page 81 and 82: voice_el_diphone A male Castilian S
Page 83 and 84: ) (PhoneSet.silences '(#)) Note som
Page 85 and 86: (set! spanish_phrase_cart_tree ' ((
Page 87 and 88: (us_diphone_init (list '(name "el_l
Page 89 and 90: (define (voice_giant) "comment comm
Page 91 and 92: 25. Tools A number of basic data ma
Page 93 and 94: CART ::= QUESTION-NODE || ANSWER-NO
Page 95 and 96: (define (pos_cand_function w) ;; se
Page 97 and 98: some label files identify point typ
Page 99: Building the models and getting goo
Page 103 and 104: to this function should be added to
Page 105 and 106: #include "festival.h" static LISP u
Page 107 and 108: In yout `Makefile' for this directo
Page 109 and 110: Every effort has been made to minim
Page 111 and 112: A typical example use of `festival_
Page 113 and 114: A simpler C only interface example
Page 115 and 116: 29.2 Singing Synthesis As an intere
Page 117 and 118: Magisterarbeit, Institute of Natura
Page 119 and 120: B C adding new LISP objects 27.2.4
Page 121 and 122: F G H Edinburgh Speech Tools Librar
Page 123 and 124: M N O P load-path 6.3 Site initiali
Page 125 and 126: S resynthesis 14.7 Utterance I/O ru
Page 127 and 128: U V W ungrouped diphones 20.1 UniSy
Page 129 and 130: 12. Phonesets 13. Lexicons 13.1 Lex
Page 131 and 132: [Top] [Contents] [Index] [ ? ] Shor
show all

Festival Speech Synthesis System: - Speech Resource Pages

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?