Festival Speech Synthesis System: - Speech Resource Pages

More documents

Recommendations

Info

These types are used when calling the function utt.synth and individual modules may be called explicitly by hand if required. Because we expect waveform synthesis methods to themselves become complex with a defined set of functions to select, join, and modify units we now support an addition notion of SynthTypes like UttTypes these define a set of functions to apply to an utterance. These may be defined using the defSynthType function. For example (defSynthType Festival (print "synth method Festival") (print "select") (simple_diphone_select utt) (print "join") (cut_unit_join utt) (print "impose") (simple_impose utt) (simple_power utt) (print "synthesis") (frames_lpc_synthesis utt) ) A SynthType is selected by naming as the value of the parameter Synth_Method. Duration the application of the function utt.synth there are three hooks applied. This allows addition control of the synthesis process. before_synth_hooks is applied before any modules are applied. after_analysis_hooks is applied at the start of Wave_Synth when all text, linguistic and prosodic processing have been done. after_synth_hooks is applied after all modules have been applied. These are useful for things such as, altering the volume of a voice that happens to be quieter than others, or for example outputing information for a talking head before waveform synthesis occurs so preparation of the facial frames and synthesizing the waveform may be done in parallel. (see `festival/examples/th-mode.scm' for an example use of these hooks for a talking head text mode.) [ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ] 14.3 Example utterance types A number of utterance types are currently supported. It is easy to add new ones but the standard distribution includes the following. Text Raw text as a string. (Utterance Text "This is an example") Words A list of words (Utterance Words (this is an example)) Words may be atomic or lists if further features need to be specified. For example to specify a word and its part of speech you can use (Utterance Words (I (live (pos v)) in (Reading (pos n) (tone H-H%)))) Note: the use of the tone feature requires an intonation mode that supports it. Any feature and value named in the input will be added to the Word item.
Phrase This allows explicit phrasing and features on Tokens to be specified. The input consists of a list of phrases each contains a list of tokens. (Utterance Phrase ((Phrase ((name B)) I saw the man (in ((EMPH 1))) the park) (Phrase ((name BB)) with the telescope))) ToBI tones and accents may also be specified on Tokens but these will only take effect if the selected intonation method uses them. Segments This allows specification of segments, durations and F0 target values. (Utterance Segments ((# 0.19 ) (h 0.055 (0 115)) (@ 0.037 (0.018 136)) (l 0.064 ) (ou 0.208 (0.0 134) (0.100 135) (0.208 123)) (# 0.19))) Note the times are in seconds NOT milliseconds. The format of each segment entry is segment name, duration in seconds, and list of target values. Each target value consists of a pair of point into the segment (in seconds) and F0 value in Hz. Phones This allows a simple specification of a list of phones. Synthesis specifies fixed durations (specified in FP_duration, default 100 ms) and monotone intonation (specified in FP_F0, default 120Hz). This may be used for simple checks for waveform synthesizers etc. (Utterance Phones (# h @ l ou #)) Wave Note the function SayPhones allows synthesis and playing of lists of phones through this utterance type. A waveform file. Synthesis here simply involves loading the file. (Utterance Wave fred.wav) Others are supported, as defined in `lib/synthesis.scm' but are used internally by various parts of the system. These include Tokens used in TTS and SegF0 used by utt.resynth. [ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ] 14.4 Utterance modules The module is the basic unit that does the work of synthesis. Within Festival there are duration modules, intonation modules, wave synthesis modules etc. As stated above the utterance type defines the set of modules which are to be applied to the utterance. These modules in turn will create relations and items so that ultimately a waveform is generated, if required. Many of the chapters in this manual are solely concerned with particular modules in the system. Note that many modules have internal choices, such as which duration method to use or which intonation method to use. Such general choices are often done through the Parameter system. Parameters may be set for different features like Duration_Method, Synth_Method etc. Formerly the values for these parameters were atomic values but now
Page 1 and 2: [Top] [Contents] [Index] [ ? ] Fest
Page 3 and 4: The Festival Speech Synthesis Syste
Page 5 and 6: 3.3 Edinburgh Speech Tools Library
Page 7 and 8: multiple methods, though we will of
Page 9 and 10: for non-commercial use (we are work
Page 11 and 12: festlex_CMU.tar.gz festlex_OALD.tar
Page 13 and 14: held), and voices_dir (pointing to
Page 15 and 16: Ensure your audio device actually w
Page 17 and 18: $ festival Festival Speech Synthesi
Page 19 and 20: eference to a manual section and re
Page 21 and 22: [ < ] [ > ] [ > ] [Top] [Contents]
Page 23 and 24: To convert a symbol whose print nam
Page 25 and 26: filter A Unix shell program filter
Page 27 and 28: into name and IP address. Note that
Page 29 and 30: The boy saw the girl in the park
Page 31 and 32: VOLUME Allows the specification of
Page 33 and 34: festival/lib/tts.scm). [ < ] [ > ]
Page 35 and 36: 13.2 Defining lexicons Building new
Page 37 and 38: (debug_output t) before compilation
Page 39 and 40: ) The above isn't the most efficien
Page 41 and 42: The process involves the following
Page 43 and 44: (y _epsilon_ i ii i@ ai uh y @ ai-@
Page 45 and 46: lexicon by over 90%. The function r
Page 47 and 48: (define (postlex_apos_s_check utt)
Page 49: a list of syllables. Each member wi
Page 53 and 54: `(item.daughter2 ITEM)' Return the
Page 55 and 56: `stress' This item's lexical stress
Page 57 and 58: This pocket-watch was made in 1983.
Page 59 and 60: ((string-matches name "\\([dD][Rr]\
Page 61 and 62: (set! simple_phrase_cart_tree ' ((R
Page 63 and 64: accented (i.e. has an IntEvent rela
Page 65 and 66: (Utterance Words (boy (saw ((accent
Page 67 and 68: After prediction the segmental dura
Page 69 and 70: aa-ll &aa-l This states that the di
Page 71 and 72: The UniSyn_module_hooks are run bef
Page 73 and 74: for i in wave/*.wav do fname=`basen
Page 75 and 76: used on the signal, and/or up to th
Page 77 and 78: lib/voices/english/don_diphone/fest
Page 79 and 80: (Parameter.set 'Audio_Method 'irixa
Page 81 and 82: voice_el_diphone A male Castilian S
Page 83 and 84: ) (PhoneSet.silences '(#)) Note som
Page 85 and 86: (set! spanish_phrase_cart_tree ' ((
Page 87 and 88: (us_diphone_init (list '(name "el_l
Page 89 and 90: (define (voice_giant) "comment comm
Page 91 and 92: 25. Tools A number of basic data ma
Page 93 and 94: CART ::= QUESTION-NODE || ANSWER-NO
Page 95 and 96: (define (pos_cand_function w) ;; se
Page 97 and 98: some label files identify point typ
Page 99 and 100: Building the models and getting goo
Page 101 and 102:
`./src/modules/diphone' An optional
Page 103 and 104:
to this function should be added to
Page 105 and 106:
#include "festival.h" static LISP u
Page 107 and 108:
In yout `Makefile' for this directo
Page 109 and 110:
Every effort has been made to minim
Page 111 and 112:
A typical example use of `festival_
Page 113 and 114:
A simpler C only interface example
Page 115 and 116:
29.2 Singing Synthesis As an intere
Page 117 and 118:
Magisterarbeit, Institute of Natura
Page 119 and 120:
B C adding new LISP objects 27.2.4
Page 121 and 122:
F G H Edinburgh Speech Tools Librar
Page 123 and 124:
M N O P load-path 6.3 Site initiali
Page 125 and 126:
S resynthesis 14.7 Utterance I/O ru
Page 127 and 128:
U V W ungrouped diphones 20.1 UniSy
Page 129 and 130:
12. Phonesets 13. Lexicons 13.1 Lex
Page 131 and 132:
[Top] [Contents] [Index] [ ? ] Shor
show all

Festival Speech Synthesis System: - Speech Resource Pages

Create successful ePaper yourself

Delete template?

Save as template?