Festival Speech Synthesis System: - Speech Resource Pages

More documents

Recommendations

Info

Also if syllabification is required there is an opportunity to run a set of "letter-to-sound"-rules on the input (actually an arbitrary re-write rule system). If the variable lex_lts_set is set, the lts ruleset of that name is applied to the flat input before syllabification. This allows simple predictable changes such as conversion of final r into longer vowel for English RP from American labelled lexicons. A list of all matching entries in the addenda and the compiled lexicon may be found by the function lex.lookup_all. This function takes a word and returns all matching entries irrespective of part of speech. You can optionall intercept the words as they are lookup up, and after they have been found through pre_hooks and post_hooks for each lexicon. This allows a function or list of functions to be applied to an word and feature sbefore lookup or to the resulting entry after lookup. The following example shows how to add voice specific entries to a general lexicon without affecting other voices that use that lexicon. For example suppose we were trying to use a Scottish English voice with the US English (cmu) lexicon. A number of entgries will be inapporpriate but we can redefine some entries thus (set! cmu_us_awb::lexicon_addenda '( ("edinburgh" n (((eh d) 1) ((ax n) 0) ((b r ax) 0))) ("poem" n (((p ow) 1) ((y ax m) 0))) ("usual" n (((y uw) 1) ((zh ax l) 0))) ("air" n (((ey r) 1))) ("hair" n (((hh ey r) 1))) ("fair" n (((f ey r) 1))) ("chair" n (((ch ey r) 1))))) We can the define a function that chesk to see if the word looked up is in the speaker specific exception list and use that entry instead. (define (cmu_us_awb::cmu_lookup_post entry) "(cmu_us_awb::cmu_lookup_post entry) Speaker specific lexicon addeda." (let ((ne (assoc_string (car entry) cmu_us_awb::lexicon_addenda))) (if ne ne entry))) And then for the particualr voice set up we need to add both a selection part and a reset part. Thuis following the FestVox vonventions for voice set up. (define (cmu_us_awb::select_lexicon) ) ... ... (lex.select "cmu") ;; Get old var for reset and to append our function to is (set! cmu_us_awb::old_cmu_post_hooks (lex.set.post_hooks nil)) (lex.set.post_hooks (append cmu_us_awb::old_cmu_post_hooks (list cmu_us_awb::cmu_lookup_post))) ... (define (cmu_us_awb::reset_lexicon) ... ;; reset CMU's post_hooks back to original (lex.set.post_hooks cmu_us_awb::old_cmu_post_hooks) ...
) The above isn't the most efficient way as the word is looked up first then it is checked with the speaker specific list. The pre_hooks function are called with two arguments, the word and features, they should return a pair of word and features. [ < ] [ > ] [ > ] [Top] [Contents] [Index] [ ? ] 13.4 Letter to sound rules Each lexicon may define what action to take when a word cannot be found in the addenda or the compiled lexicon. There are a number of options which will hopefully be added to as more general letter to sound rule systems are added. The method is set by the command (lex.set.lts.method METHOD) Where METHOD can be any of the following `Error' Throw an error when an unknown word is found (default). `lts_rules' Use externally specified set of letter to sound rules (described below). The name of the rule set to use is defined with the lex.lts.ruleset function. This method runs one set of rules on an exploded form of the word and assumes the rules return a list of phonemes (in the appropriate set). If multiple instances of rules are required use the function method described next. `none' This returns an entry with a nil pronunciation field. This will only be valid in very special circumstances. `FUNCTIONNAME' Call this as a LISP function function name. This function is given two arguments: the word and the part of speech. It should return a valid lexical entry. The basic letter to sound rule system is very simple but is powerful enough to build reasonably complex letter to sound rules. Although we've found trained LTS rules better than hand written ones (for complex languages) where no data is available and rules must be hand written the following rule formalism is much easier to use than that generated by the LTS training system (described in the next section). The basic form of a rule is as follows ( LEFTCONTEXT [ ITEMS ] RIGHTCONTEXT = NEWITEMS ) This interpretation is that if ITEMS appear in the specified right and left context then the output string is to contain NEWITEMS. Any of LEFTCONTEXT, RIGHTCONTEXT or NEWITEMS may be empty. Note that NEWITEMS is written to a different "tape" and hence cannot feed further rules (within this ruleset). An example is ( # [ c h ] C = k ) The special character # denotes a word boundary, and the symbol C denotes the set of all consonants, sets are declared before rules. This rule states that a ch at the start of a word followed by a consonant is to be rendered as the k phoneme. Symbols in contexts may be followed by the symbol * for zero or more occurrences, or + for one or more occurrences. The symbols in the rules are treated as set names if they are declared as such or as symbols in the input/output alphabets. The symbols may be more than one character long and the names are case sensitive.
Page 1 and 2: [Top] [Contents] [Index] [ ? ] Fest
Page 3 and 4: The Festival Speech Synthesis Syste
Page 5 and 6: 3.3 Edinburgh Speech Tools Library
Page 7 and 8: multiple methods, though we will of
Page 9 and 10: for non-commercial use (we are work
Page 11 and 12: festlex_CMU.tar.gz festlex_OALD.tar
Page 13 and 14: held), and voices_dir (pointing to
Page 15 and 16: Ensure your audio device actually w
Page 17 and 18: $ festival Festival Speech Synthesi
Page 19 and 20: eference to a manual section and re
Page 21 and 22: [ < ] [ > ] [ > ] [Top] [Contents]
Page 23 and 24: To convert a symbol whose print nam
Page 25 and 26: filter A Unix shell program filter
Page 27 and 28: into name and IP address. Note that
Page 29 and 30: The boy saw the girl in the park
Page 31 and 32: VOLUME Allows the specification of
Page 33 and 34: festival/lib/tts.scm). [ < ] [ > ]
Page 35 and 36: 13.2 Defining lexicons Building new
Page 37: (debug_output t) before compilation
Page 41 and 42: The process involves the following
Page 43 and 44: (y _epsilon_ i ii i@ ai uh y @ ai-@
Page 45 and 46: lexicon by over 90%. The function r
Page 47 and 48: (define (postlex_apos_s_check utt)
Page 49 and 50: a list of syllables. Each member wi
Page 51 and 52: Phrase This allows explicit phrasin
Page 53 and 54: `(item.daughter2 ITEM)' Return the
Page 55 and 56: `stress' This item's lexical stress
Page 57 and 58: This pocket-watch was made in 1983.
Page 59 and 60: ((string-matches name "\\([dD][Rr]\
Page 61 and 62: (set! simple_phrase_cart_tree ' ((R
Page 63 and 64: accented (i.e. has an IntEvent rela
Page 65 and 66: (Utterance Words (boy (saw ((accent
Page 67 and 68: After prediction the segmental dura
Page 69 and 70: aa-ll &aa-l This states that the di
Page 71 and 72: The UniSyn_module_hooks are run bef
Page 73 and 74: for i in wave/*.wav do fname=`basen
Page 75 and 76: used on the signal, and/or up to th
Page 77 and 78: lib/voices/english/don_diphone/fest
Page 79 and 80: (Parameter.set 'Audio_Method 'irixa
Page 81 and 82: voice_el_diphone A male Castilian S
Page 83 and 84: ) (PhoneSet.silences '(#)) Note som
Page 85 and 86: (set! spanish_phrase_cart_tree ' ((
Page 87 and 88: (us_diphone_init (list '(name "el_l
Page 89 and 90:
(define (voice_giant) "comment comm
Page 91 and 92:
25. Tools A number of basic data ma
Page 93 and 94:
CART ::= QUESTION-NODE || ANSWER-NO
Page 95 and 96:
(define (pos_cand_function w) ;; se
Page 97 and 98:
some label files identify point typ
Page 99 and 100:
Building the models and getting goo
Page 101 and 102:
`./src/modules/diphone' An optional
Page 103 and 104:
to this function should be added to
Page 105 and 106:
#include "festival.h" static LISP u
Page 107 and 108:
In yout `Makefile' for this directo
Page 109 and 110:
Every effort has been made to minim
Page 111 and 112:
A typical example use of `festival_
Page 113 and 114:
A simpler C only interface example
Page 115 and 116:
29.2 Singing Synthesis As an intere
Page 117 and 118:
Magisterarbeit, Institute of Natura
Page 119 and 120:
B C adding new LISP objects 27.2.4
Page 121 and 122:
F G H Edinburgh Speech Tools Librar
Page 123 and 124:
M N O P load-path 6.3 Site initiali
Page 125 and 126:
S resynthesis 14.7 Utterance I/O ru
Page 127 and 128:
U V W ungrouped diphones 20.1 UniSy
Page 129 and 130:
12. Phonesets 13. Lexicons 13.1 Lex
Page 131 and 132:
[Top] [Contents] [Index] [ ? ] Shor
show all

Festival Speech Synthesis System: - Speech Resource Pages

Create successful ePaper yourself

Delete template?

Save as template?