<strong>Proceedings</strong>, FONETIK <strong>2009</strong>, Dept. of Linguistics, Stockholm UniversityResponses (%)1007550250Bottom Middle TopRankTraditional Adapted Gesture 70%Figure 6. Rank distributions for the traditional,adapted and gesture 70% systems.The outcome of the experiment should be consideredwith some caution due to the selectionof the subject group. However, the results indicatethat the gesture system has an advantageover the other two systems and that the adaptedsystem is ranked higher than the traditional system.The maximum rankings are 64%, 72% and71% for the traditional, adapted and gesturesystems, respectively. Our initial hypothesiswas that these systems would be ranked withthe traditional system at the bottom and thegesture system at the top. This is in fact true in58% of the cases with a standard deviation of21%. One subject contradicted this hypothesisin only one out off 12 cases while another subjectdid the same in as many as 9 cases. Thehypothesis was confirmed by all subjects forone utterance and by only one subject for anotherone.The adapted system is based on data fromthe diphone unit library and was created toform a homogeneous base for combining rulebasedand unit-based synthesis as smoothly aspossible. It is interesting that even these firststeps, creating the adapted system, are regardedto be an improvement. The diphone library hasnot yet been matched to the dialect of the referencespeaker, and a number of diphones aremissing.Final remarksThis paper describes our work on building formantsynthesis systems based on both rulegeneratedand database driven methods. Thetechnical and perceptual evaluations show thatthis approach is a very interesting path to explorefurther at least in a research environment.The perceptual results showed an advantage innaturalness for the gesture system which includesboth speaker adaptation and a diphonedatabase of formant gestures, compared to boththe traditional reference system and the speakeradapted system. However, it is also apparentfrom the synthesis quality that a lot of workstill needs to be put into the automatic buildingof a formant unit library.AcknowledgementsThe diphone database was recorded using theWaveSurfer software. David Öhlin contributedin building the diphone database. We thankJohn Lindberg and Roberto Bresin for makingthe evaluation software available for the perceptualranking. The SpeeCon database wasmade available by Kjell Elenius. We thank allsubjects for their participation in the perceptualevaluation.ReferencesAcero, A. (1999) “Formant analysis and synthesisusing hidden Markov models”, In:Proc. of Eurospeech'99, pp. 1047-1050.Carlson, R., and Granström, B. (2005) “Datadrivenmultimodal synthesis”, SpeechCommunication, Volume 47, Issues 1-2,September-October 2005, Pages 182-193.Carlson, R., Granström, B., and Karlsson, I.(1991) “Experiments with voice modellingin speech synthesis”, Speech Communication,10, 481-489.Carlson, R., Granström, B., Hunnicutt, S.(1982) “A multi-language text-to-speechmodule”, In: Proc. of the 7th InternationalConference on Acoustics, Speech, and SignalProcessing (ICASSP’82), Paris, France,vol. 3, pp. 1604-1607.Carlson, R., Sigvardson, T., Sjölander, A.(2002) “Data-driven formant synthesis”, In:Proc. of <strong>Fonetik</strong> 2002, Stockholm, Sweden,STL-QPSR 44, pp. 69-72Großkopf, B., Marasek, K., v. d. Heuvel, H.,Diehl, F., Kiessling, A. (2002) “SpeeCon -speech data for consumer devices: Databasespecification and validation”, Proc. LREC.Hertz, S. (2002) “Integration of Rule-BasedFormant Synthesis and Waveform Concatenation:A Hybrid Approach to Text-to-Speech Synthesis”, In: Proc. IEEE 2002Workshop on Speech Synthesis, 11-13, September2002 Santa Monica, USA.90
<strong>Proceedings</strong>, FONETIK <strong>2009</strong>, Dept. of Linguistics, Stockholm UniversityHögberg, J. (1997) “Data driven formant synthesis”,In: Proc. of Eurospeech 97.Holmes, W. J. and Pearce, D. J. B. (1990)“Automatic derivation of segment modelsfor synthesis-by-rule”, Proc ESCA Workshopon Speech Synthesis, Autrans, France.Lee, M., van Santen, J., Möbius, B., Olive, J.(1999) “Formant Tracking Using SegmentalPhonemic Information”, In: Proc.of Eurospeech‘99, Vol. 6, pp. 2789–2792.Mannell, R. H. (1998) “Formant diphone parameterextraction utilising a labeled singlespeaker database”, In: Proc. of ICSLP 98.Mori, H., Ohtsuka, T., Kasuya, H. (2002) “Adata-driven approach to source-formanttype text-to-speech system”, In ICSLP-2002, pp. 2365-2368.Öhlin, D. (2004) “Formant Extraction for DatadrivenFormant Synthesis”, (in Swedish).Master Thesis, TMH, KTH, Stockholm.Öhlin, D., Carlson, R. (2004) “Data-drivenformant synthesis”, In: Proc. <strong>Fonetik</strong> 2004pp. 160-163.Sigvardson, T. (2002) “Data-driven Methodsfor Paameter Synthesis – Description of aSystem and Experiments with CART-Analysis”, (in Swedish). Master Thesis,TMH, KTH, Stockholm, Sweden.Sjölander, A. (2001) “Data-driven FormantSynthesis” (in Swedish). Master Thesis,TMH, KTH, Stockholm.Sjölander, K. (2003) “An HMM-based Systemfor Automatic Segmentation and Alignmentof Speech”, In: Proc. of <strong>Fonetik</strong> 2003,Umeå Universitet, Umeå, Sweden, pp. 93–96.Talkin, D. (1989) “Looking at Speech”, In:Speech Technology, No 4, April/May 1989,pp. 74–77.91
- Page 1 and 2:
Department of LinguisticsProceeding
- Page 3 and 4:
Proceedings, FONETIK 2009, Dept. of
- Page 5 and 6:
Proceedings, FONETIK 2009, Dept. of
- Page 7 and 8:
Proceedings, FONETIK 2009, Dept. of
- Page 9 and 10:
Proceedings, FONETIK 2009, Dept. of
- Page 11 and 12:
Proceedings, FONETIK 2009, Dept. of
- Page 13 and 14:
Proceedings, FONETIK 2009, Dept. of
- Page 15 and 16:
Proceedings, FONETIK 2009, Dept. of
- Page 17 and 18:
Proceedings, FONETIK 2009, Dept. of
- Page 19 and 20:
Proceedings, FONETIK 2009, Dept. of
- Page 21 and 22:
Proceedings, FONETIK 2009, Dept. of
- Page 23 and 24:
Proceedings, FONETIK 2009, Dept. of
- Page 25 and 26:
Proceedings, FONETIK 2009, Dept. of
- Page 27 and 28:
Proceedings, FONETIK 2009, Dept. of
- Page 29 and 30:
Proceedings, FONETIK 2009, Dept. of
- Page 31 and 32:
Proceedings, FONETIK 2009, Dept. of
- Page 33 and 34:
Proceedings, FONETIK 2009, Dept. of
- Page 35 and 36:
Proceedings, FONETIK 2009, Dept. of
- Page 37 and 38:
Proceedings, FONETIK 2009, Dept. of
- Page 39 and 40: Proceedings, FONETIK 2009, Dept. of
- Page 41 and 42: Proceedings, FOETIK 2009, Dept. of
- Page 43 and 44: Proceedings, FONETIK 2009, Dept. of
- Page 45 and 46: Proceedings, FONETIK 2009, Dept. of
- Page 47 and 48: Proceedings, FONETIK 2009, Dept. of
- Page 49 and 50: Proceedings, FONETIK 2009, Dept. of
- Page 51 and 52: Proceedings, FONETIK 2009, Dept. of
- Page 53 and 54: Proceedings, FONETIK 2009, Dept. of
- Page 55 and 56: Proceedings, FONETIK 2009, Dept. of
- Page 57 and 58: Proceedings, FONETIK 2009, Dept. of
- Page 59 and 60: Proceedings, FONETIK 2009, Dept. of
- Page 61 and 62: Proceedings, FONETIK 2009, Dept. of
- Page 63 and 64: Proceedings, FONETIK 2009, Dept. of
- Page 65 and 66: Proceedings, FONETIK 2009, Dept. of
- Page 67 and 68: Proceedings, FONETIK 2009, Dept. of
- Page 69 and 70: Proceedings, FONETIK 2009, Dept. of
- Page 71 and 72: Proceedings, FONETIK 2009, Dept. of
- Page 73 and 74: Proceedings, FONETIK 2009, Dept. of
- Page 75 and 76: Proceedings, FONETIK 2009, Dept. of
- Page 77 and 78: Proceedings, FONETIK 2009, Dept. of
- Page 79 and 80: Proceedings, FONETIK 2009, Dept. of
- Page 81 and 82: Proceedings, FONETIK 2009, Dept. of
- Page 83 and 84: Proceedings, FONETIK 2009, Dept. of
- Page 85 and 86: Proceedings, FONETIK 2009, Dept. of
- Page 87 and 88: Proceedings, FONETIK 2009, Dept. of
- Page 89: Proceedings, FONETIK 2009, Dept. of
- Page 93 and 94: Proceedings, FONETIK 2009, Dept. of
- Page 95 and 96: Proceedings, FONETIK 2009, Dept. of
- Page 97 and 98: Proceedings, FONETIK 2009, Dept. of
- Page 99 and 100: Proceedings, FONETIK 2009, Dept. of
- Page 101 and 102: Proceedings, FONETIK 2009, Dept. of
- Page 103 and 104: Proceedings, FONETIK 2009, Dept. of
- Page 105 and 106: Proceedings, FONETIK 2009, Dept. of
- Page 107 and 108: Proceedings, FONETIK 2009, Dept. of
- Page 109 and 110: Proceedings, FONETIK 2009, Dept. of
- Page 111 and 112: Proceedings, FONETIK 2009, Dept. of
- Page 113 and 114: Proceedings, FONETIK 2009, Dept. of
- Page 115 and 116: Proceedings, FONETIK 2009, Dept. of
- Page 117 and 118: Proceedings, FONETIK 2009, Dept. of
- Page 119 and 120: Proceedings, FONETIK 2009, Dept. of
- Page 121 and 122: Proceedings, FONETIK 2009, Dept. of
- Page 123 and 124: Proceedings, FONETIK 2009, Dept. of
- Page 125 and 126: Proceedings, FONETIK 2009, Dept. of
- Page 127 and 128: Proceedings, FOETIK 2009, Dept. of
- Page 129 and 130: Proceedings, FOETIK 2009, Dept. of
- Page 131 and 132: Proceedings, FOETIK 2009, Dept. of
- Page 133 and 134: Proceedings, FOETIK 2009, Dept. of
- Page 135 and 136: Proceedings, FOETIK 2009, Dept. of
- Page 137 and 138: Proceedings, FONETIK 2009, Dept. of
- Page 139 and 140: Proceedings, FONETIK 2009, Dept. of
- Page 141 and 142:
Proceedings, FONETIK 2009, Dept. of
- Page 143 and 144:
Proceedings, FONETIK 2009, Dept. of
- Page 145 and 146:
Proceedings, FONETIK 2009, Dept. of
- Page 147 and 148:
Proceedings, FONETIK 2009, Dept. of
- Page 149 and 150:
Proceedings, FONETIK 2009, Dept. of
- Page 151 and 152:
Proceedings, FONETIK 2009, Dept. of
- Page 153 and 154:
Proceedings, FONETIK 2009, Dept. of
- Page 155 and 156:
Proceedings, FONETIK 2009, Dept. of
- Page 157 and 158:
Proceedings, FONETIK 2009, Dept. of
- Page 159 and 160:
Proceedings, FONETIK 2009, Dept. of
- Page 161 and 162:
Proceedings, FONETIK 2009, Dept. of
- Page 163 and 164:
Proceedings, FONETIK 2009, Dept. of
- Page 165 and 166:
Proceedings, FONETIK 2009, Dept. of
- Page 167 and 168:
Proceedings, FONETIK 2009, Dept. of
- Page 169 and 170:
Proceedings, FONETIK 2009, Dept. of
- Page 171 and 172:
Proceedings, FONETIK 2009, Dept. of
- Page 173 and 174:
Proceedings, FONETIK 2009, Dept. of
- Page 175 and 176:
Proceedings, FONETIK 2009, Dept. of
- Page 177 and 178:
Proceedings, FONETIK 2009, Dept. of
- Page 179 and 180:
Proceedings, FONETIK 2009, Dept. of
- Page 181 and 182:
Proceedings, FONETIK 2009, Dept. of
- Page 183 and 184:
Proceedings, FONETIK 2009, Dept. of
- Page 185 and 186:
Proceedings, FONETIK 2009, Dept. of
- Page 187 and 188:
Proceedings, FONETIK 2009, Dept. of
- Page 189 and 190:
Proceedings, FONETIK 2009, Dept. of
- Page 191 and 192:
Proceedings, FONETIK 2009, Dept. of
- Page 193 and 194:
Proceedings, FONETIK 2009, Dept. of
- Page 195 and 196:
Proceedings, FONETIK 2009, Dept. of
- Page 197 and 198:
Proceedings, FONETIK 2009, Dept. of
- Page 199 and 200:
Proceedings, FONETIK 2009, Dept. of
- Page 201 and 202:
Proceedings, FONETIK 2009, Dept. of
- Page 203 and 204:
Proceedings, FOETIK 2009, Dept. of
- Page 205 and 206:
Proceedings, FOETIK 2009, Dept. of
- Page 207 and 208:
Proceedings, FONETIK 2009, Dept. of
- Page 209 and 210:
Proceedings, FONETIK 2009, Dept. of
- Page 211 and 212:
Proceedings, FONETIK 2009, Dept. of
- Page 213 and 214:
Proceedings, FONETIK 2009, Dept. of
- Page 215 and 216:
Proceedings, FONETIK 2009, Dept. of
- Page 217 and 218:
Proceedings, FONETIK 2009, Dept. of
- Page 219 and 220:
Proceedings, FONETIK 2009, Dept. of
- Page 221 and 222:
Proceedings, FONETIK 2009, Dept. of
- Page 223 and 224:
Proceedings, FONETIK 2009, Dept. of
- Page 225 and 226:
Proceedings, FONETIK 2009, Dept. of
- Page 227:
Department of LinguisticsPhonetics