Text-to-speech man-machine interface in embedded systems
Text-to-speech man-machine interface in embedded systems
Text-to-speech man-machine interface in embedded systems
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Concatenative TTS synthesis• Two ma<strong>in</strong> types of <strong>in</strong>ven<strong>to</strong>ries:• Fixed <strong>in</strong>ven<strong>to</strong>ry – only one sample of each unit• The data-base is small (typically 1500+)• They rely extensively on unit modification• Unit-selection <strong>systems</strong> - <strong>man</strong>y samples of the same unit• Usually conta<strong>in</strong> hours of recorded <strong>speech</strong> material• They require very little or no unit modification• The most elaborate of these is Japanese ATR’s XIMERA, whichuses a 170 hour, 25,5 GB database of recorded <strong>speech</strong>!MBROLAdiphoneFestivaldiphoneFestivalunit-selectionAT&T’s Next-Genunit-selectionATR’s CHATRunit-selection