11.07.2015 Views

Text-to-speech man-machine interface in embedded systems

Text-to-speech man-machine interface in embedded systems

Text-to-speech man-machine interface in embedded systems

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Concatenative TTS synthesis• Two ma<strong>in</strong> types of <strong>in</strong>ven<strong>to</strong>ries:• Fixed <strong>in</strong>ven<strong>to</strong>ry – only one sample of each unit• The data-base is small (typically 1500+)• They rely extensively on unit modification• Unit-selection <strong>systems</strong> - <strong>man</strong>y samples of the same unit• Usually conta<strong>in</strong> hours of recorded <strong>speech</strong> material• They require very little or no unit modification• The most elaborate of these is Japanese ATR’s XIMERA, whichuses a 170 hour, 25,5 GB database of recorded <strong>speech</strong>!MBROLAdiphoneFestivaldiphoneFestivalunit-selectionAT&T’s Next-Genunit-selectionATR’s CHATRunit-selection

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!