A Study of the ITU-T G.729 Speech Coding Algorithm ...

More documents

Recommendations

Info

2.2.2 Speech Sounds Open MASTER THESIS Figure 2: Human speech production. Datum - Date Rev Dokumentnr - Document no. 04-09-28 PA1 One way of analyzing speech is to use phonetics, which is the study of speech sounds and their production. Every language has limited linguistic units called phonemes. A phoneme is a sound and several consecutive phonemes create words. For example, the word dog consists of the three phonemes d/ao/g. Most languages have 20-50 phonemes, each phoneme representing a sound. The 42 phonemes in the English language are listed in Table 2 [15]. A phone is an acoustic realization of a phoneme. If the realization of a phoneme is context dependent, it is called an Allophone. In most languages, the phonemes can be divided into two groups: vowels and consonants. The set of vowels and consonants is language specific. The hierarchy of the English phonemes is depicted in Figure 3 [25]. 2.2.3 Speech-Signal Waveform Characteristics The range of frequencies that humans are able to hear is called the audio spectrum. The bandwidth in the audio spectrum ranges hereby from 20 Hz to 20 kHz, although most humans have a much narrower bandwidth for hearing, especially with increasing age. Frequencies above the human hearing limit are called ultrasonic, while sounds below the limit are called infrasonic. Speech-signal waveform characteristics are constant for short time-periods. A speech signal is commonly assumed to be wide sense stationary and ergodic in the autocorrelation for segments of 10-30 ms of speech. In speech analysis, it is therefore common to extract short-time segments of speech, called frames to enable simple speech modeling. For smoother transaction between analysis frames, the latter are extracted with overlap, which 16 (78)
Open MASTER THESIS Table 2: English phonemes Datum - Date Rev Dokumentnr - Document no. 04-09-28 PA1 Phonemes Word Examples Description iy feel,eve,me front close unrounded ih fill,hit,lid front close unrounded ae at,carry,gas front close unrounded aa father,ah,car back open unrounded ah cut,bud,up open-mid back unrounded ao dog,lawn,caught open mid-back round ay tie,ice,bite diphtong with quality: aa + ih ax ago,comply central close mid ey ate,day,tape front close-mid unrounded eh pet,berry,ten front open-mid unrounded er turn,fur,meter central open-mid unrounded ow go,own,tone back close-mid rounded aw foul,how,our diphtong with quality: aa + uh oy toy,coin,oil diphtong with quality: ao + ih uh book,pull,good back close-mid unrounded uw tool,crew,moo back close round b big,able,tab voiced bilavial plosive p put,open,tap voiceless bilavial plosive d dig,idea,wad voiced alveolar plosive t talk,sat voiceless alveolar plosive t meter alveolar flap g gut,angle,tag voiced velar plosive k cut,ken,take voiceless velar plosive f fork,after,if voiceless labiodental fricative v vat,over,have voiced labiodental fricative s sit,cast,toss voiceless alveolar fricative z zap,lazy,haze voiced alveolar fricative th thin,nothing,truth voiceless dental fricative dh then,father,scythe voiced dental fricative sh she,cushion,wash voiceless postalveolar fricative zh genre,azure voiced postalveolar fricative l lid alveolar lateral approximant l elbow,sail velar lateral approximant r red,part,far retroflex approximant y yacht,yard palatal sonorant glide w with,away labiovelar sonorant glide hh help,ahead,hotel voiceless glottal fricative m mat,amid,aim bilabial nasal n no,end,pan alveolar nasal ng sing,anger velar nasal ch chin,archer,marche voiceless alveolar affricate: t + sh jh joy,agile,edge voiced alveolar affricative: d + zh 17 (78)
Page 1 and 2: A Study of the ITU-T G.729 Speech C
Page 3 and 4: Open MASTER THESIS Uppgjord - Prepa
Page 11 and 12: 1 Introduction 1.1 Thesis Backgroun
Page 13 and 14: 1.4 List of Abbreviations Open MAST
Page 15: 2 Speech Coding Overview 2.1 Introd
Page 19 and 20: Open MASTER THESIS Datum - Date Rev
Page 21 and 22: 2.3 Speech Analysis Open MASTER THE
Page 27 and 28: 2.5.2 Voice Coders Open MASTER THES
Page 39 and 40: 4 Voice over IP Overview 4.1 Introd
Page 41 and 42: 4.3 Gateway Control Open MASTER THE
Page 49 and 50: 5.5 Instruction Set Open MASTER THE
Page 63 and 64: 8.8.5 Debug Method Open MASTER THES
Page 67 and 68:
Table 9: continued Word Parmeter De
Page 69 and 70:
Open MASTER THESIS Datum - Date Rev
Page 71 and 72:
10 References Open MASTER THESIS Da
Page 73 and 74:
11 Appendix A Open MASTER THESIS Da
Page 75 and 76:
Table 12: continued Open MASTER THE
Page 77 and 78:
Table 13: continued Open MASTER THE
show all

A Study of the ITU-T G.729 Speech Coding Algorithm ...

Create successful ePaper yourself

Delete template?

Save as template?