Back Room Front Room 2

More documents

Recommendations

Info

280 ENTERPRISE INFORMATION SYSTEMS VI Figure 2: An example of Hidden Markov Model other given the present state; thus probability of observing symbol vk at time t in a sequence of symbols does not depend on the symbol observed at time t − 1, but it depends only on the present state si and, implicitly, the previous one si ′ through the transition probability aii ′ (Ghahramani, 2001). HMMs could be adopted to implement a predictive models for Bliss symbols if we could assume that a symbol is predictable given the corresponding Bliss symbol category as the hidden state. However, this approach oversimplify the user language model described previously: it does not relate the emission of a symbol with the symbol previously emitted due to the independence assumption in HMMs. To face this issues we have adopted a particular extension of HMM, called AR-HMM (Auto-Regressive Hidden Markov Model) that relate the emitted symbol both to the actual state (as canonical HMM) and to the previous emitted symbol (Figure 3 illustrates the differences between canonical HMM and AR-HMM). In such a way we have a model that keeps into account the previous emission and it is still computationally tractable as described further on. In order to identify the possible hidden states of an ad-hoc AR-HMM for the Bliss language, symbols have been divided into six categories according to their grammatic role, and, later, each category has been divided into a number of subcategories adopting the semantic networks formalism (Quillian, 1968) to keep into account the semantic of the symbols and the logic connection among two subcategories. This subcategories identification process has been accomplished in collaboration with experts in verbal rehabilitation to obtain subcategories not excessively specific that would have complicated the model without any reason (e.g., we have a substantive subcategory Figure 3: Comparison between HMM and AR-HMM ‘food’ because it connects the verb subcategory ‘feeding’, we have not a substantive subcategory ‘animal’ because it does not connect a specific category). We report such subcategories and the number of symbols assigned to each subcategory (note that a symbol can belong more than one category). • Verbs: people movement (23), objects movement (15), body care (16), description (3), everyday (10), servile (7), emotional (7), feeding (33), other (180). • Adverbs: time (67), exclamatory (12), place (28), quantity (17), holidays (12), color (23), other (20). • Adjectives: opinion (29), character (18), physical description (33), food description (17), quantity (13), feeling (29), other (52). • Substantives: food (141), cloth (38), body (47), everyday (26), people place (110), things place (22), other (600). • People: possession (16), relation (47), job (38), other (51). • Punctuation: question (13), other (36). 4 DISCRETE AUTO-REGRESSIVE HIDDEN MARKOV MODEL AR-HMMs are commonly used in literature for prediction in continuous systems and they usually describe the emission probability of an symbol/value according to a Gaussian distribution; the emission of Bliss symbol, however, is a discrete event that can be described adopting a multinomial probability distribution. In CABA 2 L, to overcome this problem, we have implemented a Discrete Auto-Regressive Hidden Markov Model (DAR-HMM) where the emission probability for a symbol is described using a bivariated multinomial distribution. In fact, we introduced the DAR-HMM as a first order extension of a
Figure 4: Symbols emission in DAR-HMM; si is the state (symbol subcategory), vj are the observed symbol classical HMM where the symbol emission probability depends on the present state and the last observed symbol as depicted in Figure 4. DAR-HMM for symbolic prediction can be described using a parameter vector λ =< Π0 ,A,B >, where Π0 [N] is the vector of inital subcategory probability πi(0), A[N][N] is the matrix with subcategory transition probabilities aii ′, and B[N][M][M +1]is the emission matrix 1 with symbol probabilities b ii′ kk ′ and b i k (see Appendix A for details). In CABA2 L, this λ vector has been estimated using a dataset of Bliss sentences. To do that, we have adopted a variation of the Baum-Welch algorithm, an iterative algorithm based on the Expectation-Maximization method (Bilmes, 1998; Dempster et al., 1977), adapting this technique to the specific case of DAR-HMM (see Figure 5). Since the Baum-Welch algorithm is a greedy algorithm that can be trapped in local minima, the initialization estimate of λ parameter vector is a fundamental aspect. In literature a theoretical solution that addresses such issue does not exist; in practice, the adoption of a random or uniform distributed initialization for A and Π 0 has been verified to be adequate. In particular we adopt an uniform distribution as initial estimate for Π 0 , and a distribution based on the knowledge about the phenomenon for A. Only 1 From an implementation point of view matrix B could represent the main issue of this model (i.e., with N =30 subcategories and M � 2000 symbols the cells number amount is of the order of 10 8 , about 400MBytes). However B can be considered a sparse matrix since from each subcategories only a part of symbols can be emitted, so the cells number is, approximately, lower than 10 4 and ad-hoc data structure such as heap or priority queue and optimized algorithms can be used to overcame memory occupancy and speed access issues. 2 CAB A L A BLISS PREDICTIVE COMPOSITION ASSISTANT arcs connecting subcategories in the semantic model of the language (see Section 3) should have a probability aii ′ �= 0. However, we have assigned to the arcs between symbols and states that are not connected in the semantic network a very low probability, not to preclude the training algorithm to eventually discover unforeseen correlations. The initial estimation for the B matrix is more critical so we have used the Segmental k-Means (Juang and Rabiner, 1990; Juang et al., 1986) technique to obtain a more confidential estimate. Such process considers a sub set of sentences composing the dataset, and, for each one, it looks for the best sequence of subcategories using the Viterbi algorithm to upgrades the symbols emission probabilities. Given the initial values λ0 for the model parameters, we use a modified Baum-Welch algorithm to estimate, from a real dataset, the model parameters through a sequence of temporary λ model parameters. As in any learning algorithm, the main issue is avoiding the overfitting phenomenon (Caruana et al., 2001), so we would like to stop the training phase according to the generalization error (i.e., the error on new samples) and not just observing the training error (i.e., the error on the training set). To do this, we have used the K-fold cross-validation technique (Amari et al., 1995); it consists in dividing the whole set of sentences into K similar subsets to use at each iteration K − 1 subsets for parameter estimation and the remaining validation set is used to valuate the convergence of model generalization error. In other words, we calculate the errors of the model in predicting the sentences of the validation set it has never seen, and we analyze the validation error function during training iterations of the Baum-Welch algorithm until it reaches its minimum. In order to terminate the iteration at which the error function reaches its minimum, several practical techniques can be adopted, but none of them assures the achieving the global minimum. We have chosen to adopt a termination criterion based the generalization loss method (Prechelt, 1996). Given: ErrOpt(t) =min t ′ ≤t ErrVal(t ′ ) the minimum error is obtained at time t; consider � ErrVal(t) � GL(t) � 100 − 1 ErrOpt(t) 281 which represents the last increment in comparison with the minimum. The training phase is stopped whenever the generalization loss GL becomes bigger than a given threshold τ: GL(t) >τ. In this approach, the error function could stabilize after a local minimum, without GL(t) rising the threshold. In order to face such issue we have added to
Page 1 and 2:
www.dbeBooks.com - An Ebook Library
Page 3 and 4:
A C.I.P. Catalogue record for this
Page 5 and 6:
vi TABLE OF CONTENTS PART 1 - DATAB
Page 7 and 8:
viii TABLE OF CONTENTS A WIRELESS A
Page 9 and 10:
CONFERENCE COMMITTEE Honorary Presi
Page 11 and 12:
Hung, P. (AUSTRALIA) Jahankhani, H.
Page 13 and 14:
Invited Papers
Page 15 and 16:
2 ENTERPRISE INFORMATION SYSTEMS VI
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
10 ENTERPRISE INFORMATION SYSTEMS V
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
EVOLUTIONARY PROJECT MANAGEMENT: MU
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
MANAGING COMPLEXITY OF ENTERPRISE I
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
ASSESSING EFFORT PREDICTION MODELS
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
ORGANIZATIONAL AND TECHNOLOGICAL CR
Page 77 and 78:
Page 79 and 80:
ASAP Processes W Project Kickoff A
Page 81 and 82:
Page 83 and 84:
since it means changing the relevan
Page 85 and 86:
ACME-DB: AN ADAPTIVE CACHING MECHAN
Page 87 and 88:
ACME-DB: AN ADAPTIVE CACHING MECHAN
Page 89 and 90:
Hit Rate (%) ACME-DB: AN ADAPTIVE C
Page 91 and 92:
5.3.6 Effect on all Policies by Int
Page 93 and 94:
ACKNOWLEDGEMENTS We would like to t
Page 95 and 96:
This paper describes the data sampl
Page 97 and 98:
The major limit A of the operation
Page 99 and 100:
RELATIONAL SAMPLING FOR DATA QUALIT
Page 101 and 102:
TOWARDS DESIGN RATIONALES OF SOFTWA
Page 103 and 104:
((Brown et al., 1998), pp. 159-190,
Page 105 and 106:
sive data filtering, presentation (
Page 107 and 108:
• implementation changes in servi
Page 109 and 110:
MEMORY MANAGEMENT FOR LARGE SCALE D
Page 111 and 112:
Data Rate [bytes/sec] 1600000 14000
Page 113 and 114:
E.g., DV Camcorder Camera Packets (
Page 115 and 116:
Procedure CheckOptimal (n rs 1 ...n
Page 117 and 118:
MEMORY MANAGEMENT FOR LARGE SCALE D
Page 119 and 120:
PART 2 Artificial Intelligence and
Page 121 and 122:
110 ENTERPRISE INFORMATION SYSTEMS
Page 123 and 124:
Page 125 and 126:
Page 127 and 128:
DYNAMIC MULTI-AGENT BASED VARIETY F
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
AN EXPERIENCE IN MANAGEMENT OF IMPR
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
ANALYSIS AND RE-ENGINEERING OF WEB
Page 181 and 182:
Module p0 Customer Itinerary Send I
Page 183 and 184:
departments: the marketing dept. se
Page 185 and 186:
• An acyclic module M is usable,
Page 187 and 188:
BALANCING STAKEHOLDER’S PREFERENC
Page 189 and 190:
The scenarios will provide an abstr
Page 191 and 192:
BALANCING STAKEHOLDER'S PREFERENCES
Page 193 and 194:
BALANCING STAKEHOLDER'S PREFERENCES
Page 195 and 196:
A POLYMORPHIC CONTEXT FRAME TO SUPP
Page 197 and 198:
A POLYMORPHIC CONTE XT FRAME TO SUP
Page 199 and 200:
Page 201 and 202:
Page 203 and 204:
FEATURE MATCHING IN MODEL-BASED SOF
Page 205 and 206:
combination of solution domains - a
Page 207 and 208:
• features for all the concepts:
Page 209 and 210:
Existence In both steps it is possi
Page 211 and 212:
matching strategies are maximal, mi
Page 213 and 214:
TOWARDS A META MODEL FOR DESCRIBING
Page 215 and 216:
elementary message (ae-message) can
Page 217 and 218:
problems on a pragmatic level, whic
Page 219 and 220:
TOWARDS A META MODEL FOR DESCRIBING
Page 221 and 222:
INTRUSION DETECTION SYSTEMS USING A
Page 223 and 224:
INTRUSION DETECTION SYSTEMS USING A
Page 225 and 226:
(k� 1) (k) (k�1) (k) p � w
Page 227 and 228:
Table 5: Performance Comparison of
Page 229 and 230:
A USER-CENTERED METHODOLOGY TO GENE
Page 231 and 232:
carries out the second phase of the
Page 233 and 234:
1 1 1 1 2 PROCESS STORE ENTITY EDGE
Page 235 and 236:
constraints rules. KOGGE (Ebert et
Page 237 and 238: PART 4 Software Agents and Internet
Page 239 and 240: 230 ENTERPRISE INFORMATION SYSTEMS
Page 285 and 286: CABA 2 L A BLISS PREDICTIVE COMPOSI
Page 287: y disables evidencing severe mental
Page 291 and 292: they evidence a generalization erro
Page 293 and 294: A METHODOLOGY FOR INTERFACE DESIGN
Page 295 and 296: and mobility loss coupled with redu
Page 297 and 298: Tradeoff: The default input is poss
Page 299 and 300: Sessions on the VABS which are held
Page 301 and 302: A CONTACT RECOMMENDER SYSTEM FOR A
Page 303 and 304: A CONTACT RECOMMENDER SYSTEM FOR A
Page 305 and 306: - "Going to the sources". The user
Page 307 and 308: Here are the matrix W and P for our
Page 309 and 310: EMOTION SYNTHESIS IN VIRTUAL ENVIRO
Page 311 and 312: theme turns out to be surprisingly
Page 313 and 314: 6 FROM FEATURES TO SYMBOLS 6.1 Face
Page 315 and 316: sions are described by different em
Page 317 and 318: Electronic Imaging 2000 Conference
Page 319 and 320: to the accessibility problem for th
Page 321 and 322: Figure 3: Hierarchical View of the
Page 323 and 324: 4 CONCLUSIONS AND FUTURE WORK On th
Page 325 and 326: PERSONALISED RESOURCE DISCOVERY SEA
Page 329 and 330: profile, the user defines his curre
Page 333 and 334: Abdelkafi, N. .....................
show all

Back Room Front Room 2

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?