Back Room Front Room 2

More documents

Recommendations

Info

MINING THE RELATIONSHIPS IN THE FORM OF THE PREDISPOSING FACTORS the number of Download significantly increases are the same as its predisposing factors but after that the number of all of the other attributes decreases. 7.1.2 Slope Direction of the Download is Negative Table 3: Summary of the result in case the slope direction of the Download is negative previous current post P/V up down down Bugs0 up down down Bugs1 up down up Support0 up down down Support1 up down up Patches0 up down up Patches1 up down up Tracker0 up down down Tracker1 up down up Tasks0 down down down Tasks1 down down down CVS down down down From these results, we find that the predisposing factors of the number of the Download significantly decreases are the number of almost all of the other attributes increases, except only the number of Tasks0, Tasks1 and CVS decrease. And the coincident factors of the number of Download significantly decrease are the number of all of the other attributes decrease at the same time interval. After that the number P/V, Bugs0, Support0, Tracker0, Tasks0, Tasks1, CVS decrease and the number of Bugs1, Support1, Patches0, Patches1, Tracker1 increase . 8 PERFORMANCE Our methods consume time to find the predisposing factor and the co-incident factor of the reference event just in O(n) where n is the number of the total records. The most time consuming is the time for calculating the slope (the data change) of every two adjacent time points of the same project which take time O(n). And we have to spend time to select the reference event by using the threshold which takes time O(n). We have to spend time to group records around the reference event (at the previous time point, the current time point and the post time point) which is O(n). And the time for counting the number of events of the other attributes at each time point around the current time point is O(n). The time in overall process can be approximate to O(n), which is not exponential. So our methods are good enough to apply in the big real life data set. In our experiments we use PC PentiumIV 1.6 GHz, RAM 1 GB. The operating system is MS WindowsXP Professional. We implement these algorithms in Perl 5.8 on command line. The data set test has 1,097,341 records, 41, 540 projects with total 17 attributes. The number of attributes of consideration is 13 attributes. The size of this data set is about 48 MB. We want to see if our program consume running time in linear scale with the size of the data or not. Then we test with the different number of records in each file and run each file at a time. The result is shown in Graph 2. 50 45 40 35 30 25 20 15 10 5 0 time(sec) 6000 8000 10000 12000 14000 16000 time(sec) 141 Graph 2: Running time (in seconds) and the number of records to be run at a time From this result confirm us that our algorithm consumes execution time in linear scale with the number of records. 8.1 Accuracy Test with Synthetic Data Sets We synthesize 4 data sets as follow 1. Correct complete data set 2. Put 5 % of noise in the first data set 3. Put 20 % of noise in the first data set 4. Put 50 % of noise in the first data set We set the data change threshold as 10. The result is almost all of four data sets correct, except only at the third data set with 20 % of noise, there is only one point in the result different from the others, that is, the catalyst at the current point changes to be positive slope instead of steady.
142 ENTERPRISE INFORMATION SYSTEMS VI 9 CONCLUSION The combination of the existing methods to be our new algorithm can be used to mine the predisposing factor and co-incident factor of the reference event very well. As seen in our experiments, our proposed algorithm can be applied to both the synthetic and the real life data set. The performance of our algorithm is also good. They consume execution time just in linear time scale and also tolerate to the noise data. 10 DISCUSSION The threshold is the indicator to select the event which is the significant change of the data of the attribute of consideration. When we use the different thresholds in detecting the events, the results can be different. So setting the threshold of the data change have to be well justified. It can be justified by looking at the data and observing the characteristic of the attributes of interest. The users have to realize that the results they get can be different depending on their threshold setting. The threshold reflects the degree of importance of the predisposing factor and the coincident factor to the reference event. If the degree of importance of an attribute is very high, just little change of the data of that attribute can make the data of the reference attribute change very much. So for this reason setting the threshold value is of utmost importance for the accuracy and reliability of the results. REFERENCES Agrawal, R. and Srikant R., 1995. Mining Sequential Patterns. In Proceedings of the IEEE International Conference on Data Engineering, Taipei, Taiwan. Bettini, C., Wang S., et al. 1998. Discovering Frequent Event Patterns with Multiple Granularities in Time Sequences. In IEEE Transactions on Knowledge and Data Engineering 10(2). Blum, R. L., 1982. Discovery, Confirmation and Interpretation of Causal Relationships from a Large Time-Oriented Clinical Databases: The Rx Project. Computers and Biomedical Research 15(2): 164-187. Dasgupta, D. and Forrest S., 1995. Novelty Detection in Time Series Data using Ideas from Immunology. In Proceedings of the 5th International Conference on Intelligent Systems, Reno, Nevada. Guralnik, V. and Srivastava J., 1999. Event Detection from Time Series Data. In KDD-99, San Diego, CA USA. Hirano, S., Sun X., et al., 2001. Analysis of Time-series Medical Databases Using Multiscale Structure Matching and Rough Sets-Based Clustering Technique. In IEEE International Fuzzy Systems Conference. Hirano, S. and Tsumoto S., 2001. A Knowledge-Oriented Clustering Technique Based on Rough Sets. In 25th Annual International Computer Software and Applications Conference (COMPSAC'01), Chicago, Illinois. Hirano, S. and Tsumoto S., 2002. Mining Similar Temporal Patterns in Long Time-Series Data and Its Application to Medicine. In IEEE: 219-226. Kantardzic, M., 2003. Data Mining Concepts, Models, Methods, and Algorithms. USA, IEEE Press. Keogh, E., Chu S., et al., 2001. An Online Algorithm for Segmenting Time Series. In Proceedings of IEEE International Conference on Data Mining, 2001. Keogh, E., Lonardi S., et al., 2002. Finding Surprising Patterns in a Time Series Database in Linear Time and Space. In Proceedings of The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '02), Edmonton, Alberta, Canada. Last, M., Klein Y., et al., 2001. Knowledge Discovery in Time Series Databases. In IEEE Transactions on Systems, Man, and Cybernetics 31(1): 160-169. Lu, H., Han J., et al., 1998. Stock Movement Prediction and N-Dimensional Inter-Transaction Association Rules. In Proc. of 1998 SIGMOD'98 Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD'98) ,, Seattle, Washington. Mannila, H., Toivonen H., et al., 1997. Discovery of frequent episodes in event sequences. In Data Mining and Knowledge Discovery 1(3): 258-289. Roddick, J. F. and Spiliopoulou M., 2002. A Survey of Temporal Knowledge Discovery Paradigms and Methods. In IEEE Transactions on Knowledge and Data Mining 14(4): 750-767. Salam, M. A., 2001. Quasi Fuzzy Paths in Semantic Networks. In Proceedings 10th IEEE International Conference on Fuzzy Systems, Melbourne, Australia. Tung, A., Lu H., et al., 1999. Breaking the Barrier of Transactions: Mining Inter-Transaction Association Rules. In Proceedings of the Fifth International on Knowledge Discovery and Data Mining [KDD 99], San Diego, CA. Ueda, N. and Suzuki S., 1990. A Matching Algorithm of Deformed Planar Curves Using Multiscale Convex/Concave Structures. In JEICE Transactions on Information and Systems J73-D-II(7): 992-1000. Weiss, S. M. and Indurkhya N., 1998. Predictive Data Mining. San Francisco, California, Morgn Kaufmann Publsihers, Inc.
Page 1 and 2:
www.dbeBooks.com - An Ebook Library
Page 3 and 4:
A C.I.P. Catalogue record for this
Page 5 and 6:
vi TABLE OF CONTENTS PART 1 - DATAB
Page 7 and 8:
viii TABLE OF CONTENTS A WIRELESS A
Page 9 and 10:
CONFERENCE COMMITTEE Honorary Presi
Page 11 and 12:
Hung, P. (AUSTRALIA) Jahankhani, H.
Page 13 and 14:
Invited Papers
Page 15 and 16:
2 ENTERPRISE INFORMATION SYSTEMS VI
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
10 ENTERPRISE INFORMATION SYSTEMS V
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
EVOLUTIONARY PROJECT MANAGEMENT: MU
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
MANAGING COMPLEXITY OF ENTERPRISE I
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
ASSESSING EFFORT PREDICTION MODELS
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
ORGANIZATIONAL AND TECHNOLOGICAL CR
Page 77 and 78:
Page 79 and 80:
ASAP Processes W Project Kickoff A
Page 81 and 82:
Page 83 and 84:
since it means changing the relevan
Page 85 and 86:
ACME-DB: AN ADAPTIVE CACHING MECHAN
Page 87 and 88:
ACME-DB: AN ADAPTIVE CACHING MECHAN
Page 89 and 90:
Hit Rate (%) ACME-DB: AN ADAPTIVE C
Page 91 and 92:
5.3.6 Effect on all Policies by Int
Page 93 and 94:
ACKNOWLEDGEMENTS We would like to t
Page 95 and 96:
This paper describes the data sampl
Page 97 and 98:
The major limit A of the operation
Page 99 and 100:
RELATIONAL SAMPLING FOR DATA QUALIT
Page 101 and 102: TOWARDS DESIGN RATIONALES OF SOFTWA
Page 103 and 104: ((Brown et al., 1998), pp. 159-190,
Page 105 and 106: sive data filtering, presentation (
Page 107 and 108: • implementation changes in servi
Page 109 and 110: MEMORY MANAGEMENT FOR LARGE SCALE D
Page 111 and 112: Data Rate [bytes/sec] 1600000 14000
Page 113 and 114: E.g., DV Camcorder Camera Packets (
Page 115 and 116: Procedure CheckOptimal (n rs 1 ...n
Page 117 and 118: MEMORY MANAGEMENT FOR LARGE SCALE D
Page 119 and 120: PART 2 Artificial Intelligence and
Page 121 and 122: 110 ENTERPRISE INFORMATION SYSTEMS
Page 127 and 128: DYNAMIC MULTI-AGENT BASED VARIETY F
Page 151: 140 ENTERPRISE INFORMATION SYSTEMS
Page 169 and 170: AN EXPERIENCE IN MANAGEMENT OF IMPR
Page 179 and 180: ANALYSIS AND RE-ENGINEERING OF WEB
Page 181 and 182: Module p0 Customer Itinerary Send I
Page 183 and 184: departments: the marketing dept. se
Page 185 and 186: • An acyclic module M is usable,
Page 187 and 188: BALANCING STAKEHOLDER’S PREFERENC
Page 189 and 190: The scenarios will provide an abstr
Page 191 and 192: BALANCING STAKEHOLDER'S PREFERENCES
Page 193 and 194: BALANCING STAKEHOLDER'S PREFERENCES
Page 195 and 196: A POLYMORPHIC CONTEXT FRAME TO SUPP
Page 197 and 198: A POLYMORPHIC CONTE XT FRAME TO SUP
Page 203 and 204:
FEATURE MATCHING IN MODEL-BASED SOF
Page 205 and 206:
combination of solution domains - a
Page 207 and 208:
• features for all the concepts:
Page 209 and 210:
Existence In both steps it is possi
Page 211 and 212:
matching strategies are maximal, mi
Page 213 and 214:
TOWARDS A META MODEL FOR DESCRIBING
Page 215 and 216:
elementary message (ae-message) can
Page 217 and 218:
problems on a pragmatic level, whic
Page 219 and 220:
TOWARDS A META MODEL FOR DESCRIBING
Page 221 and 222:
INTRUSION DETECTION SYSTEMS USING A
Page 223 and 224:
INTRUSION DETECTION SYSTEMS USING A
Page 225 and 226:
(k� 1) (k) (k�1) (k) p � w
Page 227 and 228:
Table 5: Performance Comparison of
Page 229 and 230:
A USER-CENTERED METHODOLOGY TO GENE
Page 231 and 232:
carries out the second phase of the
Page 233 and 234:
1 1 1 1 2 PROCESS STORE ENTITY EDGE
Page 235 and 236:
constraints rules. KOGGE (Ebert et
Page 237 and 238:
PART 4 Software Agents and Internet
Page 239 and 240:
230 ENTERPRISE INFORMATION SYSTEMS
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
Page 249 and 250:
Page 251 and 252:
Page 253 and 254:
Page 255 and 256:
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
Page 279 and 280:
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
CABA 2 L A BLISS PREDICTIVE COMPOSI
Page 287 and 288:
y disables evidencing severe mental
Page 289 and 290:
Figure 4: Symbols emission in DAR-H
Page 291 and 292:
they evidence a generalization erro
Page 293 and 294:
A METHODOLOGY FOR INTERFACE DESIGN
Page 295 and 296:
and mobility loss coupled with redu
Page 297 and 298:
Tradeoff: The default input is poss
Page 299 and 300:
Sessions on the VABS which are held
Page 301 and 302:
A CONTACT RECOMMENDER SYSTEM FOR A
Page 303 and 304:
A CONTACT RECOMMENDER SYSTEM FOR A
Page 305 and 306:
- "Going to the sources". The user
Page 307 and 308:
Here are the matrix W and P for our
Page 309 and 310:
EMOTION SYNTHESIS IN VIRTUAL ENVIRO
Page 311 and 312:
theme turns out to be surprisingly
Page 313 and 314:
6 FROM FEATURES TO SYMBOLS 6.1 Face
Page 315 and 316:
sions are described by different em
Page 317 and 318:
Electronic Imaging 2000 Conference
Page 319 and 320:
to the accessibility problem for th
Page 321 and 322:
Figure 3: Hierarchical View of the
Page 323 and 324:
4 CONCLUSIONS AND FUTURE WORK On th
Page 325 and 326:
PERSONALISED RESOURCE DISCOVERY SEA
Page 327 and 328:
Page 329 and 330:
profile, the user defines his curre
Page 331 and 332:
Page 333 and 334:
Abdelkafi, N. .....................
show all

Back Room Front Room 2

Create successful ePaper yourself

Delete template?

Save as template?