CoSIT 2017

More documents

Recommendations

Info

136 Computer Science & Information Technology (CS & IT) defects. Projects 1, 2 and 3 show high values of R 2 for data sets containing more than 500 defects. The differences in results between the projects should be further investigated. Figure 5: Learning Curve - R 2 as a function of the sample set size In Figure 6 we compared five sets of data fields for each of the projects used in our experiments. The first set, Base Fields, is a set comprised of the following basic fields: Detector, Assignee, Priority, Severity and Project Name. The next couple of field-sets were based upon the base set with additional fields specified in the legend of Figure 6; the fourth set included all standard fields, common between all projects; and the last set contained all fields, including user-defined fields, which vary significantly between different projects. As shown in Figure 6, R 2 scores increased in correspondence with increases in the number of fields available for learning, and particularly when the system was allowed to use user-defined fields. It is important to note that any field may or may not be discarded by the algorithm, either in the learning phase itself or during pre-processing. While analyzing the results and comparing the different projects, we also examined the feature importances (i.e. the weight of each feature computed by the model). This examination showed that user-defined fields played a crucial role in determining the amount of time to fix a defect. In all projects, at least one such field was ranked as one of the three leading features. We also found that the date a defect last entered an Open state is significant for the prediction. This result supports the assumption mentioned briefly in Section , where we explained how the time a defect was opened or detected may be important. Examining feature importances also showed that fields which are intuitively thought as significant in handling time prediction of defects, are not necessarily so. Fields such as Severity, Priority, Assignee and Detector were not found to be noticeably important in most projects. These results support those described in [16].
Computer Science & Information Technology (CS & IT) 137 Figure 6: R 2 score as a function of the fields used for learning features To accurately compare our results to the work done in [7], we ran an experiment in which the target field was calculated in hours and the defects were categorized into Fast and Slow defects using the median of fixing time. Similarly, in this experiment we calculated the median for each project and partitioned the defects into two classes, we used the measures described in [7], and used a Random Forest Classifier with parameters close to those used for our regresssor (described in Section 1). The results presented in Table 3 are the average over all projects in our dataset, and the average results over all projects presented in the compared work when the initial defect data was used. Their data set contained six open source projects, with a similar number of defects to the current paper. Our results show an improvement over the reported results in the compared study. Table 3: Performance comparison 7. VALIDITY In this section we discuss the validity of our work by addressing the threats for validity of software engineering research proposed by Yin in [17]. Construct Validity. Our construct validity threats are mainly due to inaccuracies in calculating handling times, based on information stored in defect tracking systems. This is due to the "human factor": developers are expected to update the systems, often manually, to reflect the real work process, and have different expertise levels and variable work availability and productivity. Internal Validity. Methods such as cross-validation were used to make sure the results are as accurate as possible. In cases where a phenomenon had alternative explanations (e.g. comparison
Page 2:
Computer Science & Information Tech
Page 5 and 6:
Volume Editors Dhinaharan Nagamalai
Page 7 and 8:
Organization General Chair Nataraja
Page 9 and 10:
Technically Sponsored by Computer S
Page 11 and 12:
Fourth International Conference on
Page 13 and 14:
UNSUPERVISED DETECTION OF VIOLENT C
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
INVESTIGATING BINARY STRING ENCODIN
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
USE OF ADAPTIVE COLOURED PETRI NETW
Page 31 and 32:
2.2. Extended Adaptive Decision Tab
Page 33 and 34:
Page 35 and 36:
4. METHODOLOGY Computer Science & I
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
A NOVEL ADAPTIVE - WAVELET BASED DE
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
HANDWRITTEN CHARACTER RECOGNITION U
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
DIVING PERFORMANCE ASSESSMENT BY ME
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
4.4. Barycentre identification Comp
Page 69 and 70:
Page 71 and 72:
IS AI IN JEOPARDY? THE NEED TO UNDE
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
ANKLE MUSCLE SYNERGIES FOR SMOOTH P
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
ADVANCED LSB TECHNIQUE FOR AUDIO ST
Page 93 and 94:
Page 95 and 96:
Page 97 and 98: Computer Science & Information Tech
Page 99 and 100: SECURING ONLINE ACCOUNTS VIA NEW HA
Page 109 and 110: • Clark-Wilson Integrity Model Co
Page 117 and 118: NEW NON-COPRIME CONJUGATE-PAIR BINA
Page 125 and 126: TOWARDS A MULTI-FEATURE ENABLED APP
Page 129 and 130: 3.1. Architecture Computer Science
Page 139 and 140: ESTIMATING HANDLING TIME OF SOFTWAR
Page 147: Computer Science & Information Tech
Page 153 and 154: NEED FOR A SOFT DIMENSION Pradeep W
Page 157 and 158: AUTHORS Computer Science & Informat
Page 159 and 160: THE ANNUAL REPORT ALGORITHM: RETRIE
Page 179 and 180: A STUDY ON THE MOTION CHANGE UNDER
Page 181 and 182: B) Experiment part Computer Science
Page 185 and 186: ADAPTIVE AUTOMATA FOR GRAMMAR BASED
Page 197 and 198: STORAGE GROWING FORECAST WITH BACUL
Page 199 and 200:
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Page 207 and 208:
Page 209:
AUTHOR INDEX Abdullah A. Al-Shaher
show all

CoSIT 2017

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?