18.12.2012 Views

Proceedings

Proceedings

Proceedings

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Another indicator which is often calculated is Pred(25), which is the percentage of<br />

projects for which MRE is less than 25% (see, e.g., Shepperd and Schofield, 1997,<br />

and Vinay Kumar et al., 2008):<br />

1 n<br />

Pr ed(<br />

25)<br />

= × ∑ ( MRE ≤ 0,<br />

25)<br />

(10)<br />

n 1=<br />

i<br />

The last measure is the average square root error index (ASREI):<br />

N<br />

[ ( ) ] Yˆ<br />

= ∑ i − i Y<br />

1<br />

ASREI (10)<br />

N<br />

i=<br />

1<br />

Where N is the number of projects in the sample. ASREI is a robust indicator which is<br />

commonly used to evaluate goodness of fit on data (see, e.g., Hale et al., 2000; Smith<br />

et al., 2001; Pendharkar et al., 2008).<br />

Since LL-CD model predicts log effort, for the calculation of the accuracy measures<br />

corresponding to this model we take the anti-log values to compute the value of<br />

model-predicted effort. This procedure is commonly used in studies that consider loglinear<br />

models (see, e.g., Pendharkar, 2008).<br />

The four measures are calculated both using the original sample and a jackknife<br />

procedure. Jackknife is a validation technique consisting in removing each case and<br />

then using the rest of the cases to predict the removed one. This method has been used<br />

by some of the prior researchers (e.g., Shepperd and Schofield, 1997). The reason is to<br />

detect the overfitting problems that nonlinear/nonparametrical models often suffer.<br />

3. SAMPLE AND VARIABLES<br />

In this research we used the Desharnais dataset, which comprises 81 commercial<br />

projects developed by a Canadian software house in the late 80s. This dataset was<br />

initially compiled by Desharnais (1989) and is currently available at Sayyad Shirabad<br />

and Menzies (2005). It has been extensively used by researchers on software effort<br />

costs (Shepperd and Schofield, 1997; Burgess and Lefley, 2001; Oliveira et al., 2010;<br />

Azzeh et al., 2011, among many others).<br />

Despite the projects in the Desharnais database are more than 20 years old it is<br />

noticeable that it is still used in scientific research. One of its advantages is that all the<br />

projects were developed by the same software firm and the time span is relatively<br />

short. This leads to a better assessment of the models, as it guarantees the ceteris<br />

paribus requirement which must be fulfilled in the estimation of econometric models.<br />

That is, the variability of the sample with regard to organizational and technological<br />

factors which are not explicitly represented by variables in the dataset is lesser than in<br />

other datasets which comprise projects from several houses / several decades.<br />

Projects n. 38, 44, 66 and 75 of the Desharnais dataset have missing data. As well as<br />

many of the previous researchers that used this dataset, we discarded these projects.<br />

So, our final sample comprises the 77 cases with complete information.<br />

~ 686 ~

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!