03.04.2017 Views

CoSIT 2017

Fourth International Conference on Computer Science and Information Technology ( CoSIT 2017 ), Geneva, Switzerland - March 2017

Fourth International Conference on Computer Science and Information Technology ( CoSIT 2017 ), Geneva, Switzerland - March 2017

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Computer Science & Information Technology (CS & IT) 133<br />

on the graph, we let the user calculate the release end date given confidence level and vice versa.<br />

We also let the user set the number of available developers.<br />

Figure 2 describes the flow for predicting the completion time for a set of defects. Each defect in<br />

the given set of unhandled defects passes through the same preprocessing flow as in the learning<br />

process, apart from the handling time calculation stage. Then, using the model, the system returns<br />

an estimation of the handling time of the given defect, as well as a prediction confidence<br />

calculated based on the variance between the answers of the individual regression trees.<br />

To automatically calculate the total time it takes to complete a set of defects by several<br />

developers, one would ideally use the optimal scheduling which minimizes the makespan (i.e. the<br />

total length of the schedule). However, the problem of finding the optimal makespan in this setup,<br />

better known as the Minimum Makespan Scheduling on Identical Machines, is known to be NP-<br />

Hard [12]. We employ a polynomial-time approximation scheme (PTAS) called List Scheduling,<br />

utilizing the Longest Processing Time rule (LPT). We start by sorting the defects by nonincreasing<br />

processing time estimation and then iteratively assign the next defect in the list to a<br />

developer with current smallest load.<br />

An approximation algorithm for a minimization problem is said to have a performance guarantee<br />

p, if it always delivers a solution with objective function value at most p times the optimum<br />

value. A tight bound on the performance guarantee of any PTAS for this problem in the<br />

deterministic case, was proved by Kawaguchi and Kyan, to be<br />

relatively satisfying performance guarantee of 4/3 for LPT [14].<br />

[13]. Graham proved a<br />

Figure 4: A screenshot of the actual system report.<br />

After computing the scheduling scheme, we end up with a list of handling times corresponding to<br />

developers. The desired makespan is the maximum of these times.<br />

Accounting for the codependency between defects is infeasible since it requires quantification of<br />

factors such as developers' expertise levels and current availability. It is also subject to significant<br />

variance even within a given project. In practice, we observed that treating defects as independent<br />

for the sake of this calculation yields very reasonable estimates. By the assumption that defects

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!