20.01.2014 Views

thesis - Faculty of Information and Communication Technologies ...

thesis - Faculty of Information and Communication Technologies ...

thesis - Faculty of Information and Communication Technologies ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 5. Growth Dynamics<br />

abstractions that they need to focus on is well within the limits <strong>of</strong> their<br />

working memory. If this decision making method is applied for each feature,<br />

over time, the asymmetric distribution observed in various metrics<br />

is reinforced.<br />

5.5.7 Value <strong>of</strong> Gini Coefficient<br />

The Gini Coefficient is a single bounded value, <strong>and</strong> hence <strong>of</strong>fers an easy<br />

to use trigger for an investigation that would reveal potential causal<br />

mechanisms. One alternatives to using the Gini Coefficient will be to<br />

identify <strong>and</strong> observe the movement in outliers using a range <strong>of</strong> outlier<br />

detection techniques [24]. Another alternative is to use a combination<br />

<strong>of</strong> arithmetic mean, median, skewness <strong>and</strong> kurtosis to qualitatively deduce<br />

the nature <strong>of</strong> the distribution. Though these techniques may <strong>of</strong>fer<br />

some insight, they do not digest the information taking into consideration<br />

the entire population size <strong>and</strong> also are limited since there is no<br />

easy baseline to compare against. Furthermore, the outlier detection<br />

method will highlight a set <strong>of</strong> classes rather than present information<br />

that permits direct comparisons between versions without additional<br />

analysis.<br />

Gini Coefficient based analysis <strong>of</strong>fers information on what can be considered<br />

a typical range <strong>and</strong> provides a guideline for investigation, including<br />

for detecting machine generated code <strong>and</strong> unusual patterns via<br />

a single numerical value. The information provided by Gini Coefficients<br />

is helpful for solution architects <strong>and</strong> developers, especially when they<br />

need to take over a s<strong>of</strong>tware system <strong>and</strong> maintain it. For example, if<br />

a system has machine generated code, additional knowledge <strong>and</strong> skills<br />

might be needed in order to maintain <strong>and</strong> enhance blocks <strong>of</strong> code that<br />

are automatically generated.<br />

S<strong>of</strong>tware Code Audit<br />

Knowing that Gini coefficients have strong boundaries can be used to<br />

improve s<strong>of</strong>tware comprehension. We measured the Gini Coefficient<br />

during two commercial code audits undertaken as part <strong>of</strong> a broader<br />

132

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!