thesis - Faculty of Information and Communication Technologies ...
thesis - Faculty of Information and Communication Technologies ...
thesis - Faculty of Information and Communication Technologies ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Chapter 5. Growth Dynamics<br />
large <strong>and</strong> complex classes need not directly service a large number <strong>of</strong><br />
other classes.<br />
5.4.2 Metric Data Distributions are not Normal<br />
S<strong>of</strong>tware systems that we analysed contained many hundreds <strong>of</strong> classes.<br />
But how are they distributed? Are they highly skewed, as found by other<br />
researchers? When we analysed this data, we found that our observations<br />
confirm findings from other researchers [21,55,223,270,299], in<br />
that they do not fit a gaussian distributions. Further, we consistently<br />
found positive values for skewness clearly indicating that in all cases<br />
the distributions are skewed to contain a fat tail.<br />
An example typical <strong>of</strong> the metric data in our data set is illustrated in Figure<br />
5.1 <strong>and</strong> it shows the relative frequency distributions, for the metrics<br />
Number <strong>of</strong> Methods <strong>and</strong> Fan-Out Count for release 2.5.3 <strong>of</strong> the Spring<br />
framework (a popular Java/J2EE light-weight application container).<br />
In both cases the distributions, are significantly skewed. However, the<br />
shape <strong>of</strong> distribution is different. This is a pattern that is recurring<br />
<strong>and</strong> common, that is, though the distributions are non-guassian <strong>and</strong><br />
positively skewed with fat tails, they are different for different systems<br />
<strong>and</strong> metrics. A complete list <strong>of</strong> all descriptive statistics <strong>and</strong> the result<br />
from our test for normality is presented in Appendix E.<br />
5.4.3 Evolution <strong>of</strong> Metric Distributions<br />
The upper <strong>and</strong> lower boundaries <strong>of</strong> the metric data distribution is bounded<br />
within a fairly narrow range. Figure 5.5 presents the boundaries <strong>of</strong> the<br />
histograms based on the minimum <strong>and</strong> maximum values <strong>of</strong> Number <strong>of</strong><br />
Branches, In-Degree Count, Number <strong>of</strong> Methods <strong>and</strong> Out-Degree Count<br />
attained across all versions <strong>of</strong> the Spring Framework. The figures show<br />
that relative frequency distributions <strong>of</strong> these measures have a distinct<br />
pr<strong>of</strong>ile that is bounded in a small range. The notable fact is that this<br />
remarkable stability is observable over an evolution period <strong>of</strong> 5 years.<br />
111