20.01.2014 Views

thesis - Faculty of Information and Communication Technologies ...

thesis - Faculty of Information and Communication Technologies ...

thesis - Faculty of Information and Communication Technologies ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 5. Growth Dynamics<br />

large <strong>and</strong> complex classes need not directly service a large number <strong>of</strong><br />

other classes.<br />

5.4.2 Metric Data Distributions are not Normal<br />

S<strong>of</strong>tware systems that we analysed contained many hundreds <strong>of</strong> classes.<br />

But how are they distributed? Are they highly skewed, as found by other<br />

researchers? When we analysed this data, we found that our observations<br />

confirm findings from other researchers [21,55,223,270,299], in<br />

that they do not fit a gaussian distributions. Further, we consistently<br />

found positive values for skewness clearly indicating that in all cases<br />

the distributions are skewed to contain a fat tail.<br />

An example typical <strong>of</strong> the metric data in our data set is illustrated in Figure<br />

5.1 <strong>and</strong> it shows the relative frequency distributions, for the metrics<br />

Number <strong>of</strong> Methods <strong>and</strong> Fan-Out Count for release 2.5.3 <strong>of</strong> the Spring<br />

framework (a popular Java/J2EE light-weight application container).<br />

In both cases the distributions, are significantly skewed. However, the<br />

shape <strong>of</strong> distribution is different. This is a pattern that is recurring<br />

<strong>and</strong> common, that is, though the distributions are non-guassian <strong>and</strong><br />

positively skewed with fat tails, they are different for different systems<br />

<strong>and</strong> metrics. A complete list <strong>of</strong> all descriptive statistics <strong>and</strong> the result<br />

from our test for normality is presented in Appendix E.<br />

5.4.3 Evolution <strong>of</strong> Metric Distributions<br />

The upper <strong>and</strong> lower boundaries <strong>of</strong> the metric data distribution is bounded<br />

within a fairly narrow range. Figure 5.5 presents the boundaries <strong>of</strong> the<br />

histograms based on the minimum <strong>and</strong> maximum values <strong>of</strong> Number <strong>of</strong><br />

Branches, In-Degree Count, Number <strong>of</strong> Methods <strong>and</strong> Out-Degree Count<br />

attained across all versions <strong>of</strong> the Spring Framework. The figures show<br />

that relative frequency distributions <strong>of</strong> these measures have a distinct<br />

pr<strong>of</strong>ile that is bounded in a small range. The notable fact is that this<br />

remarkable stability is observable over an evolution period <strong>of</strong> 5 years.<br />

111

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!