02.04.2013 Views

Download Chapters 3-6 (.PDF) - ODBMS

Download Chapters 3-6 (.PDF) - ODBMS

Download Chapters 3-6 (.PDF) - ODBMS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

For each customer, the following counts were computed:<br />

6.2. DEVIATIONS FROM POWER LAWS 39<br />

Partners: The total number of unique callers and callees associated with every user. Note that<br />

this is essentially the degree of nodes in the (undirected and unweighted) social graph, which<br />

has an edge between two users if either called the other.<br />

Calls: The total number of calls made or received by each user. In graph theoretic terms, this<br />

is the weighted degree in the social graph where the weight of an edge between two users is<br />

equal to the number of calls that involved them both.<br />

Duration: The total duration of calls for each customer in minutes. This is the weighted<br />

degree in the social graph where the weight of the edge between two users is the total duration<br />

of the calls between them.<br />

The dPln Distribution For the intuition and the fitting of the dPln distribution, see the work of<br />

Reed [238]. Here, we mention only the highlights.<br />

If a random variable X follows the four-parameter dPln distribution dPln(α, β, ν, τ), then<br />

the complete distribution is given by:<br />

<br />

fX = αβ<br />

α+β e (αν+α2τ 2 /2) x−α−1 log x−ν−ατ<br />

( 2<br />

τ<br />

xβ−1e (−βτ+β2τ 2 /2) c log x−ν+βτ<br />

( 2 <br />

τ )<br />

) +<br />

, (6.5)<br />

where and c are the CDF and complementary CDF of N(0, 1) (Gaussian with zero mean and<br />

unit variance). Intuitively, the pdf looks like a piece-wise linear curve in log-log scales; the parameter<br />

α and β are the slopes of the (asymptotically) linear parts, and ν and τ determine the knee of the<br />

curve.<br />

One easier way of understanding the double-Pareto nature of X is by observing that X can<br />

be written as X = S0 V1<br />

V2 where S0 is lognormally distributed, and V1 and V2 are Pareto distributions<br />

with parameters α and β. Note that X has a mean that is finite only if α>1 in which case it is given<br />

by<br />

αβ<br />

τ2<br />

eν+ 2 .<br />

(α − 1)(β + 1)<br />

In conclusion, the distinguishing features of the dPln distribution are two linear sub-plots in<br />

the log-log scale and a hyperbolic middle section.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!