08.10.2016 Views

Measuring the internet economy in The Netherlands a big data analysis 2016 | 14

measuring-the-internet-economy

measuring-the-internet-economy

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

To show, by proof of concept, <strong>the</strong> possibilities of new measurement methods for produc<strong>in</strong>g<br />

statistics.<br />

<strong>The</strong> <strong><strong>in</strong>ternet</strong> is one of <strong>the</strong> most promis<strong>in</strong>g avenues for <strong>big</strong> <strong>data</strong> collection and thus for<br />

<strong>in</strong>creas<strong>in</strong>g <strong>the</strong> availability of timely and relevant <strong>data</strong>. This study has made an important step<br />

<strong>in</strong> demonstrat<strong>in</strong>g how <strong>big</strong> <strong>data</strong> from <strong>the</strong> <strong><strong>in</strong>ternet</strong> can be comb<strong>in</strong>ed with exist<strong>in</strong>g standard <strong>data</strong><br />

sources <strong>in</strong> order to fur<strong>the</strong>r capitalise on <strong>the</strong> valuable <strong>in</strong>sights provided by <strong>big</strong> <strong>data</strong> from <strong>the</strong><br />

<strong><strong>in</strong>ternet</strong>. We have developed a complex l<strong>in</strong>k<strong>in</strong>g methodology (based on different l<strong>in</strong>k<strong>in</strong>g key<br />

comb<strong>in</strong>ations) which facilitates an optimised l<strong>in</strong>k between <strong>big</strong> <strong>data</strong> from <strong>the</strong> <strong><strong>in</strong>ternet</strong> and <strong>the</strong><br />

o<strong>the</strong>r standard sources. While <strong>the</strong>re are many challenges to overcome <strong>in</strong> comb<strong>in</strong><strong>in</strong>g <strong>big</strong> and<br />

standard <strong>data</strong> sources (which will be discussed later), this study has shown that mean<strong>in</strong>gful<br />

and novel <strong>in</strong>sights can <strong>in</strong>deed be generated <strong>in</strong> this way. <strong>The</strong> fourth research aim is:<br />

To expla<strong>in</strong> differences from standard statistical concepts/classifications and exist<strong>in</strong>g statistics.<br />

As has already been mentioned, <strong>the</strong> standard method for study<strong>in</strong>g particular sectors of <strong>the</strong><br />

<strong>economy</strong> is to rely on <strong>the</strong> SIC codes. <strong>The</strong> methodology <strong>in</strong> this study beg<strong>in</strong>s not by classify<strong>in</strong>g<br />

bus<strong>in</strong>esses, but by classify<strong>in</strong>g websites. <strong>The</strong> classification of bus<strong>in</strong>esses is made on <strong>the</strong><br />

basis of <strong>the</strong> classification of <strong>the</strong> websites which l<strong>in</strong>k to <strong>the</strong> bus<strong>in</strong>ess. This is a fundamentally<br />

different statistical approach to classify<strong>in</strong>g bus<strong>in</strong>esses.<br />

In section 3, we have discussed def<strong>in</strong>itions which are related to <strong>the</strong> concept of <strong>the</strong> <strong><strong>in</strong>ternet</strong><br />

<strong>economy</strong>. In some cases, <strong>the</strong> def<strong>in</strong>ition used <strong>in</strong> this study (henceforth ‘our def<strong>in</strong>ition’) def<strong>in</strong>es<br />

an <strong><strong>in</strong>ternet</strong> <strong>economy</strong> which can be loosely thought of as ‘subset’ of <strong>the</strong> <strong><strong>in</strong>ternet</strong> <strong>economy</strong><br />

conform<strong>in</strong>g to o<strong>the</strong>r def<strong>in</strong>itions. In o<strong>the</strong>r cases, <strong>the</strong>re is overlap between our def<strong>in</strong>ition and<br />

o<strong>the</strong>r def<strong>in</strong>itions, but nei<strong>the</strong>r can be considered a subset of <strong>the</strong> o<strong>the</strong>r. Our def<strong>in</strong>ition def<strong>in</strong>es<br />

an <strong><strong>in</strong>ternet</strong> <strong>economy</strong> which is a subset of <strong>the</strong> <strong><strong>in</strong>ternet</strong> economies derived by NIESR and<br />

Growth Intelligence (2013) and <strong>the</strong> OECD (2013). In <strong>the</strong> case of NIESR and Growth Intelligence<br />

(2013), our def<strong>in</strong>ition results <strong>in</strong> a subset because it exclude digital activities not related to <strong>the</strong><br />

<strong><strong>in</strong>ternet</strong>. In <strong>the</strong> case of OECD (2013), our def<strong>in</strong>ition results <strong>in</strong> a subset because it excludes <strong>the</strong><br />

social and cultural values of <strong>the</strong> <strong><strong>in</strong>ternet</strong>. <strong>The</strong> def<strong>in</strong>itions employed by CBS (<strong>2016</strong>) of <strong>the</strong> ICT<br />

<strong>economy</strong> and of <strong>the</strong> Boston Consult<strong>in</strong>g Group (2011), def<strong>in</strong>e economies which overlap with<br />

<strong>the</strong> <strong>economy</strong> result<strong>in</strong>g from our def<strong>in</strong>ition. In <strong>the</strong> cases of <strong>the</strong> CBS ICT <strong>economy</strong> def<strong>in</strong>ition,<br />

this is because only <strong>the</strong> <strong><strong>in</strong>ternet</strong> related elements of <strong>the</strong> ICT <strong>economy</strong> are <strong>in</strong>cluded <strong>in</strong> our<br />

def<strong>in</strong>ition. In <strong>the</strong> case of <strong>the</strong> Boston Consult<strong>in</strong>g Group, it is more challeng<strong>in</strong>g to identify <strong>the</strong><br />

source of <strong>the</strong> overlap because <strong>the</strong>ir def<strong>in</strong>ition is a macro-<strong>data</strong> approach <strong>in</strong>stead of <strong>the</strong> micro<strong>data</strong><br />

approach adopted <strong>in</strong> this study. In any case, <strong>the</strong> def<strong>in</strong>itions share some conceptual<br />

similarities. On this basis, we can expect overlap. F<strong>in</strong>ally, <strong>the</strong> central research question is:<br />

To explore <strong>the</strong> possibilities to deepen our understand<strong>in</strong>g of <strong>the</strong> importance of <strong>the</strong> <strong><strong>in</strong>ternet</strong><br />

<strong>economy</strong>.<br />

In this study, we have employed a method which demonstrates how it is possible to ga<strong>in</strong><br />

<strong>in</strong>sights <strong>in</strong>to <strong>the</strong> <strong><strong>in</strong>ternet</strong> <strong>economy</strong> by comb<strong>in</strong><strong>in</strong>g exist<strong>in</strong>g CBS micro-<strong>data</strong> with <strong>big</strong> <strong>data</strong><br />

available from <strong>the</strong> <strong><strong>in</strong>ternet</strong>. To properly explore <strong>the</strong> possibilities, we need to analyse <strong>the</strong><br />

strengths and weaknesses of this method.<br />

CBS | Discussion Paper, <strong>2016</strong> | <strong>14</strong> 42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!