21.10.2014 Views

Diversity as a potential for surprise

Diversity as a potential for surprise

Diversity as a potential for surprise

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Diversity</strong> <strong>as</strong> a <strong>potential</strong> <strong>for</strong> <strong>surprise</strong><br />

An in<strong>for</strong>mation theoretic me<strong>as</strong>ure<br />

of effective product diversity<br />

Stefan Baumgärtner 1<br />

Interdisciplinary Institute <strong>for</strong> Environmental Economics,<br />

University of Heidelberg, Germany<br />

7 May 2004 (Version 4.2)<br />

Abstract: In the face of uncertainty, diversity causes <strong>surprise</strong>. Taking the view of<br />

a consumer with incomplete knowledge of the choice set, I propose to take the expected<br />

<strong>surprise</strong> when sequentially observing an allocation <strong>as</strong> a me<strong>as</strong>ure of effective<br />

diversity of that allocation. Expected <strong>surprise</strong> can be quantified <strong>as</strong> the expected<br />

in<strong>for</strong>mation content of an observation (entropy). In this approach, effective product<br />

diversity is determined by (i) the pure number of different products and (ii)<br />

the evenness of distribution of their market shares. The concept is illustrated by<br />

an empirical analysis of the allocation of p<strong>as</strong>senger cars in Germany.<br />

JEL-cl<strong>as</strong>sification: D11, L11, L62<br />

Key words: auto industry, entropy, diversity, in<strong>for</strong>mation, market concentration,<br />

product variety, <strong>surprise</strong>, uncertainty<br />

Correspondence:<br />

Dr. Stefan Baumgärtner, Interdisciplinary Institute <strong>for</strong> Environmental Economics,<br />

University of Heidelberg, Bergheimer Str. 20, D-69115 Heidelberg, Germany,<br />

phone: +49.6221.54-8012, fax: +49.6221.54-8020, email: baumgaertner@uni-hd.de,<br />

http://www.stefan-baumgaertner.de<br />

1 I am grateful to Switgard Feuerstein, Ralph Winkler and seminar participants in Heidelberg<br />

and Stockholm (EEA2003) <strong>for</strong> critical discussion and helpful comments on earlier drafts; to Eva<br />

Kiesele <strong>for</strong> research <strong>as</strong>sistance with the data evaluation used in Section 4; to the Energy and<br />

Resources Group at the University of Cali<strong>for</strong>nia/Berkeley, where the first draft of this manuscript<br />

w<strong>as</strong> written, <strong>for</strong> their hospitality in 2001/2002; to the Deutsche Forschungsgemeinschaft (DFG)<br />

<strong>for</strong> financial support; and – l<strong>as</strong>t, not le<strong>as</strong>t – to Clemens Puppe <strong>for</strong> most stimulating discussions<br />

on diversity.


1 Introduction<br />

In the face of uncertainty, diversity causes <strong>surprise</strong>. As an illustration by contr<strong>as</strong>t,<br />

imagine an economy where there exists only one make of car, now and <strong>for</strong>ever. (As<br />

a matter of history, the <strong>for</strong>merly socialist economies of central and e<strong>as</strong>tern Europe<br />

were pretty close to this extreme.) Living in such an economy, you would exactly<br />

know which car you would be driving <strong>for</strong> the rest of your life. You would exactly<br />

know which car your kids would get one day. If you were to win the main prize in<br />

the lottery – a brand new car – you would exactly know what car that would be.<br />

And if you were to die in a traffic accident, guess what car would have caused it.<br />

Without diversity, you really cannot expect much <strong>surprise</strong>.<br />

While the view of diversity <strong>as</strong> a <strong>potential</strong> <strong>for</strong> <strong>surprise</strong> may seem elusive, a<br />

number of more direct economic re<strong>as</strong>ons have been proposed why diversity matters:<br />

• <strong>Diversity</strong> of a portfolio of options provides insurance <strong>for</strong> risk averse agents,<br />

e.g. when investing in financial <strong>as</strong>sets (Eichberger and Harper 1997) or when<br />

devising corporate strategies (Berry 1975, Jacquemin and Berry 1979).<br />

• <strong>Diversity</strong> of current technology and knowledge is a <strong>potential</strong> resource b<strong>as</strong>e<br />

<strong>for</strong> future invention and innovation (Mowery and Rosenberg 1998, Schiller<br />

2001).<br />

• <strong>Diversity</strong> allows a competitive economy to efficiently adapt to un<strong>for</strong>eseen contingencies,<br />

such <strong>as</strong> evolutionary changes in preferences, technology, resource<br />

scarcity and institutions (Dekel et al. 1998, Matutinović 2001).<br />

• <strong>Diversity</strong> is tantamount <strong>for</strong> the freedom of choice of autonomous persons<br />

(Puppe 1996, Sen 1988, Weikard 1999).<br />

All of these arguments share the view that the economic relevance of diversity is<br />

intimately related to uncertainty or even fundamental ignorance about the future.<br />

And <strong>surprise</strong> is just one expression of such uncertainty (Shackle 1955, Faber et al.<br />

1992).<br />

The simplest me<strong>as</strong>ure of diversity – and <strong>for</strong> that sake let’s return to product diversity<br />

– is the number of different products available in an economy. This me<strong>as</strong>ure<br />

is used in the strand of literature on product differentiation under monopolistic<br />

competition and incre<strong>as</strong>ing returns to scale, which goes back to the seminal contributions<br />

of Spence (1976) and Dixit and Stiglitz (1977). It underlies the influential<br />

works on economic growth through product or factor differentiation (Ethier 1982,<br />

Romer 1987, 1990) and on gains from international trade in differentiated products<br />

(Dixit and Norman 1980, Ethier 1982, Helpman and Krugman 1985).<br />

The pure number of different products, however, does not tell anything about<br />

how different they are. In a more fundamental sense, diversity stems from the<br />

dissimilarity between different products in terms of some characteristic properties<br />

or attributes (Lanc<strong>as</strong>ter 1966). The diversity of a set of products may be defined <strong>as</strong><br />

an aggregate me<strong>as</strong>ure of their dissimilarity. Recently, Weitzman (1992, 1998), Gans<br />

2


and Hill (1997), Bernhofen (2001) and Nehring and Puppe (2002) have suggested<br />

various approaches <strong>for</strong> me<strong>as</strong>uring (product) diversity in such a way.<br />

All of these approaches are static and furthermore <strong>as</strong>sume complete knowledge<br />

about the number and properties of the different products available. Yet, the<br />

relevance of diversity stems exactly from not knowing all the available products<br />

and all their attributes. Such incomplete knowledge, I shall argue, should be<br />

constituent <strong>for</strong> the definition and me<strong>as</strong>urement of diversity. I want to suggest<br />

that product diversity can be defined and me<strong>as</strong>ured <strong>as</strong> the expected <strong>surprise</strong> that<br />

an unin<strong>for</strong>med person encounters when sequentially observing which products are<br />

present in the actual allocation of an economy. For simplification, I adopt the<br />

<strong>as</strong>sumption that all the different products are pairwise equally dissimilar. Thus,<br />

diversity becomes a matter merely of the number of different products.<br />

A distinction can then be made between pure product diversity and effective<br />

product diversity. Pure product diversity denotes the number of different products<br />

existent in an economy and is a technological, or objective, me<strong>as</strong>ure of diversity.<br />

Effective product diversity, on the other hand, adopts the consumers’ point of view<br />

and captures an unin<strong>for</strong>med economic agents’ perception of a diverse allocation.<br />

It, thus, is a subjective me<strong>as</strong>ure of product diversity.<br />

Consider, <strong>for</strong> example, an allocation with five different car models. Assume<br />

they have market shares of 50%, 40%, 6%, 3% and 1%. What effective diversity<br />

would you attribute to that allocation? In other words, if you were out there<br />

in the street and would watch the cars come by, one by one, how much <strong>surprise</strong><br />

can you expect? The pure product diversity which, in principle, exists in the<br />

allocation is five different car models. Effectively, however, you should not expect<br />

the same degree of <strong>surprise</strong>, or effective diversity, <strong>as</strong> if there were five different and<br />

equally abundant car models. The two with the highest market share dominate the<br />

allocation, and they also dominate your impression of diversity in that allocation.<br />

Effectively, your perceived diversity will be little more than two car models, <strong>as</strong><br />

you may probably not even see one of the car models with negligible market share<br />

when observing the actual aggregate allocation.<br />

In order to quantify the expected <strong>surprise</strong> of unin<strong>for</strong>med consumers when observing<br />

an allocation, I will employ concepts from in<strong>for</strong>mation theory, in particular<br />

the concepts of an ‘in<strong>for</strong>mation function’ and ‘entropy’ (Shannon 1948, Rényi<br />

1961), to derive a general me<strong>as</strong>ure of effective product diversity (Section 2). It<br />

will turn out that this me<strong>as</strong>ure of effective product diversity is intimately linked<br />

to common me<strong>as</strong>ures of market concentration (Section 3). It is identical to the<br />

cl<strong>as</strong>s of me<strong>as</strong>ures of market concentration suggested by Hannah and Kay (1977),<br />

and is a subcl<strong>as</strong>s of the general and unifying cl<strong>as</strong>s of concentration indices that<br />

h<strong>as</strong> been described in an axiomatic way by Foster and Shneyerov (1999). Thus,<br />

the interesting relation between effective product diversity and market concentration<br />

may also shed new light on the interpretation of different me<strong>as</strong>ures of market<br />

concentration, which have been criticized in the p<strong>as</strong>t <strong>for</strong> their apparent ‘lack of<br />

content’ (Shepherd 1987b: 639). The in<strong>for</strong>mation theoretic approach to me<strong>as</strong>uring<br />

effective product diversity put <strong>for</strong>ward here may provide such an economic ratio-<br />

3


nale <strong>for</strong> the use of certain concentration indices. Finally, I will apply the me<strong>as</strong>ure<br />

of effective product diversity in an empirical analysis of the allocation of p<strong>as</strong>senger<br />

cars in Germany 2003 (Section 4).<br />

2 Expected <strong>surprise</strong> from sequentially observing<br />

a diverse allocation<br />

2.1 Setting and notation<br />

Consider a market <strong>for</strong> the different varieties of a differentiated product, e.g. different<br />

models of p<strong>as</strong>senger cars. Let n ∈ IN be the total number of different product<br />

varieties which are in principle available on the market. Let x ∈ IR n be the actual<br />

aggregate allocation realized in the market and ¯x = ∑ n<br />

i=1 x i the total amount of<br />

output sold, where output is me<strong>as</strong>ured in physical units, i.e. number of individual<br />

cars sold of a particular make and model. Then s i = x i /¯x is the market share of<br />

product variety i (i = 1, . . . , n). Without loss of generality <strong>as</strong>sume that the different<br />

product varieties are numbered in the sequence of decre<strong>as</strong>ing market share,<br />

such that s i ≤ s i+1 <strong>for</strong> all i = 1, . . . , n − 1. For given n ∈ IN the set of possible<br />

market share distributions is D n , with a typical element s = (s 1 , . . . , s n ) with<br />

0 ≤ s i ≤ 1 <strong>for</strong> all i = 1, . . . , n and ∑ n<br />

i=1 s i = 1. The set of all possible market<br />

share distributions is D = ⋃ n∈IN Dn . For any function f : D → IR the restriction<br />

of f on D n is denoted by f n . The n coordinated vector of ones is denoted by<br />

1 n = (1, . . . , 1) ∈ IR n .<br />

Now imagine a person who h<strong>as</strong> the statistical in<strong>for</strong>mation about the aggregate<br />

allocation, i.e. she knows the number of different car models, n ∈ IN, and their<br />

market shares, s ∈ D n . What can we say about the impression of diversity that she<br />

will experience when observing that allocation? What is her degree of uncertainty,<br />

or expected <strong>surprise</strong>, about the actual allocation? Imagine that this person sets out<br />

to actually observe the allocation. For example, she might take a chair, sit down<br />

next to a busy street and just watch the cars come by. That way, her observations<br />

of the allocation are sequential, i.e. she observes the individual cars in the allocation<br />

one by one. Any single car coming by is a new observation. It reveals which event<br />

h<strong>as</strong> taken place, i.e. which car model h<strong>as</strong> actually occurred. If place and time are<br />

suitably chosen all events will be random draws from the population of all cars in<br />

the allocation. The probability <strong>for</strong> the next event to be car model i is then given<br />

by that model’s market share s i (i = 1, . . . , n). Actually, the vector of market<br />

shares s ∈ D n h<strong>as</strong> all the properties of a probability distribution over a complete<br />

set of n mutually exclusive events.<br />

2.2 Expected in<strong>for</strong>mation gain, expected <strong>surprise</strong>, and effective<br />

diversity<br />

In order to construct a me<strong>as</strong>ure of effective product diversity of the allocation<br />

described above (Section 2.1), this section introduces and applies a number of<br />

4


concepts well-known from in<strong>for</strong>mation theory (e.g. Aczél and Daróczy 1975, Theil<br />

1967). The actual me<strong>as</strong>ure of effective product diversity is then constructed in the<br />

following Section 2.3<br />

If it is known that the occurrence of car model i h<strong>as</strong> probability s i and if the<br />

observation reveals that it is indeed model i that h<strong>as</strong> occurred, how large will then<br />

be the observer’s <strong>surprise</strong> about this finding? Well, when s i = 0.99 she should not<br />

be <strong>surprise</strong>d at all to observe model i since it w<strong>as</strong> practically sure that the next<br />

car to come by would be of model i. In that c<strong>as</strong>e the observation h<strong>as</strong> very little<br />

in<strong>for</strong>mation content. If, on the other hand, s i = 0.01 she will be greatly <strong>surprise</strong>d<br />

to observe model i since it w<strong>as</strong> practically sure that this would not happen. In<br />

that c<strong>as</strong>e, the observation h<strong>as</strong> a very large in<strong>for</strong>mation content.<br />

The in<strong>for</strong>mation content of an observation thus depends on the probability that<br />

some event would take place be<strong>for</strong>e the actual observation w<strong>as</strong> made. It can be<br />

me<strong>as</strong>ured by an in<strong>for</strong>mation function which specifies the in<strong>for</strong>mation gain from observing<br />

an event with prior probability p with 0 ≤ p ≤ 1. The in<strong>for</strong>mation content<br />

of an observation also quantifies the observer’s uncertainty prior to actually making<br />

the observation. In this sense, expected in<strong>for</strong>mation and uncertainty are ‘dual<br />

concepts’ (Theil 1967: 25). While uncertainty prevails prior to the observation,<br />

in<strong>for</strong>mation is supplied by the observation, thus resolving the uncertainty. Assuming<br />

that the in<strong>for</strong>mation gain from observing a particular event depends only on<br />

the prior probability of that event, an in<strong>for</strong>mation function is usually defined in<br />

the following axiomatic way (e.g. Theil 1967: 6).<br />

Definition 1. A function h : ]0, 1] → IR with h = h(p) is an in<strong>for</strong>mation function<br />

if and only if it satisfies the following properties:<br />

(h-Continuity) h(p) is a continuous function of p <strong>for</strong> all 0 < p ≤ 1.<br />

(h-Monotonicity) h(p 1 ) > h(p 2 ) if 0 < p 1 < p 2 ≤ 1.<br />

(h-Additivity) h(p 1 p 2 ) = h(p 1 ) + h(p 2 ) if 0 < p 1 , p 2 ≤ 1.<br />

(h-Range) lim p→0 h(p) = +∞ and h(1) = 0.<br />

Continuity states that the gain in in<strong>for</strong>mation from observing an event does not<br />

change dr<strong>as</strong>tically when the prior probability of that event only changes slightly.<br />

Monotonicity states that the higher the prior probability of an event, the lower is<br />

your <strong>surprise</strong> when actually observing it. Additivity states that the in<strong>for</strong>mation<br />

gain from observing two stoch<strong>as</strong>tically independent events happening jointly is<br />

simply the sum of in<strong>for</strong>mation gains from observing these two events separately.<br />

While Continuity, Monotonicity and Additivity are the core of any axiomatic foundation<br />

of an in<strong>for</strong>mation function, there is some arbitrariness in choosing the range<br />

of this function. The Range property serves to put absolute numbers on the quantitative<br />

me<strong>as</strong>ure of <strong>surprise</strong>. It states that your <strong>surprise</strong> should be infinite when<br />

observing an event that w<strong>as</strong> to occur with zero probability. Conversely, you should<br />

not be <strong>surprise</strong>d at all when observing an event that w<strong>as</strong> to occur <strong>for</strong> sure. Together<br />

with Monotonicity the Range property implies that h(p) yields nonnegative<br />

5


values <strong>for</strong> all 0 < p ≤ 1. This can be interpreted <strong>as</strong> saying that the in<strong>for</strong>mation<br />

gain from an observation cannot be negative.<br />

The four properties uniquely (up to a positive multiplicative constant) specify<br />

a functional <strong>for</strong>m of the in<strong>for</strong>mation function.<br />

Proposition 1. A function h : ]0, 1] → IR is an in<strong>for</strong>mation function if and only<br />

if it is given by<br />

h(p) = − log p, (1)<br />

where any positive real number may be chosen <strong>as</strong> the b<strong>as</strong>e of the logarithm.<br />

Proof : It is e<strong>as</strong>y to verify that h(p) = − log p satisfies all the properties of an<br />

in<strong>for</strong>mation function. For the proof that it is the only functional <strong>for</strong>m that does<br />

so, see Khinchin (1957: 9-13).<br />

✷<br />

The logarithm in the functional representation of h is due to the requirement of<br />

Additivity. In in<strong>for</strong>mation theory it h<strong>as</strong> become customary to use the b<strong>as</strong>e 2 <strong>for</strong> the<br />

logarithm since this is consistent with an interpretation of in<strong>for</strong>mation content in<br />

terms of bits, i.e. the number of answers to yes/no-questions. But <strong>as</strong> the definition<br />

is only unique up to a positive multiplicative constant one may choose any positive<br />

real number <strong>as</strong> a b<strong>as</strong>e. I will follow the general tendency in economics when dealing<br />

with logarithms and use the natural logarithm, i.e. the b<strong>as</strong>e e. If we express, <strong>as</strong><br />

we will do in the end, effective diversity not <strong>as</strong> the expected <strong>surprise</strong> in terms of<br />

expected in<strong>for</strong>mation gain, but convert this value into a numbers-equivalent, the<br />

choice of the b<strong>as</strong>e does not matter anyway.<br />

Having now an idea of the ‘in<strong>for</strong>mation content’ of an observation and the<br />

<strong>surprise</strong> of the observer, we can return to the person sitting next to the street and<br />

watching the cars come by. Knowing the probability distribution s, she will <strong>as</strong>sign<br />

an in<strong>for</strong>mation content of h(s i ) = − log s i to the event that a car of model i comes<br />

by be<strong>for</strong>e making the actual observation. As there are n different possible events<br />

with a probability distribution s = (s 1 , . . . , s n ), the expected in<strong>for</strong>mation content<br />

of the observation is:<br />

n∑<br />

n∑<br />

H n (s) = s i h(s i ) = − s i log s i , (2)<br />

i=1<br />

i=1<br />

which is the well-known Shannon (1948) entropy. 2 Me<strong>as</strong>uring the expected in<strong>for</strong>mation<br />

content of an observation, it can also be taken <strong>as</strong> a me<strong>as</strong>ure of expected<br />

<strong>surprise</strong> or uncertainty. Thus, it is ‘a me<strong>as</strong>ure of observational variety or actual<br />

(<strong>as</strong> opposed to logically possible) diversity’ (Krippendorff 1986: 15), taking into<br />

account that there are n different car models in the allocation and that they may<br />

be unevenly distributed.<br />

2 The choice of the name ‘entropy’ is due to its <strong>for</strong>mal resemblance to the entropy expression<br />

from Statistical Thermodynamics. It should be noted, however, that the name entropy w<strong>as</strong><br />

chosen by Shannon b<strong>as</strong>ed on a purely <strong>for</strong>mal analogy and that <strong>for</strong>mula (2) does not have any<br />

substantial relation with the physical entropy concept <strong>as</strong> known from thermodynamics.<br />

6


For s i = 0 the product s i log s i is not defined. It is there<strong>for</strong>e customary to<br />

define<br />

0 log 0 := lim<br />

si →0 s i log s i = 0.<br />

Obviously, H n (s) is non-negative <strong>for</strong> all n ∈ IN and <strong>for</strong> all s ∈ D n . Its minimum<br />

value is zero and is attained, <strong>for</strong> given n, when s i = 1 <strong>for</strong> some i ∈ {1, . . . , n} and<br />

s j = 0 <strong>for</strong> all j ∈ {1, . . . , n}\{i}. In words, when event i is known to occur with<br />

certainty there is no <strong>surprise</strong> to be expected when observing which event actually<br />

h<strong>as</strong> occurred. H n (s) <strong>as</strong>sumes its maximal value, <strong>for</strong> given n, when s = 1 n 1n , i.e.<br />

s i = 1/n <strong>for</strong> all i = 1, . . . , n. 3 In words, when all events are equally likely the<br />

uncertainty prior to the observation is maximal, and so is the expected <strong>surprise</strong> by<br />

the observation. Evaluating H n (s) <strong>as</strong> given by Equation (2) <strong>for</strong> s = 1 n 1n reveals<br />

that this maximum value is log n. For given n the expected <strong>surprise</strong> is there<strong>for</strong>e<br />

bounded by<br />

0 ≤ H n (s) ≤ log n. (3)<br />

The maximum thus incre<strong>as</strong>es with n, the number of different possible events. This<br />

is in accordance with the intuition that the uncertainty prior to the observation,<br />

and the expected <strong>surprise</strong> experienced in the observation, incre<strong>as</strong>es with the number<br />

of possibilities. For given n the value of H n (s) incre<strong>as</strong>es with the uni<strong>for</strong>mity<br />

or evenness of the distribution. The more evenly the probabilities of different<br />

events are distributed, the higher is the uncertainty prior to the observation. For<br />

example, if there are two different car models and their probabilities are (0.50,<br />

0.50), your uncertainty about which model to observe next is higher than if the<br />

probabilities were (0.95, 0.05), in which c<strong>as</strong>e you will be little <strong>surprise</strong>d to observe<br />

the first model.<br />

The expected in<strong>for</strong>mation content of the observation, Equation (2), h<strong>as</strong> been<br />

calculated <strong>as</strong> the expectation value of the in<strong>for</strong>mation functions (1) of the individual<br />

events,<br />

h(s i ) = − log s i <strong>for</strong> 0 < s i ≤ 1.<br />

In analogy to this expression, Equation (2) can be written <strong>as</strong><br />

H n (s) = − log G n (s) <strong>for</strong> s ∈ D n , (4)<br />

where G n may be interpreted <strong>as</strong> a kind of average probability. From Equations (2)<br />

and (4) it follows that G n is given by the geometric mean of the probabilities s i<br />

(i = 1, . . . , n) weighted with the same probabilities <strong>as</strong> weight:<br />

G n (s) =<br />

n∏<br />

i=1<br />

s s i<br />

i <strong>for</strong> s ∈ D n . (5)<br />

It is possible to define more general weighted means G n and thus to obtain<br />

from equation (4) more general entropies, of which Shannon-entropy is then only<br />

3 This is e<strong>as</strong>ily confirmed by maximizing expression (2) over all s ∈ D n <strong>for</strong> given n ∈ N subject<br />

to the constraint that ∑ n<br />

i=1 s i = 1.<br />

7


a special c<strong>as</strong>e (Aczél and Daróczy 1975: Chap. 5). For example, with<br />

( n<br />

)<br />

G n ∑ 1/(α−1)<br />

α(s) = s α i <strong>for</strong> all α > 0, n ∈ IN, s ∈ D n , (6)<br />

i=1<br />

<strong>as</strong> a generalized weighted mean of order α where, again, the weights are given<br />

by the probabilities s i , one obtains the following cl<strong>as</strong>s of one-parameter functions<br />

H α : D → IR, which is due to Rényi (1961):<br />

Definition 2. The following one-parameter functions H α : D → IR are called<br />

entropies of order α, or Rényi-entropies, of the probability distribution s, <strong>for</strong> all<br />

α > 0:<br />

⎧<br />

⎨<br />

Hα(s) n =<br />

⎩<br />

1<br />

log 1−α (∑ n<br />

i=1 s α i ) ; α > 0, α ≠ 1<br />

lim α→1<br />

1<br />

1−α log (∑ n<br />

i=1 s α i ) ; α = 1<br />

<strong>for</strong> all n ∈ IN (7)<br />

For α = 1 one obtains Expression (2) <strong>for</strong> the Shannon-entropy <strong>as</strong> a special<br />

c<strong>as</strong>e. 4 One can show that all the Hα(s) n have b<strong>as</strong>ically the same properties <strong>as</strong><br />

Shannon-entropy H1 n (s) (Rényi 1961, Aczél and Daróczy 1975: Chap. 5). They<br />

may there<strong>for</strong>e legitimately be regarded <strong>as</strong> general me<strong>as</strong>ures of expected in<strong>for</strong>mation.<br />

Proposition 2. The Rényi-entropies, Equation (7), have the following properties<br />

<strong>for</strong> all α > 0, n ∈ IN and s ∈ D n :<br />

(H-Continuity) Hα(s) n is a continuous function of s.<br />

(H-Symmetry) Hα(s) n = Hα(P n s) <strong>for</strong> all permutation matrices P .<br />

(H-Maximum) Hα( n 1 n 1n ) > H n (s) <strong>for</strong> all s ∈ D n \{ 1 n 1n }.<br />

(H-Additivity) Hα mn (r ∗ s) = H m (r) + H n (s) <strong>for</strong> all m ∈ IN, r ∈ D m ,<br />

s ∈ S n where r ∗ s is the probability distribution consisting<br />

of probabilities r i s j <strong>for</strong> all i = 1, . . . , m, j = 1, . . . , n.<br />

Proof : E<strong>as</strong>y to verify.<br />

✷<br />

The Symmetry property states that H n α is symmetric in all its arguments, i.e.<br />

it does not matter <strong>for</strong> the me<strong>as</strong>urement of expected in<strong>for</strong>mation gain in what sequence<br />

the different possible events are numbered. The Maximum property states<br />

4 Using the theorem of Bernoulli and l’Hospital, by which lim α→1 φ(α)/ψ(α) =<br />

lim α→1 φ ′ (α)/ψ ′ (α) if φ(1) = ψ(1) = 0 and the derivatives of φ and ψ exist at α = 1 and<br />

ψ ′ (1) ≠ 0, and noting that ∑ n<br />

i=1 s i = 1, one h<strong>as</strong><br />

lim<br />

α→1<br />

log ( ∑ n<br />

i=1 sα i ) = lim<br />

1 − α<br />

∑ n<br />

i=1 (sα i log s i)<br />

n∑<br />

α→1 (−1) ∑ n = − s i log s i .<br />

i=1 sα i<br />

i=1<br />

8


that H n α reaches its (unique) maximum <strong>for</strong> a completely uni<strong>for</strong>m distribution. Its<br />

minimum value is zero when one event h<strong>as</strong> probability one, and its maximum<br />

value, attained <strong>for</strong> an absolutely uni<strong>for</strong>m probability distribution, is log n. The<br />

parameter α determines how much weight is given to n and how much to the evenness<br />

of the probability distribution s in calculating an overall expected in<strong>for</strong>mation<br />

value b<strong>as</strong>ed on both, number of different events and evenness of their probability<br />

distribution. I will discuss the properties of expression (7) in more detail in the<br />

next section.<br />

Rényi’s generalized entropy (Equation 7) is a general me<strong>as</strong>ure of expected in<strong>for</strong>mation<br />

content of an observation. By that token it is also a me<strong>as</strong>ure of the<br />

expected <strong>surprise</strong> when sequentially observing an allocation that is only known in<br />

statistical terms. As I have suggested to conceptualize effective diversity <strong>as</strong> expected<br />

<strong>surprise</strong>, it seems re<strong>as</strong>onable to take Expression (7) <strong>as</strong> a me<strong>as</strong>ure of effective<br />

diversity.<br />

2.3 Numbers-equivalent me<strong>as</strong>ure of effective diversity<br />

One can establish direct and meaningful comparability of H n α(s) with the pure<br />

diversity n by constructing a numbers-equivalent me<strong>as</strong>ure of effective diversity in<br />

the following way. H n α(s) is a logarithmic me<strong>as</strong>ure; it yields log n <strong>as</strong> its maximal<br />

and 0 <strong>as</strong> its minimal value. In order to compare it directly and in a meaningful<br />

manner to the pure product variety, n, which is defined on a linear scale, we should<br />

essentially take the exponential of H n α(s) to be the numbers-equivalent me<strong>as</strong>ure of<br />

effective product diversity. Formally, this is achieved in the following way.<br />

Definition 3. For all n ∈ IN the numbers equivalent me<strong>as</strong>ure of effective diversity<br />

of an actual aggregate allocation x ∈ IR n with pure diversity n is the equivalent<br />

number ν of different (hypothetical) products which would yield the same expected<br />

<strong>surprise</strong> <strong>as</strong> the actual allocation x when the total output ¯x were evenly distributed<br />

over these ν products.<br />

Formally, taking Rényi’s generalized entropy (Equation 7) <strong>as</strong> a me<strong>as</strong>ure <strong>for</strong> the<br />

expected <strong>surprise</strong>, ν is implicitly defined by<br />

H ν α( 1 ν 1ν ) = H n α(s) <strong>for</strong> all n ∈ IN, s ∈ D n . (8)<br />

The solution of this defining condition is explicitly stated in the following proposition.<br />

Proposition 3. With Rényi’s generalized entropy (Equation 7) <strong>as</strong> a me<strong>as</strong>ure of<br />

the expected <strong>surprise</strong> by an observation, the numbers equivalent me<strong>as</strong>ure of effective<br />

diversity is a function ν : D → IR with:<br />

⎧<br />

⎪⎨ ( ∑ n<br />

να(s) n i=1 s α i ) 1/(1−α) ; α > 0, α ≠ 1<br />

=<br />

⎪⎩ lim a→1 ( ∑ <strong>for</strong> all n ∈ IN. (9)<br />

n<br />

i=1 s α i ) 1/(1−α) ; α = 1.<br />

9


Proof : As the entropy of a distribution 1 ν 1ν of ν different products with equal<br />

probability 1/ν is given from Equation (7) by Hα( ν 1 ν 1ν ) = log ν it follows that<br />

ν = exp Hα(s) n is the number we are looking <strong>for</strong>. With expression (7) <strong>for</strong> Hα(s),<br />

n<br />

expression (9) follows immediately.<br />

✷<br />

We now have the me<strong>as</strong>ure we are looking <strong>for</strong>. While the generalized entropy<br />

H n α (Equation 7) me<strong>as</strong>ures the effective product diversity of an allocation in terms<br />

of the expected in<strong>for</strong>mation gain or expected <strong>surprise</strong> when sequentially observing<br />

the allocation, the numbers-equivalent ν n α (Equation 9) expresses the effective diversity<br />

by a number which can be directly compared to the pure diversity of that<br />

allocation, n. Both me<strong>as</strong>ures are fully equivalent, of course, since ν n α = exp H n α.<br />

3 Properties of effective product diversity<br />

While H n (Equation 7) is <strong>for</strong> all α > 0, n ∈ IN, s ∈ D n bounded by 0 ≤ Hα(s) n ≤<br />

log n, the corresponding numbers-equivalent να n (Equation 9) is bounded by 1 ≤<br />

να<br />

n ≤ n. Both incre<strong>as</strong>e with n and with the evenness of the of the probability<br />

distribution from which they are calculated.<br />

3.1 Effective diversity and market concentration<br />

As ν n α is bounded by 1 ≤ ν n α ≤ n the following identity holds <strong>for</strong> all α > 0, n ∈ IN,<br />

s ∈ D n :<br />

ν n α(s) ≡ η n α(s) · n, (10)<br />

where the fraction<br />

η n α(s) = νn α(s)<br />

n<br />

= νn α(s)<br />

ν n α(s) | max (11)<br />

is an index of evenness of the distribution s with 1/n ≤ ηα(s) n ≤ 1. At the<br />

same time, the ratio η indicates to what fraction of its pure value n diversity gets<br />

effectively reduced due to unevenly distributed relative shares. While both H and<br />

ν are extensive quantities, i.e. their value incre<strong>as</strong>es with n, the ratio η is normalized<br />

to n and thus is an intensive quantity, i.e. its value only depends on the evenness of<br />

s and not on the dimension n of s. If the distribution is absolutely even, s = 1 n 1n ,<br />

the index (11) <strong>as</strong>sumes its maximal value, ηα(s) n = 1, and the effective diversity<br />

of the allocation is given by its pure diversity, να(s) n = n. On the other hand, if<br />

the allocation is maximally uneven, s i = 1 <strong>for</strong> some i (i = 1, . . . , n) and s j = 0<br />

<strong>for</strong> all j ≠ i, then the evenness index (11) <strong>as</strong>sumes its minimal value <strong>for</strong> given n,<br />

s = 1/n, and the effective diversity of the allocation is one, να(s) n = 1.<br />

In a world where every firm produces exactly one product variety the evenness<br />

of the distribution s is equivalent to the market concentration. Identity (10) thus<br />

expresses one key result of the approach developed here: market concentration<br />

effectively reduces the diversity of an allocation. At the same time, identity (10)<br />

provides a simple tool to calculate the effective product diversity from empirical<br />

in<strong>for</strong>mation about the pure product diversity and the market concentration.<br />

10


Hence it becomes apparent that the me<strong>as</strong>ure of effective product diversity proposed<br />

here (Equation 9) is intimately linked to me<strong>as</strong>ures of market concentration.<br />

In fact, it is identical to the cl<strong>as</strong>s of (numbers-equivalent) me<strong>as</strong>ures of market concentration<br />

suggested by Hannah and Kay (1977) and derived from the normative<br />

framework of Blackorby et al. (1982) by Chakravarty (1988). It is a subcl<strong>as</strong>s of<br />

the general and unifying cl<strong>as</strong>s of (numbers-equivalent) concentration indices that<br />

h<strong>as</strong> been described in an axiomatic way by Foster and Shneyerov (1999). This will<br />

become obvious from looking at various special c<strong>as</strong>es in the following.<br />

3.2 The role of α<br />

One can show that H n α(s) is continuous, differentiable and decre<strong>as</strong>ing in the parameter<br />

α <strong>for</strong> all α > 0 (Aczél and Daróczy 1975).<br />

One of the properties of ν n α(s) (Equation 9) is that <strong>for</strong> given n and s the value of<br />

ν n α(s) decre<strong>as</strong>es with α. As the most widely used diversity or concentration indices<br />

(see below) can all be expressed <strong>as</strong> special c<strong>as</strong>es of Equation (9) <strong>for</strong> different values<br />

of α, it becomes evident that the results <strong>for</strong> the effective species number yielded<br />

by these indices are related in the following way:<br />

ν 0 ≥ ν 1 ≥ ν 2 ≥ ν +∞ , (12)<br />

where equality only holds in the c<strong>as</strong>e of equal market shares.<br />

3.3 Special c<strong>as</strong>es<br />

For particular values of α one can recover from Equations (7) and (9) a number<br />

of different special c<strong>as</strong>es which correspond to well-known market concentration<br />

indices. For α = 1 and α = 2 one obtains from expressions (7) and (9) indices<br />

of effective product diversity which correspond to the two indices most widely<br />

used to me<strong>as</strong>ure industry concentration, the Shannon (1948)-entropy index and<br />

the Hirschman (1945, 1961)-Herfindahl (1951) index. In general, <strong>for</strong> 0 < a < +∞<br />

Equations (7) and (9) yield an index of effective diversity which takes into account<br />

both the pure number of different product varieties, n, and the (un)evenness of the<br />

distribution of their shares s. The different H n α(s) and ν n α(s) differ in the extent to<br />

which they include or exclude the relatively less abundant product varieties. The<br />

smaller α, the more are less abundant product varieties included in the me<strong>as</strong>ure<br />

of effective diversity, with α = 0 being the extreme c<strong>as</strong>e in which all varieties<br />

are equally included. The larger α, the more emph<strong>as</strong>is is given to more common<br />

product varieties in the estimate of the effective diversity, with α = +∞ being the<br />

extreme c<strong>as</strong>e in which only the most common one is taken into account.<br />

3.3.1 The total number of firms (α = 0)<br />

Obviously, <strong>for</strong> α = 0 Equation (9) reduces to ν n 0 (s) = n. That is, the zeroth order<br />

effective product diversity is just the pure diversity, n, the number of different<br />

products in the allocation. Correspondingly, H n 0 (s) = log n. This means, to zeroth<br />

11


order the indices (7) and (9) take into account all products equally, irrespective of<br />

their market shares.<br />

3.3.2 The Shannon index (α = 1)<br />

Substituting α = 1 into Equations (7) and (9) yields<br />

H1 n (s) =<br />

n∑<br />

− s i log s i , (13)<br />

i=1<br />

ν n 1 (s) = exp H n 1 (s) = Π n i=1<br />

( 1<br />

s i<br />

) si<br />

(14)<br />

where H1<br />

n is the Shannon-expression <strong>for</strong> entropy (Equation 2) introduced above.<br />

For a given distribution of market shares the Shannon-index (Equation 14)<br />

incre<strong>as</strong>es with n, the total number of different product varieties in the allocation.<br />

It yields its maximal value when all n different varieties have equal market share,<br />

s i = 1/n (i = 1, . . . , n). In that c<strong>as</strong>e, ν1<br />

n = n, which means that the effective<br />

number of different product varieties equals their pure number. With unequal<br />

market shares, the index yields values smaller than n. The index <strong>as</strong>sumes its<br />

minimal value when an allocation is dominated by one single product variety, with<br />

all others having negligible relative share. In that c<strong>as</strong>e, s i ≈ 0 <strong>for</strong> all i = 1, . . . , n<br />

except i = i ∗ , where i ∗ denotes the dominant product variety, s i ∗ ≈ 1. In that<br />

c<strong>as</strong>e, ν1 n ≈ 1, which means that the effective number of different product varieties<br />

is only negligibly larger than one. In general, <strong>for</strong> given value of n the value of<br />

ν1 n (s) can vary from one to n depending on the variation in market shares, s.<br />

Table 1 illustrates the working of the Shannon-index <strong>for</strong> different hypothetical<br />

allocations. In a hypothetical allocation A 1 with four different product varieties<br />

(Table 1, column 2), where all four have equal market share 0.25, the Shannonindex<br />

<strong>as</strong>sumes its maximal value, ν1<br />

4 = 4, and thus equals the total number of<br />

different product varieties in that market, n = 4. Similarly, if the number of<br />

equally abundant product varieties incre<strong>as</strong>es to n = 5, then ν1 5 = n = 5 (Table 1,<br />

column 3). Allocations A 3 and A 4 (Table 1, columns 4 and 5) illustrate that with<br />

n − 1 equally abundant product varieties and one much less abundant variety,<br />

i = 5 in the example, the Shannon index of effective product diversity will be only<br />

slightly greater than n − 1. The smaller s 5 , the closer ν1<br />

5 approaches n − 1. A<br />

comparison of allocations A 2 and A 6 (Table 1, columns 3 and 7) shows that between<br />

two allocations the effective diversity, ν1 n , can decre<strong>as</strong>e although pure diversity, n,<br />

actually incre<strong>as</strong>es This is due to the incre<strong>as</strong>e in concentration outweighing the<br />

incre<strong>as</strong>e in pure diversity.<br />

3.3.3 The Hirschman-Herfindahl index (α = 2)<br />

With α = 2 one obtains from Equation (9) the following index:<br />

n∑<br />

ν2 n (s) = 1/ s 2 i . (15)<br />

i=1<br />

12


product variety i<br />

share s i in allocation<br />

A 1 A 2 A 3 A 4 A 5 A 6<br />

i = 1 0.25 0.20 0.24 0.249 0.50 0.50<br />

i = 2 0.25 0.20 0.24 0.249 0.30 0.30<br />

i = 3 0.25 0.20 0.24 0.249 0.10 0.10<br />

i = 4 0.25 0.20 0.24 0.249 0.07 0.07<br />

i = 5 - 0.20 0.04 0.004 0.03 0.01<br />

i = 6 - - - - - 0.01<br />

i = 7 - - - - - 0.01<br />

α = 0: ν0 n = n 4 5 5 5 5 7<br />

( )<br />

α = 1: ν1 n = Π n si 1<br />

i=1 s i<br />

4.00 5.00 4.48 4.08 3.42 3.53<br />

α = 2: ν2 n = 1/ ( ∑ n<br />

i=1 s 2 i ) 4.00 5.00 4.31 4.03 2.81 2.82<br />

α = +∞: ν+∞ n = 1/s 1 4.00 5.00 4.17 4.02 2.00 2.00<br />

Table 1: Effective product diversity να<br />

n (Equation 9) <strong>for</strong> different values of the<br />

parameter α and <strong>for</strong> different hypothetical allocations A j (j = 1, . . . , 6) which<br />

are characterized by different number n and different relative shares s i of product<br />

varieties.<br />

This is the well known (inverse) market concentration index due to Hirschman<br />

(1945, 1961) and Herfindahl (1951). Its properties are qualitatively similar to<br />

those of the Shannon index, ν1 n (Equation 14). As in the c<strong>as</strong>e of the Shannon index,<br />

higher values of ν2<br />

n represent a greater effective product diversity in the sense of<br />

a combination of a higher pure product diversity, n, and a more homogeneous<br />

distribution of market shares, s. Also like the Shannon index, the Hirschman-<br />

Herfindahl index gives less weight to less abundant product varieties than to more<br />

abundant ones in calculating the effective diversity. Table 1 presents values of<br />

ν2<br />

n <strong>for</strong> different hypothetical allocations, which may be compared directly to the<br />

Shannon index, ν1 n .<br />

The Hirschman-Herfindahl index is strongly weighted towards the most abundant<br />

product variety in the market while being less sensitive to differences in<br />

small market shares and in the pure number of product varieties, <strong>as</strong> can be seen<br />

from comparing allocations A 5 and A 6 in Table 1 (columns 6 and 7). Being a<br />

logarithmic me<strong>as</strong>ure of effective diversity, the Shannon index is more sensitive to<br />

differences in small market shares than the Hirschman-Herfindahl index. On the<br />

other hand, it is less sensitive to small differences in large market shares, where<strong>as</strong><br />

the Hirschman-Herfindahl index responds more substantially to these differences.<br />

3.3.4 The market leader’s concentration ratio (α → +∞)<br />

As α approaches infinity, ν n α(s) goes to 1/s 1 , the inverse market share of the product<br />

variety with the highest market share:<br />

ν n +∞(s) = 1/s 1 . (16)<br />

13


This index, ν n +∞(s), corresponds to the (inverse) concentration ratio CR1 (Shepherd<br />

1987a). It can be interpreted <strong>as</strong> an effective number of product varieties in<br />

the sense that 1/s 1 gives the equivalent number of equally abundant (hypothetical)<br />

product varieties with the same market share <strong>as</strong> the leader. If, <strong>for</strong> example, in an<br />

allocation with n = 5 different product varieties the leading one h<strong>as</strong> a share of<br />

s 1 = 0.5, with the other four having smaller shares, then the effective number of<br />

product varieties in that allocation would be ν 5 +∞ = 1/0.5 = 2 (Table 1, column 6).<br />

ν n +∞ obviously only pays attention to the relative dominance of the most abundant<br />

product variety, neglecting all others.<br />

4 Illustration: P<strong>as</strong>senger cars in Germany<br />

To illustrate the concepts introduced above let us now turn to one particular<br />

example of a concentrated allocation of a differentiated product, namely p<strong>as</strong>senger<br />

cars in Germany. If you were observing the actual aggregate allocation one by<br />

one, you should not be much <strong>surprise</strong>d to observe a Volkswagen Golf/Bora, since<br />

this is the most abundant car model with a relative share of 10.4% of all cars.<br />

Your <strong>surprise</strong> should be much greater if you spotted a Chrysler Le Baron or a<br />

Ferrari F355, since these have relative shares below 0.003%. But how large is your<br />

expected <strong>surprise</strong> overall, and thus the effective diversity of that allocation?<br />

The total stock of p<strong>as</strong>senger cars in Germany on January 01, 2003 is reported<br />

by the Federal Office of Motor Vehicles (Kraftfahrt-Bundesamt 2004) <strong>for</strong> each<br />

make and model. 5 In total, 44.657.303 p<strong>as</strong>senger cars were registered effective<br />

this date. Of these, ¯x = 38.587.685 can be attributed to a particular model<br />

of car. In the following, all calculations are b<strong>as</strong>ed on this smaller number of<br />

explicitly attributable registrations. They comprise a pure product diversity of<br />

n = 395 different models. The respective numbers x i and relative shares s i (i =<br />

1, . . . , 395) <strong>for</strong> each individual model are reported in the appendix. They range<br />

from 4,025,254 registrations <strong>for</strong> the Volkswagen Golf (market share 10.4%) down<br />

to 1,083 registrations (market share 0.0028%) <strong>for</strong> the Ford Mustang Convertible.<br />

The mean and median number of registrations per model are 97,690 and 17,012,<br />

with a standard deviation of 305,002.<br />

Figure 1: Total stock of p<strong>as</strong>senger cars in Germany on January 01, 2003 by model.<br />

x i denotes the total number <strong>for</strong> model i (i = 1, . . . , 395).<br />

Figure 1 shows the total stock of p<strong>as</strong>senger cars by model, where x i denotes the<br />

total number <strong>for</strong> model i (i = 1, . . . , 395). The different models have been numbered<br />

according to their rank from most to le<strong>as</strong>t abundant. The most abundant<br />

one is the Volkswagen Golf/Bora (x 1 = 4, 025, 254, s 1 = 0.1043), second is Opel<br />

Astra (x 2 = 2, 097, 364, s 2 = 0.0544), and third is Opel Corsa (x 3 = 1, 521, 435,<br />

5 A detailed description of this data set is given in the appendix.<br />

14


s 3 = 0.0394). As the figure shows, the market leader h<strong>as</strong> almost twice the market<br />

share of the second placed, the third placed h<strong>as</strong> a relative share only about<br />

one third of the leader’s one, and the number of individual cars of each model<br />

go down very f<strong>as</strong>t with every less abundant model. The leading models together<br />

strongly dominate the whole allocation, with the top 15 models 6 (3.8% of all models)<br />

accounting already <strong>for</strong> more than half of the entire allocation (20,119,044<br />

registrations, or 52.1%). The other half of the allocation (18,468,641 registrations,<br />

or 47.9%) is shared by 381 models (96.2% of all models) with correspondingly<br />

much lower relative shares.<br />

Figure 2: Relative market share (s i ) <strong>for</strong> each model of p<strong>as</strong>senger car (i =<br />

1, . . . , 395) in Germany on January 01, 2003.<br />

The strong dominance of the more abundant models over the less abundant<br />

ones becomes apparent again in Figure 2 which shows the relative share (s i ) <strong>for</strong><br />

each model (i = 1, . . . , 395) on a logarithmic scale, where the different models<br />

have again been numbered according to their rank from most to le<strong>as</strong>t abundant.<br />

The negative slope of the curve indicates how the shares of less abundant models<br />

quickly drop by orders of magnitude. Overall, the curve displays a shape consistent<br />

with a log-normal distribution of shares (or absolute number of registrations) over<br />

rank.<br />

Table 2 reports <strong>for</strong> different values of the parameter α the expected <strong>surprise</strong><br />

H (Equation 7) when observing this allocation. It also reports the corresponding<br />

indices ν of effective diversity (Equation 9) and evenness η (Equation 11). Taking<br />

pure number Shannon Herfindahl CR1<br />

α = 0 α = 1 α = 2 α = +∞<br />

expected <strong>surprise</strong> Hα 395 (s) 5.98 4.44 3.61 2.26<br />

effective diversity να 395 (s) 395 84.94 36.84 9.59<br />

evenness index ηα 395 (s) 1.00 0.22 0.09 0.02<br />

Table 2: Pure and effective diversity of the allocation s of p<strong>as</strong>senger cars in Germany<br />

on January 01, 2001 <strong>for</strong> different values of the parameter α. H n α is the generalized<br />

entropy (Equation 7), ν n α is the corresponding numbers-equivalent me<strong>as</strong>ure<br />

of effective diversity (Equation 9), η n α is the corresponding evenness index (Equation<br />

11).<br />

α = 0 reproduces the pure diversity (n = 395) of the allocation. With α = 1<br />

6 The top 15 models are (in this sequence): Volkswagen Golf/Bora, Opel Astra, Opel Corsa,<br />

Volkswagen P<strong>as</strong>sat, BMW 3-series, Volkswagen Polo, Audi A4/S4, Opel Vectra, Mercedes C-<br />

cl<strong>as</strong>s, Ford Fiesta/Fusion, Mercedes E-cl<strong>as</strong>s, Ford Escort, Ford Mondeo, BMW 5-series, Audi<br />

A6/S6.<br />

15


one obtains the Shannon entropy, H1 395 (s) = 4.44, or the corresponding numbersequivalent,<br />

ν1 395 (s) = 84.94. By this number the effective diversity of the allocation<br />

is less than one quarter (η1 395 (s) = 0.22) of its pure diversity, due to the unevenness<br />

of the distribution of market shares. It is this number that describes the subjective<br />

impression of diversity (in the sense of expected <strong>surprise</strong>) that an observer<br />

experiences when sequentially observing the allocation. Taking α = 2 yields the<br />

Hirschman-Herfindahl index values H2 395 (s) = 3.61 and ν2 395 (s) = 36.84, which indicates<br />

that the effective diversity of the allocation is less than 10% (η2 395 (s) = 0.09)<br />

of its pure diversity, due to the unevenness of the distribution of market shares.<br />

Obviously, a higher value of α places more weight on how uneven distribution<br />

diminishes diversity. As a consequence, the Hirschman-Herfindahl values indicate<br />

a lower effective diversity than the Shannon values. In number terms, going from<br />

α = 1 to α = 2 means a drop in effective diversity index by more than one half.<br />

Figure 3: Effective product diversity να<br />

395 (s) (Equation 9) of the allocation s of<br />

p<strong>as</strong>senger cars in Germany on January 01, 2001 <strong>as</strong> a function of the parameter α.<br />

Figure 3 shows how να<br />

395 (s) depends on the parameter value α. As α approaches<br />

infinity, να<br />

395 (s) approaches the inverse market share of the leading model (Volkswagen<br />

Golf/Bora, 10.4%), ν+∞(s) 342 = 11.7. This index of effective diversity only<br />

takes into account the dominance of the leading product variety over all others.<br />

It can be interpreted <strong>as</strong> the number of equally abundant (hypothetical) product<br />

varieties that would cause the same expected <strong>surprise</strong> than the actual heterogenous<br />

allocation with 395 different product varieties.<br />

5 Conclusion<br />

<strong>Diversity</strong> causes <strong>surprise</strong>. I have proposed to take this everyday experience <strong>as</strong> a<br />

guideline <strong>for</strong> an economically meaningful definition and me<strong>as</strong>urement of effective<br />

diversity. Expected <strong>surprise</strong> may be quantified by in<strong>for</strong>mation theoretic tools <strong>as</strong> the<br />

expected in<strong>for</strong>mation gain from an observation of an actual economic allocation.<br />

I have suggested to use Rényi’s (1961) generalized entropy (Equation 7), or the<br />

corresponding numbers-equivalent (Equation 9), <strong>as</strong> such a me<strong>as</strong>ure of effective<br />

diversity. The effective diversity of an actual aggregate allocation incre<strong>as</strong>es with<br />

the pure number of different product varieties and the evenness of the distribution<br />

of their relative shares in the allocation. In general, when the different product<br />

varieties are unevenly distributed the effective diversity of that allocation will be<br />

smaller than its pure diversity. Thus, market concentration effectively reduces<br />

diversity below its pure value. I have illustrated this concept in an empirical<br />

analysis of the diversity of p<strong>as</strong>senger cars in Germany.<br />

The conclusion from this analysis is that the effective diversity <strong>as</strong> experienced<br />

by an observer may be considerably smaller than the underlying pure diversity.<br />

The exact quantitative amount by which pure and effective diversity differ because<br />

16


of unevenly distributed market shares depends on how much weight is given to<br />

pure diversity and how much to evenness when calculating the index of effective<br />

diversity.<br />

When deriving the entropy me<strong>as</strong>ure of effective diversity I have made the simplifying<br />

<strong>as</strong>sumption that all product varieties are pairwise equally dissimilar. Thus<br />

the pure diversity of an allocation is simply the number of different product varieties.<br />

Effective diversity me<strong>as</strong>ures by how much that number is diminished due to<br />

the uneven distribution of relative shares of product varieties. In future research<br />

one could apply the idea of diversity <strong>as</strong> a <strong>potential</strong> <strong>for</strong> <strong>surprise</strong> to me<strong>as</strong>ures of pure<br />

diversity which are b<strong>as</strong>ed on pairwise dissimilarity between product varieties, such<br />

<strong>as</strong> those of Weitzman (1992, 1998), Gans and Hill (1997), Bernhofen (2001) and<br />

Nehring and Puppe (2002).<br />

Another line of research could address the question of what are the benefits and<br />

costs of effective (<strong>as</strong> opposed to pure) product diversity <strong>for</strong> economic agents who<br />

reach consumption decisions over time and under uncertainty. On the one hand,<br />

higher diversity generally allows consumers to reach a higher utility level <strong>as</strong> the<br />

range of choice is expanded or consumers may have preferences <strong>for</strong> diversity per<br />

se. On the other hand, with higher diversity also the costs of making consumption<br />

decisions are higher. For example, these costs may be search costs due to the time<br />

spent when identifying the different available product options or finding dealers<br />

that offer the preferred variety. The in<strong>for</strong>mation theoretic me<strong>as</strong>ure of effective<br />

diversity presented here can be used to quantify the value of in<strong>for</strong>mation gained<br />

from observing the market allocation and thus the decision costs <strong>as</strong>sociated with<br />

a given level of pure diversity.<br />

Economists have been analyzing product diversity <strong>for</strong> quite some time now.<br />

Most of the conceptions of diversity used so far are static and <strong>as</strong>sume complete<br />

knowledge of all the options at hand. As the economic relevance of diversity<br />

ultimately stems from uncertainty over the future it seems logical to now move<br />

on, and address the role of economic diversity in a dynamic context and under<br />

uncertainty. The in<strong>for</strong>mation theoretic approach presented here may be a fruitful<br />

starting point.<br />

References<br />

[1] Aczél, J. and Z. Daróczy (1975). On Me<strong>as</strong>ures of In<strong>for</strong>mation and their Characterizations.<br />

Academic Press, New York.<br />

[2] Bernhofen, D.M. (2001). ‘Me<strong>as</strong>uring diversity in product space: the missing<br />

axiom’, Economics Letters 70: 69-72.<br />

[3] Berry, C.H. (1975). Corporate Growth and Diversification. Princeton University<br />

Press, Princeton.<br />

17


[4] Blackorby, C., D. Donaldson and J.A. Weymark (1982). ‘A normative approach<br />

to industrial-per<strong>for</strong>mance evaluation and concentration indices’, European<br />

Economic Review 19: 89-121.<br />

[5] Chakravarty, S.R. (1988). ‘On the separable industry per<strong>for</strong>mance evaluation<br />

function’, Economics Letters 27: 179-182.<br />

[6] Chakravarty, S.R. and J.A. Weymark (1988). ‘Axiomatizations of the entropy<br />

numbers equivalent index of industrial concentration’, in: W. Eichhorn (ed.),<br />

Me<strong>as</strong>urement in Economics. Theory and Applications of Economic Indices,<br />

Physica-Verlag, Heidelberg, pp. 383-397.<br />

[7] Dekel, E., B.L. Lipman and A. Rustichini (1998). ‘Recent developments in<br />

modeling of un<strong>for</strong>eseen contingencies’, European Economic Revue 42: 523-<br />

542.<br />

[8] Dixit, A.K. and V. Norman (1980). Theory of International Trade. Cambridge<br />

University Press, Cambridge.<br />

[9] Dixit, A.K. and J.E. Stiglitz (1977). Monopolistic competition and optimum<br />

product diversity. American Economic Review 67(3): 297-308.<br />

[10] Eichberger, J. and I.R. Harper (1997). Financial Economics. Ox<strong>for</strong>d University<br />

Press, Ox<strong>for</strong>d.<br />

[11] Ethier, W. (1982). ‘National and international returns to scale in the modern<br />

theory of international trade’, American Economic Review 72: 389-405<br />

[12] Faber, M., R. Manstetten and J.L.R. Proops (1992). ‘Humankind and the<br />

environment: an anatomy of <strong>surprise</strong> and ignorance’, Environmental Values<br />

1: 217-241.<br />

[13] Foster, J.E. and A.A. Shneyerov (1999). ‘A general cl<strong>as</strong>s of additively decomposable<br />

inequality me<strong>as</strong>ures’, Economic Theory 14: 89-111.<br />

[14] Gans, J. and R. Hill (1997). ‘Me<strong>as</strong>uring product diversity’, Economics Letters<br />

55: 145-150.<br />

[15] Gehrig, W. (1988). ‘On the Shannon-Theil concentration me<strong>as</strong>ure and its<br />

characterizations’, in: W. Eichhorn (ed.), Me<strong>as</strong>urement in Economics. Theory<br />

and Applications of Economic Indices, Physica-Verlag, Heidelberg, pp. 413-<br />

427.<br />

[16] Hannah, L. and J.A. Kay (1977). Concentration in Modern Industry. Theory,<br />

Me<strong>as</strong>urement and the U.K. Experience. Macmillan London.<br />

[17] Helpman, E. and P.R. Krugman (1985). Market Structure and Foreign Trade:<br />

Incre<strong>as</strong>ing Returns, Imperfect Competition, and the International Economy.<br />

Cambridge University Press, Cambridge.<br />

18


[18] Herfindahl, O. (1950). Concentration in the Steel Industry. Unpublished Ph.D.<br />

Dissertation, Columbia University.<br />

[19] Hirschman, A.O. (1945). National Power and the Structure of Foreign Trade.<br />

University of Cali<strong>for</strong>nia Press, Berkeley and Los Angeles.<br />

[20] Hirschman, A.O. (1964). ‘The paternity of an index’, American Economic<br />

Revue 54: 761.<br />

[21] Jacquemin, A.P. and C.H. Berry (1979). ‘Entropy me<strong>as</strong>ure of diversification<br />

and corporate growth’, Journal of Industrial Economics 27: 359-369.<br />

[22] Khinchin, A.I. (1957). Mathematical Foundations of In<strong>for</strong>mation Theory.<br />

Dover, New York.<br />

[23] Kraftfahrt-Bundesamt (2004). ‘Bestand an Personenkraftwagen am 1. Januar<br />

2003 nach Segmenten und Typgruppen’, Press Rele<strong>as</strong>e, January 2004,<br />

Kraftfahrt-Bundesamt (German Federal Office of Motor Vehicles), Flensburg.<br />

[24] Krippendorff, K. (1986). In<strong>for</strong>mation Theory. Structural Models <strong>for</strong> Qualitative<br />

Data. Sage, Beverly Hills.<br />

[25] Lanc<strong>as</strong>ter, K.J. (1966). ‘A new approach to consumer theory’, Journal of<br />

Political Economy 74: 132-157.<br />

[26] Matutinović, I. (2001). ‘The <strong>as</strong>pects and the role of diversity in socio-economic<br />

systeme: an evolutionary perspective’, Ecological Economics 39: 239-256.<br />

[27] Mowery, D.C. and N. Rosenberg (1998). Paths of Innovation. Technological<br />

Change in 20th-Century America. Cambridge University Press, Cambridge.<br />

[28] Nehring, K. and C. Puppe (2002). ‘A theory of diversity’, Econometrica 70:<br />

1155-1198.<br />

[29] Puppe, C. (1996). ‘An axiomatic approach to ”preference <strong>for</strong> freedom of<br />

choice”’, Journal of Economic Theory 68: 174-199.<br />

[30] Rényi, A. (1961). ’On me<strong>as</strong>ures of entropy and in<strong>for</strong>mation’, in: J. Neyman<br />

(ed.), Proceedings of the Fourth Berkeley Symposium on Mathematical<br />

Statistics and Probability, Vol. I, University of Cali<strong>for</strong>nia Press, Berkeley, pp.<br />

547-561.<br />

[31] Romer, P.M. (1987). ‘Growth b<strong>as</strong>ed on incre<strong>as</strong>ing returns due to specialization’,<br />

American Economic Review 77(2): 56-62.<br />

[32] Romer, P.M. (1990). ‘Endogenous Technological Change’, Journal of Political<br />

Economy 98(5, part II): S71-S102.<br />

19


[33] Schiller, J. (2001). Umweltprobleme und Zeit. Bestände als konzeptionelle<br />

Grundlage ökologischer Ökonomik. Ph.D. Dissertation, University of Heidelberg.<br />

[34] Sen, A. (1988). ‘Freedom of choice: concept and content’, European Economic<br />

Review 32: 269-294.<br />

[35] Shackle, G.L.S. (1955). Uncertainty in Economics and Other Reflections.<br />

Cambridge University Press, Cambridge.<br />

[36] Shannon, C.E. (1948). ‘A mathematical theory of communication’, Bell System<br />

Technical Journal 27: 379-423 and 623-656.<br />

[37] Shepherd, W.G. (1987a). ‘Concentration ratios’, in: J.M. Eatwell, M. Milgate<br />

and R. Newman (eds), The New Palgrave. A Dictionary of Economics, Vol.<br />

1, Macmillan, London, pp. 563-564.<br />

[38] Shepherd, W.G. (1987b). ‘Herfindahl index’, in: J.M. Eatwell, M. Milgate<br />

and R. Newman (eds), The New Palgrave. A Dictionary of Economics, Vol.<br />

2, Macmillan, London, p. 639.<br />

[39] Spence, A.M. (1976). ‘Product selection, fixed costs, and monopolistic competition’,<br />

Review of Economic Studies 43: 217-236.<br />

[40] Theil, H. (1967). Economics and In<strong>for</strong>mation Theory. North-Holland, Amsterdam.<br />

[41] Weikard, H.-P. (1999). Wahlfreiheit für zukünftige Generationen. Neue<br />

Grundlagen für eine Ressourcenökonomik. Metropolis, Marburg.<br />

[42] Weitzman, M. (1992). ‘On diversity’, Quarterly Journal of Economics 107:<br />

363-405.<br />

[43] Weitzman, M. (1998). ‘The Noah’s Ark problem’, Econometrica 66: 1279-<br />

1298.<br />

Appendix: Data used in Section 4<br />

The following table reports the data on the stock of p<strong>as</strong>senger cars in Germany<br />

on January 01, 2003 that have been used in Section 4. The data set h<strong>as</strong> been<br />

<strong>as</strong>sembled from an official report published by the German Federal Office of Motor<br />

Vehicles (Kraftfahrt-Bundesamt 2004). The data reported there have been<br />

calculated by an inventory-update-method from the monthly number of new registrations,<br />

transcriptions, and deregistrations since January 01, 1990. The report<br />

explicitly lists only models with a minimum number of 1,000 cars registered at the<br />

end of period date (January 01, 2003). Models with fewer registrations, and registrations<br />

<strong>for</strong> which the vehicle identification number (“FIN”) could not properly be<br />

20


processed, are grouped together <strong>as</strong> “others”. Because of the space constraint only<br />

a part of the entire data set is reproduced here. The full data set can be obtained<br />

from the author upon request.<br />

rank make model registrations market share<br />

i x i s i [%]<br />

1 Volkswagen Golf/Bora 4,025,254 10.4315<br />

2 Opel Astra 2,097,364 5.4353<br />

3 Opel Corsa 1,521,435 3.9428<br />

4 Volkswagen P<strong>as</strong>sat 1,521,385 3.9427<br />

5 BMW 3-series 1,339,196 3.4705<br />

6 Volkswagen Polo 1,325,186 3.3368<br />

7 Audi A4/S4 1,287,576 3.0477<br />

8 Opel Vectra 1,176,037 3.0477<br />

9 Mercedes C-cl<strong>as</strong>s 1,023,111 2.6514<br />

10 Ford Fiesta/Fusion 993,508 2.5747<br />

11 Mercedes E-cl<strong>as</strong>s 937,928 2.4306<br />

12 Ford Escort 831,033 2.1536<br />

13 Ford Mondeo 789,338 2.0456<br />

14 BMW 5-series 682,132 1.7678<br />

15 Audi A6/S6 568,561 1.4734<br />

. .<br />

.<br />

.<br />

.<br />

66 Skoda Fabia 100,533 0.2605<br />

67 Skoda Felicia 98,418 0.2551<br />

68 Ford Scorpio 97,010 0.2514<br />

69 Citroen AX 96,270 0.2495<br />

. .<br />

.<br />

.<br />

.<br />

196 Isuzu Trooper/Monterey 17,402 0.0451<br />

197 Volvo 80 17,138 0.0444<br />

198 Alfa Romeo Spider 17,012 0.0441<br />

199 Suzuki Samurai 16,876 0.0437<br />

200 Toyota Picnic 16,829 0.0436<br />

. .<br />

.<br />

.<br />

.<br />

389 Chrysler Le Baron 1,123 0.0029<br />

390 Lada 110, 111, 112 1,121 0.0029<br />

391 Honda Integra 1,114 0.0029<br />

392 Toyota P<strong>as</strong>eo 1,110 0.0029<br />

393 Citroen Visa 1,108 0.0029<br />

394 Ferrari F355 1,089 0.0028<br />

395 Ford Mustang (convertible) 1,083 0.0028<br />

Subtotal 38,587,685 100.0000<br />

Others Others 6,069,618<br />

Total 44,657,303<br />

21

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!