19.05.2014 Views

Testing Gateway Theory: do cigarette prices affect illicit drug use?

Testing Gateway Theory: do cigarette prices affect illicit drug use?

Testing Gateway Theory: do cigarette prices affect illicit drug use?

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Journal of Health Economics 21 (2002) 679–698<br />

<strong>Testing</strong> <strong>Gateway</strong> <strong>Theory</strong>: <strong>do</strong> <strong>cigarette</strong> <strong>prices</strong> <strong>affect</strong><br />

<strong>illicit</strong> <strong>drug</strong> <strong>use</strong>?<br />

Michael Beenstock a,∗ , Giora Rahav b<br />

a Department of Economics, Hebrew University of Jerusalem, Jerusalem, Israel<br />

b Department of Sociology, Tel Aviv University, Tel Aviv, Israel<br />

Received 18 February 2000; received in revised form 26 July 2001; accepted 7 February 2002<br />

Abstract<br />

We test the causal <strong>Gateway</strong> <strong>Theory</strong> of <strong>drug</strong> <strong>use</strong> dynamics by way of a natural experiment. We<br />

ran<strong>do</strong>mize <strong>cigarette</strong> smoking by birth cohort and <strong>cigarette</strong> <strong>prices</strong>. We <strong>use</strong> data for Israel to show that<br />

while <strong>cigarette</strong> smoking ca<strong>use</strong>s cannabis <strong>use</strong>, the evidence that cannabis <strong>use</strong> ca<strong>use</strong>s hard <strong>drug</strong> <strong>use</strong> is<br />

much weaker. These results are based on various econometric metho<strong>do</strong>logies including two-stage<br />

logit (2SL), bivariate probit, and frailty analysis for survival data. © 2002 Elsevier Science B.V. All<br />

rights reserved.<br />

Keywords: <strong>Gateway</strong> <strong>Theory</strong>; Drug addiction; Treatment effects; Natural experimentation<br />

1. Introduction<br />

The economic analysis of <strong>illicit</strong> <strong>drug</strong>s has quite a long history (e.g. Rottenberg, 1968;<br />

Nisbet and Vakil, 1972; Moore, 1973, and White and Lusketich, 1983). However, following<br />

the pioneering work of Becker and Murphy (1988) economists have become increasingly<br />

interested in addictive consumer behaviour. Much of this interest has focussed upon the determinants<br />

of <strong>cigarette</strong> smoking (e.g. Chaloupka, 1991), alcohol (e.g. Mullahy and Sindelar,<br />

1993), gambling and <strong>illicit</strong> <strong>drug</strong>s (e.g. Grossman and Chaloupka, 1999). Economists have<br />

also contributed to the policy debate concerning whether <strong>drug</strong>s should be legalized or decriminalized<br />

(Prinz, 1997; Frey, 1997), and they have made their contribution to the interface<br />

between <strong>drug</strong>s and crime (e.g. Model, 1991 and Miron and Zweibel, 1991, 1995). In this<br />

paper, we open a new research front for economists, namely econometric investigation<br />

of <strong>Gateway</strong> <strong>Theory</strong> which until now has been monopolized by non-economists, including<br />

epidemiologists, sociologists, psychiatrists and others.<br />

∗ Corresponding author. Tel.: +972-2-5883-120; fax: +972-2-581-6071.<br />

E-mail address: msbin@mscc.huji.ac.il (M. Beenstock).<br />

0167-6296/02/$ – see front matter © 2002 Elsevier Science B.V. All rights reserved.<br />

PII: S0167-6296(02)00009-7


680 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

<strong>Gateway</strong> <strong>Theory</strong> was originally developed in the 1970s by Kandel (1975). She observed<br />

that there is a systematic sequencing in the <strong>use</strong> of psychoactive substances which runs from<br />

alcohol and <strong>cigarette</strong>s, then to cannabis, and finally to “hard” <strong>drug</strong>s such as cocaine, heroin<br />

and LSD. Cigarettes are a “gateway” to cannabis, which in turn is a “gateway” to hard <strong>drug</strong>s.<br />

Of course not all <strong>cigarette</strong> smokers go on to <strong>use</strong> cannabis, and it should be noted that not<br />

all cannabis consumers first smoked <strong>cigarette</strong>s. Nevertheless, <strong>cigarette</strong> smokers are more<br />

likely to <strong>use</strong> cannabis subsequently than non-smokers. A similar juxtaposition applies to<br />

cannabis and hard <strong>drug</strong>s; cannabis <strong>use</strong>rs are more likely to <strong>use</strong> hard <strong>drug</strong>s eventually than<br />

non-<strong>use</strong>rs of cannabis, but not all hard <strong>drug</strong> consumers <strong>use</strong>d cannabis first.<br />

The scientific literature on <strong>Gateway</strong> <strong>Theory</strong> is too vast to be reviewed here. However,<br />

it splits into two broad camps. One camp (e.g. O’Donnell and Clayton, 1982) regards the<br />

gateway effect to be causal or generative. According to this view <strong>cigarette</strong> smoking induces<br />

cannabis <strong>use</strong>, and cannabis <strong>use</strong> induces hard <strong>drug</strong> consumption. This implies that if smoking<br />

were restricted there would be less <strong>use</strong> of cannabis. It also implies that if cannabis were<br />

legalized there would be more <strong>use</strong> of hard <strong>drug</strong>s. It is obvious how this causal interpretation<br />

of <strong>Gateway</strong> <strong>Theory</strong> has been grist to the mill of the anti liberalization lobby. The rival camp<br />

(e.g. Baumrind, 1983) views the gateway effect to be merely predictive or even descriptive.<br />

Economists will recognize this as Granger causality; due to systematic sequencing <strong>cigarette</strong><br />

consumption may help predict cannabis consumption, and cannabis consumption may help<br />

predict hard <strong>drug</strong> consumption. However, Granger causality <strong>do</strong>es not imply causality itself<br />

and has no implications for policy.<br />

The longstanding debate about <strong>Gateway</strong> <strong>Theory</strong> revolves around the identification problem.<br />

Does the fact that <strong>cigarette</strong> smokers are more likely to go onto <strong>use</strong> cannabis result<br />

from unobserved heterogeneity, i.e. people with a greater susceptibility to smoke <strong>cigarette</strong>s<br />

also have a greater susceptibility to consume cannabis, or <strong>do</strong>es it result from a treatment<br />

effect, i.e. exposure to <strong>cigarette</strong>s (the treatment) induces cannabis <strong>use</strong> (the outcome)? The<br />

vast number of empirical papers on <strong>Gateway</strong> <strong>Theory</strong> have not resolved the identification<br />

problem.<br />

One way to resolve this identification problem is to apply the metho<strong>do</strong>logy of natural<br />

experimentation, which in the present context seeks to ran<strong>do</strong>mize <strong>cigarette</strong> smoking so that<br />

its causal effect on subsequent <strong>use</strong> of cannabis is identified. We <strong>use</strong> the data for Israel to<br />

apply the metho<strong>do</strong>logy, and follow Evans and Ringel (1999) who study the effect of smoking<br />

on birth weights by using <strong>cigarette</strong> price data to ran<strong>do</strong>mize smoking behavior. They <strong>use</strong><br />

cross section data and ran<strong>do</strong>mize <strong>cigarette</strong> smoking by exploiting differential tax rates on<br />

<strong>cigarette</strong>s in the US. Pacula (1998) too <strong>use</strong>s cross section data to identify the gateway effect<br />

of alcohol by exploiting differential tax rates on alcohol in the US.<br />

Natural experiments in econometrics usually exploit cross section differences in instrumental<br />

variables. A metho<strong>do</strong>logical innovation in our approach to natural experimentation<br />

consists of exploiting time series data rather than cross section data. We <strong>do</strong> so beca<strong>use</strong><br />

there are no cross sectional or geographical differences in <strong>cigarette</strong> <strong>prices</strong> (or other potential<br />

instruments) in Israel. However, there has been extensive variation in the real price of<br />

<strong>cigarette</strong>s over time. People who grew up when <strong>cigarette</strong>s were cheap are more likely to<br />

smoke than people who grew up when they were expensive. Beca<strong>use</strong> individuals <strong>do</strong> not<br />

chose their year of birth or the price of <strong>cigarette</strong>s and other variables we have the basis of<br />

a natural experiment.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 681<br />

The remainder of the paper is organized as follows. In Section 2, we describe the metho<strong>do</strong>logy<br />

that we <strong>use</strong> for testing <strong>Gateway</strong> <strong>Theory</strong>. In Section 3 we describe the data which come<br />

from nationwide surveys of Jews in Israel conducted in 1989, 1992 and 1995. In Section 4,<br />

we perform two tests of <strong>Gateway</strong> <strong>Theory</strong>. In the first test we focus on sequencing and investigate<br />

whether smoking eventually induces cannabis <strong>use</strong>, and whether the latter eventually<br />

induces <strong>use</strong> of hard <strong>drug</strong>s. In the second, we focus on timing and investigate whether earlier<br />

initiation of smoking induces earlier initiation of cannabis <strong>use</strong>. The latter attaches importance<br />

to timing of the gateway effect, whereas the former <strong>do</strong>es not. Our main conclusions<br />

for Israel are that <strong>cigarette</strong> smoking causally increases the likelihood of subsequently using<br />

cannabis, but cannabis <strong>use</strong> <strong>do</strong>es not causally increase the likelihood of subsequently using<br />

hard <strong>drug</strong>s.<br />

2. Metho<strong>do</strong>logy<br />

2.1. Linear probability<br />

The first hypothesis of interest is <strong>do</strong>es <strong>cigarette</strong> smoking increase the probability of<br />

subsequently using cannabis? Define C n = 1, if individual n smoked <strong>cigarette</strong>s and 0<br />

otherwise, and S n = 1, if individual n subsequently <strong>use</strong>d cannabis and 0 otherwise. We<br />

exclude those who consumed cannabis either prior to smoking <strong>cigarette</strong>s or who did so<br />

without smoking. These exceptions have troubled gateway theorists, but they are not relevant<br />

for testing the first hypothesis, which <strong>do</strong>es not claim that <strong>cigarette</strong>s are a precondition for<br />

cannabis.<br />

Although in Section 4 we <strong>do</strong> not apply the linear probability model, we <strong>use</strong> it here<br />

to illustrate the metho<strong>do</strong>logical issues involved. We <strong>use</strong> time subscripts (t) to indicate<br />

sequencing of <strong>drug</strong> <strong>use</strong>, and to remind us, e.g. that C occurs before S. <strong>Gateway</strong> <strong>Theory</strong><br />

suggests:<br />

S nt = αX nt + βC n(t−1) + γ y D y + u nt (1)<br />

where X is a vector of controls including personal characteristics, D y the birth cohort indicator<br />

for those born in year y, and u an unobserved error term. <strong>Gateway</strong> <strong>Theory</strong> hypothesizes<br />

that β>0. By definition C t−1 is predetermined beca<strong>use</strong> it occurs prior to S t . However,<br />

this <strong>do</strong>es not help identify the treatment effect of C upon S beca<strong>use</strong> C and u are likely to be<br />

positively correlated. This arises from the auxiliary model for smoking:<br />

C n(t−1) = φZ n(t−1) + θ y D y + v n(t−1) (2)<br />

where Z is a vector of controls determining smoking and v denotes the unobserved heterogeneity<br />

in smoking. If people with a natural susceptibility to <strong>cigarette</strong>s are also more<br />

susceptible to cannabis, then E(u t v t−1 )>0, in which case estimates of β will be biased<br />

upwards. If in this case, the estimate of β is positive and statistically significant when in<br />

reality β = 0, there is Granger causality but no genuine or generative causality. Prior knowledge<br />

of C t−1 may be <strong>use</strong>d to predict S t . If in addition the estimate of β is not statistically<br />

significant, there is neither Granger causality nor generative causality.


682 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

The solution to this problem is to instrument C t−1 in Eq. (1). No <strong>do</strong>ubt Z and X largely<br />

overlap. An element of Z is the price of <strong>cigarette</strong>s (and other variables discussed in Section 4)<br />

that prevailed when individual n was growing up (P n(t−1) ), which are hypothesized to<br />

determine the demand for <strong>cigarette</strong>s but not the demand for cannabis. We expect C t−1 to<br />

vary inversely with P t−1 . There is no reason to suspect that P t−1 and v t−1 are correlated<br />

(nor P t−1 and u t ). If S t <strong>do</strong>es not depend upon P t−1 , then P t−1 identifies the effect of C t−1<br />

upon S t in Eq. (1).<br />

The next step in the gateway chain concerns the relationship between cannabis and<br />

hard <strong>drug</strong>s. The hypothesis of interest here is <strong>do</strong>es exposure to cannabis induce a higher<br />

probability of subsequently using hard <strong>drug</strong>s? We denote H n = 1 if individual n <strong>use</strong>d hard<br />

<strong>drug</strong>s after using cannabis and 0 otherwise. We express this as follows:<br />

H n(t+1) = λY n(t+1) + δS nt + ρ y D y + w n(t+1) (3)<br />

where Y is a vector of controls for hard <strong>drug</strong> <strong>use</strong>. <strong>Gateway</strong> <strong>Theory</strong> suggests δ>0. Since<br />

we suspect that E(wu) >0 unobserved heterogeneity is likely to bias upwards estimates<br />

of δ. Ideally, we seek exclusion restrictions for Y and X such as <strong>drug</strong> price data, which are<br />

not available in Israel. Instead, we suggest using as an instrument for S t in Eq. (3) its fitted<br />

value from IV estimation of Eq. (1). We refer to this as “<strong>do</strong>mino” identification beca<strong>use</strong><br />

the natural experiment is indirect. Clearly, such indirect identification weakens the power<br />

of tests on the value of δ. Ran<strong>do</strong>m exposure to <strong>cigarette</strong>s at time t − 1 induces cannabis<br />

consumption at time t, which in turn, induces hard <strong>drug</strong> <strong>use</strong> at time t + 1. If β and δ are<br />

properly identified, and there is a causal gateway effect, we can expect a chain reaction<br />

where raising the price of <strong>cigarette</strong>s will reduce <strong>cigarette</strong> smoking, which will reduce the<br />

subsequent <strong>use</strong> of cannabis, which, in turn, will reduce the subsequent <strong>use</strong> of hard <strong>drug</strong>s.<br />

Note that Eqs. (1)–(3) control for cohort effects. This is possible beca<strong>use</strong> we <strong>use</strong> several<br />

surveys in Section 4 (y = year of survey − age). Had there been only one survey it would<br />

not have been possible to identify the separate effects of P t−1 and birth cohort upon <strong>drug</strong><br />

consumption.<br />

2.2. Recursive bivariate probit and two-stage procedures<br />

If u and v in Eqs. (1) and (2) happen to be bivariate normal with cov(u, v) = ρ then<br />

the model may be estimated by maximum likelihood as a recursive bivariate probit (RBP)<br />

model (Maddala, 1983, pp. 122–123, Greene, 2000, pp. 852–825). Note that in this case C in<br />

Eq. (1) <strong>do</strong>es not have to be instrumented beca<strong>use</strong>, in contrast to the linear probability model,<br />

it is not estimated by least squares. However, Maddala points out that identification requires<br />

that X omit covariates in Z.IfX = Z the model is not identified, even parametrically.<br />

Maddala suggests that an alternative procedure is to specify prob (C ∗ > 0) in Eq. (1)<br />

instead of C, where C ∗ denotes the underlying latent variable that measures the propensity<br />

to smoke <strong>cigarette</strong>s. In this case, it may be shown that a two-stage procedure provides<br />

consistent estimates of the parameters in Eqs. (1) and (2). In the first stage, Eq. (2) is<br />

estimated by probit or logit, and in the second stage the predicted probability of C obtained<br />

from the first stage is <strong>use</strong>d to replace C in Eq. (1).<br />

There are advantages and disadvantages to both procedures. The main disadvantage of<br />

RBP is that it may be sensitive to parametric assumptions about the unobserved heterogeneity.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 683<br />

If u and v <strong>do</strong> not happen to be bivariate normal then estimates of β in Eq. (1) will be biased<br />

and inconsistent (unless ρ = 0). This problem <strong>do</strong>es not arise in the two-stage procedure<br />

beca<strong>use</strong> it <strong>do</strong>es not require estimates of ρ. The disadvantage of the two-stage procedure is<br />

that although it is consistent, it is not efficient. By contrast, RBP estimates are consistent<br />

and efficient provided that u and v happen to be bivariate normal.<br />

In Section 4, we <strong>use</strong> both procedures. 1 However, the main results that we report are for<br />

two-stage logit (2SL), mainly beca<strong>use</strong> it is less parametric, and is therefore less sensitive<br />

to mis-specification error. A disadvantage of the two-stage procedure is that the standard<br />

errors of the parameters are difficult to calculate (Maddala, 1983, p. 247). In Section 4, we<br />

<strong>use</strong> a bootstrap procedure to calculate them.<br />

2.3. Hazard analysis<br />

In this section, we focus on the timing of events rather than their sequencing. The specific<br />

questions we ask are whether earlier initiation of <strong>cigarette</strong> smoking ca<strong>use</strong>s earlier initiation<br />

of cannabis, and whether the latter ca<strong>use</strong>s earlier initiation of hard <strong>drug</strong>s. Elsewhere<br />

(Beenstock and Rahav, 2001), we suggest the <strong>use</strong> of long term survivor models to specify<br />

the initiation hazard since <strong>drug</strong> <strong>use</strong> is a minority activity. This is also the approach <strong>use</strong>d<br />

by Douglas and Hariharan (1994). However, this approach is not practical in the current,<br />

more complicated, context in which our objective is to identify treatment effects. Instead,<br />

we model <strong>drug</strong> <strong>use</strong> initiation using Cox’s proportional hazards model (CPHM), which has<br />

the advantage of not necessarily implying that it is a matter of time before everyone <strong>use</strong>s<br />

<strong>drug</strong>s. 2<br />

The counterpart of Eq. (1) for individual n’s hazard of using cannabis ( s ) is:<br />

λ s (t n ) = λ s0 (t n ) exp − (X n α + βA cn + γ y D yn ) (4)<br />

where A cn denotes the age at which individual n first smoked <strong>cigarette</strong>s, X and D are defined<br />

as in Eq. (1), and s0 is the “baseline” hazard. We denote the age of cannabis initiation by<br />

A s >A c by definition. According to <strong>Gateway</strong> <strong>Theory</strong> β > 0, since people who begin<br />

smoking later will initiate cannabis later. Unobserved heterogeneity is likely to generate<br />

positive covariance between A s and A c , which will bias upwards estimates of β. We suggest<br />

that instead of using A c in Eq. (4) it should be replaced by its expected value as determined<br />

from a CPHM for <strong>cigarette</strong> smoking. The specification of the hazard for <strong>cigarette</strong> smoking<br />

(λ c ) parallels Eq. (2) and is written as follows:<br />

λ c (t n ) = λ c0 (t n ) exp − (Z n φ + θ y D yn ) (5)<br />

where Z is defined as in Eq. (2). Eq. (5) implies that expected age at <strong>cigarette</strong> initiation is:<br />

E(A cn ) = exp[−Λ c (t) exp(Z n φ + θ y D yn )] (6)<br />

where Λ c denotes the integrated hazard function for smoking evaluated at the mean. We<br />

suggest a two-stage procedure in which Eq. (5) is estimated in the first stage and the solution<br />

1 Evans and Ringel (1999) in a similar situation to ours, side-step these metho<strong>do</strong>logical issues.<br />

2 Larson and Dinse (1985) propose a long term survivor model for CPHM which is <strong>use</strong>d in Beenstock and Rahav<br />

(2001).


684 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

to Eq. (6) is <strong>use</strong>d to replace A c in Eq. (4) in the second stage. The parameter standard errors in<br />

the second stage are bootstrapped. Having thus estimated Eq. (4), we may <strong>use</strong> it to calculate<br />

E(A s ) and to replace A s in a CPHM for hard <strong>drug</strong>s.<br />

In the bio-statistical literature this class of dependent hazard model is known as a “frailty”<br />

model (Hougaard, 1995; Klein and Moeschberger, 1997). Frailty models usually consist of<br />

two hazards functions that happen to be stochastically related. If u denotes the stochastic<br />

component of the empirical hazard function for <strong>cigarette</strong> initiation, and v denotes its counterpart<br />

for cannabis initiation, then frailty models estimate the parameter ρ = cov(u, v).<br />

However, they <strong>do</strong> not usually specify A c in the hazard function for cannabis. Here, we<br />

specify the dependence between the two hazard functions directly via β in Eq. (1), as in the<br />

two-stage procedure that we have described. A more general specification would estimate<br />

both ρ as well as β in Eq. (4), i.e. frailty may be transmitted directly via β and indirectly<br />

via ρ. 3<br />

3. The data<br />

3.1. The surveys<br />

The data to be <strong>use</strong>d in the present analysis are from three epidemiological surveys carried<br />

out in Israel by the Israel Anti-Drug Authority in 1989 (N = 5280), 1992 (N = 1816) and<br />

1995 (N = 5044), providing a total of some 12,700 observations. While sample size varied<br />

between surveys, the sampling procedure remained the same. The sample is intended to<br />

represent the Jewish population in Israel aged 18–40 years. It was based on a geographic<br />

sample of points in each of the larger Jewish cities and towns (i.e. with population of<br />

30,000 or more) and a sample of the smaller localities. Starting at each of these geographic<br />

points a cluster of 10 residential apartments was sampled. Experienced interviewers first<br />

checked if there were individuals in the designated age bracket in the ho<strong>use</strong>hold. One<br />

individual was selected from each ho<strong>use</strong>hold by one of two, predetermined methods: either<br />

the first available eligible person was interviewed, or the interviewee was selected by the<br />

Troldahl–Carter method (for more details see Barnea et al. (1990)).<br />

Interviewers were instructed to make at least three attempts to obtain a sampled ho<strong>use</strong>hold,<br />

or person within the ho<strong>use</strong>hold. If there was no response from an apartment there was no<br />

way of determining whether it is residential, commercial, or an unoccupied apartment. Most<br />

probably there is some under-representation of new immigrants, and others who are either<br />

away from home much of their time, or <strong>do</strong> not speak Hebrew. Otherwise, the sample seems<br />

to be representative of Israeli Jews in this age bracket who live at home. It <strong>do</strong>es not cover<br />

at all institutionalized persons, including those in military service away from home and<br />

prisoners, nor members of Kibbutzim (collective farms) who make up about 2% of the<br />

population.<br />

3 In the classic frailty model the survival hazards of parents and children are assumed to be related via ρ. This<br />

has a genetic interpretation; strong parents breed strong children. Additionally, or independently, there may be a<br />

behavioral effect in that the early death of the parent may adversely <strong>affect</strong> the survival of the child. This direct<br />

frailty effect is captured by β. We are unaware of empirical examples where direct and indirect frailty effects are<br />

estimated.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 685<br />

Table 1<br />

Sequencing in ever <strong>use</strong> of substances (%)<br />

From<br />

To<br />

Cigarettes Cannabis Hard <strong>drug</strong>s Stopped<br />

Nothing 1.1 0<br />

Cigarettes 12.2 2.9 24.6<br />

Cannabis 5.2 12.5 58.6<br />

Hard <strong>drug</strong>s 5.5 11.0 62.1<br />

Total 59.0 8.2 1.15<br />

Data on <strong>cigarette</strong>, alcohol and <strong>illicit</strong> <strong>drug</strong> consumption were collected through a series<br />

of questions using the following structure:<br />

1. Have you ever <strong>use</strong>d X?<br />

2. If yes, how old were you when you <strong>use</strong>d X for the first time?<br />

3. If yes, how much X have you <strong>use</strong>d (a) ever, (b) during the last year, (c) during the last<br />

month? Fill in relevant category of <strong>use</strong>.<br />

The data were collected using face-to-face interviews. However, respondents provided<br />

answers to questions on <strong>illicit</strong> <strong>drug</strong> data anonymously at the end of the interview. 4<br />

Individuals who report positive ever-<strong>use</strong> of X but have not <strong>use</strong>d X during the last year are<br />

deemed to have stopped using it (at least for the time being). Table 1 gives a bird’s eye view<br />

of the data and the sequencing of <strong>drug</strong> <strong>use</strong>. The bottom row of the table records the total who<br />

ever <strong>use</strong>d the particular substance. For example, 59% of those who participated in the three<br />

samples reported that they ever smoked, while only 1.15% reported that they ever <strong>use</strong>d hard<br />

<strong>drug</strong>s (cocaine, heroin, LSD). These totals <strong>do</strong> not take account of sequencing. The rows<br />

in Table 1 indicate sequencing. For example, 12.2% of <strong>cigarette</strong> smokers subsequently<br />

<strong>use</strong>d cannabis, and 2.9% subsequently <strong>use</strong>d hard <strong>drug</strong>s without using cannabis first. A<br />

total of 12.5% of cannabis <strong>use</strong>rs subsequently <strong>use</strong>d hard <strong>drug</strong>s. Note that the data are not<br />

entirely consistent with gateway sequencing. A total of 1.1% <strong>use</strong>d cannabis without smoking<br />

<strong>cigarette</strong>s first, i.e. about 5.5% of cannabis <strong>use</strong>rs did not smoke first. A total of 5.2% of<br />

cannabis <strong>use</strong>rs subsequently started smoking <strong>cigarette</strong>s.<br />

Such reverse gateway behavior is also apparent in the consumption of hard <strong>drug</strong>s. 2.9%<br />

of <strong>cigarette</strong> smokers subsequently <strong>use</strong>d hard <strong>drug</strong>s without first using cannabis. 11% of<br />

hard <strong>drug</strong> <strong>use</strong>rs subsequently <strong>use</strong>d cannabis and 5.5% of then subsequently started smoking<br />

<strong>cigarette</strong>s.<br />

It is clear that <strong>cigarette</strong> smoking is neither a necessary nor sufficient precondition for<br />

cannabis, which in turn is neither a necessary nor sufficient precondition for hard <strong>drug</strong>s. Most<br />

investigators of the gateway sequencing hypothesis would tend to accept such deviations as<br />

being within the margin of error. Our view is that the issue is not important. The important<br />

question is not whether it is possible to <strong>use</strong> cannabis prior to <strong>cigarette</strong>s (it evidently is),<br />

4 It is well known that despite attempts to guarantee confidentiality such data may contain reporting errors.<br />

Tsibel (2000) <strong>use</strong>s our data to estimate a true model and a miss-reporting model. She finds that miss-reporting<br />

depends upon age and the number of inconsistencies in replies.


686 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

but whether incidental <strong>use</strong> of <strong>cigarette</strong>s increases the risk of subsequently using cannabis.<br />

Likewise, the question will be whether the incidental <strong>use</strong> of cannabis increases the risk of<br />

subsequently using hard <strong>drug</strong>s.<br />

Table 1 further indicates that the majority of <strong>illicit</strong> <strong>drug</strong> <strong>use</strong>rs stop using them, e.g. 62.1%<br />

of hard <strong>drug</strong> <strong>use</strong>rs had stopped. The table shows that the proportion of <strong>cigarette</strong> smokers<br />

who stop smoking is smaller than the proportion of stoppers among <strong>illicit</strong> <strong>drug</strong> <strong>use</strong>rs. The<br />

data are truncated beca<strong>use</strong> <strong>use</strong>rs may stop after the survey date. We have shown elsewhere<br />

(Beenstock and Rahav, 2000) that the data in the table understate the true rates of stopping,<br />

and that whereas the stopping hazard for cannabis is duration dependent, the stopping hazard<br />

for hard <strong>drug</strong>s is age dependent.<br />

We <strong>do</strong> not attach importance to alcohol as a gateway substance, beca<strong>use</strong> in Israel the<br />

cultural significance of alcohol is quite different from its significance in other western<br />

countries. Apart from the usual demographic variables, such as age, education, country of<br />

origin, and father’s country of origin, the surveys also contain data about religious practice,<br />

frequency of pub visits, and a range of economic variables.<br />

3.2. Instrumental variables<br />

To identify the gateway effect of <strong>cigarette</strong>s on cannabis <strong>use</strong> we seek instruments, which<br />

<strong>affect</strong> the <strong>use</strong> of <strong>cigarette</strong>s without simultaneously <strong>affect</strong>ing the <strong>use</strong> of cannabis. Tobacco<br />

taxes are uniform in Israel (except in Eilat where there is no VAT), hence we cannot implement<br />

Evans and Ringel’s (1999) idea. However, <strong>cigarette</strong> taxes have changed over time,<br />

as has the world price of tobacco. This suggests that the relative price of <strong>cigarette</strong>s may<br />

serve as a possible instrument for smoking. People who grew up when <strong>cigarette</strong>s were<br />

relatively cheap are more likely to smoke, given everything else, than people who grew<br />

up when <strong>cigarette</strong>s were expensive. In this case, ran<strong>do</strong>mization is by birth cohort and the<br />

price of <strong>cigarette</strong>s. The natural experiment consists of investigating whether birth cohorts<br />

who matured when <strong>cigarette</strong>s were relatively cheap were more prone to smoking, and as a<br />

consequence, were more prone to <strong>use</strong> cannabis.<br />

The crucial identifying restriction is that the price of <strong>cigarette</strong>s at the time when individual<br />

n was growing up <strong>affect</strong>s the probability of him smoking them, but it <strong>do</strong>es not <strong>affect</strong> the<br />

probability of using cannabis at a later date. There is no intertemporal substitution between<br />

<strong>cigarette</strong>s and cannabis; e.g. consumers <strong>do</strong> not plan to consume cannabis in the future as a<br />

result of an increase in the current price of <strong>cigarette</strong>s. In this sense, we assume consumers<br />

are myopic. If this were not the case the gateway effect would be inverted; when <strong>cigarette</strong><br />

consumption falls cannabis consumption should increase. We show later that this <strong>do</strong>es<br />

not happen in practice, and that cannabis consumption is not <strong>affect</strong>ed by <strong>cigarette</strong> <strong>prices</strong>,<br />

whereas <strong>cigarette</strong> consumption is so <strong>affect</strong>ed. This <strong>do</strong>es not necessarily mean that there is<br />

no intratemporal substitution between <strong>cigarette</strong>s and cannabis. If an individual eventually<br />

happens to consume both <strong>cigarette</strong>s and cannabis a change in their relative price might<br />

induce him to alter his relative demand for <strong>cigarette</strong>s.<br />

Unfortunately, there are no systematic data for cannabis <strong>prices</strong> in Israel. Had they existed,<br />

however, we would have dated them at time t rather than time t − 1 beca<strong>use</strong>, according to<br />

gateway sequencing, cannabis initiation at time t−1 <strong>do</strong>es not depend upon cannabis <strong>prices</strong> at<br />

a later date (time t) they would have constituted a further source of identifying information.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 687<br />

The approach may be extended to other variables with possibly identifying power. These<br />

include exposure to anti-smoking regulations such as the introduction of mandatory health<br />

warnings. Older birth cohorts, who were young before such regulations were introduced, are<br />

more likely to smoke, given other considerations, than younger birth cohorts. The accident<br />

of birth serves to ran<strong>do</strong>mize exposure to the regulatory environment.<br />

A third possible source of natural experimentation consists of the herd effect; the individual<br />

who happens to grow up when smoking is more prevalent may be more likely to<br />

smoke as a result of demonstration effects. Since the individual has no control over his<br />

environment, aggregate smoking patterns during the years of high risk exposure (say, aged<br />

15–20 years) may serve to ran<strong>do</strong>mize individual smoking behavior.<br />

In Figs. 1 and 2 we <strong>use</strong> CBS data to plot the real price of <strong>cigarette</strong>s (as measured by<br />

the average price per packet deflated by the consumer price index) in Israel, and aggregate<br />

<strong>cigarette</strong> consumption per capita. The data indicate that <strong>cigarette</strong>s were relatively expensive<br />

in the 1960s, while per capita <strong>cigarette</strong> consumption was lower. Subsequently, <strong>cigarette</strong>s<br />

became cheaper and per capita <strong>cigarette</strong> consumption increased. 5 A person born in the<br />

early 1950s would have experienced relatively high <strong>cigarette</strong> <strong>prices</strong> in his teens, and a more<br />

smoke-free environment. By contrast, a person born in the 1970s would have experienced<br />

relatively low <strong>cigarette</strong> <strong>prices</strong> in his teens, and a less smoke-free environment. Controlling<br />

for other factors, the former is less likely to smoke than the latter, if indeed herd effects and<br />

the price of <strong>cigarette</strong>s <strong>affect</strong>s smoking initiation.<br />

We <strong>use</strong> the data in Fig. 1 to calculate a weighted average of <strong>cigarette</strong> <strong>prices</strong> for each<br />

individual during his “impressionable” years. The weights are taken from the hazard function<br />

for smoking initiation (Fig. 3, taken from Beenstock and Rahav (2001)). This gives<br />

a higher weight in the ages 15–20 years, when the risk of smoking initiation is high, and<br />

smaller weights when the risk of smoking initiation is low. We apply a similar procedure<br />

to the data in Fig. 2. In this way, we calculate for each individual the personalized<br />

price of <strong>cigarette</strong>s and smoking environment. These variables are naturally ran<strong>do</strong>mized<br />

by birth year, <strong>cigarette</strong> <strong>prices</strong> and aggregate smoking behavior. This is similar to the<br />

approach taken by Douglas and Hariharan (1994) who <strong>use</strong>d a narrower win<strong>do</strong>w (15–18<br />

years) than ours. These variables are <strong>use</strong>d as instrumental variables beca<strong>use</strong> they are independent<br />

of observed heterogeneity that might influence the propensity to smoke and <strong>use</strong><br />

<strong>drug</strong>s.<br />

4. Results<br />

We report two types of gateway test using the metho<strong>do</strong>logy of natural experimentation.<br />

In the first, we estimate the recursive model as specified in Section 2.2 using a bivariate<br />

probit procedure (RBP), and a two-stage logit (2SL) procedure. The second type of test<br />

refers to the relationship between smoking initiation and cannabis initiation. We apply the<br />

5 Unfortunately, there are no systematic prevalence data. Since 1972, the Health Education Department at the<br />

Ministry of Health has occasionally commissioned surveys on smoking prevalence. These data have been analysed<br />

by Ben-Sira (e.g. Ben-Sira (1993)). They suggest that between 1972 and 1988 smoking prevalence was stable, but<br />

it fell during the 1990s.


688 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 689


690 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

Fig. 3. Survival and hazard functions for age at <strong>cigarette</strong> initiation.<br />

metho<strong>do</strong>logy described in Section 2.3 and estimate a CPHM for the former in which the<br />

personalized relative price of <strong>cigarette</strong>s is hypothesized to influence the hazard of smoking. 6<br />

4.1. Test 1: sequencing<br />

We begin by reporting results for the 2SL procedure, which we compare with results<br />

obtained using the RBP procedure. Table 2 records our chosen logit model for the <strong>use</strong> of<br />

<strong>cigarette</strong>s. All the variables are statistically significant. The “personalized” relative price of<br />

<strong>cigarette</strong>s reduces the probability of smoking (odds ratio = 0.991). Among the demographic<br />

controls we mention that women are less likely to smoke, as are more religious people.<br />

People born in Israel are less likely to smoke, as are people whose fathers were born in<br />

6 Douglas and Hariharan (1994) estimated a split sample hazard model for smoking. In Beenstock and<br />

Rahav (2001), we estimate split sample hazard models for cannabis and hard <strong>drug</strong>s both parametrically and<br />

semi-parametrically. In the present context, split sample estimation is not numerically feasible beca<strong>use</strong> of the need<br />

for bootstrapping. CPHM nevertheless <strong>do</strong>es not require the population survival rate to tend to 0 and in this sense<br />

it mimics split sample models.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 691<br />

Table 2<br />

Logit model for <strong>cigarette</strong> smoking<br />

Variable Coefficient S.E.<br />

Intercept 1.2715 0.1254<br />

Female −0.6689 0.0390<br />

Age 0.0623 0.0072<br />

Israel −0.2634 0.0601<br />

Middle East −0.1677 0.0514<br />

Balkan −0.1926 0.0636<br />

Asia −0.4209 0.1747<br />

Eastern Europe −0.3033 0.0690<br />

Education 4 0.2823 0.0521<br />

Religious (high) −0.8841 0.0528<br />

Religious (middle) −0.2316 0.0453<br />

Survey 1989 0.4450 0.0614<br />

Cigarette Price −0.0094 0.0013<br />

N = 12,455; −2log L = 16,171; P-value for χ 2 = 0.0001.<br />

Asia. Cohort effects are picked up by two variables, age and survey 1989, both of which<br />

imply that earlier cohorts are more likely to smoke.<br />

We <strong>use</strong> this model to calculate the predicted probability of smoking (PPS, denoted in the<br />

last row of Table 3). Table 3 records our chosen model for subsequent cannabis <strong>use</strong>. The PPS<br />

has a large and highly significant positive effect on the probability of subsequent cannabis<br />

<strong>use</strong>. Had we <strong>use</strong>d the dummy variable for prior smoking (C in Eq. (1) and the dependent<br />

variable in Table 2) instead of its instrumented value, it could have been argued that the<br />

observed gateway effect from smoking to cannabis is the result of spurious correlation<br />

induced by unobserved heterogeneity. In fact, it turns out that the dummy variable for<br />

smoking is yet more significant, however, this significance is exaggerated by unobserved<br />

heterogeneity. Since PPS is purged of unobserved heterogeneity, the conclusion from Table 3<br />

must be in favor of a causal gateway effect from <strong>cigarette</strong>s to cannabis.<br />

Two types of parameter standard errors are reported in Table 3, the original standard<br />

errors, and their bootstrapped counterparts. The standard errors have been bootstrapped for<br />

Table 3<br />

Logit model for subsequent cannabis <strong>use</strong><br />

Variable Coefficient S.E. Bootstrap S.E.<br />

Intercept −6.0553 0.3838<br />

Female 0.2294 0.1067 0.1146<br />

Age −0.0207 0.0059 0.0061<br />

Israel 0.5566 0.0898 0.1055<br />

Education 4 −0.7391 0.1105 0.1144<br />

Education 5–7 −0.7072 0.0823 0.0756<br />

Religious 3 −0.4492 0.0818 0.0910<br />

Pub frequency 0.2701 0.0287 0.0271<br />

Cigarette probability (PPS) 6.4529 0.4347 0.5417<br />

N = 12,382; −2log L = 5782; P-value for χ 2 = 0.0001.


692 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

Table 4<br />

Comparison between 2SP and 2SBP<br />

Variable 2SP 2SBP<br />

Cigarette <strong>prices</strong> −0.00553 (7.119) −0.00538 (7.133)<br />

ρ 0 0.57 (27.1)<br />

PPS 3.228 (13.058) 3.402 (14.300)<br />

The t-values are shown in parentheses.<br />

two reasons. First, beca<strong>use</strong> PPS is an estimate of its true value. Secondly, beca<strong>use</strong> in any<br />

case IV estimation requires correction of the standard errors in the second stage. We <strong>use</strong> 200<br />

i.i.d. resamplings to calculate the standard errors. Andrews and Buchinsky (1999) discuss<br />

the number of resamplings required to attain a prescribed degree of accuracy for different<br />

P-values. It turns out that the original (unbootstrapped) standard errors provide a reliable<br />

guide to statistical significance. Also, the original parameter estimates reported in Table 3<br />

turn out to be close to the means of their bootstrapped counterparts, suggesting that the<br />

crude estimates are unbiased.<br />

It is noteworthy that if the relative price of <strong>cigarette</strong>s is omitted from Table 2, PPS ceases<br />

to be statistically significant in Table 3. This suggests that there is identifying information<br />

in <strong>cigarette</strong> <strong>prices</strong>, and that the significance of PPS in Table 3 is not simply the result of<br />

parametric identification.<br />

We experimented with other instruments, including herd effects and exposure to <strong>cigarette</strong><br />

regulations. In the former case we <strong>use</strong>d “personalized” <strong>cigarette</strong> consumption per capita<br />

(based in Fig. 2) as an additional covariate in Table 2, but it was not statistically significant. 7<br />

In the latter case, we defined exposure to be the number of adult years lived prior to the<br />

introduction of mandatory health warnings on <strong>cigarette</strong> packets. We expected that more<br />

exposed individuals are more likely to smoke, holding other variables constant. However,<br />

this did not turn out to be the case. This was true when we <strong>use</strong>d a variety of health warning<br />

trigger dates (1983 Israel, 1971 UK, 1965 US) both individually and collectively. If anything,<br />

we found that more exposed individuals, as defined, were less likely to smoke. However, this<br />

may be the manifestation of some complex cohort effect; early birth cohorts are less likely<br />

to smoke. In summary, the main instrument for smoking turned out to be the personalized<br />

relative price of <strong>cigarette</strong>s.<br />

Other results in Table 3 include a negative cohort effect; older birth cohorts are less likely<br />

to <strong>use</strong> cannabis, Sabras (native Israelis) are more likely to <strong>use</strong> cannabis, the uneducated and<br />

the highly educated are more likely to <strong>use</strong> cannabis. When controlling for other factors,<br />

women and secular Jews are more likely to <strong>use</strong> cannabis.<br />

As noted in Section 2.2, 2SL is consistent but not efficient, beca<strong>use</strong> it restricts ρ = 0.<br />

Inefficiency may induce bias in non-linear estimators. To investigate this we re-estimated the<br />

model using two-stage probit (2SP), solved for PPS, and then iterated PPS using two-stage<br />

bivariate probit (2SBP). Coefficient estimates for the main variables of interest are shown<br />

in Table 4.<br />

7 It is possible that prevalence data would have produced different results. See Footnote 5.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 693<br />

Table 5<br />

Logit model for subsequent hard <strong>drug</strong> <strong>use</strong><br />

Variable Coefficient S.E. Bootstrap S.E.<br />

Intercept −5.0 2.3445<br />

Female −1.0621 0.2611 0.2880<br />

Age 0.0089 0.0151 0.0140<br />

Israel −0.5690 0.2743 0.3345<br />

Middle East −1.1456 0.3123 0.3187<br />

Balkan −1.1456 0.3544 0.4364<br />

Eastern Europe −1.2418 0.3759 0.4513<br />

Education 4 −0.6849 0.2919 0.2909<br />

Education 5–7 −1.1975 0.2969 0.3440<br />

Religious (high) −1.1149 0.4127 0.4388<br />

Religious (middle) −0.7614 0.2968 0.3644<br />

Survey 1989 −0.5022 0.2382 0.2673<br />

Survey 1992 −0.6256 0.3070 0.2989<br />

Pub frequency 0.3001 0.1106 0.1514<br />

PPH 1.0967 2.6563 2.7315<br />

N = 12,382; −2log L = 1261; P-value for χ 2 = 0.0001.<br />

Table 4 shows that the enhanced efficiency of 2SBP makes little difference to the parameter<br />

estimates. We also found that 2SP estimates are similar to the 2SL estimates reported<br />

in Tables 2 and 3. This suggests that while 2SL is not efficient, and may be biased in small<br />

samples, the loss of efficiency is likely to be minimal.<br />

We <strong>use</strong>d the specification in Tables 2 and 3 to estimate by RBP instead of 2SL. To save<br />

space we <strong>do</strong> not report the counterparts of Tables 2 and 3 that were obtained using RBP. On<br />

the whole the results were similar; the signs and significance of the coefficients were the<br />

same as in 2SL. The RBP estimate of β in Eq. (1) is 2.337 with t-value 36.57, which may<br />

be contrasted with its two-stage counterparts reported in the Table 4.<br />

In Table 5, we apply the method of “<strong>do</strong>mino” identification to the effect of cannabis on<br />

hard <strong>drug</strong>s. The parameter of interest is δ in Eq. (3). We present a 2SL model for subsequent<br />

hard <strong>drug</strong> <strong>use</strong> (i.e. after cannabis) in which PPH is the predicted probability of cannabis<br />

<strong>use</strong> that is derived from Table 3. It is positive but not statistically significant, implying<br />

that there is not a causal gateway effect from cannabis to hard <strong>drug</strong> <strong>use</strong>. However, it turns<br />

out that this result depends upon the specification of the model in Table 3. If, e.g. fathers’<br />

country of birth is included as a regressor, then PPH becomes marginally significant. In<br />

summary: becoming a <strong>cigarette</strong> smoker increases the probability of becoming a <strong>use</strong>r of<br />

cannabis, which increases the probability of becoming a <strong>use</strong>r of hard <strong>drug</strong>s. The former link<br />

in the gateway chain is highly statistically significant, whereas the latter link is less statistically<br />

robust. Since the asymptotic standard errors of the parameter estimates vary directly<br />

with the rarity of the event, the standard errors in the model for hard <strong>drug</strong>s will naturally<br />

tend to be larger than their counterparts in the model for cannabis. It is possible therefore<br />

that 12,500 observations may be insufficient to estimate with sufficient accuracy the parameters<br />

in Table 5. Also, “<strong>do</strong>mino” identification naturally weakens the power of the test<br />

beca<strong>use</strong> of its indirectness. It places perhaps too great a burden of identification on <strong>cigarette</strong>s<br />

<strong>prices</strong>.


694 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

So much for the two-stage investigation of the causal effect of cannabis <strong>use</strong> on hard<br />

<strong>drug</strong> <strong>use</strong>. We now <strong>use</strong> RBP to investigate the same issue. Strictly speaking, RBP is not<br />

appropriate beca<strong>use</strong> we have added a third link to the gateway chain. A recursive trivariate<br />

probit (RTP) estimator is required. Since RTP is non-standard we have carried out the<br />

following approximation.<br />

(a) We <strong>use</strong> RBP to model the link between <strong>cigarette</strong>s and cannabis.<br />

(b) We <strong>use</strong> the results to obtain PPS, the predicted probability for smoking.<br />

(c) We <strong>use</strong> RBP to model the link between cannabis and hard <strong>drug</strong>s, where PPS is <strong>use</strong>d in<br />

the probit model for cannabis.<br />

This procedure estimates RTP in two stages. When we <strong>use</strong> the variables specified in<br />

Tables 2, 3 and 5 we obtain that the estimate of δ is 0.178 (t = 0.708), implying that the<br />

causal effect from cannabis to hard <strong>drug</strong>s is positive, but not statistically significant. The<br />

associated estimate of ρ is 0.835 (S.E. = 0.135). This result confirms the one obtained by<br />

2SL as reported in Table 5. The estimate of ρ strongly suggests that the gateway effect from<br />

cannabis to hard <strong>drug</strong>s is the result of correlation between the unobserved heterogeneity; it<br />

is not causal.<br />

However, δ became statistically significant when certain variables were dropped from the<br />

model for hard <strong>drug</strong>s. When we dropped “pub frequency” and “education 4”, we obtained<br />

an estimate of δ = 2.61 (t = 5.53) and ρ =−0.22 (S.E. = 0.213). While these restrictions<br />

fail a likelihood ratio test, they undermine the degree to which we can confidently reject the<br />

hypothesis that there is no causal gateway effect from cannabis to hard <strong>drug</strong>s.<br />

4.2. Test 2: initiation hazard<br />

In Table 6 we present a CPHM for smoking initiation in which the “personalized” price of<br />

<strong>cigarette</strong>s is clearly significant, implying that the age of <strong>cigarette</strong> initiation varies directly<br />

with the price of <strong>cigarette</strong>s, after controlling for demographic variables. The model also<br />

implies that women start smoking later, as <strong>do</strong> religious people and Israeli born.<br />

Table 6<br />

CPHM for smoking initiation<br />

Variable Coefficient S.E.<br />

Female −0.496400 0.02364<br />

Israel 0.135970 0.03793<br />

Middle East 0.111880 0.03215<br />

Balkan −0.075090 0.03909<br />

Asia −0.268128 0.11848<br />

Eastern Europe −0.152799 0.04350<br />

Education 4 0.216162 0.03074<br />

Religious (high) −0.547501 0.03484<br />

Religious (middle) −0.148835 0.02748<br />

Survey 1989 0.106948 0.02992<br />

Cigarette price −0.001895 0.000313<br />

N = 12,455; −2log L = 13,1067; P-value for χ 2 = 0.0001.


M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 695<br />

Table 7<br />

CPHM for cannabis initiation<br />

Variable Coefficient S.E. Bootstrap S.E.<br />

Female 0.4211 0.1192 0.1351<br />

Israel 0.4527 0.0839 0.0946<br />

Balkan −0.2003 0.0947 0.0980<br />

Education 4 −0.7770 0.1083 0.1094<br />

Education 5–7 −0.6260 0.0752 0.0681<br />

Religious (middle) −0.3830 0.0787 0.0843<br />

Pub frequency 0.2682 0.0255 0.0247<br />

PRSS 2.2195 0.1953 0.2137<br />

N = 12,382; −2log L = 15,874; P-value for χ 2 = 0.0001.<br />

In Table 7 we present a CPHM for cannabis initiation in which PRSS = Xβ is specified<br />

as a covariate. Similar results are obtained if Xβ is transformed non-linearly as in the earlier<br />

equation. We carried out 200 i.i.d. resamplings for bootstrapping. PRSS is statistically significant,<br />

implying that <strong>cigarette</strong> initiation has a causal gateway effect on cannabis initiation.<br />

The longer one is predicted to survive without smoking, the longer one survives without<br />

using cannabis. Table 8 further implies that women initiate cannabis later, as <strong>do</strong> religious<br />

people and the children of Balkan fathers. Israeli born initiate earlier despite the fact that<br />

they start smoking later.<br />

Next, we calculate the predicted relative survival for cannabis (PRSC) from Table 7 for<br />

each individual, and we apply the method of “<strong>do</strong>mino” identification to determine whether<br />

age at cannabis initiation has a causal effect upon age at hard <strong>drug</strong> initiation. We specify<br />

PRSC in Table 8 as the indirectly instrumented value of the cannabis initiation hazard.<br />

The non-instrumented alternative would have been to specify the age of cannabis initiation.<br />

However, this would have embodied unobserved heterogeneity, which might have induced a<br />

Table 8<br />

CPHM for hard <strong>drug</strong> initiation<br />

Variable Coefficient S.E. Bootstrap S.E.<br />

Female −0.7609 0.4573 0.4043<br />

Israel −0.6227 0.2894 0.2671<br />

Middle East −1.0511 0.3378 0.3362<br />

Balkan −0.9891 0.4203 0.4572<br />

Eastern Europe −1.0666 0.4070 0.4423<br />

Education 4 −0.4802 0.3201 0.3307<br />

Education 5–7 −0.8362 0.4115 0.3956<br />

Religious (high) −0.6042 0.8012 0.7854<br />

Religious (middle) −0.4497 0.4946 0.4995<br />

Survey 1989 −0.5681 0.2314 0.2041<br />

Survey 1992 −0.5729 0.3025 0.3431<br />

Pub frequency 0.1999 0.1961 0.1984<br />

PRSC 0.3173 0.6052 0.5558<br />

N = 12,382; −2log L = 2128; P-value for χ 2 = 0.0001.


696 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

spurious gateway effect. The bootstrapped standard errors are based upon 200 resamplings.<br />

Table 8 indicates that PRSC is positive but not statistically significant, implying that there<br />

is not a causal gateway effect from cannabis to hard <strong>drug</strong>s.<br />

When the age of cannabis initiation is specified instead of PRSC in Table 8, its coefficient<br />

turned out to be negative and highly statistically significant, implying that age at hard <strong>drug</strong><br />

initiation varies directly with age at cannabis initiation. This result suggests a gateway<br />

effect from cannabis initiation to hard <strong>drug</strong> initiation. However, the test in Table 8 shows<br />

that this effect is not causal since it results from unobserved heterogeneity. People who are<br />

susceptible to <strong>drug</strong>s happen to initiate cannabis sooner, and initiate hard <strong>drug</strong>s sooner. Thus,<br />

the gateway effect is spurious.<br />

In summary, beginning to smoke at an earlier age induces earlier <strong>use</strong> of cannabis, which<br />

induces earlier <strong>use</strong> of hard <strong>drug</strong>s. The former link in the gateway chain is highly statistically<br />

significant, whereas the latter link is not robust. In these respects, the results from experiment<br />

2 complement those obtained from experiment 1.<br />

5. Conclusion<br />

Controlling for cohort and other effects, people who grew up when <strong>cigarette</strong>s were relatively<br />

cheap were more likely to smoke, and to start smoking younger. As a causal consequence<br />

they were more likely to <strong>use</strong> cannabis afterwards and to initiate cannabis sooner. Our<br />

findings from natural experimentation indicate for the first time a causal gateway effect from<br />

<strong>cigarette</strong>s to cannabis, both for sequencing and for timing. On the other hand, we <strong>do</strong> not find<br />

a causal gateway effect from cannabis to hard <strong>drug</strong>s, both for sequencing and for timing.<br />

The latter result has to be qualified by two considerations. Beca<strong>use</strong> hard <strong>drug</strong> <strong>use</strong> is rare,<br />

hypothesis testing is more difficult. Secondly, the identification of the causal gateway effect<br />

from cannabis to hard <strong>drug</strong>s was indirect.<br />

These findings imply that higher <strong>cigarette</strong> taxes will not only reduce <strong>cigarette</strong> smoking,<br />

they will also reduce cannabis consumption. However, they will not reduce the consumption<br />

of hard <strong>drug</strong>s. They further imply that if cannabis is decriminalized, and the consumption of<br />

cannabis increases (Model, 1991), it will not effect the consumption of hard <strong>drug</strong>s. Hence,<br />

if policy makers are reluctant to decriminalize cannabis for gateway reasons our results <strong>do</strong><br />

not support this position. It should be recalled, however, that just as the anti-<strong>drug</strong> legislation<br />

in the early part of the 20th century was politically inspired (Musto, 1973), so scientific<br />

reasoning in the 21st century may not sway the issue.<br />

We admit that our approach has been somewhat ad hoc. This stems from the fact that the<br />

rational addiction model refers to single substances rather than multiple substances. There<br />

is a need to extend the rational addiction model to multiple substances in order to shed<br />

theoretical light upon the gateway phenomenon. Why <strong>do</strong> some people advance to cannabis<br />

from <strong>cigarette</strong>s, or from cannabis to hard <strong>drug</strong>s, while others <strong>do</strong> not? Such a model might<br />

suggest how the <strong>prices</strong> of <strong>cigarette</strong>s, cannabis and hard <strong>drug</strong>s <strong>affect</strong> the consumption of these<br />

substances at different stages in the life cycle. In the absence of price data for <strong>illicit</strong> <strong>drug</strong>s<br />

in Israel, we have been forced to place all the burden of identification upon <strong>cigarette</strong> <strong>prices</strong>.<br />

The availability of <strong>illicit</strong> <strong>drug</strong> price data in the US makes the extension of our approach<br />

there of particular interest.


Acknowledgements<br />

M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 697<br />

We wish to thank Irena Vovk for her excellent research assistance; the Israel Science<br />

Foundation and the Eshkol Institute for financial support; the Israel Anti-Drug Authority<br />

for providing the data; Michael Hvoshnyansky who helped us prepare the final version of<br />

the paper; and three referees whose remarks considerably improved the paper.<br />

References<br />

Andrews, D.W.K., Buchinsky, M., 1999. A three-step method for choosing the number of bootstrap repetitions.<br />

Econometrica 68, 23–51.<br />

Barnea, Z., Teichman, M., Rahav, G., Gil, R., Rozenbloom, Y., 1990., The Use of Drugs and Alcohol Among the<br />

Population of Israel: An Epidemiologic Study. The Anti-Drugs Authority, Jerusalem.<br />

Baumrind, D., 1983. Specious causal attributions in the social sciences: the reformulated stepping stone theory of<br />

heroin <strong>use</strong> as an exemplar. Journal of Personality and Social Psychology 45, 1289–1298.<br />

Becker, G.S., Murphy, K.M., 1988. A theory of rational addiction. Journal of Political Economy 75, 700–765.<br />

Beenstock, M., Rahav, G., 2000. Maturing Out as a Life-Cycle or Duration Dependent Phenomenon in the Natural<br />

History of Drug Use in Israel, mimeo.<br />

Beenstock, M., Rahav, G., 2001. Immunity and Susceptibility in Illicit Drug Initiation in Israel.<br />

Ben-Sira, Z., 1993. Smoking: A Follow-up on Behavior Patterns and Change. The Louis Guttman Israel Institute<br />

of Applied Social Research, Jerusalem (in Hebrew).<br />

Chaloupka, F.J., 1991. Rational addictive behavior and <strong>cigarette</strong> smoking. Journal of Political Economy 99, 722–<br />

742.<br />

Douglas, S., Hariharan, G., 1994. The hazard of starting smoking: estimates from a split population duration<br />

model. Journal of Health Economics 13, 213–230.<br />

Evans, W.N., Ringel, J.S., 1999. Can higher <strong>cigarette</strong> taxes improve birth outcomes? Journal of Public Economics<br />

18, 135–154.<br />

Frey, B.S., 1997. Drugs and Economic Policy, Economic Policy.<br />

Greene, W.H., 2000., Econometric Analysis, 4th Edition. Prentice-Hall, Englewood Cliffs, NJ.<br />

Grossman, M., Chaloupka, F.J., 1999. The demand for cocaine by young adults: a rational addiction approach.<br />

Journal of Health Economics 18, 427–474.<br />

Hougaard, P., 1995. Frailty models for survival data. Lifetime Data Analysis 1, 255–274.<br />

Kandel, D.S., 1975. Stages in a<strong>do</strong>lescent involvement in <strong>drug</strong> <strong>use</strong>. Science 190, 912–924.<br />

Klein, J.P., Moeschberger, M.L., 1997. Survival Analysis. Springer, New York.<br />

Larson, M.G., Dinse, G.E., 1985. A mixture model for the regression analysis of competing risks data. Applied<br />

Statistics 34, 201–211.<br />

Maddala, G.S., 1983. Limited-Dependent and Qualitative Variables in Econometrics. Cambridge University Press,<br />

Cambridge.<br />

Miron, J.A., Zweibel, J., 1991. Alcohol consumption during prohibition. American Economic Review 81, 242–247.<br />

papers and proceedings..<br />

Miron, J.A., Zweibel, J., 1995. The economic case against <strong>drug</strong> prohibition. Journal of Economic Perspectives 9,<br />

175–192.<br />

Model, K., 1991. The effect of marijuana decriminalization on hospital emergency room <strong>drug</strong> episodes. Journal<br />

of the American Statistical Association 88, 737–747.<br />

Moore, M.H., 1973. Policies to achieve discrimination on the effective price of heroin. American Economic Review<br />

63, 270–277.<br />

Mullahy, J., Sindelar, J.L., 1993. Alcoholism, work and income. Journal of Labor Economics 11, 494–520.<br />

Musto, D.F., 1973. The American Disease: Origins of Narcotic Control. Yale University Press, New Haven.<br />

Nisbet, C.T., Vakil, F., 1972. Some estimates of price and expenditure elasticities of demand for marijuana among<br />

UCLA students. Review of Economics and Statistics 54, 473–475.<br />

O’Donnell, J.A., Clayton, R.R., 1982. The stepping stone hypothesis: marijuana, heroin and causality. Chemical<br />

Dependency 4, 229–241.


698 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698<br />

Pacula, R.L., 1998. A<strong>do</strong>lescent Alcohol and Marijuana Consumption: Is there Really a <strong>Gateway</strong> Effect? NBER<br />

working paper 6348.<br />

Prinz, A., 1997. Do European <strong>drug</strong>s policies matter? Economic Policy.<br />

Rottenberg, S., 1968. The clandestine distribution of heroin, its discovery and suppression. Journal of Political<br />

Economy 75, 78–90.<br />

Tsibel, N., 2000. Event history analysis accounting for measurement errors and missing data, Ph.D. dissertation.<br />

The Hebrew University of Jerusalem.<br />

White, M.D., Lusketich, W.A., 1983. Heroin: price elasticity and enforcement strategies. Economic Inquiry 21,<br />

557–564.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!