Testing Gateway Theory: do cigarette prices affect illicit drug use?

More documents

Recommendations

Info

682 M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 The solution to this problem is to instrument C t−1 in Eq. (1). No doubt Z and X largely overlap. An element of Z is the price of cigarettes (and other variables discussed in Section 4) that prevailed when individual n was growing up (P n(t−1) ), which are hypothesized to determine the demand for cigarettes but not the demand for cannabis. We expect C t−1 to vary inversely with P t−1 . There is no reason to suspect that P t−1 and v t−1 are correlated (nor P t−1 and u t ). If S t does not depend upon P t−1 , then P t−1 identifies the effect of C t−1 upon S t in Eq. (1). The next step in the gateway chain concerns the relationship between cannabis and hard drugs. The hypothesis of interest here is does exposure to cannabis induce a higher probability of subsequently using hard drugs? We denote H n = 1 if individual n used hard drugs after using cannabis and 0 otherwise. We express this as follows: H n(t+1) = λY n(t+1) + δS nt + ρ y D y + w n(t+1) (3) where Y is a vector of controls for hard drug use. Gateway Theory suggests δ>0. Since we suspect that E(wu) >0 unobserved heterogeneity is likely to bias upwards estimates of δ. Ideally, we seek exclusion restrictions for Y and X such as drug price data, which are not available in Israel. Instead, we suggest using as an instrument for S t in Eq. (3) its fitted value from IV estimation of Eq. (1). We refer to this as “domino” identification because the natural experiment is indirect. Clearly, such indirect identification weakens the power of tests on the value of δ. Random exposure to cigarettes at time t − 1 induces cannabis consumption at time t, which in turn, induces hard drug use at time t + 1. If β and δ are properly identified, and there is a causal gateway effect, we can expect a chain reaction where raising the price of cigarettes will reduce cigarette smoking, which will reduce the subsequent use of cannabis, which, in turn, will reduce the subsequent use of hard drugs. Note that Eqs. (1)–(3) control for cohort effects. This is possible because we use several surveys in Section 4 (y = year of survey − age). Had there been only one survey it would not have been possible to identify the separate effects of P t−1 and birth cohort upon drug consumption. 2.2. Recursive bivariate probit and two-stage procedures If u and v in Eqs. (1) and (2) happen to be bivariate normal with cov(u, v) = ρ then the model may be estimated by maximum likelihood as a recursive bivariate probit (RBP) model (Maddala, 1983, pp. 122–123, Greene, 2000, pp. 852–825). Note that in this case C in Eq. (1) does not have to be instrumented because, in contrast to the linear probability model, it is not estimated by least squares. However, Maddala points out that identification requires that X omit covariates in Z.IfX = Z the model is not identified, even parametrically. Maddala suggests that an alternative procedure is to specify prob (C ∗ > 0) in Eq. (1) instead of C, where C ∗ denotes the underlying latent variable that measures the propensity to smoke cigarettes. In this case, it may be shown that a two-stage procedure provides consistent estimates of the parameters in Eqs. (1) and (2). In the first stage, Eq. (2) is estimated by probit or logit, and in the second stage the predicted probability of C obtained from the first stage is used to replace C in Eq. (1). There are advantages and disadvantages to both procedures. The main disadvantage of RBP is that it may be sensitive to parametric assumptions about the unobserved heterogeneity.
M. Beenstock, G. Rahav / Journal of Health Economics 21 (2002) 679–698 683 If u and v do not happen to be bivariate normal then estimates of β in Eq. (1) will be biased and inconsistent (unless ρ = 0). This problem does not arise in the two-stage procedure because it does not require estimates of ρ. The disadvantage of the two-stage procedure is that although it is consistent, it is not efficient. By contrast, RBP estimates are consistent and efficient provided that u and v happen to be bivariate normal. In Section 4, we use both procedures. 1 However, the main results that we report are for two-stage logit (2SL), mainly because it is less parametric, and is therefore less sensitive to mis-specification error. A disadvantage of the two-stage procedure is that the standard errors of the parameters are difficult to calculate (Maddala, 1983, p. 247). In Section 4, we use a bootstrap procedure to calculate them. 2.3. Hazard analysis In this section, we focus on the timing of events rather than their sequencing. The specific questions we ask are whether earlier initiation of cigarette smoking causes earlier initiation of cannabis, and whether the latter causes earlier initiation of hard drugs. Elsewhere (Beenstock and Rahav, 2001), we suggest the use of long term survivor models to specify the initiation hazard since drug use is a minority activity. This is also the approach used by Douglas and Hariharan (1994). However, this approach is not practical in the current, more complicated, context in which our objective is to identify treatment effects. Instead, we model drug use initiation using Cox’s proportional hazards model (CPHM), which has the advantage of not necessarily implying that it is a matter of time before everyone uses drugs. 2 The counterpart of Eq. (1) for individual n’s hazard of using cannabis ( s ) is: λ s (t n ) = λ s0 (t n ) exp − (X n α + βA cn + γ y D yn ) (4) where A cn denotes the age at which individual n first smoked cigarettes, X and D are defined as in Eq. (1), and s0 is the “baseline” hazard. We denote the age of cannabis initiation by A s >A c by definition. According to Gateway Theory β > 0, since people who begin smoking later will initiate cannabis later. Unobserved heterogeneity is likely to generate positive covariance between A s and A c , which will bias upwards estimates of β. We suggest that instead of using A c in Eq. (4) it should be replaced by its expected value as determined from a CPHM for cigarette smoking. The specification of the hazard for cigarette smoking (λ c ) parallels Eq. (2) and is written as follows: λ c (t n ) = λ c0 (t n ) exp − (Z n φ + θ y D yn ) (5) where Z is defined as in Eq. (2). Eq. (5) implies that expected age at cigarette initiation is: E(A cn ) = exp[−Λ c (t) exp(Z n φ + θ y D yn )] (6) where Λ c denotes the integrated hazard function for smoking evaluated at the mean. We suggest a two-stage procedure in which Eq. (5) is estimated in the first stage and the solution 1 Evans and Ringel (1999) in a similar situation to ours, side-step these methodological issues. 2 Larson and Dinse (1985) propose a long term survivor model for CPHM which is used in Beenstock and Rahav (2001).
Page 1 and 2: Journal of Health Economics 21 (200
Page 3: M. Beenstock, G. Rahav / Journal of
Page 7 and 8: M. Beenstock, G. Rahav / Journal of
Page 19 and 20: Acknowledgements M. Beenstock, G. R

Testing Gateway Theory: do cigarette prices affect illicit drug use?

Create successful ePaper yourself

Delete template?

Save as template?