Pinar Karaca-Mandic and Kenneth Train, (2003), pp. 400 ... - Feweb

More documents

Recommendations

Info

Standard error correction in two-stage estimation with nested samples 401 an omitted product attribute), then some form of instrumental variables estimation is needed to account for this correlation. The first stage specifies price as a function of instruments using market-level data, and the second stage utilizes the predicted price or the residual in estimation of the customer-level model. Kuksov and Villas-Boas (2001) derive a similar formula for a related set-up. Their samples are not nested; instead, each unit of observation in the second stage appears repeatedly in all of the first-stage observations (i.e. each customer is observed in each of several time periods.) They allow correlation over first-stage observations in the unobservables associated with each second-stage unit (i.e. correlation for the same customer over time). In contrast, our formula uses independence over first-stage observations. We apply our formula to the Petrin–Train application of households’ choice of TV reception. They are concerned that the prices of the TV reception alternatives are correlated with the unobserved attributes of the alternatives. To correct for this endogeneity, they specify a firststage linear regression of prices on exogenous variables and instrumental variables using market level data. The second stage is a mixed logit model. The dependent variable is the TV reception choice of a customer in a given market, and the independent variables are market level prices, market level residuals estimated in the first stage and other observed attributes of the alternatives. 2. FRAMEWORK First stage. Estimated parameters ˆθ1 solve moment conditions ¯h( ˆθ1) = 0 where ¯h(θ1) = 1 N1 m hm(θ1, zm), N1 is the number of observations, and hm(θ1; zm) is a vector of moments for observation m which depends on data zm. Second stage. Estimated parameters ˆθ2 solve moment conditions ¯g( ˆθ2, ˆθ1) = 0 where ¯g(θ2, ˆθ1) = 1 N2 n gn(θ2, ˆθ1; xn), N2 is the number of observations, and gn(θ2, ˆθ1; xn) is the moment for observation n based on its data and the first-stage estimates. Each n in the second stage corresponds to an m in the first stage (e.g. n indexes customers, m indexes markets, and each customer buys in one market). Therefore, there are one or more observations in the second-stage sample from each group in the first-stage sample. Also, xn in the second stage can include one or more elements of zm for the m corresponding to n. Take a Taylor expansion of both the first- and second-stage moment conditions around the true parameters θ ∗ 1 and θ ∗ 2 (assuming that the moment conditions are bounded and differentiable around the true parameters) and evaluate it asymptotically. For the first stage: and for the second stage: where c○ Royal Economic Society 2003 0 a = ¯h(θ ∗ 1 ) − A( ˆθ1 − θ ∗ 1 ), 0 a = ¯g(θ ∗ 2 , θ ∗ 1 ) − B( ˆθ1 − θ ∗ 1 ) − C( ˆθ2 − θ ∗ 2 ), A = −plim∇θ1 ¯h(θ ∗ 1 ) B = −plim∇θ1 ¯g(θ ∗ 2 , θ ∗ 1 ) C = −plim∇θ2 ¯g(θ ∗ 2 , θ ∗ 1 ),
402 Pinar Karaca-Mandic and Kenneth Train and where a = denotes equality up to asymptotically negligible remainder terms. 1 Then and ( ˆθ1 − θ ∗ 1 ) a = A −1 ¯h(θ ∗ 1 1 ) = A−1 N1 n m hm(θ ∗ 1 , zm), ( ˆθ2 − θ ∗ 2 ) a = C −1 ( ¯g(θ ∗ 2 , θ ∗ 1 ) − B( ˆθ1 − θ ∗ 1 )) a = C −1 1 gn(θ N2 ∗ 2 , θ ∗ 1 ; xn) −1 1 − B A hm(θ N1 ∗ 1 , zm) . (1) Given that the N2 second-stage observations can be grouped into N1 first-stage observations, we can re-write N2 N1 N gn(θ2, θ1; xn) = m gℓm(θ2, θ1; xℓm), n=1 m=1 ℓ=1 where N m is the number of observations among the N2 second-stage observations that correspond to observation m of the first stage (e.g. number of customers in market m) and gℓm is the second-stage moment for the ℓth second-stage observation that corresponds to the mth first-stage observation. Then equation (1) becomes ⎡ ⎤ ( ˆθ2 − θ ∗ 2 ) a = C −1 ⎣ 1 N2 N1 N m m=1 ℓ=1 or, expressed more conveniently, ( ˆθ2 − θ ∗ 2 ) a = C −1 ⎡ ⎣ 1 N1 N N1 m N1 N2 m=1 ℓ=1 gℓm(θ ∗ 2 , θ ∗ 1 ; xℓm) −1 1 − B A N1 gℓm(θ ∗ 2 , θ ∗ 1 ; xℓm) −1 1 − B A N1 m N1 hm(θ m=1 ∗ 1 N1 hm(θ m=1 ∗ 1 , zm) ⎦ Define ˜gm = N m N1 ℓ=1 C−1 N2 gℓm(θ ∗ 2 , θ ∗ 1 ; xℓm) and ˜hm = C−1 B A−1hm(θ ∗ 1 , zm). Then: N1( ˆθ2 − θ ∗ 2 ) a = 1 √ N1 Finally, letting Xm = ˜gm − ˜hm, we have N1 m=1 N1( ˆθ2 − θ ∗ 2 ) a = 1 √ N1 ( ˜gm − ˜hm). N1 m=1 Xm. ⎤ , zm) ⎦ . Asymptotically, √ N1( ˆθ2 −θ ∗ 2 ) a a ∼ N(0, Var(Xm)), such that ˆθ2 ∼ N(θ ∗ 2 , Var(Xm)/N1). Var(Xm) is approximated by its sample analog, 1 N1 m ˆXm ˆX ′ m , with ˆXm calculated at the estimated parameters (and A, B, and C replaced by sample analogs). This is a re-expression of the Murphy 1 A and C are square matrices in this notation, such that the parameters are exactly identified by the moment conditions. The notation can of course be generalized. c○ Royal Economic Society 2003
Page 1: Econometrics Journal (2003), volume
Page 5 and 6: 404 Pinar Karaca-Mandic and Kenneth
Page 7: 406 Pinar Karaca-Mandic and Kenneth

Pinar Karaca-Mandic and Kenneth Train, (2003), pp. 400 ... - Feweb

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?