12.01.2015 Views

Econometric Analysis of Cross Section and Panel Data - Free

Econometric Analysis of Cross Section and Panel Data - Free

Econometric Analysis of Cross Section and Panel Data - Free

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Duration <strong>Analysis</strong> 707<br />

with covariates. In addition to allowing us to treat grouped durations, the panel data<br />

approach has at least two additional advantages. First, in a proportional hazard<br />

specification, it leads to easy methods for estimating flexible hazard functions. Second,<br />

because <strong>of</strong> the sequential nature <strong>of</strong> the data, time-varying covariates are easily<br />

introduced.<br />

We assume flow sampling so that we do not have to address the sample selection<br />

problem that arises with stock sampling. We divide the time line into M þ 1 intervals,<br />

½0; a 1 Þ; ½a 1 ; a 2 Þ; ...; ½a M 1 ; a M Þ; ½a M ; yÞ, where the a m are known constants. For<br />

example, we might have a 1 ¼ 1; a 2 ¼ 2; a 3 ¼ 3, <strong>and</strong> so on, but unequally spaced<br />

intervals are allowed. The last interval, ½a M ; yÞ, is chosen so that any duration falling<br />

into it is censored at a M : no observed durations are greater than a M . For a r<strong>and</strong>om<br />

draw from the population, let c m be a binary censoring indicator equal to unity<br />

if the duration is censored in interval m, <strong>and</strong> zero otherwise. Notice that c m ¼ 1<br />

implies c mþ1 ¼ 1: if the duration was censored in interval m, it is still censored in interval<br />

m þ 1. Because durations lasting into the last interval are censored, c Mþ1 1 1.<br />

Similarly, y m is a binary indicator equal to unity if the duration ends in the mth interval<br />

<strong>and</strong> zero otherwise. Thus, y mþ1 ¼ 1ify m ¼ 1. If the duration is censored in<br />

interval m ðc m ¼ 1Þ, we set y m 1 1 by convention.<br />

As in <strong>Section</strong> 20.3, we allow individuals to enter the initial state at di¤erent calendar<br />

times. In order to keep the notation simple, we do not explicitly show the conditioning<br />

on these starting times, as the starting times play no role under flow<br />

sampling when we assume that, conditional on the covariates, the starting times are<br />

independent <strong>of</strong> the duration <strong>and</strong> any unobserved heterogeneity. If necessary, startingtime<br />

dummies can be included in the covariates.<br />

For each person i, we observe ðy i1 ; c i1 Þ; ...; ðy iM ; c iM Þ, which is a balanced panel<br />

data set. To avoid confusion with our notation for a duration (T for the r<strong>and</strong>om<br />

variable, t for a particular outcome on T ), we use m to index the time intervals. The<br />

string <strong>of</strong> binary indicators for any individual is not unrestricted: we must observe a<br />

string <strong>of</strong> zeros followed by a string <strong>of</strong> ones. The important information is the interval<br />

in which y im becomes unity for the first time, <strong>and</strong> whether that represents a true exit<br />

from the initial state or censoring.<br />

20.4.1 Time-Invariant Covariates<br />

With time-invariant covariates, each r<strong>and</strong>om draw from the population consists <strong>of</strong><br />

information on fðy 1 ; c 1 Þ; ...; ðy M ; c M Þ; xg. We assume that a parametric hazard<br />

function is specified as lðt; x; yÞ, where y is the vector <strong>of</strong> unknown parameters. Let T<br />

denote the time until exit from the initial state. While we do not fully observe T,<br />

either we know which interval it falls into, or we know whether it was censored in a

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!