13.07.2015 Views

1. Introduction - Econometrics at Illinois - University of Illinois at ...

1. Introduction - Econometrics at Illinois - University of Illinois at ...

1. Introduction - Econometrics at Illinois - University of Illinois at ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

INFERENCE ON THE QUANTILE REGRESSION PROCESSROGER KOENKER AND ZHIJIE XIAOUNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGNAbstract. Tests based on the quantile regression process can be formul<strong>at</strong>ed likethe classical Kolmogorov-Smirnov and Cramer-von-Mises tests <strong>of</strong> goodness-<strong>of</strong>-temploying the theory <strong>of</strong> Bessel processes as in Kiefer (1959). However, it is frequentlydesirable to formul<strong>at</strong>e hypotheses involving unknown nuisance parameters,thereby jeopardizing the distribution free character <strong>of</strong> these tests. We characterizethis situ<strong>at</strong>ion as \the Durbin problem" since it was posed in Durbin (1973), forparametric empirical processes.In this paper we consider an approach to the Durbin problem involving a martingaletransform<strong>at</strong>ion <strong>of</strong> the parametric empirical process suggested by Khmaladze(1981) and show th<strong>at</strong>itcanbeadaptedtoawidevariety <strong>of</strong> inference problemsinvolving the quantile regression process. In particular, we suggest new tests <strong>of</strong>the loc<strong>at</strong>ion shift and loc<strong>at</strong>ion-scale shift models th<strong>at</strong> underlie much <strong>of</strong> classicaleconometric inference.The methods are illustr<strong>at</strong>ed with a reanalysis <strong>of</strong> d<strong>at</strong>a on unemployment dur<strong>at</strong>ionsfrom the Pennsylvania Reemployment Bonus Experiments. The Pennsylvania experiments,conducted in 1988-89, were designed to test the ecacy <strong>of</strong> cash bonusespaid for early reemployment in shortening the dur<strong>at</strong>ion <strong>of</strong> insured unemploymentspells.<strong>1.</strong> <strong>Introduction</strong>Quantile regression, as introduced by Koenker and Bassett (1978), is graduallyevolving into a comprehensive approach to the st<strong>at</strong>istical analysis <strong>of</strong> linear and nonlinearresponse models for conditional quantile functions. Just as classical linearregression methods based on minimizing sums <strong>of</strong> squared residuals enable one to estim<strong>at</strong>emodels for conditional mean functions, quantile regression methods based onminimizing asymmetrically weighted absolute residuals oer a mechanism for estim<strong>at</strong>ingmodels for the conditional median function, and the full range <strong>of</strong> other conditionalquantile functions. By supplementing least squares estim<strong>at</strong>ion <strong>of</strong> conditional meanVersion: February 6, 200<strong>1.</strong> The authors wish to express their appreci<strong>at</strong>ion to Jushan Baiwhose work provided a vital initial impetus for this research byintroducing them to the Khmaladzeapproach. The authors also wish to thank Steve Portnoy, Est<strong>at</strong>e Khmaladze, Hira Koul, Jim Heckman,Hannu Oja and seminar participants <strong>at</strong> the U. <strong>of</strong> <strong>Illinois</strong>, U. <strong>of</strong> Chicago, CREST-INSEE, andHarvard-MIT and two referees for helpful comments. The research was partially supported by NSFgrant SBR-9911184.1


2 Inference on the Quantile Regression Processfunctions with techniques for estim<strong>at</strong>ing a full family <strong>of</strong> conditional quantile functions,quantile regression is capable <strong>of</strong> providing a much more complete st<strong>at</strong>isticalanalysis <strong>of</strong> the stochastic rel<strong>at</strong>ionships among random variables.There is already a well-developed theory <strong>of</strong> asymptotic inference for many importantaspects <strong>of</strong> quantile regression. Rank-based inference based on the approach<strong>of</strong> Gutenbrunner, Jureckova, Koenker, and Portnoy (1993) appears particularly <strong>at</strong>tractivefor a wide variety <strong>of</strong>quantile regression inference problems including theconstruction <strong>of</strong> condence intervals for individual quantile regression parameter estim<strong>at</strong>es.There has also been considerable <strong>at</strong>tention devoted to various resamplingstr<strong>at</strong>egies. See e.g. Hahn (1995), Horowitz (1998), Bilias, Chen, and Ying (1999), Heand Hu (1999). In Koenker and Machado (1999) some initial steps have beentakentoward a theory <strong>of</strong> inference based on the entire quantile regression process. Thesesteps have claried the close tie to classical Kolmogorov-Smirnov goodness <strong>of</strong> t results,and rel<strong>at</strong>ed liter<strong>at</strong>ure p-sample goodness-<strong>of</strong>-t tests based on Bessel processesiniti<strong>at</strong>ed by Kiefer (1959).This paper describes some further steps in this direction. These new steps dependcrucially on an ingenious suggestion by Khmaladze (1981) for dealing with tests <strong>of</strong>composite null hypotheses based on empirical processes. Khmaladze's results werer<strong>at</strong>her slow to percol<strong>at</strong>e into st<strong>at</strong>istics generally, but the approach has recently playedan important roleinwork on specic<strong>at</strong>ion tests <strong>of</strong> the form <strong>of</strong> the regression functionby Stute, Thies, and Zhu (1998) and Koul and Stute (1999). In econometrics,Bai (1998) was apparently the rst to recognize the potential importance <strong>of</strong> thesemethods, considering tests <strong>of</strong> model specic<strong>at</strong>ion based on the empirical process inparametric time series models.In contrast to the prior liter<strong>at</strong>ure, which has focused on tests based directly onthe empirical process, we consider tests based on the quantile regression process. Wefocus primarily on tests <strong>of</strong> the hypothesis th<strong>at</strong> the regression eect <strong>of</strong> covari<strong>at</strong>es exertseither a pure loc<strong>at</strong>ion shift, or a loc<strong>at</strong>ion-scale shift <strong>of</strong> the conditional distribution <strong>of</strong>the response, versus altern<strong>at</strong>ives under which covari<strong>at</strong>es may alter the shape <strong>of</strong> theconditional distribution as well as its loc<strong>at</strong>ion and scale. These tests may beviewedas semiparametric in the sense th<strong>at</strong> we need not specify a parametric form for theerror distribution.Khmaladze's martingale transform<strong>at</strong>ion provides a general str<strong>at</strong>egy for purging theeect <strong>of</strong> estim<strong>at</strong>ed nuisance parameters from the rst order asymptotic represent<strong>at</strong>ion<strong>of</strong> the empirical process and rel<strong>at</strong>ed processes and thereby restoring the feasibility <strong>of</strong>\asymptotically distribution free" tests. We will argue th<strong>at</strong> the approach is especially<strong>at</strong>tractive in the quantile regression setting and is capable <strong>of</strong> gre<strong>at</strong>ly expanding thescope <strong>of</strong> inferential methods described in earlier work. Although there are close formalconnections between the theory <strong>of</strong> tests based on the quantile regression processand the existing liter<strong>at</strong>ure there are some signicant dierences as well. Some <strong>of</strong> thesedierences are immedi<strong>at</strong>ely apparent from the classical Bahadur-Kiefer theorems and


Roger Koenker and Zhijie Xiao 3the liter<strong>at</strong>ure on condence bands for QQ-plots, e.g. Nair (1982). But the principaldierences emerge from the regression specic<strong>at</strong>ion <strong>of</strong> the conditional quantile functions.In particular, we introduce tests <strong>of</strong> the classical regression specic<strong>at</strong>ions th<strong>at</strong>the covari<strong>at</strong>es aect only the loc<strong>at</strong>ion, or the loc<strong>at</strong>ion and scale, <strong>of</strong> the conditionaldistribution <strong>of</strong> the response variable without requiring a parametric specic<strong>at</strong>ion <strong>of</strong>the innov<strong>at</strong>ion distribution.An altern<strong>at</strong>ive general approach to tests based on empirical processes with estim<strong>at</strong>edparameters entails resampling the test st<strong>at</strong>istic under conditions consistentwith the null hypothesis to obtain critical values. The origins <strong>of</strong> this approach maybe traced to Bickel (1969). Romano (1988) describes implement<strong>at</strong>ions for a variety<strong>of</strong> problems. Andrews (1997) introduces a conditional Kolmogorov-Smirnov test formodel specic<strong>at</strong>ion in parametric settings and uses resampling to obtain critical values,and Abadie (2000) develops rel<strong>at</strong>ed methods for investig<strong>at</strong>ing tre<strong>at</strong>ment eectsin the two sample setting. We hope to explore this approach to quantile regressioninference in subsequent work.The remainder <strong>of</strong> the paper is organized as follows. In the next section, we introducea general paradigm for quantile regression inference focusing initially on thecanonical two-sample tre<strong>at</strong>ment-control model. In Section 3, we briey introduceKhmaladze's approach to handling empirical processes with estim<strong>at</strong>ed nuisance parameters.Section 4 extends this approach to general problems <strong>of</strong> inference basedon the quantile regression process. Section 5 tre<strong>at</strong>s some practical problems <strong>of</strong> implementingthe tests. Section 6 describes an empirical applic<strong>at</strong>ion to the analysis <strong>of</strong>unemployment dur<strong>at</strong>ions. Section 7 contains some concluding remarks.2. The Inference ParadigmTo motiv<strong>at</strong>e our approach it is helpful to begin by reconsidering the classical twosampletre<strong>at</strong>ment-control problem. In the simplest possible setting we can imaginea random sample <strong>of</strong> size, n, drawn from a homogeneous popul<strong>at</strong>ion and randomizedinto n 1 tre<strong>at</strong>ment observ<strong>at</strong>ions, and n 0 control observ<strong>at</strong>ions. We observe a responsevariable, Y i , and are interested in evalu<strong>at</strong>ing the eect <strong>of</strong> the tre<strong>at</strong>ment on this response.In a typical clinical trial applic<strong>at</strong>ion, for example, the tre<strong>at</strong>mentwould be some form<strong>of</strong> medical procedure, and Y i ,might be log survival time. In our applic<strong>at</strong>ion discussedin Section 6, the tre<strong>at</strong>ment is an oer <strong>of</strong> a cash bonus for early exit from a spell <strong>of</strong>unemployment, and Y i is the logarithm <strong>of</strong> individual i's unemployment dur<strong>at</strong>ion. Inthe rst instance we might be s<strong>at</strong>ised to know simply the mean tre<strong>at</strong>ment eect, th<strong>at</strong>is, the dierence in means for the two groups. This we could evalu<strong>at</strong>e by \runningthe regression" <strong>of</strong> the observed y i 's on an indic<strong>at</strong>or variable: x i = 1, if subject iwas tre<strong>at</strong>ed, x i = 0, if subject i was a control. Of course this regression wouldpresume, implicitly, th<strong>at</strong> the variability <strong>of</strong>thetwo subsamples was the same; thisobserv<strong>at</strong>ion opens the door to the possibility th<strong>at</strong> the tre<strong>at</strong>ment alters other fe<strong>at</strong>ures


4 Inference on the Quantile Regression Process<strong>of</strong> the response distribution as well. Although we are accustomed to thinking aboutregression models in which thecovari<strong>at</strong>es aect only the loc<strong>at</strong>ion <strong>of</strong> the conditionaldistribution <strong>of</strong> the response { this is the force <strong>of</strong> the iid error assumption { there isno compelling reason to believe th<strong>at</strong>covari<strong>at</strong>es must only oper<strong>at</strong>e in this restrictivefashion.2.<strong>1.</strong> Quantile Tre<strong>at</strong>ment Eects. Lehmann (1974) introduced the following generalformul<strong>at</strong>ion <strong>of</strong> the two sample tre<strong>at</strong>ment eect,\Suppose the tre<strong>at</strong>ment adds the amount (x) when the response <strong>of</strong> theuntre<strong>at</strong>ed subject would be x. Then the distribution G <strong>of</strong> the tre<strong>at</strong>mentresponses is th<strong>at</strong> <strong>of</strong> the random variable X +(X) where X is distributedaccording to F ."Doksum (1974) provides a detailed axiom<strong>at</strong>ic analysis <strong>of</strong> this formul<strong>at</strong>ion, showingth<strong>at</strong> if we dene(x) as the \horizontal distance" between F and G <strong>at</strong> x, soF (x) =G(x +(x))then (x) is uniquely dened and can be expressed as(x) =G ,1 (F (x)) , x:Changing variables, so = F (x) we obtain wh<strong>at</strong> we will call the quantile tre<strong>at</strong>menteect,() =(F ,1 ()) = G ,1 () , F ,1 ():In the two sample setting this quantity is n<strong>at</strong>urally estimable by^() =G ,1n 1() , F ,1n 0()where G n1 ;F n0 denote the empirical distribution functions <strong>of</strong> the tre<strong>at</strong>ment and controlobserv<strong>at</strong>ions respectively, andF n ,1 = inffxjF n (x) g, as usual. Since wecannot observe subjects in both the tre<strong>at</strong>ed and control st<strong>at</strong>es { and this pl<strong>at</strong>itudemay be regarded as the fundamental \uncertainty principle" underlying the \causaleects" liter<strong>at</strong>ure { it seems reasonable to regard () as a complete description <strong>of</strong>the tre<strong>at</strong>ment eect.Of course, there is no way <strong>of</strong> really knowing whether the tre<strong>at</strong>ment oper<strong>at</strong>es inthe way prescribed by Lehmann. In fact, the tre<strong>at</strong>ment maymake otherwise weaksubjects especially robust, and turn the strong to jello. All we can observe fromthe experimental evidence is the dierence between the two marginal survival distributions,so it is n<strong>at</strong>ural to associ<strong>at</strong>e the tre<strong>at</strong>ment eect with this dierence. Thequantile tre<strong>at</strong>ment eect provides the unexpurg<strong>at</strong>ed version. Of course if there aresystem<strong>at</strong>ic dierences in tre<strong>at</strong>ment response associ<strong>at</strong>ed with observable covari<strong>at</strong>es,then these eects can be estim<strong>at</strong>ed via interactions with the tre<strong>at</strong>ment indic<strong>at</strong>or.


Roger Koenker and Zhijie Xiao 5We can interpret the foregoing discussion in terms <strong>of</strong> the \potential outcomes"<strong>of</strong> the causal eects liter<strong>at</strong>ure. The Lehmann formul<strong>at</strong>ion essentially assumes th<strong>at</strong>the ranks <strong>of</strong> the control observ<strong>at</strong>ions would be preserved were they to be tre<strong>at</strong>ed.Heckman, Smith, and Clements (1997) consider a model which allows a specieddegree <strong>of</strong> \slippage" in the quantile ranks <strong>of</strong> the two distributions in the context <strong>of</strong> abounds analysis. Abadie, Angrist, and Imbens (1999) have suggested an IV estim<strong>at</strong>orfor quantile tre<strong>at</strong>ment eects when the tre<strong>at</strong>ment is endogonous. Oja (1981) considersorderings <strong>of</strong> distributions based on loc<strong>at</strong>ion, scale, skewness, kurtosis, etc., based onthe function (x).Of course, it is possible th<strong>at</strong> the two distributions dier only by a loc<strong>at</strong>ion shift, so() = 0 , or th<strong>at</strong> they dier by a scale shift so () = 1 F ,1 () or th<strong>at</strong> they dierby a loc<strong>at</strong>ion and scale shift so () = 0 + 1 F ,1 (): Indeed, most <strong>of</strong> the regressionliter<strong>at</strong>ure deals with just such models. These hypotheses are all nicely nested withinLehmann's general framework. And yet, as we shall see, testing them against thegeneral altern<strong>at</strong>ives represented by the Lehmann-Doksum quantile tre<strong>at</strong>ment eectposes some challenges. In the simplest context <strong>of</strong> the two sample tre<strong>at</strong>ment-controlmodel our tests may be viewed as quantile version <strong>of</strong> the conventional two-sampleKolmogorov-Smirnov testinwhich the null species th<strong>at</strong> the two distributions dierby the constant, 0 , in the case <strong>of</strong> the loc<strong>at</strong>ion shift, or by 0 + 1 F ,1 () for someunspecied distribution, F , in the case <strong>of</strong> the loc<strong>at</strong>ion-scale shift. The presenceunder the null <strong>of</strong> the unspecied parameters, 0 and 1 , or more precisely the need toestim<strong>at</strong>e them, disrupts the elegant asymptotically distribution free n<strong>at</strong>ure <strong>of</strong> thesetests and leads us to the Khmaladze transform<strong>at</strong>ion.2.2. Inference on the Quantile Regression Process. In the two-sample tre<strong>at</strong>mentcontrolmodel there are a multitude <strong>of</strong> tests designed to answer the question: \Isthe tre<strong>at</strong>ment eect signicant." The most familiar <strong>of</strong> these, like the two-sampleStudent-t and Mann-Whitney-Wilcoxon tests are designed to reveal loc<strong>at</strong>ion shift altern<strong>at</strong>ives.Others are designed for scale shift altern<strong>at</strong>ives. Still others, like the twosample Kolmogorov-Smirnov test, are intended to encompass omnibus non-parametricaltern<strong>at</strong>ives. When the non-parametric null is posed in a form free <strong>of</strong> nuisance parameterswe have an elegant distribution-free theory for a variety <strong>of</strong> tests, includingthe Kolmogorov-Smirnov test th<strong>at</strong> are based on the empirical distribution function.Non-parametric testing in the presence <strong>of</strong> nuisance parameters under the null, however,poses some new problems. Suppose, for example, th<strong>at</strong> we wishtotestthehypothesis th<strong>at</strong> the response distribution under the tre<strong>at</strong>ment, G, diers from thecontrol distribution, F ,by a pure loc<strong>at</strong>ion shift, th<strong>at</strong> is for all 2 [0; 1],G ,1 () =F ,1 ()+ 0for some real 0 ,orth<strong>at</strong>theydierby a loc<strong>at</strong>ion-scale shift, so,G ,1 () = 1 F ,1 ()+ 0 :


6 Inference on the Quantile Regression ProcessIn such cases we can easily estim<strong>at</strong>e the nuisance parameters, 0 ; 1 , but the introduction<strong>of</strong> the estim<strong>at</strong>ed parameters into the asymptotic theory <strong>of</strong> the empirical processdestroys the distribution free character <strong>of</strong> the resulting Kolmogorov-Smirnov test.Analogous problems arise in the theory <strong>of</strong> the one sample Kolmogorov-Smirnov testwhen there are estim<strong>at</strong>ed parameters under the null, and have been considered byDurbin (1973) and others.In the general linear quantile regression model specied as,Q yjx (jx) =x > ()such modelsmay be represented by the linear hypothesis:() = + F ,10 ()for and in R p and F ,10a univari<strong>at</strong>e quantile function. Thus, all p coordin<strong>at</strong>es<strong>of</strong> the quantile regression coecient vector are required to be ane functions <strong>of</strong> thesame univari<strong>at</strong>e quantile function, F ,10.Such models may be viewed as arising fromlinearly heteroscedastic modely i = x > i +(x > i )u iwith the fu i g iid from the df F 0 . They can be estim<strong>at</strong>ed by solving the linear programmingproblem,minb2R pnXi=1 (y i , x > i b)where (u) =u( , I(u


Roger Koenker and Zhijie Xiao 7It is well known th<strong>at</strong> v n (t) converges weakly to a Brownian bridge process, v 0 (t), th<strong>at</strong>is a mean-zero Gaussian process with covariance functionEv 0 (t)v 0 (s) =t ^ s , st;and thus the distribution <strong>of</strong> K n and rel<strong>at</strong>ed functionals follows from the observ<strong>at</strong>ion<strong>of</strong> Doob (1949) and its subsequent renements.3.<strong>1.</strong> The Durbin Problem. It is rare in practice, however, th<strong>at</strong> we are willing tospecify F 0 completely. More commonly, ourhypothesis places F in some parametricfamily F with 2 R p .For example, we may wish to test \normality", claimingth<strong>at</strong> Y has distribution F 0 (y) =((y , 0 )= 0 ), but 0 =( 0 ; 0 ) is unknown. Weare thus led to consider, following Durbin (1973), the parametric empirical process,U n (y) = p n(F n (y) , F^ n(y)):Again changing variables, so y ! F ,1 0(t); we may equivalently consideru n (t) = p n(G n (t) , G^ n(t))where G n (t) =F n (F ,1 0(t)) and G^ n(t) =F^ n(F ,1 0(t)) so G 0 (t) =t. Under mildconditions on the sequence f^ n g we have the linear (Bahadur) represent<strong>at</strong>ion,p 1n(^n , 0 )=Zh 0 (s)dv n (s)+o p (1):0So provided the mapping ! G has a Frechet deriv<strong>at</strong>ive, g = g 0 , th<strong>at</strong> is,sup t jG +h (t) , G (t) , h > g(t)j = o(jjhjj) ash ! 0, see van der Vaart (1998, p.278),we maywrite,and thus obtain, with r n (t) =o p (1);(3.1)G^ n(t) =t +(^ n , 0 ) > g(t)+o p (1);^v n (t) = p n(G n (t) , t , (G^ n(t) , t))Z 1= v n (t) , g(t) > h 0 (s)dv n (s)+r n (t);which converges weakly to the Gaussian process,Z 1u 0 (t) =v 0 (t) , g(t) > h 0 (s)dv 0 (s):R The necessity <strong>of</strong> estim<strong>at</strong>ing 0 introduces the drift component g(t) > 1h 00(s)dv 0 (s):Instead <strong>of</strong> the simple Brownian bridge process, v 0 (t), we obtain a more complic<strong>at</strong>edGaussian process with covariance functionEu 0 (t)u 0 (s) =s ^ t , ts , g(t) > H 0 (s) , g(s) > H 0 (t)+g(s) > J 0 g(t)00


8 Inference on the Quantile Regression ProcessR 1R R twhere H 0 (t) = h 100(s)ds and J 0 = h 0 00(t)h 0 (s) > dtds: When ^ n is the mle, soh 0 (s) =,(Er ) ,1 (F ,1 (s)) with = r log f, thecovariance function simpliesnicely toEu 0 (t)u 0 (s) =s ^ t , ts , g(s) > I 0 g(t)where I 0 denotes Fisher's inform<strong>at</strong>ion m<strong>at</strong>rix. See Durbin (1973) and Shorack andWellner (1986) for further details on this case.The practical consequence <strong>of</strong> the drift term involving the function g(t) istoinvalid<strong>at</strong>ethe distribution-free character <strong>of</strong> the original test. Tests based on the parametricempirical process u n (t) require special consider<strong>at</strong>ion <strong>of</strong> the process u 0 (t) and its dependenceon F in each particular case. Koul (1992) and Shorack and Wellner (1986)discuss several leading examples. Durbin (1973) describes a general numerical approachbased on Fourier inversion, but also expresses doubts about feasibility <strong>of</strong> themethod when the parametric dimension <strong>of</strong> exceeds one. Although the problem <strong>of</strong>nding a viable, general approach to inference based on the parametric empiricalprocess had been addressed by several previous authors, notably Darling (1955), andKac, Kiefer, and Wolfowitz (1955), we will, in the spirit <strong>of</strong> Stigler's (1980) law <strong>of</strong>eponymy, refer to this as \the Durbin problem."3.2. Martingales and the Doob-Meyer Decomposition. Khmaladze's generalapproach to the Durbin problem can be motiv<strong>at</strong>ed as a n<strong>at</strong>ural elabor<strong>at</strong>ion <strong>of</strong> theDoob-Meyer decomposition for the parametric empirical process. Recall th<strong>at</strong> a stochasticprocess x = fx(t) :t 0g th<strong>at</strong> is (i) right continuous with left limits; (ii)integrable sup 0t


Roger Koenker and Zhijie Xiao 9Let X 1 ;::: ;X n be iid from F 0 ,soY i = F 0 (X i ); i =1;::: ;n are iid uniform,U[0; 1]: The empirical distribution functionG n (t) =F n (F ,10(t)) = n ,1 nXi=1I(Y i t):viewed as a process, is a submartingale. We have an associ<strong>at</strong>ed ltr<strong>at</strong>ion F Gn =fF Gnt :0 t 1g and the order st<strong>at</strong>istics Y (1) ;::: ;Y (n) are Markov times withrespect to F Gn ,th<strong>at</strong>isfY (i) tg = fF n (t) i=ng 2F Gnt .The process G n (t) isMarkov; Khmaladze notes th<strong>at</strong> for t 0,with G n (0)=0,thus(3.2)nG n (t) = n[G n (t +t) , G n (t)]This suggests the decompositionBinomial(n(1 , G n (t)); t=(1 , t))E(G n (t)jF GntG n (t) =Z t0)= 1 , G n(t)t:1 , t1 , G n (s)ds + m n (t):1 , sTh<strong>at</strong> m n (t) is a martingale then follows from the fact th<strong>at</strong>, from (3.2),E(m n (t)jF Gns )=m n (s)and integrability <strong>of</strong>m n (t) follows from the inequalityZ t01 , G n (s)ds ,log(1 , Y (n) );1 , swhich implies a nite mean for the compens<strong>at</strong>or, or predictable component. Substitutingfor G n (t) in(3.2)wehave the classical Doob-Meyer decomposition <strong>of</strong> theempirical process v nZ tv n (s)v n (t) =w n (t) ,01 , s dswhere v n (t) = p n(G n (t) , t) and the normalized process w n (t) = p nm n (t) convergesweakly to a standard Brownian motion process, w 0 (t); by the argument <strong>of</strong>Khmaladze(1981, x2.6).3.3. The Parametric Empirical Process. To extend this approach to the generalparametric empirical process, we nowletg(t) =(t; g(t) > ) > =(t; g 1 (t);:::g m (t)) >be a (m +1)-vector <strong>of</strong> real-valued functions on [0; 1]. Suppose th<strong>at</strong> the functions_g(t) =dg(t)=dt are linearly independent in a neighborhood <strong>of</strong> 1 soC(t) Z 1t_g(s)_g(s) > ds


10 Inference on the Quantile Regression Processis non-singular, and consider the transform<strong>at</strong>ion(3.3)w n (t) =v n (t) ,Z t0_g(s) > C ,1 (s)Z 1s_g(r)dv n (r)ds:Here, w n (t) clearly depends upon the choice <strong>of</strong> g, and therefore diers from w n (t)dened above. But the abuse <strong>of</strong> not<strong>at</strong>ion R maybe justied by noting th<strong>at</strong> in the special1case g(t) =t,wehave C(s) =1,s,and _gdv s n(r) =,v n (s) yielding the Doob-Meyerdecomposition (3.2) as a special case. In the general case, the transform<strong>at</strong>ionQ g '(t) ='(t) ,Z t0_g(s) > C ,1 (s)Z 1s_g(r)d'(r)dsmay be recognized as the residual from the prediction <strong>of</strong> '(t) based on the recursiveleast squares estim<strong>at</strong>e using inform<strong>at</strong>ion from (t; 1]: For functions in the span <strong>of</strong> g,the prediction is exact, th<strong>at</strong> is, Q g g =0:Now returning to the represent<strong>at</strong>ion <strong>of</strong> the parametric empirical process, ^v n (t),given in (3.1), using Khamaladze (1981, x4.2), we have,~v n (t) =Q g^v n (t)Z 1= Q g (v n (t) , g(t) > h 0 (s)dv n (s)+r n (t))= Q g (v n (t)+r n (t))= w 0 (t)+o p (1):The transform<strong>at</strong>ion <strong>of</strong> the parametric empirical process annihil<strong>at</strong>es the g component<strong>of</strong> the represent<strong>at</strong>ion and in so doing restores the feasibility <strong>of</strong> asymptotically distributionfree tests based on the transformed process ~v n (t).3.4. The Parametric Empirical Quantile Process. Wh<strong>at</strong> can be done for testsbased on the parametric empirical process can also be adapted for tests based on theparametric empirical quantile process. In some ways the quantile domain is actuallymore convenient. Suppose fy 1 ;::: ;y n g constitute a random sample on Y withdistribution function F Y : Consider testing the hypothesis, F Y (y) =F 0 ((y , 0 )= 0 );so,() F ,1 () =Y 0 + 0 F ,1 (): 0Given the empirical quantile process^() = inffa 2 RjnXi=10 (y i , a) = min!gand known parameters 0 =( 0 ; 0 ) tests may be based onv n () = p n' 0 ()(^() , ())= 0 ) v 0 ()where ' 0 () =f 0 (F ,10()) and v 0 () is the Brownian bridge process.


Roger Koenker and Zhijie Xiao 11To test our hypothesis when is unknown, set (t) =(1;F ,10(t)) > and for anestim<strong>at</strong>or ^ n s<strong>at</strong>isfying,p n(^n , 0 )=set ~(t) =^ +^ 0 F ,10(t) =^ > n(t). Then(3.4)(3.5)Z 1^v n (t) = p n' 0 (t)(^(t) , ~(t)= 00h 0 (s)dv n (s)+o p (1)= p n' 0 (t)(^(t) , (t) , (~(t) , (t)))= 0= v n (t) , p n' 0 (t)(^ , 0 ) > (t)= 0Z 1= v n (t) , ' 0 (t)(t) > h 0 (s)dv n (s)+o p (1)Thus, if we take g(t) =(t; (t) > ' 0 (t)) > ,we obtain,_g(t) =(1; _ f=f;1+F ,10(t) _ f=f) >where _ f=f is evalu<strong>at</strong>ed <strong>at</strong> F ,10(t), so for example in the Gaussian case,_g(t) =(1; , ,1 (t); 1 , ,1 (t) 2 ) > :Given the represent<strong>at</strong>ion (3.4) and the fact th<strong>at</strong> (t) lies in the linear span <strong>of</strong> g, wemay again apply Khmaladze's martingale transform<strong>at</strong>ion to obtain,0~v n (t) =Q g^v n (t);which can then be shown to converge to the standard Brownian motion process. Aswe have suggested in Section 2, we would like to consider a two sampleversion <strong>of</strong> theforegoing problem in which weleave the precise functional form <strong>of</strong> the distributionF , and therefore the form <strong>of</strong> the function, g, unspecied under the null. As we havealready noted, this is a special case <strong>of</strong> the general quantile regression tests introducedin the next section.4. Quantile Regression InferenceThe classical linear regression model asserts th<strong>at</strong> the conditional mean <strong>of</strong> the response,y i , given covari<strong>at</strong>es, x i ,may be expressed as a linear function <strong>of</strong> the covari<strong>at</strong>es.Th<strong>at</strong> is, there exists a 2 R P such th<strong>at</strong>,E(y i jx i )=x > i :The linear quantile regression model asserts, analogously, th<strong>at</strong> the conditional quantilefunctions <strong>of</strong> y i given x i are linear in covari<strong>at</strong>es,(4.1)F ,1y i jx i(jx i )=x > i ()for in some index set T [0; 1]: The model (4.1) will be taken to be our basicmaintained hypothesis. For convenience we will restrict <strong>at</strong>tention to the case th<strong>at</strong>


12 Inference on the Quantile Regression ProcessT =[; 1 , ] for some 2 (0; 1=2), and to facilit<strong>at</strong>e asymptotic local power analysiswe will consider sequences <strong>of</strong> models for which () = n () depends explicitly onthe sample size, n.A leading special case is the loc<strong>at</strong>ion-scale shift model,(4.2)F ,1y i jx i(jx i )=x > i + x > i F ,10():where F ,10() denotes a univari<strong>at</strong>e quantile function. Covari<strong>at</strong>es aect both theloc<strong>at</strong>ion and scale <strong>of</strong> the conditional distribution <strong>of</strong> y i given x i in this model, but thecovari<strong>at</strong>es have no eect on the shape <strong>of</strong> the conditional distribution. Typically, thevectors fx i g \contain an intercept" so e.g., x i =(1;z > i ) > and (4.2) may be seen asarising from the linear modely i = x > i +(x> i )u iwhere the \errors" fu i g are iid with distribution function F 0 .Further specializing themodel, may write,x > i = 0 + z > i 1 ;and the restriction, 1 = 0, then implies th<strong>at</strong> the covari<strong>at</strong>es aect only the loc<strong>at</strong>ion<strong>of</strong> the y i 's. We will call this model(4.3)F ,1y i jx i(jx i )=x > i + 0 F ,10()the loc<strong>at</strong>ion shift model. Although this model underlies much <strong>of</strong> classical econometricinference, it posits a very narrowly circumscribed role for the x i . In the remainder <strong>of</strong>this section we explore ways to test the hypotheses th<strong>at</strong> the general linear quantileregression model takes either the loc<strong>at</strong>ion shift or loc<strong>at</strong>ion-scale shift form.We will consider a linear hypothesis <strong>of</strong> the general form,(4.4)R() , r = (); 2T;where R denotes a q p m<strong>at</strong>rix, q p; r 2 R q ,and () denotes a known function:T !R q .In the two sample model described in Section 2.1,F ,1y i jDi (jD i)= 0 ()+ 1 ()D iwe might like to test th<strong>at</strong>, the tre<strong>at</strong>ment and control distributions dier by an anetransform<strong>at</strong>ion or, even more simply, th<strong>at</strong> they dier by a loc<strong>at</strong>ion shift. Here, D idenotes the indic<strong>at</strong>or variable corresponding to the tre<strong>at</strong>ment. In these cases we maytake () 0;r = 0 ;R =(1; , 1 ) in the former case, and R =(1; ,1) in thel<strong>at</strong>ter case. Of course, we could also expand these two-sample hypotheses to considerfully specied parametric models with an explicit choice <strong>of</strong> (), however, the semiparametricform <strong>of</strong> the hypotheses expressed above seems more plausible for mosteconometric applic<strong>at</strong>ions.


Roger Koenker and Zhijie Xiao 13We will consider tests based on the quantile regression process,^() = argmin b2R pnXi=1 (y i , x > i b)where (u) =u( , I(u ) ,1=2 (R ^() , r , ()) ) v 0 ();so tests th<strong>at</strong> are asymptotically distribution free can be readily constructed. Indeed,Koenker and Machado (1999) consider tests <strong>of</strong> this type when R constitutes an exclusionrestriction so e.g., R =[0.I q ];r=0,and () = 0. In such cases it is also shownth<strong>at</strong> the nuisance parameters ' 0 () and can be replaced by consistent estim<strong>at</strong>eswithout jeopardizing the distribution free character <strong>of</strong> the tests.To formalize the foregoing discussion we introduce the following conditions, whichclosely resemble the conditions employed in Koenker and Machado. We will assumeth<strong>at</strong> the fy i g's are conditional on x i , independent with linear conditional quantilefunctions given by (4.1) and local, in a sense specied in A.3, to the loc<strong>at</strong>ion-scaleshift model (4.2).A. <strong>1.</strong> The distribution function F 0 ,in(4.2) hasacontinuous Lebesgue density, f 0 ,with f 0 (u) > 0 on fu :0 n X n ! J 0 ,apositive denite m<strong>at</strong>rix.(iii): H n = n ,1 X > n , ,1n X n ! H 0 ,apositive denite m<strong>at</strong>rix where , n = diag( > x i ).(iv): max i=1;::: ;n k x i k= O(n 1=4 log n)A. 3. There exists a xed, continuous function () :[0; 1] ! R q such th<strong>at</strong> for samples<strong>of</strong> size n,R n () , r , () =()= p n:As noted in Koenker and Machado (1999), conditions A.1 and A.2 are quite standardin the quantile regression liter<strong>at</strong>ure. Somewh<strong>at</strong> weaker conditions are employedby Gutenbrunner, Jureckova, Koenker, and Portnoy (1993) in an eort to extend thetheory further into the tails. But this isn't required for our present purposes, so wehave reverted to conditions closer to those <strong>of</strong> Gutenbrunner and Jureckova (1992).


14 Inference on the Quantile Regression ProcessCondition A.3 enables us to explore local asymptotic power <strong>of</strong> the proposed testsemploying a r<strong>at</strong>her general form for the local altern<strong>at</strong>ives.We can now st<strong>at</strong>e our rst result. Pro<strong>of</strong>s <strong>of</strong> all results appear in the appendix.Theorem <strong>1.</strong> Let T denote the closed interval ["; 1,"], for some " 2 (0; 1=2). Underconditions A.1-3v n () ) v 0 ()+() for 2Twhere v 0 () denotes a q-vari<strong>at</strong>e standard Brownian bridge process and() =' 0 ()(RR > ) ,1=2 ():Under the null hypothesis, () =0, the test st<strong>at</strong>isticsup2Tk v n () k) sup2Tk v 0 () k :Typically, even if the hypothesis is fully specied, it is necessary to estim<strong>at</strong>e them<strong>at</strong>rix and the function ' 0 (t) f 0 (F ,10(t)). Fortun<strong>at</strong>ely, these quantities can bereplaced by estim<strong>at</strong>es s<strong>at</strong>isfying the following condition.A. 4. There exist estim<strong>at</strong>ors ' n () and n s<strong>at</strong>isfyingi.: sup 2T j' n () , ' 0 ()j = o p (1),ii.: jj n , jj = o p (1).Corollary <strong>1.</strong> The conclusions <strong>of</strong> Theorem 1 remain valid if ' 0 () and are replacedby estim<strong>at</strong>es s<strong>at</strong>isfying condition A.4.Remark. An important class <strong>of</strong> applic<strong>at</strong>ions <strong>of</strong> Theorem 1 involves partial orderings<strong>of</strong> conditional distributions using stochastic dominance. In the simplest case <strong>of</strong> thetwo-sample tre<strong>at</strong>ment control model this involves testing the hypothesis th<strong>at</strong> thetre<strong>at</strong>ment distribution stochastically domin<strong>at</strong>es the control distribution, so 1 () > 0for 2T. This can be accomplished using the one sided KS st<strong>at</strong>istic. Similarly,tests <strong>of</strong> second order stochastic dominance can be based on the indenite integralprocess <strong>of</strong> 1 (). These two sample tests extend nicely to general quantile regressionsettings and thus complement the important work <strong>of</strong> McFadden (1989) on testing forstochastic dominance, and Abadie (2000) on rel<strong>at</strong>ed test for tre<strong>at</strong>ment eects.Theorem 1 extends slightly the results <strong>of</strong> Koenker and Machado (1999), but it stillcan not be used to test the loc<strong>at</strong>ion shift or loc<strong>at</strong>ion-scale shift hypothesis. It fails toanswer our main question: how to deal with unknown nuisance parameters in R andr? To begin to address this question, we introduce an additional condition.A. 5. There exist estim<strong>at</strong>ors R n and r n s<strong>at</strong>isfying p n(R n , R) =O p (1) andpn(rn , r) =O p (1):And we now consider the parametric quantile regression process,^v n () = p n' 0 ()[R n R > n ],1=2 (R n ^() , rn , ()):


Roger Koenker and Zhijie Xiao 15The next result establishes a represent<strong>at</strong>ion for ^v n () analogous th<strong>at</strong> provided in (2.2)for the univari<strong>at</strong>e empirical quantile process.Theorem 2. Under conditions A.1-5, we have^v n () , Z > n () ) v 0 ()+()where () =' 0 ()(1;F ,10()) > , and Z n = O p (1), withv 0 () and () as specied inTheorem <strong>1.</strong>Corollary 2. The conclusions <strong>of</strong> Theorem 2 remain valid if ' 0 () and are replacedby estim<strong>at</strong>es s<strong>at</strong>isfying condition A.4.As in the univari<strong>at</strong>e case we are faced with two options. We can accept the presence<strong>of</strong> the Z n term, and abandon the asymptotically distribution free n<strong>at</strong>ure <strong>of</strong> tests basedupon ^v n (): This would presumably require some resampling str<strong>at</strong>egy to determinecritical values. Or we can, following Khmaladze, try to nd a transform<strong>at</strong>ion <strong>of</strong> ^v n ()th<strong>at</strong> annihil<strong>at</strong>es the Z n contribution, and thus restores the asymptotically distributionfree n<strong>at</strong>ure <strong>of</strong> inference. We adopt the l<strong>at</strong>ter approach.Let g(t) =(t; (t) > ) > so _g(t) =(1; (t); (t)F ,1 (t)) > with (t) =( f=f)(F _ ,1 (t)).We will assume th<strong>at</strong> g(t) s<strong>at</strong>ises the following condition.A. 6. The function g(t) s<strong>at</strong>ises:i: R k _g(t) k 2 dt < 1,ii: f _g i (t) :i =1;::: ;mg are linearly independent in a neighborhood <strong>of</strong><strong>1.</strong>We may note th<strong>at</strong> Khmaladze (1981, x3.3) shows th<strong>at</strong> A.6.ii implies C ,1 () existsfor all Q g^v n () > =^v n () > ,Z 0_g(s) > C ,1 (s)Z 1s_g(r)d^v n (r) > ds;where the recursive least squares transform<strong>at</strong>ion should now beinterpreted as oper<strong>at</strong>ingcoordin<strong>at</strong>e by coordin<strong>at</strong>e on the ^v n process.Theorem 3. Under conditions A.1 - 6, we have~v n () ) w 0 ()+~()where w 0 () denotes a q-vari<strong>at</strong>e standard Brownian motion, and ~() > = Q g () > .Under the null hypothesis, () =0,sup2Tk ~v n () k) sup2Tk w 0 () k :Typically, in applic<strong>at</strong>ions, the function g(t) will not be specied under the nullhypothesis, but will also need to be estim<strong>at</strong>ed. Fortun<strong>at</strong>ely, only one r<strong>at</strong>her mildfurther condition is needed to enable us to replace g by an estim<strong>at</strong>e.A. 7. There exists an estim<strong>at</strong>or, g n (), s<strong>at</strong>isfying sup 2T k _g n () , _g() k= o p (1):


16 Inference on the Quantile Regression ProcessBy a similar argument as Bai (1998), we have the following Corollary.Corollary 3. The conclusions <strong>of</strong> Theorem 3 remain valid if ' 0 (); , andg are replacedby estim<strong>at</strong>es s<strong>at</strong>isfying conditions A.4 and A.7.In some applic<strong>at</strong>ions, R is known and only r contains nuisance parameters th<strong>at</strong>need to be estim<strong>at</strong>ed. In this case, we have reduced dimensionality in(t), Z n , g(t),and _g(t). In particular, the asymptotic result <strong>of</strong> Theorem 2 reduces to^v n () , ' 0 ()[RR > ] ,1=2p n(r n , r) ) v 0 ()+();and the corresponding _g() (org() ) functions in transform<strong>at</strong>ion (4.6) are_g(t) =(1; (t)) > and g(t) =(t; ' 0 (t)) > :An important example <strong>of</strong> this case is the heteroskedasticity test described in Section4.2.The proposed tests have non-trivial power against any local devi<strong>at</strong>ion () suchth<strong>at</strong> Q g () 6= 0. Consider the following linear oper<strong>at</strong>or K:Z 1Kx(t)= _g(t) > C(t) ,1 _g(r)x(r)drand denote its eigen-space with eigenvalue 1, fx(t) :x(t) =Kx(t)g, asS. Our testswill have power as long as =2S: K is a Fredholm integral oper<strong>at</strong>or dened on thespace L 2 [0; 1], and we can write,with Volterra kernelKx(t) =Z 10tK(t; s)x(s)dsK(t; s) = _g(t) > C(t) ,1 _g(s)1 [t;1] (s):The set S is the set <strong>of</strong> solutions to the Volterra equ<strong>at</strong>ion,x(t) =Z 10K(t; s)x(s)ds;and by Khmaladze (1981, p. 252), S = fx : x(t) = _g(t) > g. Thus, the proposed testshave non-trivial power against any local devi<strong>at</strong>ion process () such th<strong>at</strong> _ is not inthe linear space spanned by the elements <strong>of</strong> _g().In the above analysis, for simplicity and without loss <strong>of</strong> generality we focus onst<strong>at</strong>istics <strong>of</strong> the classical Kolmogorov-Smirnov form. The results <strong>of</strong> the paper applymore generally to st<strong>at</strong>istics <strong>of</strong> the form h(~v n ()) for continuous functionals h: Besidesthe \sup" function, one may use other measures <strong>of</strong> the discrepancy between ~v n () and0; depending on the altern<strong>at</strong>ives <strong>of</strong> interest. For instance, we can construct a CramervonMises type test st<strong>at</strong>istic R T ~v n() > ~v n ()d; based on ~v n (): Or, more generally,RT ~v n() > W ()~v n ()d for a suitably chosen weight m<strong>at</strong>rix function W ().


Roger Koenker and Zhijie Xiao 17The foregoing results provide some basic machinery for a broad class <strong>of</strong> tests basedon the quantile regression process. In the next section we provide further detailson the implement<strong>at</strong>ion <strong>of</strong> these tests focusing most <strong>of</strong> our <strong>at</strong>tention on tests <strong>of</strong> theloc<strong>at</strong>ion shift and loc<strong>at</strong>ion-scale shift models.5. Implement<strong>at</strong>ion <strong>of</strong> the TestsGiven a general framework for inference based on the quantile regression process,we cannow elabor<strong>at</strong>e some missing details. We will begin by considering tests <strong>of</strong> theloc<strong>at</strong>ion scale shift hypothesis against a general quantile regression altern<strong>at</strong>ive. Tests<strong>of</strong> the loc<strong>at</strong>ion shift hypothesis and several variants <strong>of</strong> tests for heteroscedasticitywillthen be considered. Problems associ<strong>at</strong>ed with estim<strong>at</strong>ion <strong>of</strong> nuisance parameters aretre<strong>at</strong>ed in the nal subsection.5.<strong>1.</strong> The loc<strong>at</strong>ion-scale shift hypothesis. We would like to testF ,1y i jx i(jx i )=x > i + x > i F ,10()against the sequence <strong>of</strong> linear quantile regression altern<strong>at</strong>ivesF ,1y i jx i(jx i )= x > i n():In the simplest case the univari<strong>at</strong>e quantile function is known and we can formul<strong>at</strong>ethe hypothesis in the (4.4) not<strong>at</strong>ion,R() , r = ()by setting r i = i = i ; R = diag( ,1i), and () = p,1F ,10(): Obviously, there issome diculty if there are i equal to zero. In such cases, we can take i =1,andset the corresponding elements r i = i and i() 0. How should we go aboutestim<strong>at</strong>ing the parameters and ? Under the null hypothesis, i () = i + i F ,10() i =1;::: ;pso it is n<strong>at</strong>ural to consider linear regression. Since ^ i () is piecewise constant withjumps <strong>at</strong> points f 1 ;::: ; J ); it suces to consider p bivari<strong>at</strong>e linear regressions <strong>of</strong>^ i ( j )onf(1;F ,10( j )) : j =1;::: ;Jg: Each <strong>of</strong> these regressions has a known (asymptotic)Gaussian covariance structure th<strong>at</strong> could be used to construct a weightedleast squares estim<strong>at</strong>or, but pragm<strong>at</strong>ism might lead us to opt for the simpler unweightedestim<strong>at</strong>or. In either case we have our required O(n ,1=2 ) estim<strong>at</strong>ors ^ n and^ n .When F ,10() is(hypothetically) known the Khmaladz<strong>at</strong>ion process is rel<strong>at</strong>ivelypainless comput<strong>at</strong>ionally. The function _g(t) =(1; 0 (t); 0 (t)F ,10(t)) > is known andthe transform<strong>at</strong>ion (3.3) can be carried out by recursive least squares. Again, the discretiz<strong>at</strong>ionis based on the jumps f 1 ;::: ; J g <strong>of</strong> the piecewise constant ^() process.


18 Inference on the Quantile Regression ProcessTests st<strong>at</strong>istics based on the transformed process, ~v n (), can then be easily computed.The simplest <strong>of</strong> these is probably the Kolmogorov-Smirnov sup-type st<strong>at</strong>isticK n = sup2Tk ~v n () kwhere T is typically <strong>of</strong> the form ["; 1,"]with" 2 (0; 1=2): The choice <strong>of</strong> the norm jjjjis also an issue. Euclidean norm is obviously n<strong>at</strong>ural, but has the possibly undesirableeect <strong>of</strong> accentu<strong>at</strong>ing extreme behavior in a few coordin<strong>at</strong>es. Instead, we will employthe `1 norm in the simul<strong>at</strong>ions and the empirical applic<strong>at</strong>ion below.In practice, F ,10(t), is generally unknown under the null and it is convenient tochoose one coordin<strong>at</strong>e, typically the intercept coecient, to play the role <strong>of</strong> numeraire.If the covari<strong>at</strong>es are centered then the intercept component <strong>of</strong> the quantile regressionprocess can be interpreted as an estim<strong>at</strong>e <strong>of</strong> the conditional quantile function <strong>of</strong> theresponse <strong>at</strong> the mean covari<strong>at</strong>e vector. From (4.4) we can write(5.1) i () = i + i 1 () i =2;::: ;pwhere i = i , 1 i = 1 and i = i = 1 , or in m<strong>at</strong>rix not<strong>at</strong>ion asR() =rwhere () 0; R =[. , I p,1] andr = ,. Estim<strong>at</strong>es <strong>of</strong> the vectors and are again obtainable by regression <strong>of</strong> ^ i () i =2;::: ;p on the intercept coordin<strong>at</strong>e^ 1 ():5.2. Estim<strong>at</strong>ion <strong>of</strong> Nuisance Parameters. Our proposed tests depend cruciallyon estim<strong>at</strong>es <strong>of</strong> the quantile density and quantile score functions. Fortun<strong>at</strong>ely, thereis a large rel<strong>at</strong>ed liter<strong>at</strong>ure on estim<strong>at</strong>ing ' 0 (); including e.g. Siddiqui (1960), Hajek(1962), She<strong>at</strong>her and Maritz (1983), and Welsh (1988). Following Siddiqui, since,dF ,10(t)=dt =(' 0 (t)) ,1 , it is n<strong>at</strong>ural to use the estim<strong>at</strong>or,(5.2)' n (t) =2h nF n ,1 (t + h n ) , F n ,1 (t , h n ) ;where F ,1n (s) is an estim<strong>at</strong>e <strong>of</strong> F ,10(s) andh n is a bandwidth which tends to zero asn !1: See Koenker and Machado (1999) for further details including discussion <strong>of</strong>the estim<strong>at</strong>ion <strong>of</strong> the m<strong>at</strong>rix appearing in the standardiz<strong>at</strong>ion <strong>of</strong> the process.Finally, wemust face the problem <strong>of</strong> estim<strong>at</strong>ing the function _g. Fortun<strong>at</strong>ely, thereis already a large liter<strong>at</strong>ure on estim<strong>at</strong>ion <strong>of</strong> score functions. For our purposes it isconvenient to employ the adaptive kernel method described in Portnoy and Koenker(1989). An <strong>at</strong>tractive altern<strong>at</strong>ive to this approach has been developed by Cox (1985)and Ng (1994) based on smoothing spline methods. Given a uniformly consistentestim<strong>at</strong>or, _g n , s<strong>at</strong>isfying condition A.7, see Portnoy and Koenker (1989, Lemma 3.2),Corollary 3 implies th<strong>at</strong> under the null hypothesis~v n (t) Q gn^v n (t) ) w 0 (t)


Roger Koenker and Zhijie Xiao 19and therefore tests can be based on K n as before. Note th<strong>at</strong> in this case estim<strong>at</strong>ion<strong>of</strong> _g provides as a byproduct an estim<strong>at</strong>or <strong>of</strong> the function ' 0 (t) which is needed tocompute the process ^v n (t).In applic<strong>at</strong>ions it will usually be desirable to restrict <strong>at</strong>tention to a closed interval[ 0 ; 1 ] (0; 1). This is easily accommod<strong>at</strong>ed, following Koul and Stute (1999),Remark 2.3, by considering the modied test st<strong>at</strong>istic,K n =sup jj~v() , ~v( 0 )jj= p 1 , 0 ;2[ 0 ; 1 ]which converges weakly, just as in the unrestricted case, to sup [0;1]jjw 0 ()jj. Thisrenormaliz<strong>at</strong>ion is useful in our empirical applic<strong>at</strong>ion, for example, since we are restricted<strong>at</strong> the outset to estim<strong>at</strong>ing the ^v process on the subinterval [ 0 ; 1 ]=[:2;:8].Indeed, it may be fruitful to consider other forms <strong>of</strong> standardiz<strong>at</strong>ion as well.5.3. The loc<strong>at</strong>ion shift hypothesis. An important special case <strong>of</strong> the loc<strong>at</strong>ionscaleshift model is, <strong>of</strong> course, the pure loc<strong>at</strong>ion shift model,F ,1y i jx i(jx i )=x > i + 0F ,10 ()This is just the classical homoscedastic linear regression model,y i = x > i + 0 u iwhere the fu i g are iid with distribution function F 0 . This model underlies much <strong>of</strong>classical econometric theory and practice. If it is found to be appropri<strong>at</strong>e then it isobviously sensible to consider estim<strong>at</strong>ion by altern<strong>at</strong>ive methods. For F 0 Gaussian,least squares would <strong>of</strong> course be optimal. For F 0 unknown one might consider theHuber M-estim<strong>at</strong>or, or its L-estim<strong>at</strong>or counterpart,Z 1,^ =(1, 2) ,1 ^()d;see Koenker and Portnoy (1987). In the loc<strong>at</strong>ion shift model it is also well-knownfrom Bickel (1982), th<strong>at</strong> the slope parameters, ( 2 ;:::; p ), are adaptively estimableprovided F 0 has nite Fisher inform<strong>at</strong>ion for the loc<strong>at</strong>ion parameter. Thus, it wouldbe reasonable to consider M-estim<strong>at</strong>ors like those described in Hsieh and Manski(1987) or the adaptive L-estim<strong>at</strong>ors described in Portnoy and Koenker (1989).The loc<strong>at</strong>ion-shift hypothesis can be expressed in standard form,R() =r;by setting R =[0.I p,1]; r =( 2 ;::: ; p ) > . It asserts simply th<strong>at</strong> the quantile regressionslopes are constant, independent <strong>of</strong>. Again, the unknown parameters infR; rg are easily estim<strong>at</strong>ed so the process ^v n () is easily constructed. The transform<strong>at</strong>ionis obviously somewh<strong>at</strong> simpler in this case since g(t) =(t; ' 0 (t)) has one fewercoordin<strong>at</strong>e than in the previous case.


20 Inference on the Quantile Regression ProcessWe can continue to view tests <strong>of</strong> the loc<strong>at</strong>ion-shift hypothesis as tests against ageneral quantile regression altern<strong>at</strong>ive represented in (A.3), or we can also consider thebehavior <strong>of</strong> the tests against a more specialized class <strong>of</strong> loc<strong>at</strong>ion-scale shift altern<strong>at</strong>ivesfor which() = 0 F ,10()for some xed vector 0 2 R p,1 . In the l<strong>at</strong>ter setting we have a test for parametricheteroscedasticity andwe can compare the performance <strong>of</strong> our very general class <strong>of</strong>tests against altern<strong>at</strong>ive tests designed to be more narrowly focused on heteroscedasticaltern<strong>at</strong>ives.An optimal (invariant) test in the parametric setting may be based on optimalL-estim<strong>at</strong>or <strong>of</strong> scale with weight function,(5.3)!() = ddx (x f=f)j _x=F,1 ;0 ( )see e.g. Sering (1980). Thus, for example, in the Gaussian model, F 0 =,wewouldhave, !() = ,1 (); so our estim<strong>at</strong>or <strong>of</strong> 0 would be,^ n =Z 10 ,1 () ^()d;and a test for heteroscedasticity could be based on the last p , 1 coordin<strong>at</strong>es <strong>of</strong> ^ n .One way tointerpret such tests is to view them as smoothly weighted linear combin<strong>at</strong>ions<strong>of</strong> the interquantile range tests for heteroscedasticity introduced in Koenkerand Bassett (1982). Clearly, in the case <strong>of</strong> the Gaussian weight function, extremeinterquantile ranges get considerable weight, so it may be prudent to consider Huberizedversions <strong>of</strong> these tests th<strong>at</strong> trim the inuence <strong>of</strong> the tails. Altern<strong>at</strong>ively, onecould consider weight functions explicitly designed for more heavy tailed distributionslike the Cauchy,!() =2sin(2)(cos(2) , 1):5.4. Local Asymptotic Power Comparison. In this section we compare the heteroscedasticitytests proposed above in an eort to evalu<strong>at</strong>e the cost <strong>of</strong> considering amuch more general class <strong>of</strong> semiparametric altern<strong>at</strong>ives instead <strong>of</strong> the strictly parametricaltern<strong>at</strong>ives represented by the loc<strong>at</strong>ion scale shift model.We consider the local altern<strong>at</strong>ive with() = 0 F ,10() in the loc<strong>at</strong>ion-shift model,and denote this hypothesis as H n as in assumption A.3. This corresponds to thelinear model with asymptotically vanishing heteroskedasticity studied by Koenkerand Bassett (1982). In the loc<strong>at</strong>ion-shift model, R =[0.I p,1 ]; and thus RR > = x ;where x is the lower (p , 1) (p , 1) corner <strong>of</strong> : Under H n ;p ^v() = ' 0 () ,1=2x nR ^() , r np= v 0 () , ' 0 () ,1=2x n (rn , r)+ 0 ,1=2x' 0 ()F ,1 ()+o 0 p(1)


and the transformed process isRoger Koenker and Zhijie Xiao 21~v() =w 0 ()+ ~ () 0 ,1=2x + o p (1)where the noncentrality process () ~ is simply the martingale transform<strong>at</strong>ion <strong>of</strong>() =' 0 ()F ,10(). Provided th<strong>at</strong> () _ is not in the space spanned by the functionsf1; _' 0 ()g; it is clear th<strong>at</strong> () ~ 6= 0, so the proposed tests have non-trivial power. Figure1 depicts the noncentrality function () its \predicted" version, and its (residual)transformed version () ~ for the normal case. For any given 0 ; the asymptotic localpower <strong>of</strong> the proposed Kolmogorov-Smirnov test is given as followsP (c) =Prsup2Tw 0 ()+ ~ () 0 ,1=2x >c where c is the asymptotic critical value <strong>at</strong> signicance level :Example. It is <strong>of</strong> obvious interest to consider altern<strong>at</strong>ives for which the function 0 (t) =' 0 (t)F ,10(t) is annihil<strong>at</strong>ed by the martingale transform<strong>at</strong>ion. This entailsth<strong>at</strong> _ 0 (t) lie in the linear span <strong>of</strong> f1; _' 0 (t)g, and is evidently s<strong>at</strong>ised for F 0 uniform.More surprisingly, it is also s<strong>at</strong>ised for triangular densities like, f 0 =(1,jxj)I(jxj1). These cases may be regarded as somewh<strong>at</strong> p<strong>at</strong>hological as they have inniteFisher inform<strong>at</strong>ion for both loc<strong>at</strong>ion and scale. Thus, it may not be regarded as toodisturbing th<strong>at</strong> the transform<strong>at</strong>ion performs poorly under such circumstances.5.5. Extensions. The proposed methods can be easily modied to accommod<strong>at</strong>emany other inference problems, including tests for parametric conditional distributionsand tests for conditional symmetry. The theory wedevelop also carries over withminor changes to some analytic nonlinear restrictions in place <strong>of</strong> (4.4) and associ<strong>at</strong>edchanges in the transform<strong>at</strong>ion (4.6). For example, we may modify our hypothesis tothe form(5.4)R() , r = (;):If were known, the previous tests would apply. More generally, if is unknown butmay be estim<strong>at</strong>ed by a p n consistent estim<strong>at</strong>or, n ; then, under regularity conditions(so th<strong>at</strong> admits a Taylor expansion to the second order), the standardized empiricalprocess R n ^() , rn , (; n ) has the following approxim<strong>at</strong>ion^v n () = p n' 0 () R n R > n ,1=2R n ^() , rn , (; n )= v n ()+' 0 ()Z 1 + ' 0 ()F ,10()Z 2 + ' 0 ()s() > Z 3 + o p (1);where s() =@ (; 0 )=@; and Z j = O p (1), j =1; 2; 3: Denoting the set <strong>of</strong> linearlyindependent components <strong>of</strong> functions(;' 0 ();' 0 ()F ,10();' 0 ()s() > );


22 Inference on the Quantile Regression Processas g() and assuming th<strong>at</strong> _g() s<strong>at</strong>ises Assumption A.6, then the transformed process~v n () dened by (4.6) can be constructed and similar results to Theorem 3 can beobtained.The above process can be applied to testing the hypothesis th<strong>at</strong> the conditionaldistribution <strong>of</strong> y i ; conditional on x i , has a continuous cdf F (;), which depends onsome parameters . Thishypothesis can be expressed in the form <strong>of</strong> (5.4) by settingR =(1; 0; ::::::; 0); r= 1 ,and (; 0 )=F ,1 (; 0 ). In this case, g() corresponds tothe set <strong>of</strong> linearly independent components <strong>of</strong> functions (;' 0 ();' 0 ()s() > ). In thecase <strong>of</strong> the Gaussian hypothesis, the distribution is fully determined by the loc<strong>at</strong>ionand scale parameters which can be accommod<strong>at</strong>ed into R and r by re-standardiz<strong>at</strong>ion,reducing to the hypothesis to the simple formul<strong>at</strong>ion (4.4).6. A Reappraisal <strong>of</strong> the Pennsylvania Reemployment BonusExperimentsA common concern about unemployment insurance (UI) systems has been thesuggestion th<strong>at</strong> the insurance benet acts as a disincentive for job-seekers and thusprolongs the dur<strong>at</strong>ion <strong>of</strong> unemployment spells. During the 1980's several controlledexperiments were conducted in the U.S. to test the incentive eects altern<strong>at</strong>ive compens<strong>at</strong>ionschemes for UI. In these experiments, UI claimants were oered a cashbonus if they found a job within some specied period <strong>of</strong> time and if the job wasretained for a specied dur<strong>at</strong>ion. The question addressed by the experiments was:would the promise <strong>of</strong> such a monetary lump-sum benet provide a signicant inducementfor more intensive job-seeking and thus reduce the dur<strong>at</strong>ion <strong>of</strong> unemployment?In the rst experiments conducted in <strong>Illinois</strong> a random sample <strong>of</strong> new UI claimantswere told th<strong>at</strong> they would receiveabonus <strong>of</strong> $500 if they found full-time employmentwithin 11 weeks after ling their initial claim, and if they retained their new job for<strong>at</strong> least 4 months. These \tre<strong>at</strong>ment claimants" were then compared with a controlgroup <strong>of</strong> claimants who followed the usual rules <strong>of</strong> the <strong>Illinois</strong> UI system. The <strong>Illinois</strong>experiment provided very encouraging initial indic<strong>at</strong>ion <strong>of</strong> the incentive eects <strong>of</strong>such policies. They showed th<strong>at</strong> bonus oers resulted in a signicant reduction in thedur<strong>at</strong>ion <strong>of</strong> unemployment spells and consequent reduction <strong>of</strong> the regular amountspaid by the st<strong>at</strong>e to UI beneciaries. This nding led to further \bonus experiments"in the st<strong>at</strong>es <strong>of</strong> New Jersey, Pennsylvania and Washington with a variety <strong>of</strong> newtre<strong>at</strong>ment options. An excellent review <strong>of</strong> the experiments, some general conclusionsabout their ecacy and a critique <strong>of</strong> their policy relevance can be found in Meyer(1995), and Meyer (1996). In this section we will focus more narrowly on a reanalysis<strong>of</strong> d<strong>at</strong>a from the Pennsylvania Reemployment Bonus Demonstr<strong>at</strong>ion described indetail in Corson, Decker, Dunstan, and Keransky (1992).The Pennsylvania experiments were conducted by the U.S. Department <strong>of</strong>Laborbetween July 1988 and October 1989. During the enrollment period, claimants whobecame unemployed and registered for unemployment benets in one <strong>of</strong> the 12 selected


Roger Koenker and Zhijie Xiao 23local oces throughout the st<strong>at</strong>e were randomly assigned either to a control groupor one <strong>of</strong> six experimental tre<strong>at</strong>ment groups. In the control group the existing rules<strong>of</strong> the unemployment insurance system applied. Individuals in the tre<strong>at</strong>ment groupswere oered a cash bonus if they became reemployed in a full-time job, workingmore than 32 hours per week, within a specied qualic<strong>at</strong>ion period. Two bonuslevels and two qualic<strong>at</strong>ion periods were tested, but we will restrict <strong>at</strong>tention to thehigh bonus, long qualic<strong>at</strong>ion period tre<strong>at</strong>ment which oered a cash <strong>of</strong> bonus <strong>of</strong> sixtimes the weekly benet for claimants establishing reemployment within 12 weeks.This restricted the sample size to 6384 observ<strong>at</strong>ions. A detailed description <strong>of</strong> thecharacteristics <strong>of</strong> claimants under study is presented in Koenker and Bilias (2001)which has inform<strong>at</strong>ion on age, race, gender, number <strong>of</strong> dependents, loc<strong>at</strong>ion in thest<strong>at</strong>e, existence <strong>of</strong> recall expect<strong>at</strong>ions, and type <strong>of</strong> occup<strong>at</strong>ion.Since a large portion <strong>of</strong> spells end in either the rst week or the twenty seventhweek, it should be stressed th<strong>at</strong> the denition <strong>of</strong> the rst spell <strong>of</strong> UI in the Pennsylvaniastudy includes a waiting week and th<strong>at</strong> the maximum number <strong>of</strong> uninterruptedfull weekly benets is 26. This implies th<strong>at</strong> many subjects did not receiveanyweeklybenet and th<strong>at</strong> many other claimants received continuously their full, entitled unemploymentbenet. Again, Koenker and Bilias (2001) contains further details.6.<strong>1.</strong> The Model. Our basic model for analyzing the Pennsylvania experiment presumesth<strong>at</strong> the logarithm <strong>of</strong> the dur<strong>at</strong>ion (in weeks) <strong>of</strong> subjects' spells <strong>of</strong> UI benetshave linear conditional quantile functions <strong>of</strong> the formQ log(T ) (jx) =x > ():The choice <strong>of</strong> the log transform<strong>at</strong>ion is dict<strong>at</strong>ed primary by the desire to achievelinearity <strong>of</strong> the parametric specic<strong>at</strong>ion and by its ease <strong>of</strong> interpret<strong>at</strong>ion. Multiplic<strong>at</strong>ivecovari<strong>at</strong>e eects are widely employed throughout survival analysis, and they arecertainly more plausible in the present applic<strong>at</strong>ion than the assumption <strong>of</strong> additiveeects. It is perhaps worth reiter<strong>at</strong>ing th<strong>at</strong> the role <strong>of</strong> the transform<strong>at</strong>ion is completelytransparent in the quantile regression setting, where Q h(T ) (jx) = x > ()implies Q T (jx) =h ,1 (x > ()). In contrast, the role <strong>of</strong> transform<strong>at</strong>ions in models <strong>of</strong>the conditional mean function are r<strong>at</strong>her complic<strong>at</strong>ed since the transform<strong>at</strong>ion aectsnot only loc<strong>at</strong>ion, but scale and shape <strong>of</strong> the conditional distribution <strong>of</strong> the response.Our (provisional) model includes the following eects: Indic<strong>at</strong>or for the tre<strong>at</strong>ment group. Indic<strong>at</strong>ors for female, black and hispanic respondents. Number <strong>of</strong> dependents, with 2 indic<strong>at</strong>ing two or more dependents. Indic<strong>at</strong>ors for the 5 quarters <strong>of</strong> entry to the experiment. Indic<strong>at</strong>or for whether the claimant \expected to be recalled" to a previous job. Indic<strong>at</strong>ors for whether the respondent was \young" { less than 35, or \old" {indic<strong>at</strong>ing age gre<strong>at</strong>er than 54. Indic<strong>at</strong>or for whether claimant was employed in the durable goods industry.


24 Inference on the Quantile Regression Process Indic<strong>at</strong>or for whether the claimant was registered in one <strong>of</strong> the low unemploymentshort unemployment dur<strong>at</strong>ion districts: Co<strong>at</strong>esville, Reading, or Lancaster.In Figure 6.1 we present a concise visual represent<strong>at</strong>ion <strong>of</strong> the results from theestim<strong>at</strong>ion <strong>of</strong> this model. Each <strong>of</strong> the panels <strong>of</strong> the Figure illustr<strong>at</strong>e one coordin<strong>at</strong>e <strong>of</strong>the vector-valued function, ^(), viewed as a function <strong>of</strong> 2 [ 0 ; 1 ]. Here we choose 0 = :20 and 1 = :80, eectively neglecting the proportion <strong>of</strong> the sample th<strong>at</strong> areimmedi<strong>at</strong>ely reemployed in week one and those whose unemployment spell exceedsth<strong>at</strong> insured limit <strong>of</strong> 26 weeks. The lightly shaded region in each panel <strong>of</strong> the gurerepresents a 90 percent condence band. We omit the plots for the 5 quarter <strong>of</strong> entryindic<strong>at</strong>ors, and for the for the low unemployment district variable to conserve space.Before turning to interpret<strong>at</strong>ion <strong>of</strong> specic coecients, we will try to oer somebrief general remarks on how tointerpret these gures. The simplest case is the pureloc<strong>at</strong>ion shift model for which wewould have the classical acceler<strong>at</strong>ed failure time(AFT) model,log T i = x > i + u iwith fu i g's iid from some F . For F <strong>of</strong> the form F (u) =1, exp(, exp(u)), this isthe well known Cox proportional hazard model with Weibull baseline hazard. In thiscase we would expect to see slope coecients ^ j () th<strong>at</strong> oscill<strong>at</strong>e around a constantvalue indic<strong>at</strong>ing th<strong>at</strong> the shift in the response due to a change in the covari<strong>at</strong>e isconstant over the entire estim<strong>at</strong>ed range <strong>of</strong> the distribution. The conditional meaneect estim<strong>at</strong>ed by least squares is asymptotically equivalent in this case to integr<strong>at</strong>ingthe estim<strong>at</strong>ed coecients over the unit interval.Another conventional model with linear quantile functions is the linear loc<strong>at</strong>ionscalemodel,log T i = x > i +(x > i )u iwhere again, u i is taken to be iid. Nowthecovari<strong>at</strong>es are allowed to inuence the scaleas well as the loc<strong>at</strong>ion <strong>of</strong> the conditional distribution <strong>of</strong> dur<strong>at</strong>ions. In this case theplots <strong>of</strong> the \slope" coecients ^ j () should look just like the \intercept" coecientup to a loc<strong>at</strong>ion and scale shift. The intercept coecient estim<strong>at</strong>es a normalizedversion <strong>of</strong> the quantile function <strong>of</strong> the u i 's and all the other coecients are simplyloc<strong>at</strong>ion and scale shifts <strong>of</strong> this function.6.2. Interpret<strong>at</strong>ion <strong>of</strong> the Estim<strong>at</strong>ed Eects. No tre<strong>at</strong>ment eect is observed ineither tail implying th<strong>at</strong> the tre<strong>at</strong>ment had no eect in changing the probability <strong>of</strong>immedi<strong>at</strong>e reemployment (inweek one), or in eecting the probability <strong>of</strong> dur<strong>at</strong>ionsbeyond the 26 week maximum. The high bonus and long qualic<strong>at</strong>ion period tre<strong>at</strong>ment,yielded roughly a 15% reduction in median dur<strong>at</strong>ion. This eect is considerablystronger st<strong>at</strong>istical signicance than th<strong>at</strong> seen in the other tre<strong>at</strong>ments.The randomiz<strong>at</strong>ion <strong>of</strong> the experiment was quite eective in rendering the potentiallyconfounding eects <strong>of</strong> other covari<strong>at</strong>es orthogonal to the tre<strong>at</strong>ment indic<strong>at</strong>or.


Roger Koenker and Zhijie Xiao 25Figure 6.<strong>1.</strong> Quantile Regression Process for Log Dur<strong>at</strong>ion ModelIntercept0.5 <strong>1.</strong>5 2.5Tre<strong>at</strong>ment-0.25 -0.10 0.00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>0Female0.0 0.1 0.2Black-<strong>1.</strong>2 -0.8 -0.4 0.00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>0Hispanic-<strong>1.</strong>0 -0.6 -0.2N-Dependents0.0 0.05 0.150.0 0.2 0.4 0.6 0.8 <strong>1.</strong>00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>0Recall Effect0.0 0.5 <strong>1.</strong>0Young Effect-0.4 -0.2 0.00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>0Old Effect0.0 0.1 0.2 0.3 0.4Durable Effect0.0 0.1 0.2 0.30.0 0.2 0.4 0.6 0.8 <strong>1.</strong>00.0 0.2 0.4 0.6 0.8 <strong>1.</strong>0


26 Inference on the Quantile Regression ProcessNevertheless, it is <strong>of</strong> some interest to explore the eect <strong>of</strong> other covari<strong>at</strong>es in an eortto better understand determinants <strong>of</strong> the dur<strong>at</strong>ion <strong>of</strong> unemployment.Women are 5 to 15% slower than men to exit unemployment. Blacks and Hispanicsappear much quicker than whites to become reemployed. This eect is particularlystriking in the case <strong>of</strong> blacks for whom median dur<strong>at</strong>ion is roughly half ( e ,:75 ) th<strong>at</strong><strong>of</strong> whites, and only 30% as long as controls <strong>at</strong> = :33: The number <strong>of</strong> dependentsappears to exert a r<strong>at</strong>her weak positive eect on unemployment dur<strong>at</strong>ions. Thequarter-<strong>of</strong>-entry variables are inherently not very interesting, but it appears th<strong>at</strong> l<strong>at</strong>eentry into the experiment improved one's chances for early reemployment. The recallindic<strong>at</strong>or is considerably more interesting; anticip<strong>at</strong>ed recall to one's prior job hasavery strong and very precisely estim<strong>at</strong>ed detrimental eect over the entire lowertail <strong>of</strong> the distribution. However, beyond quantile = :6; which corresponds toabout 20 weeks dur<strong>at</strong>ion for white, male controls, the anticip<strong>at</strong>ed recall appears tobe abandoned and beyond this point expected recall becomes a signicant force forearly reemployment in the upper tail <strong>of</strong> the distribution.Not surprisingly the young (those under 34) tend to nd reemployment earlierthan their middle aged counterparts, while the old (those over 54) do signicantlyworse. In both cases the eects are highly signicant throughout the entire range<strong>of</strong> quantiles we have estim<strong>at</strong>ed. Prior employment in durable manufacturing has aweakly disadvantageous eect on reemployment, but residing in a low unemploymentdistrict is, not surprisingly, helpful in facilit<strong>at</strong>ing more rapid reemployment.The tre<strong>at</strong>ment eect <strong>of</strong> the bonus oer clearly does not conform well to the loc<strong>at</strong>ionshift paradigm <strong>of</strong> the conventional models. After the log transform<strong>at</strong>ion <strong>of</strong> dur<strong>at</strong>ions,a loc<strong>at</strong>ion shift would imply th<strong>at</strong> the tre<strong>at</strong>ment exerts a constant percentage changein all dur<strong>at</strong>ions. In the present instance this implic<strong>at</strong>ion is particularly unpal<strong>at</strong>ablesince the entire point <strong>of</strong> the experiment was to alter the shape <strong>of</strong> the conditional dur<strong>at</strong>iondistribution, concentr<strong>at</strong>ing mass within the qualic<strong>at</strong>ion period, and reducingit beyond this period. In the tre<strong>at</strong>ment panel <strong>of</strong> Figure 6.1 we have seen th<strong>at</strong> thebonus eect gradually reduces dur<strong>at</strong>ions from a null eect in the lower tail to a maximumreduction <strong>of</strong> 15% <strong>at</strong> the median, and then gradually again returning to a nulleect in the upper tail. This nding accords perfectly with the timing imposed bythe qualic<strong>at</strong>ion period <strong>of</strong> the experiment. It mightbethought th<strong>at</strong> the bonus shouldnot aect dur<strong>at</strong>ions <strong>at</strong> all beyond the qualic<strong>at</strong>ion period, but further consider<strong>at</strong>ionsuggests th<strong>at</strong> acceler<strong>at</strong>ed search in an eort to meet the qualic<strong>at</strong>ion period deadlinecould easily yield \successes" th<strong>at</strong> extended beyond the qualic<strong>at</strong>ion period due todecision delay by potential employers, or other factors.Taken together, the results presented in Figure 6.1 do not seem to lend muchsupport to either the loc<strong>at</strong>ion shift, or to the loc<strong>at</strong>ion-scale shift, hypotheses <strong>of</strong> theconventional regression model. In the former case we would expect to see plots th<strong>at</strong>appeared essentially constantin while in the l<strong>at</strong>ter, we expect to see plots th<strong>at</strong> mimicthe shape <strong>of</strong> the intercept plot. Neither <strong>of</strong> these expect<strong>at</strong>ions are fullled. However,


Roger Koenker and Zhijie Xiao 27as we have emphasized earlier, it is crucial to be able evalu<strong>at</strong>e these impressions bymore formal st<strong>at</strong>istical methods, a task th<strong>at</strong> is undertaken in the next subsection.6.3. Inference from the Quantile Regression Process. To illustr<strong>at</strong>e our proposedinference str<strong>at</strong>egy we have decomposed the test <strong>of</strong> the loc<strong>at</strong>ion scale shift hypothesisbased on the full model represented in Figure 6.1, into several intermedi<strong>at</strong>esteps. In each <strong>of</strong> these steps we present results for only a subset <strong>of</strong> eight selectedcovari<strong>at</strong>e eects in an eort to conserve space, but all 15 covari<strong>at</strong>e eects are handledin an identical fashion. In Figure 6.2 we present, for each <strong>of</strong> our selected covari<strong>at</strong>es,the prediction <strong>of</strong> the process ^ i () based on the regression onto the estim<strong>at</strong>ed \interceptprocess", ^ 1 () as indic<strong>at</strong>ed by (5.1). Each <strong>of</strong> the tted curves is based on leastsquares estim<strong>at</strong>ion using the 301 estim<strong>at</strong>ed points <strong>of</strong> the quantile regression processfor each coordin<strong>at</strong>e. The solid lines in these panels are the same as those appearingin the previous gure; the dotted lines represents the tted curve. With the possibleexception <strong>of</strong> the recall eect, none <strong>of</strong> these ts look very compelling, but <strong>at</strong> this stagewe are already deeply mired in the Durbin problem and so it is dicult to judge thesignicance <strong>of</strong> departures from the tted rel<strong>at</strong>ionships.Taking the residuals from the panels <strong>of</strong> Figure 6.2, and standardizing by theCholesky decomposition <strong>of</strong> their (inverse) covariance m<strong>at</strong>rix yields the parametricquantile regression process, ^v n (). It is misleading, <strong>of</strong> course, to associ<strong>at</strong>e the coordin<strong>at</strong>es<strong>of</strong> this process with the original labeling <strong>of</strong> the coordin<strong>at</strong>es <strong>of</strong> ^(), sincethe m<strong>at</strong>rix transform<strong>at</strong>ion <strong>of</strong> the process mixes the coordin<strong>at</strong>es thoroughly. Had wespecied hypothetical values for the coecients r<strong>at</strong>her than estim<strong>at</strong>ing them for Figure6.2, we could <strong>of</strong> course tre<strong>at</strong> the resulting process as a vector <strong>of</strong> independentBrownian bridges under the null. However, the eect <strong>of</strong> the estim<strong>at</strong>ion is to distortthe variability <strong>of</strong> the process, as we have seen in Section 3. At this point we estim<strong>at</strong>ethe function _g and perform the martingale transform<strong>at</strong>ion on each slope coordin<strong>at</strong>e.The transform<strong>at</strong>ion is applied on the restricted subinterval, [ 0 ; 1 ], as described <strong>at</strong>the end <strong>of</strong> Section 4.1, yielding the new process, ~v n (), The transformed coordin<strong>at</strong>es<strong>of</strong> this process are, under the null hypothesis, asymptotically, independent Brownianmotions. We consider the test st<strong>at</strong>istic,K n =supjj~v n () , ~v n ( 0 )jj= p 1 , 02Twhichtakes the value 112.23. Here T =[:25;:75] so there is an additional .05 trimmingto mollify the extreme behavior <strong>of</strong> the transform<strong>at</strong>ion in the tails. The critical valuefor this test is 16.00, employing the `1 norm, so the loc<strong>at</strong>ion-scale-shift hypothesis isdecisively rejected.It is <strong>of</strong> some independentinterest to investig<strong>at</strong>e which <strong>of</strong> the coordin<strong>at</strong>es contributemost to the joint signicance <strong>of</strong> our K n st<strong>at</strong>istic. This inquiry is fraught with all theusual objections since the coordin<strong>at</strong>es are not independent, but we plunge ahead, nevertheless.In place <strong>of</strong> the joint hypothesis we can consider univari<strong>at</strong>e sub-hypotheses


28 Inference on the Quantile Regression ProcessFigure 6.2. Quantile Regression Process for Log Dur<strong>at</strong>ion ModelTre<strong>at</strong>ment-0.15 -0.05 0.0Female0.0 0.05 0.10 0.150.2 0.3 0.4 0.5 0.6 0.7 0.80.2 0.3 0.4 0.5 0.6 0.7 0.8Black-<strong>1.</strong>0 -0.6 -0.2Hispanic-0.8 -0.6 -0.4 -0.20.2 0.3 0.4 0.5 0.6 0.7 0.80.2 0.3 0.4 0.5 0.6 0.7 0.8N-Dependents0.0 0.04 0.08Recall Effect0.0 0.5 <strong>1.</strong>00.2 0.3 0.4 0.5 0.6 0.7 0.80.2 0.3 0.4 0.5 0.6 0.7 0.8Young Effect-0.35 -0.25 -0.15 -0.05Old Effect0.05 0.15 0.250.2 0.3 0.4 0.5 0.6 0.7 0.80.2 0.3 0.4 0.5 0.6 0.7 0.8<strong>of</strong> the form, i () = i + i 1 ()


Roger Koenker and Zhijie Xiao 29for each \slope" coecient. In eect, this approach replaces the m<strong>at</strong>rix standardiz<strong>at</strong>ionused for the joint test by a scalar standardiz<strong>at</strong>ion. The martingale transform<strong>at</strong>ionis then applied just as in the previous case. Now, because there is no m<strong>at</strong>rix standardiz<strong>at</strong>ionthe original labeling <strong>of</strong> the coordin<strong>at</strong>es is meaningful. In Table 6.1 wereport the test st<strong>at</strong>istics,K ni =supj~v ni () , ~v ni ( 0 )j= p 1 , 02Tfor each <strong>of</strong> the covari<strong>at</strong>es. Eects for the 5 quarters <strong>of</strong> entry are not reported. Thecritical values for these coordin<strong>at</strong>ewise tests are <strong>1.</strong>923 <strong>at</strong> .05, and 2.420 <strong>at</strong> .01, as givenin Appendix B, so the tre<strong>at</strong>ment, race, gender and age eects are highly signicant.Variable Loc<strong>at</strong>ion Scale Shift Loc<strong>at</strong>ion ShiftTre<strong>at</strong>ment 5.41 5.48Female 4.47 4.42Black 5.77 22.00Hispanic 2.74 2.00N-Dependents 2.47 2.83Recall Eect 4.45 16.84Young Eect 3.42 3.90Old Eect 6.81 7.52Durable Eect 3.07 2.83Lusd Eect 3.09 3.05Joint Eect 112.23 449.83Table 6.<strong>1.</strong> Tests <strong>of</strong> the Loc<strong>at</strong>ion-Scale-Shift HypothesisAlso reported in Table 6.1 are the corresponding test st<strong>at</strong>istics for the pure loc<strong>at</strong>ionshifthypothesis. Not surprisingly, we nd th<strong>at</strong> the more restrictive hypothesis <strong>of</strong>constant i () eects is considerably less plausible than the loc<strong>at</strong>ion scale hypothesis.The joint test st<strong>at</strong>istic is now, 449.83, with .01 critical value <strong>of</strong> 16.00, and all <strong>of</strong>the reported covari<strong>at</strong>es eects are signicant <strong>at</strong>level .05, with the exception <strong>of</strong> thehispanic eect.7. ConclusionWh<strong>at</strong> should we conclude from this exercise? The linear loc<strong>at</strong>ion shift and loc<strong>at</strong>ionscaleshift models are very elegant and convenient abstractions for many st<strong>at</strong>isticalpurposes. However, they also clearly place very stringent restrictions on the way th<strong>at</strong>covari<strong>at</strong>es are permitted to inuence the conditional distribution <strong>of</strong> the response variable.In our unemployment dur<strong>at</strong>ion applic<strong>at</strong>ion the loc<strong>at</strong>ion-scale shift hypothesismay be viewed as a generalized form <strong>of</strong> the familiar acceler<strong>at</strong>ed failure time model inwhich the scale <strong>of</strong> the response distribution responds linearly to the covari<strong>at</strong>es. This


30 Inference on the Quantile Regression Processspecic<strong>at</strong>ion is decisively rejected by the d<strong>at</strong>a from the Pennsylvania experiments.Not only the tre<strong>at</strong>ment eect <strong>of</strong> the bonus payment, but many other <strong>of</strong> the covari<strong>at</strong>esappear to aect the conditional distribution <strong>of</strong> unemployment dur<strong>at</strong>ion in ways th<strong>at</strong>are poorly approxim<strong>at</strong>ed either by pure loc<strong>at</strong>ion and/or scale shifts. One consequence<strong>of</strong> the proposed methods <strong>of</strong> inference, it maybehoped,would be a gre<strong>at</strong>er willingnessto explore more exible models for covari<strong>at</strong>e eects in a wide variety <strong>of</strong> econometricmodels.Pro<strong>of</strong> <strong>of</strong> Theorem 1 Note th<strong>at</strong>R ^() , r , () =RAppendix A. Pro<strong>of</strong>sh ^() , ()i+ R() , r , ():Under Assumption A.3, R() , r , () =()= p n; thusR ^()i, r , () =Rh^() , () + ()= p n:Under Assumptions A.1 and A.2, by Theorem 1 <strong>of</strong> Gutenbrunner and Jureckova(1992), we have,uniformly for 2T;p h in ^() , () ) 1'0() H,1 0 J1=2 0 v0()where v0() is a standardized p-dimensional Brownian bridge process, and '0() =f0(F ,1 0 ()).Thusv n () = p n'0()[RR > ] ,1=2 [R ^() , r , ()]= '0()[RR > ] ,1=2 R p h in ^() , () + '0()[RR > ] ,1=2 ()) v0()+():Pro<strong>of</strong> <strong>of</strong> Corollary 1 We have,v n () = p n' n ()[R n R > ] ,1=2 [R ^() , r , ()]Following Bai (1998), we observe th<strong>at</strong>= p n'0()[RR > ] ,1=2 [R ^() , r , ()]+[' n () , '0()] [R n R > ] ,1=2p n[R ^() , r , ()]h+'0() [R n R > ] ,1=2 , [RR > ] ,1=2i p n[R ^() , r , ()]:[R n R > ] ,1=2 , [RR > ] ,1=2 =[R n R > ] ,1=2 n [RR > ] 1=2 , [R n R > ] 1=2o [RR > ] ,1=2 ;and [R n R > ] 1=2 = R ^H,1 0 J1=2 0 ;[RR > ] 1=2 , [R n R > ] 1=2 = R[H ,1 0 , ^H,1 0 ]J1=2 = R ^H,1 0 [ ^H0 , H0]H ,1 0 J1=2 :Under Assumption A.4,[' n () , '0()] [R n R > ] ,1=2p n[R ^() , r , ()] = o p (1);h'0() [R n R > ] ,1=2 , [RR > ] ,1=2i p n[R ^() , r , ()] = op (1);


Roger Koenker and Zhijie Xiao 31thusv n () = p n' n ()[R n R > ] ,1=2 [R ^() , r , ()]= p n'0()[RR > ] ,1=2 [R ^() , r , ()] + o p (1)) v0()+():Pro<strong>of</strong> <strong>of</strong> Theorem 2 We may write,^v n () = p n'0()[R n R > n ] ,1=2 [R n ^() , rn , ()]= '0()[R n R > n ] ,1=2p n[R ^() , r , ()]+'0()[R n R > n ] ,1=2p n[r n , r]+'0()[R n R > n ] ,1=2p n[R n , R] ^()= '0()[RR > ] ,1=2p n[R ^() , r , ()]+'0()[RR > ] ,1=2p n[r n , r]+'0()[RR > ] ,1=2p n[R n , R]()+o p (1)Since () = + F ,1 ();^v n () = '0()[RR > ] ,1=2p n[R ^() , r , ()]n+'0() [RR > ] ,1=2p n[r n , r]+[RR > ] ,1=2p on[R n , R]+'0()F ,1 ()[RR > ] ,1=2p n[R n , R]+o p (1)= v n ()+Z > n ()+o p(1)where () =('0(); '0()F ,1 ()) > andZ n = [RR > ] ,1=2p n[r n , r]+[RR > ] ,1=2p n[R n , R]; [RR > ] ,1=2p n[R n , R] > = O p (1):And thus by Theorem 1,^v n () , Z > n () ) v0()+():Pro<strong>of</strong> <strong>of</strong> Corollary 2 Similar to th<strong>at</strong> <strong>of</strong> Corollary <strong>1.</strong>Pro<strong>of</strong> <strong>of</strong> Theorem 3 By Theorem 2,Denote the transform<strong>at</strong>ion based on _g as^v n () =v0()+Z > n ()+()+o p (1):Q g (h()) = h() ,Since Q g is a linear oper<strong>at</strong>or, we haveZ 0_g(s) > C(s) ,1 Z 1s_g(r)dh(r) ds;ev n () > = Q g^v n () > = Q g v0() > + Q g () > Z n + Q g () > + o p (1):By construction, Q g (() > ) = 0, and by Khmaladze (1981), Q g v0() > ) w0() > ; where w0 is a q-vari<strong>at</strong>e standard Brownian motion. ThusUnder the null hypothesis,sup2Tev n () ) w0()+e():kev n ()k )sup kw0()k :2T


32 Inference on the Quantile Regression ProcessPro<strong>of</strong> <strong>of</strong> Corollary 3 The pro<strong>of</strong> <strong>of</strong> this Corollary follows Bai (1998). Denote the transform<strong>at</strong>ionbased on _g n asZ Q gn (^v n () > )=^v n () > , _g n (s) > C n (s) ,1 Z 1_g n (r)d^v n () > ds:0sNoticing th<strong>at</strong>^v n () = p n' n ()[R n n R > n ] ,1=2 [R n ^() , rn , ()] = v n ()+Z > n n ()+o p (1)where Z n is an O p (1) quantity independent <strong>of</strong>, andby construction, Q gn (g n ) = 0. Thus we haveZ ^v n () > ,_g n (s) > C n (s) ,1 Z 1_g n (r)d^v n (r) > ds0sZ = v n () > ,_g n (s) > C n (s) ,1 Z 1_g n (r)dv n (r) > ds + o p (1):0sBecause _g n (r) is a consistent estim<strong>at</strong>or <strong>of</strong> _g(r) uniformly on r 2T =["; 1 , "]; we have, for all s 2T C(s),1 Z 1= _g(v)_g(v) dv,1 Z 1 > _g(v)_g(v) dv,1 (A.1)> < 1;and(A.2)s Cn (s) ,1 Z 1 ==sZ 1Z 11,"1,"1," ,1 _g n (v)_g n (v) > dv ,1 _g n (v)_g n (v) > dv_g(v)_g(v) > dv ,1 + o p (1) < 1:By assumption A.7, (A.1), and (A.2), we haveZ Z _g n (s) > C n (s) ,1 1[_g n (r) , _g(r)] dv n (r) > ds = o p (1);0sZ _gn (s) > , _g(s) > Z C(s) ,1 1_g(r)dv n (r) > ds = o p (1):0sAlso notice th<strong>at</strong>, under Assumption A.7, for all s 2T ,Z 1 (A.3)C(s) , C n (s) = _g(v)_g(v) > , _g n (v)_g n (v) > dv = o p (1);sthus, by (A.3), (A.1), and (A.2),Z _g n (s) > C n (s) ,1 , C(s) ,1 Z 1_g(r)dv n (r) > ds0sZ = _g n (s) > C n (s) ,1 [C(s) , C n (s)] C(s) ,1 Z 1_g(r)dv n (r) > ds0s= o p (1):


ThusandZ _g n (s) > C n (s) ,1 Z 1Roger Koenker and Zhijie Xiao 33 Z _g n (r)dv n (r) ds , _g(s) > C(s) ,1 Z 1_g(r)dv n (r) > ds0s0sZ = _g n (s) > C n (s) ,1 Z 1[_g n (r) , _g(r)] dv n (r) > ds0sZ + _g n (s) > C n (s) ,1 , C(s) ,1 Z 1_g(r)dv n (r) > ds0sZ + _gn (s) > , _g(s) > C(s) ,1 Z 1_g(r)dv n (r) > ds0s= o p (1);v n () > ,and the result follows.Z 0= v n () > ,_g n (s) > C n (s) ,1 Z 1_g n (r)dv n (r) > dsZ 0sg(s) > C(s) ,1 Z 1g(r)dv n (r) > ds + o p (1);sAppendix B. Asymptotic Critical ValuesLike many other Kolmogorov-Smirnovtype tests (see, e.g. Andrews (1993), the limiting distributionsup 2T kw0()k is dependent on the norm jjjj, the pre-specied T and the dimension parameterq. Notice th<strong>at</strong> the transform<strong>at</strong>ion is generally unstable in the extreme right tails, and the uniformconvergence <strong>of</strong> existing estim<strong>at</strong>ors <strong>of</strong> the density and score (f(F ,1 (s)) and f 0 =f(F ,1 (s))) usuallyrequires th<strong>at</strong> T be bounded away from zero and one, we consider a subset <strong>of</strong> [0; 1] whose closure liesin (0; 1):We calcul<strong>at</strong>ed the 1%, 5%, and 10% critical values for the test st<strong>at</strong>istic sup 2T kev n ()k basedon simul<strong>at</strong>ions where the Brownian motion was approxim<strong>at</strong>ed by a Gaussian random walk, usinga sample size n = 2000 and 20; 000 replic<strong>at</strong>ions. For the norm kk, we use the `1 norm for a q-dimensional vector x; kxk = P qj=1 jx jj. Table 1 covers T =["; 1,"] for " =0:05; 0:1; 0:15, 0:2, 0:25,0:3; and q =1; 2;::::::; 20. Although conventionally we consider symmetric intervals T =["; 1 , "]for some small numbers ", amuch wider range <strong>of</strong> intervals T may be considered for the proposedtests. Critical values based other choices <strong>of</strong> the interval T and the dimension parameter q can besimilarly calcul<strong>at</strong>ed. Gauss programs are available from the authors upon request.


34 Inference on the Quantile Regression ProcessAsymptotic Critical Values" =0:05 " =0:1 " =0:151% 5% 10% 1% 5% 10% 1% 5% 10%p =1 2.721 2.140 <strong>1.</strong>872 2.640 2.102 <strong>1.</strong>833 2.573 2.048 <strong>1.</strong>772p =2 4.119 3.393 3.011 4.034 3.287 2.946 3.908 3.199 2.866p =3 5.350 4.523 4.091 5.267 4.384 3.984 5.074 4.269 3.871p =4 6.548 5.560 5.104 6.340 5.430 4.971 6.148 5.284 4.838p =5 7.644 6.642 6.089 7.421 6.465 5.931 7.247 6.264 5.758p =6 8.736 7.624 7.047 8.559 7.412 6.852 8.355 7.197 6.673p =7 9.876 8.578 7.950 9.573 8.368 7.770 9.335 8.125 7.536p =8 10.79 9.552 8.890 10.53 9.287 8.662 10.35 9.044 8.412p =9 1<strong>1.</strong>81 10.53 9.820 1<strong>1.</strong>55 10.26 9.571 1<strong>1.</strong>22 9.963 9.303p =10 12.91 1<strong>1.</strong>46 10.72 12.54 1<strong>1.</strong>17 10.43 12.19 10.85 10.14p =11 14.03 12.41 1<strong>1.</strong>59 13.58 12.10 1<strong>1.</strong>29 13.27 1<strong>1.</strong>77 10.98p =12 15.00 13.34 12.52 14.65 13.00 12.20 14.26 12.61 1<strong>1.</strong>86p =13 15.93 14.32 13.37 15.59 13.90 13.03 15.22 13.48 12.69p =14 16.92 15.14 14.28 16.52 14.73 13.89 16.12 14.34 13.48p =15 17.93 16.11 15.19 17.53 15.67 14.76 17.01 15.24 14.36p =16 18.85 16.98 16.06 18.46 16.56 15.65 17.88 16.06 15.22p =17 19.68 17.90 16.97 19.24 17.44 16.53 18.78 16.93 16.02p =18 20.63 18.83 17.84 20.21 18.32 17.38 19.70 17.80 16.86p =19 2<strong>1.</strong>59 19.72 18.73 2<strong>1.</strong>06 19.24 18.24 20.53 18.68 17.70p =20 22.54 20.58 19.62 22.02 20.11 19.11 2<strong>1.</strong>42 19.52 18.52" =0:2 " =0:25 " =0:31% 5% 10% 1% 5% 10% 1% 5% 10%p =1 2.483 <strong>1.</strong>986 <strong>1.</strong>730 2.420 <strong>1.</strong>923 <strong>1.</strong>664 2.320 <strong>1.</strong>849 <strong>1.</strong>602p =2 3.742 3.100 2.781 3.633 3.000 2.693 3.529 2.904 2.602p =3 4.893 4.133 3.749 4.737 4.018 3.632 4.599 3.883 3.529p =4 6.023 5.091 4.684 5.818 4.948 4.525 5.599 4.807 4.365p =5 6.985 6.070 5.594 6.791 5.853 5.406 6.577 5.654 5.217p =6 8.147 6.985 6.464 7.922 6.760 6.241 7.579 6.539 6.024p =7 9.094 7.887 7.299 8.856 7.611 7.064 8.542 7.357 6.832p =8 10.03 8.775 8.169 9.685 8.510 7.894 9.413 8.211 7.633p =9 10.90 9.672 9.018 10.61 9.346 8.737 10.27 9.007 8.400p =10 1<strong>1.</strong>89 10.52 9.843 1<strong>1.</strong>48 10.17 9.517 1<strong>1.</strong>15 9.832 9.192p =11 12.85 1<strong>1.</strong>35 10.66 12.48 10.99 10.28 12.06 10.62 9.929p =12 13.95 12.22 1<strong>1.</strong>48 13.54 1<strong>1.</strong>82 1<strong>1.</strong>11 12.96 1<strong>1.</strong>43 10.74p =13 14.86 13.09 12.31 14.34 12.66 1<strong>1.</strong>93 13.82 12.24 1<strong>1.</strong>51p =14 15.69 13.92 13.11 15.26 13.46 12.67 14.64 13.03 12.28p =15 16.55 14.77 13.91 16.00 14.33 13.47 15.46 13.85 13.05p =16 17.41 15.58 14.74 16.81 15.09 14.26 16.25 14.61 13.78p =17 18.19 16.43 15.58 17.59 15.95 15.06 17.04 15.39 14.54p =18 19.05 17.30 16.37 18.49 16.78 15.83 17.85 16.14 15.30p =19 19.96 18.09 17.17 19.40 17.50 16.64 18.78 16.94 16.05p =20 20.81 18.95 17.97 20.14 18.30 17.38 19.48 17.74 16.79


Roger Koenker and Zhijie Xiao 35Appendix A. Monte Carlo ResultsWe have conducted some limited Monte Carlo experiments to examine the nite sample performance<strong>of</strong> the proposed tests. In particular, we examine the eectiveness <strong>of</strong> the martingale transform<strong>at</strong>ionbased on the size and power properties <strong>of</strong> the tests. The following sample sizes wereconsidered in our experiment: n =100; 200; 300; 400; 500: These sample sizes were chosen becausethey represent the most relevant range <strong>of</strong> sample sizes in empirical analyses.First <strong>of</strong> all, to investig<strong>at</strong>e the eectiveness <strong>of</strong> the martingale transform<strong>at</strong>ion on quantile regressioninference, we examine the size and power properties <strong>of</strong> the infeasible version tests where the truedensity and score functions are used in the standardiz<strong>at</strong>ion and the martingale transform<strong>at</strong>ion. Westart with the heteroscedasticity test. The d<strong>at</strong>a were gener<strong>at</strong>ed from(A.1)y i = + x i + (x i )u i ;where x i and u i are iid N (0; 1) random vari<strong>at</strong>es and are mutually independent, =0; and =<strong>1.</strong>(x i )=0 + 1x i , 0 =<strong>1.</strong> We examined the empirical rejection r<strong>at</strong>es <strong>of</strong> the test for dierentchoices <strong>of</strong> sample sizes and 1 values, <strong>at</strong> 5% level <strong>of</strong> signicance. In constructing the test, we usedthe OLS estim<strong>at</strong>or for , b and the trunc<strong>at</strong>ion parameter value = 0:05 (i.e. T =[0:05; 0:95]).Since x i is a scalar, the limiting null distribution <strong>of</strong> the test st<strong>at</strong>istic is sup0:050:95 jW ()j : The5% level critical value is 2.14. For the choices <strong>of</strong> the heteroscedasticity parameter 1; we consider1 =0; 0:1; 0:2; 0:3; 0:5; 1; 2; 5: When 1 =0; the model is homoscedastic and the rejection r<strong>at</strong>esgive the empirical sizes. When 1 6= 0; the model is heteroscedastic and the rejection r<strong>at</strong>es deliverthe empirical powers. Table 1 reports the empirical rejection r<strong>at</strong>es for dierent values <strong>of</strong> 1 and n:Other values <strong>of</strong> the trunc<strong>at</strong>ion parameter were also tried and quantit<strong>at</strong>ively similar results wereobtained. These Monte Carlo results indic<strong>at</strong>e th<strong>at</strong>, given inform<strong>at</strong>ion on the density and score, themartingale transform<strong>at</strong>ion brings pretty good size and power to the proposed testing procedure innite sample.The remaining Monte-Carlo experiments are based on the even simpler two sample model, y 1i = 1 + 1u i ;i=1; :::::;n1;(A.2)y2i = 2 + 2v i ;i=1; :::::; n2;In particular, we considered the following two sets <strong>of</strong> parameter values(A.3)(A.4)Loc<strong>at</strong>ion Shift: 1 =1;2 =0;1 = 2 =1;Loc<strong>at</strong>ion-Scale Shift: 1 =1;2 =0;1 =2;2 =1;where u i ;v i are iid N (0; 1) random vari<strong>at</strong>es. When the parameters take the rst set <strong>of</strong> values, (A.2)represents a pure loc<strong>at</strong>ion shift model. The null hypothesis <strong>of</strong> a shift model can be tested by theprocedure given in Section 4.2. When the d<strong>at</strong>a is gener<strong>at</strong>ed from the second set parameters, (A.2)is a loc<strong>at</strong>ion-scale shift model. The loc<strong>at</strong>ion-scale hypothesis can be tested by the procedure givenin Section 4.<strong>1.</strong> Table 2 reports the empirical size <strong>of</strong> these tests for dierent combin<strong>at</strong>ions <strong>of</strong> n1 andn2. We can see th<strong>at</strong> the test has good size properties in nite samples. These Monte Carlo results,together with the results on the heteroscedasticity testinTable 1, conrm the eectiveness <strong>of</strong> themartingale transform<strong>at</strong>ion in quantile regression inference.The above Monte Carlo experiments use the true density and score. It is obviously also importantto evalu<strong>at</strong>e the eect <strong>of</strong> nonparametric nuisance parameter estim<strong>at</strong>ion on the performance <strong>of</strong> theproposed tests. In our next Monte Carlo experiments, we estim<strong>at</strong>ed F ,1 (s) and'0(s) using theapproach described in the text. For the score function _g, we employ the adaptive kernel estim<strong>at</strong>or<strong>of</strong> Portnoy and Koenker (1989).The kernel estim<strong>at</strong>ion procedures for these nuisance functions are nonparametric and thereforeobviously entail choices <strong>of</strong> bandwidth values. Unsuitable bandwidth selection can produce poor


36 Inference on the Quantile Regression Processestim<strong>at</strong>es. However, under broad conditions on the convergence r<strong>at</strong>e <strong>of</strong> the bandwidth parameters,the nonparametric estim<strong>at</strong>es are consistent and testing procedures using dierent bandwidth choicesare (rst order) asymptotically equivalent, although the nite sample performance <strong>of</strong> these testscan vary considerably with bandwidth choice. Extensive simul<strong>at</strong>ions have been conducted in theliter<strong>at</strong>ure to show the importance <strong>of</strong> bandwidth choice on estim<strong>at</strong>ion and testing procedure th<strong>at</strong> usenonparametric estim<strong>at</strong>es.It was anticip<strong>at</strong>ed th<strong>at</strong> the estim<strong>at</strong>ion <strong>of</strong> '0(s)would exert important inuence on the nite sampleperformance <strong>of</strong> our tests. This is conrmed in the simul<strong>at</strong>ions. For this reason, we pay particular<strong>at</strong>tention to the bandwidth choice in density estim<strong>at</strong>ion. Hall and She<strong>at</strong>her (1988) suggested abandwidth rule based on Edgeworth expansion for studentized quantiles. This bandwidth is <strong>of</strong> ordern ,1=3 and we denote it as h HS . Another bandwidth selection has been proposed by Bonger (1975)is <strong>of</strong> order n ,1=5 . We denote it by h B . Wehave considered both <strong>of</strong> these bandwidth choices forour tests. In addition, notice th<strong>at</strong> the Bonger bandwidth is eventually much larger than the Halland She<strong>at</strong>her bandwidth, we have also considered the following bandwidth choice which takes valuesbetween h HS and h B ; it is denoted as h , h = h B , where h B is the Bonger bandwidth and isa scalar. We report the results for the case =0:6 here. The score function was estim<strong>at</strong>ed by themethod <strong>of</strong> Portnoy and Koenker (1989) and we simply choose the Silverman (1986) bandwidth.Tables 3a, 3b, 3c report the Monte Carlo results for the heteroscedasticity test with dierentbandwidth selections and Tables 4a, 4b, 4c give the result <strong>of</strong> the loc<strong>at</strong>ion-scale test. The Monte Carloevidence indic<strong>at</strong>es th<strong>at</strong> the bandwidth choice does have animportant inuence on the nite sampleperformance <strong>of</strong> these tests. It also shows th<strong>at</strong>, by choosing appropri<strong>at</strong>e bandwidth, the proposedtests have reasonable size and power properties. In general, we found over-rejection when the Hall-She<strong>at</strong>her bandwidth was used. For the other two bandwidth, h and h B , the rel<strong>at</strong>ive performancedepends on which test we consider. For the heteroscedasticity test,we found under-rejection whenthe Bonger bandwidth was used. In this test, <strong>at</strong> least for the model and the nonparametric methodsused here, the bandwidth choice h provides pretty good nite sample performance. However, for theloc<strong>at</strong>ion-scale test, h tends to over-reject and h B seems to be a rel<strong>at</strong>ively better bandwidth choice.To focus our <strong>at</strong>tention on the eect <strong>of</strong> ' n (s), we have also conducted Monte Carlo experimentswhere only the density function is estim<strong>at</strong>ed (and use the true score function), the Monte Carloresults reconrmed our ndings on the three bandwidth choices.TABLE 1: Size and Power <strong>of</strong> the Heteroskedasticity Test (Trunc<strong>at</strong>ed, =0:05)Size Powern 1 =0 1 =0:1 1 =0:2 1 =0:3 1 =0:5 1 =1 1 =2 1 =5100 0.006 0.134 0.377 0.729 0.974 0.981 0.990 0.999200 0.054 0.269 0.77 0.977 0.999 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000300 0.052 0.383 0.931 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000400 0.052 0.549 0.989 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000500 0.052 0.616 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000TABLE 2: Applic<strong>at</strong>ion to The Two-Sample ModelsCase 1: Loc<strong>at</strong>ion ShiftCase 2: Loc<strong>at</strong>ion-Scale Shift1 =1;2 =0;1 = 2 =1 1 =1;2 =0;1 =2;2 =1n1 n2 size n1 n2 size n1 n2 size n1 n2 size100 100 0.074 100 200 0.060 100 100 0.153 100 200 0.179150 150 0.080 100 300 0.086 150 150 0.158 100 300 0.196200 200 0.064 150 300 0.055 200 200 0.169 150 300 0.175250 250 0.054 200 300 0.056 250 250 0.172 200 300 0.183


Roger Koenker and Zhijie Xiao 37TABLE 3(The Heteroskedasticity Test. Bandwidth in Density Estim<strong>at</strong>ion:Kernel Estim<strong>at</strong>ion <strong>of</strong> Score with Several Bandwidths)h HS h B h Size Power Size Power Size Powern 1 =0 :2 :5 1 1 =0 :2 :5 1 1 =0 :2 :5 1100 0.45 0.723 0.99 <strong>1.</strong>000 0.009 0.053 0.197 0.545 0.035 0.211 0.755 0.820200 0.21 0.877 <strong>1.</strong>000 <strong>1.</strong>000 0.013 0.109 0.772 0.949 0.041 0.406 0.990 0.989300 0.195 0.952 <strong>1.</strong>000 <strong>1.</strong>000 0.019 0.229 0.985 0.992 0.043 0.665 <strong>1.</strong>000 <strong>1.</strong>000400 0.186 0.995 <strong>1.</strong>000 <strong>1.</strong>000 0.023 0.412 0.997 0.998 0.043 0.809 <strong>1.</strong>000 <strong>1.</strong>000500 0.173 <strong>1.</strong>000 <strong>1.</strong>000 <strong>1.</strong>000 0.029 0.565 <strong>1.</strong>000 <strong>1.</strong>000 0.045 0.911 <strong>1.</strong>000 <strong>1.</strong>000TABLE 4(Loc<strong>at</strong>ion-Scale Test. Bandwidth in Density Estim<strong>at</strong>ion:Kernel Estim<strong>at</strong>ion <strong>of</strong> Score with Several Bandwidths)SizeSizen1 n2 h HS h B h n1 n2 h HS h B h 100 100 0.589 0.038 0.098 50 50 0.616 0.028 0.063150 150 0.538 0.079 0.112 75 75 0.603 0.033 0.086200 200 0.511 0.079 0.123 250 250 0.507 0.065 0.126500 500 0.406 0.105 0.145 300 300 0.456 0.078 0.135ReferencesAbadie, A. (2000): \Bootstrap test for the eect <strong>of</strong> a tre<strong>at</strong>ment on the Distribution <strong>of</strong> an outcomevariable," NBER Technical Report 26<strong>1.</strong>Abadie, A., J. Angrist, and G. Imbens (1999): \Instrumental Variables Estim<strong>at</strong>ion <strong>of</strong> QuantileTre<strong>at</strong>ment Eects," preprint.Andrews, D. W. K. (1993): \Tests for Parameter Instability and Structural Change With UnknownChange Point," Econometrica, 61, 821{856.Andrews,D.W.K.(1997): \A Conditional Kolmogorov Test," Econometrica, 65, 1097{1128.Bai, J. (1998): \Testing parametric conditional distributions <strong>of</strong> dynamic models," preprint.Bickel, P. J. (1969): \A Distribution Free Version <strong>of</strong> the SmirnovTwo Sample Test in the p-vari<strong>at</strong>eCase," The Annals <strong>of</strong> M<strong>at</strong>hem<strong>at</strong>ical St<strong>at</strong>istics, 40, 1{23.(1982): \On Adaptive Estim<strong>at</strong>ion," Ann. <strong>of</strong> St<strong>at</strong>ist., 10, 647{67<strong>1.</strong>Bilias, Y., S. Chen, and Z. Ying (1999): \Simple resampling methods for censored regressionquantiles," J. <strong>Econometrics</strong>, forthcoming.B<strong>of</strong>inger, E. (1975): \Estim<strong>at</strong>ion <strong>of</strong> a density function using order st<strong>at</strong>istics," Australian J. <strong>of</strong>St<strong>at</strong>istics, 17, 1{7.Corson, W., P. Decker, S. Dunstan, and S. Keransky (1992): \Pennsylvania `reemploymentbonus' demonstr<strong>at</strong>ion: Final Report," Unemployment Insurance Occasional Paper, 92.Cox, D. D. (1985): \A Penalty Method for Nonparametric Estim<strong>at</strong>ion <strong>of</strong> the Logarithmic Deriv<strong>at</strong>ive<strong>of</strong> a Density Function," Annals <strong>of</strong> Institute <strong>of</strong> M<strong>at</strong>h. St<strong>at</strong>., 37, 271{288.Darling, D. (1955): \The Cramer-Smirnov test in the parametric case," Ann. <strong>of</strong> St<strong>at</strong>ist., 26, 1{20.Doksum, K. (1974): \Empirical Probability Plots and St<strong>at</strong>istical Inference for Nonlinear Modelsin the Two-sample Case," Ann. <strong>of</strong> St<strong>at</strong>ist., 2,267{277.


38 Inference on the Quantile Regression ProcessDoob, J. (1949): \A Heuristic Approach to the Kolmogorov-Smirnov theorems," Annals <strong>of</strong> M<strong>at</strong>h.St<strong>at</strong>., 20, 393{403.Durbin, J. (1973): Distribution theory for tests based on the sample distribution function. SIAM,Philadelphia.Fleming, T., and D. Harrington (1991): Counting Processes and Survival Analysis. Wiley.Gutenbrunner, C., and J. Jureckova (1992): \Regression quantile and regression rank scoreprocess in the linear model and derived st<strong>at</strong>istics," Ann. St<strong>at</strong>ist., 20, 305{330.Gutenbrunner, C., J. Jureckova, R. Koenker, and S. Portnoy (1993): \Tests <strong>of</strong> linearhypotheses based on regression rank scores," J. <strong>of</strong> Nonparametric St<strong>at</strong>istics, 2,307{33.Hahn, J. (1995): \Bootstrapping quantile regression models," Econometric Theory, 11, 105{12<strong>1.</strong>Hajek, J. (1962): \Asymptotically Most Powerful Rank-order Tests," The Annals <strong>of</strong> M<strong>at</strong>hem<strong>at</strong>icalSt<strong>at</strong>istics, 33, 1124{1147.Hall, P., and S. She<strong>at</strong>her (1988): \On the distribution <strong>of</strong> a studentized quantile," J. RoyalSt<strong>at</strong>. Soc. (B), 50, 381{39<strong>1.</strong>He, X., and F. Hu (1999): \Markov chain marginal bootstrap," preprint.Heckman, J., J. Smith, and N. Clements (1997): \Making the most our <strong>of</strong> Programme evalu<strong>at</strong>ionand social experiments: Accounting for heterogeneity in programme impacts," Review <strong>of</strong>Economic Studies, 64, 487{535.Horowitz, J. (1998): \Bootstrap methods for median regression models," Econometrica, 66, 1327{1352.Hsieh, D. A., and C. F. Manski (1987): \Monte Carlo Evidence on Adaptive Maximum LikelihoodEstim<strong>at</strong>ion <strong>of</strong> a Regression," Ann. <strong>of</strong> St<strong>at</strong>ist., 15, 541{55<strong>1.</strong>Kac, M., . Kiefer, and J. Wolfowitz (1955): \On tests <strong>of</strong> normality and other tests <strong>of</strong> goodness<strong>of</strong> t based on distance methods," Ann.<strong>of</strong>St<strong>at</strong>ist., 26, 189{21<strong>1.</strong>Khmaladze, E. V. (1981): \Martingale Approach in the Theory <strong>of</strong> Goodness-<strong>of</strong>-t Tests," Theory<strong>of</strong> Prob. and its Apps, 26, 240{257.Kiefer, J. (1959): \K-sample analogues <strong>of</strong> the Kolmogorov-Smirnov and Cramer-von Mises tests,"Ann. M<strong>at</strong>h. St<strong>at</strong>ist, 30, 420{447.Koenker, R., and G. Bassett (1978): \Regression Quantiles," Econometrica, 46, 33{50.(1982): \Robust tests for heteroscedasticity based on regression quantiles," Econometrica,50, 43{6<strong>1.</strong>Koenker, R., and Y. Bilias (2001): \Quantile Regression for Dur<strong>at</strong>ion D<strong>at</strong>a: A Reappraisal <strong>of</strong>the Pennsylvania Reemployment Bonus Experiments," Empirical Economics, forthcoming.Koenker, R., and J. Machado (1999): \Goodness <strong>of</strong> t and rel<strong>at</strong>ed inference processes forquantile regression," J. <strong>of</strong> Am. St<strong>at</strong>. Assoc., 94, 1296{1310.Koenker, R., and S. Portnoy (1987): \L-estim<strong>at</strong>ion for Linear Models," J. <strong>of</strong> Am. St<strong>at</strong>. Assoc.,82, 851{857.Koul, H., and W. Stute (1999): \Nonparametric Model Checks for Time Series," Ann. <strong>of</strong>St<strong>at</strong>ist., 27, 204{236.Koul, H. L. (1992): Weighted Empiricals and Linear Models. Institute <strong>of</strong> M<strong>at</strong>hem<strong>at</strong>ical St<strong>at</strong>istics,Hayward, California.Lehmann, E. (1974): Nonparametrics: St<strong>at</strong>istical Methods based onRanks. Holden-Day, SanFrancisco.McFadden, D. (1989): \Testing for stochastic dominance," in Studies in the Economics <strong>of</strong> Uncertaintyin honor <strong>of</strong> Josef Hadar, ed. by T.Fomby, and T. Seo. Springer: New York.Meyer, B. (1995): \Lessons from the U.S. unemployment insurance experiments," Journal <strong>of</strong>Economic Liter<strong>at</strong>ure, 33, 191{13<strong>1.</strong>


Roger Koenker and Zhijie Xiao 39(1996): \Wh<strong>at</strong> have we learned from the <strong>Illinois</strong> reemploymentbonus experiment," Journal<strong>of</strong> Labor Economics, 14, 26{5<strong>1.</strong>Nair, V. N. (1982): \Q-Q Plots With Condence Bands for Comparing Several Popul<strong>at</strong>ions,"Scandinavian Journal <strong>of</strong> St<strong>at</strong>istics, 9,193{200.Ng, P. T. (1994): \Smoothing Spline Score Estim<strong>at</strong>ion," SIAM J. Scientic and St<strong>at</strong>istical Computing,15, 1003{1025.Oja, H. (1981): \On Loc<strong>at</strong>ion, Scale, Skewness and Kurtosis <strong>of</strong> Univari<strong>at</strong>e Distributions," Scand.J. <strong>of</strong> St<strong>at</strong>., 8,154{168.Portnoy, S., and R. Koenker (1989): \Adaptive L-estim<strong>at</strong>ion <strong>of</strong> linear models," Ann. St<strong>at</strong>ist,,17, 362{38<strong>1.</strong>Prakasa Rao, B. (1999): Semimartingales and their St<strong>at</strong>istical Inference. Chapman-Hall, London.Romano, J. P. (1988): \A Bootstrap Revival <strong>of</strong> Some Nonparametric Distance Tests," Journal <strong>of</strong>the American St<strong>at</strong>istical Associ<strong>at</strong>ion, 83, 698{708.Serfling, R. (1980): Approxim<strong>at</strong>ion Theorems <strong>of</strong> M<strong>at</strong>hem<strong>at</strong>ical St<strong>at</strong>istics. New York: Wiley.She<strong>at</strong>her, S., and J. Maritz (1983): \An estim<strong>at</strong>e <strong>of</strong> the asymptotic standard error <strong>of</strong> thesample medi an," Australian J. <strong>of</strong> St<strong>at</strong>istics, 25, 109{122.Shorack, G., and J. Wellner (1986): Empirical Processes with Applic<strong>at</strong>ions to St<strong>at</strong>istics. Wiley,New York.Siddiqui, M. (1960): \Distribution <strong>of</strong> Quantiles from a Bivari<strong>at</strong>e Popul<strong>at</strong>ion," Journal <strong>of</strong> Research<strong>of</strong> the N<strong>at</strong>ional Bureau <strong>of</strong> Standards, 64B, 145{150.Silverman, B. (1986): Density Estim<strong>at</strong>ion for St<strong>at</strong>istics and D<strong>at</strong>a Analysis. Chapman-Hall: NewYork.Stigler, S. (1980): \Stigler's Law <strong>of</strong>Eponymy," Transactions <strong>of</strong> the New York Academy <strong>of</strong> Sciences,39, 147{157.Stute, W., H. Thies, and L. X. Zhu (1998): \Model Checks for Regression: An Innov<strong>at</strong>ionProcess Approach," Ann. <strong>of</strong> St<strong>at</strong>ist., 26, 1916{1934.van der Vaart, A. W. (1998): Asymptotic St<strong>at</strong>istics. Cambridge U. Press.Welsh, A. H. (1988): \Asymptotically Ecient Estim<strong>at</strong>ion <strong>of</strong> the Sparsity Function At aPoint,"St<strong>at</strong>. and Prob. Letters, 6, 427{432.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!