Queue length estimation from probe vehicle ... - ResearchGate

ARTICLE IN PRESS2 G. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxxtwo examples for an isolated intersection with fixed signal timing.The arrivals are assumed to follow a discrete distribution such asPoisson whereas the vehicles are assumed to queue vertically forsimplicity.It should be noted that there is a vast body of literature onqueues at signalized intersections. Since fixed cycle traffic light allowsa detailed analytical analysis, it has been studied by manyresearchers. One of the earliest studies is by Webster (1958) whogenerated relationships for the number of stops and delays by simulatingtraffic flow on a one-lane approach to an isolated signalizedintersection. In particular, the curve he fitted to the simulation resultshas been fundamental to traffic signal setting proceduressince its development. Miller (1963) found an approximation tothe average overflow queue for any arrival and departure distributions.Later, Newell (1965) derived an analytical approximation tothe mean queue length for general arrivals. McNeil (1968) deriveda formula for the expected delay and approximate mean of theoverflow queue length for general arrival process and constantdeparture rate.Other than the studies on the estimation of mean, there havebeen some attempts to obtain the probability function for thequeue length. Some researchers obtained the conditional probabilitydistribution of the overflow queue at the end of one periodgiven the queue length at the preceding period assuming homogenousPoisson arrival process (Haight, 1959; Heidemann, 1994;Mung et al., 1996). Yet some others derived the probability generatingfunction (pgf) of the stationary overflow queue (Darroch,1964) in the hope of obtaining the probability functions for delaysand queue lengths. Obtaining a probability function from a pgf involvesinverting the pgf function (Abate and Whitt, 1992), and thisinversion process is quite complicated and entails finding complexroots and numerical evaluations of parameters (Van Leeuwaarden,2006). Therefore, the applicability of this procedure is very limitedand complicated. Following similar methods presented in Haight(1959); Olszewski (1990) used Markov chains to obtain the probabilitydistribution of overflow queue, and developed a computerprogram that estimates the mean queue length and its varianceunder different conditions such as stationary and non-stationaryarrival processes, and variable service rates. In a recent paper,Van Zuylen and Viti (2007) also used Markov chains to modelthe dynamics of the probability distributions of the total queueand overflow queue.Lastly, it is interesting to note that the general idea of usingprobes to estimate the system state or system performance is notunique to vehicular traffic on roadways. In computer communicationnetworks, ‘‘probe packets” are sent from a source to one ormore receiver nodes in the network in order to deduce the qualityof service or performance (e.g., loss rate, delay) at the internalnodes or links (e.g., routers). In this technique, performance ofthe internal links/nodes is estimated by exploiting the correlationpresent in end-to-end (origin to destination) measurements obtainedfrom probe packets (e.g., Xi et al., 2006; Duffield, 2006).This article is organized as follows: Section 2 introduces theproblem statement and the notation. Section 3 describes the analyticalformulation for the queue length estimation problem. Theapplication of the formulation is presented first for a fixed-cycletraffic light without considering the overflow queue in Section 4.Section 5 addresses the application of the formulation when overflowqueue is also considered. Lastly, Section 6 summarizes thefindings and results.vehicles. The main objective is to estimate the total queue length,N, if the locations of probe vehicles in the queue are known. Technically,the goal is to determine the conditional expected value of Ngiven the probe information, and to evaluate the error when thisconditional expected value is used as the predictor of the actualN, where N is the total number of vehicles accumulated in thequeue at the end of red interval.It is assumed that the distance of probe vehicles from the stopbar can be measured by location tracking technologies. For example,today’s differential GPS systems can provide one to three meteraccuracy (USDOT, 2008). For the purpose of locating vehicles ona link, this accuracy would be sufficient. Once the position of avehicle waiting on the link is known (i.e., distance from the stopbar), the number of vehicles ahead of the subject vehicle can beestimated by assuming an average length/spacing per vehicle. Eventhough all vehicles are assumed to be identical in this paper, it ispossible to capture the impact of multiple vehicle classes by selectingthe average vehicle length to be representative of differentvehicle types. (The concept of using an average or effective vehiclelength is common in estimating density and speeds based on occupancy(i.e., percentage of time the detector is occupied/activated)measured by inductive loop detectors (e.g., Hellinga, 2002)). Anotherpossibility to model multiple vehicle classes is through specifyingarrival rates for each vehicle type. Even though incorporatingmultiple vehicle classes (e.g., busses, trucks) into the formulationpresented below is possible, this is avoided to simplify notationand presentation of the main ideas. Furthermore, this paper doesnot discuss information flow architecture and the exact nature ofdata processing needed to obtain the location of probes in thequeue. Albeit important, such details as determining whether aprobe vehicle is in the queue or not (i.e., waiting or moving), whichcan be determined based on its speed, acceleration and selectedthresholds, are not dealt with in this paper.For simplicity, the location of vehicles in queue will be measuredin terms of the number of vehicles. For example, in Fig. 1,there are three probe vehicles and the location of the last probe,L p , is 6. The value of N is estimated in real-time at the end of eachcycle based on the location of probe vehicles. Even though themodels are developed for vertical queues, this figure is includedto clarify the notation. In vertical queuing models, it is assumedthat vehicles do not occupy space and can accelerate and decelerateinstantaneously.The models developed in this paper are based on the assumptionthat only probe vehicle locations are observable. From a technologicalpoint of view, a system that can collect and use othertypes of data (e.g., arrival times of probes or joining back-of-queueevents) might also be envisioned. However, the scope of this paperis limited to those situations where only location data are used. Asit is demonstrated in the next section, the location of the last probein the queue L p is sufficient for estimating the queue length;neither locations of other probe vehicles nor the total number ofprobes vehicles in the queue N p are needed. It is assumedthat the percentage of probe vehicles in the traffic stream p isknown.N=72. Problem statement and notationFig. 1 shows a snapshot of traffic at a signalized intersection approachat the end of a red phase. Solid rectangles represent probeL p= 6Fig. 1. Snapshot of an intersection right before the red interval terminates.Please cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

ARTICLE IN PRESSG. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxx 3In order to estimate N, the conditional probability distributionof N given the probe locations is needed. The next section presentsthe general formulation whereas the following sections providetwo examples and some analyses. Even though the formulationin the next section can be generalized, the two example applicationsgiven in this paper are for a signal with fixed timing. Theapplication to actuated signals is a topic of future research.3. Analytical formulationAssuming that the proportion of probe vehicles p is known, therelationship between the total number of probe vehicles N p and thetotal number of all vehicles in the queue N can be written asfollows:N p ¼ XNi¼1y i where y i 2f0; 1g and Pðy i ¼ 1Þ ¼p: ð1ÞThe equation above asserts that every vehicle has equal probabilityof being a probe vehicle. It is assumed that the random variablesy 1 ,y 2 ,... are independent and independent of N. Both N and N pare two discrete random variables. The formulation here is for ageneral probability mass function (pmf) for N, which is denotedby P(N = n). Given N = n, it follows that the number of probe vehiclesin the queue has the following binomial distribution.PðN p ¼ n p jN ¼ nÞ ¼n p np ð1 pÞ n np ; 0 6 p 6 1;n pn p ¼ 0; 1; 2; ...; n:Given N = n and N p = n p , the probability distribution for the locationof last probe (L p ) can be derived by considering the number of possiblecombinations. It can be verified that the conditional probabilityof L p given n and n p is l p 1n p 1PðL p ¼ l p jN ¼ n; N p ¼ n p Þ¼ ;nn pfor l p ¼ n p ; n p þ 1; ...; n; n p P 1: ð3ÞThis equation can be explained through an example case illustratedin Fig. 1. From Fig. 1, N =7,N p = 3, and L p = 6. The sample space forthe ‘‘experiment” is the possible combinations of choosing n p probe7vehicles from n vehicles, which is equal to . The number of elementsin the event space is equal to the number of possible place-3ments of the remaining probe vehicles, other than the one atposition l p , into the previous slots since l p is fixed. In this example,5 slots are available to place 2 probes. Thus, the event probability isthe number of elements in the event space divided by the numberof elementsin the sample space, which is PðL p ¼ 6jn ¼ 7;n p ¼ 3Þ ¼ 5 7.2 3As mentioned before, the main purpose is to find the conditionaldistribution of the queue length given the location of lastprobe and the total number of probes, i.e., P(N = njL p = l p , N p = n p ).This conditional distribution can be written by using the Bayes’rule as follows:PðN ¼ njL p ¼ l p ; N p ¼ n p Þ¼ PðL p ¼ l p jN p ¼ n p ; N ¼ nÞPðN p ¼ n p jN ¼ nÞPðN ¼ nÞ: ð4ÞPðL p ¼ l p ; N p ¼ n p ÞBy the Total Probability LawPðL p ¼ l p ; N p ¼ n p Þ¼ X1k¼l pPðL p ¼ l p jN ¼ k; N p ¼ n p ÞPðN p¼ n p jN ¼ kÞPðN ¼ kÞ:ð2Þð5ÞThen (4) becomesPðN ¼ njL p ¼ l p ; N p ¼ n p Þ¼PðL p ¼ l p jN p ¼ n p ; N ¼ nÞPðN p ¼ n p jN ¼ nÞPðN ¼ nÞP 1k¼l pPðL p ¼ l p jN p ¼ n p ; N ¼ kÞPðN p ¼ n p jN ¼ kÞPðN ¼ kÞ : ð6ÞSubstituting the respective equations and after some simplifications,the following conditional distribution function for the queuelength is obtainedPðN ¼ njL p ¼ l p ; N p ¼ n p Þ¼ ð1 pÞn PðN ¼ nÞP 1k¼l pð1 pÞ k PðN ¼ kÞ ;for l p P 1:As it turns out, this probability only depends on the location of lastprobe l p and does not depend on the total number of probes in thequeue n p . It should be noted that Eq. (7) is valid for any probabilitydistribution of N. Even though the numerical examples presented inthe next sections are based on Poisson and Negative Binomial distributions,the formulation can be applied for any given P(N = n).Based on the above conditional probability distribution, the expectedqueue length can be easily computed as follows:EðNjL p ¼ l p Þ¼ X1¼ X1n¼l pnn¼l pnPðN ¼ njL p ¼ l p Þð7Þ½ð1 pÞŠ n PðN ¼ nÞP 1k¼l p½ð1 pÞŠ k PðN ¼ kÞ ; for l p P 1: ð8ÞThe equation above gives the mean of the random variable NjLp = lpwhich can be used as a predictor of the queue length given theprobe location information. For real-time applications, this equationcan be used to estimate the queue length, provided that P(N = n) isknown.In order to assess how accuracy of this estimated queue lengthchanges by the proportion of probe vehicles p, some additional resultsare obtained. First, the conditional variance of the queuelength (given l p ) can be written as follows:VARðNjL p ¼ l p Þ¼ X1n 2 ½ð1 pÞŠ n PðN ¼ nÞP 1n¼l p k¼l p½ð1 pÞŠ k PðN ¼ kÞ½EðNjL p ¼ l p ÞŠ 2 ; for l p P 1: ð9ÞSince the above variance depends on l p , a more general measurethat does not depend on l p is needed to determine the effects of pon accuracy. The error in the estimates, denoted by D, can be treatedas a random variable that is the difference between the actualqueue length N and its estimate:D ¼ N EðNjL p ¼ l p Þ: ð10ÞIf the expectation of both sides of this equation is taken, it can beseen that the expected error is zero. (Since E[E(NjL p = l p )] = E(N).)On the other hand, the variance of error is as follows:VarðDÞ ¼Var½N EðNjL p ¼ l p ÞŠ ¼ E½VarðNjL p ¼ l p ÞŠ: ð11ÞThe expected value on the right hand of this equation can be calculatedas follows:EðVarðNjL p ¼ l p ÞÞ ¼ X1l p¼0PðL p ¼ l p ÞVARðNjL p ¼ l p Þ:ð12ÞThis expected value in essence represents the weighted varianceover all possible values of l p . To calculate this weighted variance,the marginal probability distribution of L p is needed. This marginaldistribution can be readily written in terms of the conditional distributionsobtained thus farPlease cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

ARTICLE IN PRESS4 G. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxxPðL p ¼ l p Þ¼ XlpX 1PðL p ¼ l p jN ¼ n; N p ¼ n p ÞPðN p ¼ n p jNn p¼0 n¼l p¼ nÞPðN ¼ nÞ:Then, the marginal distribution of L p becomesPðL p ¼ l p Þ¼ Xlpn p¼1ð13ÞX 1 l p 1p np ð1 pÞ n np PðN ¼ nÞ; for l p P 1;n¼l pn p 1PðL p ¼ 0Þ ¼ X1ð1 pÞ n PðN ¼ nÞ; for l p ¼ 0:n¼0ð14ÞThe formulation presented so far can be utilized to assess the relationshipbetween the percentage of probes p and the accuracy of theestimated queue lengths. The main input required is a probabilityfunction for the total number of vehicles in the queue, P(N = n).The next two sections provide two examples to illustrate the applicationof this formulation.4. Analysis of arrivals on red (without overflow queue)At signalized intersections, if the overflow queue is ignored andthe number of arrivals per unit time is assumed to be Poisson withparameter k then, N is also a Poisson with parameter kR, where R isthe duration of the red interval. It should be noted that at isolatedintersections the use of Poisson distribution is common fordescribing the arrival process (Rouphail et al., 2001). Without lossof generality it is assumed that R = 1, then the probability functionfor N can be written asPðN ¼ nÞ ¼ e k k n; k > 0; n ¼ 0; 1; 2 ... ð15Þn!For illustration purposes an example is constructed where k is set to10. From Eq. (8), the expected queue length for this example can beobtained as follows:the percentage of probes, and q is the parameter of the geometricdistribution.As it can be observed from Fig. 2, at smaller p and l p values (lessthan 10 or so) the expected value is close to k, as the most informationon-hand is based on the arrival rate. On the other hand, whenp is large then the expected value tends to the location of the lastprobe. Obviously, at the extreme when p = 1.0, the location of thelast probe is the same as the queue length.Fig. 3 shows the variance of the conditional queue length versusthe location of last probe for different p values. The results are inagreement with the intuition that variance decreases with higherp values. Variance is larger when l p is small; and as l p becomes largervariance tends to zero even for small p values. This implies thatqueue length can be estimated with more accuracy when l p islarge.The previous two figures illustrate how expected queue lengthand its variance change by the location of last probe in the queuebut they do not provide any information about the likelihood ofoccurrence of each l p value. This information is needed to assessthe overall or weighted accuracy at a given p value. Fig. 4 showsthe probability mass function of L p for different p values calculatedfrom Eq. (14). Asp value increases the bell-shaped distribution becomesmore symmetrical around k = 1 and the probability of l pbeing zero (or the probability of having no probe = exp( kp)) getscloser to zero.Using the probability mass functions in Fig. 4, one can calculatethe weighted variance (see Eq. (12)) at a given p value to determinethe overall expected deviations of the estimates from the truequeue lengths. Weighted variances are calculated at various p levelsand plotted in Fig. 5. As anticipated, the variance reduces as pincreases and becomes zero as p approaches to 1. This figure certainlyprovides valuable information needed in trading off the levelEðNjL p ¼ l p Þ¼ X1n¼l pn½ð1 pÞkŠnn! P 1k¼l p½ð1pÞkŠ kk!; for l p P 1: ð16ÞFig. 2 shows the relationship between the expected queue lengthversus the location of the last probe l p for five different probe vehicleproportions p. It can be observed that, for all p values, the expectedqueue length converges to the location of last probe as l pincreases. Therefore, one can write lim lp!1ðEðNjL p ¼ l p Þ L p Þ¼0.This limit is numerically verified for Poisson distribution P(N = n).However, it should be noted that this result depends on the probabilitydistribution of N. For example, if P(N = n) is geometric, thenlim lp!1ðEðNjL p ¼ l p Þ L p Þ¼a=ð1 aÞ, where a =(1 p)(1 q), p isFig. 3. Variance of the conditional queue length versus L p .Fig. 2. Expected value of queue length given the location of the last probe vehicle inthe queue.Fig. 4. Marginal probability mass function (pmf) of L p .Please cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

ARTICLE IN PRESSG. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxx 5Fig. 5. Variance of error versus p.of accuracy and the cost of data collection (i.e., percentage of probevehicles).Even though Fig. 5 provides some guidance on trading off accuracyand p, confidence intervals and confidence limits are moreinformative for making such decisions. In order to construct confidenceintervals for the overall error at a given p value, one needs toknow the pmf of the error shown in Eq. (10). However, derivingthis pmf seems to be prohibitive due to the complexity ofthe expressions involved (e.g., the probability distribution ofE(NjL p = l p )). Instead, one- and two-sigma deviations (i.e., standarddeviations) for expected queue length are computed and shown inFig. 6. If one can assume that a normal approximation is valid then,a 95% confidence interval would nearly coincide with the two-sigmacurve in Fig. 6.The previous figures are constructed for a fixed arrival rate kwhich essentially determines the average queue length. In orderto see how the error behaves with different arrival rates, Fig. 7 isconstructed. It shows the error that is normalized by the arrivalrate k in percentage as a function of p at various k levels. At a givenFig. 6. Deviations from the expected total queue length versus p.Var(D)11010090807060504030201000 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5Proportion of probes,pp level, the relative or normalized error is smaller for larger k valueswhich implies that the relative error gets smaller as averagequeue length increases.4.1. Analysis with different arrival processesThe formulation presented in Section 3 is applicable to anyprobability distribution of N,P(N = n). To explore the behavior of errorswith different P(N = n), the results for Negative Binomial andPoisson arrivals are plotted in Fig. 8. For illustration purposes, NegativeBinomial is selected to represent P(N = n). It should be notedthat the overflow queue is not considered in these examples;therefore, probability distribution of N is the same thing as theprobability distribution of total arrivals is a given time period.The probability mass function for Negative Binomial isPðN ¼ nÞ ¼ n þ r 1nq r ð1PoissonGeometricNeg.Binomial(r=5)Neg.Binomial(r=10)Neg.Binomial(r=35)Fig. 8. Variance of errors for Poisson and Negative Binomial Distributions.qÞ n ; n ¼ 0; 1; ..., where n is consideredhere as the total number of arrivals or total number of vehiclesin queue. The expected value for average queue length isselected to be 10 vehicles per 45 seconds of red duration in orderto be consistent with the example above. The mean for a NegativeBinomial random number can be found by r(1 q)/q. The errorsare calculated for different r values as shown in Fig. 8 where q isselected such that the mean is always equal to 10. The tail of thePoisson distribution is shorter than that of the Negative Binomialat low r values. Therefore, the number of events with larger arrivalswill be fewer if a Poisson distribution is used than if a NegativeBinomial distribution is used when both models have the same expectedvalue (Banks et al., 2001). Fig. 8 shows that the model givesconsistent results with this observation. It is known that as theparameter r increases the shape of the Negative Binomial resemblesto the Poisson, and consequently the Negative Binomial linesin Fig. 8 gets closer to that of the Poisson as r increases.E(N) ± σ and 2σ252015105Deviations from E(N)E(N)1− σ2− σ00 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1Proportion of Probes, pFig. 7. Percent error for different average queue lengths at various p levels.Fig. 9. Deviations from the expected total queue length versus p.Please cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

ARTICLE IN PRESS6 G. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxxTable 1Two-sigma deviations at various p levelsp 0.0001 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.9999# of vehicle (2r) 8.5 6.6 5.1 4.1 3.3 2.6 2.1 1.5 1.1 0.7 0.0Dev. % (2r) 75 58 45 36 29 23 18 13 10 6 05. Analysis with the overflow queueIn the previous section, the overflow queue (cycle to cycle overflow)or the residual queue at the end of green period is ignored. Inthis section, the total queue length estimation that also includesthe overflow queue due to randomness is presented. It is still assumedthat average demand is less than the capacity – undersaturatedconditions. Let N o be the overflow queue length and N r be thequeue length that occurs from new arrivals during the red period.The total queue length to be estimated, N, is the summation of N oand N r . At any given cycle, probe vehicles may or may not be presentin one or both of N o and N r , leading to four possible scenarios.Irrespective of where a probe vehicle is in the queue, the generalformulation and equations presented before are applicable as longas the marginal probability function, P(N = n), is known. Therefore,the analysis can be carried out without dealing with these scenariosindividually.The main input needed is the marginal probability of the totalqueue length, P(N = n). For a traffic light with fixed timing, someresearchers developed models that allow calculation of P(N = n)numerically. These models involve finding the complex roots ofthe pgf of overflow queue in equilibrium (Abate and Whitt,1992). The application of these models also requires numericalevaluation and approximations for pgf inversion (Van Leeuwaarden,2006). The second alternative to find P(N = n) is through simulation.In this paper, a simulation model is developed todetermine P(N = n).In this example, arrivals are assumed to be Poisson with a rateof k, and the vehicles are assumed to depart from the stopbar at aconstant rate during the green, which is assumed to be 2 secondsper vehicle. Therefore, the service rate l is equal to the green durationdivided by 2. The total queue length is defined as N = N o + N r inequilibrium conditions. The distribution of total queue length,P(N = n), is determined by running the simulation multiple times.The traffic light parameters are chosen as follows: cycle length,C = 90 seconds; red period, r = 45 seconds; and green period,g = 45 seconds. The arrival rate, k, is set to 20 vehicles per cycle.These selected parameters correspond to a v/c ratio of 0.89. Simulationis run for 180,000 seconds and replicated 1000 times. Thefirst 200-cycles in each replication is considered as the warm-upperiod and not included in calculating the equilibrium distributionof P(N = n).Once the P(N = n) is obtained, an analysis similar to the one presentedin the previous section can be performed to determine theTable 2Results for various q valuesp = 0.5q = k/l k Mean = E(N) r 2 = E(VAR(Njl p )) 2r Dev. (%)0.50 11.25 5.630 1.295 2.3 400.60 13.50 6.757 1.395 2.4 350.70 15.75 7.936 1.472 2.4 310.80 18.00 9.338 1.553 2.5 270.88 20.00 11.394 1.654 2.6 230.90 20.25 11.816 1.672 2.6 220.95 21.38 14.938 1.766 2.7 180.99 22.00 23.580 1.876 2.7 12100* σ/E(N)70605040302010λ/μ=0.6λ/μ=0.7λ/μ=0.8λ/μ=0.9λ/μ=0.9900 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1Proportion of Probes, pFig. 10. Normalized errors versus proportion of probes at different q values.expected total queue length and variance. For brevity, only the resultsthat show how error changes by the proportion of probe vehiclesare presented here. Fig. 9 shows the expected total queuelength and one- and two-sigma deviations from the mean versusthe proportion of probes, p. Compared to Fig. 6, the mean is shiftedupwards due to the effect of the overflow queue, and the deviationsshow similar behavior.The results presented in Fig. 9 are summarized in Table 1 to easilysee the improvement in errors when p gets larger. The 2 rdeviations are presented in two formats: as absolute values (row2) and as percentage of the mean (row 3). The values in row 3can also be interpreted as the coefficient of variance (cv) multipliedby two. It is useful to look at both absolute deviations in terms ofthe number of vehicles and the percent deviations since the targeterror criterion can be expressed in both formats. For example, for80% probe vehicles, the expected error is about 10% and the absoluteerror is about ±1 vehicle. If the error is assumed to follow anormal distribution then, these error rates roughly correspond toa 95% confidence interval.Thus far, the results presented are for a constant arrival ratewith a v/c ratio or q equal to 0.89. To understand the impact of qon accuracy, the arrival rate is varied to generate the scenariosshown in Table 2 where the percent of probe vehicles p is kept constantat 50%. For each one of these scenarios, the simulation programis run separately to obtain P(N = n) and the equationsdescribed above are utilized to estimate the means and variances.The last two columns of Table 2 convey the same type of informationpresented in Table 1. As it can be observed, absolute deviations(2r) are not increasing significantly as v/c or q increases. On theother hand, percent deviations from the mean are decreasing substantiallywith increasing q.In order to depict a more comprehensive picture, Fig. 10 is createdto analyze the errors both as a function of probe proportions pand q = k/l values. The normalized percent error (or percent coefficientof variation) decreases with increasing proportion of probesas expected. However, the errors at different q values show a complexbehavior as a function of p. For very small p values (less than5%) the relative error gets very large when q approaches 1. This canbe attributed to the increase in the size and variance of the initialqueue. For larger p values, the relative error is smaller for larger qvalues as also indicated in the results of Table 2.Please cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

ARTICLE IN PRESSG. Comert, M. Cetin / European Journal of Operational Research xxx (2008) xxx–xxx 76. ConclusionsThis paper presents a statistical method for real-time estimationof queue length at a signalized intersection approach fromprobe vehicle data under the assumption that the marginal probabilitydistribution of the queue length P(N = n) is known. Analyticalexpressions are derived for queue length estimation and for erroranalysis where the queue length is estimated based on the positionor location of probe vehicles in the queue. It is found that, in termsof real-time data, only location of the last probe vehicle l p in thequeue is sufficient for the estimation (i.e., location of other probesor the total number of probe vehicles in the queue are redundant).Therefore, from the technological perspective, tracking only thelocation of the last probe in the queue is sufficient. Obviously, thisconclusion is valid for the conditions and assumptions consideredin the paper (e.g., steady-state conditions, known arrival rate). Forexample, any information on the arrival times of probe vehicles isnot considered here, which can potentially provide additional benefitsto the queue length estimation.Exact expressions for the conditional mean and variance of thequeue length for a given l p are derived. Two example applicationsare presented for a traffic light with fixed timing where arrivals areassumed to follow a Poisson distribution. In the first example,overflow queue is not considered and therefore the probability distributionof the queue length becomes Poisson. In the secondexample, overflow queue is considered and P(N = n) is derived fromsimulated data since there is no readily available analytical modelto determine P(N = n). For both of these examples, the relationshipbetween the percentage of probe vehicles p and the accuracy of theestimates is analyzed. Obviously, as p increases error diminishes.The results can be used to select an appropriate p for given confidencelevel and error level. In addition, it is found that as volumeto capacity or q increases, the absolute error in the queue lengthestimates does not increase significantly whereas the percentageerror relative to the mean decreases significantly for p values largerthan 10%. This implies that queue lengths can be estimated moreeffectively at relatively high volumes. It should be noted that theseresults are obtained for undersaturated conditions (q < 1.0).This paper presents what appears to be the first attempt to formulatethe problem of estimating the queue length from probevehicle data. The presented formulation relies on a priori knowledgeof the marginal distribution of N. Therefore, specifying anaccurate distribution for N is an important issue. There are numerousopportunities for future research. Extensions that deal withtime dependent demand, multiple intersections, actuated signals,and oversaturated conditions are some obvious ones. In addition,the use of other types of probe vehicle data, beyond the probe vehiclelocation, should also be investigated.AcknowledgementsThe authors wish to thank the University of South Carolina forsupporting this research. The authors are also grateful to the anonymousreviewers who provided very detailed and insightful commentsthat helped us improve the quality of this manuscript.ReferencesAbate, J., Whitt, W., 1992. Numerical inversion of probability generating functions.Operations Research Letters 12, 245–251.Banks, J., Carson II, J.S., Nelson, B.L., Nicol, D.M., 2001. Discrete-Event SystemSimulation. Prentice-Hall, New Jersey.Boyce, D.E., Hicks, J., Sen, A., 1991. In-vehicle navigation requirements formonitoring link travel times in a dynamic route guidance system. OperationsReview 8 (1), 17–24.Cetin, M., List, G.F., Zhou, Y., 2005. Factors affecting the minimum number of probesrequired for reliable travel time estimation. Transportation Research Record1917, 37–44.Chen, M., Chien, S.I.J., 2000. Determining the number of probe vehicles for freewaytravel time estimation by microscopic simulation. Transportation ResearchRecord 1719, 61–68.Cheu, R.L., Xie, C., Lee, D., 2002. Probe vehicle population and sample size for arterialspeed estimation. Computer-Aided Civil and Infrastructure Engineering 17 (1),53–60.Darroch, J.N., 1964. On the traffic-light queue. The Annals of MathematicalStatistics, 380–388.Duffield, N.G., 2006. Network tomography of binary network performancecharacteristics. IEEE Transactions on Information Theory 52, 5373–5388.Ferman, M.A., Blumenfeld, D.E., Dai, X., 2005. An analytical evaluation of a real-timetraffic information system using probe vehicles. Journal of IntelligentTransportation Systems: Technology, Planning, and Operations 9 (1), 23–34.Gartner, N.H., Pooran, F.J., Andrews, C.M., 2002. Optimized policies for adaptivecontrol systems: Implementation and field testing. Transportation ResearchRecord 1811, 148–156.Haight, F.A., 1959. Overflow at a traffic flow. Biometrika 46 (3-4), 420–424.Heidemann, D., 1994. Queue length and delay distributions at signalizedintersections. Transportation Research Part B 28 (5), 624–635.Hellinga, B.R., 2002. Improving freeway speed estimates from single-loop detectors.Journal of Transportation Engineering 128 (1), 58–67.Lin, I., Rong, H., Kornhauser, A.L., 2008. Estimating nationwide link speeddistribution using probe position data. Journal of Intelligent TransportationSystems 12 (1), 29–37.McNeil, D.R., 1968. A solution to the fixed cycle traffic light problem for compoundPoisson arrivals. Journal of Applied Probability 5, 624–635.Miller, A.J., 1963. Settings for fixed-cycle traffic signals. Operations ResearchQuarterly 14, 373–386.Mung, K.S.G., Poon, A.C.K., Lam, W.H.K., 1996. Distributions of queue lengths at fixedtime traffic signals. Transportation Research Part B 30 (6), 421–439.Newell, G.F., 1965. Approximation methods for queues with application to thefixed-cycle traffic light. SIAM Review 7, 223–240.Olszewski, P., 1990. Modeling of queue probability distribution at traffic signals. In:Koshi, M. (Ed.), Transportation and Traffic Theory. Elsevier, Amsterdam, pp.569–588.Rouphail, N., Tarko, A., Li, J., 2001. Traffic flows at signalized intersections. TrafficFlow Theory Monograph.Srinivasan, K.K., Jovanis, P.P., 1996. Determination of number of probe vehiclesrequired for reliable travel time measurement in urban network. TransportationResearch Record 1537, 5–22.Turner, S., Holdener, D., 1995. Probe vehicle sample sizes for real-time information:The Houston experience. In: Proceedings of the Vehicle Navigation &Information Systems Conference, Seattle, WA, United States, pp. 3–10.USDOT, 2008. Nationwide Differential Global Positioning System Program FactSheet, Turner-Fairbank Highway Research Center. , visited on February 26.Van Leeuwaarden, V.J.S.H., 2006. Delay analysis for the fixed-cycle traffic-lightqueue. Transportation Science 40 (2), 189–199.Van Zuylen, H.J., Viti, F., 2007. A probabilistic model for queues, delays and waitingtime at controlled intersections. In: 86th Annual Meeting of the TransportationResearch Board CD-ROM.Webster, F.V., 1958. Traffic signal settings. Road Research Laboratory TechnicalPaper No. 39, HMSO, London.Xi, B., Michalidis, G., Nair, V.N., 2006. Estimating network loss rates using activetomography. Journal of the American Statistical Association 101 (476), 1430–1448.Please cite this article in press as: Comert, G., Cetin, M., Queue length estimation from probe vehicle location and the impacts of samplesize, European Journal of Operational Research (2008), doi:10.1016/j.ejor.2008.06.024

Queue length estimation from probe vehicle ... - ResearchGate

Create successful ePaper yourself

Delete template?

Save as template?