LNCS 0558 -Job Scheduling Strategies for Parallel Processing

More documents

Recommendations

Info

SRPT Scheduling for Web Servers Mor Harchol-Balter, Nikhil Bansal, Bianca Schroeder, and Mukesh Agrawal School of Computer Science, Carnegie Mellon University Pittsburgh, PA 15213 {harchol, nikhil, bianca, mukesh}@cs.cmu.edu Abstract. This note briefly summarizes some results from two papers: [4] and [23]. These papers pose the following question: Is it possible to reduce the expected response time of every request at a web server, simply by changing the order in which we schedule the requests? In [4] we approach this question analytically via an M/G/1 queue. In [23] we approach the same question via implementation involving an Apache web server running on Linux. 1 Introduction Motivation and Goals A client accessing a busy web server can expect a long wait. This delay is comprised of several components: the propagation delay and transmission delay on the path between the client and the server; delays due to queueing at routers; delays caused by TCP due to loss, congestion, and slow start; and finally the delay at the server itself. The aggregate of these delays, i.e. the time from when the client makes a request until the entire file arrives is defined to be the response time of the request. We focus on what we can do to improve the delay at the server. Research has shown that in situations where the server is receiving a high rate of requests, the delays at the server make up a significant portion of the response time [6], [5], [32]. Our work will focus on static requests only of the form“Get me a file.” Measurements [31] suggest that the request stream at most web servers is dominated by static requests. The question of how to service static requests quickly is the focus of many companies e.g., Akamai Technologies, and much ongoing research. Our Idea Our idea is simple. For static requests, the size of the request (i.e. the time required to service the request) is well-approximated by the size of the file, which is well-known to the server. Thus far, no companies or researchers have made use of this information. Traditionally, requests at a web server are scheduled D.G. Feitelson and L. Rudolph (Eds.): JSSPP 2001, LNCS 2221, pp. 11–20, 2001. c○ Springer-Verlag Berlin Heidelberg 2001
12 M. Harchol-Balter et al. independently of their size. The requests are time-shared, with each request receiving a fair share of the web server resources. We call this FAIR scheduling (a.k.a. Processor-Sharing scheduling). We propose, instead, unfair scheduling, in which priority is given to short requests, or those requests which have short remaining time, in accordance with the well-known scheduling algorithmShortest- Remaining-Processing-Time-first (SRPT). The expectation is that using SRPT scheduling of requests at the server will reduce the queueing time at the server. The Controversy It has long been known that SRPT has the lowest mean response time of any scheduling policy, for any arrival sequence and job sizes [41,46]. Despite this fact, applications have shied away fromusing this policy for fear that SRPT “starves” big requests [9,47,48,45]. It is often stated that the huge average performance improvements of SRPT over other policies stem from the fact that SRPT unfairly penalizes the large jobs in order to help the small jobs. It is often thought that the performance of small jobs cannot be improved without hurting the large jobs and thus large jobs suffer unfairly under SRPT. 2 Analysis of SRPT Based on [4] Relevant Previous Work It has long been known that SRPT minimizes mean response time [41,46]. Rajaraman et al. showed further that the mean slowdown under SRPT is at most twice optimal, for any job sequence [19]. Schrage and Miller first derived the expressions for the response times in an M/G/1/SRPT queue [42]. This was further generalized by Pechinkin et al. to disciplines where the remaining times are divided into intervals [36]. The steadystate appearance of the M/G/1/SRPT queue was obtained by Schassberger [40]. The mean response time for a job of size x in an M/G/1/SRPT server is given below: E[T (x)]SRPT = λ(� x 0 t2f(t)dt + x2 (1 − F (x))) 2(1 − λ � x 0 tf(t)dt)2 + � x 0 dt 1 − λ � t 0 yf(y)dy where λ is the average arrival rate and f(t) is the p.d.f. of the job size distribution. The above formula is difficult to evaluate numerically, due to its complex form (many nested integrals). Hence, the comparison of SRPT to other policies was long neglected. More recently, SRPT has been compared with other policies by plotting the mean response times for specific job size distributions under specific loads [39,37,43]. These include a 7-year long study at University of Aachen under Schreiber et. al. These results are all plots for specific job size distributions and loads. Hence it is not clear whether the conclusions based on these plots hold more generally.
Page 1 and 2: Lecture Notes in Computer Science 2
Page 3 and 4: Dror G. Feitelson Larry Rudolph (Ed
Page 5 and 6: Preface This volume contains the pa
Page 7 and 8: Agrawal, M., 11 Bansal, N., 11 Bren
Page 9 and 10: 2 M.E. Crovella 2 Evidence The evid
Page 11 and 12: 4 M.E. Crovella when X is a heavy t
Page 13 and 14: 6 M.E. Crovella delay [43] although
Page 15 and 16: 8 M.E. Crovella 6. Andrei Broder, R
Page 17: 10 M.E. Crovella 43. R. W. Weber. O
Page 21 and 22: 14 M. Harchol-Balter et al. - Given
Page 23 and 24: 16 M. Harchol-Balter et al. Since t
Page 25 and 26: 18 M. Harchol-Balter et al. - There
Page 27 and 28: 20 M. Harchol-Balter et al. 31. S.
Page 29 and 30: 22 A.B. Yoo and M.A. Jette componen
Page 31 and 32: 24 A.B. Yoo and M.A. Jette The DCS
Page 33 and 34: 26 A.B. Yoo and M.A. Jette report o
Page 35 and 36: 28 A.B. Yoo and M.A. Jette A few ne
Page 37 and 38: 30 A.B. Yoo and M.A. Jette The task
Page 39 and 40: 32 A.B. Yoo and M.A. Jette if (the
Page 41 and 42: 34 A.B. Yoo and M.A. Jette Mean Job
Page 43 and 44: 36 A.B. Yoo and M.A. Jette executio
Page 45 and 46: 38 A.B. Yoo and M.A. Jette one of t
Page 47 and 48: 40 A.B. Yoo and M.A. Jette 24. S. P
Page 49 and 50: 42 F. Giné etal. be made taking im
Page 51 and 52: 44 F. Giné etal. Execution time(s)
Page 53 and 54: 46 F. Giné etal. nization grows mo
Page 55 and 56: 48 F. Giné etal. Some studies [24,
Page 57 and 58: 50 F. Giné etal. every time a page
Page 59 and 60: 52 F. Giné etal. Local RQ[∞]:=ta
Page 61 and 62: 54 F. Giné etal. tasks, small and
Page 63 and 64: 56 F. Giné etal. - Slowdown: It is
Page 65 and 66: 58 F. Giné etal. Fig. 6 shows the
Page 67 and 68: 60 F. Giné etal. 5/64 5/32 5/16 5/
Page 69 and 70:
62 F. Giné etal. 30000 25000 20000
Page 71 and 72:
64 F. Giné etal. In this paper, a
Page 73 and 74:
The Influence of Communication on t
Page 75 and 76:
68 A.I.D. Bucur and D.H.J. Epema th
Page 77 and 78:
70 A.I.D. Bucur and D.H.J. Epema by
Page 79 and 80:
72 A.I.D. Bucur and D.H.J. Epema ut
Page 81 and 82:
74 A.I.D. Bucur and D.H.J. Epema In
Page 83 and 84:
76 A.I.D. Bucur and D.H.J. Epema Ta
Page 85 and 86:
78 A.I.D. Bucur and D.H.J. Epema ba
Page 87 and 88:
80 A.I.D. Bucur and D.H.J. Epema Av
Page 89 and 90:
82 A.I.D. Bucur and D.H.J. Epema ex
Page 91 and 92:
84 A.I.D. Bucur and D.H.J. Epema fr
Page 93 and 94:
86 A.I.D. Bucur and D.H.J. Epema 2.
Page 95 and 96:
88 D. Jackson, Q. Snell, and M. Cle
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
Page 105 and 106:
Page 107 and 108:
100 D. Jackson, Q. Snell, and M. Cl
Page 109 and 110:
102 D. Jackson, Q. Snell, and M. Cl
Page 111 and 112:
104 B.B. Zhou and R.P. Brent divide
Page 113 and 114:
106 B.B. Zhou and R.P. Brent though
Page 115 and 116:
108 B.B. Zhou and R.P. Brent s l o
Page 117 and 118:
110 B.B. Zhou and R.P. Brent It is
Page 119 and 120:
112 B.B. Zhou and R.P. Brent s l o
Page 121 and 122:
114 B.B. Zhou and R.P. Brent simult
Page 123 and 124:
Effects of Memory Performance on Pa
Page 125 and 126:
118 G.E. Suh, L. Rudolph, and S. De
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
134 Y. Zhang et al. We analyze thre
Page 143 and 144:
136 Y. Zhang et al. Our modeling pr
Page 145 and 146:
138 Y. Zhang et al. Total CPU time
Page 147 and 148:
140 Y. Zhang et al. 3 The Impact of
Page 149 and 150:
142 Y. Zhang et al. Average job wai
Page 151 and 152:
144 Y. Zhang et al. 4. FillMatrix:
Page 153 and 154:
146 Y. Zhang et al. not violate any
Page 155 and 156:
148 Y. Zhang et al. Average job slo
Page 157 and 158:
150 Y. Zhang et al. Average job slo
Page 159 and 160:
152 Y. Zhang et al. The migration f
Page 161 and 162:
154 Y. Zhang et al. improvement is
Page 163 and 164:
156 Y. Zhang et al. Loss of capacit
Page 165 and 166:
158 Y. Zhang et al. 10. B. Gorda an
Page 167 and 168:
160 S.-H. Chiang and M.K. Vernon sc
Page 169 and 170:
162 S.-H. Chiang and M.K. Vernon Ta
Page 171 and 172:
Table 4. Total Monthly Processor an
Page 173 and 174:
166 S.-H. Chiang and M.K. Vernon be
Page 175 and 176:
168 S.-H. Chiang and M.K. Vernon to
Page 177 and 178:
170 S.-H. Chiang and M.K. Vernon Pr
Page 179 and 180:
172 S.-H. Chiang and M.K. Vernon cu
Page 181 and 182:
174 S.-H. Chiang and M.K. Vernon me
Page 183 and 184:
Page 185 and 186:
178 S.-H. Chiang and M.K. Vernon ac
Page 187 and 188:
Page 189 and 190:
182 S.-H. Chiang and M.K. Vernon pe
Page 191 and 192:
184 S.-H. Chiang and M.K. Vernon pr
Page 193 and 194:
186 S.-H. Chiang and M.K. Vernon ch
Page 195 and 196:
Metrics for Parallel Job Scheduling
Page 197 and 198:
190 D.G. Feitelson cummulative prob
Page 199 and 200:
192 D.G. Feitelson Downey has propo
Page 201 and 202:
194D.G. Feitelson LANL-CM5: jobs pe
Page 203 and 204:
196 D.G. Feitelson average response
Page 205 and 206:
198 D.G. Feitelson with very long j
Page 207 and 208:
200 D.G. Feitelson avg bnd-sld 600
Page 209 and 210:
202 D.G. Feitelson 95% confidence i
Page 211 and 212:
204D.G. Feitelson faster convergenc
Page 213:
Agrawal, M., 11 Bansal, N., 11 Bren
show all

LNCS 0558 -Job Scheduling Strategies for Parallel Processing

Create successful ePaper yourself

Delete template?

Save as template?