Is Parallel Programming Hard, And, If So, What Can You Do About It?

More documents

Recommendations

Info

$TeX op Mac OS X, met teTeX en TeXShop - Nluug$

92 CHAPTER 8. DEFERRED PROCESSINGperiod”. The asynchronous update-side primitive,call_rcu(), invokes a specified function with aspecified argument after a subsequent grace period.For example, call_rcu(p,f); will result in the“RCU callback” f(p) being invoked after a subsequentgrace period. There are situations, such aswhen unloading a Linux-kernel module that usescall_rcu(), when it is necessary to wait for all outstandingRCU callbacks to complete [McK07e]. Thercu_barrier() primitive does this job. Note thatthe more recent hierarchical RCU [McK08a] implementationdescribed in Sections D.2 and D.3 alsoadheres to “RCU Classic” semantics.Finally, RCU may be used to provide type-safememory [GC96], as described in Section 8.3.2.6.In the context of RCU, type-safe memory guaranteesthat a given data element will not changetype during any RCU read-side critical sectionthat accesses it. To make use of RCU-basedtype-safe memory, pass SLAB_DESTROY_BY_RCU tokmem_cache_create(). It is important to notethat SLAB_DESTROY_BY_RCU will in no way preventkmem_cache_alloc() from immediately reallocatingmemory that was just now freed viakmem_cache_free()! In fact, the SLAB_DESTROY_BY_RCU-protected data structure just returned byrcu_dereference might be freed and reallocatedan arbitrarily large number of times, even whenunder the protection of rcu_read_lock(). Instead,SLAB_DESTROY_BY_RCU operates by preventingkmem_cache_free() from returning a completelyfreed-upslabofdatastructurestothesystemuntil after an RCU grace period elapses. In short,although the data element might be freed and reallocatedarbitrarily often, at least its type will remainthe same.Quick Quiz 8.23: How do you prevent a hugenumber of RCU read-side critical sections from indefinitelyblocking a synchronize_rcu() invocation?Quick Quiz 8.24: The synchronize_rcu() APIwaits for all pre-existing interrupt handlers to complete,right?In the “RCU BH” column, rcu_read_lock_bh()and rcu_read_unlock_bh() delimit RCU read-sidecritical sections, and call_rcu_bh() invokes thespecified function and argument after a subsequentgrace period. Note that RCU BH does nothave a synchronous synchronize_rcu_bh() interface,though one could easily be added if required.Quick Quiz 8.25: What happens if you mix andmatch? For example, suppose you use rcu_read_lock() and rcu_read_unlock() to delimit RCUread-side critical sections, but then use call_rcu_bh() to post an RCU callback?Quick Quiz 8.26: Hardware interrupt handlerscan be thought of as being under the protection ofan implicit rcu_read_lock_bh(), right?In the “RCU Sched” column, anything that disablespreemption acts as an RCU read-side criticalsection, and synchronize_sched() waits forthe corresponding RCU grace period. This RCUAPI family was added in the 2.6.12 kernel, whichsplit the old synchronize_kernel() API into thecurrent synchronize_rcu() (for RCU Classic) andsynchronize_sched() (for RCU Sched). Note thatRCU Sched did not originally have an asynchronouscall_rcu_sched() interface, but one was added in2.6.26. In accordance with the quasi-minimalist philosophyof the Linux community, APIs are added onan as-needed basis.Quick Quiz 8.27: What happens if you mix andmatch RCU Classic and RCU Sched?Quick Quiz 8.28: In general, you cannot relyonsynchronize_sched()towaitforallpre-existinginterrupt handlers, right?The “Realtime RCU” column has the same APIas does RCU Classic, the only difference being thatRCU read-side critical sections may be preemptedandmayblockwhileacquiringspinlocks. Thedesignof Realtime RCU is described elsewhere [McK07a].Quick Quiz 8.29: Why do both SRCU andQRCU lack asynchronous call_srcu() or call_qrcu() interfaces?The “SRCU” column in Table 8.5 displays a specializedRCU API that permits general sleeping inRCU read-side critical sections (see Appendix D.1for more details). Of course, use of synchronize_srcu() in an SRCU read-side critical section can resultin self-deadlock, so should be avoided. SRCUdiffers from earlier RCU implementations in thatthe caller allocates an srcu_struct for each distinctSRCU usage. This approach prevents SRCUread-side critical sections from blocking unrelatedsynchronize_srcu() invocations. In addition, inthis variant of RCU, srcu_read_lock() returns avalue that must be passed into the correspondingsrcu_read_unlock().The“QRCU”columnpresentsanRCUimplementationwith the same API structure as SRCU, butoptimized for extremely low-latency grace periods inabsence of readers, as described elsewhere [McK07f].As with SRCU, use of synchronize_qrcu() in aQRCU read-side critical section can result in selfdeadlock,so should be avoided. Although QRCUhas not yet been accepted into the Linux kernel, itis worth mentioning given that it is the only kernellevelRCU implementation that can boast deep sub-
8.3. READ-COPY UPDATE (RCU) 93microsecond grace-period latencies.Quick Quiz 8.30: Under what conditions cansynchronize_srcu() be safely used within anSRCU read-side critical section?The Linux kernel currently has a surprising numberof RCU APIs and implementations. There issome hope of reducing this number, evidenced bythe fact that a given build of the Linux kernel currentlyhas at most three implementations behindfour APIs (given that RCU Classic and RealtimeRCU share the same API). However, careful inspectionand analysis will be required, just as would berequired in order to eliminate one of the many lockingAPIs.The various RCU APIs are distinguished by theforward-progress guarantees that their RCU readsidecritical sections must provide, and also by theirscope, as follows:1. RCU BH: read-side critical sections must guaranteeforward progress against everything exceptfor NMI and IRQ handlers, but not includingsoftirq handlers. RCU BH is global inscope.2. RCU Sched: read-side critical sections mustguarantee forward progress against everythingexcept for NMI and IRQ handlers, includingsoftirq handlers. RCU Sched is global in scope.3. RCU (both classic and real-time): readsidecritical sections must guarantee forwardprogress against everything except for NMIhandlers, IRQ handlers, softirq handlers, and(in the real-time case) higher-priority real-timetasks. RCU is global in scope.4. SRCU and QRCU: read-side critical sectionsneed not guarantee forward progress unlesssome other task is waiting for the correspondinggrace period to complete, in which case theseread-side critical sections should complete inno more than a few seconds (and preferablymuch more quickly). 1 SRCU’s and QRCU’sscope is defined by the use of the correspondingsrcu_struct or qrcu_struct, respectively.In other words, SRCU and QRCU compensate fortheirextremelyweakforward-progressguaranteesbypermitting the developer to restrict their scope.1 Thanks to James Bottomley for urging me to this formulation,as opposed to simply saying that there are no forwardprogressguarantees.8.3.3.2 RCU has Publish-Subscribe andVersion-Maintenance APIsFortunately, theRCUpublish-subscribeandversionmaintenanceprimitives shown in the following tableapply to all of the variants of RCU discussed above.Thiscommonalitycaninsomecasesallowmorecodeto be shared, which certainly reduces the API proliferationthat would otherwise occur. The originalpurpose of the RCU publish-subscribe APIs was tobury memory barriers into these APIs, so that Linuxkernel programmers could use RCU without needingto become expert on the memory-ordering modelsof each of the 20+ CPU families that Linux supports[Spr01].The first pair of categories operate on Linuxstruct list_head lists, which are circular, doublylinkedlists. The list_for_each_entry_rcu()primitive traverses an RCU-protected list in a typesafemanner, while also enforcing memory orderingfor situations where a new list element is insertedinto the list concurrently with traversal. Onnon-Alpha platforms, this primitive incurs little orno performance penalty compared to list_for_each_entry(). The list_add_rcu(), list_add_tail_rcu(), and list_replace_rcu() primitivesare analogous to their non-RCU counterparts, butincur the overhead of an additional memory barrieron weakly-ordered machines. The list_del_rcu()primitive is also analogous to its non-RCU counterpart,but oddly enough is very slightly faster dueto the fact that it poisons only the prev pointerrather than both the prev and next pointers aslist_del() must do. Finally, the list_splice_init_rcu() primitive is similar to its non-RCUcounterpart, but incurs a full grace-period latency.The purpose of this grace period is to allow RCUreaders to finish their traversal of the source list beforecompletely disconnecting it from the list header– failure to do this could prevent such readers fromever terminating their traversal.Quick Quiz 8.31: Whydoesn’tlist_del_rcu()poison both the next and prev pointers?The second pair of categories operate on Linux’sstruct hlist_head, which is a linear linkedlist. One advantage of struct hlist_head overstruct list_head is that the former requires only asingle-pointer list header, which can save significantmemory in large hash tables. The struct hlist_head primitives in the table relate to their non-RCU counterparts in much the same way as do thestruct list_head primitives.The final pair of categories operate directly onpointers, and are useful for creating RCU-protected
Page 1 and 2:
Is Parallel Programming Hard, And,
Page 3 and 4:
Contents1 Introduction 11.1 Histori
Page 5 and 6:
CONTENTSv6 Locking 676.1 Staying Al
Page 7 and 8:
CONTENTSviiB Synchronization Primit
Page 9 and 10:
CONTENTSixE.7.1 Introduction to Pre
Page 11 and 12:
PrefaceThe purpose of this book is
Page 13 and 14:
Chapter 1IntroductionParallel progr
Page 15 and 16:
1.2. PARALLEL PROGRAMMING GOALS 3CP
Page 17 and 18:
1.3. ALTERNATIVES TO PARALLEL PROGR
Page 19 and 20:
1.4. WHAT MAKES PARALLEL PROGRAMMIN
Page 21 and 22:
1.5. GUIDE TO THIS BOOK 9other hand
Page 23 and 24:
Chapter 2Hardware and its HabitsMos
Page 25:
2.1. OVERVIEW 13Therefore, as shown
Page 28 and 29:
16 CHAPTER 2. HARDWARE AND ITS HABI
Page 30 and 31:
18 CHAPTER 2. HARDWARE AND ITS HABI
Page 32 and 33:
20 CHAPTER 3. TOOLS OF THE TRADE1 p
Page 34 and 35:
22 CHAPTER 3. TOOLS OF THE TRADE1 p
Page 36 and 37:
24 CHAPTER 3. TOOLS OF THE TRADE1.1
Page 38 and 39:
26 CHAPTER 3. TOOLS OF THE TRADEQui
Page 40 and 41:
28 CHAPTER 3. TOOLS OF THE TRADE
Page 42 and 43:
30 CHAPTER 4. COUNTING1 atomic_t co
Page 44 and 45:
32 CHAPTER 4. COUNTING4.2.3 Eventua
Page 46 and 47:
34 CHAPTER 4. COUNTINGvanish when t
Page 48 and 49:
36 CHAPTER 4. COUNTINGper-thread va
Page 50 and 51:
38 CHAPTER 4. COUNTING1 unsigned lo
Page 52 and 53:
40 CHAPTER 4. COUNTING1 unsigned lo
Page 54 and 55: 42 CHAPTER 4. COUNTING1 #define THE
Page 56 and 57: 44 CHAPTER 4. COUNTING1 unsigned lo
Page 58 and 59: 46 CHAPTER 4. COUNTINGReadsAlgorith
Page 60 and 61: 48 CHAPTER 5. PARTITIONING AND SYNC
Page 80 and 81: 68 CHAPTER 6. LOCKING1 int delete(i
Page 82 and 83: 70 CHAPTER 7. DATA OWNERSHIP
Page 84 and 85: 72 CHAPTER 8. DEFERRED PROCESSINGfo
Page 86 and 87: 74 CHAPTER 8. DEFERRED PROCESSINGth
Page 88 and 89: 76 CHAPTER 8. DEFERRED PROCESSING
Page 90 and 91: 78 CHAPTER 8. DEFERRED PROCESSINGfi
Page 92 and 93: 80 CHAPTER 8. DEFERRED PROCESSINGti
Page 94 and 95: 82 CHAPTER 8. DEFERRED PROCESSINGNo
Page 96 and 97: 84 CHAPTER 8. DEFERRED PROCESSING12
Page 100 and 101: 88 CHAPTER 8. DEFERRED PROCESSINGvo
Page 102 and 103: 90 CHAPTER 8. DEFERRED PROCESSINGLi
Page 106 and 107: 94 CHAPTER 8. DEFERRED PROCESSINGCa
Page 108 and 109: 96 CHAPTER 8. DEFERRED PROCESSINGTh
Page 116 and 117: 104 CHAPTER 8. DEFERRED PROCESSINGs
Page 118 and 119: 106 CHAPTER 8. DEFERRED PROCESSINGo
Page 120 and 121: 108 CHAPTER 8. DEFERRED PROCESSING
Page 122 and 123: 110 CHAPTER 9. APPLYING RCU1 struct
Page 124 and 125: 112 CHAPTER 9. APPLYING RCU
Page 126 and 127: 114 CHAPTER 10. VALIDATION: DEBUGGI
Page 128 and 129: 116 CHAPTER 11. DATA STRUCTURES
Page 130 and 131: 118 CHAPTER 12. ADVANCED SYNCHRONIZ
Page 150 and 151: 138 CHAPTER 13. EASE OF USEFigure 1
Page 152 and 153: 140 CHAPTER 13. EASE OF USE
Page 154 and 155:
142 CHAPTER 14. TIME MANAGEMENT
Page 156 and 157:
144 CHAPTER 15. CONFLICTING VISIONS
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
154 APPENDIX A. IMPORTANT QUESTIONS
Page 168 and 169:
156 APPENDIX A. IMPORTANT QUESTIONS
Page 170 and 171:
158 APPENDIX B. SYNCHRONIZATION PRI
Page 172 and 173:
160 APPENDIX B. SYNCHRONIZATION PRI
Page 174 and 175:
162 APPENDIX C. WHY MEMORY BARRIERS
Page 176 and 177:
Page 178 and 179:
Page 180 and 181:
Page 182 and 183:
Page 184 and 185:
Page 186 and 187:
Page 188 and 189:
Page 190 and 191:
Page 192 and 193:
Page 194 and 195:
Page 196 and 197:
184 APPENDIX D. READ-COPY UPDATE IM
Page 198 and 199:
Page 200 and 201:
Page 202 and 203:
Page 204 and 205:
Page 206 and 207:
Page 208 and 209:
Page 210 and 211:
Page 212 and 213:
Page 214 and 215:
Page 216 and 217:
Page 218 and 219:
Page 220 and 221:
Page 222 and 223:
Page 224 and 225:
Page 226 and 227:
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
Page 236 and 237:
Page 238 and 239:
Page 240 and 241:
Page 242 and 243:
Page 244 and 245:
Page 246 and 247:
Page 248 and 249:
Page 250 and 251:
Page 252 and 253:
Page 254 and 255:
Page 256 and 257:
244 APPENDIX E. FORMAL VERIFICATION
Page 258 and 259:
Page 260 and 261:
Page 262 and 263:
Page 264 and 265:
Page 266 and 267:
Page 268 and 269:
Page 270 and 271:
Page 272 and 273:
Page 274 and 275:
Page 276 and 277:
Page 278 and 279:
Page 280 and 281:
Page 282 and 283:
Page 284 and 285:
272 APPENDIX F. ANSWERS TO QUICK QU
Page 286 and 287:
Page 288 and 289:
Page 290 and 291:
Page 292 and 293:
Page 294 and 295:
Page 296 and 297:
Page 298 and 299:
Page 300 and 301:
Page 302 and 303:
Page 304 and 305:
Page 306 and 307:
Page 308 and 309:
Page 310 and 311:
Page 312 and 313:
Page 314 and 315:
Page 316 and 317:
Page 318 and 319:
Page 320 and 321:
Page 322 and 323:
Page 324 and 325:
Page 326 and 327:
Page 328 and 329:
Page 330 and 331:
Page 332 and 333:
Page 334 and 335:
Page 336 and 337:
Page 338 and 339:
Page 340 and 341:
Page 342 and 343:
330 APPENDIX G. GLOSSARY(2) A physi
Page 344 and 345:
332 APPENDIX G. GLOSSARYnear by. Th
Page 346 and 347:
334 APPENDIX G. GLOSSARY
Page 348 and 349:
336 BIBLIOGRAPHY[But97]USA, March 2
Page 350 and 351:
338 BIBLIOGRAPHY[HMB06][Hol03][HP95
Page 352 and 353:
340 BIBLIOGRAPHY[McK06] Paul E. McK
Page 354 and 355:
342 BIBLIOGRAPHYtor. Software - Pra
Page 356 and 357:
344 BIBLIOGRAPHY[UoC08][VGS08]Berke
Page 358:
346 APPENDIX H. CREDITSH.4 Original
show all

Is Parallel Programming Hard, And, If So, What Can You Do About It?

Create successful ePaper yourself

Delete template?

Save as template?