Design of CMU Common Lisp.pdf - Common Lisp.net

More documents

Recommendations

Info

Chapter 16 Control optimization In this phase we annotate blocks with drop-throughs. This controls how code generation linearizes code so that drop-throughs are used most effectively. We totally linearize the code here, allowing code generation to scan the blocks in the emit order. There are basically two aspects to this optimization: 1. Dynamically reducing the number of branches taken v.s. branches not taken under the assumption that branches not taken are cheaper. 2. Statically minimizing the number of unconditional branches, saving space and presumably time. These two goals can conflict, but if they do it seems pretty clear that the dynamic optimization should get preference. The main dynamic optimization is changing the sense of a conditional test so that the more commonly taken branch is the fall-through case. The problem is determining which branch is more commonly taken. The most clear-cut case is where one branch leads out of a loop and the other is within. In this case, clearly the branch within the loop should be preferred. The only added complication is that at some point in the loop there has to be a backward branch, and it is preferable for this branch to be conditional, since an unconditional branch is just a waste of time. In the absence of such good information, we can attempt to guess which branch is more popular on the basis of difference in the cost between the two cases. Min-max strategy suggests that we should choose the cheaper alternative, since the percentagewise improvement is greater when the branch overhead is significant with respect to the cost of the code branched to. A tractable approximation of this is to compare only the costs of the two blocks immediately branched to, since this would avoid having to do any hairy graph walking to find all the code for the consequent and the alternative. It might be worthwhile discriminating against ultra-expensive functions such as ERROR. For this to work, we have to detect when one of the options is empty. In this case, the next for one branch is a successor of the other branch, making the comparison meaningless. We use dominator information to detect this situation. When a branch is empty, one of the predecessors of the first block in the empty branch will be dominated by the first block in the other branch. In such a case we favor the empty branch, since that’s about as cheap as you can get. Statically minimizing branches is really a much more tractable problem, but what literature there is makes it look hard. Clearly the thing to do is to use a non-optimal heuristic algorithm. A good possibility is to use an algorithm based on the depth first ordering. We can modify the basic DFO algorithm so that it chooses an ordering which favors any drop-thrus that we may choose for dynamic reasons. When we are walking the graph, we walk the desired drop-thru arc last, which will place it immediately after us in the DFO unless the arc is a retreating arc. We scan through the DFO and whenever we find a block that hasn’t been done yet, we build a straight-line segment by setting the drop-thru to the unreached successor block which has the lowest DFN greater than that for the block. We move to the drop-thru block and repeat the process until there is no such block. We then go back to our original scan through the DFO, looking for the head of another straight-line segment. This process will automagically implement all of the dynamic optimizations described above as long as we favor the appropriate IF branch when creating the DFO. Using the DFO will prevent us from making the 40
ack branch in a loop the drop-thru, but we need to be clever about favoring IF branches within loops while computing the DFO. The IF join will be favored without any special effort, since we follow through the most favored path until we reach the end. This needs some knowledge about the target machine, since on most machines non-tail-recursive calls will use some sort of call instruction. In this case, the call actually wants to drop through to the return point, rather than dropping through to the beginning of the called function. 41
Page 1 and 2: Design of CMU Common Lisp Robert A.
Page 3 and 4: Contents I System Architecture 2 1
Page 5 and 6: 32.2 Calls . . . . . . . . . . . .
Page 7 and 8: Part I System Architecture 5
Page 9 and 10: lisp Originally, this package had a
Page 11 and 12: 2.3 Building Core Images Both the k
Page 13 and 14: Chapter 3 Compiler Overview The str
Page 15 and 16: Chapter 4 The Implicit Continuation
Page 17 and 18: What is the relationship between th
Page 19 and 20: We actually convert each form in th
Page 21 and 22: Actually, I guess the overhead for
Page 23 and 24: The current node type annotations s
Page 25 and 26: Chapter 7 Find components This is a
Page 27 and 28: The big reason for doing this trans
Page 29 and 30: • Eliminate IFs with predicates k
Page 31 and 32: If there is no intersection between
Page 33 and 34: could punt on set variables...] A t
Page 35 and 36: Chapter 12 Environment analysis A r
Page 37 and 38: Chapter 13 Virtual Machine Represen
Page 39 and 40: If the tail-set type is for a fixed
Page 41: can use the node-derived-type to se
Page 45 and 46: The primitive type T is special-cas
Page 47 and 48: But the problem doesn’t happen he
Page 49 and 50: Chapter 19 Representation selection
Page 51 and 52: This phase makes use of the results
Page 53 and 54: Chapter 21 Packing File: pack #—
Page 55 and 56: Subject to resource constraints: -
Page 57 and 58: Chapter 22 Code generation This is
Page 59 and 60: Chapter 24 Dumping So far as input
Page 61 and 62: Chapter 25 User Interface of the Co
Page 63 and 64: Chapter 26 Retargeting the compiler
Page 65 and 66: Chapter 27 Storage bases and classe
Page 67 and 68: Chapter 28 Type system parameteriza
Page 69 and 70: ead in any order as long no TN is w
Page 71 and 72: Chapter 31 Writing Assembly Code VO
Page 73 and 74: 32.1 Function Call 32.1.1 Registers
Page 75 and 76: Register usage at the time of the r
Page 77 and 78: when (current-nfp-tn VOP-self-point
Page 79 and 80: Chapter 33 Standard Primitives 77
Page 81 and 82: it would seem pretty losing to sepa
Page 83 and 84: Chapter 35 The Type System 81
Page 85 and 86: Chapter 37 The IR1 Interpreter May
Page 87 and 88: editor-based mouse-driven interface
Page 89 and 90: 254 => read next two bytes for inte
Page 91 and 92: location name, type, etc. location-
Page 93 and 94:
Then the debug-info holds a simple-
Page 95 and 96:
Chapter 39 Object Format 39.1 Taggi
Page 97 and 98:
39.5 Other-immediates As for fixnum
Page 99 and 100:
structure-header-type 182 fdefn-typ
Page 101 and 102:
Available Elements: This is a fixnu
Page 103 and 104:
| Pointer to next function (other-p
Page 105 and 106:
NIL eval::*top-of-stack* lisp::*cur
Page 107 and 108:
Chapter 41 Interface to C and Assem
Page 109 and 110:
Chapter 43 Core File Format 107
Page 111 and 112:
language. The body of necessity beg
Page 113 and 114:
15: FOP-LIST length(1) ⇒ stack Th
Page 115 and 116:
59: FOP-SMALL-CODE nitems(1) size(2
Page 117 and 118:
149-199: Unused 200: FOP-RPLACA tab
Page 119 and 120:
closed continuation A closed contin
Page 121 and 122:
Data-block A data-block is a dual-w
show all

Design of CMU Common Lisp.pdf - Common Lisp.net

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?