2006 Scheme and Functional Programming Papers, University of

More documents

Recommendations

Info

2. Explaining Macros Macro expansion takes place during parsing. As the parser traverses the concrete syntax, 1 it creates abstract syntax nodes for primitive syntactic forms, but stops when it recognizes the use of a macro. At that point, it hands over the (sub)phrases to the macro, which, roughly speaking, acts as a rewriting rule. In Lisp and in some Scheme implementations, a macro is expressed as a plain function; in R 5 RS Scheme [19], macros are expressed in a sub-language of rewriting rules based on patterns. Also in Lisp, concrete syntax are just S-expressions; Lisp macro programming is thus typically first-order functional programming 2 over pairs and symbols. The most widely used Scheme implementations, however, represent concrete syntax with structures that carry additional information: lexical binding information, original source location, code security annotations, and others. Scheme macro programming is therefore functional programming with a rich algebraic datatype. Given appropriate inputs, a Lisp macro can go wrong in two ways. First, the macro transformer itself may raise a run-time exception. This case is naturally in the domain of run-time debuggers; after all, it is just a matter of traditional functional programming. Second, a Lisp macro may create a new term that misuses a syntactic form, which might be a primitive form or another macro. This kind of error is not detected when the macro is executing, but only afterwards when the parser-expander reaches the misused term. Modern Scheme macros might go wrong in yet another way. The additional information in a syntax object interacts with other macros and primitive special forms. For example, macro-introduced identifiers carry a mark that identifies the point of their introduction and binding forms interpret identifiers with different marks as distinct names. Scheme macros must not only compute a correct replacement tree but also equip it with the proper additional properties. Even in Lisp, which has supported macros for almost 50 years now, macros have always had impoverished debugging environments. A typical Lisp environment supports just two procedures/tools for this purpose: expand and expand-once (or macroexpand and macroexpand-1 [28]). All Scheme implementations with macros have adapted these procedures. When applied to a term, expand completely parses and expands it; in particular, it does not show the intermediate steps of the rewriting process. As a result, expand distracts the programmer with too many irrelevant details. For example, Scheme has three conditional expressions: if, cond, and case. Most Scheme implementations implement only if as a primitive form and define cond and case as macros. Whether or not a special form is a primitive form or a macro is irrelevant to a programmer except that macro expansion reveals the difference. It is thus impossible to study the effects of a single macro or a group of related macros in an expansion, because expand processes all macros and displays the entire abstract syntax tree. The task of showing individual expansion steps is left to the second tool: expand-once. It consumes a macro application, applies the matching macro transformer, and returns the result. In particular, when an error shows up due to complex macro interactions, it becomes difficult to use expand-once easily because the offending or interesting pieces are often hidden under a large pile of syntax. Worse, iterated calls to expand-once lose information between expansion steps, because lexical scope and other information depends on the context of the expansion call. This problem renders expand-once unfit for serious macro debugging. 1 We consider the result of (read) as syntax. 2 Both Lisp and Scheme macro programmers occasionally use side-effects but aside from gensym it is rare. Implementing a better set of debugging tools than expand and expand-once is surprisingly difficult. It is apparently impossible to adapt the techniques known from run-time debugging. For example, any attempt to pre-process the syntax and attach debugging information or insert debugging statements fails for two reasons: first, until parsing and macro expansion happens, the syntactic structure of the tree is unknown; second, because macros inspect their arguments, annotations or modifications are likely to change the result of the expansion process [31]. While these reasons explain the dearth of macro debugging tools and steppers, they don’t reduce the need for them. What we present in this paper is a mechanism for instrumenting the macro expander and for displaying the expansion events and intermediate stages in a useful manner. Eventually we also hope to derive a wellfounded model of macros from this work. 3. The Macro Debugger at Work The core of our macro debugging tool is a stepper for macro expansion in PLT Scheme. Our macro debugger shows the macro expansion process as a reduction sequence, where the redexes are macro applications and the contexts are primitive syntactic forms, i.e., nodes in the final abstract syntax tree. The debugger also includes a syntax display and browser that helps programmers visualize properties of syntax objects. The macro stepper is parameterized over a set of “opaque” syntactic forms. Typically this set includes those macros imported from libraries or other modules. The macro programmers are in charge, however, and may designate macros as opaque as needed. When the debugger encounters an opaque macro, it deals with the macro as if it were a primitive syntactic form. That is, it creates an abstract syntax node that hides the actual expansion of the macro. Naturally, it does show the expansion of the subexpressions of the macro form. The parameterization of primitive forms thus allows programmers to work at the abstraction level of their choice. We have found this feature of the debugger critical for dealing with any nontrivial programs. The rest of this section is a brief illustrative demonstration of the debugger. We have picked three problems with macros from recent discussions on PLT Scheme mailing lists, though we have distilled them into a shape that is suitably simple for a technical paper. 3.1 Plain Macros For our first example we consider a debugging scenario where the macro writer gets the form of the result wrong. Here are three different versions of a sample macro that consumes a list of identifiers and produces a list of trivial definitions for these identifiers: 1. in Lisp, the macro writer uses plain list-processing functions to create the result term: (define-macro (def-false . names) (map (lambda (a) ‘(define ,a #f)) names)) 2. in R 5 RS the same macro is expressed with a rewriting rule notation like this: (define-syntax def-false (syntax-rules () [(def-false a ...) ((define a #f) ...)])) 3. in major alternative Scheme macro systems, the rule specification is slightly different: (define-syntax (def-false stx) (syntax-case stx () [(_ a ...) (syntax ((define a #f) ...))])) 16 Scheme and Functional Programming, 2006
The macro definition is a function that consumes a syntax tree, named stx. The syntax-case construct de-structures the tree and binds pattern variables to its components. The syntax constructor produces a new syntax tree by replacing the pattern variables in its template with their values. Using the macro, like thus: (def-false x y z) immediately exposes a problem. The macro expander fails with an error explaining that definitions can’t occur in an expression context. Of course, the problem is that the macro produces a list of terms, which the macro expander interprets as an application and which, in turn, may not contain any definitions. Our macro stepper shows the sequence of macro expansion steps, one at a time: Here we can see both the original macro form and the output of the macro application. The original appears at the top, the output of the first step at the bottom. The highlighted subterms on the top and bottom are the redex and contractum, respectively. The separator explains that this is a macro expansion step. At this point, an experienced Lisp or Scheme programmer recognizes the problem. A novice may need to see another step: 3.2 Syntax properties Nearly all hygienic macro papers use the or macro to illustrate the problem of inadvertent variable capture: (define-syntax (or stx) (syntax-case stx () [(or e1 e2) (syntax (let ([tmp e1]) (if tmp tmp e2)))])) In Scheme, the purpose of (or a b) is to evaluate a and to produce its value, unless it is false; if it is false, the form evaluates b and produces its value as the result. In order to keep or from evaluating its first argument more than once, the macro introduces a new variable for the first result. In Lisp-style macro expanders (or Scheme prior to 1986), the new tmp binding captures any free references to tmp in e2, thus interfering with the semantics of the macro and the program. Consequently, the macro breaks abstraction barriers. In Scheme, the new tmp identifier carries a mark or timestamp—introduced by the macro expander—that prevents it from binding anything but the two occurrences of tmp in the body of the macro-generated let [21]. This mark is vital to Scheme’s macro expansion process, but no interface exists for inspecting the marks and the marking process directly. Our macro debugger visually displays this scope information at every step. The display indicates with different text colors 3 from which macro expansion step every subterm originated. Furthermore, the programmer can select a particular subterm and see how the other subterms are related to it. Finally, the macro stepper can display a properties panel to show more detailed information such as identifier bindings and source locations. The following example shows a programmer’s attempt to create a macro called if-it, a variant of if that tries to bind the variable it to the result of the test expression for the two branches: (define-syntax (if-it1 stx) ;; WARNING: INCORRECT (syntax-case stx () [(if-it1 test then else) (syntax (let ([it test]) (if it then else)))])) The same mechanism that prevents the inadvertent capture in the or example prevents the intentional capture here, too. With our macro debugger, the puzzled macro writer immediately recognizes why the macro doesn’t work: Here the macro expander has explicitly tagged the term as an application. The third step then shows the syntax error, highlighting the term and the context in which it occurred. The macro debugger actually expands the entire term before it displays the individual steps. This allows programmers to skip to the very end of a macro expansion and to work backwards. The stepper supports this approach with a graphical user interface that permits programmers to go back and forth in an expansion and also to skip to the very end and the very beginning. The ideas for this interface have been borrowed from Clements’s algebraic run-time stepper for PLT Scheme [3]; prior to that, similar ideas appeared in Lieberman’s stepper [22] and Tolmach’s SML debugger [29]. When the programmer selects an identifier, that identifier and all others with compatible binding properties are highlighted in the same color. Thus, in the screenshot above, the occurrence of it from the original program is not highlighted while the two macrointroduced occurrences are. 3 Or numeric suffixes when there are no more easily distinguishable colors. Scheme and Functional Programming, 2006 17
Page 1: Scheme and Functional Programming 2
Page 4 and 5: 4 Scheme and Functional Programming
Page 6 and 7: 6 Scheme and Functional Programming
Page 8 and 9: • A web browser that plays the ro
Page 10 and 11: and requests. When a client request
Page 12 and 13: above prevents pages from these dom
Page 14 and 15: 14 Scheme and Functional Programmin
Page 18 and 19: For completeness, here is the macro
Page 20 and 21: onment is extended in the original
Page 22 and 23: expand-term(term, env, phase) = emi
Page 24 and 25: Derivation ::= (make-mrule Syntax S
Page 26 and 27: True derivation (before macro hidin
Page 28 and 29: We assume that the reader has basic
Page 30 and 31: the value of the %eax register by 4
Page 32 and 33: Code generation for the new forms i
Page 34 and 35: we introduced in 3.11. The only dif
Page 36 and 37: the user code from interfering with
Page 40 and 41: mization and on creating efficient
Page 42 and 43: (a) stage (b) fifo (c) split (d) me
Page 44 and 45: This tagging is used later in the c
Page 46 and 47: (let ((clo_25 (%closure (lambda (y)
Page 48 and 49: nb. cycles per element 1400 1200 10
Page 52 and 53: a ∗ ❅ left right · b ❅ a S
Page 54 and 55: T ([spec], w) = { {w}, if w ∈ L([
Page 56 and 57: A([spec]): ✓✏ (where L([spec])
Page 58 and 59: construction commands. It is possib
Page 60 and 61: ; Regular expression for Scheme num
Page 62 and 63: References [1] A. V. Aho, R. Sethi,
Page 64 and 65: (let ((n (cond ((char? var0) ) ((sy
Page 66 and 67:
3. Survey An incomplete survey of a
Page 68 and 69:
case monster 10 literals 100 litera
Page 70 and 71:
70 Scheme and Functional Programmin
Page 72 and 73:
modest programming requirements, an
Page 74 and 75:
development of incomplete subsystem
Page 76 and 77:
the previous request. This method a
Page 78 and 79:
could be run indefinitely. This fun
Page 80 and 81:
programmers reject Scheme without r
Page 82 and 83:
(define interp (λ (env e) (case e
Page 84 and 85:
Before describing the run-time sema
Page 86 and 87:
Figure 5. Cast Insertion Figure 6.
Page 88 and 89:
Figure 7. Evaluation Figure 8. Eval
Page 90 and 91:
catching type errors, as we do here
Page 92 and 93:
[34] J. C. Reynolds. Types, abstrac
Page 94 and 95:
let id (T:*) (x:T) : T = x; The tra
Page 96 and 97:
Figure 3: Regular Expressions and N
Page 98 and 99:
Figure 5: Evaluation Rules Evaluati
Page 100 and 101:
5. Exact Substitution: E, (x = v :
Page 102 and 103:
Figure 9: Subtyping Algorithm Algor
Page 104 and 105:
for most type variables, and that m
Page 106 and 107:
2. miniKanren Overview This section
Page 108 and 109:
3. Pseudo-Variadic Relations Just a
Page 110 and 111:
Replacing run 10 with run ∗ cause
Page 112 and 113:
As might be expected, we could use
Page 114 and 115:
Of course there are still infinitel
Page 116 and 117:
To ensure that streams produced by
Page 118 and 119:
118 Scheme and Functional Programmi
Page 120 and 121:
On the other hand, we can use a hig
Page 122 and 123:
(ev* (Q (lambda (x) (+ x 1)))) # >
Page 124 and 125:
(term ’lam (lambda (x) (if (equal
Page 126 and 127:
a message: there is no guarantee th
Page 128 and 129:
(! (self) 3) (?) =⇒ 1 (?? odd?) =
Page 130 and 131:
(define new-server (spawn (lambda (
Page 132 and 133:
The abstraction shown in this secti
Page 134 and 135:
Erlang Termite List length (µs) (
Page 136 and 137:
Page 138 and 139:
page of j contains the URL of the p
Page 140 and 141:
4. Solution The failed attempts abo
Page 142 and 143:
· ; · ; · :: Store × Frame Stac
Page 144 and 145:
logically creates a sub-session of
Page 146 and 147:
on this work, including Ryan Culpep
Page 148 and 149:
ing application operation. 1 As a r
Page 150 and 151:
such as image glyphs corresponding
Page 152 and 153:
Phone For clarity, this code pr
Page 154 and 155:
A. Porting TinyScheme to Qualcomm B
Page 156 and 157:
Page 158 and 159:
ware installers have to install any
Page 160 and 161:
defines a module named circle-lib i
Page 162 and 163:
Figure 2. Sometimes special cases a
Page 164 and 165:
Considering all of these issues tog
show all

2006 Scheme and Functional Programming Papers, University of

Create successful ePaper yourself

Delete template?

Save as template?