9 years ago · 0b8ca5bfee
--- a/book.tex
+++ b/book.tex
@@ -3145,6 +3145,7 @@ that does what its name suggests. The program then applies
 
															 \marginpar{\scriptsize to do: interpreter for $R_4$. \\ --Jeremy}
														
 
															 \section{Functions in x86}
														
 
															+\label{sec:fun-x86}
														
 
															 The x86 architecture provides a few features to support the
														
 
															 implementation of functions. We have already seen that x86 provides
														
@@ -3252,7 +3253,7 @@ $8n-8$\key{(\%rsp)} & $8n+8$(\key{\%rbp})& argument $n$ \\
 
															 \end{figure}
														
 
															-\section{Planning the compilation of functions}
														
 
															+\section{The compilation of functions}
														
 
															 Now that we have a good understanding of functions as they appear in
														
 
															 $R_4$ and the support for functions in x86-64, we need to plan the
														
@@ -3269,10 +3270,10 @@ variable; we need to use \code{leaq} to move the function name to a
 
															 register. Similarly, the application of a function is going to require
														
 
															 a complex sequence of instructions, unlike the primitive
														
 
															 operations. Thus, it is a good idea to create a new pass that changes
														
 
															-references to function names from just a symbol $f$ to
														
 
															-\code{(function-ref $f$)} and that changes function application from
														
 
															-\code{($e_0$ $e_1$ $\ldots$ $e_n$)} to the explicitly tagged
														
 
															-\code{(app $e_0$ $e_1$ $\ldots$ $e_n$)}. A good name for this pass is
														
 
															+function references from just a symbol $f$ to \code{(function-ref
														
 
															+  $f$)} and that changes function application from \code{($e_0$ $e_1$
														
 
															+  $\ldots$ $e_n$)} to the explicitly tagged AST \code{(app $e_0$ $e_1$
														
 
															+  $\ldots$ $e_n$)}. A good name for this pass is
														
 
															 \code{reveal-functions}. Placing this pass after \code{uniquify} is a
														
 
															 good idea, because it will make sure that there are no local variables
														
 
															 and functions that share the same name. On the other hand,
														
@@ -3280,48 +3281,205 @@ and functions that share the same name. On the other hand,
 
															 because \code{flatten} will help us compiler \code{function-ref}.
														
 
															 Because each \code{function-ref} needs to eventually become an
														
 
															-\code{leaq} instruction, it needs to become an assignment
														
 
															-statement. This can be handled easily in the \code{flatten} pass by
														
 
															-categorizing \code{function-ref} as a complex expression.
														
 
															+\code{leaq} instruction, it first needs to become an assignment
														
 
															+statement so there is a left-hand side in which to put the
														
 
															+result. This can be handled easily in the \code{flatten} pass by
														
 
															+categorizing \code{function-ref} as a complex expression.  Then, in
														
 
															+the \code{select-instructions} pass, an assignment of
														
 
															+\code{function-ref} becomes a \code{leaq} instruction as follows: \\
														
 
															+\begin{tabular}{lll}
														
 
															+\begin{minipage}{0.45\textwidth}
														
 
															+\begin{lstlisting}
														
 
															+  (assign |$\itm{lhs}$| (function-ref |$f$|))
														
 
															+\end{lstlisting}
														
 
															+\end{minipage}
														
 
															+&
														
 
															+$\Rightarrow$
														
 
															+&
														
 
															+\begin{minipage}{0.4\textwidth}
														
 
															+\begin{lstlisting}
														
 
															+(leaq (function-ref |$f$|) |$\itm{lhs}$|)
														
 
															+\end{lstlisting}
														
 
															+\end{minipage}
														
 
															+\end{tabular} 
														
 
															+Next we consider compiling function definitions.  The \code{flatten}
														
 
															+pass should handle function definitions a lot like a \code{program}
														
 
															+node; after all, the \code{program} node represents the \code{main}
														
 
															+function. So the \code{flatten} pass, in addition to flattening the
														
 
															+body of the function into a sequence of statements, should record the
														
 
															+local variables in the $\Var^{*}$ field as shown below.
														
 
															+\begin{lstlisting}
														
 
															+   (define (|$f$| [|\itm{xs}| : |\itm{ts}|]|$^{*}$|) : |\itm{rt}| (|$\Var^{*}$|) |$\Stmt^{+}$|)
														
 
															+\end{lstlisting}
														
 
															+In the \code{select-instructions} pass, we need to encode the
														
 
															+parameter passing in terms of the conventions discussed in
														
 
															+Section~\ref{sec:fun-x86}. So depending on the length of the parameter
														
 
															+list \itm{xs}, some of them may be in registers and some of them may
														
 
															+be on the stack. I recommend generating \code{movq} instructions to
														
 
															+move the parameters from their registers and stack locations into the
														
 
															+variables \itm{xs}, then let register allocation handle the assignment
														
 
															+of those variables to homes. After this pass, the \itm{xs} can be
														
 
															+added to the list of local variables. As mentioned in
														
 
															+Section~\ref{sec:fun-x86}, we need to find out how far to move the
														
 
															+stack pointer to ensure we have enough space for stack arguments in
														
 
															+all the calls inside the body of this function. This pass is a good
														
 
															+place to do this and store the result in the \itm{maxStack} field of
														
 
															+the output \code{define} shown below.
														
 
															+\begin{lstlisting}
														
 
															+  (define (|$f$|) |\itm{numParams}| (|$\Var^{*}$| |\itm{maxStack}|) |$\Instr^{+}$|)
														
 
															+\end{lstlisting}
														
 
															+
														
 
															+Next, consider the compilation of function applications, which have
														
 
															+the following form at the start of \code{select-instructions}.
														
 
															+\begin{lstlisting}
														
 
															+  (assign |\itm{lhs}| (app |\itm{fun}| |\itm{args}| |$\ldots$|))
														
 
															+\end{lstlisting}
														
 
															+In the mirror image of handling the parameters of function
														
 
															+definitions, some of the arguments \itm{args} need to be moved to the
														
 
															+argument passing registers and the rest should be moved to the
														
 
															+appropriate stack locations, as discussed in
														
 
															+Section~\ref{sec:fun-x86}. You might want to introduce a new kind of
														
 
															+AST node for stack arguments, \code{(stack-arg $i$)} where $i$ is the
														
 
															+index of this argument with respect to the other stack arguments. As
														
 
															+you're generate this code for parameter passing, take note of how many
														
 
															+stack arguments are needed for purposes of computing the
														
 
															+\itm{maxStack} discussed above.
														
 
															+
														
 
															+Once the instructions for parameter passing have been generated, the
														
 
															+function call itself can be performed with an indirect function call,
														
 
															+for which I recommend creating the new instruction
														
 
															+\code{indirect-callq}. Of course, the return value from the function
														
 
															+is stored in \code{rax}, so it needs to be moved into the \itm{lhs}.
														
 
															+\begin{lstlisting}
														
 
															+  (indirect-callq |\itm{fun}|)
														
 
															+  (movq (reg rax) |\itm{lhs}|)
														
 
															+\end{lstlisting}
														
 
															+
														
 
															+The rest of the passes need only minor modifications to handle the new
														
 
															+kinds of AST nodes: \code{function-ref}, \code{indirect-callq}, and
														
 
															+\code{leaq}. Inside \code{uncover-live}, when computing the $W$ set
														
 
															+(written variables) for an \code{indirect-callq} instruction, I
														
 
															+recommend including all the caller save registers, which will have the
														
 
															+affect of making sure that no caller save register actually need to be
														
 
															+saved. In \code{patch-instructions}, you should deal with the x86
														
 
															+idiosyncracy that the destination argument of \code{leaq} must be a
														
 
															+register.
														
 
															+
														
 
															+For the \code{print-x86} pass, I recommend the following translations:
														
 
															+\begin{lstlisting}
														
 
															+  (function-ref |\itm{label}|) |$\Rightarrow$| |\itm{label}|(%rip)
														
 
															+  (indirect-callq |\itm{arg}|) |$\Rightarrow$| callq *|\itm{arg}|
														
 
															+  (stack-arg |$i$|) |$\Rightarrow$| |$i$|(%rsp)
														
 
															+\end{lstlisting}
														
 
															+For function definitions, the \code{print-x86} pass should add the
														
 
															+code for saving and restoring the callee save registers, if you
														
 
															+haven't already done that.
														
 
															+\section{An Example Translation}
														
 
															+
														
 
															+Figure~\ref{fig:add-fun} shows an example translation of a simple
														
 
															+function in $R_4$ to x86-64. The figure includes the results of the
														
 
															+\code{flatten} and \code{select-instructions} passes.  Can you see any
														
 
															+obvious ways to improve the translation?
														
 
															+
														
 
															+\begin{figure}[tbp]
														
 
															+\begin{tabular}{lll}
														
 
															+\begin{minipage}{0.5\textwidth}
														
 
															+\begin{lstlisting}
														
 
															+(program
														
 
															+ (define (add [x : Integer] 
														
 
															+                [y : Integer]) 
														
 
															+    : Integer (+ x y))
														
 
															+ (add 40 2))
														
 
															+\end{lstlisting}
														
 
															+$\Downarrow$
														
 
															+\begin{lstlisting}
														
 
															+(program (t.1 t.2)
														
 
															+  ((define (add.1 [x.1 : Integer] 
														
 
															+                    [y.1 : Integer])
														
 
															+     : Integer (t.3)
														
 
															+     (assign t.3 (+ x.1 y.1))
														
 
															+     (return t.3)))
														
 
															+  (assign t.1 (function-ref add.1))
														
 
															+  (assign t.2 (app t.1 40 2))
														
 
															+  (return t.2))
														
 
															+\end{lstlisting}
														
 
															+$\Downarrow$
														
 
															+\begin{lstlisting}
														
 
															+(program ((t.1 t.2) 0)
														
 
															+  ((define (add.1) 2 ((x.1 y.1 t.3) 0)
														
 
															+     (movq (reg rdi) (var x.1))
														
 
															+     (movq (reg rsi) (var y.1))
														
 
															+     (movq (var x.1) (var t.3))
														
 
															+     (addq (var y.1) (var t.3))
														
 
															+     (movq (var t.3) (reg rax))))
														
 
															+  (leaq (function-ref add.1) (var t.1))
														
 
															+  (movq (int 40) (reg rdi))
														
 
															+  (movq (int 2) (reg rsi))
														
 
															+  (indirect-callq (var t.1))
														
 
															+  (movq (reg rax) (var t.2))
														
 
															+  (movq (var t.2) (reg rax)))
														
 
															+\end{lstlisting}
														
 
															+\end{minipage}
														
 
															+&
														
 
															+\begin{minipage}{0.4\textwidth}
														
 
															+$\Downarrow$
														
 
															+\begin{lstlisting}
														
 
															+	.globl add_1
														
 
															+add_1:
														
 
															+	pushq	%rbp
														
 
															+	movq	%rsp, %rbp
														
 
															+	pushq	%r15
														
 
															+	pushq	%r14
														
 
															+	pushq	%r13
														
 
															+	pushq	%r12
														
 
															+	pushq	%rbx
														
 
															+	subq	$16, %rsp
														
 
															+	movq	%rdi, %rbx
														
 
															+	movq	%rsi, %rcx
														
 
															+	addq	%rcx, %rbx
														
 
															+	movq	%rbx, %rax
														
 
															+	addq	$16, %rsp
														
 
															+	popq	%rbx
														
 
															+	popq	%r12
														
 
															+	popq	%r13
														
 
															+	popq	%r14
														
 
															+	popq	%r15
														
 
															+	popq	%rbp
														
 
															+	retq
														
 
															+
														
 
															+	.globl _main
														
 
															+_main:
														
 
															+	pushq	%rbp
														
 
															+	movq	%rsp, %rbp
														
 
															+	subq	$16, %rsp
														
 
															+	leaq	add_1(%rip), %rbx
														
 
															+	movq	$40, %rdi
														
 
															+	movq	$2, %rsi
														
 
															+	callq	*%rbx
														
 
															+	movq	%rax, %rbx
														
 
															+	movq	%rbx, %rax
														
 
															+	addq	$16, %rsp
														
 
															+	popq	%rbp
														
 
															+	retq
														
 
															+\end{lstlisting}
														
 
															+\end{minipage}
														
 
															+\end{tabular} 
														
 
															+\caption{Example compilation of a simple function to x86-64.}
														
 
															+\label{fig:add-fun}
														
 
															+\end{figure}
														
 
															+
														
 
															+
														
 
															+
														
 
															+\begin{exercise}\normalfont
														
 
															+Expand your compiler to handle $R_4$ as outlined in this section.
														
 
															+Create 5 new programs that use functions, including examples that pass
														
 
															+functions and return functions from other functions, and test your
														
 
															+compiler on these new programs and all of your previously created test
														
 
															+programs.
														
 
															+\end{exercise}
														
 
															-% flatten
														
 
															-%   * function-ref not simple, why? have to use the leaq instruction
														
 
															-%       to put the function label in to a register.
														
 
															-%
														
 
															-% select-instructions
														
 
															-%   * function defs. deal with parameters
														
 
															-%   * (assign lhs (function-ref f)) => (leaq (function-ref f) lhs)
														
 
															-%   * (assign lhs (app f es ...))
														
 
															-%     - pass some args in registers, rest on the stack (stack-arg)
														
 
															-%     - need to keep track of how large the stack needs to grow across
														
 
															-%       all the function calls in the body of a function
														
 
															-%     - indirect-callq f; movq rax lhs
														
 
															-%
														
 
															-% uncover-live
														
 
															-%   * free-vars: function-ref, stack-arg
														
 
															-%   * read-vars: leaq, indirect-callq
														
 
															-%   * write-vars: leaq, indirect-callq (all caller save!)
														
 
															-%   * uncover-live: treat functions like the main program.
														
 
															-%
														
 
															-% build interferece:
														
 
															-%   * treat functions like the main function
														
 
															-%
														
 
															-% assign-homes
														
 
															-%   * add cases for: stack, stack-arg, indirect-callq, function-ref
														
 
															-%
														
 
															-% allocate-registers
														
 
															-%   * treat functions like the main function
														
 
															-%
														
 
															-% patch-instructions
														
 
															-%   * add cases for: function defs, indirect-callq, leaq (target must be reg.)
														
 
															-%
														
 
															-% print-x86
														
 
															-%   * function-ref uses rip
														
 
															-%   * indirect-callq => callq *
														
 
															-%   * stack-arg  => rsp
														
 
															-%   * function defs: save and restore callee-save registers
														
 
															 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
														
 
															 \chapter{Lexically Scoped Functions}