9 tahun lalu · a5d5a3b56f
--- a/book.tex
+++ b/book.tex
@@ -738,7 +738,7 @@ test whether it produces programs that get the same result as the
 
															 input program. That is, we can test whether it satisfies Diagram
														
 
															 \eqref{eq:compile-correct}. The following code runs the partial
														
 
															 evaluator on several examples and tests the output program.  The
														
 
															-\texttt{assert} function is defined in Appendix~\ref{sec:utilities}.
														
 
															+\texttt{assert} function is defined in Appendix~\ref{appendix:utilities}.
														
 
															 \begin{lstlisting}
														
 
															 (define (test-pe pe p)
														
 
															   (assert "testing pe-arith"
														
@@ -915,7 +915,9 @@ refer to integer constants (called \emph{immediate values}), variables
 
															 called \emph{registers}, and instructions may load and store values
														
 
															 into \emph{memory}.  Memory is a mapping of 64-bit addresses to 64-bit
														
 
															 values. Figure~\ref{fig:x86-a} defines the syntax for the subset of
														
 
															-the x86-64 assembly language needed for this chapter.
														
 
															+the x86-64 assembly language needed for this chapter.  (We use the
														
 
															+AT\&T syntax that is expected by \key{gcc}, or rather, the GNU
														
 
															+assembler inside \key{gcc}.)
														
 
															 An immediate value is written using the notation \key{\$}$n$ where $n$
														
 
															 is an integer. 
														
@@ -966,7 +968,7 @@ specified by the label, which we shall use to implement
 
															 \]
														
 
															 \end{minipage}
														
 
															 }
														
 
															-\caption{A subset of the x86-64 assembly language.}
														
 
															+\caption{A subset of the x86-64 assembly language (AT\&T syntax).}
														
 
															 \label{fig:x86-a}
														
 
															 \end{figure}
														
@@ -1085,10 +1087,7 @@ The compiler will need a convenient representation for manipulating
 
															 x86 programs, so we define an abstract syntax for x86 in
														
 
															 Figure~\ref{fig:x86-ast-a}. The \itm{info} field of the \key{program}
														
 
															 AST node is for storing auxilliary information that needs to be
														
 
															-communicated from one step of the compiler to the next. The function
														
 
															-\key{print-x86} provided in the supplemental code converts an x86
														
 
															-abstract syntax tree into the text representation for x86
														
 
															-(Figure~\ref{fig:x86-a}).
														
 
															+communicated from one step of the compiler to the next. 
														
 
															 \begin{figure}[tbp]
														
 
															 \fbox{
														
@@ -1221,21 +1220,21 @@ To get from $C_0$ to x86-64 assembly requires three more steps, which
 
															 we discuss below.
														
 
															 \[\large
														
 
															 \xymatrix@=50pt{
														
 
															-  C_0 \ar@/^/[r]^-{\key{select\_instr.}}
														
 
															-  & \text{x86}^{*} \ar@/^/[r]^-{\key{assign\_homes}} 
														
 
															-  & \text{x86}^{*} \ar@/^/[r]^-{\key{patch\_instr.}}
														
 
															+  C_0 \ar@/^/[r]^-{\key{select-instr.}}
														
 
															+  & \text{x86}^{*} \ar@/^/[r]^-{\key{assign-homes}} 
														
 
															+  & \text{x86}^{*} \ar@/^/[r]^-{\key{patch-instr.}}
														
 
															   & \text{x86}
														
 
															 }
														
 
															 \]
														
 
															 We handle difference \#1, concerning the format of arithmetic
														
 
															-instructions, in the \key{select\_instructions} pass.  The result
														
 
															+instructions, in the \key{select-instructions} pass.  The result
														
 
															 of this pass produces programs consisting of x86-64 instructions that
														
 
															 use variables.
														
 
															 %
														
 
															 As there are only 16 registers, we cannot always map variables to
														
 
															 registers (difference \#3). Fortunately, the stack can grow quite
														
 
															 large, so we can map variables to locations on the stack. This is
														
 
															-handled in the \key{assign\_homes} pass. The topic of
														
 
															+handled in the \key{assign-homes} pass. The topic of
														
 
															 Chapter~\ref{ch:register-allocation} is implementing a smarter
														
 
															 approach in which we make a best-effort to map variables to registers,
														
 
															 resorting to the stack only when necessary.
														
@@ -1244,7 +1243,7 @@ The final pass in our journey to x86 handles an indiosycracy of x86
 
															 assembly. Many x86 instructions have two arguments but only one of the
														
 
															 arguments may be a memory reference. Because we are mapping variables
														
 
															 to stack locations, many of our generated instructions will violate
														
 
															-this restriction. The purpose of the \key{patch\_instructions} pass
														
 
															+this restriction. The purpose of the \key{patch-instructions} pass
														
 
															 is to fix this problem by replacing every violating instruction with a
														
 
															 short sequence of instructions that use the \key{rax} register.
														
@@ -1313,7 +1312,7 @@ that overshadow eachother.  The three programs should be in a
 
															 subdirectory named \key{tests} and they shoul have the same file name
														
 
															 except for a different integer at the end of the name, followed by the
														
 
															 ending \key{.scm}.  Use the \key{interp-tests} function
														
 
															-(Appendix~\ref{sec:utilities}) from \key{utilities.rkt} to test your
														
 
															+(Appendix~\ref{appendix:utilities}) from \key{utilities.rkt} to test your
														
 
															 \key{uniquify} pass on the example programs.
														
 
															 %% You can use the interpreter \key{interpret-S0} defined in the
														
@@ -1380,7 +1379,7 @@ Implement the \key{flatten} pass and test it on all of the example
 
															 programs that you created to test the \key{uniquify} pass and create
														
 
															 three new example programs that are designed to exercise all of the
														
 
															 interesting code in the \key{flatten} pass. Use the \key{interp-tests}
														
 
															-function (Appendix~\ref{sec:utilities}) from \key{utilities.rkt} to
														
 
															+function (Appendix~\ref{appendix:utilities}) from \key{utilities.rkt} to
														
 
															 test your passes on the example programs.
														
 
															 \end{exercise}
														
@@ -1388,11 +1387,11 @@ test your passes on the example programs.
 
															 \section{Select Instructions}
														
 
															 \label{sec:select-s0}
														
 
															-In the \key{select\_instructions} pass we begin the work of
														
 
															+In the \key{select-instructions} pass we begin the work of
														
 
															 translating from $C_0$ to x86. The target language of this pass is a
														
 
															 pseudo-x86 language that still uses variables, so we add an AST node
														
 
															 of the form $\VAR{\itm{var}}$ to the x86 abstract syntax.  The
														
 
															-\key{select\_instructions} pass deals with the differing format of
														
 
															+\key{select-instructions} pass deals with the differing format of
														
 
															 arithmetic operations. For example, in $C_0$ an addition operation
														
 
															 could take the following form:
														
 
															 \[
														
@@ -1424,9 +1423,9 @@ procedure.
 
															 \label{sec:assign-s0}
														
 
															 As discussed in Section~\ref{sec:plan-s0-x86}, the
														
 
															-\key{assign\_homes} pass places all of the variables on the stack.
														
 
															+\key{assign-homes} pass places all of the variables on the stack.
														
 
															 Consider again the example $S_0$ program $\BINOP{+}{52}{ \UNIOP{-}{10} }$,
														
 
															-which after \key{select\_instructions} looks like the following.
														
 
															+which after \key{select-instructions} looks like the following.
														
 
															 \[
														
 
															 \begin{array}{l}
														
 
															 (\key{movq}\;\INT{10}\; \VAR{x})\\
														
@@ -1436,7 +1435,7 @@ which after \key{select\_instructions} looks like the following.
 
															 \end{array}
														
 
															 \]
														
 
															 The one and only variable $x$ is assigned to stack location
														
 
															-\key{-8(\%rbp)}, so the \key{assign\_homes} pass translates the
														
 
															+\key{-8(\%rbp)}, so the \key{assign-homes} pass translates the
														
 
															 above to
														
 
															 \[
														
 
															 \begin{array}{l}
														
@@ -1463,7 +1462,7 @@ Consider again the following example.
 
															 \[
														
 
															 \LET{a}{42}{ \LET{b}{a}{ b }}
														
 
															 \]
														
 
															-After \key{assign\_homes} pass, the above has been translated to
														
 
															+After \key{assign-homes} pass, the above has been translated to
														
 
															 \[
														
 
															 \begin{array}{l}
														
 
															 (\key{movq} \;\INT{42}\; \STACKLOC{{-}8})\\
														
@@ -1490,7 +1489,16 @@ argument must be a register.
 
															 \section{Print x86}
														
 
															 \label{sec:print-x86}
														
 
															-[To do: talk about printing the AST to x86.]
														
 
															+The last step of the compiler from $S_0$ to x86-64 is to convert the
														
 
															+x86-64 AST (defined in Figure~\ref{fig:x86-ast-a}) to the string
														
 
															+representation (defined in Figure~\ref{fig:x86-a}). The Racket
														
 
															+\key{format} and \key{string-append} functions are useful in this
														
 
															+regard. The main work that this step needs to perform is to create the
														
 
															+\key{\_main} function and the standard instructions for its prelude
														
 
															+and conclusion, as described in Section~\ref{sec:x86-64}. You need to
														
 
															+know the number of stack-allocated variables, which is convenient to
														
 
															+compute in the \key{assign-homes} pass (Section~\ref{sec:assign-s0})
														
 
															+and then store in the $\itm{info}$ field of the \key{program}.
														
 
															 %% \section{Testing with Interpreters}
														
@@ -1903,11 +1911,11 @@ shown in Figure~\ref{fig:reg-alloc-passes}.
 
															 \begin{figure}[tbp]
														
 
															 \[
														
 
															 \xymatrix{
														
 
															-  C_0 \ar@/^/[r]^-{\key{select\_instr.}}
														
 
															-    & \text{x86}^{*} \ar[d]^-{\key{uncover\_live}} \\
														
 
															-    & \text{x86}^{*} \ar[d]^-{\key{build\_interference}} \\
														
 
															-    & \text{x86}^{*} \ar[d]_-{\key{allocate\_register}} \\
														
 
															-    & \text{x86}^{*} \ar@/^/[r]^-{\key{patch\_instr.}} 
														
 
															+  C_0 \ar@/^/[r]^-{\key{select-instr.}}
														
 
															+    & \text{x86}^{*} \ar[d]^-{\key{uncover-live}} \\
														
 
															+    & \text{x86}^{*} \ar[d]^-{\key{build-interference}} \\
														
 
															+    & \text{x86}^{*} \ar[d]_-{\key{allocate-registers}} \\
														
 
															+    & \text{x86}^{*} \ar@/^/[r]^-{\key{patch-instr.}} 
														
 
															     & \text{x86} 
														
 
															 }
														
 
															 \]
														
@@ -2116,8 +2124,19 @@ $S_1$.
 
															 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
														
 
															 \chapter{Appendix}
														
 
															+\section{Interpreters}
														
 
															+\label{appendix:interp}
														
 
															+
														
 
															+We provide several interpreters in the \key{interp.rkt} file.  The
														
 
															+\key{interp-scheme} function takes an AST in one of the Racket-like
														
 
															+languages considered in this book ($S_0, S_1, \ldots$) and interprets
														
 
															+the program, returning the result value.  The \key{interp-C} function
														
 
															+interprets an AST for a program in one of the C-like languages ($C_0,
														
 
															+C_1, \ldots$), and the \key{interp-x86} function interprets an AST for
														
 
															+an x86 program.
														
 
															+
														
 
															 \section{Utility Functions}
														
 
															-\label{sec:utilities}
														
 
															+\label{appendix:utilities}
														
 
															 The utility function described in this section can be found in the
														
 
															 \key{utilities.rkt} file.
														
@@ -2128,19 +2147,21 @@ Boolean \key{bool} is false.
 
															 (define (assert msg bool) ...)
														
 
															 \end{lstlisting}
														
 
															-The interp-tests function takes a compiler name (a string) a
														
 
															+The \key{interp-tests} function takes a compiler name (a string) a
														
 
															 description of the passes a test family name (a string), and a list of
														
 
															 test numbers, and runs the compiler passes and the interpreters to
														
 
															 check whether the passes correct. The description of the passes is a
														
 
															 list with one entry per pass.  An entry is a list with three things: a
														
 
															 string giving the name of the pass, the function that implements the
														
 
															 pass (a translator from AST to AST), and a function that implements
														
 
															-the interpreter (a function from AST to result value).  This function
														
 
															-assumes that the subdirectory \key{tests} has a bunch of Scheme
														
 
															-programs whose names all start with the family name, followed by an
														
 
															-underscore and then the test number, ending in \key{.scm}. Also, for
														
 
															-each Scheme program there is a file with the same number except that
														
 
															-it ends with \key{.in} that provides the input for the Scheme program.
														
 
															+the interpreter (a function from AST to result value).  The
														
 
															+interpreters from Appendix~\ref{appendix:interp} make a good choice.
														
 
															+The \key{interp-tests} function assumes that the subdirectory
														
 
															+\key{tests} has a bunch of Scheme programs whose names all start with
														
 
															+the family name, followed by an underscore and then the test number,
														
 
															+ending in \key{.scm}. Also, for each Scheme program there is a file
														
 
															+with the same number except that it ends with \key{.in} that provides
														
 
															+the input for the Scheme program.
														
 
															 \begin{lstlisting}
														
 
															 (define (interp-tests name passes test-family test-nums) ...
														
 
															 \end{lstlisting}