hace 4 años · 2229f55d7c
--- a/book.tex
+++ b/book.tex
@@ -134,7 +134,7 @@ showstringspaces=false
 
				 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
			
 
				 
			
 
				 \title{\Huge \textbf{Essentials of Compilation} \\
			
 
				-  \huge An Incremental Approach}
			
 
				+  \huge The Incremental, Nano-Pass Approach}
			
 
				 
			
 
				 \author{\textsc{Jeremy G. Siek} \\
			
 
				 %\thanks{\url{http://homes.soic.indiana.edu/jsiek/}} \\
			
@@ -188,15 +188,19 @@ The compiler course evolved to incorporate novel pedagogical ideas
 
				 while also including elements of effective real-world compilers.  One
			
 
				 of Friedman's ideas was to split the compiler into many small
			
 
				 ``passes'' so that the code for each pass would be easy to understood
			
 
				-in isolation.  (In contrast, most compilers of the time were organized
			
 
				+in isolation.  In contrast, most compilers of the time were organized
			
 
				 into only a few monolithic passes for reasons of compile-time
			
 
				-efficiency.)  Dybvig, with later help from his students Dipanwita
			
 
				-Sarkar and Andrew Keep, developed infrastructure to support this
			
 
				-approach and evolved the course, first to use smaller micro-passes and
			
 
				-then into even smaller
			
 
				-nano-passes~\citep{Sarkar:2004fk,Keep:2012aa}. I was a student in this
			
 
				-compiler course in the early 2000's as part of his Ph.D. studies at
			
 
				-Indiana University. Needless to say, I enjoyed the course immensely!
			
 
				+efficiency. Another idea, called ``the game'', was to test the code
			
 
				+generated by each pass on interpreters for each intermediate language,
			
 
				+thereby helping to pinpoint errors in individual passes.
			
 
				+%
			
 
				+Dybvig, with later help from his students Dipanwita Sarkar and Andrew
			
 
				+Keep, developed infrastructure to support this approach and evolved
			
 
				+the course, first to use smaller micro-passes and then into even
			
 
				+smaller nano-passes~\citep{Sarkar:2004fk,Keep:2012aa}. I was a student
			
 
				+in this compiler course in the early 2000's as part of my
			
 
				+Ph.D. studies at Indiana University. Needless to say, I enjoyed the
			
 
				+course immensely!
			
 
				 
			
 
				 During that time, another graduate student named Abdulaziz Ghuloum
			
 
				 observed that the front-to-back organization of the course made it
			
@@ -219,30 +223,28 @@ same content from the Indiana compiler course. I very much enjoyed
 
				 teaching the course organized in this way, and even better, many of
			
 
				 the students learned a lot and got excited about compilers.
			
 
				 
			
 
				-I returned to teach at Indiana University in 2013.  In my absence the
			
 
				-compiler course had switched from the front-to-back organization to a
			
 
				-back-to-front organization. Seeing how well the incremental approach
			
 
				-worked at Colorado, I started porting and adapting the structure of
			
 
				-the Colorado course back into the land of Scheme. In the meantime
			
 
				-Indiana University had moved on from Scheme to Racket, so the course
			
 
				-is now about compiling a subset of Racket (and Typed Racket) to the
			
 
				-x86 assembly language. The compiler is implemented in
			
 
				-Racket~\citep{plt-tr}.
			
 
				+I returned to Indiana University in 2013.  In my absence the compiler
			
 
				+course had switched from the front-to-back organization to a
			
 
				+back-to-front organization~\cite{Dybvig:2010aa}. Seeing how well the
			
 
				+incremental approach worked at Colorado, I started porting and
			
 
				+adapting the structure of the Colorado course back into the land of
			
 
				+Scheme. In the meantime Indiana University had moved on from Scheme to
			
 
				+Racket~\citep{plt-tr}, so the course is now about compiling a subset
			
 
				+of Racket (and Typed Racket) to the x86 assembly language.
			
 
				 
			
 
				 This is the textbook for the incremental version of the compiler
			
 
				-course at Indiana University (Spring 2016 - present) and it is the
			
 
				-first open textbook for an Indiana compiler course.  With this book I
			
 
				-hope to make the Indiana compiler course available to people that have
			
 
				-not had the chance to study compilers at Indiana University.  Many of
			
 
				-the compiler design decisions in this book are drawn from the
			
 
				-assignment descriptions of \cite{Dybvig:2010aa}. I have captured what
			
 
				-I think are the most important topics from \cite{Dybvig:2010aa} but
			
 
				-have omitted topics that are less interesting conceptually. I have
			
 
				-also made simplifications to reduce complexity.  In this way, this
			
 
				-book leans more towards pedagogy than towards the efficiency of the
			
 
				-generated code. Also, the book differs in places where we I the
			
 
				-opportunity to make the topics more fun, such as in relating register
			
 
				-allocation to Sudoku (Chapter~\ref{ch:register-allocation-r1}).
			
 
				+course at Indiana University (Spring 2016 - present).  With this book
			
 
				+I hope to make the Indiana compiler course available to people that
			
 
				+have not had the chance to study compilers at Indiana University.
			
 
				+
			
 
				+%% I have captured what
			
 
				+%% I think are the most important topics from \cite{Dybvig:2010aa} but
			
 
				+%% have omitted topics that are less interesting conceptually. I have
			
 
				+%% also made simplifications to reduce complexity.  In this way, this
			
 
				+%% book leans more towards pedagogy than towards the efficiency of the
			
 
				+%% generated code. Also, the book differs in places where we I the
			
 
				+%% opportunity to make the topics more fun, such as in relating register
			
 
				+%% allocation to Sudoku (Chapter~\ref{ch:register-allocation-r1}).
			
 
				 
			
 
				 \section*{Prerequisites}
			
 
				 
			
@@ -289,11 +291,12 @@ is to run a virtual machine with Linux as the guest operating system.
 
				 \section*{Acknowledgments}
			
 
				 
			
 
				 Many people have contributed to the ideas, techniques, and
			
 
				-organization of this book and have taught courses based on it. We
			
 
				-especially thank John Clements, Bor-Yuh Evan Chang, Kent Dybvig,
			
 
				-Daniel P. Friedman, Ronald Garcia, Abdulaziz Ghuloum, Andrew Keep, Jay
			
 
				-McCarthy, Nate Nystrom, Dipanwita Sarkar, Oscar Waddell, and Michael
			
 
				-Wollowski.
			
 
				+organization of this book and have taught courses based on it.  Many
			
 
				+of the compiler design decisions in this book are drawn from the
			
 
				+assignment descriptions of \cite{Dybvig:2010aa}.  We also would like
			
 
				+to thank John Clements, Bor-Yuh Evan Chang, Daniel P. Friedman, Ronald
			
 
				+Garcia, Abdulaziz Ghuloum, Jay McCarthy, Nate Nystrom, Dipanwita
			
 
				+Sarkar, Oscar Waddell, and Michael Wollowski.
			
 
				 
			
 
				 \mbox{}\\
			
 
				 \noindent Jeremy G. Siek \\
			
@@ -313,44 +316,37 @@ called \emph{concrete syntax}. We use concrete syntax to concisely
 
				 write down and talk about programs. Inside the compiler, we use
			
 
				 \emph{abstract syntax trees} (ASTs) to represent programs in a way
			
 
				 that efficiently supports the operations that the compiler needs to
			
 
				-perform.
			
 
				-\index{concrete syntax}
			
 
				-\index{abstract syntax}
			
 
				-\index{abstract syntax tree}
			
 
				-\index{AST}
			
 
				-\index{program}
			
 
				-\index{parse}
			
 
				-%
			
 
				-The translation from concrete syntax to abstract syntax is a process
			
 
				-called \emph{parsing}~\cite{Aho:1986qf}. We do not cover the theory
			
 
				-and implementation of parsing in this book. A parser is provided in
			
 
				-the supporting materials for translating from concrete syntax to
			
 
				-abstract syntax for the languages used in this book.
			
 
				+perform.\index{concrete syntax}\index{abstract syntax}\index{abstract
			
 
				+  syntax tree}\index{AST}\index{program}\index{parse} The translation
			
 
				+from concrete syntax to abstract syntax is a process called
			
 
				+\emph{parsing}~\citep{Aho:1986qf}. We do not cover the theory and
			
 
				+implementation of parsing in this book. A parser is provided in the
			
 
				+supporting materials for translating from concrete to abstract syntax.
			
 
				 
			
 
				 ASTs can be represented in many different ways inside the compiler,
			
 
				 depending on the programming language used to write the compiler.
			
 
				 %
			
 
				-We use Racket's \href{https://docs.racket-lang.org/guide/define-struct.html}{\code{struct}}
			
 
				+We use Racket's
			
 
				+\href{https://docs.racket-lang.org/guide/define-struct.html}{\code{struct}}
			
 
				 feature to represent ASTs (Section~\ref{sec:ast}). We use grammars to
			
 
				-define the abstract syntax of programming languages (Section~\ref{sec:grammar})
			
 
				-and pattern matching to inspect individual nodes in an AST
			
 
				-(Section~\ref{sec:pattern-matching}).  We use recursion to construct
			
 
				-and deconstruct entire ASTs (Section~\ref{sec:recursion}).  This
			
 
				-chapter provides an brief introduction to these ideas.
			
 
				-\index{struct}
			
 
				+define the abstract syntax of programming languages
			
 
				+(Section~\ref{sec:grammar}) and pattern matching to inspect individual
			
 
				+nodes in an AST (Section~\ref{sec:pattern-matching}).  We use
			
 
				+recursive functions to construct and deconstruct entire ASTs
			
 
				+(Section~\ref{sec:recursion}).  This chapter provides an brief
			
 
				+introduction to these ideas.  \index{struct}
			
 
				 
			
 
				 \section{Abstract Syntax Trees and Racket Structures}
			
 
				 \label{sec:ast}
			
 
				 
			
 
				-Compilers use abstract syntax trees to represent programs because
			
 
				-compilers often need to ask questions like: for a given part of a
			
 
				-program, what kind of language feature is it? What are the sub-parts
			
 
				-of this part of the program? Consider the program on the left and its
			
 
				-AST on the right. This program is an addition and it has two
			
 
				-sub-parts, a read operation and a negation. The negation has another
			
 
				-sub-part, the integer constant \code{8}. By using a tree to represent
			
 
				-the program, we can easily follow the links to go from one part of a
			
 
				-program to its sub-parts.
			
 
				+Compilers use abstract syntax trees to represent programs because they
			
 
				+often need to ask questions like: for a given part of a program, what
			
 
				+kind of language feature is it? What are its sub-parts? Consider the
			
 
				+program on the left and its AST on the right. This program is an
			
 
				+addition and it has two sub-parts, a read operation and a
			
 
				+negation. The negation has another sub-part, the integer constant
			
 
				+\code{8}. By using a tree to represent the program, we can easily
			
 
				+follow the links to go from one part of a program to its sub-parts.
			
 
				 \begin{center}
			
 
				 \begin{minipage}{0.4\textwidth}
			
 
				 \begin{lstlisting}
			
@@ -452,10 +448,10 @@ We say that the value created by \code{(Int 8)} is an
 
				 
			
 
				 The following is the \code{struct} definition for primitives operations.
			
 
				 \begin{lstlisting}
			
 
				-(struct Prim (op arg*))
			
 
				+(struct Prim (op args))
			
 
				 \end{lstlisting}
			
 
				 A primitive operation node includes an operator symbol \code{op}
			
 
				-and a list of children \code{arg*}. For example, to create
			
 
				+and a list of children \code{args}. For example, to create
			
 
				 an AST that negates the number $8$, we write \code{(Prim '- (list eight))}.
			
 
				 \begin{lstlisting}
			
 
				 (define neg-eight (Prim '- (list eight)))
			
@@ -521,13 +517,12 @@ instance of the \code{Int} structure is an expression:
 
				 \end{equation}
			
 
				 %
			
 
				 Each rule has a left-hand-side and a right-hand-side. The way to read
			
 
				-a rule is that if you have all the program parts on the
			
 
				-right-hand-side, then you can create an AST node and categorize it
			
 
				-according to the left-hand-side.
			
 
				+a rule is that if you have an AST node that matches the
			
 
				+right-hand-side, then you can categorize it according to the
			
 
				+left-hand-side.
			
 
				 %
			
 
				-A name such as $\Exp$ that is
			
 
				-defined by the grammar rules is a \emph{non-terminal}.
			
 
				-\index{non-terminal}
			
 
				+A name such as $\Exp$ that is defined by the grammar rules is a
			
 
				+\emph{non-terminal}.  \index{non-terminal}
			
 
				 %
			
 
				 The name $\Int$ is a also a non-terminal, but instead of defining it
			
 
				 with a grammar rule, we define it with the following explanation.  We
			
@@ -557,10 +552,9 @@ Symbols in typewriter font such as \key{-} and \key{read} are
 
				 the rule to be applicable.
			
 
				 \index{terminal}
			
 
				 
			
 
				-We can apply the rules to build ASTs in the $R_0$
			
 
				-language. For example, by rule \eqref{eq:arith-int}, \texttt{(Int 8)} is an
			
 
				-$\Exp$, then by rule \eqref{eq:arith-neg}, the following AST is
			
 
				-an $\Exp$.
			
 
				+We can apply these rules to build ASTs in the $R_0$ language. By rule
			
 
				+\eqref{eq:arith-int}, \texttt{(Int 8)} is an $\Exp$, then by rule
			
 
				+\eqref{eq:arith-neg}, the following AST is an $\Exp$.
			
 
				 \begin{center}
			
 
				 \begin{minipage}{0.4\textwidth}
			
 
				 \begin{lstlisting}
			
@@ -586,8 +580,8 @@ The next grammar rule defines addition expressions:
 
				 \end{equation}
			
 
				 We can now justify that the AST \eqref{eq:arith-prog} is an $\Exp$ in
			
 
				 $R_0$.  We know that \lstinline{(Prim 'read '())} is an $\Exp$ by rule
			
 
				-\eqref{eq:arith-read} and we have already shown that \code{(Prim '-
			
 
				-  (list (Int 8)))} is an $\Exp$, so we apply rule \eqref{eq:arith-add}
			
 
				+\eqref{eq:arith-read} and we have already categorized \code{(Prim '-
			
 
				+  (list (Int 8)))} as an $\Exp$, so we apply rule \eqref{eq:arith-add}
			
 
				 to show that
			
 
				 \begin{lstlisting}
			
 
				 (Prim '+ (list (Prim 'read '()) (Prim '- (list (Int 8)))))
			
@@ -636,7 +630,7 @@ Appendix~\ref{appendix:utilities} for more details.
 
				 \[
			
 
				 \begin{array}{rcl}
			
 
				 \begin{array}{rcl}
			
 
				-  \Exp &::=& \Int \mid (\key{read}) \mid (\key{-}\;\Exp) \mid (\key{+} \; \Exp\;\Exp)\\
			
 
				+  \Exp &::=& \Int \mid \LP\key{read}\RP \mid \LP\key{-}\;\Exp\RP \mid \LP\key{+} \; \Exp\;\Exp\RP\\
			
 
				   R_0 &::=& \Exp
			
 
				 \end{array}
			
 
				 \end{array}
			
@@ -704,14 +698,15 @@ Reference\footnote{\url{https://docs.racket-lang.org/reference/match.html}}
 
				 for a complete description of \code{match}.)
			
 
				 %
			
 
				 The body of a match clause may contain arbitrary Racket code.  The
			
 
				-pattern variables can be used in the scope of the body.
			
 
				+pattern variables can be used in the scope of the body, such as
			
 
				+\code{op} in \code{(print op)}.
			
 
				 
			
 
				 A \code{match} form may contain several clauses, as in the following
			
 
				-function \code{leaf?} that recognizes when an $R_0$ node is
			
 
				-a leaf. The \code{match} proceeds through the clauses in order,
			
 
				-checking whether the pattern can match the input AST. The
			
 
				-body of the first clause that matches is executed. The output of
			
 
				-\code{leaf?} for several ASTs is shown on the right.
			
 
				+function \code{leaf?} that recognizes when an $R_0$ node is a leaf in
			
 
				+the AST. The \code{match} proceeds through the clauses in order,
			
 
				+checking whether the pattern can match the input AST. The body of the
			
 
				+first clause that matches is executed. The output of \code{leaf?} for
			
 
				+several ASTs is shown on the right.
			
 
				 \begin{center}
			
 
				 \begin{minipage}{0.6\textwidth}
			
 
				 \begin{lstlisting}
			
@@ -719,8 +714,8 @@ body of the first clause that matches is executed. The output of
 
				   (match arith
			
 
				     [(Int n) #t]
			
 
				     [(Prim 'read '()) #t]
			
 
				-    [(Prim '- (list c1)) #f]
			
 
				-    [(Prim '+ (list c1 c2)) #f]))
			
 
				+    [(Prim '- (list e1)) #f]
			
 
				+    [(Prim '+ (list e1 e2)) #f]))
			
 
				 
			
 
				 (leaf? (Prim 'read '()))
			
 
				 (leaf? (Prim '- (list (Int 8))))
			
@@ -753,13 +748,13 @@ in the \code{leaf?} function, we refer to the grammar for $R_0$ in
 
				 Figure~\ref{fig:r0-syntax}. The $\Exp$ non-terminal has 4
			
 
				 alternatives, so the \code{match} has 4 clauses.  The pattern in each
			
 
				 clause corresponds to the right-hand side of a grammar rule. For
			
 
				-example, the pattern \code{(Prim '+ (list c1 c2))} corresponds to the
			
 
				+example, the pattern \code{(Prim '+ (list e1 e2))} corresponds to the
			
 
				 right-hand side $\ADD{\Exp}{\Exp}$. When translating from grammars to
			
 
				 patterns, replace non-terminals such as $\Exp$ with pattern variables
			
 
				-of your choice (e.g. \code{c1} and \code{c2}).
			
 
				+of your choice (e.g. \code{e1} and \code{e2}).
			
 
				 
			
 
				 
			
 
				-\section{Recursion}
			
 
				+\section{Recursive Functions}
			
 
				 \label{sec:recursion}
			
 
				 \index{recursive function}
			
 
				 
			
@@ -770,17 +765,16 @@ such a recursive function, we define \texttt{exp?} below, which takes
 
				 an arbitrary value and determines whether or not it is an $R_0$
			
 
				 expression.
			
 
				 %
			
 
				-When a recursive function is defined using a sequence of match clauses
			
 
				-that correspond to a grammar, and the body of each clause makes a
			
 
				-recursive call on each child node, then we say the function is defined
			
 
				-by \emph{structural recursion}\footnote{This principle of structuring
			
 
				-  code according to the data definition is advocated in the book
			
 
				-  \emph{How to Design Programs}
			
 
				-  \url{http://www.ccs.neu.edu/home/matthias/HtDP2e/}.}. Below we also
			
 
				-define a second function, named \code{R0?}, that determines whether a
			
 
				-value is an $R_0$ program.  In general we can expect to write one
			
 
				-recursive function to handle each non-terminal in a grammar.
			
 
				-\index{structural recursion}
			
 
				+We say that a function is defined by \emph{structural recursion} when
			
 
				+it is defined using a sequence of match clauses that correspond to a
			
 
				+grammar, and the body of each clause makes a recursive call on each
			
 
				+child node.\footnote{This principle of structuring code according to
			
 
				+  the data definition is advocated in the book \emph{How to Design
			
 
				+    Programs}\url{http://www.ccs.neu.edu/home/matthias/HtDP2e/}.}.
			
 
				+Below we also define a second function, named \code{R0?}, that
			
 
				+determines whether an AST is an $R_0$ program.  In general we can
			
 
				+expect to write one recursive function to handle each non-terminal in
			
 
				+a grammar.\index{structural recursion}
			
 
				 %
			
 
				 \begin{center}
			
 
				 \begin{minipage}{0.7\textwidth}
			
@@ -850,27 +844,23 @@ it comes to the \code{Program} wrapper.  Yet this style is generally
 
				 %
			
 
				 For example, the above function is subtly wrong:
			
 
				 \lstinline{(R0? (Program '() (Program '() (Int 3))))}
			
 
				-will return true, when it should return false.
			
 
				-
			
 
				-%% NOTE FIXME - must check for consistency on this issue throughout.
			
 
				+would return true, when it should return false.
			
 
				 
			
 
				 
			
 
				 \section{Interpreters}
			
 
				 \label{sec:interp-R0}
			
 
				 \index{interpreter}
			
 
				 
			
 
				-The meaning, or semantics, of a program is typically defined in the
			
 
				+In general, the intended behavior of a program is defined by the
			
 
				 specification of the language. For example, the Scheme language is
			
 
				 defined in the report by \cite{SPERBER:2009aa}. The Racket language is
			
 
				-defined in its reference manual~\citep{plt-tr}. In this book we use an
			
 
				-interpreter to define the meaning of each language that we consider,
			
 
				-following Reynolds' advice~\citep{reynolds72:_def_interp}. An
			
 
				-interpreter that is designated (by some people) as the definition of a
			
 
				-language is called a \emph{definitional interpreter}.
			
 
				-\index{definitional interpreter}
			
 
				-We warm up by creating a definitional interpreter for the $R_0$ language, which
			
 
				-serves as a second example of structural recursion. The
			
 
				-\texttt{interp-R0} function is defined in
			
 
				+defined in its reference manual~\citep{plt-tr}. In this book we use
			
 
				+interpreters to specify each language that we consider. An interpreter
			
 
				+that is designated as the definition of a language is called a
			
 
				+\emph{definitional interpreter}~\citep{reynolds72:_def_interp}.
			
 
				+\index{definitional interpreter} We warm up by creating a definitional
			
 
				+interpreter for the $R_0$ language, which serves as a second example
			
 
				+of structural recursion. The \texttt{interp-R0} function is defined in
			
 
				 Figure~\ref{fig:interp-R0}. The body of the function is a match on the
			
 
				 input program followed by a call to the \lstinline{interp-exp} helper
			
 
				 function, which in turn has one match clause per grammar rule for
			
@@ -1197,7 +1187,7 @@ interpreters. We want to write down those common parts just once
 
				 instead of many times. A naive approach would be to have, for example,
			
 
				 the interpreter for $R_2$ handle all of the new features in that
			
 
				 language and then have a default case that dispatches to the
			
 
				-interpreter for $R_1$. The follow code sketches this idea.
			
 
				+interpreter for $R_1$. The following code sketches this idea.
			
 
				 \begin{center}
			
 
				   \begin{minipage}{0.45\textwidth}
			
 
				 \begin{lstlisting}
			
@@ -1242,7 +1232,7 @@ To make our intepreters extensible we need something called \emph{open
 
				 always invoke the ``top'' interpreter, even if the recursive call is
			
 
				 made from interpreters that are lower down.  Object-oriented languages
			
 
				 provide open recursion in the form of method overriding\index{method
			
 
				-  overriding}. The follow code sketches this idea for interpreting
			
 
				+  overriding}. The following code sketches this idea for interpreting
			
 
				 $R_1$ and $R_2$ using the
			
 
				 \href{https://docs.racket-lang.org/guide/classes.html}{\code{class}}
			
 
				 \index{class} feature of Racket.  We define one class for each
			
@@ -1297,12 +1287,13 @@ expression by creating an object of the $R_2$ class and sending it the
 
				 \begin{lstlisting}
			
 
				 (send (new interp-R2-class) interp-exp e0)
			
 
				 \end{lstlisting}
			
 
				-This will again hit the default case and dispatch to the
			
 
				-\code{interp-exp} method for $R_1$, which will handle the \code{-}
			
 
				-operator. But then for the recursive method call, it will dispatch
			
 
				-back to \code{interp-exp} for $R_2$, where the \code{If} will be
			
 
				-correctly handled. Thus, method overriding gives us the open recursion
			
 
				-that we need to implement our interpreters in an extensible way.
			
 
				+This will again hit the default case of \code{interp-exp} in $R_2$ and
			
 
				+dispatch to the \code{interp-exp} method for $R_1$, which will handle
			
 
				+the \code{-} operator. But then for the recursive method call, it will
			
 
				+dispatch back to \code{interp-exp} for $R_2$, where the \code{If} will
			
 
				+be correctly handled. Thus, method overriding gives us the open
			
 
				+recursion that we need to implement our interpreters in an extensible
			
 
				+way.
			
 
				 
			
 
				 \newpage
			
 
				 
			
@@ -3840,9 +3831,9 @@ shown in Figure~\ref{fig:reg-alloc-passes}.
 
				 \path[->,bend left=15] (R1-2) edge [above] node {\ttfamily\footnotesize remove-complex.} (R1-3);
			
 
				 \path[->,bend left=15] (R1-3) edge [right] node {\ttfamily\footnotesize explicate-control} (C0-1);
			
 
				 \path[->,bend right=15] (C0-1) edge [left] node {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				-\path[->,bend left=15] (x86-2) edge [right] node {\ttfamily\footnotesize\color{red} uncover-live} (x86-2-1);
			
 
				-\path[->,bend right=15] (x86-2-1) edge [below] node {\ttfamily\footnotesize\color{red} build-inter.} (x86-2-2);
			
 
				-\path[->,bend right=15] (x86-2-2) edge [right] node {\ttfamily\footnotesize\color{red} allocate-reg.} (x86-3);
			
 
				+\path[->,bend left=15] (x86-2) edge [right] node {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				+\path[->,bend right=15] (x86-2-1) edge [below] node {\ttfamily\footnotesize build-inter.} (x86-2-2);
			
 
				+\path[->,bend right=15] (x86-2-2) edge [right] node {\ttfamily\footnotesize allocate-reg.} (x86-3);
			
 
				 \path[->,bend left=15] (x86-3) edge [above] node {\ttfamily\footnotesize patch-instr.} (x86-4);
			
 
				 \path[->,bend left=15] (x86-4) edge [right] node {\ttfamily\footnotesize print-x86} (x86-5);
			
 
				 \end{tikzpicture}
			
@@ -5518,17 +5509,17 @@ Test your compiler using your previously created programs on the
 
				 \node (x86-2-1) at (3,-4)  {\large $\text{x86}^{*}_1$};
			
 
				 \node (x86-2-2) at (6,-4)  {\large $\text{x86}^{*}_1$};
			
 
				 
			
 
				-\path[->,bend left=15] (R2) edge [above] node {\ttfamily\footnotesize\color{red} type-check} (R2-2);
			
 
				-\path[->,bend left=15] (R2-2) edge [above] node {\ttfamily\footnotesize\color{red} shrink} (R2-3);
			
 
				+\path[->,bend left=15] (R2) edge [above] node {\ttfamily\footnotesize type-check} (R2-2);
			
 
				+\path[->,bend left=15] (R2-2) edge [above] node {\ttfamily\footnotesize shrink} (R2-3);
			
 
				 \path[->,bend left=15] (R2-3) edge [above] node {\ttfamily\footnotesize uniquify} (R2-4);
			
 
				 \path[->,bend left=15] (R2-4) edge [above] node {\ttfamily\footnotesize remove-complex.} (R2-5);
			
 
				-\path[->,bend left=15] (R2-5) edge [left] node {\ttfamily\footnotesize\color{red} explicate-control} (C1-1);
			
 
				-\path[->,bend right=15] (C1-1) edge [left] node {\ttfamily\footnotesize\color{red} select-instructions} (x86-2);
			
 
				-\path[->,bend left=15] (x86-2) edge [right] node {\ttfamily\footnotesize\color{red} uncover-live} (x86-2-1);
			
 
				+\path[->,bend left=15] (R2-5) edge [left] node {\ttfamily\footnotesize explicate-control} (C1-1);
			
 
				+\path[->,bend right=15] (C1-1) edge [left] node {\ttfamily\footnotesize select-instructions} (x86-2);
			
 
				+\path[->,bend left=15] (x86-2) edge [right] node {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				 \path[->,bend right=15] (x86-2-1) edge [below] node {\ttfamily\footnotesize build-inter.} (x86-2-2);
			
 
				 \path[->,bend right=15] (x86-2-2) edge [right] node {\ttfamily\footnotesize allocate-reg.} (x86-3);
			
 
				-\path[->,bend left=15] (x86-3) edge [above] node {\ttfamily\footnotesize\color{red} patch-instr.} (x86-4);
			
 
				-\path[->,bend left=15] (x86-4) edge [right] node {\ttfamily\footnotesize\color{red} print-x86 } (x86-5);
			
 
				+\path[->,bend left=15] (x86-3) edge [above] node {\ttfamily\footnotesize patch-instr.} (x86-4);
			
 
				+\path[->,bend left=15] (x86-4) edge [right] node {\ttfamily\footnotesize print-x86 } (x86-5);
			
 
				 \end{tikzpicture}
			
 
				 \caption{Diagram of the passes for $R_2$, a language with conditionals.}
			
 
				  \label{fig:R2-passes}
			
@@ -7029,18 +7020,18 @@ conclusion:
 
				 \node (x86-2-1) at (3,-4)  {\large $\text{x86}^{*}_2$};
			
 
				 \node (x86-2-2) at (6,-4)  {\large $\text{x86}^{*}_2$};
			
 
				 
			
 
				-%\path[->,bend left=15] (R3) edge [above] node {\ttfamily\footnotesize\color{red} type-check} (R3-2);
			
 
				+%\path[->,bend left=15] (R3) edge [above] node {\ttfamily\footnotesize type-check} (R3-2);
			
 
				 \path[->,bend left=15] (R3) edge [above] node {\ttfamily\footnotesize shrink} (R3-2);
			
 
				 \path[->,bend left=15] (R3-2) edge [above] node {\ttfamily\footnotesize uniquify} (R3-3);
			
 
				-\path[->,bend left=15] (R3-3) edge [above] node {\ttfamily\footnotesize\color{red} expose-alloc.} (R3-4);
			
 
				+\path[->,bend left=15] (R3-3) edge [above] node {\ttfamily\footnotesize expose-alloc.} (R3-4);
			
 
				 \path[->,bend left=15] (R3-4) edge [above] node {\ttfamily\footnotesize remove-complex.} (R3-5);
			
 
				 \path[->,bend left=20] (R3-5) edge [left] node {\ttfamily\footnotesize explicate-control} (C2-4);
			
 
				-\path[->,bend left=15] (C2-4) edge [right] node {\ttfamily\footnotesize\color{red} select-instr.} (x86-2);
			
 
				+\path[->,bend left=15] (C2-4) edge [right] node {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				 \path[->,bend right=15] (x86-2) edge [left] node {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				-\path[->,bend right=15] (x86-2-1) edge [below] node {\ttfamily\footnotesize\color{red} build-inter.} (x86-2-2);
			
 
				-\path[->,bend right=15] (x86-2-2) edge [right] node {\ttfamily\footnotesize\color{red} allocate-reg.} (x86-3);
			
 
				+\path[->,bend right=15] (x86-2-1) edge [below] node {\ttfamily\footnotesize build-inter.} (x86-2-2);
			
 
				+\path[->,bend right=15] (x86-2-2) edge [right] node {\ttfamily\footnotesize allocate-reg.} (x86-3);
			
 
				 \path[->,bend left=15] (x86-3) edge [above] node {\ttfamily\footnotesize patch-instr.} (x86-4);
			
 
				-\path[->,bend left=15] (x86-4) edge [right] node {\ttfamily\footnotesize\color{red} print-x86} (x86-5);
			
 
				+\path[->,bend left=15] (x86-4) edge [right] node {\ttfamily\footnotesize print-x86} (x86-5);
			
 
				 \end{tikzpicture}
			
 
				 \caption{Diagram of the passes for $R_3$, a language with tuples.}
			
 
				 \label{fig:R3-passes}
			
@@ -8253,8 +8244,8 @@ previously created test programs.
 
				 \begin{figure}[tbp]
			
 
				 \begin{tikzpicture}[baseline=(current  bounding  box.center)]
			
 
				 \node (R4) at (0,2)  {\large $R_4$};
			
 
				-\node (R4-2) at (3,2)  {\large $R_4$};
			
 
				-%\node (R4-3) at (6,2)  {\large $R_4$};
			
 
				+\node (R4-1) at (3,2)  {\large $R_4$};
			
 
				+\node (R4-2) at (6,2)  {\large $R_4$};
			
 
				 \node (F1-1) at (12,0)  {\large $F_1$};
			
 
				 \node (F1-2) at (9,0)  {\large $F_1$};
			
 
				 \node (F1-3) at (6,0)  {\large $F_1$};
			
@@ -8269,31 +8260,31 @@ previously created test programs.
 
				 \node (x86-2-1) at (3,-6)  {\large $\text{x86}^{*}_3$};
			
 
				 \node (x86-2-2) at (6,-6)  {\large $\text{x86}^{*}_3$};
			
 
				 
			
 
				-%\path[->,bend left=15] (R4) edge [above] node
			
 
				-%     {\ttfamily\footnotesize\color{red} type-check} (R4-2);
			
 
				 \path[->,bend left=15] (R4) edge [above] node
			
 
				+     {\ttfamily\footnotesize shrink} (R4-1);
			
 
				+\path[->,bend left=15] (R4-1) edge [above] node
			
 
				      {\ttfamily\footnotesize uniquify} (R4-2);
			
 
				 \path[->,bend left=15] (R4-2) edge [right] node
			
 
				-     {\ttfamily\footnotesize\color{red} reveal-functions} (F1-1);
			
 
				+     {\ttfamily\footnotesize ~~reveal-functions} (F1-1);
			
 
				 \path[->,bend left=15] (F1-1) edge [below] node
			
 
				-     {\ttfamily\footnotesize\color{red} limit-functions} (F1-2);
			
 
				+     {\ttfamily\footnotesize limit-functions} (F1-2);
			
 
				 \path[->,bend right=15] (F1-2) edge [above] node
			
 
				      {\ttfamily\footnotesize expose-alloc.} (F1-3);
			
 
				 \path[->,bend right=15] (F1-3) edge [above] node
			
 
				-     {\ttfamily\footnotesize\color{red} remove-complex.} (F1-4);
			
 
				+     {\ttfamily\footnotesize remove-complex.} (F1-4);
			
 
				 \path[->,bend left=15] (F1-4) edge [right] node
			
 
				-     {\ttfamily\footnotesize\color{red} explicate-control} (C3-2);
			
 
				+     {\ttfamily\footnotesize explicate-control} (C3-2);
			
 
				 \path[->,bend right=15] (C3-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} select-instr.} (x86-2);
			
 
				+     {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				 \path[->,bend left=15] (x86-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} uncover-live} (x86-2-1);
			
 
				+     {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				 \path[->,bend right=15] (x86-2-1) edge [below] node 
			
 
				-     {\ttfamily\footnotesize \color{red}build-inter.} (x86-2-2);
			
 
				+     {\ttfamily\footnotesize build-inter.} (x86-2-2);
			
 
				 \path[->,bend right=15] (x86-2-2) edge [left] node
			
 
				      {\ttfamily\footnotesize allocate-reg.} (x86-3);
			
 
				 \path[->,bend left=15] (x86-3) edge [above] node
			
 
				-     {\ttfamily\footnotesize\color{red} patch-instr.} (x86-4);
			
 
				-\path[->,bend right=15] (x86-4) edge [left] node {\ttfamily\footnotesize\color{red} print-x86} (x86-5);
			
 
				+     {\ttfamily\footnotesize patch-instr.} (x86-4);
			
 
				+\path[->,bend right=15] (x86-4) edge [left] node {\ttfamily\footnotesize print-x86} (x86-5);
			
 
				 \end{tikzpicture}
			
 
				 \caption{Diagram of the passes for $R_4$, a language with functions.}
			
 
				 \label{fig:R4-passes}
			
@@ -8949,19 +8940,19 @@ shift it by $57$ bits to the right.
 
				 \path[->,bend left=15] (R4-2) edge [above] node
			
 
				      {\ttfamily\footnotesize uniquify} (R4-3);
			
 
				 \path[->,bend left=15] (R4-3) edge [right] node
			
 
				-     {\ttfamily\footnotesize\color{red} reveal-functions} (F1-1);
			
 
				+     {\ttfamily\footnotesize reveal-functions} (F1-1);
			
 
				 \path[->,bend left=15] (F1-1) edge [below] node
			
 
				-     {\ttfamily\footnotesize\color{red} convert-to-clos.} (F1-2);
			
 
				+     {\ttfamily\footnotesize convert-to-clos.} (F1-2);
			
 
				 \path[->,bend right=15] (F1-2) edge [above] node
			
 
				      {\ttfamily\footnotesize limit-fun.} (F1-3);
			
 
				 \path[->,bend right=15] (F1-3) edge [above] node
			
 
				-     {\ttfamily\footnotesize\color{red} expose-alloc.} (F1-4);
			
 
				+     {\ttfamily\footnotesize expose-alloc.} (F1-4);
			
 
				 \path[->,bend right=15] (F1-4) edge [above] node
			
 
				      {\ttfamily\footnotesize remove-complex.} (F1-5);
			
 
				 \path[->,bend right=15] (F1-5) edge [right] node
			
 
				      {\ttfamily\footnotesize explicate-control} (C3-2);
			
 
				 \path[->,bend left=15] (C3-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} select-instr.} (x86-2);
			
 
				+     {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				 \path[->,bend right=15] (x86-2) edge [left] node
			
 
				      {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				 \path[->,bend right=15] (x86-2-1) edge [below] node 
			
@@ -9998,7 +9989,7 @@ The \code{analyze-dataflow} function has four parameters.
 
				 \node (x86-2-2) at (6,-6)  {\large $\text{x86}^{*}_3$};
			
 
				 
			
 
				 %% \path[->,bend left=15] (R4) edge [above] node
			
 
				-%%      {\ttfamily\footnotesize\color{red} type-check} (R4-2);
			
 
				+%%      {\ttfamily\footnotesize type-check} (R4-2);
			
 
				 \path[->,bend left=15] (R4) edge [above] node
			
 
				      {\ttfamily\footnotesize shrink} (R4-2);
			
 
				 \path[->,bend left=15] (R4-2) edge [above] node
			
@@ -10006,7 +9997,7 @@ The \code{analyze-dataflow} function has four parameters.
 
				 \path[->,bend left=15] (R4-3) edge [above] node
			
 
				      {\ttfamily\footnotesize reveal-functions} (R4-4);
			
 
				 \path[->,bend left=15] (R4-4) edge [right] node
			
 
				-     {\ttfamily\footnotesize\color{red} convert-assignments} (F1-1);
			
 
				+     {\ttfamily\footnotesize convert-assignments} (F1-1);
			
 
				 \path[->,bend left=15] (F1-1) edge [below] node
			
 
				      {\ttfamily\footnotesize convert-to-clos.} (F1-2);
			
 
				 \path[->,bend right=15] (F1-2) edge [above] node
			
@@ -10014,13 +10005,13 @@ The \code{analyze-dataflow} function has four parameters.
 
				 \path[->,bend right=15] (F1-3) edge [above] node
			
 
				      {\ttfamily\footnotesize expose-alloc.} (F1-4);
			
 
				 \path[->,bend right=15] (F1-4) edge [above] node
			
 
				-     {\ttfamily\footnotesize\color{red} remove-complex.} (F1-5);
			
 
				+     {\ttfamily\footnotesize remove-complex.} (F1-5);
			
 
				 \path[->,bend right=15] (F1-5) edge [right] node
			
 
				-     {\ttfamily\footnotesize\color{red} explicate-control} (C3-2);
			
 
				+     {\ttfamily\footnotesize explicate-control} (C3-2);
			
 
				 \path[->,bend left=15] (C3-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} select-instr.} (x86-2);
			
 
				+     {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				 \path[->,bend right=15] (x86-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} uncover-live} (x86-2-1);
			
 
				+     {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				 \path[->,bend right=15] (x86-2-1) edge [below] node 
			
 
				      {\ttfamily\footnotesize build-inter.} (x86-2-2);
			
 
				 \path[->,bend right=15] (x86-2-2) edge [left] node
			
@@ -11051,11 +11042,11 @@ completion without error.
 
				 \path[->,bend left=15] (R4-3) edge [above] node
			
 
				      {\ttfamily\footnotesize reveal-functions} (R4-4);
			
 
				 \path[->,bend right=15] (R4-4) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} cast-insert} (R4-5);
			
 
				+     {\ttfamily\footnotesize cast-insert} (R4-5);
			
 
				 \path[->,bend left=15] (R4-5) edge [above] node
			
 
				-     {\ttfamily\footnotesize\color{red} check-bounds} (R4-6);
			
 
				+     {\ttfamily\footnotesize check-bounds} (R4-6);
			
 
				 \path[->,bend left=15] (R4-6) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} reveal-casts} (R4-7);
			
 
				+     {\ttfamily\footnotesize reveal-casts} (R4-7);
			
 
				      
			
 
				 \path[->,bend left=15] (R4-7) edge [below] node
			
 
				      {\ttfamily\footnotesize convert-to-clos.} (F1-2);
			
@@ -11068,7 +11059,7 @@ completion without error.
 
				 \path[->,bend right=15] (F1-5) edge [right] node
			
 
				      {\ttfamily\footnotesize explicate-control} (C3-2);
			
 
				 \path[->,bend left=15] (C3-2) edge [left] node
			
 
				-     {\ttfamily\footnotesize\color{red} select-instr.} (x86-2);
			
 
				+     {\ttfamily\footnotesize select-instr.} (x86-2);
			
 
				 \path[->,bend right=15] (x86-2) edge [left] node
			
 
				      {\ttfamily\footnotesize uncover-live} (x86-2-1);
			
 
				 \path[->,bend right=15] (x86-2-1) edge [below] node