9 éve · ea77f167c6
--- a/book.tex
+++ b/book.tex
@@ -1,4 +1,4 @@
 
				-\documentclass[10pt]{book}
			
 
				+\documentclass[12pt]{book}
			
 
				 \usepackage[T1]{fontenc}
			
 
				 \usepackage[utf8]{inputenc}
			
 
				 \usepackage{lmodern}
			
@@ -52,7 +52,6 @@ basicstyle=\ttfamily%
 
				 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
			
 
				 
			
 
				 \newcommand{\itm}[1]{\ensuremath{\mathit{#1}}}
			
 
				-\newcommand{\Atom}{\itm{atom}}
			
 
				 \newcommand{\Stmt}{\itm{stmt}}
			
 
				 \newcommand{\Exp}{\itm{exp}}
			
 
				 \newcommand{\Ins}{\itm{instr}}
			
@@ -70,7 +69,7 @@ basicstyle=\ttfamily%
 
				 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
			
 
				 
			
 
				 \title{\Huge \textbf{Essentials of Compilation} \\ 
			
 
				-  \huge From Scheme to x86 Assembly}
			
 
				+  \huge An Incremental Approach}
			
 
				 
			
 
				 \author{\textsc{Jeremy G. Siek}
			
 
				    \thanks{\url{http://homes.soic.indiana.edu/jsiek/}}
			
@@ -209,9 +208,9 @@ the following produces $42$ (and not $-42$).
 
				 \[
			
 
				 \LET{x}{\READ}{ \LET{y}{\READ}{ \BINOP{-}{x}{y} } }
			
 
				 \]
			
 
				-The initializing expression of a \key{let} is always evaluated before
			
 
				-the body of the \key{let}, so in the above, the \key{read} for $x$ is
			
 
				-performed before the \key{read} for $y$.
			
 
				+The initializing expression is always evaluated before the body of the
			
 
				+\key{let}, so in the above, the \key{read} for $x$ is performed before
			
 
				+the \key{read} for $y$.
			
 
				 %
			
 
				 The behavior of the following program is somewhat subtle because
			
 
				 Scheme does not specify an evaluation order for arguments of an
			
@@ -238,38 +237,40 @@ language to compile $S_0$.
 
				 \section{x86-64 Assembly}
			
 
				 
			
 
				 An x86-64 program is a sequence of instructions. The instructions
			
 
				-manipulate a fixed number of variables called \emph{registers} and can
			
 
				-load and store values into \emph{memory}. Memory is a mapping of
			
 
				-64-bit addresses to 64-bit values. The syntax $n(r)$ is used to read
			
 
				-the address $a$ stored in register $r$ and then offset it by $n$,
			
 
				-producing the address $a + n$. The arithmetic instructions, such as
			
 
				-$\key{addq}\,s\,d$, read from the source $s$ and destination argument
			
 
				-$d$, apply the arithmetic operation, then stores the result in the
			
 
				-destination $d$. In this case, computing $d \gets d + s$.  The move
			
 
				-instruction, $\key{movq}\,s\,d$ reads from $s$ and stores the result
			
 
				-in $d$. The $\key{callq}\,\mathit{label}$ instruction executes the
			
 
				-function specified by the label, which we shall use to implement
			
 
				-\key{read}. Figure~\ref{fig:x86-a} defines the syntax for this
			
 
				-subset of the x86-64 assembly language.
			
 
				+manipulate 16 variables called \emph{registers} and can also load and
			
 
				+store values into \emph{memory}. Memory is a mapping of 64-bit
			
 
				+addresses to 64-bit values. The syntax $n(r)$ is used to read the
			
 
				+address $a$ stored in register $r$ and then offset it by $n$ bytes (8
			
 
				+bits), producing the address $a + n$. The arithmetic instructions,
			
 
				+such as $\key{addq}\,s\,d$, read from the source $s$ and destination
			
 
				+argument $d$, apply the arithmetic operation, then stores the result
			
 
				+in the destination $d$. In this case, computing $d \gets d + s$.  The
			
 
				+move instruction, $\key{movq}\,s\,d$ reads from $s$ and stores the
			
 
				+result in $d$. The $\key{callq}\,\mathit{label}$ instruction executes
			
 
				+the procedure specified by the label, which we shall use to implement
			
 
				+\key{read}. Figure~\ref{fig:x86-a} defines the syntax for this subset
			
 
				+of the x86-64 assembly language.
			
 
				 
			
 
				 \begin{figure}[tbp]
			
 
				 \fbox{
			
 
				 \begin{minipage}{0.96\textwidth}
			
 
				 \[
			
 
				 \begin{array}{lcl}
			
 
				-\itm{register} &::=& \key{rax} \mid \key{rbx} \mid \key{rcx}
			
 
				+\itm{register} &::=& \key{rsp} \mid \key{rbp} \mid \key{rax} \mid \key{rbx} \mid \key{rcx}
			
 
				               \mid \key{rdx} \mid \key{rsi} \mid \key{rdi} \mid \\
			
 
				               && \key{r8} \mid \key{r9} \mid \key{r10}
			
 
				               \mid \key{r11} \mid \key{r12} \mid \key{r13}
			
 
				               \mid \key{r14} \mid \key{r15} \\
			
 
				-\Arg &::=&  \Int \mid \key{\%}\itm{register} \mid \Int(\key{\%}\itm{register}) \\ 
			
 
				+\Arg &::=&  \key{\$}\Int \mid \key{\%}\itm{register} \mid \Int(\key{\%}\itm{register}) \\ 
			
 
				 \Ins &::=& \key{addq} \; \Arg \; \Arg \mid 
			
 
				       \key{subq} \; \Arg \; \Arg \mid 
			
 
				       \key{imulq} \; \Arg \; \Arg \mid 
			
 
				       \key{negq} \; \Arg \mid \\
			
 
				   && \key{movq} \; \Arg \; \Arg \mid 
			
 
				-      \key{callq} \; \mathit{label} \\
			
 
				-\Prog &::= & \Ins^{*}
			
 
				+      \key{callq} \; \mathit{label} \mid
			
 
				+      \key{pushq}\;\Arg \mid \key{popq};\Arg \mid \key{retq} \\
			
 
				+\Prog &::= & \key{.globl \_main}\\
			
 
				+      &    & \key{\_main:} \; \Ins^{+} 
			
 
				 \end{array}
			
 
				 \]
			
 
				 \end{minipage}
			
@@ -278,7 +279,21 @@ subset of the x86-64 assembly language.
 
				 \label{fig:x86-a}
			
 
				 \end{figure}
			
 
				 
			
 
				-\begin{figure}[tbp]
			
 
				+Figure~\ref{fig:p0-x86} depicts an x86-64 program that is equivalent
			
 
				+to $\BINOP{+}{10}{32}$. The \key{globl} directive says that the
			
 
				+\key{\_main} procedure is externally visible, which is necessary so
			
 
				+that the operating system can call it. The label \key{\_main:}
			
 
				+indicates the beginning of the \key{\_main} procedure.  The
			
 
				+instruction $\key{movq}\,\$10, \%\key{rax}$ puts $10$ into the
			
 
				+register \key{rax}. The following instruction $\key{addq}\,\key{\$}32,
			
 
				+\key{\%rax}$ adds $32$ to the $10$ in \key{rax} and puts the result,
			
 
				+$42$, back into \key{rax}. The instruction \key{retq} finishes the
			
 
				+\key{\_main} function by returning the integer in the \key{rax}
			
 
				+register to the operating system.
			
 
				+
			
 
				+\begin{figure}[htbp]
			
 
				+\centering
			
 
				+\begin{minipage}{0.6\textwidth}
			
 
				 \begin{lstlisting}
			
 
				 	.globl _main
			
 
				 _main:
			
@@ -286,16 +301,132 @@ _main:
 
				 	addq	$32, %rax
			
 
				 	retq
			
 
				 \end{lstlisting}
			
 
				-\caption{A simple x86-64 program equivalent to $(+ \; 10 \; 32)$.}
			
 
				+\end{minipage}
			
 
				+\caption{A simple x86-64 program equivalent to $\BINOP{+}{10}{32}$.}
			
 
				 \label{fig:p0-x86}
			
 
				 \end{figure}
			
 
				 
			
 
				+The next example exhibits the use of memory.  Figure~\ref{fig:p1-x86}
			
 
				+lists an x86-64 program that is equivalent to $\BINOP{+}{52}{
			
 
				+  \UNIOP{-}{10} }$. To understand how this x86-64 program uses memory,
			
 
				+we need to explain a region of memory called called the
			
 
				+\emph{procedure call stack} (\emph{stack} for short). The stack
			
 
				+consists of a separate \emph{frame} for each procedure call. The
			
 
				+memory layout for an individual frame is shown in
			
 
				+Figure~\ref{fig:frame}.  The register \key{rsp} is called the
			
 
				+\emph{stack pointer} and points to the item at the top of the
			
 
				+stack. The stack grows downward in memory, so we increase the size of
			
 
				+the stack by subtracting from the stack pointer. The frame size is
			
 
				+required to be a multiple of 16 bytes. The register \key{rbp} is the
			
 
				+\emph{base pointer} which serves two purposes: 1) it saves the
			
 
				+location of the stack pointer for the procedure that called the
			
 
				+current one and 2) it is used to access variables associated with the
			
 
				+current procedure. We number the variables from $1$ to $n$. Variable
			
 
				+$1$ is stored at address $-8\key{(\%rbp)}$, variable $2$ at
			
 
				+$-16\key{(\%rbp)}$, etc.
			
 
				+
			
 
				+\begin{figure}
			
 
				+\centering
			
 
				+\begin{minipage}{0.6\textwidth}
			
 
				+\begin{lstlisting}
			
 
				+	.globl _main
			
 
				+_main:
			
 
				+	pushq	%rbp
			
 
				+	movq	%rsp, %rbp
			
 
				+	subq	$16, %rsp
			
 
				+
			
 
				+	movq	$10, -8(%rbp)
			
 
				+	negq	-8(%rbp)
			
 
				+	movq	$52, %rax
			
 
				+	addq	-8(%rbp), %rax
			
 
				+
			
 
				+	addq	$16, %rsp
			
 
				+	popq	%rbp
			
 
				+	retq
			
 
				+\end{lstlisting}
			
 
				+\end{minipage}
			
 
				+\caption{An x86-64 program equivalent to $\BINOP{+}{52}{\UNIOP{-}{10} }$.}
			
 
				+\label{fig:p1-x86}
			
 
				+\end{figure}
			
 
				+
			
 
				+
			
 
				+\begin{figure}
			
 
				+\centering
			
 
				+\begin{tabular}{|r|l|} \hline
			
 
				+Position & Contents \\ \hline
			
 
				+8(\key{\%rbp}) & return address \\
			
 
				+0(\key{\%rbp}) & old \key{rbp} \\
			
 
				+-8(\key{\%rbp}) & variable $1$ \\
			
 
				+-16(\key{\%rbp}) & variable $2$ \\
			
 
				+ \ldots  & \ldots \\
			
 
				+0(\key{\%rsp}) & variable $n$\\ \hline
			
 
				+\end{tabular}
			
 
				+
			
 
				+\caption{Memory layout of a frame.}
			
 
				+\label{fig:frame}
			
 
				+\end{figure}
			
 
				+
			
 
				+Getting back to the program in Figure~\ref{fig:p1-x86}, the first
			
 
				+three instructions are the typical prelude for a procedure.  The
			
 
				+instruction \key{pushq \%rbp} saves the base pointer for the procedure
			
 
				+that called the current one onto the stack and subtracts $8$ from the
			
 
				+stack pointer. The second instruction \key{movq \%rsp, \%rbp} changes
			
 
				+the base pointer to the top of the stack. The instruction \key{subq
			
 
				+  \$16, \%rsp} moves the stack pointer down to make enough room for
			
 
				+storing variables.  This program just needs one variable ($8$ bytes)
			
 
				+but because the frame size is required to be a multiple of 16 bytes,
			
 
				+it rounds to 16 bytes.
			
 
				+
			
 
				+The next four instructions carry out the work of computing
			
 
				+$\BINOP{+}{52}{\UNIOP{-}{10} }$. The first instruction \key{movq \$10,
			
 
				+  -8(\%rbp)} stores $10$ in variable $1$. The instruction \key{negq
			
 
				+  -8(\%rbp)} changes variable $1$ to $-10$. The \key{movq \$52, \%rax}
			
 
				+places $52$ in the register \key{rax} and \key{addq -8(\%rbp), \%rax}
			
 
				+adds the contents of variable $1$ to \key{rax}, at which point
			
 
				+\key{rax} contains $42$.
			
 
				+
			
 
				+The last three instructions are the typical conclusion of a procedure.
			
 
				+The \key{addq \$16, \%rsp} instruction moves the stack pointer back to
			
 
				+point at the old base pointer. The amount added here needs to match
			
 
				+the amount that was subtracted in the prelude of the procedure.  Then
			
 
				+\key{popq \%rbp} returns the old base pointer to \key{rbp} and adds
			
 
				+$8$ to the stack pointer.  The \key{retq} instruction jumps back to
			
 
				+the procedure that called this one and subtracts 8 from the stack
			
 
				+pointer.
			
 
				+
			
 
				+\section{Planning the route from $S_0$ to x86-64}
			
 
				+
			
 
				+To compile one language to another it helps to focus on the
			
 
				+differences between the two languages. It is these differences that
			
 
				+the compiler will need to bridge. What are the differences between
			
 
				+$S_0$ and x86-64 assembly? Here we list some of the most important the
			
 
				+differences.
			
 
				+
			
 
				+\begin{enumerate}
			
 
				+\item Variables in $S_0$ can overshadow other variables with the same
			
 
				+  name. The registers and memory locations of x86-64 all have unique
			
 
				+  names.
			
 
				+
			
 
				+\item An argument to an $S_0$ operator can be any expression, whereas
			
 
				+  x86-64 instructions restrict their arguments to integers, registers,
			
 
				+  and memory locations.
			
 
				+
			
 
				+\item x86-64 arithmetic instructions typically take two arguments and
			
 
				+  update the second argument in place. In contrast, $S_0$ arithmetic
			
 
				+  operations only read their arguments and produce a new value.
			
 
				+
			
 
				+\item An $S_0$ program can have any number of variables whereas x86-64
			
 
				+  has only 16 registers.
			
 
				+\end{enumerate}
			
 
				+
			
 
				+
			
 
				+
			
 
				 \section{An intermediate C-like language}
			
 
				 
			
 
				 \[
			
 
				 \begin{array}{lcl}
			
 
				-\Atom &::=& \Int \mid \Var \\
			
 
				-\Exp &::=& \Atom \mid (\Op \; \Atom^{*})\\
			
 
				+\Arg &::=& \Int \mid \Var \\
			
 
				+\Exp &::=& \Arg \mid (\Op \; \Arg^{*})\\
			
 
				 \Stmt &::=& (\key{assign} \; \Var \; \Exp) \mid (\key{return}\; \Exp)
			
 
				 \end{array}
			
 
				 \]