6 vuotta sitten · e188a12bf9
--- a/book.tex
+++ b/book.tex
@@ -1163,20 +1163,14 @@ the operating system starts executing this program.  The instruction
 
				 \lstinline{movq $10, %rax} puts $10$ into register \key{rax}. The
			
 
				 following instruction \lstinline{addq $32, %rax} adds $32$ to the
			
 
				 $10$ in \key{rax} and puts the result, $42$, back into
			
 
				-\key{rax}. Finally, the instruction \lstinline{movq %rax, %rdi} moves the value
			
 
				-in \key{rax} into another register, \key{rdi}, and
			
 
				-\lstinline{callq print_int} calls the external function \code{print\_int}, which
			
 
				-prints the value in \key{rdi}.
			
 
				-
			
 
				-The last two instructions---\lstinline{movq $0, %rax} and \key{retq}---finish
			
 
				-the \key{main} function by returning the integer in \key{rax} to the
			
 
				-operating system. The operating system interprets this integer as the program's
			
 
				-exit code. By convention, an exit code of 0 indicates the program was
			
 
				-successful, and all other exit codes indicate various errors. To ensure that
			
 
				-we successfully communicate with the operating system, we explicitly move 0
			
 
				-into \key{rax}, lest the previous value in \key{rax} be misinterpreted as an
			
 
				-error code.
			
 
				+  \key{rax}.
			
 
				 
			
 
				+The last instruction, \key{retq}, finishes the \key{main} function by
			
 
				+returning the integer in \key{rax} to the operating system. The
			
 
				+operating system interprets this integer as the program's exit
			
 
				+code. By convention, an exit code of 0 indicates the program was
			
 
				+successful, and all other exit codes indicate various errors.
			
 
				+Nevertheless, we return the result of the program as the exit code.
			
 
				 
			
 
				 %\begin{wrapfigure}{r}{2.25in}
			
 
				 \begin{figure}[tbp]
			
@@ -1185,24 +1179,18 @@ error code.
 
				 main:
			
 
				 	movq	$10, %rax
			
 
				 	addq	$32, %rax
			
 
				-	movq	%rax, %rdi
			
 
				-	callq	print_int
			
 
				-	movq    $0, %rax
			
 
				 	retq
			
 
				 \end{lstlisting}
			
 
				 \caption{An x86 program equivalent to $\BINOP{+}{10}{32}$.}
			
 
				 \label{fig:p0-x86}
			
 
				 %\end{wrapfigure}
			
 
				 \end{figure}
			
 
				-%% \margincomment{Consider using italics for the texts in these figures.
			
 
				-%%   It can get confusing to differentiate them from the main text.}
			
 
				-%% It looks pretty ugly in italics.-Jeremy
			
 
				 
			
 
				-Unfortunately, x86 varies in a couple ways depending on what operating system it
			
 
				-is assembled in. The code examples shown here are correct on Linux and most
			
 
				-Unix-like platforms, but when assembled on Mac OS X, labels like \key{main} must
			
 
				-be prefixed with an underscore.  So the correct output for the above program on
			
 
				-Mac would begin with:
			
 
				+Unfortunately, x86 varies in a couple ways depending on what operating
			
 
				+system it is assembled in. The code examples shown here are correct on
			
 
				+Linux and most Unix-like platforms, but when assembled on Mac OS X,
			
 
				+labels like \key{main} must be prefixed with an underscore.  So the
			
 
				+correct output for the above program on Mac would begin with:
			
 
				 \begin{lstlisting}
			
 
				 	.globl _main
			
 
				 _main:
			
@@ -1243,11 +1231,8 @@ main:
 
				 	negq	-8(%rbp)
			
 
				 	movq	$52, %rax
			
 
				 	addq	-8(%rbp), %rax
			
 
				-	movq	%rax, %rdi
			
 
				-	callq	print_int
			
 
				 
			
 
				 	addq	$16, %rsp
			
 
				-	movq    $0, %rax
			
 
				 	popq	%rbp
			
 
				 	retq
			
 
				 \end{lstlisting}
			
@@ -1352,23 +1337,26 @@ $R_1$ and x86 assembly? Here we list some of the most important the
 
				 differences.
			
 
				 
			
 
				 \begin{enumerate}
			
 
				-\item[(a)] x86 arithmetic instructions typically take two arguments and
			
 
				-  update the second argument in place. In contrast, $R_1$ arithmetic
			
 
				-  operations take two arguments and produce a new value.
			
 
				-
			
 
				-\item[(b)] An argument to an $R_1$ operator can be any expression, whereas
			
 
				-  x86 instructions restrict their arguments to integers, registers,
			
 
				-  and memory locations.
			
 
				-
			
 
				-\item[(c)] The order of execution in x86 is explicit in the syntax: a
			
 
				+\item[(a)] x86 arithmetic instructions typically take two arguments
			
 
				+  and update the second argument in place. In contrast, $R_1$
			
 
				+  arithmetic operations take two arguments and produce a new value.
			
 
				+  An x86 instruction may have at most one memory-accessing argument
			
 
				+  and some instructions place further restrictions on the kinds of
			
 
				+  their arguments.
			
 
				+
			
 
				+\item[(b)] An argument to an $R_1$ operator can be any expression,
			
 
				+  whereas x86 instructions restrict their arguments to simple things
			
 
				+  like integers, registers, and memory locations.
			
 
				+
			
 
				+\item[(d)] The order of execution in x86 is explicit in the syntax: a
			
 
				   sequence of instructions, whereas in $R_1$ it is a left-to-right
			
 
				   depth-first traversal of the abstract syntax tree.
			
 
				 
			
 
				-\item[(d)] An $R_1$ program can have any number of variables whereas x86
			
 
				-  has only 16 registers.
			
 
				+\item[(e)] An $R_1$ program can have any number of variables whereas
			
 
				+  x86 has only 16 registers.
			
 
				 
			
 
				-\item[(e)] Variables in $R_1$ can overshadow other variables with the same
			
 
				-  name. The registers and memory locations of x86 all have unique
			
 
				+\item[(f)] Variables in $R_1$ can overshadow other variables with the
			
 
				+  same name. The registers and memory locations of x86 all have unique
			
 
				   names.
			
 
				 \end{enumerate}
			
 
				 
			
@@ -1382,31 +1370,45 @@ orders will be better so often some trial-and-error is
 
				 involved. However, we can try to plan ahead and choose the orderings
			
 
				 based on this planning.
			
 
				 
			
 
				+% (b) -> (e)
			
 
				 For example, to handle difference (b) (nested expressions), we shall
			
 
				-introduce temporary variables to hold the intermediate results
			
 
				-of each subexpression.  To deal with difference (d) we
			
 
				-will be replacing variables with registers and/or stack
			
 
				-locations. Thus, it makes sense to deal with (b) before (d) so that
			
 
				-(d) can replace both the original variables and the new ones. Next,
			
 
				-consider where (a) should fit in. Because it has to do with the format
			
 
				-of x86 instructions, it makes more sense after we have removed the
			
 
				-nested expressions (b). What about (c), order of execution?
			
 
				+introduce temporary variables to hold the intermediate results of each
			
 
				+subexpression.  To deal with difference (e) we will be replacing
			
 
				+variables with registers and/or stack locations. Thus, it makes sense
			
 
				+to deal with (b) before (e) so that (e) can replace both the original
			
 
				+variables and the new ones.
			
 
				+%
			
 
				+% (b) -> (a) ??
			
 
				+Next, consider where (a) should fit in. Because it has to do with the
			
 
				+format of x86 instructions, it makes more sense after we have removed
			
 
				+the nested expressions (b).
			
 
				+
			
 
				+What about (c), order of execution?
			
 
				 
			
 
				 UNDER CONSTRUCTION
			
 
				 
			
 
				+% (e) -> (b)
			
 
				 Finally, when should we deal with (e) (variable overshadowing)?  We
			
 
				 shall solve this problem by renaming variables to make sure they have
			
 
				-unique names. Recall that our plan for (b) involves moving nested
			
 
				-expressions, which could be problematic if it changes the shadowing of
			
 
				+unique names. Recall that our plan for (b) involves moving 
			
 
				+expressions, which could be problematic if a move changes the shadowing of
			
 
				 variables. However, if we deal with (e) first, then it will not be an
			
 
				-issue.  Thus, we arrive at the following ordering.
			
 
				+issue. Of course, this means that during (b), when we insert temporary
			
 
				+variables, we need to make sure that they are unique.
			
 
				+%
			
 
				+
			
 
				+What about the ordering of (a) (instr. sel) and (e) (register allocation)?
			
 
				+
			
 
				+Thus, we arrive at the following ordering.
			
 
				+
			
 
				+[ordering of reg. alloc versus instr. sel? -jeremy]
			
 
				 \[
			
 
				 \begin{tikzpicture}[baseline=(current  bounding  box.center)]
			
 
				-\foreach \i/\p in {1/1,2/2,3/3,4/4,5/5}
			
 
				+\foreach \i/\p in {1/1,2/2,3/3,4/4,5/5,6/6}
			
 
				 {
			
 
				   \node (\i) at (\p*1.5,0) {$\bullet$};
			
 
				 }
			
 
				-\foreach \x/\y/\lbl in {1/2/a,2/3/b,3/4/c,4/5/d}
			
 
				+\foreach \x/\y/\lbl in {1/2/e,2/3/b,3/4/c,4/5/a,5/6/d}
			
 
				 {
			
 
				   \path[->,bend left=15] (\x) edge [above] node {\small\lbl} (\y);
			
 
				 }
			
@@ -1470,7 +1472,7 @@ C_0 & ::= & (\key{program}\;(\Var^{*})\;\Stmt^{+})
 
				 \end{figure}
			
 
				 
			
 
				 To get from $C_0$ to x86 assembly, it remains for us to handle
			
 
				-difference \#1 (the format of instructions) and difference (d)
			
 
				+difference (a) (the format of instructions) and difference (d)
			
 
				 (variables versus stack locations and registers). These two
			
 
				 differences are intertwined, creating a bit of a Gordian Knot. To
			
 
				 handle difference (d), we need to map some variables to registers