3 년 전 · 8e8192cb07
--- a/book.tex
+++ b/book.tex
@@ -3996,14 +3996,26 @@ all, fast code is useless if it produces incorrect results!
 
				 
			
 
				 \index{subject}{register allocation}
			
 
				 
			
 
				-In Chapter~\ref{ch:Lvar} we learned how to store variables on the
			
 
				-stack. In this chapter we learn how to improve the performance of the
			
 
				-generated code by assigning some variables to registers.  The CPU can
			
 
				-access a register in a single cycle, whereas accessing the stack can
			
 
				-take 10s to 100s of cycles.  The program in Figure~\ref{fig:reg-eg}
			
 
				-serves as a running example. The source program is on the left and the
			
 
				-output of instruction selection is on the right. The program is almost
			
 
				-in the x86 assembly language but it still uses variables.
			
 
				+In Chapter~\ref{ch:Lvar} we compiled \LangVar{} to x86, storing
			
 
				+variables on the procedure call stack.  It can take 10s to 100s of
			
 
				+cycles for the CPU to access locations on the stack whereas accessing
			
 
				+a register takes only a single cycle. In this chapter we improve the
			
 
				+efficiency of our generated code by storing some variables in
			
 
				+registers. The goal of register allocation is to fit as many variables
			
 
				+into registers as possible. Some programs have more variables than
			
 
				+registers so we cannot always map each variable to a different
			
 
				+register. Fortunately, it is common for different variables to be
			
 
				+needed during different periods of time during program execution, and
			
 
				+in those cases we can map multiple variables to the same register.
			
 
				+
			
 
				+The program in Figure~\ref{fig:reg-eg} serves as a running
			
 
				+example. The source program is on the left and the output of
			
 
				+instruction selection is on the right. The program is almost in the
			
 
				+x86 assembly language but it still uses variables.  Consider variables
			
 
				+\code{x} and \code{z}.  After the variable \code{x} is moved to
			
 
				+\code{z} it is no longer needed.  Variable \code{z}, on the other
			
 
				+hand, is used only after this point, so \code{x} and \code{z} could
			
 
				+share the same register.
			
 
				 
			
 
				 \begin{figure}
			
 
				 \begin{minipage}{0.45\textwidth}
			
@@ -4075,30 +4087,22 @@ callq print_int
 
				 \label{fig:reg-eg}
			
 
				 \end{figure}
			
 
				 
			
 
				-The goal of register allocation is to fit as many variables into
			
 
				-registers as possible. Some programs have more variables than
			
 
				-registers so we cannot always map each variable to a different
			
 
				-register. Fortunately, it is common for different variables to be
			
 
				-needed during different periods of time during program execution, and
			
 
				-in such cases several variables can be mapped to the same register.
			
 
				-Consider variables \code{x} and \code{z} in Figure~\ref{fig:reg-eg}.
			
 
				-After the variable \code{x} is moved to \code{z} it is no longer
			
 
				-needed.  Variable \code{z}, on the other hand, is used only after this
			
 
				-point, so \code{x} and \code{z} could share the same register. The
			
 
				-topic of Section~\ref{sec:liveness-analysis-Lvar} is how to compute
			
 
				-where a variable is needed.  Once we have that information, we compute
			
 
				-which variables are needed at the same time, i.e., which ones
			
 
				-\emph{interfere} with each other, and represent this relation as an
			
 
				-undirected graph whose vertices are variables and edges indicate when
			
 
				-two variables interfere (Section~\ref{sec:build-interference}). We
			
 
				-then model register allocation as a graph coloring problem
			
 
				+The topic of Section~\ref{sec:liveness-analysis-Lvar} is how to
			
 
				+compute where a variable is needed.  Once we have that information, we
			
 
				+compute which variables are needed at the same time, i.e., which ones
			
 
				+\emph{interfere}\index{subject}{interfere} with each other, and
			
 
				+represent this relation as an undirected graph whose vertices are
			
 
				+variables and edges indicate when two variables interfere
			
 
				+(Section~\ref{sec:build-interference}). We then model register
			
 
				+allocation as a graph coloring problem
			
 
				 (Section~\ref{sec:graph-coloring}).
			
 
				 
			
 
				 If we run out of registers despite these efforts, we place the
			
 
				 remaining variables on the stack, similar to what we did in
			
 
				-Chapter~\ref{ch:Lvar}. It is common to use the verb \emph{spill} for
			
 
				-assigning a variable to a stack location. The decision to spill a
			
 
				-variable is handled as part of the graph coloring process.
			
 
				+Chapter~\ref{ch:Lvar}. It is common to use the verb
			
 
				+\emph{spill}\index{subject}{spill} for assigning a variable to a stack
			
 
				+location. The decision to spill a variable is handled as part of the
			
 
				+graph coloring process.
			
 
				 
			
 
				 We make the simplifying assumption that each variable is assigned to
			
 
				 one location (a register or stack address). A more sophisticated
			
@@ -4108,7 +4112,7 @@ in short sequence and then only used again after many other
 
				 instructions, it could be more efficient to assign the variable to a
			
 
				 register during the initial sequence and then move it to the stack for
			
 
				 the rest of its lifetime. We refer the interested reader to
			
 
				-\citet{Cooper:2011aa} Chapter 13 for more information about that
			
 
				+\citet{Cooper:2011aa} (Chapter 13) for more information about that
			
 
				 approach.
			
 
				 
			
 
				 % discuss prioritizing variables based on how much they are used.
			
@@ -4117,9 +4121,9 @@ approach.
 
				 \label{sec:calling-conventions}
			
 
				 \index{subject}{calling conventions}
			
 
				 
			
 
				-As we perform register allocation, we need to be aware of the
			
 
				-\emph{calling conventions} \index{subject}{calling conventions} that govern how
			
 
				-functions calls are performed in x86.
			
 
				+As we perform register allocation, we must be aware of the
			
 
				+\emph{calling conventions} \index{subject}{calling conventions} that
			
 
				+govern how functions calls are performed in x86.
			
 
				 %
			
 
				 Even though \LangVar{} does not include programmer-defined functions,
			
 
				 our generated code includes a \code{main} function that is called by
			
@@ -4154,8 +4158,7 @@ view, the caller view and the callee view:
 
				 \item The caller should assume that all the caller-saved registers get
			
 
				   overwritten with arbitrary values by the callee.  On the other hand,
			
 
				   the caller can safely assume that all the callee-saved registers
			
 
				-  contain the same values after the call that they did before the
			
 
				-  call.
			
 
				+  retain their original values.
			
 
				 \item The callee can freely use any of the caller-saved registers.
			
 
				   However, if the callee wants to use a callee-saved register, the
			
 
				   callee must arrange to put the original value back in the register
			
@@ -4165,8 +4168,10 @@ view, the caller view and the callee view:
 
				 \end{itemize}
			
 
				 
			
 
				 In x86, registers are also used for passing arguments to a function
			
 
				-and for the return value.  In particular, the first six arguments to a
			
 
				+and for the return value.  In particular, the first six arguments of a
			
 
				 function are passed in the following six registers, in this order.
			
 
				+\index{subject}{argument-passing registers}
			
 
				+\index{subject}{parameter-passing registers}
			
 
				 \begin{lstlisting}
			
 
				 rdi rsi rdx rcx r8 r9
			
 
				 \end{lstlisting}
			
@@ -4187,38 +4192,40 @@ The next question is how these calling conventions impact register
 
				 allocation. Consider the \LangVar{} program in
			
 
				 Figure~\ref{fig:example-calling-conventions}.  We first analyze this
			
 
				 example from the caller point of view and then from the callee point
			
 
				-of view.
			
 
				-
			
 
				-The program makes two calls to \READOP{}.  Also, the variable \code{x}
			
 
				-is in use during the second call to \READOP{}, so we need to make sure
			
 
				-that the value in \code{x} does not get accidentally wiped out by the
			
 
				-call to \READOP{}.  One obvious approach is to save all the values in
			
 
				-caller-saved registers to the stack prior to each function call, and
			
 
				-restore them after each call. That way, if the register allocator
			
 
				-chooses to assign \code{x} to a caller-saved register, its value will
			
 
				-be preserved across the call to \READOP{}.  However, saving and
			
 
				-restoring to the stack is relatively slow. If \code{x} is not used
			
 
				-many times, it may be better to assign \code{x} to a stack location in
			
 
				-the first place. Or better yet, if we can arrange for \code{x} to be
			
 
				-placed in a callee-saved register, then it won't need to be saved and
			
 
				-restored during function calls.
			
 
				-
			
 
				-The approach that we recommend for variables that are in use during a
			
 
				-function call is to either assign them to callee-saved registers or to
			
 
				-spill them to the stack. On the other hand, for variables that are not
			
 
				-in use during a function call, we try the following alternatives in
			
 
				-order 1) look for an available caller-saved register (to leave room
			
 
				-for other variables in the callee-saved register), 2) look for a
			
 
				-callee-saved register, and 3) spill the variable to the stack.
			
 
				+of view. We refer to a variable that is needed during a function call
			
 
				+as being a \emph{call-live variable}\index{subject}{call-live
			
 
				+  variable}.
			
 
				+
			
 
				+The program makes two calls to \READOP{}.  The variable \code{x} is
			
 
				+call-live because it is needed during the second call to \READOP{}; we
			
 
				+must ensure that the value in \code{x} does not get overwritten during
			
 
				+the call to \READOP{}.  One obvious approach is to save all the values
			
 
				+that reside in caller-saved registers to the stack prior to each
			
 
				+function call, and restore them after each call. That way, if the
			
 
				+register allocator chooses to assign \code{x} to a caller-saved
			
 
				+register, its value will be preserved across the call to \READOP{}.
			
 
				+However, saving and restoring to the stack is relatively slow. If
			
 
				+\code{x} is not used many times, it may be better to assign \code{x}
			
 
				+to a stack location in the first place. Or better yet, if we can
			
 
				+arrange for \code{x} to be placed in a callee-saved register, then it
			
 
				+won't need to be saved and restored during function calls.
			
 
				+
			
 
				+The approach that we recommend for call-live variables is to either
			
 
				+assign them to callee-saved registers or to spill them to the
			
 
				+stack. On the other hand, for variables that are not call-live, we try
			
 
				+the following alternatives in order 1) look for an available
			
 
				+caller-saved register (to leave room for other variables in the
			
 
				+callee-saved register), 2) look for a callee-saved register, and 3)
			
 
				+spill the variable to the stack.
			
 
				 
			
 
				 It is straightforward to implement this approach in a graph coloring
			
 
				-register allocator. First, we know which variables are in use during
			
 
				-every function call because we compute that information for every
			
 
				+register allocator. First, we know which variables are call-live
			
 
				+because we already need to compute which variables are needed at every
			
 
				 instruction (Section~\ref{sec:liveness-analysis-Lvar}). Second, when
			
 
				 we build the interference graph
			
 
				 (Section~\ref{sec:build-interference}), we can place an edge between
			
 
				-each of these call-live variables and the caller-saved registers in
			
 
				-the interference graph. This will prevent the graph coloring algorithm
			
 
				+each of the call-live variables and the caller-saved registers in the
			
 
				+interference graph. This will prevent the graph coloring algorithm
			
 
				 from assigning them to caller-saved registers.
			
 
				 
			
 
				 Returning to the example in
			
@@ -4227,8 +4234,8 @@ generated x86 code on the right-hand side. Notice that variable
 
				 \code{x} is assigned to \code{rbx}, a callee-saved register. Thus, it
			
 
				 is already in a safe place during the second call to
			
 
				 \code{read\_int}. Next, notice that variable \code{y} is assigned to
			
 
				-\code{rcx}, a caller-saved register, because \code{y} is not in the
			
 
				-live-after set of a \code{callq} instruction.
			
 
				+\code{rcx}, a caller-saved register, because \code{y} is not a
			
 
				+call-live variable.
			
 
				 
			
 
				 Next we analyze the example from the callee point of view, focusing on
			
 
				 the prelude and conclusion of the \code{main} function. As usual the