2 years ago · c39fffb38a
--- a/book.tex
+++ b/book.tex
@@ -4949,20 +4949,20 @@ state $0$. Then repeat the following, looking at the next input token.
 
															   symbols in the right-hand side of the rule being reduced. Jump to
														
 
															   the state at the top of the stack and then follow the goto edge for
														
 
															   the nonterminal that matches the left-hand side of the rule that we
														
 
															-  reducing by. Push the edge's target state and the nonterminal on the
														
 
															+  are reducing by. Push the edge's target state and the nonterminal on the
														
 
															   stack.
														
 
															 \end{itemize}
														
 
															-Notice that in state 6 of Figure~\ref{fig:shift-reduce} there is both
														
 
															+Notice that in state 6 of figure~\ref{fig:shift-reduce} there is both
														
 
															 a shift and a reduce action for the token \lstinline{PLUS}, so the
														
 
															 algorithm does not know which action to take in this case. When a
														
 
															 state has both a shift and a reduce action for the same token, we say
														
 
															 there is a \emph{shift/reduce conflict}.  In this case, the conflict
														
 
															-will arise, for example, when trying to parse the input
														
 
															-\lstinline{print 1 + 2 + 3}. After having consumed \lstinline{print 1 + 2}
														
 
															-the parser will be in state 6, and it will not know whether to
														
 
															-reduce to form an \code{exp} of \lstinline{1 + 2}, or whether it
														
 
															-should proceed by shifting the next \lstinline{+} from the input.
														
 
															+will arise, for example, in trying to parse the input
														
 
															+\lstinline{print 1 + 2 + 3}. After having consumed \lstinline{print 1 + 2},
														
 
															+the parser will be in state 6 and will not know whether to
														
 
															+reduce to form an \code{exp} of \lstinline{1 + 2} or 
														
 
															+to proceed by shifting the next \lstinline{+} from the input.
														
 
															 A similar kind of problem, known as a \emph{reduce/reduce} conflict,
														
 
															 arises when there are two reduce actions in a state for the same
														
@@ -4973,32 +4973,32 @@ generated from the grammar, which we discuss next.
 
															 The parse table is generated one state at a time. State 0 represents
														
 
															 the start of the parser. We add the grammar rule for the start symbol
														
 
															 to this state with a period at the beginning of the right-hand side,
														
 
															-similar to the initialization phase of the Earley parser.  If the
														
 
															+similarly to the initialization phase of the Earley parser.  If the
														
 
															 period appears immediately before another nonterminal, we add all the
														
 
															 rules with that nonterminal on the left-hand side. Again, we place a
														
 
															-period at the beginning of the right-hand side of each the new
														
 
															-rules. This process, called \emph{state closure}, is continued
														
 
															-until there are no more rules to add (similar to the prediction
														
 
															+period at the beginning of the right-hand side of each new
														
 
															+rule. This process, called \emph{state closure}, is continued
														
 
															+until there are no more rules to add (similarly to the prediction
														
 
															 actions of an Earley parser). We then examine each dotted rule in the
														
 
															-current state $I$. Suppose a dotted rule has the form $A ::=
														
 
															-s_1.\,X s_2$, where $A$ and $X$ are symbols and $s_1$ and $s_2$
														
 
															-are sequences of symbols. We create a new state, call it $J$.  If $X$
														
 
															-is a terminal, we create a shift edge from $I$ to $J$ (analogous to
														
 
															+current state $I$. Suppose that a dotted rule has the form $A ::=
														
 
															+s_1.\,X \,s_2$, where $A$ and $X$ are symbols and $s_1$ and $s_2$
														
 
															+are sequences of symbols. We create a new state and call it $J$.  If $X$
														
 
															+is a terminal, we create a shift edge from $I$ to $J$ (analogously to
														
 
															 scanning in Earley), whereas if $X$ is a nonterminal, we create a
														
 
															 goto edge from $I$ to $J$.  We then need to add some dotted rules to
														
 
															 state $J$. We start by adding all dotted rules from state $I$ that
														
 
															-have the form $B ::= s_1.\,Xs_2$ (where $B$ is any nonterminal and
														
 
															-$s_1$ and $s_2$ are arbitrary sequences of symbols), but with
														
 
															+have the form $B ::= s_1.\,X\,s_2$ (where $B$ is any nonterminal and
														
 
															+$s_1$ and $s_2$ are arbitrary sequences of symbols), with
														
 
															 the period moved past the $X$.  (This is analogous to completion in
														
 
															 the Earley algorithm.)  We then perform state closure on $J$.  This
														
 
															 process repeats until there are no more states or edges to add.
														
 
															 We then mark states as accepting states if they have a dotted rule
														
 
															 that is the start rule with a period at the end.  Also, to add
														
 
															-in the reduce actions, we look for any state containing a dotted rule
														
 
															+the reduce actions, we look for any state containing a dotted rule
														
 
															 with a period at the end. Let $n$ be the rule number for this dotted
														
 
															 rule. We then put a reduce $n$ action into that state for every token
														
 
															-$Y$. For example, in Figure~\ref{fig:shift-reduce} state 4 has an
														
 
															+$Y$. For example, in figure~\ref{fig:shift-reduce} state 4 has a
														
 
															 dotted rule with a period at the end. We therefore put a reduce by
														
 
															 rule 3 action into state 4 for every
														
 
															 token.
														
@@ -5011,9 +5011,10 @@ the parse table.
 
															 \begin{exercise}
														
 
															   \normalfont\normalsize
														
 
															 %
														
 
															-On a piece of paper, walk through the parse table generation process
														
 
															-for the grammar at the top of figure~\ref{fig:shift-reduce} and check
														
 
															-your results against parse table in figure~\ref{fig:shift-reduce}.
														
 
															+Working on paper, walk through the parse table generation process for
														
 
															+the grammar at the top of figure~\ref{fig:shift-reduce}, and check
														
 
															+your results against the parse table shown in
														
 
															+figure~\ref{fig:shift-reduce}.
														
 
															 \end{exercise}
														
@@ -5021,7 +5022,7 @@ your results against parse table in figure~\ref{fig:shift-reduce}.
 
															   \normalfont\normalsize
														
 
															 %
														
 
															   Change the parser in your compiler for \LangVar{} to set the
														
 
															-  \code{parser} option of Lark to \code{'lalr'}. Test your compiler on
														
 
															+  \code{parser} option of Lark to \lstinline{'lalr'}. Test your compiler on
														
 
															   all the \LangVar{} programs that you have created. In doing so, Lark
														
 
															   may signal an error due to shift/reduce or reduce/reduce conflicts
														
 
															   in your grammar. If so, change your Lark grammar for \LangVar{} to
														
@@ -5034,16 +5035,16 @@ your results against parse table in figure~\ref{fig:shift-reduce}.
 
															 In this chapter we have just scratched the surface of the field of
														
 
															 parsing, with the study of a very general but less efficient algorithm
														
 
															 (Earley) and with a more limited but highly efficient algorithm
														
 
															-(LALR). There are many more algorithms, and classes of grammars, that
														
 
															-fall between these two ends of the spectrum. We recommend the reader
														
 
															-to \citet{Aho:2006wb} for a thorough treatment of parsing.
														
 
															-
														
 
															-Regarding lexical analysis, we described the specification language,
														
 
															-the regular expressions, but not the algorithms for recognizing them.
														
 
															-In short, regular expressions can be translated to nondeterministic
														
 
															-finite automata, which in turn are translated to finite automata.  We
														
 
															-refer the reader again to \citet{Aho:2006wb} for all the details on
														
 
															-lexical analysis.
														
 
															+(LALR). There are many more algorithms and classes of grammars that
														
 
															+fall between these two ends of the spectrum. We recommend to the reader
														
 
															+\citet{Aho:2006wb} for a thorough treatment of parsing.
														
 
															+
														
 
															+Regarding lexical analysis, we have described the specification
														
 
															+language, which are the regular expressions, but not the algorithms
														
 
															+for recognizing them. In short, regular expressions can be translated
														
 
															+to nondeterministic finite automata, which in turn are translated to
														
 
															+finite automata.  We refer the reader again to \citet{Aho:2006wb} for
														
 
															+all the details on lexical analysis.
														
 
															 \fi}