3 years ago · 0ef137071a
--- a/book.tex
+++ b/book.tex
@@ -4858,14 +4858,14 @@ rules. This process called \emph{state closure} is continued
 
															 until there are no more rules to add (similar to the prediction
														
 
															 actions of an Earley parser). We then examine each dotted rule in the
														
 
															 current state $I$. Suppose a dotted rule has the form $A ::=
														
 
															-\alpha.X\beta$, where $A$ and $X$ are symbols and $\alpha$ and $\beta$
														
 
															+s_1.\,X s_2$, where $A$ and $X$ are symbols and $s_1$ and $s_2$
														
 
															 are sequences of symbols. We create a new state, call it $J$.  If $X$
														
 
															 is a terminal, we create a shift edge from $I$ to $J$ (analogous to
														
 
															 scanning in Earley), whereas if $X$ is a nonterminal, we create a
														
 
															 goto edge from $I$ to $J$.  We then need to add some dotted rules to
														
 
															 state $J$. We start by adding all dotted rules from state $I$ that
														
 
															-have the form $B ::= \gamma.X\kappa$ (where $B$ is any nonterminal and
														
 
															-$\gamma$ and $\kappa$ are arbitrary sequences of symbols), but with
														
 
															+have the form $B ::= s_1.\,Xs_2$ (where $B$ is any nonterminal and
														
 
															+$s_1$ and $s_2$ are arbitrary sequences of symbols), but with
														
 
															 the period moved past the $X$.  (This is analogous to completion in
														
 
															 the Earley algorithm.)  We then perform state closure on $J$.  This
														
 
															 process repeats until there are no more states or edges to add.
														
@@ -4878,40 +4878,55 @@ rule. We then put a reduce $n$ action into that state for every token
 
															 $Y$. For example, in Figure~\ref{fig:shift-reduce} state 4 has an
														
 
															 dotted rule with a period at the end. We therefore put a reduce by
														
 
															 rule 3 action into state 4 for every
														
 
															-token. (Figure~\ref{fig:shift-reduce} does not show a reduce rule for
														
 
															-\code{INT} in state 4 because this grammar does not allow two
														
 
															-consecutive \code{INT} tokens in the input. We will not go into how
														
 
															-this can be figured out, but in any event it does no harm to have a
														
 
															-reduce rule for \code{INT} in state 4; it just means the input will be
														
 
															-rejected at a later point in the parsing process.)
														
 
															+token.
														
 
															+%% (Figure~\ref{fig:shift-reduce} does not show a reduce rule for
														
 
															+%% \code{INT} in state 4 because this grammar does not allow two
														
 
															+%% consecutive \code{INT} tokens in the input. We will not go into how
														
 
															+%% this can be figured out, but in any event it does no harm to have a
														
 
															+%% reduce rule for \code{INT} in state 4; it just means the input will be
														
 
															+%% rejected at a later point in the parsing process.)
														
 
															+
														
 
															+When inserting reduce actions, take care to spot any shift/reduce or
														
 
															+reduce/reduce conflicts. If there are any, abort the construction of
														
 
															+the parse table.
														
 
															+
														
 
															 \begin{exercise}
														
 
															-On a piece of paper, walk through the parse table generation 
														
 
															-process for the grammar in Figure~\ref{fig:parser1} and check
														
 
															-your results against Figure~\ref{fig:shift-reduce}. 
														
 
															+  \normalfont\normalsize
														
 
															+%
														
 
															+On a piece of paper, walk through the parse table generation process
														
 
															+for the grammar at the top of figure~\ref{fig:shift-reduce} and check
														
 
															+your results against parse table in figure~\ref{fig:shift-reduce}.
														
 
															 \end{exercise}
														
 
															 \begin{exercise}
														
 
															+  \normalfont\normalsize
														
 
															+%
														
 
															   Change the parser in your compiler for \LangVar{} to set the
														
 
															   \code{parser} option of Lark to \code{'lalr'}. Test your compiler on
														
 
															   all the \LangVar{} programs that you have created. In doing so, Lark
														
 
															   may signal an error due to shift/reduce or reduce/reduce conflicts
														
 
															   in your grammar. If so, change your Lark grammar for \LangVar{} to
														
 
															   remove those conflicts.
														
 
															-
														
 
															 \end{exercise}
														
 
															 \section{Further Reading}
														
 
															-UNDER CONSTRUCTION
														
 
															-
														
 
															-finite automata
														
 
															-
														
 
															-
														
 
															-
														
 
															-
														
 
															+In this chapter we have just scratched the surface of the field of
														
 
															+parsing, with the study of a very general put less efficient algorithm
														
 
															+(Earley) and with a more limited but highly efficient algorithm
														
 
															+(LALR). There are many more algorithms, and classes of grammars, that
														
 
															+fall between these two. We recommend the reader to \citet{Aho:2006wb}
														
 
															+for a thorough treatment of parsing.
														
 
															+
														
 
															+Regarding lexical analysis, we described the specification language,
														
 
															+the regular expressions, but not the algorithms for recognizing them.
														
 
															+In short, regular expressions can be translated to nondeterministic
														
 
															+finite automata, which in turn are translated to finite automata.  We
														
 
															+refer the reader again to \citet{Aho:2006wb} for all the details of
														
 
															+lexical analysis.
														
 
															 \fi}