Solving a query

Recall that a goal or query Q is a possibly empty conjunction of $\cal {R}$ ( $\cal {L}$ )-atoms and a $\cal {L}$ -constraint, written as:

$\begin{displaymath} \prologq \mbox{\it p}_{1} \dots \mbox{\it p}_{n}, \phi. \end{displaymath}$

The purpose of a proof procedure is to show whether, for a given set of definite clauses $\cal {S}$ , there is a satisfiable constraint $\psi$ , such that $\psi$ $\rightarrow$ Q is valid. The proof procedure is a refutation procedure, which shows that the denial of some goal is inconsistent with the assumptions $\cal {S}$ . The building blocks of proof procedures are inference rules. I will assume a top-down inference strategy. The initial goal is replaced by other goals via inference rules. If the empty goal is obtained, then a refutation has been discovered (where an empty goal is a goal without any atoms). The constraint associated with the empty goal is the desired answer.

In later chapters I will define proof procedures in which the inference rules are applied bottom-up. In such a proof procedure new assertions are derived from the assumptions $\cal {S}$ until an assertion is derived that explicitly contradicts the denial of the goal.

The top-down refutation procedure is best introduced using an example. Consider the set of clauses in figure 2.5 which define the state of a hypothetical library.

**Figure 2.5:** A definite clause specification of the state of an hypothetical library.
$\begin{figure} \prn\pred \head{\mbox{\it to\_lend}(\mbox{\rm B}) \mbox{\tt :-}} ... ... :-}} \body{ \mbox{\rm B}~\mbox{\it status} \doteq in.} \epred\eprn \end{figure}$

$\query{ to\_lend(\mbox{\rm Book}), } \qitem{ \mbox{\rm Book}~\mbox{\it author} \doteq \mbox{\rm greene}.} \equery$
which should be read as `for which Book, Book's author is greene and Book can be lent'. However, in the refutation procedure such a goal will be denied, i.e. read as `for all Book it is not the case that Book's author is greene and Book can be lent'. The inference rule in a top-down refutation procedure is a generalization of `modus tollens'. From the fact that `owning' plus `availability' implies `lendability', and from the goal ` Book cannot be lent' we can infer that it can neither be the case that Book is both owned and available, i.e. our original goal is replaced by:

$\query{\mbox{\it own}(\mbox{\rm Book}),} \qitem{\mbox{\it available}(\mbox{\rm B... ...,} \qitem{ \mbox{\rm Book}~\mbox{\it author} \doteq \mbox{\rm greene}.} \equery$
Next we can argue as follows. If it is the case that Book is not both owned and available, and we know that a book with the title `the captain and the enemy', and status `in' is owned, then we infer that such a book is not available:

$\query{available(\mbox{\rm Book}),} \qitem{ \mbox{\rm Book}~\mbox{\it author} \d... ...my'},} \qitem{ \mbox{\rm Book}~\mbox{\it status} \doteq \mbox{\rm in}.} \equery$
Finally, from this goal (reading: a book with title `the captain and the enemy', author `greene' and status `in' is not available) we can infer the empty clause because it is stated in the database that all things with status `in' in fact are available:

$\query{ \mbox{\rm Book}~\mbox{\it author} = \mbox{\rm greene},} \qitem{ \mbox{\r... ...my'},} \qitem{ \mbox{\rm Book}~\mbox{\it status} \doteq \mbox{\rm in}.} \equery$
Hence we obtain a contradiction, thus the negation of the original goal turns out not to be true. Therefore the original goal is shown to be true. Furthermore we obtain a constraint on Book that can be viewed as a counter example to the negated goal and hence constitutes an answer to the goal.

The proof procedure sketched here will now be defined as follows. Note that I use the standard terminology as for example presented in [50]. First I define the inference rule more precisely. Then I will define the computation rule which tells us on which atom the inference rule must be applied. Finally I define the search rule which tells us against which clause the inference rule has to be applied.

The inference rule I will define is called goal-reduction. The inference rule will select an atom from the goal and will replace this atom with the body of a clause defining this atom. Furthermore, the constraint associated with the new goal will be the conjunction of the constraints $\phi$ and $\psi$ , where $\phi$ is the constraint associated with the old goal and $\psi$ is the constraint associated with the body of the selected clause. Summarizing, for a goal p₁...p_n, $\phi$ goal reduction will replace one of the atoms p_i in the goal by the atoms in the body of a clause defining p_i as follows:

$\begin{displaymath} \mbox{\it p}_{1} \dots \mbox{\it p}_{{i-1}} \mbox{\it p}_{i}... ...q}_{n} , \mbox{\it p}_{i+1} \dots \mbox{\it p}_{n}, \phi, \psi \end{displaymath}$
where

$\begin{displaymath}\mbox{\it p}_{i}(\mbox{\rm X}_{1} \dots \mbox{\rm X}_{n}) \mbox{\tt :-} \mbox{\it q}_{1} \dots \mbox{\it q}_{n}, \psi \end{displaymath}$
is a variant of a clause in $\cal {S}$ , such that the variables in the clause do not occur in the original goal, except for the variables explicitly mentioned. As usual, a variant of a clause is obtained by consistently renaming its variables. For example, given the clauses in the library example above, consider the goal:

$\query{ \mbox{\it own}(\mbox{\rm X}_{3}),} \qitem{ \mbox{\it available}(\mbox{\r... ...\qitem{ \mbox{\rm X}_{3}~\mbox{\it author} \doteq \mbox{\rm melville}.} \equery$
Suppose we want to try to apply the inference rule on the atom own(X₃) against the third clause defining the predicate `own/1'. In order for own(X₃) and own(B) to match, we first rename the variables in the clause, obtaining the variant:

$\prn\pred \head{\mbox{\it own}(\mbox{\rm X}_{3}) \mbox{\tt :-}} \body{ \mbox{\rm... ...,} \body{ \mbox{\rm X}_{3}~\mbox{\it status} \doteq \mbox{\rm in}.} \epred\eprn$
and next the body of this clause replaces the selected atom in the goal, obtaining:

$\query{ \mbox{\it available}(\mbox{\rm X}_{3}),} \qitem{ \mbox{\rm X}_{3}~\mbox{... ...k'},} \qitem{ \mbox{\rm X}_{3}~\mbox{\it status} \doteq \mbox{\rm in}.} \equery$

The inference rule is applied only in case the resulting constraint $\phi$ , $\psi$ is satisfiable, because clearly if this constraint is not satisfiable then we will not be able to produce a correct answer; a constraint remains unsatisfiable whatever information is added to it (i.e. whatever constraints we conjoin it with). Therefore each time the inference rule is applied, it is tested whether the resulting constraint is still satisfiable. This satisfiability test will use the procedure described in the previous section. Therefore, the result of an inference rule will either be a goal in which the $\cal {L}$ -constraint is solved, or either be `failure' if the current part of the search space does not contain any solutions. The operation to test whether the conjunction of two constraints is satisfiable relates to the `unification' operation in Prolog and PATR II, or to `constraint-solving' in other formalisms.

Note that it is not strictly necessary to compute after each reduction step whether the resulting constraint is satisfiable (as long as proposed answers are checked to be satisfiable). This observation gives rise to inference procedures where satisfiability checks are `delayed' or performed less frequently. This may be useful if the check costs a lot of overhead in comparison to the size of the search space that could be avoided. For an application of this idea in a linguistic setting, consider the work on Constraint Logic Grammars [14].

The computation rule of the proof procedure determines which atom in a given goal is to be reduced. As in Prolog, I assume that the leftmost atom of a goal is always reduced; i.e. the computation rule selects the leftmost atom in a goal. In later chapters I will argue that for parsing and generation a computation rule which selects the head may be much more suitable. ^2.3

Finally, the search rule of a proof procedure determines in which order the search space is traversed. For a given selected atom several clauses may be available to reduce the atom with. Each of the possible reductions may lead to the desired result; hence this defines a search space which has the shape of a tree. For our toy example above, this tree looks as in figure 2.6, assuming the leftmost selection rule.

**Figure 2.6:** The search tree for the query whether it is possible to lend a book by Greene. A local tree corresponds to all possible ways in which the leftmost atom of the mother node can be reduced.
$\begin{figure} \begin{center} \leavevmode \unitlength1pt \beginpicture \setplot... ...hbox{\rlrl \rrr }} [Bl] at 239.81 73.00 \endpicture \end{center}\par\end{figure}$

**Figure 2.7:** The search tree of the query `which book by Greene can be lent'. In this case, each local tree represents all possible ways in which the rightmost atom of the mother node can be reduced. As the tree does not branch, the search proceeds deterministically.
$\begin{figure} \begin{center} \leavevmode \unitlength1pt \beginpicture \setplot... ...nd{picture}}\tbf }} [Bl] at 63.53 18.00 \endpicture \end{center}\par\end{figure}$

I assume, as in Prolog, that search trees are traversed in a depth-first left-to-right order. Backtracking occurs if branches are encountered which contain no refutation. Note that the search-rule is quite independent of both the direction of the proof procedure (bottom-up, top-down) and the computation rule. Apart from the usual depth-first strategy I adopt here it is often possible to augment proof procedures with well-formed and ill-formed subgoal tables. In section 4.6.5 I will come back to this subject.

At this point it may be useful to introduce another type of tree, which we may call `parse tree'. In such a tree a node represents an atom in a goal; the daughters of a node are the atoms with which this atom is reduced. A leaf in the parse tree thus represents a goal of which the reduction yielded the empty goal. ^2.4 A search tree thus represents all possible refutations; a parse-tree represents only one refutation. In examples of parse trees I usually replace the variables in the atom with the constraints imposed on this variable (in matrix representation); also I leave out the predicate symbol in the case of the predicate symbol sign/1. For example, for the goal ``which books by `greene' can be lent?'' we have the following parse tree, leaving out the constraints on the variable Book.

Example 11 (context free grammars) As an example of a correct refutation according to the refutation procedure defined above, consider the encoding of a simple context free grammar in figure 2.8 as a grammar of $\cal {R}$ ( $\cal {L}$ ).

**Figure 2.8:** This grammar of $\cal {R}$ ( $\cal {L}$ ) encodes a simple context-free grammar, by a difference list implementation of concatenation.
$\begin{figure} \prn\pred \head{\mbox{\it sign}(\avm{ \mbox{\it cat}: \mbox{\rm s... ...\rm wrote}\vert\mbox{\rm T}\rangle - \mbox{\rm T} }).} \epred\eprn \end{figure}$

The following goal is solved as in figure 2.9; the corresponding parse tree is presented in figure 2.10. Note that in the trace of the refutation I use the up-arrow to indicate that the constraints on the variable are identical to the constraints on that variable in the preceding goal.

$\begin{displaymath} \prologq \mbox{\it sign}(\avm[{\mbox{\rm X}_{0}}]{ \mbox{\it... ... \mbox{\rm 21}, \mbox{\rm stories}\rangle - \langle \rangle}). \end{displaymath}$

**Figure 2.9:** A trace of the refutation of the claim that ``greene wrote 21 stories'' is *not* a string in the grammar. That is, a trace of the proof that this string in fact is recognized by the grammar.
$\begin{figure} \eq & \prologq & \mbox{\it sign}(\mbox{\rm X}_{0}), \\ && \avm[{... ...Uparrow, \mbox{\rm X}_{3} \Uparrow, \mbox{\rm X}_{4} \Uparrow. \eeq \end{figure}$

**Figure 2.10:** Parse tree of the string ``greene wrote 21 stories''
$\begin{figure} \begin{center} \leavevmode \unitlength1pt \beginpicture \setplot... ...t{\hbox{\vierbc }} [Bl] at 256.63 18.00 \endpicture \end{center}\par\end{figure}$