2.5.3 Planning as Satisfiability

Another interesting approach is to convert the planning problem into an enormous Boolean satisfiability problem. This means that the planning problem of Formulation 2.4 can be solved by determining whether some assignment of variables is possible for a Boolean expression that leads to a TRUE value. Generic methods for determining satisfiability can be directly applied to the Boolean expression that encodes the planning problem. The Davis-Putnam procedure is one of the most widely known algorithms for satisfiability. It performs a depth-first search by iteratively trying assignments for variables and backtracking when assignments fail. During the search, large parts of the expression can be eliminated due to the current assignments. The algorithm is complete and reasonably efficient. Its use in solving planning problems is surveyed in [382]. In practice, stochastic local search methods provide a reasonable alternative to the Davis-Putnam procedure [459].

Suppose a planning problem has been given in terms of Formulation 2.4. All literals and operators will be tagged with a stage index. For example, a literal that appears in two different stages will be considered distinct. This kind of tagging is similar to situation calculus [378]; however, in that case, variables are allowed for the tags. To obtain a finite, Boolean expression the total number of stages must be declared. Let

denote the number of stages at which operators can be applied. As usual, the fist stage is

and the final stage is

. Setting a stage limit is a significant drawback of the approach because this is usually not known before the problem is solved. A planning algorithm can assume a small value for

and then gradually increase it each time the resulting Boolean expression is not satisfied. If the problem is not solvable, however, this approach iterates forever.

Let $\vee$ denote logical OR, and let $\wedge$ denote logical AND. The Boolean expression is written as a conjunction^2.5 of many terms, which arise from five different sources:

Example 2..9 (The Flashlight Problem as a Boolean Expression) A Boolean expression will be constructed for Example 2.6. Each of the expressions given below is joined into one large expression by connecting them with $\wedge$ 's.

The expression for the initial state is

$\displaystyle O(C,F,1) \wedge \neg I(B1,F,1) \wedge \neg I(B2,F,1) ,$

(2.36)

which uses the abbreviated names, and the stage tag has been added as an argument to the predicates. The expression for the goal state is

$\displaystyle O(C,F,5) \wedge I(B1,F,5) \wedge I(B2,F,5) ,$

(2.37)

which indicates that the goal must be achieved at stage

. This value was determined because we already know the solution plan from (2.24). The method will also work correctly for a larger value of

. The expressions for the operators are

$\begin{displaymath}\begin{split}\neg PC_k & \vee (\neg O(C,F,k) \wedge O(C,F,k+1... ... O(C,F,k) \wedge \neg I(B2,F,k) \wedge I(B2,F,k+1)) \end{split}\end{displaymath}$

(2.38)

for each

from

The frame axioms yield the expressions

$\begin{displaymath}\begin{split}(O(C,F,k) \vee \neg O(C,F,k+1)) & \vee (PC_k) \\... ...vee (I2_k) (\neg I(B2,F,k) \vee I(B2,F,k+1)) & , \end{split}\end{displaymath}$

(2.39)

for each

from

. No operators remove batteries from the flashlight. Hence, two of the expressions list no operators.

Finally, the complete exclusion axiom yields the expressions

$\displaystyle \neg RC_k$	$\displaystyle \vee \neg PC_k$	$\displaystyle \qquad \neg RC_k$	$\displaystyle \vee \neg O1_k$	$\displaystyle \qquad \neg RC_k$	$\displaystyle \vee \neg O2_k$	(2.40)
$\displaystyle \neg PC_k$	$\displaystyle \vee \neg O1_k$	$\displaystyle \qquad \neg PC_k$	$\displaystyle \vee \neg O2_k$	$\displaystyle \qquad \neg O1_k$	$\displaystyle \vee \neg O2_k ,$

for each

from

. The full problem is encoded by combining all of the given expressions into an enormous conjunction. The expression is satisfied by assigning TRUE values to

, and

. An alternative solution is

, and

. The stage index tags indicate the order that the actions are applied in the recovered plan. $\blacksquare$