In order to consider symbolic computing applications on the domain of the predicate calculus (first-order logic), we need a grammar for the textual representation and a compatible Hasekll representation.
Consider the following grammar rules defining a textual representation for formulas of the predicate calculus. Overall, we have eight types of formula.
Predicate expressions such as \(p(X, f(Y, Z))\) consist of a named predicate applied to terms involving functions and variables.
In this syntax, the number of terms in a or may be zero, in which case the parentheses are omitted. A predicate expression having zero terms is essentially a proposition. A function expression having zero terms is essentially a constant.
One convention for distinguishing predicate and function names from variables is to use symbols beginning with a capital letter for variables, and symbols beginning with a lower case letter for predicate or function names. (Prolog implementations often follow this convention.)
predicate-name = [a-z][A-Za-z_0-9]+ function-name = [a-z][a-z_0-9]+ variable = [A-Z][A-Za-z_0-9]+
For simplicity of parsing, the structure of the compound formulas can use the operator keywords with a prefix-parenthesized notation as follows.
::= "not" "(" ")" ::= "and" "(" [ <"," >] ")" ::= "or" "(" [ <"," >] ")" ::= "implies" "(" ")" ::= "equiv" "(" ")" ::= "forall "(" ")" ::= "exists "(" ")"
How does this make parsing simpler?
Given the textual representation of of formulas of the predicate calculus as defined by the grammar above, it is straightforward to define suitable Haskell data representations.
data Formula = Predicate Name [Term] | Negation Formula | Conj [Formula] | Disj [Formula] | Implies Formula Formula | Equiv Formula Formula | ForAll Variable Formula | Exists Variable Formula data Term = Function Name [Term] type Name = [Char] type Variable = [Char]
Conjunctive Normal Form is a canonical representation of formulas that is very simple to work with in theorem proving. It is a very flat structure consisting of logical connectives in 4 layers.
The CNF representation can then be converted to a clause-form equivalent representation reflected by the following Haskell data types.
type ClauseFormEquivalent = [Clause] type Clause = [Literal] data Literal = Predicate Name [Term] | NegatedPred Name [Term]
The process of conversion to canonical form involves the following steps.
The first step in conversion to CNF is to apply the following reduction rules to eliminate implications and equivalences.
Let us write a Haskell function to do this. A standard inside-out approach does the job.
eliminate1 :: Formula -> Formula eliminate1 f@(Predicate n terms) = f eliminate1 (Negation f) = (Negation (eliminate1 f)) eliminate1 (Conj formulas) = Conj (map eliminate1 formulas) eliminate1 (Disj formulas) = Disj (map eliminate1 formulas) eliminate1 (ForAll v f) = ForAll v (eliminate1 f) eliminate1 (Exists v f) = Exists v (eliminate1 f) eliminate1 (Implies f1 f2) = Disj [Negation (eliminate1 f1), eliminate1 f2] eliminate1 (Equiv f1 f2) = let (e1, e2) = (eliminate1 f1, eliminate1 f2) in Conj [Disj [Negation e1, e2], Disj [Negation e2, e1]]
In this step, the goal is to transform a formula so that any negations remaining in the formula are innermost, i.e., they apply to predicate expressions.
The following reduction rules apply.
One way to do this would be to use a standard inside-out approach, with a simplify and a make_negation routine. But if we have simplified subexpressions and then see an outer negation, our various negation rules will create more negations that have to be propagated inwards.
A simpler, more-efficient approach is perform the task outside-in.
propagate :: Formula -> Formula negatef :: Formula -> Formula propagate f@(Predicate n terms) = f propagate (Negation f) = negate f propagate (Conj formulas) = Conj (map propagate formulas) propagate (Disj formulas) = Disj (map propagate formulas) propagate (ForAll v f) = ForAll v (propagate f) propagate (Exists v f) = Exists v (propagate f) negatef f@(Predicate n terms) = (Negation f) negatef (Negation f) = propagate f negatef (Conj formulas) = Disj (map negatef formulas) negatef (Disj formulas) = Conj (map negatef formulas) negatef (ForAll v f) = Exists v (negatef f) negatef (Exists v f) = ForAll v (negatef f)
The next step prepares for the elimination of quantifiers by renaming variables to be globally unique.
The next step in conversion to CNF is to eliminate existential quantifiers by introducing Skolem functions.
Consider the variable \(Z\) in: \(\forall X \forall Y \exists Z p(X, Y, Z)))\)
If this is to be true, there must exist some \(Z\) (possibly dependent on \(X\) and \(Y\)) that makes it true.
We introduce a Skolem function \(g(X, Y)\) to represent a suitable value of \(Z\).
The formula can then be translated to \(\forall X \forall Y p(X, Y, g(X, Y)))\).
Of course, each existentially quantified variable requires a separate Skolem function.
Next we can propagate universal quantifiers outwards, so that the formula is in prenex form, i.e., all the quantifiers are at the outermost levels, and apply to a single quantifier-free subformula called the matrix.
We then discard the outer quantifiers and use the matrix to represent the overall formula, with all variables implicitly universally quantified.
The final steps produce conjunctive normal form by flattening propagating conjunctions outwards and flattening conjunctions and disjunctions.
The CNF representation is converted to a clause-form equivalent by eliminating explicit conjunction and disjunction operators.
type ClauseFormEquivalent = [Clause] type Clause = [Literal] data Literal = Predicate Name [Term] | NegatedPred Name [Term]