Chapter 7
Herbrand Method
In Propositional Logic, it is possible to determine satisfiability and logical entailment by looking at the set of all interpretations for the logical constants of that language, i.e. its truth table. Although the truth table for a propositional language can be large, it is always finite; and so there is a simple procedure for checking whether a sentence is satisfiable or valid or whether a sentence logically entails another.
In Relational Logic, we also have the notion of an interpretation; and this raises the question of whether there is an approach to checking satisfiability and logical entailment for Relational Logic that is analogous to the truth table method for Propositional Logic.
Consider a set of sentences in Relational Logic with two object constants a and b and one unary relation constant r. The table below shows all of the interpretations using a universe of discourse consisting of just the two numbers 1 and 2. In this case, the set of all possible interpretations is finite.
| ∀ | a | b | p |
| {1, 2} | 1 | 1 | {} |
| {1, 2} | 1 | 1 | {1} |
| {1, 2} | 1 | 1 | {2} |
| {1, 2} | 1 | 1 | {1,2} |
| {1, 2} | 1 | 2 | {} |
| {1, 2} | 1 | 2 | {1} |
| {1, 2} | 1 | 2 | {2} |
| {1, 2} | 1 | 2 | {1,2} |
| {1, 2} | 2 | 1 | {} |
| {1, 2} | 2 | 1 | {1} |
| {1, 2} | 2 | 1 | {2} |
| {1, 2} | 2 | 1 | {1,2} |
| {1, 2} | 2 | 2 | {} |
| {1, 2} | 2 | 2 | {1} |
| {1, 2} | 2 | 2 | {2} |
| {1, 2} | 2 | 2 | {1,2} |
Unfortunately, using just this table to check for satisfiability and logical entailment is not adequate, since it covers just one possible universe of discourse. There are many other possibilities. The problem with Relational Logic is that, in general, the set of possible interpretations is necessarily infinite. Moreover, there is not even a systematic way of enumerating all of the possibilities.
All is not lost. At least in some cases. The trick is to use a set of special interpretations, called Herbrand interpretations, to check satisfiability and logical entailment. It can be shown that in some cases checking these models alone is sufficient. Furthermore, in some cases, there are only finitely many Herbrand models; and so the process of checking for these conditions is algorithmic.
Sadly, this trick does not always work. Sometimes, Herbrand interpretations alone do not suffice. And, sometimes, even though Herbrand interpretations suffice, they are infinitely large; and so checking satisfaction takes forever.
In order to understand when the trick works and when it does not, we look at the method on increasingly complex subsets of Relational Logic. First, we examine Basic Logic, i.e. Relational Logic without variables, quantifiers, or function constants. We then extend this to Universal Logic by allowing variables. After looking at Universal Logic, we switch to Existential Logic (Universal Logic with quantifiers) but still avoid functions. Then, we look at Functional Logic but avoid quantifiers.
Basic logic is Relational Logic without variables, quantifiers, or function constants. All ground sentences are included and all logical sentences that can be formed from ground sentences.
The Herbrand universe for a set of sentences in Basic Logic is the set of all object constants used in the sentences. For example, for sentences with just two object constants a and b, the Herbrand universe is the set {a,b}.
A Herbrand interpretation for Basic Logic is an interpretation in which (1) the universe of discourse is the Herbrand universe for the language and (2) each object constant maps into itself.
For example, the interpretation hshown below is a Herbrand interpretation for a set of sentences with objects constants a and b and unary relation constant p.
| ∀h | = | {a, b} |
| ah | = | a |
| bh | = | b |
| ph | = | {a, b} |
Note that the object constants are interpreted as themselves, as required by the definition of Herbrand interpretation. The interpretation of the relation constant p is arbitrary; any other unary relation on the Herbrand universe would be acceptable.
One interesting thing to note about Herbrand interpretations for basic logic is that there are only finitely many of them. Given a finite set of constants, the universe of discourse is finite. The interpretation of the object constants is fixed; and there are only finitely many relations that can be formed from a finite universe of discourse.
The Herbrand table for a relational language is a table illustrating all possible Herbrand interpretations for the language. As with a truth table, the columns represent the interpretations of symbols and the rows represent possible interpretations. The following is the Herbrand table for our little language.
| ∀ | a | b | p |
| {a, b} | a | b | {} |
| {a, b} | a | b | {a} |
| {a, b} | a | b | {b} |
| {a, b} | a | b | {a, b} |
The interesting thing about Herbrand interpretations for basic logic is that they alone are sufficient for checking satisfiability and logical entailment, because of the Basic Herbrand Theorem.
Basic Herbrand Theorem: A set of sentences in Basic Logic has a model if and only if it has a Herbrand model.Given a model i for a set of sentences in Basic Logic, we can construct a corresponding Herbrand model h as follows. The Herbrand interpretation for each object constant is, of course, itself. The Herbrand interpretation h for each relation constant ρ is the set of all tuples of object constants &tau1, ... ,&taun such that interpretation i satisfies the sentence ρ(&tau1, ... ,&taun). From this construction, it is easy to see that h satisfies the same ground atomic sentences as i, and their agreement on logical sentences is assured by the semantics of the logical operators.
As an example of this construction, consider a set of sentences with object constants a and b and a single unary relation p. Now, consider the interpretation i shown below. There are just two elements in the universe of discourse. The constant a maps to 1; b maps to 2; and p is true of 1 but not 2.
| ∀i | = | {1, 2} |
| ai | = | 1 |
| bi | = | 2 |
| pi | = | {1} |
The corresponding Herbrand interpretation is shown below. The constants a and b map to themselves. In order to get the interpretation of p, we consider each relational sentence that can be formed from a, b, and p . For each, we check whether or not it is satisfied by i. If so, we include the corresponding constant in our interpretation. If not, we do not include it. In this case, this leads to the interpretation shown.
| ∀h | = | {a, b} |
| ah | = | a |
| bh | = | b |
| ph | = | {a} |
As a more interesting example, consider the following interpretation for the same language. In this case, both object constants refer to the same object, viz. 1; and one of the elements in the universe of discourse, viz. 2, has no corresponding object constant.
| ∀i | = | {1, 2} |
| ai | = | 1 |
| bi | = | 1 |
| pi | = | {1} |
The corresponding Herbrand interpretation is shown below. The constants a and b map to themselves. In order to get the interpretation of p, we consider each relational sentence involving p and the constants a and b. For each such sentence, we check whether it is satisfied by i. If so, we include the corresponding constants in our interpretation. If not, we do not include them. In this case, we get both a and b.
| ∀h | = | {a, b} |
| ah | = | a |
| bh | = | b |
| ph | = | {a, b} |
In the first example, the structure of the interpretation and its corresponding Herbrand interpretation are very similar. In this second example, the structure is very different. In the second interpretation, both object constants refer to the same object whereas in the corresponding Herbrand interpretation, they refer to different objects. In the Herbrand interpretation, there is an object constant for each element in the universe of discourse, whereas in the ordinary interpretation, there is an object (2) with no corresponding object constant. These differences might lead one to think that the two interpretations do not satisfy the same sentences. The Herbrand theorem assures us that, despite these differences, they satisfy exactly the same sentences, provided that we restrict ourselves to Basic Logic. The reader is encouraged to try out the interpretations on various sentences to confirm that this is indeed correct.
The importance of the Herbrand theorem is that it gives us a way of checking logical entailment in a finite amount of time. This is called the Herbrand Method. Starting with a set of premises and a possible conclusion, we loop over Herbrand interpretations, crossing out each interpretation that does not satisfy the premises. Then, we check whether each remaining interpretation satisfies the conclusion. If so, the premises logically entail the conclusion. Otherwise, logical entailment does not hold.
Suppose, for example, we were asked to determine whether the sentence (p(a) ⇒ q(b) and the sentence (p(a) ∨ q(b)) together logically entail q(b). We start by writing out the set of all Herbrand interpretations and crossing out those that do not satisfy the premises. The first table below on the left is the complete set of Herbrand interpretations for this language. The second table results from crossing out those interpretations that do not satisfy (p(a) ⇒ q(b). The third table results from crossing out the interpretations that do not satisfy (p(a) ∨ q(b)). The fourth table is the result of crossing out all rows that do not satisfy both premises.
|
|
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Now, we check that each remaining interpretation satisfies the desired conclusion. In this case, the check is successful. Consequently, we can conclude that the premises in this case logically entail the conclusion.
Although this method is simple, it is not necessarily very efficient. The Herbrand model for a set of sentences in Basic Logic can be very large. If there are n object constants, then the Herbrand universe is of size n. This, in itself, is not so bad. What hurts is the number of interpretations of the relation constants. With n constants, one can form nk k-tuples, leading to 2nk possible k-ary relations. With m k-ary relation constants, this leads to (2nk)m possible Herbrand interpretations.
To make this concrete, consider a language with ten object constants and three binary relation constants. In this case, the Herbrand universe has size 10. There are 100 2-tuples, leading to 2100 binary relations, leading to 2300 possible Herbrand interpretations. For problems of this size, the Herbrand method is impractical.
Universal Logic differs from Basic Logic in that we are permitted to use variables. Function constants are still forbidden. Explicit quantifiers are excluded as well. (Free variables have implicit universal quantification, and there are no embedded universal quantifiers and no existential quantifiers.)
As with Basic Logic, the Herbrand universe for Universal Logic is the set of all object constants. If there are no object constants, then we include a single, arbitrary object constant, say a.
As with Basic Logic, we can form the Herbrand interpretation corresponding to any given interpretation. The process is exactly the same. Every object constant is mapped into itself. For every relation constant ρ, we take the set of tuples of object constants &tau1, ... ,&taun for which ρ(&tau1, ... ,&taun) is true under the given interpretation.
The apparent bad news is that this sometimes produces a Herbrand interpretation that, for some universal sentences, does not agree with the interpretation on which it is based. Consider the interpretation shown below.
| ∀i | = | {1, 2, 3} |
| ai | = | 1 |
| bi | = | 2 |
| pi | = | {1, 2} |
The following interpretation is the corresponding Herbrand interpretation.
| ∀h | = | {a, b} |
| ah | = | a |
| bh | = | b |
| ph | = | {a, b} |
Now, consider the sentence p(x), which asserts that p holds of everything. This is clearly true in the Herbrand interpretation, but it is not true in the interpretation on which this Herbrand interpretation is based.
In the case of Basic Logic, the Herbrand interpretation corresponding to an ordinary interpretation satisfies exactly the same sentences as the ordinary interpretation. While this relationship does not hold for Universal Logic, a weaker version does hold.
The Skolemization of a sentence &phi' obtained by replacing all variables in φ with new constants. The importance of Skolemization stems from the fact that a sentence in Universal Logic is satisfiable if and only if its Skolemization is satisfiable.
The upshot of this is that a version of the Herbrand method can still be used in Universal Logic. It is essentially the same as the Herbrand Method for Basic Logic. The only difference is that any variables that occur in the desired conclusion must first be replaced by new object constants. If a sentence can be proved with arbitrary new object constants substituted for variables, then it must be true for all objects in the universe of discourse.
Checking whether Δ logically entails ψ is a two-step process. We first create the Skolemization φ' by replacing all variables in φ with new constants. We then loop over Herbrand interpretations for the (possibly enlarged) language in the usual fashion. If every Herbrand interpretation that satisfies the premises also satisfies the Skolemized conclusion, then the premises logically entail the conclusion.
Suppose, for example, we were asked to determine whether (p(x) ⇒ q(x)) and (q(x) ⇒ r(x))logically entail (p(x⇒ r(x)). Remember that, in Relational Logic, free variables are interpreted as universally quantified variables; hence the implication here must be true for all objects in the universe of discourse.
We start by Skolemizing the desired conclusion. We introduce a new object constant a and then create the Skolemized conclusion (p(a) ⇒ r(a)). Next, we write out the set of all Herbrand interpretations. We then crossout those that do not satisfy the premises. The first table below on the left is the complete set of Herbrand interpretations for this language. The second table results from crossing out those interpretations that do not satisfy (p(x) ⇒ q(x)). The third table results from crossing out the interpretations that do not satisfy (q(x) ⇒ r(x)). The fourth table is the result of crossing out all rows that do not satisfy both premises.
|
|
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Now, we verify that the remaining interpretations satisfy the Skolemized conclusion (p(a) ⇒ r(a)). Consequently, we can conclude that the premises in this case logically entail the conclusion.
Existential Logic is the same as universal logic except that it includes quantified sentences. It is called existential logic because the problems that arise are due to the presence of existential quantifiers. Universal quantifiers can also cause problems but only when they are equivalent to existential quantifiers, i.e. when used in a negative context such as the target of a negation or the antecedent of an implication or reduction.
The bad news is that no version of the Herbrand Theorem holds for Existential Logic. If a set of sentences in an Existential Logic has a model, then it does not necessarily have a Herbrand model.
To see that this is true, consider the following example. Suppose we have a language with two object constants a and b and a single unary relation constant p. Now, consider the following sentences in existential logic.
The interpretation shown below is a model of these sentences. There are three objects in the universe of discourse. The object constants a and b map to two of these objects, and the relation constant p maps to a relation that is true of the third object only.
| ∀i | = | {1, 2, 3} |
| ai | = | 1 |
| bi | = | 2 |
| pi | = | {3} |
The corresponding Herbrand interpretation in this case is shown below. We have a universe of discourse with just two objects.
| ∀h | = | {a, b} |
| ah | = | a |
| bh | = | b |
| ph | = | {} |
Unhappily, this interpretation is not a model of the sentences above. The third sentence requires that there be an object that satisfies p, and this interpretation does not contain any such object.
What's more, it is easy to see that there can be no model built on the Herbrand universe. Because of the first two sentences, the relation corresponding to r must not be true of these constants, and there is no other object in the universe of discourse to satisfy the third sentence.
The upshot of this is that the Herbrand method does not work for Existential Logic. It can be used to show that a set of sentences is satisfiable. However, the failure to satisfy all Herbrand interpretations does not mean that there are no interpretations that satisfy the sentences, a this example demonstrates.
Functional Logic is the same as Universal Logic except that it includes functional terms. We are still not permitted to use quantifiers, though as with Universal Logic free variables are implicitly universally quantified.
For Universal Logic, the Herbrand universe is the set of all object constants. For Functional Logic, we must also include complex terms that can be formed by applying function constants to ground terms. If there are no object constants, then we include an arbitrary object constant, say a.
As an example, consider a set of sentences with a single constant a and no function constants. In this case, the Herbrand universe consists of just this single constant.
Now, let us add two unary function constants, say f and g. In this case, we must add additional terms, as shown below. Note that the Herbrand universe is infinite in this case, as it is for all functional languages.
A Herbrand interpretation for functional logic is an interpretation in which (1) the universe of discourse is the Herbrand Universe for the language, (2) each object constant maps into itself, and (3) each function constant denotes a function that maps ground terms to themselves.
For example, the interpretation shown below is a Herbrand interpretation with objects constant a, unary function constants f and g, and unary relation constant p. The object constants maps into itself, and each function constant is interpreted as a function that maps each ground term into a new term comprised of that function constant applied to that term.
| ∀h | = | {a, f(a), g(a), f(f(a)), f(g(a)), g(f(a)), g(g(a)), ...} |
| ah | = | a |
| fh | = | {a → f(a), f(a) → f(f(a)), g(a) → f(g(a)), ...} |
| gh | = | {a → g(a), f(a) → g(f(a)), g(a) → g(g(a)), ...} |
| ph | = | {a, g(a), f(f(a)), g(g(g(a)))} |
The interpretation shown below is also a Herbrand interpretation for this language. The object constants and function constants are interpreted as before, as required by the definition of Herbrand interpretation. The only thing different, the only thing that can be different is the interpretation of the relation constant p.
| ∀h | = | {a, f(a), g(a), f(f(a)), f(g(a)), g(f(a)), g(g(a)), ...} |
| ah | = | a |
| fh | = | {a → f(a), f(a) → f(f(a)), g(a) → f(g(a)), ...} |
| gh | = | {a → g(a), f(a) → g(f(a)), g(a) → g(g(a)), ...} |
| ph | = | {a, f(a), f(f(a)), f(f(f(a))), ...} |
We define a Herbrand variable assignment as a variable assignment in which the universe is the Herbrand Universe for the language. The only difference is that the universe of discourse now contains more objects.
| xh | = | a |
| yh | = | a |
| zh | = | g(f(a)) |
The good news about Functional Logic is that the Herbrand Method works, provided we use the algorithm used for Universal Logic. Many people are surprised by this result. It is possible to have a finite model for a functional language; the universe of discourse of the corresponding Herbrand interpretation is infinitely large; yet the two interpretations satisfy the same sentences.
Unfortunately, in the case of Functional Logic, the result is not as useful as it is for Universal Logic. The size of the Herbrand universe for Functional Logic is infinite. Consequently, checking the Herbrand interpretations for a language to determine satisfiability and logical entailment is not feasible.
1. Herbrand Tables. Consider a language with two object constants a and b and a single binary relation constant r. Write out the Herbrand table for this language. Show answer
2. Corresponding Herbrand Interpretations. Consider the interpretation shown below.
| ∀i | = | {1, 2, 3} |
| ai | = | 1 |
| bi | = | 2 |
| ri | = | {[1,1], [2,2], [1,3]} |
Write out the Herbrand interpretation corresponding to this interpretation. Show answer