Induction
The proofs we've seen so far are examples of one type of proof, the direct proof. We now introduce another type of proof, the indirect proof. Indirect proofs come in several variants. One such variant is the proof-by-contradiction.
Proof-by-contradiction
In a proof-by-contradiction, we start by assuming that the opposite of what we are trying to prove (the negation) is true. Then, following this assumption, we take logical deductions until we arrive at a contradiction (a proposition that states "false equals true"). If we ever reach a false-equals-true, then that means that what we assumed from the very beginning was wrong. In other words, the opposite (the negation) cannot be true. And since the negation cannot be true, the original proposition must be true. More formally:
To prove a proposition is true, we assume that is false ( is true). From this hypothesis ( is false), we prove that a falsehood is true ( is false implies is true). In doing so, we prove that it must be the case that is true.
This works because if we can prove that then the only way for to be true is if is false. And if is false, then is true.
Example: Prove that is irrational. First, let's be clear about what our terms mean. An irrational number is a number that cannot be written as the ratio of two integers. In contrast, a rational number is a number that can be written as the ratio of two integers.
When we write a proof by contradiction, we always start by indicating that the proof is a proof by contradiction.
Assume for the purpose of contradiction that is a rational number. If is rational, then where and are integers, and is a fraction expressed in lowest terms (a fraction with no common divisors).
Squaring both sides: Multiplying both sides by we obtain This implies that is even, since can be written as given that is an integer. Because is even, then it must be true that is even (since the square of an even number is even) — ("two divides ").
Given that is even, is a multiple of 4 — This means that Dividing each side by 2,
Now, if 2 divides by then it must be the case that is even. We've arrived at a contradiction. If is even and is even, then is not a fraction in lowest terms. This means that cannot be rational, so it must be irrational.
The proof above was first discovered by a religious society of ancient Greece called the Pythagoreans. In ancient Greece, the practice of mathematics was a religious exercise. The ancient Greeks believed that mathematics was ruled by two gods — Apeiron and Peras. Apeiron was viewed as a malevolent god: the embodiment of primal chaos, the god of the infinite. In contrast, Peras was viewed as the benevolent one: the embodiment of order, the god of the finite.
One of the axioms of these mathematical mystics was that there were no irrational numbers — they simply did not exist. The Greeks denied their existence because they were evil; they were infinite. In contrast, the mystics affirmed the existence of the rational numbers because they were finite.
This denial of the infinite extended to other spheres. Every line, according to the Pythagoreans, was finite. From this axiom, they derived what we now know as the Pythagorean Theorem. If there was a triangle with legs of length 1, then the length of the hypotenuse was Because they denied the existence of the irrationals, the Pythagoreans constructed a theorem that was rational. Eventually, the Pythagoreans discovered the proof above.
Needless to say, this caused mayhem. The Pythagoreans were now confronted with the fact that their axioms were inconsistent. Every theorem they had proven was premised on the non-existence of irrationals, and they now had just found an irrational. The solution: Cover it up. According to legend, the Pythagoreans not only denied the result, they went so far as to murder the member who discovered the proof.
The Well-Ordering Principle
The Well-ordering Principle states:
definition. Every nonempty set of nonnegative integers has a smallest element.
This short statement contains numerous implications. First, it's false for the empty set since the empty set contains no elements. Second, it's false for the negative integers. And since it's false for the negative integers, it's false for the set of all integers. I.e., is not well ordered. And since it's false for it's false for the set of all rational numbers, And since it's false for it's false for the set of all real numbers, These all imply that there's something very special about the nonnegative integers, a.k.a. the natural numbers 1
The well-ordering principle allows us to construct numerous proofs-by-contradiction, and it is a cornerstone of proof-by-induction. For example, recall the proof that is irrational. In the previous proof, we used a little number theory and algebraic manipulation to construct a proof-by-contradiction. In that proof, however, we relied on a particular postulate:
Postulate. Let Then can be written in lowest terms. In other words, cannot be written as where and are positive integers with no common prime factors.
How do we know that this postulate is true? Well, we can prove it with WOP:
Proof (By Contradiction). Let Suppose cannot be written in lowest terms. Further suppose that is the set of all positive integers that are numerators of In other words, Because is nonempty.
Since is nonempty and then by the well-ordering principle, contains a smallest integer Hence, again by WOP there exists an integer such that:
cannot be written in lowest terms.
Because cannot be written in lowest terms, and must have a common prime factor, Thus:
This implies that cannot be written in lowest terms either. Because is the set of all positive integers that are numerators of the numerator is a member of But This contradicts the fact that is the smallest element of
Since the assumption that leads to a contradiction, it follows that must be empty. And because must be empty, for all positive integers, the fraction that can be written in lowest terms.
Proof-by-induction
We now turn to another variety of proofs — proof-by-induction. The concept of induction is actually a predicate itself:
Axiom. Let be a predicate. If is true, and is true, then is true.
Another way of expressing the axiom above: If is true, and if and if all the way up to then is true for any natural number As an axiom, induction is relatively straightforward. If know that is true, and we know that and all the way up to then it makes sense for to be true for all We can think of this as akin to dominoes falling.
Of note, mathematical induction is not a form of inductive reasoning. In inductive reasoning, we reason that because of some previously observed phenomenon, we can make inferences about the future. This is entirely unrelated to mathematical induction. Mathematical induction is, in fact, a form of deductive reasoning, not inductive reasoning. In other words, mathematical induction is to induction what Rocky Mountain oysters are to oysters. Similar names but very, very different things. When mathematicians use the word "induction" alone, they are referring to the mathematical, rather than epistemic, context.
Let's see an example. Prove the following:
Note that the ellipses, tells us that we must complete the pattern. In this case, there is a pattern to In this case, each element in the sequence is an increment-by-one of the element before it. Because this can be very vague, we have several ways to express this idea more explicitly.
In each of the notations above, we are expressing the clause, "The sum of from to Thus, the proposition we are asked to prove can be more succinctly expressed as:
The notation above is called a summation. There are a number of reasons why use the summation notation. To begin with, if then the expression is ambiguous and confusing. With the summation notation, we can simply say the when then since the sum of 1 to 1 is simply 1. Similarly, if then Why? Because there are no numbers to sum. We call these two cases, and the edge cases.
All this in mind, let us write the proof. The first thing we do is to state that the proof will be done by induction. Next step: What is our predicate? In other words, what is the hypothesis? Here, it is the proposition we are trying to prove: That
Proof (by induction).
Let be the proposition that
Now we begin thinking about the proof. The first thing to analyze is the base case. Here, we must establish that is true. We saw this earlier. The sum of to 0 of is 0. The proposition is also true by algebra:
Having proved the base case is true, we now must prove the second part, the inductive case. In this case, we must show: For is true.
Now, recall basic logic. How do we show that an implication is true? There is only one situation where the implication is false, and that is when is true and is false. Thus, to prove this implication, we must assume that is true, then determine whether is true. So, putting together what we have so far:
Let be the proposition that When then This is the base case. We now prove the inductive case: For
Assume is true for purposes of induction. Following this assumption, we must prove that
The last sentence above is arguably the most confusing for non-mathematician readers. It seems as if in trying to prove we are assuming Isn't this circular? Admittedly, it appears to teeter very close. In actuality, it is not at all circular. We are assuming to prove the necessary condition of the proposition Remember, in mathematics, we write a proof one step at a time, from one deduction to the next.
We should also be very clear about what we mean when we say something like "Assume " We are not saying that is, in fact, true. We are saying, "Just pretend for a moment—for the sake of a hypothetical—that is true." The only purpose of an "Assume " statement is to obtain a non-hypothetical statement of the form For example, remember how we wrote a proof-by-contradiction? The same idea is at work here. We want to reach some non-hypothetical conclusion: "When we assume this is what happens. And since this is what happens, it must be true that ..."
Returning to the proof, we are now trying to prove that We can rewrite this as Since we assumed that is true, we know that:
Thus, we have:
Putting this all together:
Let be the proposition that When then This is the base case. We now prove the inductive case: For
Assume is true for purposes of induction. Following this assumption, we must prove that Since is true, we can rewrite the expression as Rewriting this expression, we have which is Factoring this expression, we have
Thus, we have proved that is true. And since is true when is true, the implication cannot be false. Therefore, the implication must be true. Ergo, is true.
Let's do another example. Prove the following:
Let's first test the predicate:
3 does in fact divide 120.
First, what is our induction hypothesis? It is: Let Next, we should establish that our base case is. Our base case is when Checking:
Our base case is true; 3 does in fact divide 0. Now we turn to the inductive step: For is true. More formally:
To prove that this is true, we first assume that is true. In other words, we assume that is true. Now we examine the necessary condition, The proposition is Exampding this:
Looking at this result, it doesn't really look like is a multiple of 3. We need to wrtie this a bit more. We want to use the fact that is a multiple of three (remember, we assumed it). Remember, whenever we perform a proof-by-induction, we always want to make use of the sufficient condition. So, to make use of our sufficient condition, we want to get a inside the expression. So, we need to rewrite the expression so as to incorporate a minus sign:
Now it's very clear that is true:
We need not always have a base case start at 0. We could have a base case and then the induction step
Consider the following problem. We have a grid, and we want to tile it with an L-shaped tile, such that the entirety of the grid is filled with one square open:
It looks like we can from the diagram above. Now we want to prove it for all Here is the proposition:
there exists a way to tile a region with a center square missing.
First, we establish our induction hypothesis: a way to tile a region with a center square missing. Second, we establish the base case, Where we only have one square, since That is the center square missing, so is true where is true.
Now we consider the inductive case: For assume is true to verify Now we must prove that is true.
Let's consider It turns out it's very difficult to prove when a center square is missing (try it). It is actually much easier if we make a stronger assumption — any square is missing. With this assumption, we can place the missing square anywhere. In doing so, we can think of any region as smaller parts of And since is true, we can place the missing square anywhere in a region
Footnotes
-
The well-ordering principle is often shortened to WOP, not to be confused with Belcalis Marlenis Almánzar and Megan Jovon Ruth Pete's popular hit, WAP. ↩