OK, fine—we can't actually tell you how to prove everything. But we can give you some advice that will help you when you try to prove anything.
Writing a proof is more art than science. Although there may be a “correct” way to prove something (or several correct ways), there is still a wide range of styles, formats, and logical implications that follow the same basic argument. (Similarly, you and your friend might write very different essays on “How I Spent My Summer Vacation,” even if you did the same things during your vacations.)
Writing a proof is very much like arguing a case in court. (Or at least it's like how it looks on TV.) Like a courtroom argument, a proof should contain a beginning, a middle, and an end.
Courtroom Claim: Dr. Evil is guilty of stealing pencils from Prof. Plum's desk.
Courtroom Argument: Consider the man sitting before you, Dr. Evil. You already know that security camera video from the night of October 6 shows Dr. Evil entering Prof. Plum's office building. Let the security video tape from that night be labeled as “Exhibit A.” Today I will convince you, beyond a reasonable doubt, that Dr. Evil stole pencils from Prof. Plum's desk on that night. I will do this by providing physical evidence placing Dr. Evil in Prof. Plum's office and demonstrating that Dr. Evil had, indeed, touched pencils recently.
See how similar their structures are?
Courtroom Argument: Exhibit A shows Dr. Evil entering Prof. Plum's office building at 11:37 PM. At approximately 11:45 PM, graduate students saw a secretive figure attempting to pick the lock on Prof. Plum's office door. Dr. Evil's fingerprints were found on the door, and in Prof. Plum's office, on the following day (October 7), and since Prof. Plum's office had been steam‐cleaned the day before, the fingerprints must have been left on the night of October 6. Moreover, graphite stains on Dr. Evil's shirt match the precise composition of graphite contained in the pencils that Prof. Plum keeps in his desk.
Note that, in the proof, we provided justification for the steps that were not immediately obvious but omitted justification for the easy algebraic step. What counts as “easy” or “obvious” is, of course, a subjective matter. In general, a good rule of thumb to use is that, if your fellow students were reading your proof, they should be able to follow each step without having to look up any facts or write down any additional derivations.
Courtroom Argument: Ladies and gentlemen of the jury, the preponderance of evidence demonstrates that Dr. Evil has committed this heinous crime. You have no choice but to find him guilty.
Just like the lawyer's argument, the proof uses words—not just math—to lead the reader on the path from assumptions to conclusions. Of course, in a legal trial, facts and implications are subject to interpretation—that's why we have judges and juries. In a mathematical proof, however, all facts and logical implications should be incontrovertible.
Here is a (very nonexhaustive) list of the kinds of statements you may be asked to prove in this book or at some other point in your proof‐writing career:
This is the simplest kind of statement (though that does not mean it will require the simplest proof). It simply asks you to prove that two mathematical objects are equal.
Notice that in this example, there is a qualifying statement to set up the statement you are asked to prove; this is fairly typical.
The symbols p and q stand here not for variables but for statements. The symbol is interpreted as “p implies q,” and it is the same as saying “if p then q.”
Here p is “ ” and q is equation B.2).
The symbol means “if and only if” (sometimes abbreviated “iff”). The claim indicates that either both statements are true or both are false. Another description for this kind of statement is that q is a necessary and sufficient condition for p—in order for p to be true, q must be true, and the truth of q is sufficient to ensure the truth of p.
To prove an iff statement, you must prove both directions of the implication—that is, you must prove that p implies q and that q implies p. Sometimes you can do this all at once using a string of iff implications:
In other cases, though, the proof needs to be divided into two parts. In the first part, you prove one implication (e.g., ), and in the second part, you prove the reverse implication ( , or its logical equivalent, ).
Here you are asked to prove that for all x that satisfy a certain [condition], some [statement] is true.
To prove a “ ” claim, you take the [condition] as given and prove that the [statement] is true. This actually feels a lot like a “ ” claim, and in fact they are often logically equivalent.
This time you need to prove that there exists (at least one) x that satisfies the [statement]. Sometimes there are qualifying conditions on the type of x that are allowed.
(In this case, there are many x that will do the trick, but you are asked only to prove the existence of one of them.)
In other words, you are being asked to disprove the statement p.
In general, it suffices to find a single example for which p is not true. In the example above, you just need to find two convex functions whose product is not convex.
However, if p is of the form “ such that [statement],” then to disprove p you must prove that the [statement] is false for all x. This may be easy or hard. Here's a hard example:
(This is Fermat's Last Theorem, which went unproved for over 350 years until it was finally proved in 1995.)
This section will give you a quick overview of several types of proofs—strategies for proving theorems. (Of course, there are others that this list does not include.) These are tools in your proof‐building toolbox. It's your job to figure out which tool(s) to use for each job.
This is the most common kind of proof—you simply prove the claim directly, through a series of logical implications.
Suppose you are trying to prove that . In a proof by contradiction, you assume p, as usual, but then you assume that q is not true and then prove that a contradiction occurs. In particular, you show that if q is false, then so is p. And since you have assumed that p is true, you have now proven that p is both true and false—an impossibility. Therefore, the assumption of must have been false—in other words, q must be true.
Actually, to highlight the structure of a proof by contradiction, let's rewrite the theorem in “ ” form, even though it's a little more awkward:
Note the parenthetical phrase “for a contradiction.” This is not strictly necessary, but it does help the reader by letting him or her know that the assumption you're about to make is not one that you actually believe—you are making it solely for the purpose of proving a contradiction later.
The contrapositive of the statement is , and the two are logically equivalent; therefore, you can prove by proving . This feels a lot like a proof by contradiction—we assume and prove . The difference is that in a proof by contradiction, we also assume p and we use it to derive the contradiction. For example, we used the fact that N is the number of primes to build B, and then we used B to contradict the fact that N is the number of primes. In a proof by contrapositive, we don't need to assume p—we simply assume and prove .
Mathematical induction is useful when you need to prove something about all the integers (or all the members of some other countable set). The idea is to prove that if the claim is true for (an arbitrary) n, then it must also be true for . If we can prove this implication, then it holds for any n—that is, the truth of the claim for n implies the claim for , and this in turn implies the claim for , and then for , and so on. Therefore, this general implication (truth for n truth for ) is powerful enough to prove that the claim is true for all integers greater than or equal to n. Of course, we also have to get the process started, by proving that the claim is true for .
A proof by induction generally has two parts: In the first (often called the base case), we prove that the claim is true for , and in the second (called the induction step), we prove that, if the claim holds for n, then it also holds for . In the induction step, we are allowed to assume that the claim holds for n—this is called the induction hypothesis.
Note the phrase “by induction on n” at the start of the proof. This is not strictly necessary, but it helps the reader by telling him or her how we're going to prove the theorem.
In this method, the universe of possibilities is divided into cases, and the claim is proved for each case separately. We don't know which case applies—typically, all of the cases are possible. But since we've proved the claim for every case, it doesn't matter which case holds.
For example, suppose you want to say:
Let be the order times, let T be the set of order times, let , and let
Then you may wish to help out your reader a bit by also saying:
That is, S is the set of order times for which the inventory level (just before ordering) is positive, and is the earliest such time.
=
and ==
in C/C++, Java, and other programming languages. It's also the difference between
and
in the pseudocode in this book.) For example, consider the following proof fragment:
This fragment is confusing. Is v a new symbol that is being defined as ? Or do we already know that ? Did the first step prove that ? Or do we already know that ? Or is it another new symbol?
True, a smart reader might be able to figure all this out. But the reader's life would be a lot easer if the proof‐writer instead wrote:
We know that . Let . Since (by Theorem 4.3), we have
This bullet is really a special case of the next.
For example, consider the following proof fragment:
The math would be a lot easier to follow if the proof‐writer had written the fractions the way they were intended to be written:
In general, thoughtfully typeset math will make your proof easier to read.