Proof Writing Guide
A comprehensive reference for writing proofs in the long-form mathematics style. Covers the three-phase proof pattern, templates for every major proof technique, in-line commentary, the almost-correct proof pattern, common mistakes, and a technique selection flowchart.
Table of Contents
- The Three-Phase Proof Pattern
- Proof Templates by Technique
- In-Line Commentary Patterns
- The Almost-Correct Proof Pattern
- Common Proof Mistakes
- Technique Selection Flowchart
1. The Three-Phase Proof Pattern
Every nontrivial proof should follow a three-phase structure: Strategy, Proof, and Analysis. This pattern makes the invisible thinking process visible and gives the reader understanding, not just verification.
Phase 1: Pre-Proof Strategy
Before the formal proof, show the reader how the proof was discovered. This is labeled Scratch Work (Cummings) or Proof Strategy (Chartrand).
What to include:
- Goal identification. State clearly what must be shown. Write the conclusion you need to reach.
- Working backward. Start from the desired conclusion and manipulate it toward known facts or assumptions.
- Technique selection. Explain why you are choosing a particular proof method (direct, contradiction, etc.). If a direct approach runs into trouble, say so explicitly.
- Handling difficulties. If the path is not straightforward, show where the obstacles are and how you navigate them.
- The key insight. If the proof depends on a clever trick (adding and subtracting a term, choosing a specific epsilon, partitioning into cases), reveal it here with motivation.
For long proofs, also provide a Proof Idea: A plain-English summary of the high-level strategy, in one or two sentences, so the reader has a map before entering the technical details.
Full Example: The Three-Phase Pattern in Action
Theorem. If is a sequence of real numbers such that and , then .
Proof Idea. We show that is smaller than every positive number, which forces . The triangle inequality provides the key bridge between the two convergence assumptions.
Scratch Work. We want to show , or equivalently . A natural approach is to show for every , since the only nonnegative number less than every positive number is zero.
Given , we know:
- Since , there exists such that implies .
- Since , there exists such that implies .
If we take and pick any , the triangle inequality gives:
That works. The key trick is splitting into two halves — one for each convergence assumption.
Proof. Let . Since , there exists a positive integer such that for every . Since , there exists a positive integer such that for every .
Let and let . By the triangle inequality,
Since for every , and , we conclude that , and therefore .
Moral. This proof illustrates a fundamental technique in analysis: to show two quantities are equal, show their difference is smaller than every positive number. The triangle inequality is the workhorse that connects separate estimates into a single bound.
Alternative approach. The result can also be proved by choosing (assuming ) and showing that the -neighborhoods of and are disjoint, creating a contradiction when must eventually lie in both. This geometric argument is more visual but slightly longer.
Phase 2: The Proof Itself
The proof is the polished, forward-directed argument. It should read like a finished narrative, not scratch work.
Template structure:
Proof.
[1] State what is assumed. ("Let n be an odd integer.")
[2] Unpack definitions. ("Then n = 2k + 1 for some integer k.")
[3] Perform computation, with each step justified.
("Therefore, n^2 = (2k+1)^2 = 4k^2 + 4k + 1 = 2(2k^2 + 2k) + 1.")
[4] Identify the key expression. ("Since 2k^2 + 2k is an integer, ...")
[5] Conclude by stating what has been shown.
("...we conclude that n^2 is odd.")
Rules for the proof body:
- State assumptions first, before doing anything else.
- Introduce every new variable with its type: “where is an integer”, not just "".
- At each step, explain why it is taken, not just what happens.
- Use words between equations (see Section 3).
- Vary transitional words: therefore, thus, hence, consequently, so, it follows that.
- Conclude by explicitly restating what has been proved.
Phase 3: Post-Proof Analysis
After the proof, step back and reflect. This is labeled Proof Analysis (Chartrand) or Moral (Cummings).
What to include:
- Could this be simpler? Is there a shorter or more elegant route?
- The moral. What does this result teach us, in plain English? What is the conceptual takeaway?
- Alternative proofs. If a different technique would illuminate a different aspect, sketch it or point to it.
- Reverse-engineering insight. Show how the polished proof was constructed by working backward from the goal. This answers: “How would I ever think of this?”
- Extensions and connections. Does this result generalize? Does it connect to a bigger picture?
2. Proof Templates by Technique
2.1 Direct Proof
When to use. The hypothesis directly provides enough information to derive the conclusion. This is the default — try it first.
Template:
Proof. Assume that [hypothesis] holds for an arbitrary [element type].
Then [unpack the definition]: [variable] = [expression], where [variable] is [type].
Therefore, [computation with justification].
Since [key expression] is [type], we conclude that [conclusion]. ▢
Example. The sum of two even integers is even.
Proof. Let and be even integers. Then and for some integers and . Therefore,
Since is an integer, we conclude that is even.
Common mistake. Writing and . This forces . Use distinct variables for independent quantities.
2.2 Proof by Contrapositive
When to use. The hypothesis gives a complicated expression that is hard to work with directly, but the negation of the conclusion gives a simpler starting point. If proving , instead prove .
Decision heuristic. If starting from the hypothesis leads to an expression like where you cannot tell whether it is even an integer, try the contrapositive — you may get a cleaner starting expression.
Template:
Proof. We prove the contrapositive: if [not Q], then [not P].
Assume that [not Q] for an arbitrary [element type].
Then [unpack negated conclusion], so [computation].
Therefore [negated hypothesis holds], which completes the proof. ▢
Example. If is even, then is odd.
Scratch Work. A direct proof would assume , giving . It is not clear this is an integer, let alone odd. The contrapositive — assume is even — gives us , which is much easier to work with.
Proof. We prove the contrapositive: if is even, then is odd. Assume that is even, so for some integer . Then
Since is an integer, is odd.
2.3 Proof by Contradiction
When to use. The result is “negative-sounding” — it asserts that something does not exist, cannot happen, or is impossible. Also useful when neither direct proof nor contrapositive yields a clean path.
Template:
Proof. Assume, to the contrary, that [negation of statement].
Then [derive consequences from this assumption].
[Arrive at a statement that contradicts a known fact, an axiom, or the hypothesis.]
This is a contradiction. Therefore, [original statement] holds. ▢
Example. There is no rational number whose square is 2.
Proof. Assume, to the contrary, that there exists a rational number such that . Write where and are integers with and . Then , so is even. Since is even, itself is even (the square of an odd number is odd). Write for some integer . Then , which gives , so . By the same reasoning, is even. But then both and are even, contradicting .
Common mistake. Arriving at a “contradiction” that is not actually contradictory — for example, deriving that a variable is both positive and nonnegative (which is consistent, not contradictory).
2.4 Proof by Cases
When to use. The domain partitions naturally into cases (even/odd, positive/negative/zero, vs. ), and no single argument handles all cases at once.
Requirements:
- Cases must be exhaustive (cover every possibility).
- Cases should be disjoint when possible (to avoid redundancy).
Template:
Proof. We consider [number] cases.
Case 1: [condition]. [Proof for this case.]
Case 2: [condition]. [Proof for this case.]
Since these cases are exhaustive, the result follows. ▢
The WLOG pattern. When two cases are symmetric (e.g., “either or ”), prove one and state: “Without loss of generality, assume .” But you must explain why there is no loss of generality — the cases must truly be interchangeable by symmetry.
Example. For every integer , the integer is even.
Proof. We consider two cases based on the parity of .
Case 1: is even. Then for some integer , so , which is even.
Case 2: is odd. Then for some integer , so , which is even.
Since every integer is either even or odd, we conclude that is even for every integer .
2.5 Existence Proofs
Constructive Existence
When to use. The statement asserts “there exists an such that…” and you can explicitly exhibit such an .
Template:
Proof. We claim that [explicit object] satisfies the required property.
Indeed, [verify the property by direct computation]. ▢
Example. There exists an integer such that .
Proof. We claim that satisfies this property. Indeed, .
Non-Constructive Existence
When to use. You can prove that an object must exist (via contradiction, the pigeonhole principle, or another theorem) without explicitly constructing it.
Template:
Proof. Assume, to the contrary, that no [object] satisfies [property].
[Derive a contradiction from this assumption.]
Therefore, such an [object] must exist. ▢
2.6 Uniqueness Proofs
When to use. The statement asserts that exactly one object has a certain property. Uniqueness proofs typically combine an existence proof with a uniqueness argument.
Template for the uniqueness part:
To show uniqueness, suppose that both x and x' satisfy [property].
Then [derive from the property that x and x' must be equal].
Therefore x = x'. ▢
Example. The additive identity in a ring is unique.
Proof. Let and both be additive identities in a ring . Since is an additive identity, . Since is an additive identity, . Therefore .
2.7 Mathematical Induction
When to use. The statement involves a natural number parameter and the truth of each case builds on the previous one. Typical pattern: the claim holds for all where is some base case.
Template:
Proof. We proceed by induction on n.
Base case (n = n_0): [Verify the statement directly for the smallest value.]
Inductive step: Assume that [statement holds for n = k] for some integer k >= n_0.
We must show that [statement holds for n = k + 1].
[Derive the k+1 case, using the inductive hypothesis explicitly.]
By the principle of mathematical induction, [statement] holds for all n >= n_0. ▢
Example. For every positive integer , we have .
Proof. We proceed by induction on .
Base case (): The left side is . The right side is . The base case holds.
Inductive step: Assume that for some positive integer . We must show that . Starting from the left side and applying the inductive hypothesis:
This completes the inductive step. By the principle of mathematical induction, the formula holds for all positive integers .
Common mistake. Using the inductive hypothesis without labeling it. Always write “by the inductive hypothesis” when you invoke the assumption for .
2.8 Epsilon-Delta Proofs (The Two-Phase Pattern)
When to use. Proving that a limit exists, that a sequence converges, or that a function is continuous — any statement involving and (or and ).
The two-phase approach (from Chartrand):
Phase 1 — Scratch Work (backward). Work backward from the desired inequality.
- Write out what needs to be shown: .
- Algebraically manipulate to isolate (or for sequences).
- Impose a restriction on (typically ) to bound other factors.
- Compute the resulting bound , so that .
- Set .
Phase 2 — Proof (forward). Present the argument cleanly from hypotheses to conclusion.
Proof. Let epsilon > 0 be given. Choose delta = min(1, epsilon/K).
Let x be a real number such that 0 < |x - a| < delta.
Since delta <= 1, [bound other factors using the restriction].
Since delta <= epsilon/K, [derive the final inequality].
Therefore |f(x) - L| < epsilon. ▢
Example. Show that .
Scratch Work. We need , i.e., . Factor: . We need to bound . If , then , so , which gives . Hence . We need , so . Choose .
Proof. Let be given. Choose , and let be a real number such that .
Since , we have , so , and therefore .
Since , we have . Therefore,
This shows that .
The pattern. The choice simultaneously ensures that the bounding argument works () and that the final inequality is satisfied (). This is a signature move in epsilon-delta proofs.
3. In-Line Commentary Patterns
The Principle
Every proof step should explain why it is taken, not just what happens. The reader should never face a chain of bare equations without accompanying words.
Words Between Equations
Bad (bare equation chain):
x + (-x) = x + x'
-x + (x + (-x)) = -x + (x + x')
(-x + x) + (-x) = (-x + x) + x'
0 + (-x) = 0 + x'
-x = x'
Good (with explanatory words):
Since , it follows, by adding to both sides, that . By associativity, . Hence , and therefore .
Connective Phrases
Use these phrases to link steps and explain reasoning:
Causal connectives (explaining why):
- “Since is even, we have for some integer .”
- “Because is continuous on , the extreme value theorem guarantees…”
- “By the triangle inequality, .”
- “By hypothesis, .”
Consequential connectives (stating what follows):
- “Therefore, is even.”
- “Thus, for some .”
- “Hence, the sequence converges.”
- “Consequently, .”
- “So must be even.”
- “It follows that is bounded.”
- “This implies that is injective.”
IMPORTANT: Vary your transitions. Do not write “Therefore” three times in a row. Rotate through the options above.
Attribution phrases (citing a result):
- “By Theorem 3.5, …”
- “By the definition of convergence, …”
- “Applying the Bolzano-Weierstrass theorem, …”
- “By the inductive hypothesis, …”
The “Since/If” Distinction
This is a subtle but important rule:
- “Since” is for facts already established or assumed. “Since is odd” (we already know this).
- “If” is for hypothetical conditions. “If were even, then…”
Wrong: “If is odd” when was assumed odd in the hypothesis. Right: “Since is odd” when was assumed odd in the hypothesis.
Commentary at Key Steps
At critical junctures in a proof — the clever trick, the key substitution, the moment where the hypothesis is used — add a parenthetical or a sentence explaining the motivation:
- “We add and subtract (a standard trick that separates the two limits)…”
- “This is where we use the completeness of the reals: the supremum exists because is bounded above.”
- “Note that without the continuity assumption, this step fails (see Section 6.3).“
4. The Almost-Correct Proof Pattern
One of the most powerful pedagogical tools: show a proof attempt that almost works, diagnose exactly where and why it fails, then fix it. This builds the reader’s ability to evaluate proofs critically.
Structure
- State the theorem and a candidate proof attempt.
- Present the attempt as if it were correct, without flagging the error in advance.
- Pause and examine. Ask: “Does this work?” or “Where does this argument break down?”
- Diagnose the failure. Pinpoint the exact step where the reasoning is invalid and explain why.
- Fix the proof. Either repair the flawed step or use a different technique entirely.
- State the moral. What does the failure teach us about the concept?
Example: Pointwise vs. Uniform Convergence
Claim. If pointwise and each is continuous, then is continuous.
Proof attempt. Let and let be a point in the domain. Since pointwise, there exists such that for all . Since is continuous at , there exists such that implies . Then by the triangle inequality…
Wait. Read the second sentence again. We wrote: “there exists such that for all .” But pointwise convergence only guarantees that for each fixed , there exists (depending on ) such that the approximation holds. The we need might be different for each point .
Diagnosis. The proof attempt secretly used uniform convergence (a single works for all simultaneously) when only pointwise convergence was given. This is the crucial distinction:
- Pointwise: For each , for each , there exists such that…
- Uniform: For each , there exists such that for all …
The order of quantifiers matters. In pointwise convergence, depends on . In uniform convergence, is independent of .
The fix. The claim is actually false under pointwise convergence. A counterexample: let on . Each is continuous, and pointwise where for and . The limit is discontinuous at .
The correct theorem requires uniform convergence: if uniformly and each is continuous, then is continuous.
Moral. Pointwise convergence preserves almost nothing about the individual functions. Uniform convergence is the stronger condition needed to transfer properties like continuity from the approximating functions to the limit. Whenever you find yourself needing “the same for all ,” you are implicitly invoking uniform convergence.
When to Use This Pattern
- When a subtle hypothesis is easy to overlook (uniform vs. pointwise, open vs. closed, bounded vs. unbounded).
- When a common student mistake reveals a genuine conceptual distinction.
- When the failure is more instructive than a correct proof would be on its own.
- When the fix illuminates why a hypothesis exists in a theorem.
5. Common Proof Mistakes
5.1 Reusing Variables for Independent Quantities
The mistake: When two objects satisfy the same property, using the same variable for both.
Bad:
Let and be even integers. Then and for some integer .
This forces , which was not assumed. The proof now only handles the case where both integers are equal.
Good:
Let and be even integers. Then and for some integers and .
Rule. Every existential unpacking introduces a fresh variable. If two objects independently satisfy a property, they get independent witnesses.
5.2 Proof by Example
The mistake: Verifying a universal claim for specific values and concluding it holds in general.
Bad:
Claim: The sum of two odd integers is even. “Proof”: , which is even. , which is even. Therefore the sum of two odd integers is even.
This proves nothing about, say, , let alone all pairs. Examples can illustrate, motivate, and build intuition, but they cannot replace a general argument.
When examples ARE proofs. Existence proofs (“there exists an such that…”) can be proved by a single example. Disproving universal claims (“not every satisfies…”) requires only one counterexample.
5.3 Assuming What You Are Proving
The mistake: Using the conclusion as part of the argument for the conclusion.
Bad (proving is irrational):
Assume in lowest terms. Then , so is even. Write . Then , so , so is even. Since both are even, is irrational.
The last sentence is a non sequitur. The correct conclusion is: “Since both and are even, this contradicts our assumption that . Therefore no such rational number exists.” The proof must explicitly close the loop of the contradiction.
A subtler form: In a proof that , writing ” is true” in the middle and then using to derive further consequences, rather than deriving from .
5.4 Missing WLOG Justification
The mistake: Writing “Without loss of generality, assume ” without explaining why there is no loss of generality.
Bad:
Without loss of generality, let .
If the claim treats , , symmetrically (e.g., “the sum is even”), then WLOG is justified because relabeling does not change the sum. But if the claim is not symmetric in , , , then this assumption does lose generality.
Rule. Every use of WLOG must be accompanied by a brief justification: “Without loss of generality, assume (since the roles of and are symmetric in the hypothesis and conclusion).“
5.5 Using “Clearly” or “Obviously” to Skip Steps
The mistake: Disguising missing reasoning behind authority words.
Bad:
Clearly, is continuous on , and obviously the maximum is attained.
Good:
Since is a polynomial, it is continuous on . By the extreme value theorem, attains its maximum on .
Rule. If a claim is truly immediate (one mental step), you can omit the justification entirely. If it requires even a small argument, provide it. Never use “clearly” as a substitute for a proof step you do not want to write.
5.6 Bare Equation Chains Without Words
The mistake: Presenting a sequence of equations with no connective text.
Bad:
n^2 + n = n(n+1)
= n(n+1)/2 * 2
QED
Good:
We factor as . Since one of and is even (consecutive integers have opposite parity), their product is even.
Rule. Every equation should be part of a sentence. The reader should never have to guess why one line follows from the previous one.
5.7 Failing to Introduce Variable Types
The mistake: Writing without saying what is.
Bad:
Since is odd, .
Good:
Since is odd, for some integer .
If has not appeared before, the reader does not know whether it is an integer, a real number, or something else. State the type.
5.8 Confusing the Direction of an Implication
The mistake: Proving when the claim is (the converse error).
This is especially common in biconditional proofs, where both directions must be proved separately. Always label which direction you are proving and verify you have not mixed them up.
6. Technique Selection Flowchart
When facing a statement to prove, use this decision tree to select a technique. Start at the top and follow the first matching branch.
START: What kind of statement are you proving?
│
├─ Is it "P if and only if Q"?
│ └─ YES → Prove both directions separately (P⇒Q and Q⇒P).
│ Each direction may use a different technique.
│
├─ Does it assert "there exists" something?
│ ├─ Can you exhibit an explicit object? → Constructive existence proof.
│ └─ Cannot construct? → Non-constructive (contradiction, pigeonhole, etc.).
│
├─ Does it assert uniqueness ("there exists exactly one")?
│ └─ YES → Existence proof + assume two exist, show they are equal.
│
├─ Does it have a natural number parameter (holds "for all n ≥ n₀")?
│ └─ YES → Try mathematical induction.
│
├─ Is the conclusion "negative-sounding"?
│ │ ("no such X exists", "it is impossible that", "X is irrational")
│ └─ YES → Proof by contradiction. Assume the negation and derive
│ a contradiction.
│
├─ Does the hypothesis give a complicated expression?
│ │ (e.g., "if 5x - 7 is even" gives x = (2a+7)/5)
│ └─ YES → Try proof by contrapositive. Starting from ¬Q may give
│ a simpler expression to work with.
│
├─ Does the domain partition naturally into cases?
│ │ (even/odd, positive/negative/zero, n < k vs. n ≥ k)
│ └─ YES → Proof by cases. Check that cases are exhaustive.
│ Use WLOG if cases are symmetric.
│
├─ Does it involve limits, continuity, or convergence?
│ └─ YES → Epsilon-delta (or epsilon-N) proof.
│ Use the two-phase pattern (scratch work backward,
│ proof forward).
│
└─ DEFAULT → Try a direct proof. Assume the hypothesis,
unpack definitions, compute toward the conclusion.
Worked Decision Examples
Statement: “If is even, then is even.”
- Not biconditional. Not existence. Not induction. Not negative-sounding.
- The hypothesis gives , so — complicated.
- Try contrapositive: Assume is odd. Then , so , which is odd. Clean and direct.
Statement: “There is no largest prime number.”
- Negative-sounding (“there is no”).
- Use contradiction: Assume there are finitely many primes . Consider . Then is not divisible by any , contradicting the assumption that every integer greater than 1 has a prime factor among .
Statement: “For all , .”
- Natural number parameter with a base case.
- Use induction. Base case: . Inductive step: for .
Statement: ” if and only if .”
- Biconditional (“if and only if”).
- Prove both directions. () Assume and show by double containment. () Assume and show directly.
Statement: “The sequence converges to 0.”
- Involves convergence.
- Epsilon-N proof. Scratch work: we need , i.e., . Choose to be any integer greater than (the Archimedean property guarantees such exists). The proof follows immediately.
Quick Reference: Proof Opening Lines
| Technique | Opening Line |
|---|---|
| Direct | ”Let be an [type] satisfying [hypothesis].” |
| Contrapositive | ”We prove the contrapositive. Assume that [not Q].” |
| Contradiction | ”Assume, to the contrary, that [negation of claim].” |
| Cases | ”We consider two cases.” |
| Existence (constructive) | “We claim that [explicit object] satisfies the property.” |
| Uniqueness | ”Suppose that both and satisfy [property].” |
| Induction | ”We proceed by induction on . Base case (): …” |
| Epsilon-delta | ”Let be given. Choose .” |
Quick Reference: Post-Proof Closing Lines
| Situation | Closing Line |
|---|---|
| Direct / contrapositive | ”Therefore [restate conclusion]. “ |
| Contradiction | ”This contradicts [specific fact]. Therefore [original claim]. “ |
| Cases | ”Since these cases are exhaustive, [conclusion]. “ |
| Induction | ”By the principle of mathematical induction, [claim] holds for all . “ |
| Epsilon-delta | ”This shows that . ” |