Rolfe Schmidt

I’m learning. Slowly.

Proof I: Language and Logic

with 8 comments

Series Outline

  1. Proof — Series introduction
  2. Proof I: Language and Logic — this post, introduce sentential grammar
  3. Proof II: Bringing Meaning to Language –truth tables and syntax trees

In my introduction to this mini-series I talked about when and why someone whould learn about proof. Now it is time to start talking about what a proof is.

So let’s try to define “proof”. Here’s a pretty good first stab: A proof is an incontrovertible argument. Let’s accept this. We immediately run into a couple of issues.

  1. Now we need to define “incontrovertible” and “argument”
  2. “Argument” seems to have something to do with language. To check whether something “is an argument” we’ll just need to check the grammar, or syntax
  3. “Incontrovertible” has something to do with the meaning of language, or semantics.

So we’ve just scratched the surface here and it’s pretty clear that talking about proof means talking about language, not Math! Many people get confused when trying to prove things in school because they think proof is all about the “idea”. Don’t get me wrong — ideas are what make Math worth studying. But an idea is not a proof. A proof must use language to communicate the idea to another person, and put the idea in a form that we can validate.

Once you get comfortable with this separation of “idea” and “proof”, you may start to notice that some of your thinking is “visual”, and other thinking is “linguistic”. Could there be a better way to prove things using this visual thinking? Perhaps, but if you want to tell me about it you’ll have to tell me with language. So if your visual proof system is really better than our standard linguistic one, that extra power will be something we can’t talk about!

So studying proof is really studying (a certain type of) language. Now when kids learn to talk, do they start by improvising in iambic pentameter? Or do they stumble around with basic vocabulary and grammar? When you study another language, isn’t the first step learning vocabulary, idioms, and grammar? You need those before you can speak with meaning or elegance. (I’m not saying you should teach people by saying inane things — you should expose learners to meaning and elegance from the start. But I digress.)

Learning about proof is very similar to learning human languages. You need to learn the grammar and the semantics before you can translate your ideas. But here we have an important difference — in logic we must be strict about our grammar or we won’t be able to validate our proofs later. A proof with a grammar error is not a proof. So if you want to know what a proof is, you need to know the grammar.

I’m sorry for sounding like an old schoolmarm here. Later we’ll see that reality is much more fun. But this formal style of proof is critical, because it is always an option, our arguments can always be made formal and validated.

So let’s face the grammar.

I will introduce the grammar in two layers. Proof is all about figuring out when a sentence is true or false, so we will only worry about propositions: sentences that are either true or false. Grammar is all about building sentences. The first layer of our grammar — the grammar of sentential logic — tells us how we can build new propositions from old ones.

I’ll need to get a little mathy here and start using symbols to represent propositions. I’ll use boldface capital letters as “sentence symbols”. When you see a sentence symbol, you can replace it with a grammatically correct sentence to get a new grammatically correct sentence. The rest of this post will be about making these ideas concrete.

First let me say a little more about the sentence symbols. In my examples here, I’ll just use the symbols A, B, C, and D. Whenever you see these symbols, I encourage you to substitute an English sentence in its place. I’ll do this for you every once in a while, and here are the translations I’ll use

Symbol

Possible English translation

A

Janos is tall

B

Janos plays basketball

C

You can consume all you want as long as you buy carbon offsets

D

Global warming is not the only environmental problem we’re facing

Now none of these English sentences is very precise. Our second layer of grammar (coming later), will help us write better propositions. But at a common sense level these are sentences that seem like they must be true or false. They are propositions. One of the neat things we’ll see is that we can say a lot about propositions just by looking at the structure of the sentence.

Sow how can we take our propositions symbols A, B, C, and D and build new ones? Here are four useful ways:

symbol

meaning

example

translation

\neg

not

(\neg \textbf{A})

A is not true

\wedge

and

(\bf{A}\wedge\bf{B})

A and B

\vee

or

(\bf{A}\vee\bf{B})

A or B

\rightarrow

if … then

(\bf{A}\rightarrow\bf{B})

if A then B

Thse are called sentential connectives. Yes there are other logical connectives, and you can see notation for them in a table here. But the neat thing is that these are the only ones we need. Sometimes we’ll throw in \leftrightarrow to mean “if and only if”, but we don’t really need it — we could just use two “if” statements instead. All propositions involving any (finite) number of variables can be built with these four connectives. We could get even more parsimonious and get down to just one connective: “not and”. It is neat to know that we only need one logical operation say everything, but that makes for pretty tough reading.

Each one of these connectives gives us a sentence-building operation — a way to make a new sentence from one or two old ones. To do this we had to introduce some new symbols to our language: (, ), \wedge, \vee, \rightarrow. The parentheses are critical. Every sentence must have parentheses around it or we will never understand how to break a big sentence down into its pieces. Here is an example: what does the sentence \bf{A}\wedge\bf{B}\vee\neg\bf{A} mean? Using my example translations, it reads something like “Janos is tall and Janos plays basketball or Janos is not tall”. This is an ambiguous sentence, think about it a bit. But we could write (\bf{A}\wedge\bf{B})\vee\neg\bf{A} to mean, basically if Janos is tall, he must play basketball. This is not ambiguous.

Now we’re in a position to state all of the rules of our grammar. To keep close to standard terminology, we’ll say that an expression is any finite sequence of our symbols. A “grammatically correct” expression is called a well formed formula of wff (pronounced “woof”). An expression is a wff if and only if

  1. it is a sentence symbol A, B, C, D, …
  2. Or it is built from one or more wffs using one of our four sentence-building operations listed above.

That’s it. This is the whole of the grammar. But we can use it to build pretty complicated propositions. Here’s an example that’s really not too bad:

((\bf{A}\vee\bf{B})\wedge(\bf{D}\vee(\bf{A}\rightarrow\bf{B}))\rightarrow\neg\bf{C})

Here are some simpler examples of sentences I can build up from these symbols. Try to translate them into English. Use my translations or make up your own!

(\bf{D} \rightarrow (\neg \bf{C}))

((\bf{A} \wedge \bf{B}) \rightarrow \bf{A})

(\bf{A} \rightarrow (\neg \bf{A}))

When you translate these to English, you’ll notice that sentences can be true regardless of the meaning of the symbols. ((\bf{A} \wedge \bf{B}) \rightarrow \bf{A}) is an example. These are called tautologies. Some are false no matter what: (\bf{A} \rightarrow (\neg \bf{A})). Finally, some statements are “interesting”: (\bf{D} \rightarrow (\neg \bf{C})), or with my translations “if global warming is not the only environmental problem we are facing, it is not true that you can consume all you want as long as you buy carbon offsets”.

This “sentence-building” view of our grammar has a nice side-effect. It allows us to write a computer program that will write out every possible wff. This sort of observation will be crucial down the road when we talk about Gödel’s incompleteness theorem and the relationship between truth and provability.

After introducing my definition of proof, this post has really dwelt on grammar or syntax. Basically, we’ve been starting to talk about how an argument must be written. In the next post in this series (not sure when it will arrive), I plan to talk about the semantics — what makes a sentence incontrovertible. Then we’ll follow this up with that promised “second-layer” of grammar, first-order logic.

If you want to dive deeper into the subject (I’m making no attempt at a rigorous presentation here), there are plenty of books on the subject. I happen to have Enderton’s A Mathematical Introduction to Logic, and I like it but I haven’t read it through. I learned most of this in a course with a great teacher and from bits and pieces I’ve put together since.

[This is still a draft. I will edit it. If you don't like something, tell me and I may change it. If you do like something, watch out it may disappear.]

Written by Rolfe Schmidt

June 10, 2007 at 10:12 pm

Posted in Math Education

8 Responses

Subscribe to comments with RSS.

  1. The most popular intro texts are Copi and Hurley (I use the latter). Hurley uses the multiplication dot for “and” and several people use the tilde (~) for not.

    What constitutes proof is a big deal. I say “proof is that which convinces” and leave it to the context to provide guidance about what it actually takes. Slippery, but then the notion of proof is itself.

    I did one intro course, btw, and then picked up bits and pieces. I studied a bit more on my own in preparation for teaching it as a senior (high school) elective.

    Jonathan

    jd2718

    June 10, 2007 at 10:23 pm

  2. Thanks for letting me know about those texts, I’ll have a look at them. It might be nice to put together a little guide to the literature sometime. You are right about the notation, there is quite a bit of variation out there. I just chose to stick with Enderton’s notation when writing, even though I use different notation when I’m scribbling.

    In practice it is often the case that a proof is “that which convinces”, but if the result is important and the proof is wrong it will likely get caught. And I’d say that even if it convinced people for a while, it is still not a proof. I plan to talk about a couple of famous examples in a later post: Euclid’s first proof in the elements relies on an assumption about circles that doesn’t show up in his axioms, so it is not a proof. Leslie Lamport has some interesting things to say here too. The beauty of the formal proof is that it once you validate it, you can only challenge it by saying either (1) the axioms weren’t good, or (2) the validator was wrong. So if you really take the care to prove something formally, it isn’t too slippery. You just need to know where to look for the footholds!

    That is nice that your school offers a logic elective, I wish that were more common!

    Rolfe Schmidt

    June 11, 2007 at 8:24 am

  3. Working logic in a language laced with pejoratives sounds like a hard chore to eliminate unwarranted assumptions : and I’ve never studied that !

    opit

    June 11, 2007 at 1:50 pm

  4. Hi opit: I didn’t quite follow about the language laced with pejoratives — could you clarify?

    The punchline of this whole study is that you can never get rid of those assumptions. At some point you have pick some axioms that you justify on physical or aesthetic grounds. The only way that logic can “justify” these axioms is by finding “interesting” or “beautiful” or “natural” results and failing to find absurdities.

    Outside of math, our unwarranted assumptions are usually very poorly tested and logic can be handy here. You pin down a few assumptions you’d like to make, use logic to find an absurdity, then refine your assumptions so that your absurd argument fails. This is the big lesson I get out of Plato. I also note that in Plato’s dialogues, Socrates almost never finds assumptions he can accept. He even undermines his own ‘forms’. So tread carefully, and don’t take yourself too seriously.

    Rolfe Schmidt

    June 11, 2007 at 2:05 pm

  5. Perhaps a love of ‘British understatement’ has complicated things. In deed, the point of what I was trying to say was that our language does, in fact, carry a lot of judgmental baggage and implied meaning which distorts any attempt at impartial statement.
    The point that it is a language and not a mathematical methodology was made in an odd way when I tried to get a rise out of a high school English instructor by enquiring ; oh -so-innocently you understand ; that if you went into double negatives and even more, if it was proper to count them to see if the residual meaning was ‘positive’ or ‘negative’.
    I was quite positively informed that any use of the negative took precedence.
    I think somehow the quip “Don’t use no double negatives nohow” had been in my mind and I thought I’d throw out bait.

    opit

    June 13, 2007 at 7:28 pm

  6. [...] it is kind of funny that as I was writing up my last post about the importance of language for mathematical proof, G came up to me and called my bluff a [...]

  7. Now I get it. I had the same double negative argument with at least one of my teachers. Natural languages like English have some big problems when you try to say something unambiguously. Different people understand the same words in wildly different ways. Double negatives are a great example: when you use a double negative two people listening to you can hear the same words but take away opposite meanings.

    So you can’t never say you “proved” nothing in English unless you know how to translate your argument into some unambiguous form.

    Rolfe Schmidt

    June 16, 2007 at 6:47 am

  8. [...] Proof I: Language and Logic –  introduce sentential grammar [...]


Leave a Reply