1The language of mathematics

The introduction of $\texttt{ChatGPT}$ on November 30, 2022 took the world by storm. The use of generative AI or large language models is encouraged throughout this course. It is my hope that you will learn mathematics on a deeper level by communicating with the machine using careful prompt engineering. Good prompts seem to resemble clearly written text.

First we need to introduce some formal notation to be able to talk clearly about mathematics and also in a reasonable way formulate mathematical models of the real world. In modern mathematics the concept of a set is crucial. It is a bit tricky to define this precisely. We will go on and define the basic concepts of what is called naive set theory.

1.1 Black box warnings

Modern mathematics is perhaps not like anything you have encountered so far. It calls for a lot of focus and precision, especially when writing down solutions to problems. It is a bit like programming a computer. There is no room for imprecision and half-baked sentences.

This course amounts to $10$ ECTS or approximately $280$ hours. Suppose that you spend a week studying for the exam, say $40$ hours. Lectures, exercise classes and MatLab amount to $14\cdot (4 + 2 + 3)$ hours $= 126$ hours. This leaves around $114$ hours for your own study and immersion. Put in other terms, you are supposed to work around $8$ hours per week outside classes for this course. With classes, each week calls for $17$ hours of work. There is a very close relationship between the amount of hours you log each week and your result at the exam. To state the obvious: numbers don't lie. If you put in the time, you will almost certainly do well. Try to allocate time for IMO in your weekly schedule and please (ab)use all the help that is provided.

Also, computers are exceptionally fun, but be careful! Nothing really beats a clear thinking human mind. To wit, I asked WolframAlpha to solve a certain optimization problem and it came up with the answer

What is strange about this output?

Compute

$x = -\sqrt{\frac{1}{2} \left(1 - \sqrt{2} + \sqrt{3 - 2\sqrt{2}}\right)}$ on a pocket calculator (or similar) and see what you get using approximate decimal numbers. Explain your (stepwise) computation of $x$ using approximate decimal numbers.

$3 - 2 \sqrt{2} = (\sqrt{2} - 1)^2.$

1.2 Computer algebra

We will use the computer algebra system Sage in exploring and experimenting with mathematics. This means that you will have to write small commands and code snippets. Sage is built on top of the very wide spread language python and you can in fact enter Python codeOne may also enter code in several other languages in the Sage input windows in this text. Below is an example of a basic graphics command in Sage. Push the Compute button to evaluate.

You can install Sage on your own computer following the instructions on https://www.sagemath.org/.

Did you notice that you can edit and enter new commands in the Sage window? Do the following problems using Sage based on the Sage guided tour.

Consider $f(x) = x \sin(1/x)$ . Plot the graph of $f$ from $0$ to $0.1$ . Computing $f(0)$ does not make sense. Do you see a way of assigning a natural value to $f(0)$ using the graph?
Find an approximate solution with four decimals to the equation $\cos(x) = x$ .

This is an example of an equation, that can only be solved numerically. Try first plotting the graph of $f(x) = x - \cos(x)$ from $0$ to $1$ . Then use a suitable function from the Sage guide.
Compute $\pi$ with $100$ decimals.

Compute the sum

$\frac{1}{\sqrt{1} + \sqrt{2}} + \frac{1}{\sqrt{2} + \sqrt{3}} + \frac{1}{\sqrt{3} + \sqrt{4}}$ without using a computer.

PS: If you have a subscription with WolframAlpha PRO, you can unlock the Step-by-step solution to get the answer to this (and the bonus question below). Try to do it yourself! Nothing beats the warm feeling of coming up with a killer idea after struggling for some time.

Bonus question

Generalize your answer/method to computing the sum

$\frac{1}{\sqrt{1} + \sqrt{2}} + \frac{1}{\sqrt{2} + \sqrt{3}} + \frac{1}{\sqrt{3} + \sqrt{4}} + \cdots + \frac{1}{\sqrt{N-1} + \sqrt{N}},$ for $N = 5, 6, 7, \dots$ .

1.3 Objects or elements and the symbols $=$ and $\neq$

Mathematics can be broadly viewed as handling objects precisely according to a specific system of rules. The first element of precision is in distinguishing the objects and deciding when they are the same. This calls for notation. If two objects $x$ and $y$ are the same, we write $x = y$ . If they are different we write $x\neq y$ .

You may laugh here, but identifying objects is really one of the fundamental tasks of mathematics. It is not always that easy. Even though objects appear different they are the same as in, for example

$\frac{105}{189} = \frac{35}{63}\qquad\text{and}\qquad \sin\left(\frac{\pi}{2}\right) = 1.$ The first example above is an identity of fractions (rational numbers). The second is an identity, which calls for knowledge of the sine function and real numbers. Each of these identities calls for some rather advanced mathematics.

Use the Sage window above to reason about equality in the quiz below. In each case describe the objects i.e., are they numbers, symbols, etc.? Also, please check your computations by hand with the old fashioned paper and pencil, especially $(a+b)(a-b)$ .

Click on the right equalities below.

$a + b - 2 b = a - b$

$(a+b)^2 = a^2 + b^2$

$(a + b)(a - b) = a^2 - b^2$

$(a + b)^2 = a^2 + 2 a b + b^2$

$(a+b)^3 = a^3 + 2 a^2 b + 2 a b^2 + b^3$

$\frac{3}{8} = \frac{5}{13}$

$\pi = \frac{22}{7}$

$\cos^2(\pi) + \sin^2(\pi) = 1$

You know that $(a+ b)^2 = a^2 + 2 a b + b^2$ . Use Sage to find a similar identity for $(a + b)^4$ .

Go back and look at (the beginning of) Exercise 1.4.

1.4 Sets

A set is (informally) a collection of distinct objects or elements. A set is an object as described in section 1.3 and it makes sense to ask when two sets are equal.

Two sets $A$ and $B$ are equal i.e., $A = B$ if they contain the same elements.

An example of a set could be the set $\{1,2,3\}$ of natural numbers between $0$ and $4$ . Notice that we use the symbol " $\{$ " to start the listing of elements in a set and the symbol " $\}$ " to denote the end of the listing. Notice also that (by our definition of equality between sets), the order of the elements in the listing does not matter i.e.,

$\{1, 2, 3\} = \{2, 3, 1\}.$ We are also not allowing duplicates like for example in the listing $\{1, 2, 2, 3, 3, 3\}$ (such a thing is called a multiset).

An example of a set not involving numbers could be the set of letters

$S=\{A, n, e, x, a, m, p, l, c, o, u, d, b, t, h, s, r, i\}$ used in this sentence. The number of elements in a set $S$ is called the cardinality of the set. We will denote it by $|S|$ .

To convince someone beyond a doubt (we will talk about this formally later in this chapter) that two sets $A$ and $B$ are equal, one needs to argue that if $x$ is an element of $A$ , then $x$ is an element of $B$ and the other way round, if $y$ is an element of $B$ , then $y$ is an element of $A$ . If this is true, then $A$ and $B$ must contain the same elements.

Give a precise reason as to why the two sets $\{1, 2, 3\}$ and $\{1, 2, 4\}$ are not equal. Is it possible for a set with $5$ elements to be equal to a set with $7$ elements?

Sets may be explored using (only) python. This is illustrated in the snippet below.

Come up with three lines of Sage code that verifies $\{1, 2, 3\} \neq \{1, 2, 4\}$ . Try it out.

1.4.1 The empty set

There is a unique set containing no or zero elements. This set is called the empty set and is denoted $\emptyset$ i.e.,

$\emptyset = \{\}\qquad\text{and}\qquad |\emptyset| = 0.$ Not surprisingly the empty set is reflected as the empty list in Sage. The empty list has zero elements.

For some reason (perhaps a good one) python does not accept $\{\}$ as input for the empty set. Why is this? Evaluate the python snippet below and explain.

1.4.2 Sets of numbers

A set could also be the natural numbers (yes, I want $0$ as a natural number: $0$ is very natural, although it came late historically)

$\mathbb{N} = \{0, 1, 2, 3, \dots\},$ or the set of integers

$\mathbb{Z} = \{\dots, -3, -2, -1, 0, 1, 2, 3, \dots\}.$ These sets are called infinite, since they contain infinitely many elements. Even though the natural numbers seem as easy as one, two three, they contain wonderful and deep mathematical mysteries, such as the nature and distribution of the prime numbers $2, 3, 5, 7, 11, 13, 17, \dots$ . Also please respect, that the negative numbers like $-3, -1\in \mathbb{Z}$ have caused confusion for centuries.

We also have the set $\mathbb{Q}$ of rational numbers (fractions) and the set $\mathbb{R}$ of real numbers. The real numbers contains all the possible numbers that we encounter in this course.

We will not define the arithmetic operations (like addition and multiplication) on $\mathbb{Z}, \mathbb{Q}$ and $\mathbb{R}$ formally. I will assume that you know how to add and multiply fractions, and that you do not make mistakes like

$\color{red} \frac{1}{2} + \frac{2}{3} = \frac{1+2}{2+3}=\frac{3}{5}.$ Similarly, I will assume that you know that a rational number stays the same, when the numerator and denominator is multiplied by the same non-zero integer. For example,

$\frac{1}{2} = \frac{3}{6}\qquad\text{and}\qquad \frac{2}{3} = \frac{4}{6}.$ In fact,

$\frac{1}{2} + \frac{2}{3} = \frac{3}{6} + \frac{4}{6} = \frac{3 + 4}{6} = \frac{7}{6}.$ The computation above says that it is straightforward to add pizza slices of the same size (one sixth), but that you need to think a bit when adding one half pizza slice and two pizza slices of size one third.

Click on the right equalities below. Do not use Sage (or any computer)!

$\frac{1}{5} + \frac{1}{7} = \frac{1}{35}$

$\frac{3}{7} + \frac{4}{7} = 1$

$\frac{2}{3} + \frac{3}{2} - 2 = \frac{1}{6}$

$\frac{1}{3} + 2 = \frac{8}{3}.$

1.4.3 Notation and rules for arithmetic operations

Please do not use the symbol for multiplication coming from your favorite computer algebra system. Nothing is worse than looking at notation like

$a*b + c*(b+d)\qquad \text{and}\qquad 12*14 \tag{1.1}$ in a written assignment. In the language of mathematics (1.1) is written

$a b + c (b+d)\qquad \text{and}\qquad 12\cdot 14. \tag{1.2}$ When using variables multiplication is simply an elegant small space and $\cdot$ is used with numbers. Also recall the important distributive law for handling expressions formally. It says that

$a (b + c) = a b + a c. \tag{1.3}$ Notice that you can read the distributive law from right to left i.e., you may write $a (b + c)$ instead of $a b + a c$ .

Verify that (1.3) is true for some specific non-zero numbers. Also convince yourself that WolframAlpha actually accepts space (between numbers and variables) as multiplication.

Suppose that $x, y, z\in \mathbb{Q}$ and $w = x y + x z$ . It seems that computing $w$ involves two multiplications and one addition. Multiplications are expensive operations on a computer. Is there a way of computing $w$ with only one multiplication and one addition?

1.4.4 The symbols $\in$ and $\notin$

The symbol $\in$ is ubiquitous in set theory (and mathematics). It means belongs to or is an element of as in $x\in A$ , where $x$ is an element and $A$ is a set. The symbol $\notin$ means is not an element of as in $x\notin A$ meaning $x$ is not an element of $A$ .

$\in$

, but

$\not\in$

. This exercise actually has

possible correct solutions if $\{1, 2, 3\}$ is in the second empty box and $\{4, 5, 6\}$ in the fourth empty box.

$\{1, 2, 3\}$

$\{4, 5, 6\}$

$0$

$1$

$3$

$6$

$7$

Belongs to ( $\in$ ) is straightforward in python:

1.4.5 Subsets and the symbols $\subseteq$ and $\not\subseteq$

If $A$ and $B$ are sets, then $A\subseteq B$ At times, the symbol $\subset$ is used instead of $\subseteq$ . In our context these two symbols mean the same. However, the notation $A\subsetneq B$ means that $A\subseteq B$ and $A\neq B$ . For example, $\{1, 2, 3\} \subseteq \{1, 2, 3\}$ and $\{1, 2, 3\} \subset \{1, 2, 3\}$ . means that every element of $A$ is also an element of $B$ . In this case we say that $A$ is a subset of $B$ . We also use the notation $A\subsetneq B$ to indicate that $A\subseteq B$ and $A\neq B$ . In this case we say that $A$ is a strict subset of $B$ .

Mentimeter

Quiz on number of subsets

We have for example that

$\mathbb{N} \subseteq \mathbb{Z}.$ What does $A\not\subseteq B$ mean? Here we have to be a little careful. We want this notation to mean that $A$ is not a subset of $B$ . In order for $A\subseteq B$ to be false, there must exist $x\in A$ , such that $x\notin B$ . This is the meaning of $A\not\subseteq B$ . For example,

$\mathbb{Z}\not\subseteq \mathbb{N},$ since $-1\in \mathbb{Z}$ and $-1\notin \mathbb{N}$ .

The set

is not a subset of $A=$

, simply because

does not belong to $A$ . This exercise actually has

possible correct solutions.

$\{1, 2, 3\}$

$\{-1, 1, 2, 3, 4\}$

$\{-1, 0, 1, 2, 4\}$

$3$

$-1$

$5$

$6$

$0$

Below Sage (not python) will list all subsets of the set $\{1, 2, 3\}$ . Before pressing the Compute button, try to write them down on your own.

List all the subsets of a set with five elements. In general, how many subsets does a set with $n$ elements have?

The empty set has

elements. A set with

elements has

subsets. In general a set with $n$ elements has

subsets.

$1$

$0$

$5$

$25$

$32$

$n^2$

$2^n$

It turns out that the empty set $\emptyset$ is a subset of any set.

Does this make sense? We will explain this later talking about the logical relation $\implies$ .

1.4.6 Intersections, unions and the symbols $\cap,\,\, \cup$ and $\setminus$

Suppose that we have two sets $A$ and $B$ . Then the intersection $A\cap B$ is the set consisting of the elements in both $A$ and $B$ . This is illustrated in the socalled Venn diagram below.

The union $A\cup B$ is the set consisting of the elements in $A$ or $B$ . To be more precise, an element is in $A\cup B$ if it is in $A$ or in $B$ (or in both of them):

Lastly, the difference $A\setminus B$ (between $A$ and $B$ ) consists of the elements in $A$ not contained in $B$ :

You should experiment using the python window below to get a feeling for these three operations.

Let $A = \{1, 2, 3\}$ , $B = \{3, 4, 5\}$ and $C = \{0, 1, 5\}$ . Verify by hand (no computer) that

$A\cup B = \{1, 2, 3, 4, 5\}$ .
$A\cap B = \{3\}$ .
$A\cap (B\cap C) = \emptyset$ .
$B\setminus A = \{4, 5\}$ .
$A \cap (B\cup C) = (A\cap B) \cup (A \cap C).$

Given two sets $A$ and $B$ , is it true that $A \cap B = B \cap A$ and $A\cup B = B\cup A$ ?

What about $A\setminus B = B\setminus A$ ?

Suppose that $A$ and $B$ are two finite sets. Is it true that

$|A\setminus B| = |A| - |B|?$ What about

$|A\cup B| = |A| + |B|?$ Seriously, both formulas are wrong. Can you come up with the correct version of the formula for $|A \cup B|$ ?

Use your correct formula to find a formula for

$|A\cup B \cup C|$ viewing $A\cup B$ as the first set and $C$ as the second set. Here you need the formula

$(A\cup B)\cap C = (A\cap C) \cup (B\cap C).$ Why is this formula true? Finally, explain why

$C \setminus (A\cap B) = (C\setminus A) \cup (C\setminus B).$ Hint

If you are attacking the last part of this exercise using Exercise 1.43, you may find it useful to notice that two sets $S_1, S_2$ are equal i.e, $S_1 = S_2$ if and only if

$x\in S_1 \iff x\in S_2.$ Also,

$\begin{aligned} x &\in S_1 \cup S_2 \iff x\in S_1 \lor x\in S_2\\ x &\in S_1 \cap S_2 \iff x\in S_1 \land x\in S_2\\ x &\in S_1 \setminus S_2 \iff x\in S_1 \land x\not\in S_2\\ x &\not\in S_1 \iff \neg (x\in S_1). \end{aligned}$

There is one more operation called the symmetric difference between two sets $A$ and $B$ . It is denoted $A\, \Delta\, B$ . Experiment in the python window below to find out exactly what it does. Is it true that $A\, \Delta\, B = B\, \Delta\, A$ ?

The following is an excerpt from the infamous Beredskabsprøve Datalogi.

Let $X$ and $Y$ denote sets. Which of the following are true?

$X \cup X = X$

$X\cap X = X$

$X\setminus X = X$

$X\subseteq X\cap X$

$\emptyset \subseteq X$

For some sets $X$ and $Y$ we can have

$X\cap Y = X\cup Y.$

Mentimeter

Quiz on set operations

1.4.7 Pairs, triples and tuples

Given two sets $A$ and $B$ we can form the new set $A\times B$ , which is the set of pairs $(a, b)$ , where $a\in A$ and $b\in B$ . For example,

$\{1, 2\}\times \{1, 2, 3\} = \{(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3)\}.$ The set $A\times B$ is also called the Cartesian product of $A$ and $B$ .

Mentimeter

Quiz on $\emptyset\times \{1, 2, 3\}$

Consider two pairs $(a, b)$ and $(c, d)$ . What is a natural way of defining equality between these pairs i.e., $(a, b) = (c, d)$ ?

The Cartesian product can be computed in python as shown below.

There is no need to restrict ourselves to pairs. We might as well consider triples $A\times B\times C$ i.e., the set of all $(a, b, c)$ , where $A$ , $B$ and $C$ are sets, or for that matter general tuples

$(a_1, a_2, \dots, a_n)\in A_1\times A_2\times \cdots \times A_n$ of any length $n\in \mathbb{N}$ , where $a_1\in A_1, a_2\in A_2, \dots, a_n\in A_n$ . Based on the above example with tuples we have,

$\begin{aligned} &\{0\}\times\{1, 2\}\times \{1, 2, 3\} = \\ &\{(0, 1, 1), (0, 1, 2), (0, 1, 3), (0, 2, 1), (0, 2, 2), (0, 2, 3)\}. \end{aligned}$

You may check this using the python snippet below.

For a given set $A$ and $n\in \mathbb{N}$ we define the $n$ -fold cartesian product of $A$ as

$A^n = \underbrace{A\times A\times \cdots \times A}_{n\text{ times}}.$

Have you seen $\mathbb{R}^2$ before? Perhaps plotting points or graphs? In the same way, is there a geometric way of thinking of $\mathbb{R}^3$ ? Does $\mathbb{R}^4$ exist in the real world?

Let $A$ and $B$ be two sets. Is $A\times B = B \times A$ ?

Let $X$ be any set. What is $\emptyset \times X$ ?

Let $A, B, C$ and $D$ be four sets. Is

$(A\times B)\cap (C\times D) = (A\cap C)\times (B\cap D)?$

$(A\times B)\setminus (C\times D) = (A\setminus C)\times (B\setminus D)?$

See Exercise 1.25.

Use python to solve Exercise 1.24 by playing with (and extending) the code below.

1.5 Ordering numbers

Let us be a little rigorous and introduce the (usual) ordering on our numbers with addition and multiplication using almost full blown mathematical formalities. First the formal definition for two integers $x, y\in \mathbb{Z}$ :

$x \leq y\qquad \text{ means that }\qquad y - x\in \mathbb{N} \tag{1.4}$

Notice that $x = y$ implies that $x\leq y$ (and $y\leq x$ ). Along this line we also define $x < y$ if $x \leq y$ and $x\neq y$ .

Assume that $x, y, z\in \mathbb{Z}$ and that $x \leq y$ . Then drag and drop the elements from the left to the right below to explain that $x + z \leq y + z$ .

By assumption $x\leq y$ .

This means that $z - x + y\in \mathbb{N}$

This means that $y - x\in \mathbb{N}$

To show that $x + z \leq y + z$ , we need to show that $(y + z) - (x + z) \in \mathbb{N}$ .

But $(y + z) - (x + z) = y + z - x + z$ . Therefore,

But $(y + z) - (x + z) = y + z - x - z = y - x$ . Therefore,

$(y + z) - (x + z)\in \mathbb{N}$ , since

$y - x \in \mathbb{N}$

Suppose thatAs an example, this could be assuming $1 \leq 2$ and $2 \leq 5$ and then arguing that $1\leq 5$ .

$x \leq y\qquad\text{and}\qquad y\leq z$ for three integers $x, y, z\in \mathbb{Z}$ . Argue from the definition in (1.4) that $x\leq z$ by using the definitions of $x\leq y$ and $y\leq z$ to conclude that $x\leq z$ .

The definition of $x\leq z$ is $z-x\in \mathbb{N}$ . The definition of $x\leq y$ is $y - x\in \mathbb{N}$ . The definition of $y\leq z$ is $z-y\in \mathbb{N}$ . How do you get from the assumptions

$y - x\in \mathbb{N}\qquad\text{and}\qquad z - y\in \mathbb{N}$ to the conclusion

$z - x\in \mathbb{N}?$

If $a\in \mathbb{N}$ and $b\in \mathbb{N}$ , then $a + b\in \mathbb{N}$ .

Suppose that $x\leq y$ and $a\in \mathbb{N}$ , where $x, y\in \mathbb{Z}$ . Conclude that

$a x \leq a y.$ What if we only assume that $a\in \mathbb{Z}$ ? You are welcome to experiment with some concrete numbers like $x = 1, y = 2$ and $a = -3$ . In the end you should be able to come up with an argument using the variables $x, y, a$ with the given assumptions.

$a x \leq a y$ means that $a y - a x \in \mathbb{N}$ , but

$a y - a x = a ( y - x)$ and $x \leq y$ means that $y - x\in \mathbb{N}$ .

If $a\in \mathbb{N}$ and $b\in \mathbb{N}$ , then $a b\in \mathbb{N}$ .

You can see that this definition agrees with our preconception that

$\cdots < -3 < -2 < -1 < 0 < 1 < 2 < \cdots \tag{1.5}$

To be precise, writing $\cdots < -3 < -2 < -1 < 0 < 1 < 2 < \cdots$ is nonsense, since $\leq$ is only defined for two integers in (1.4).

How is one supposed to interpret $0 < 1 < 2$ for example? Go ahead and formulate (1.5) correctly comparing only two integers at a time. How does Python/Sage interpret $-3 < -2 < -1< 0 < 1 < 2$ ? Find out using the Sage snippet below.

What about $1 < 5 > 3 < 4$ ? What about $0 < 1 > 2$ ?

Notice that the set of integers has huge holes. Given two integers $a, b\in \mathbb{Z}$ , such that $a < b$ , we cannot always find an integer $c\in \mathbb{Z}$ in between $a$ and $b$ :

$a < c < b.$

The set of rational numbers has the property that they do not have holes. We can always find an in between number such as $c$ above. But we need a precise way of comparing rational numbers. A way to explain precisely why for example

$\frac{2}{3}\,\, \leq \,\, \frac{5}{7}.$ Of course, you can enter the two numbers on a computer and see that $\frac{2}{3}$ is approximately $0.67$ and $\frac{5}{7}$ is approximately $0.71$ , but we aim for the mathematical precise definition.

A rational number $\frac{a}{b}$ is represented by a numerator $a\in \mathbb{Z}$ and a denominator $b\in \mathbb{Z}$ with $b > 0$ . We already know the criterion for two rational numbers $\frac{a}{b}$ and $\frac{c}{d}$ to be equalTechnically speaking we are defining a socalled equivalence relation identifying the infinitely many ways of writing a rational number into one.:

$\frac{a}{b}\, =\, \frac{c}{d}\qquad \text{ means that }\qquad a d = b c\qquad (\text{in }\mathbb{Z}).$

We wish to compare the two rational numbers $\frac{a}{b}$ and $\frac{c}{d}$ deciding precisely how they are ordered:

$\frac{a}{b}\, \leq\, \frac{c}{d}\qquad \text{ means that }\qquad a d \leq b c\qquad (\text{in }\mathbb{Z}). \tag{1.6}$

How does one come up with the definition in (1.6)? Why not $a^3 b^2 + b^7 \leq a b^8 - a$ instead of $a d \leq b c$ ? The reason is that we want the order on $\mathbb{Q}$ to be related to the one we already defined on the subset $\mathbb{Z}\subseteq \mathbb{Q}$ . It should respect multiplication by positive numbers just as in Exercise 1.28. If we want this to hold, we are forced to the definition in (1.6), since

$(b d)\,\frac{a}{b} = a d\qquad\text{and}\qquad (b d)\, \frac{c}{d} = b c.$

Mentimeter

Quiz on ordering rationals

As for the integers, we also define $x < y$ if $x \leq y$ and $x\neq y$ for two rational numbers $x$ and $y$ .

Using this definition, you can check that $\frac{2}{3} < \frac{5}{7}$ , since

$2\cdot 7 < 3 \cdot 5.$

An easy, but surprising, way of finding a rational number strictly between these two is adding their numerators and denominators:

$\frac{2}{3} < \frac{2 + 5}{3 + 7} < \frac{5}{7}.$

We wil try to explain the first inequality in mathematical general terms going through a rather formal proof consisting of five steps. These steps are given in the quiz below. Your task is to drag from the left and drop them to the right in an order, so that the proof makes sense.

After that you are supposed, on your own, to write down a precise proof of the second inequality.

Order the arguments below so that they constitute a coherent explanation of the statement that if

$\frac{a}{b} < \frac{c}{d},$ then

$\frac{a}{b} < \frac{a + c}{b + d}$

By definition this means that $a d < b c$ .

We are assuming that $\frac{a}{b} < \frac{c}{d}$ .

For integers $x, y, z$ we know that the rule $x ( y + z) = x y + x z$ holds. Therefore

we need to show that $a b + c d < b a + b c$ .

Since $a b = b a$ and $a b + a d < a b + b c$ is a consequence of $a d < b c$ , we are done if we know this is true.

However, this is a consequence of our assumption $\frac{a}{b} < \frac{c}{d}$ .

To show that $\frac{a}{b} < \frac{a+c}{b + d},$ we need to argue that $a (b + d) < b (a + c)$ .

we need to show that $a b + a d < b a + b c$ .

Similarly to the quiz above, assume that

$\frac{a}{b} < \frac{c}{d}.$ Write down a precise argument showing that

$\frac{a + c}{b + d} < \frac{c}{d}.$ You may seek inspiration in Video 1.51 for how to mix math and words (even though it is further ahead).

On Twitter, Raman Gupta posted the note below

For a natural number $m\in \mathbb{N}$ ,

$m! = m (m-1) (m-2)\cdot \dots \cdot 2\cdot 1.$ For example, $3! = 6$ and $5! = 120$ . What is the answer for the question in the note?

Experiment a bit with Sage: define a function $f(n)$ , which computes

$2^{n!} - 2^n!$

Then look at

$f(1), f(2), f(3), f(4), f(5), \dots$

The exercise below shows that our trick for finding rational numbers in between two given rational numbers can be made into a machine for generating all positive rational numbers!

Can you spot the system in the fractions in the diagram below?

Once you see the system, extend the diagram with the next level downwards. Is every positive fraction present in this diagram if one keeps adding levels?

Suppose that

$\frac{p}{q} < \frac{r}{s}$ and $q r - s p = 1$ . Then for

$\frac{p}{q} < \frac{p+r}{q+s} < \frac{r}{s},$ we have $q (p+r) - (q+s) p = 1$ and $(q+s) r - (p + r) s = 1$ . If $\frac{a}{b}$ is a positive fraction, such that

$\frac{p}{q} < \frac{a}{b} < \frac{r}{s},$ show that

$a + b = (r+s)(q a - b p)+(p+q)(b r - a s)\geq p+q+r+s.$

1.5.1 Subsets of numbers and first elements

In a set equipped with an order, it is intuitively clear what a first element should be. For example, the natural numbers $\mathbb{N}$ has $0$ as its first element. On the other hand the set $\mathbb{Z}$ of integers does not have a first element (it is "infinite to the left").

In fact every non-empty subset $S\subseteq \mathbb{N}$ has a first element. This follows from a rather special property of $\mathbb{N}$ : there can be only finitely many natural numbers smaller than a given one. This is not true for $\mathbb{Z}$ . Here there are infinitely many integers smaller than any integer.

If $A$ is a set with an order $\leq$ and $B\subseteq A$ is a subset, then formally $x\in B$ is a first element of $B$ if $x \leq y$ for every element $y\in B$ .

Mentimeter

Quiz on first element in a subset

Give an example of a subset of $\mathbb{Z}$ that does not have a first element. Does the subset of even numbers have a first element? Does the empty subset have a first element?

Can a non-empty subset of a set with an order have two different first elements? I need to be precise here. I am assuming that the order $\leq$ (naturally) satisfies: if $x$ and $y$ are two elements from the subset and $x\leq y$ and $y \leq x$ both hold, then $x = y$ .

Do the orders we defined on $\mathbb{Z}$ and $\mathbb{Q}$ satisfy the above property?

If $x \leq y$ , where $x, y\in \mathbb{Z}$ , then $y - x\in \mathbb{N}$ . If $y \leq x$ , then $x - y\in \mathbb{N}$ . But

$y - x = - (x - y).$ In other words, $z = y - x$ is an integer that satisfies $z\in \mathbb{N}$ and $-z\in \mathbb{N}$ . What integer is $z$ ?

Consider the subset $S$ of $\mathbb{Q}$ consisting of positive fractions i.e., rational numbers $>0$ . Does this subset have a first element?

1.6 Propositional logic

We have seen quite a few mathematical statements that ended up being true or false. Such statements are called propositions. Here are two examples of propositions usings sets (in python):

What exactly are the two propositions in the above python window written up in mathematical terminology? Notice that the symbol == is a programming construct. It is not used in mathematics notation.

Propositions can be combined into new (compound) propositions. Take for example the propositions

$\begin{aligned} &p: \text{it rains}\\ &q: \text{it is cloudy}. \end{aligned}$

Then ( $p$ and $q$ ) is a perfectly good new proposition reading it rains and it is cloudy. The same goes for (if $p$ then $q$ ), which reads if it rains then it is cloudy. The proposition (if $q$ then $p$ ) reads if it is cloudy then it rains. This proposition is (clearly) false.

We need some notation to describe these compound propositions:

$\begin{array}{ll} p \land q\qquad\qquad & \qquad\qquad p \text{ and } q\\ \\ p \lor q\qquad\qquad & \qquad\qquad p \text{ or } q\\ \\ p\implies q\qquad\qquad & \qquad\qquad \text{if } p \text{ then } q\\ \\ \neg p\qquad\qquad & \qquad\qquad \text{not } p \end{array}$

The compound propositions are either true( $t$ ) or false ( $f$ ) depending on $p$ and $q$ . The dependencies are displayed in the truth tables below.

$\def\arraystretch{1.2} \begin{array}{c|c|c} p & q & p\land q \\ \hline t & t & t \\ t & f & f\\ f & t & f\\ f & f & f \end{array}\qquad \begin{array}{c|c|c} p & q & p\lor q \\ \hline t & t & t \\ t & f & t\\ f & t & t\\ f & f & f \end{array} \qquad \begin{array}{c|c|c} p & q & p\implies q \\ \hline t & t & t \\ t & f & f\\ f & t & t\\ f & f & t \end{array}\qquad \begin{array}{c|c} p & \neg p \\ \hline t & f\\ f & t \end{array}$

The tables for the compound propositions $p\land q, p\lor q$ and also $\neg p$ are not too hard to grasp. The table for $p\implies q$ raises a few more questions. Why is $f\implies t$ true? I will not go into this, but just point out that there are many explanations available online and, perhaps more importantly, refer you to Exercise 1.40 and the remark below.

Here is a statement about real numbers

$x > 0 \qquad \implies \qquad x^2 > 0. \tag{1.7}$ This statement reads: no matter which real number $x$ you pick, if $x > 0$ , then $x^2 > 0$ . We definitely want this to be true. Being true means that (1.7) must hold for all numbers $x$ , also $x = -7$ , which reads

$- 7 > 0 \qquad \implies \qquad (-7)^2 = 49 > 0.$ The above statement is an example of a false implies true statement, which we want to be true: even though $-7$ is negative, its square is positive.

In general terms, in proving the statement that $p(x) \implies q(x)$ holds for every $x$ in some set $S$ , we are really only interested in $x\in S$ for which $p(x)$ is true, since $p(x)$ is our assumption. We still need $p(x)\implies q(x)$ to be true for $x\in S$ for which $p(x)$ is false. This is assured by the truth table for $\implies$ , since $f \implies t$ and $f\implies f$ are both true.

We may also check what Sage outputs for the truth table of $p\implies q$ (or for that matter any formulaNotice that Sage uses -> for $\implies$ , \& for $\land$ , | for $\lor$ and ~ for $\neg$ .. combining the logical operations above):

Mentimeter

Courtroom quiz

Suppose that we are presented with four cards

$\boxed{3}\qquad \fcolorbox{black}{red}{\phantom{3}} \qquad\boxed{4}\qquad \fcolorbox{black}{blue}{\phantom{4}} \tag{1.8}$ with a (natural) number on the front and the color blue or red on the back. In (1.8), the first and third cards are shown with their fronts facing up and the second and fourth cards are shown with their backs facing up.

A claim (proposition) is made that if a card has an even number on the front, then it must have the color blue on the back.

Your task is to verify this for the cards above. Of course you can do this by turning all four cards, but is there a way of checking this by turning less than four cards?

What if we add the claim, that if a card has the color blue on the back, then it must have an even number on the front?

Find two propositions $p$ and $q$ so that the claim reads $p\implies q$ .

Explain why Python/Sage thinks that the valueThanks to Gerth Brodal for pointing this out to me of

$1 < 0 < 1/0$ is False! Notice that you are dividing one by zero in the last "integer" above.

In the exercise below you will see for example that $p\implies q$ is the same as $\neg q \implies \neg p$ .

Two propositions are considered the same ( $=$ ) if they have the same truth table. Verify, by filling out and comparing truth tables, that

$\neg (p \land q) = (\neg p) \lor (\neg q)$
$\neg (p \lor q) = (\neg p) \land (\neg q)$
$p \implies q = (\neg q) \implies (\neg p)$
$p \implies q = (\neg p)\lor q$

Can you use the setup up in Exercise 1.42 to verify that

$p\land (q \lor r) = (p\land q) \lor (p \land r)$ for three propositions $p, q$ and $r$ ? Verify an analogous identity for $p\lor (q \land r).$

The notation $p \iff q$ is used frequently. It means that both $p\implies q$ and $q\implies p$ are true i.e.,

$(p\implies q) \land (q\implies p).$

Mentimeter

Truth table

1.6.1 The symbols $\exists, \forall$ and propositions with variables (predicates)

In mathematics one usually reasons with propositions with variables, such as $x > 0$ . A proposition with variables is usually called a predicate.

In order to evaluate a predicate with a variable $x$ , one must first specify to which set $S$ the variable belongs. For example, the predicate $p(x)$ given by

$x^2 > 0,$ does not make sense if $x$ is taken from the set of letters in the English alphabet (not unless you give an interpretation of $x^2$ , $>$ and $0$ in this set). However, if $x\in \mathbb{Z}$ , then $p(x)$ certainly makes sense. Whether $p(x)$ is true depends on $x$ .

For example, $p(0)$ is false, whereas $p(-1)$ is true. This leads us to the existential and universal quantifiers $\exists$ and $\forall$ . The former reads there exists and the latter for every.

For example, the predicate

$\exists x\in \mathbb{Z}: \neg p(x)$ is true and so is

$\forall x\in \mathbb{Z}\setminus\{0\}: p(x).$ The symbol ":" above means "such that"Therefore $\exists x\in \mathbb{Z}: \neg p(x)$ reads "there exists $x$ in $\mathbb{Z}$ , such that $\neg p(x)$ is true".. Also, $\forall x\in \mathbb{Z}: p(x)$ is false, because $\exists x\in \mathbb{Z}: \neg p(x)$ is true. Linking ":" with $\implies$ one may say that

$\forall x\in S: q(x)\quad\text{is the same as}\quad x\in S\implies q(x)\quad\text{ is true for every }x\in S$

for a general predicate $q(x)$ . This statement can only be false if there exists $x\in S$ , such that $q(x)$ is false (see the truth table for $\implies$ above). In particular, $\forall x\in \emptyset: x = 17$ is a true statement!

In general,

$\exists x\in S: q(x)\quad\text{ is the same as }\quad \neg \left(\forall x\in S: \neg q(x)\right).$

So we do not really need the quantifier $\exists$ , when we have $\neg$ and $\forall$ , but $\exists$ is convenient and used all the time. Notice that $\exists x\in \emptyset: x = 17$ is a false statement!

The quantifiers are important to learn and apply when expressing mathematical ideas. So is the use of predicates in writing up subsets: if $S$ is a set, $x$ a variable taking values in $S$ and $p(x)$ a predicate (making sense in $S$ ), then

$\{x\in S \mid q(x)\}\quad\text{is the subset of the elements in } x\in S, \text{such that } q(x) \text{ is true.}$

For example, if $q(x) = x^2 > 0$ , then

$\{x\in \mathbb{Z} \mid q(x)\} = \mathbb{Z}\setminus \{0\}.$

Suppose that $p_1(x), \dots, p_n(x)$ are predicates with a variable $x$ taking values in $S$ , then we often use the notation (using $,$ instead of $\land$ )

$\{x\in S \mid p_1(x), \dots, p_n(x)\}\quad\text{for}\quad \{x\in S \mid p_1(x)\land \dots\land p_n(x)\}.$

It also makes sense to have predicates with more than one variable. With variables $x, y$ in $\mathbb{N}$ , such a proposition $q(x, y)$ could be

$x\text{ and }y\text{ are prime numbers and } x + y \text{ is a prime number}.$

Consider the predicate $q(x, y)$ from Remark 1.44. Write down the elements in

$\{(x, y)\in \mathbb{N}\times \mathbb{N} \mid q(x, y) \land x + y \leq 10\}.$ Is

$\{(x, y) \in \mathbb{N}\times \mathbb{N}\mid q(x, y)\}$ an infinite set?

Explore the fascinating world of prime numbers and learn about twin primes.

List the elements in the following subsets.

$\{x\in \mathbb{Z} \mid x^2 < 10\}.$
$\{(x, y)\in \mathbb{Z}^2 \mid x^2 + y^2 < 5\}.$

You have previously encountered systems of linear equations like

$\begin{aligned} x + y &= 3\\ 3x - y &= 5. \end{aligned}\tag{1.9}$ The solutions to (1.9) can be identified with a subset of $\mathbb{R}^2$ . Define this subset precisely i.e., write the subset as

$\{(x, y)\in \mathbb{R}^2 \mid p(x, y)\},$ where $p(x, y)$ is a predicate in the variables $x, y\in \mathbb{R}$ .

Suppose that $X = \mathbb{R}$ and

$Y = \{x\in \mathbb{R} \mid x > 0,\, x < 2\}.$ Then write down precisely what $X\setminus Y$ is i.e., find suitable predicates $q_1, q_2$ in the variable $x$ , such that $q(x) = q_1(x) \lor q_2(x)$ and

$X\setminus Y = \{x\in \mathbb{R} \mid q(x)\}.$

Consider the subset $S$ of $\mathbb{R}^2$ pictured in the drawing below

Express $S$ as

$S = \{(x, y)\in \mathbb{R}^2 \mid p_1(x, y), p_2(x, y), p_3(x, y)\},$ where $p_1, p_2, p_3$ are predicates in the variables $x, y$ .

Hint

A predicate in the variables $x, y$ could be something like

$x -y \geq 17.$

Express $\mathbb{R}^2\setminus S$ as

$\{(x, y)\in \mathbb{R}^2 \mid p(x, y)\},$ where

$p(x, y) = q_1(x, y) \lor q_2(x, y) \lor q_3(x, y)$ and $q_1, q_2$ and $q_2$ are suitable predicates in the variables $x, y$ .

The following is yet another excerpt from the infamous Beredskabsprøve Datalogi.

Which of the following are true?

$\forall x\in \mathbb{N}: x > 2$

$\exists x\in \mathbb{N}: x > 2$

$\forall x\in \emptyset: x = 7$

$\exists x\in \emptyset: x = 7$

1.6.2 The use of implication ( $\implies$ ) and bi-implication ( $\iff$ )

Usually $\implies$ and $\iff$ are applied to link propositions in a logical argument. An example is

$x \leq y \iff x + z \leq y + z$ for integers $x, y, z$ . To be completely precise, I should here write

$\forall x, y, z\in \mathbb{Z}: x \leq y \iff x + z \leq y + z, \tag{1.10}$ but one often writes $\forall$ with words as for example for integers $x, y, z$ .

To be one hundred percent precise (1.10) means formally, that

$\forall x\in \mathbb{Z}: ( \forall y\in \mathbb{Z}: (\forall z\in \mathbb{Z}: x \leq y\iff x + z \leq y + z))).$

Here $x \leq y\implies x + z \leq y + z$ is true and similarly $x + z \leq y + z \implies x \leq y$ (by using the definition (see (1.4)) of $\leq$ in $\mathbb{Z}$ ). So the use of $\iff$ is valid. A somewhat simpler example is

$x + 1 = 2 \iff x = 1\qquad\text{or}\qquad \forall x\in \mathbb{Z}: x + 1 = 2 \iff x = 1.$

However, for $x \geq 0 \implies x^2 \geq 0$ we cannot link the two propositions by $\iff$ , simply because $x^2 \geq 0 \implies x \geq 0$ is false (for $x= -1$ ).

1.7 What is a mathematical proof?

Most professional mathematicians rarely think about the precise definition of a proof. During many years of training they have assimilated knowledge by experience. Therefore many proofs seem born out of witchcraft containing several magical devices.

However, many proofs appearing in respected mathematical journals, submitted by respected mathematicians, have turned out to contain errors. Recent developments in automated proof systems like Coq and LEAN show promise in checking proofs like for example the famous four color theorem.

Informally a proof of a proposition $q$ , consists in arguing that an implication $p\implies q$ is true by first assuming $p$ . Usually this is done not only through one implication $p\implies q$ , but through a series of intermediate implications

$p\implies q_1 \implies q_2 \implies q_3 \implies \cdots \implies q_N,$ where the last proposition $q_N$ is $q$ . If $p$ is true, this will constitute a proof that $q_N = q$ is true. Just like in (1.5), there is an imprecision here. Can you tell what it is?

In this section we will illustrate a simple mathematical proof of the proposition:

$\forall n\in \mathbb{N}: p(n)\implies p(n^2),$ where $p(n) = (n\text{ is odd})$ i.e., the square of an odd natural number has to be odd. This seems true for a first selection of examples: $3^2=9, 5^2=25, \dots$ .

First we need to know what $p(n)$ means. What does it mean exactly for a number to be odd? This means that it is not divisible by $2$ or that there exists another natural number $a$ , such that $n = 2 a + 1$ . So

$p(n) = \exists a\in \mathbb{N}: n = 2 a + 1.$ Therefore we need to show that

$\left(\exists a\in \mathbb{N}: n = 2 a + 1\right) \implies \left(\exists b\in \mathbb{N}: n^2 = 2 b + 1\right).$ Notice that I had to change $a$ into $b$ in the second proposition above. The two variables are not the same: $a$ is associated with $n$ and $b$ is associated with $n^2$ .

Let us assume that $n = 2 a + 1$ . Now we need to argue that $n^2 = 2 b + 1$ for some $b\in \mathbb{N}$ . You stare at this for a while and notice that we should use the assumption $n=2 a + 1$ in computing $n^2$ :

$n^2 = (2 a + 1)^2 = (2 a)^2 + 2 (2 a) + 1^2 = 4 a^2 + 4 a + 1 = 2(2 a^2 + 2 a) + 1.$ Thus, using our assumption we may conclude that if $n = 2 a + 1$ , then

$n^2 = 2 b + 1,$ where $b=2a^2+ 2 a$ . This completes the proof.

The beauty here is that we have verified for all odd natural numbers that their square is odd. Not just a finite selection like $3, 7, 11, 13$ .

Below I have given a very detailed walk through of the proof above. It exemplifies how to write up the proof mixing words and mathematics. In many ways a proof is like a detailed argument in a court case, except that the rules of mathematics are universal. You need the absolute truth.

1.7.1 Proof by contradiction

A proposition $p$ is either true or false. This seemingly obvious statement goes by the name of the law of excluded middle and dates back to the writings of Aristotle.

An irrational number is a (real) number that is not rational. It is a startling fact that such numbers exist, but they do! The square root $\sqrt{2}$ of two is an example. We will prove that there exists two irrational numbers $\alpha, \beta$ , such that $\alpha^\beta$ is rational.

Consider the proposition $p$ given by

$\gamma = \sqrt{2}^{\sqrt{2}}\text{is rational.}$ Either $p$ is true or false. If $p$ is true we are done putting $\alpha = \beta = \sqrt{2}$ . If not, then $p$ must be false and $\gamma$ is irrational. But then

$\gamma^{\sqrt{2}} = \left(\sqrt{2}^{\sqrt{2}}\right)^{\sqrt{2}} = (\sqrt{2})^{\sqrt{2}\cdot \sqrt{2}} = \sqrt{2}^2 = 2$ and we are done putting $\alpha = \gamma$ and $\beta = \sqrt{2}$ .

We know that $\sqrt{2}^2$ is $\sqrt{2}$ multiplied by itself, but can you define $\sqrt{2}^{\sqrt{2}}$ in a similar precise way?

Here you need help from the special functions $e^x$ and $\log(x)$ .

So which one is it? Is

$\sqrt{2}^{\sqrt{2}}$ rational or irrational?

This is advanced mathematics! Try to make sense of the famous Gelfond-Schneider theorem.

The law of excluded middle can be turned into a powerful proof technique called proof by contradiction.

Suppose we wish to establish that $p$ is true. Then we turn things upside down by assuming that $p$ is false i.e., that $\neg p$ is true. If we then by logical deduction can show that

$\neg p \implies q,$ for some proposition $q$ , which is demonstrably false, then $\neg p$ cannot be true (since true $\implies$ false is false). Therefore $\neg p$ must be false and $p$ must be true by the law of the excluded middle. This technique is used all the time!

Perhaps the two most famous proofs by contradiction in all of mathematics are due to Euclid. The first one is about the infinitude of the prime numbers. Here one assumes to begin with that there are finitely many prime numbers and follows this through to a contradiction. See Exercise 1.56.

Mentimeter

Primes

The second one (perhaps even attributable to Artistotle) is about the irrationality of $\sqrt{2}$ . Here one assumes to begin with that $\sqrt{2}$ is rational and follows this through reaching a contradiction. See Exercise 1.57.

We will give an example of a proof by contradiction using a previous exercise: let $p$ be the proposition that the subset

$S = \{x\in \mathbb{Q} \mid x > 0\}$ of $\mathbb{Q}$ does not have a first element. Recall the definition of a first element in the context of $S$ : $x_0\in S$ is a first element if

$\forall x\in S: x_0 \leq x.$ So if $x_0$ is a first element in $S$ , there cannot exist $x_1\in S$ , such that $x_1 < x_0$ .

The proof by contradiction in this case, runs as follows. Assume $\neg p$ i.e., that $S$ has a first element say

$x_0 = \cfrac{a}{b}.$ Then using $x_0$ we can form

$x_1 = \frac{a}{b+1},$ and you can checkCheck that $a b < a(b+1)$ . that $x_1\in S$ and $x_1 < x_0$ i.e., $x_0$ is not a first element. So our assumption that $S$ has a first element immediately leads to the conclusion that $S$ does not have a first element. In fact, we have proved

$\neg p \implies p.$ Therefore $\neg p$ (the wrong assumption) has to be false and thus $p$ must be true.

Consider the first $n$ prime numbers

$p_1 = 2, p_2 = 3, p_3 = 5, \dots, p_n.$ Check that

$\begin{aligned} &p_1\\ &p_1\, p_2 + 1\\ &p_1\, p_2\, p_3 + 1\\ &p_1\, p_2\, p_3\, p_4 + 1 \end{aligned}$ are prime numbers by using the Sage window below (factor gives the prime factorization of a natural number).

Is it true in general that

$p_1\, p_2\, \cdots \, p_n + 1$ is a prime number?

Assume that we know that every natural number must be divisible by a prime number. Show how the assumption that there are only finitely many prime numbers say

$p_1, p_2, \dots, p_n$ leads to a contradiction by using that the natural number

$p_1\, p_2\, \dots\, p_n + 1$ must be divisible by a prime number.

Suppose that $q(n) = (n \text{ is even})$ . Prove that

$\forall n\in \mathbb{N}: q(n^2) \implies q(n).$ Suppose that

$\sqrt{2} = \frac{m}{n}.$ Show that this implies $2 n^2 = m^2$ and that $m$ and $n$ are even numbers.

Given the above, write up a precise proof that $\sqrt{2}\not\in \mathbb{Q}$ using proof by contradiction.

You may wonder what is so special about rational numbers. Which property does $\sqrt{2}$ break? You can explore this by looking at the decimal expansion of some fractions below.

However, $\sqrt{2}$ is an algebraic number being a root in the polynomial $x^2 - 2$ . In general an algebraic number is a number, which is a root in a polynomial with coefficients in $\mathbb{Z}$ .

1.7.2 Proof by induction

A precocious GaussSee the article Gauss's Day of Reckoning for some history of this anecdote. proved the formula

$1 + 2 + \cdots + n = \frac{n(n+1)}{2} \tag{1.11}$ at the age of seven displaying remarkable ingenuity for his age. Lesser mortals usually use induction to prove this formula. Gauss was asked along with his classmates to compute the sum of all natural numbers $1, 2, \dots, 100$ . Using his formula he quickly came up with the correct answer $5050$ . His classmates had to work for the entire lesson.

Suppose that the formula in (1.11) is viewed as a proposition $p(n)$ . To prove the formula we need to prove it for all natural numbers (you can easily see that $p(1)$ and $p(2)$ are true) i.e., we need to prove

$\forall n\in \mathbb{N}\setminus\{0\}: p(n).$ An induction proof is a way of proving this statement by showing two things:

$p(1)$
$\forall n\in \mathbb{N}\setminus\{0\}: p(n)\implies p(n+1)$

These two statements ensure that $p(1) \implies p(2)$ . Therefore $p(2)$ must be true, since we assumed $p(1)$ true from the beginning. Similarly $p(2)\implies p(3)$ ensures that $p(3)$ is true and so on. In fact we have proved $p(n)$ for every $n\in \mathbb{N}$ using this technique. One can prove this using proof by contradiction and that every non-empty subset of $\mathbb{N}$ has a first element (see subsection 1.5.1 and below).

Suppose that $p(n)$ are infinitely many propositions given by $n\in \mathbb{N}\setminus\{0\}$ . Then

$\forall n\in \mathbb{N}\setminus\{0\}: p(n)$ is true if

$p(1)$ is true.
$\left(\forall n\in \mathbb{N}\setminus\{0\}: p(n)\implies p(n+1)\right)$ is true.

Suppose by contradiction that there exists $n\in \mathbb{N}\setminus\{0\}$ , such that $p(n)$ is false. Then the subset

$S = \{n\in \mathbb{N} \mid \neg p(n)\}\subseteq \mathbb{N}$ is non-empty. Therefore it has a first element $n_0\in S$ . Here $n_0 > 1$ , since $p(1)$ is assumed to be true. So we know that $p(n_0-1)$ is true and that $p(n_0-1)\implies p(n_0)$ is true. But the latter implication is a contradiction, since true implies false is false.

What happens if $\mathbb{N}\setminus\{0\}$ is replaced by $\mathbb{N}$ and $p(1)$ by $p(0)$ in Theorem 1.58?

Let us see how an induction proof plays out in the above example with the statement $p(n)$ that

$1 + 2 + \cdots + n = \frac{n(n+1)}{2}. \tag{1.12}$ Clearly $p(1)$ is true. We need to prove $p(n)\implies p(n+1)$ , so we assume that $p(n)$ holds i.e., that (1.12) is true. Then we may add $n+1$ to both sides of (1.12) to get

$1 + 2 + \cdots + n + (n+1) = \frac{n(n+1)}{2} + (n+1).$ Here the right hand side can be rewritten as

$\frac{n(n+1) + 2(n+1)}{2} = \frac{(n+1)(n+2)}{2},$ which is exactly what we want. This is the conjectured formula for the sum of the numbers $1, 2, \dots, n, n+1$ . Therefore we have proved that $p(n)\implies p(n+1)$ and the induction proof is complete.

Mentimeter

Induction

For a real number $r\neq 1$ , the extremely useful formula

$1 + r + \cdots + r^n = \frac{1 - r^{n+1}}{1-r} \tag{1.13}$ holds. Let us prove this formula by induction. For $n=1$ this amounts to the identity

$1 + r = \frac{1-r^2}{1-r},$ which is true since $1-r^2 = (1+r)(1-r)$ . We let $p(n)$ denote the identity in (1.13). We have seen that $p(1)$ is true. The induction step consists in proving $p(n)\implies p(n+1)$ . We can prove this by adding $r^{n+1}$ to the right hand side in (1.13):

$\frac{1 - r^{n+1}}{1-r} + r^{n+1} = \frac{1 - r^{n+1} + (1-r) r^{n+1}}{1-r} = \frac{1 - r^{n+2}}{1-r}. \tag{1.14}$ Real life application

In order to pay for a house you borrow $P$ DKK at an interest of $r$ per year. You want to pay off your debt over $N$ years by paying a fixed amount each year. How much is the fixed yearly amount you need to pay?

Let us analyze the setup: suppose that the fixed yearly amount is $Y$ . We will find an equation giving us $Y$ in terms of $P, N$ and $r$ . Put $q = 1+ r$ .

After one year you owe

$q P - Y.$ After two years you owe

$q(q P - Y) - Y.$ After three years you owe

$q ( q ( q P - Y) - Y) - Y.$ In general after $n$ years you owe

$q^n P - Y (1 + q + \cdots + q^{n-1}).$ Since we want to be debt free after $N$ years, the yearly payment will have to satisfy

$q^N P = Y ( 1 + q + \cdots + q^{N-1}).$ By the formula (1.13), we get

$q^N P = Y \frac{1-q^N}{1-q}.$ Here $Y$ can be isolated giving the formula

$Y = \frac{r P}{1 - \left(\frac{1}{1+r}\right)^N}.$ With the current (August 2023) interest rate around five percent, you pay a fixed monthly amount of around 5420 DKK (up from 3200 DKK in 2021, when the interest rate was one percent) for borrowing one million DKK over $30$ years.

Verify the computation (induction step) in (1.14) i.e., explain the operations used to go from the left to the right of the two equalities.

Locate the mistake in the following fake induction proof of the curious fact that $2^n = 2$ for every $n\in \mathbb{N}\setminus\{0\}$ .

Let $p(n)$ be the proposition $2^n = 2$ . Then $p(1)$ is true.

We wish to prove that $p(n) \implies p(n+1)$ assuming that $p(1), \dots, p(n)$ are true:

$\begin{aligned} 2^{n+1} &= 2^n \cdot 2\\ &= 2^n \cdot \frac{2^n}{2^{n-1}}\\ &= 2 \cdot \frac{2}{2}\,\,\text{(by }p(n)\text{ and }p(n-1)\text{)}\\ &= 2. \end{aligned}$ This shows that $p(n) \implies p(n+1)$ and therefore that $2^n = 2$ for every $n\in \mathbb{N}\setminus\{0\}$ .

Prove by induction that the sum of the first $n$ odd numbers is given by the formula

$1 + 3 + \cdots + (2 n - 1) = n^2,$ i.e., for $n=5$ we have

$1 + 3 + 5 + 7 + 9 = 25.$

Prove by induction that

$1^2 + 2^2 + 3^2 + \cdots + n^2 = \frac{n(n+1)(2n + 1)}{6}.$ ,

Prove using the idea of induction that

$2^n < n!$ for $n\geq 4$ .

The last exercise related to induction concerns the famous pigeonhole principle. The statement itself looks innocent, well almost ridiculous, but it is very powerful. Even the go-to website mathoverflow for research mathematicians has a quite nice thread about this.

Prove the following by induction on $m$ : if $n$ items are put into $m$ containers and $n > m$ , then at least one container must contain more than one item.

1.8 The concept of a function

A function is a crucial concept in mathematics. In Sage (actually python here) a simple function can be programmed like

The code above seems to take a number and returns the number plus one. This (f) is in fact a function taking as input a number and returning as output the number plus one. Notice that we do not even know which numbers we are talking about here. In mathematics we need to have a more precise notion of a function.

Mathematically a function $f$ takes values from a set $S$ and returns values in a set $T$ . In details, it is denoted $f: S\rightarrow T$ and the value associated with $s\in S$ is denoted $f(s)\in T$ .

The above python function could more formally be denoted as $f: \mathbb{Z}\rightarrow \mathbb{Z}$ with $f(n) = n+1$ if we are dealing with the integers, but we cannot tell from the code.

Well, to be fair ...

To be completely fair, it is possible from Python 3.5 to add type annotations to functions, so that we could write

def f(n: int) -> int: return(n+1)

in the Python code to state that the function should take values in the integers and return integers.

If you want the super precise mathematical definition of a function, I will give it here. A function $f: S\rightarrow T$ is a subset $f\subseteq S\times T$ , such that $(s, t_1)\in f \land (s, t_2)\in f \implies t_1 = t_2$ . In words it states that a function $f: S\rightarrow T$ is a subset $f$ of $S\times T$ , containing pairs having only one second coordinate for every first coordinate.

The everyday working definition of a function is more intuitive: a machine taking input from some set $S$ and giving output in some set $T$ . The uniqueness of the output is encoded in the mathematical definition of a function.

Please notice that a function is a very, very general concept. It is not just something that you draw as a graph on a piece of paper. Of course, you can draw a function $f:\mathbb{R}\rightarrow \mathbb{R}$ like $f(x) = x^2$ :

Generally, a function $f: S\rightarrow T$ is given by a machine, formula or algorithm that computes $f(x)\in T$ for every $x\in S$ . Nothing more, nothing less. It really has nothing to do with a graph (even though graphs can sometimes be useful for visualizing certain functions like $f(x) = x^2$ ).

Good examples of functions can be found in the cryptographic hash functions. They are examples of complicated functions $f:S \rightarrow T$ , where $S$ is infinite and $T$ finite. Here $S$ could be data like plain text files and $T$ could be a $256$ bit number. This is the setup for the widely used sha-256 cryptographic hash function. The whole point of a cryptographic hash function is that it must be humanly impossible to compute $y$ with $f(y) = f(x)$ given $f(x)$ A pair $x\neq y$ with $f(x) = f(y)$ is called a collision. In fact, sha-256 is used in the Bitcoin block chain. The precise definition of sha-256 can be found in FIPS PUB 180-4 approved by the Secretary of Commerce.

Other interesting functions output a bounded size digital footprint (checksum) of a file (like md5). This is very useful for checking data integrity of downloads over the internet. The md5 hash is a $128$ bit number.

Instead of listing $256$ or $128$ bits for the hash value one uses hexadecimal notation with digits in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 , a, b, c, d, e, f. A pair of hexadecimal digits then represents a byte or $8$ bits. Output from sha-256 and md5 consist of $64$ and $32$ hexadecimal digits respectively. You are welcome to experiment with these two hash functions in the Sage window below.

Another hashing function is NeuralHash (see also GitHub) constructed using deep learning. It is used in Apple's Child Sexual Abuse Material (CSAM) technology.

Mentimeter

Sha-256 and md-5

What is the sha-256 hash of your name? Change a few letters and recompute. Do you see any system? What about the md5 hash function? Can you find two different strings with the same md5 hash using your computer?

I have not answered the last question myself, but I am told that it is possible to find a collision for md5 using a garden variety home computer. Browsing the internet, it seems that the two strings $s_1$ and $s_2$ given in hexadecimal notationThis notation represents a sequence of bytes given by pairs of hexadecimal digits by

d131dd02c5e6eec4693d9a0698aff95c2fcab58712467eab4004583eb8fb7f89 
55ad340609f4b30283e488832571415a085125e8f7cdc99fd91dbdf280373c5b 
d8823e3156348f5bae6dacd436c919c6dd53e2b487da03fd02396306d248cda0 
e99f33420f577ee8ce54b67080a80d1ec69821bcb6a8839396f9652b6ff72a70

and

d131dd02c5e6eec4693d9a0698aff95c2fcab50712467eab4004583eb8fb7f89 
55ad340609f4b30283e4888325f1415a085125e8f7cdc99fd91dbd7280373c5b 
d8823e3156348f5bae6dacd436c919c6dd53e23487da03fd02396306d248cda0 
e99f33420f577ee8ce54b67080280d1ec69821bcb6a8839396f965ab6ff72a70

give a collision for md5. Verify that $s_1\neq s_2$ and that they give the same md5 hash. If you find a collision for sha-256 you will become world famous.

1.8.1 Notations for defining a function

If $f: S\rightarrow T$ is a function and $S$ is a finite set, then you can define $f$ using a simple table. This is best illustrated using an example. Suppose that $S = \{1, 2, 3\}, T = \mathbb{R}$ and

$\begin{aligned} f(1) &= \sqrt{2}\\ f(2) &= \pi\\ f(3) &= -1. \end{aligned}$ Then $f$ is expressed in table form as

$\def\arraystretch{1.5} \begin{array}{c|ccccccc} x & 1 & 2 & 3\\ \hline f(x) & \sqrt{2} & \pi & -1 \end{array}$

Very often the bracket (or Tuborg in Danish) notation is used. It is similar to if-then-else statements in programming:

$f(x) = \begin{cases} 0 &\text{if } x\leq 0\\ x^2 &\text{if } x > 0 \end{cases} \tag{1.15}$ defines the function $f:\mathbb{R} \rightarrow \mathbb{R}$ that outputs $0$ if the input $x\leq 0$ and $x^2$ if $x>0$ . In python we may express this as

What is $f(-17)$ and $f(17)$ for the function defined in (1.15). Draw the graph of $f$ . Come up with a function $f:S\rightarrow T$ , where it does not make sense to draw a graph.

We now define three very important notions related to functions.

Let $f: S\rightarrow T$ be a function. Then $f$ is called

injective, if $f(x) = f(y) \implies x = y$ for every $x, y\in S$ .
surjective, if for every $y\in T$ , there exists $x\in S$ , such that $f(x) = y$ .
bijective, if it is both injective and surjective.

Is a cryptographic hash-function as defined in Example 1.68 injective?

Suppose that

$S = \{1, 2, 3\}\qquad\text{and}\qquad T = \{1, 2, 3, 4\}.$ We define a function $f: S\rightarrow T$ by the table

$\def\arraystretch{1.5} \begin{array}{c|ccccccc} x & 1 & 2 & 3\\ \hline f(x) & 1 & 2 & 4 \end{array}$ Is $f$ injective? Is it surjective? Is it possible to adjust the table so that $f$ becomes injective? Is it possible to adjust the table so that $f$ becomes surjective?

Consider the function $f:S \rightarrow T$ given by

$f(x) = x^2,$ where $S = T = \mathbb{R}$ . Is $f$ injective? Is $f$ surjective? Suggest how to change $S$ and $T$ so that $f:S\rightarrow T$ becomes bijective.

Consider the function $f:\mathbb{Z} \rightarrow \mathbb{Z}$ given by

$f(x) = x + 1$ Show that $f$ is bijective.

Write down precisely how the truth table for $p\implies q$ may be expressed in terms of a function $f: S\rightarrow T$ . What are the sets $S$ and $T$ in this case?

1.8.2 Composition of functions

Given two functions $f: S\rightarrow T$ and $g: U\rightarrow V$ , where $V\subseteq S$ , we define a new function $f\circ g: U \rightarrow T$ by

$(f\circ g)(u) = f(g(u)).$ This notion calls for some reflection. We have a total of four sets in this definition: $U, V, S$ and $T$ and, not to forget, the condition that $V\subseteq S$ . If this last condition was not satisfied it would be meaningless to apply the function $f$ to $g(u)$ . I hope the diagram below helps the understanding.

The concept of a function is powerful and underlies functional programming in computer science: every computation can be realized as applying a composition of functions to an argument. This is exemplified in the computer language Haskell.

Suppose that

$U = \{1, 2, 3\},\qquad S = \{1, 2, 3, 4\}\qquad\text{and}\qquad T = \{7, 8, 9\}$ and that $g: U \rightarrow S$ and $f: S\rightarrow T$ are given by the tables

$\def\arraystretch{1.5} \begin{array}{c|ccccccc} x & 1 & 2 & 3\\ \hline g(x) & 1 & 3 & 4 \end{array}\qquad\text{and}\qquad \begin{array}{c|ccccccc} x & 1 & 2 & 3 & 4\\ \hline f(x) & 7 & 8 & 9 & 7 \end{array}$ Compute the table for $(f\circ g): U\rightarrow T$ . Show that $f\circ g$ is not injective. Adjust the table for $f$ so that $f\circ g$ becomes bijective.

Consider $f: \mathbb{R}\rightarrow \mathbb{R}^2$ and $g: \mathbb{R}^2\rightarrow \mathbb{R}$ given by

$\begin{aligned} f(t) &= (t^2, t^3)\\ g((x, y)) &= \cos(x y) + x \sin(x + y). \end{aligned}$ What is $(g\circ f)(t)$ as a function from $\mathbb{R}$ to $\mathbb{R}$ in terms of $t$ ?

1.8.3 The inverse function

If $f:S\rightarrow T$ is bijective, then we may define a function $g: T\rightarrow S$ , so that $(f\circ g)(y) = y$ for every $y\in T$ and $(g\circ f)(x)$ for every $x\in S$ . This function is denoted $f^{-1}$ .

How do we define $f^{-1}(y)$ for $y\in T$ ? Well, since $f$ is surjective, we may find $x\in S$ so that $y = f(x)$ . Now, we simply define

$f^{-1}(y) = x. \tag{1.16}$ We cannot have $x_1 \neq x_2$ in $S$ with $f(x_1) = f(x_2) = y$ , since $f$ is injective. We only have one choice for $x$ in (1.16). Therefore (1.16) really is a good and sound definition.

Let $f: S\rightarrow S$ , where $S = \{1, 2, 3\}$ be given by

$\def\arraystretch{1.5} \begin{array}{c|ccccccc} x & 1 & 2 & 3\\ \hline f(x) & 3 & 1 & 2 \end{array}.$ Compute $f^{-1}$ .

What if the definition of $f$ is changed to

$\def\arraystretch{1.5} \begin{array}{c|ccccccc} x & 1 & 2 & 3\\ \hline f(x) & 3 & 2 & 2 \end{array}.$ Does $f^{-1}$ make sense here?

What is the inverse function of $f:\mathbb{Z}\rightarrow \mathbb{Z}$ given by $f(x) = x + 1$ ? What is the inverse function of $g: S \rightarrow S$ , where $g(x) = \sqrt{x}$ and $S = \{x\in \mathbb{R}\mid x\geq 0\}$ ?

1.8.4 Neural networks

Having defined functions and composition of functions, we can deflate the term (deep) neural network, which is often clouded in magic and mystery.

A neural network is a special case of a function

$f: A\rightarrow B, \tag{1.17}$ where $A\subseteq \mathbb{R}^m$ and $B\subseteq \mathbb{R}^n$ . Neural networks are often compositions of many intermediate functions called (hidden) layers.

A function such as (1.17) can be written

$f(x_1, \dots, x_m) = \left( f_1(x_1, \dots, x_m), \dots, f_n(x_1, \dots, x_m)\right), \tag{1.18}$ where $f_1, \dots, f_n$ are functions $A\rightarrow \mathbb{R}$ .

In a neural network the functions $f_1, f_2, \dots, f_n$ are viewed as neuronsTo be precise, the functions should be viewed as synapses. Depending on their input they either fire or do not fire a signal. Classically this is modelled by the perceptron, which is a function $p:\mathbb{R}^n\rightarrow \mathbb{R}$ of the form

$p(x_1, \dots, x_n) = \begin{cases} 1 &\text{if } w_1 x_1 + \cdots + w_n x_n > b\\ 0 &\text{if } w_1 x_1 + \cdots + w_n x_n \leq b \end{cases} \tag{1.19}$ for fixed numbers $w_1, \dots, w_n$ (called weights) and a number $b$ (called the threshold). If the weighted sum $w_1 x_1 + \cdots + w_n x_n$ is above the threshold, the neuron fires (returns the value $1$ ). If not it does not fire (returns the value $0$ ).

Consider the three perceptrons $p_1, p_2, p_3: \mathbb{R}^2\rightarrow \mathbb{R}$ , where

$p_1(x, y) = \begin{cases} 1 &\text{if } -x-y > -\frac{3}{2}\\ 0 &\text{if } -x-y \leq -\frac{3}{2} \end{cases}, \qquad p_2(x, y) = \begin{cases} 1 &\text{if } x + y > \frac{1}{2}\\ 0 &\text{if } x + y \leq \frac{1}{2} \end{cases},$ and

$p_3(x, y) = \begin{cases} 1 &\text{if } x + y > \frac{3}{2}\\ 0 &\text{if } x + y \leq \frac{3}{2} \end{cases}.$ Let $f(x, y) = p_3 (p_1(x, y), p_2(x, y))$ . Then $f$ is a composite function $f = g\circ h$ of two functions $h: \mathbb{R}^2\rightarrow \mathbb{R}^2$ and $g: \mathbb{R}^2\rightarrow \mathbb{R}$ . Write down these functions.

Hint

Have a closer look at (1.18) in order to understand how functions from $\mathbb{R}^2$ to $\mathbb{R}^2$ are expressed. Notice that our notation is a bit inconsistent when it comes to types. For example, the function $p_1:\mathbb{R}^2 \rightarrow \mathbb{R}$ should really be denoted $p_1((x, y))$ instead of $p_1(x, y)$ , since it takes input from $\mathbb{R}^2 = \mathbb{R}\times \mathbb{R}$ . This is remedied in the (hopefully easy to understand) python code below.

Compute $f(0, 0), f(1, 0), f(0, 1)$ and $f(1, 1)$ .

Relate the perceptrons $p_1$ and $p_2$ to the illustration below. What do you think the red and blue line illustrate? What does it mean that a dot is solid compared to hollow? What is special about points between the red and blue lines? Try to relate $f(0,0), f(1,0), f(0,1)$ and $f(1,1)$ to the illustration.

(Illustration courtesy of William Heyman Krill).

Give weights $w_1, w_2$ and a threshold $b$ for a perceptron $p:\mathbb{R}^2\rightarrow \mathbb{R}$ that computes the logical and function $\land$ i.e, $p$ must satisfy

$\begin{aligned} p(0,0) &= 0\\ p(1, 0) &= 0\\ p(0,1) &= 0\\ p(1, 1) &= 1. \end{aligned}$ Do the same for the logical or function $\lor$ .

The output of one neuron can be used as input for other neurons in a potentially extremely complicated network:

The diagram above represents a neural network, which is a function $\mathbb{R}^8\rightarrow \mathbb{R}^4$ . This function is actually a composition (represented by the hidden layers $1$ , $2$ , $3$ and the output layer):

$\mathbb{R}^8\rightarrow \mathbb{R}^9 \rightarrow \mathbb{R}^9 \rightarrow \mathbb{R}^9 \rightarrow \mathbb{R}^4.$ All of the nodes above, except the ones in the input layer, represent perceptrons.

Is it possible to find a perceptron $p:\mathbb{R}^2\rightarrow \mathbb{R}$ , such that

$\begin{aligned} p(0,0) &= 0\\ p(1, 0) &= 1\\ p(0,1) &= 1\\ p(1, 1) &= 0? \end{aligned}$ What if you are allowed to use a neural network composed as $\mathbb{R}^2\rightarrow \mathbb{R}^2\rightarrow \mathbb{R}$ (one hidden layer)

Mathematically there is no reason to use special functions such as perceptrons in each node. One also uses a (smooth) version of the perceptron employing the sigmoid function. With the notation above, this function is given as

$\sigma(x_1, \dots, x_n) = \frac{1}{1 + e^{-(w_1 x_1 + \cdots + w_n x_n) - b}}.$ However, around 2011 it was observed that the perceptron activation function (ReLU) as defined in (1.19) led to better training of deep neural networks. It is today, the most popular activation function.

1The language of mathematics

1.1 Black box warnings

1.2 Computer algebra

1.3 Objects or elements and the symbols = and \neq

1.4 Sets

1.4.1 The empty set

1.4.2 Sets of numbers

1.4.3 Notation and rules for arithmetic operations

1.4.4 The symbols \in and \notin

1.4.5 Subsets and the symbols \subseteq and \not\subseteq

1.4.6 Intersections, unions and the symbols \cap,\,\, \cup and \setminus

1.4.7 Pairs, triples and tuples

1.5 Ordering numbers

1.5.1 Subsets of numbers and first elements

1.6 Propositional logic

1.6.1 The symbols \exists, \forall and propositions with variables (predicates)

1.6.2 The use of implication (\implies) and bi-implication (\iff)

1.7 What is a mathematical proof?

1.7.1 Proof by contradiction

1.7.2 Proof by induction

1.8 The concept of a function

1.8.1 Notations for defining a function

1.8.2 Composition of functions

1.8.3 The inverse function

1.8.4 Neural networks

1.3 Objects or elements and the symbols $=$ and $\neq$

1.4.4 The symbols $\in$ and $\notin$

1.4.5 Subsets and the symbols $\subseteq$ and $\not\subseteq$

1.4.6 Intersections, unions and the symbols $\cap,\,\, \cup$ and $\setminus$

1.6.1 The symbols $\exists, \forall$ and propositions with variables (predicates)

1.6.2 The use of implication ( $\implies$ ) and bi-implication ( $\iff$ )