Constructions of Familiar Number Systems

One of the things you learn like five times in the first half of your undergraduate math career is how to construct the reals from the rationals, the rationals from the integers, the integers from the naturals, the naturals from nothing. It’s an entertaining process and is a good way to fill up a lecture, and it might as well be reproduced here.

First, we begin life with \{\}, the empty set, and we define 0 = \{\}. (I do not typically endorse 0 \in \mathbb{N}, but it fits in this construction so we’ll change conventions for this post.) To get at the rest of the natural numbers, we construct a successor function such that \mathrm{Succ}(X) = \{X\} \cup X. Thus, 1 = \{0\}, 2 = \{0, 1\}, 3 = \{0, 1, 2\}, and so on. We induct, and we define the set given to us by induction to be \mathbb{N}, the set of natural numbers. We get a couple operators immediately; for instance, the minimum and maximum of a set of naturals is their intersection and union respectively. As shorthand for applying the successor function multiple times, we write n + m = \mathrm{Succ}^m(n), which gives us the usual definition of addition as an operation +:\mathbb{N} \times \mathbb{N} \to \mathbb{N}. Multiplication of n\cdot m, as you learned in grade school, is just adding n to itself m-fold times.

For those who have had any training in algebra before, + smells like a group operation, but falls slightly short. It’s associative, the set is closed under it, we have an identity element, but we’re missing inverses. To this end, we consider the set of pairs of natural numbers, written \mathbb{N}^2. We define addition as component-wise and multiplication by the formula (a, a') \cdot (b, b') = (ab+a'b', a'b+ab'). Then, finally, we construct the equivalence relation (a, a') \sim (b, b') if and only if a + b' = b + a', and define the quotient space to be \mathbb{Z} = \mathbb{N}^2/\sim. It’s up to the reader to check that we can carry +, \cdot over from \mathbb{N} and have them be well-defined. To recover the familiar notation for the integers, for any pair (n, m) there must exist either a pair (n', 0) or (0, m') in its equivalence class, and so to denote the equivalence class we write either n' or -m' respectively.

Now that we have the structure of addition on the integers, we’ve constructed a ring, (\mathbb{Z}, +, \cdot). We are missing the multiplicative inverses necessary to form a field, thus motivating the construction of the rationals. Let’s begin with an arbitrary commutative ring A. We can induce a ring structure on A \times (A \setminus \{0\}) by defining multiplication component-wise (i.e. (a, a') \cdot (b, b') = (ab, a'b')) and defining addition through the following formula: (a, a') + (b, b') = (ab'+a'b, a'b'). Finally, we need another equivalence relation \sim_A such that (a, a') \sim_A (b, b') if and only if ab' = a'b, and we construct the final ring as A \times (A \setminus \{0\}) / \sim_A, for which we write \mathrm{Quot}(A) as shorthand. \mathrm{Quot}(A) is in fact a field, if A has no zero divisors. Applying this construction to \mathbb{Z}, we get \mathrm{Quot}(\mathbb{Z}) = \mathbb{Q}, the set of rationals.

\mathbb{Q} has most of the algebraic properties that we need for a basic structure to work in, but it’s missing an important analytic and topological one: completion. To this end, we need to switch perspective and think about metrics. We define a pre-metric on a set X (which we will start calling a metric space, or just a space) to be a function d : X \times X \to \mathbb{Q} such that the following properties hold:

  • Nonnegativity: \forall (x, y \in X) . d(x, y) \ge 0
  • Indiscernibles: \forall (x, y \in X) . d(x, y) = 0 \Longleftrightarrow x = y
  • Symmetry: \forall (x, y \in X) . d(x, y) = d(y, x)
  • Triangle Inequality: \forall (x, y, z \in X) . d(x, y) + d(y, z) \ge d(x, z)

Some notes: A metric is a pre-metric whose codomain is \mathbb{R}. The reason we’re working with \mathbb{Q} here is because we haven’t yet constructed the reals. Also, that last property is also sometimes called ’subadditivity’, a word that I personally have seen come up a lot more frequently in measure theory. Now, \mathbb{Q} is naturally a metric space with metric d(x, y) = |x - y|, where |\cdot | denotes absolute value. We typically call this metric the Euclidean metric, since there are lots of viable metrics on \mathbb{Q} that behave radically differently.

We haven’t talked about analysis yet because we haven’t had to use much of it, but I have been using the idea of sequences and convergence without ever defining them. To correct that, a sequence in X is a function a: \mathbb{N} \to X, and we typically write a_i to mean the ith term of the sequence, or a_n to designate the sequence as a whole. We say that a sequence a_n in a metric space X converges to a point a \in X (called a limit point) if \forall (\epsilon > 0) \exists (N \in \mathbb{N}) such that \forall (n > N) . d(a_n, a) < \epsilon. The idea is that if we want to make the points in the sequence arbitrarily close to the limit point, then we just need to go past some finite point in the sequence, and then everything on that tail is within the desired distance. Now, we say a sequence is Cauchy if \forall (\epsilon > 0) \exists (N \in \mathbb{N}) such that \forall (n, m > N) . d(a_n, a_m) < \epsilon. The idea expressed in these quantifiers is that as we get farther out in the sequence, the individual terms of the sequence start to cluster up next to each other (but not necessarily around some particular point). All convergent sequences are Cauchy, but Cauchy sequences are not necessarily convergent in all metric spaces. For spaces where that reverse implication is true, we say that the space is ‘complete’.

There are a lot of definitions involving continuity and extension of uniformly continuous functions to complete spaces and topological density and thus at least point-set topology with its open balls and lots of other concepts. We can go back and revisit the particulars of the construction of \mathbb{R} at a later date, but suffice it to say that \mathbb{R} is the unique complete metric space such that every ball B_r(a) = \{x \in \mathbb{R} | d(a, x) < r \} has nonempty intersection with \mathbb{Q}. A useful corollary of the completeness of \mathbb{R} is that every set in \mathbb{R} with an upper bound has a least upper bound. For instance, the set (0, \sqrt{2}) did not have an upper bound in \mathbb{Q}, but in \mathbb{R} that supremum exists and is called \sqrt{2}.

We have one last leap to make, one that I’ve already detailed before if you go back 30 or so posts. \mathbb{R} is not algebraically closed — that is, there exist polynomials with coefficients in \mathbb{R} with roots not in \mathbb{R}. In order to generate an algebraically closed field, we use the field extension \mathbb{R}[i]/(i^2+1) = \mathbb{C}, and we’re done.

Update: I was reading an introduction to K-theory last night and found that the construction of the integers from the naturals was reused in the context of k-rank vector bundles over a topological space.  Turns out math is interrelated.

Leave a Reply