Real Analysis/The real numbers

From testwiki
Jump to navigation Jump to search

Template:Header

Why we need the real numbers

This is a good juncture to justify the subject of real analysis, which essentially reduces to justifying the necessity of studying . So, what is missing? Why do we need anything beyond the rationals?

The first sign of trouble is square roots. Famously, 2 is not rational – in other words, there is no rational number which squares to 2 (see the exercises). This fact has a curious consequence – consider the following function:

f: ; x{0:x2<21:x2>2

Clearly this function has a dramatic jump in it around the rational 1.4 , where it suddenly changes from being equal to zero and starts being equal to one. However, it's difficult (or even impossible) to pin down exactly where this jump happens. Any specific rational number is safely on one side or the other, and, indeed, in the standard Topology on , this function is continuous (don't worry if that makes no sense to you).

It is this flaw which the real numbers are designed to repair. We will define the real numbers so that no matter how clever we try to be, if a function has a 'jump' in the way that f does, then we will always be able to find a specific number at which it jumps.

The following sections describe the properties of which make this possible.

Different perspectives

In order to prove anything about the real numbers, we need to know what their properties are. There are two different approaches to describing these properties – axiomatic and constructive.

An axiomatic approach

When we take an axiomatic approach, we simply make a series of assertions regarding , and assume that they hold.

The assertions that we make are called axioms – in a mathematical context this term means roughly 'basic assumption'.

The advantage of this approach is that it is then clear exactly what has been assumed, before proceeding to deduce results which rely only on those assumptions.

The disadvantage of this approach is that it might not be immediately clear that any object satisfying the properties we desire even exists!

With a constructive approach, we are not happy simply to assume exactly what we want, but rather we try to construct from something simpler, and then prove that it has the properties we want. In this way, what could have been axioms become theorems. There are several different ways to do this, starting from and using some method to 'fill up the gaps between the rationals'.

All of these methods are fairly complex and will be put off until the next section.

The axioms

So, what are these axioms which we will need? The short version is to say that is a complete ordered field. This is in fact saying a great many things:

  • That is a totally ordered field.
  • That is complete in this ordering (Note that the meaning of completeness here is not quite the same as the common meaning in the study of partially ordered sets).
  • That the algebraic operations (addition and multiplication) described by the field axioms interact with the ordering in the expected manner.

In more detail, we assert the following:

  1. is a field. For this, we require binary operations addition (denoted +) and multiplication (denoted ×) defined on , and distinct elements 0 and 1 satisfying:
    1. (,+,0) is a commutative group, meaning:
      1. x,y,z:(x+y)+z=x+(y+z) (associativity)
      2. x,y:x+y=y+x (commutativity)
      3. x:x+0=x (identity)
      4. x:y:x+y=0 (inverse)
    2. ({0},×,1) is a commutative group, meaning:
      1. x,y,z{0}:(x×y)×z=x×(y×z) (associativity)
      2. x,y{0}:x×y=y×x (commutativity)
      3. x{0}:x×1=x (identity)
      4. x{0}:y{0}:x×y=1 (inverse)
    3. x,y,z:x×(y+z)=(x×y)+(x×z) (distributivity)
  2. is a totally ordered set. For this we require a relation (denoted by ) satisfying:
    1. x:xx (reflexivity)
    2. x,y,z:(xy and yz)xz (transitivity)
    3. x,y:(xy and yx)x=y (anti-symmetry)
    4. x,y:either xy or yx (totality)
  3. is complete in this order (see below for details).
  4. The field operations and order interact in the expected manner, meaning:
    1. x,y,z:xy(x+z)(y+z)
    2. x,y,z:(xy and 0z)(x×z)(y×z)

This is a substantial list, and if you are not used to axiomatic mathematics (or even if you are!) it may seem somewhat daunting, especially since we have yet to give details of what completeness means. This is amongst the longest list of axioms in any region of mathematics, but if you examine each in turn, you will find that they all state things which you have probably taken for granted as 'the way numbers behave' without a second thought.

These axioms are so exacting that there is a sense in which they specify the real numbers precisely. In other words is the only complete ordered field.

Further notation

Having defined these operations and relations on , we need to introduce more notation to aid in talking about them. Hopefully all these conventions should be familiar to you, but it is important to formally present them all to avoid confusion following from misunderstanding of notation:

  • Rather than writing × for multiplication, we may simply denote it by juxtaposition. In other words, we write xy to denote x×y.
  • Since both multiplication and addition are associative, we omit unnecessary bracketing when several numbers are added or multiplied. In other words, rather than writing (x+y)+z or x+(y+z), which are equal, we simply write x+y+z to denote their common value.
  • To further save writing of brackets, by convention, multiplication has a higher precedence than addition. So, for example, the expression x+yz should be interpreted as x+(yz), not as (x+y)z.
  • The number x+y is called the sum of x and y.
  • The number xy is called the product of x and y.
  • The additive inverse of x is written x, and called the negative or negation of x. So, x+(x)=0.
  • The multiplicative inverse of x is written x1, and called the reciprocal, or simply the inverse of x. So, x(x1)=1.
  • We define the binary operation of subtraction as follows: For x,y, we set xy=x+(y). The number xy is called the difference of x and y.
  • Subtraction has the same precedence as addition (less than that of multiplication), and when the two operations are mixed without bracketing, left-associativity is implied. For example, a+bcd+e should be interpreted as (((a+b)c)d)+e.
  • We define the binary operation of division as follows: For x,y, with y=0, we set x/y=x(y1). The number x/y is called the quotient of x and y, and is also denoted xy.
  • Division has a higher precedence than that of addition or subtraction, but there is no simple convention as to how to handle mixed multiplication and division. Using the xy notation, rather than the x/y notation helps to avoid confusion.
  • We define the binary operation of exponentation as follows: For x and n0 we define xn recursively by x0=1 and xn+1=(xn)x. Then for n, with n<0, we define xn=(x1)n.
  • Exponentation has a higher precedence than any of division, multiplication, addition and subtraction. For example, ab2+d3 should be interpreted as (a(b2))+(d3).
  • We write xy to mean yx.
  • We write x<y to mean xy and x=y.
  • We write x>y to mean y<x.
  • To abbreviate a collection of equalities or inequalities, they may be strung together. For example, the expression ab=c=d<e should be interpreted as ab and b=c and c=d and d<e.
  • To say x is positive means x>0.
  • To say x is negative means x<0.
  • To say x is non-positive means x0.
  • To say x is non-negative means x0.
  • We also introduce notation for several common varieties of subsets of . All of these subsets are called intervals:
    • [a,b]={x:axb} (called the closed interval from a to b)
    • (a,b)={x:a<x<b} (called the open interval from a to b)
    • [a,b)={x:ax<b}
    • (a,b]={x:a<xb}
    • In all these cases, a is called the lower limit of the interval, and b is called the upper limit.
    • An excluded lower limit (as in the second and fourth cases) may be replaced by to indicate that there is no lower restriction. For example (,b]={x:xb}.
    • Similarly, an excluded upper limit (as in the second and third cases) may be replaced by . For example, (,)=.
    • Some specific intervals which appear frequently are the closed unit interval, or just unit interval, which is [0,1], and +=(0,), the positive real numbers.

Completeness

The rational numbers satisfy all of the axioms above which have been explained in detail, and so if we are to escape the problem which we described above then we clearly need something more. This 'something more' is completeness. There are several equivalent ways of describing completeness, but most of them require us to know about Sequences, which we do not introduce until the next chapter, so for the moment we can only give one definition.

Upper bounds

Let A. We say b is an upper bound for A if

sA:sb

For example, 3 is an upper bound for [0,1], as is 1, but 12 is not, because 1[0,1] and 1>12. A set with an upper bound b is said to be bounded above by b.

Least Upper Bound

We say s is a least upper bound or supremum for A if s is an upper bound for A, and b is any upper bound for A then sb. More formally:

(aA:as) and (b:((aA:ab)(sb)))

Similarly, we say b is a lower bound for A if

aA:ab

and we say i is a greatest lower bound or infimum for A if:

(aA:ai) and (b:((aA:ab)(ib)))

The supremum and infimum of a set A are denoted supA and infA respectively.

The Least upper bound axiom

Now we are finally ready to state the last axiom:

  • If S is non-empty and has an upper bound, then S  has a least upper bound in .

This is the axiom of the real numbers that finally satisfies what was lacking in the rationals: completeness. It is worth noting at this point, to avoid possible confusion, that in the study of general partially ordered sets, the definition of completeness is that every subset has a least upper bound, and there is no condition that they be non-empty or bounded above. Nevertheless, we really do wish to impose these two conditions in this case.

Other completeness axioms

There are other equivalent ways to state the completeness axiom, but they involve sequences, so we shall delay them until after the discussion of that topic. Because of the existence of these other forms, this axiom is sometimes called the least upper bound axiom.