Chapter 3

Even in ancient times, people were fascinated by the idea of divisibility. This book is interested primarily in positive integers, just as Euclid was.

Definition 3.1. A positive integer a is said to be a divisor of another positive integer b if b=ac for some positive integer c. (If a is a divisor of b, we write a|b and say "a divides b," or "b is a multiple of a").
Definition 3.2. If an integer p>1 has only 1 and p as divisors, it is called prime (for example 2, 3, 4, 7, and 11 are primes).

Two elementary facts help illuminate the place of primes among the integers. First, if p₁, p₂, ..., p_n are primes, then N=1+p₁p₂...p_nis not divisible by any of p₁, p₂, ..., p_n (the remainder when N is divided by any p_iis 1). Thus either N is prime or N has a prime divisor other than the p_i. Thus,

Also, if n>1 is not prime (such an integer is called composite), n must have a prime divisor

, since it must have a divisor

(n = ab implies a or b ), and that divisor is either prime a or has a prime divisor

. Thus, one can determine if a number

is prime by eliminating all multiples of the primes

. This method is known as the Sieve of Eratosthenes (Figure 3.1).

Figure 3.1. Animated Sieve of Eratosthenes. Primes are sifted out that are less than 169 = 13² . Thus, only multiples of 2, 3, 5, 7, and 11 need be eliminated. The animation shows the successive elimination of these multiples to reveal the set of all primes less than 169. Click here to see a movie in which the reader can control the animation rate in order to study pattern.

The primes are the building blocks of the integers, since every positive integer can be written uniquely as a product of primes (Fundamental Theorem of Arithmetic). This uniqueness extends only to the primes involved and how many times they are involved; for example, the fact that 12 = 2·2·3 =2·3·2 = 3·2·2 does not violate uniqueness. By convention, we say that 1 is a product of no primes. It is the desire for unique factorization that led to the avoidance of 1 as a prime; obviously, 1 can be included in a factorization of a number as often as desired.

Take a side trip to an application that employs a strategy that is abstractly similar to the method of Eratosthenes. It does so, however, in the urban setting of downtown Ann Arbor, Michigan.

Almost as important as knowing what divisors an integer has is knowing what divisors it shares with other integers. For this allows the consideration of equations relating one integer to another.

Definition 3.4. Let a, b be two positive integers. The greatest common divisor (a, b) of these two integers is the largest positive integer d such that d|a and d|b.

If the prime factorizations of a and b are readily available, it is easy to find (a, b).

Suppose a and b are positive integers, a > b. Then there are integers q (quotient) and r (remainder) such that a = bq + r and 0

r < b (these were first encountered when learning long division in the fourth grade).

Mini-Theorem 3.6. Suppose a, b, r are positive integers with a = bq + r, 0 < r < b < a. Then (a,b) = (b, r).
Proof: It is only necessary to show that a, b, r share exactly the same divisors, since then they certainly have the same GREATEST common divisor.

Euclidean Algorithm 3.8. Suppose a, b are positive integers with a > b. Suppose further that

Proof: First note that the process defined above is a finite one, since each remainder is smaller than the previous one, with the result that eventually, the integer remainder must become 0. (Computer scientists might be interested in Lamé's Theorem, which proves that this process can take at most 5n steps, where n is the number of decimal digits in b.)

By the mini-fact, (r_n, r_n+1) = r_n+1. But, by the mini-theorem, (a, b) = (b, r₁) = (r₁, r₂) = ... = (r_n-1, r_n) = (r_n, r_n+1). Therefore, (a, b) = r_n+1.

That is, one can work backwards through the equations until (a, b) is expressed in terms of a and b. But the computations appear quite difficult, and it appears that all the quotients and remainders are needed. But the computation can be done more easily if each remainder is expressed in terms of a and b as the Euclidean Algorithm is carried out. Note that

In fact, the a and b can be left understood and only their coefficients kept track of. A detailed example follows.

Figure 3.2. Animation of Euclidean Algorithm application. Click here to see a movie in which the reader can control the animation rate in order to study pattern.

The procedure of this section will be of great significance in solving congruences in the next section.

Number-theoretic congruence
One of the most powerful tools of number theory is the notion of congruence, which partitions the integers into finite sets of related integers.

Definition 3.9. Two integers x and y are said to be congruent modulo m (a positive integer) if m is a divisor of x - y. This is written xy (mod m).

The most common place the everyday world uses congruence, albeit not consciously, is in telling time. Any time y a multiple of 12 hours after a time x is called by the same time as x.

For example, 60 hours after 3 o'clock, it is once more 3 o'clock, since 60 + 3

3 modulo 12.

Indeed, it is quite common to use a set of small representatives of all the possibilities modulo m. In telling time, 1, 2, ..., 12 are used for all possibilities. Military time uses 0, 1, ..., 23 for the hours in a day modulo 24. Mathematicians usually use 0, 1, ..., m-1 when they want representatives for the classes of numbers which are congruent modulo m, for the following reason.

Proof: a - r = mb, so m is a divisor of a - r. Thus the natural representative of a number modulo m is its remainder when divided by m.

In fact, any set of m integers, none of which is congruent to any of the others modulo m, is called a complete residue system modulo m. No doubt this name comes from thinking of remainders as residues (what's left over) after division by m. It is easy to create an addition and multiplication for these residues.

Theorem 3.11. Suppose xy (mod m) and st (mod m). Then x + sy + t (mod m) and xsyt (mod m).

Proof: Suppose x - y = am and s - t = bm. Then (x + s) - (y + t) = (x - y) + (s - t) = am + bm = (a + b)m. So x + sy + t (mod m). Also xs - yt = (xs - xt) + (xt - yt) = x(s - t) + (x - y)t = xbm + amt = m(xb + at) So xsyt (mod m).

Telling time is an obvious example of the addition part of this theorem. For instance, 5 hours after 9 o'clock comes 2 o'clock, since 9 + 5

2 (mod 12).

Linear congruences
The first type of equation to investigate is the linear congruence ax

b (mod m). One needed fact is an easy consequence of the Euclidean Algorithm.

Proof: Suppose bc = ax. Because (a, b) = 1, there are integers m, n such that 1 = ma + nb. Then c = cma + cnb = cma + nax = a(cm + nx). Thus a | c.

This allows proof of a limited cancellation theorem. In ordinary algebra, if ax = ay, then x = y. Such a general theorem is impossible for congruences.

Proof: Since axay (mod m), a(x - y)

0 (mod m). Thus m | a(x - y). But, since (a, m) = 1, Lemma 3.12 assures m | (x - y). Therefore, xy (mod m).

Proof: If axb (mod m), then ax - b = my for some integer y. Thus b = ax - my. But since d | a and d | m, this means d | b, contrary to the theorem's hypothesis. (Note: a = ds, m= dt means b = dsn - dty = d(sx - ty).)

Theorem 3.16. Suppose d = (a, m) | b. Then axb (mod m) has a unique solution modulo m | d and d solutions modulo m.

Proof: Let b = dr, a = ds, m = dt. Then axb (mod m) is and only if ax - b = my for some integer y. Substituting for a, b, cdsx - ds = dty. Canceling xs - r = ty or sxr (mod t). By Theorem 3.14, this has a unique solution modulo t = m/d. Call this solution A (so sAr (mod t) ). Then, A, A + t, ..., A + (d-1)t are the d solutions (modulo m). Note that these are all congruent modulo t but incongruent modulo m, and notice that since sA - r = tB for some B

The ability to solve congruences leads to some interesting results.

Theorem 3.17 (Fermat's Little Theorem) Suppose p is a prime, (a, p) = 1. Then a ^p^-111 (mod p).

Proof: Consider the integers a, 2a, 3a, ..., (p-1)a. These p-1 integers are all incongruent mod p, since by Theorem 3.13 sa ta (mod p) implies st (mod p). Since none of them is divisible by p (by Lemma 3.12, if p | ka, then p | k, which is false). Thus these p - 1 integers are congruent to the integers 1, 2, ..., p-1 in some order. Therefore, a(2a)...(p-1)a 1(2)...(p-1) (mod p). By Theorem 3.13, cancellation of 2(3)...(p-1) gives a^p^-111 (mod p).

Fermat's Little Theorem can be used to perform some amazing computations. For example, it can be used to compute the remainder when 3²⁴⁶⁰ is divided by 113, certainly a computation unlikely by direct computation. But Theorem 3.17 assures 3¹¹²11 (mod 113). Simple division gives 2460 = 112(13) + 4. Therefore 3²⁴⁶⁰ = 3 ^112(13)+4 = (3¹¹²)¹³ 3⁴13⁴181 mod 113. So the remainder when 3²⁴⁶⁰ is divided by 113 is 81.

The situation of Theorem 3.17 can be generalized.

Definition 3.18. Let m be a positive integer. A complete reduced residue system modulo m is the set of numbers obtained from a complete residue system by removing all the integers which have a factor in common with m.

For example, the integers 1, 2, ..., p-1 are a reduced residue system modulo a prime p. Also, the integers 1, 3, 7, 11, 13, 17, 19 for a reduced residue system modulo 20.

Definition 3.19. The Euler - function (phi-function)(m) (or, phi(m) in subscripts or superscripts) is the number of integers in a complete reduced residue system modulo m.

For example (p) = p-1, (20) = 8.

The Euler - function has been studied extensively. The interested reader should consult any of the numerous number theory texts listed in the references. Fermat's Little Theorem can be generalized as follows:

Theorem 3.20.(Euler's Theorem) Suppose (a, m) = 1. Then a^phi(m)11 (mod m).

Proof: Let 1, x₂, x₃, ..., x_phi(m) be a reduced residue system modulo m. As in Theorem 3.17, a, a₂, ..., ax_phi(m) are all incongruent modulo m; also, since each has no factor in common with m, they form a complete reduced residue system and are congruent to 1, x₂, ..., x_phi(m) in some order. Therefore a(ax₂) ... (ax_phi(m)) x₂x₃...x_phi(m) mod m, so by cancellation a^phi(m)11 (mod m).

Again, by way of example, it is easy to compute the remainder when 13⁸⁰⁵⁹is divided by 20. By Euler's Theorem, 13^phi(20) = 13⁸1 13³ (mod 20). But 13²= 169 9 (mod 20). Therefore 13³ = 13²(13) 9(13)11717(mod 20). Thus the remainder is 17.

Wilson's Theorem and Quadratic Residues

Definition 3.21. Let m be a positive integer, (a, m) = 1; a is said to be a quadratic residue modulo m if the congruence x²1a (mod m) has a solution.

A great deal of discussion about quadratic residues centers on prime moduli. The case p = 2 is not very interesting, since a is a quadratic residue if and only if a 1 (mod 2). So odd primes are of the most interest.

Theorem 3.22. Let p be an odd prime. Then the congruence x ²1a (mod p) has either 0 or 2 solutions modulo p.

Proof: If s ²1a (mod p), then (-s) ²1a(mod p), and since p is odd, s and -s are incongruent (mod p). (s -s (mod p) implies 2s0 (mod p).) Thus, if there is one solution, there are two solutions. If t is another solution, then t ²1as ² (mod p). Then t ²- s ²10 (mod p), so (t - s)(t + s) 0 (mod p). Thus p is a divisor of either t - s or t + s, so either ts (mod p) or t -s (mod p). So there are no other solutions.

A surprising consequence of this simple fact is

Theorem 3.23 (Wilson's Theorem) If p is a prime, then (p - 1)! -1 (mod p).

Proof: If p = 2, this just says 1 -1 (mod 2), which is true. If p is an odd prime, the congruence x ²1 1 (mod p) has the two solutions x 1 (mod p) and x -1 p - 1 (mod p). If a is an integer between 2 and p - 2, then the congruence ax 1 (mod p) has a unique solution by Theorem 3.14, and that solution is not congruent to a modulo p, or a would satisfy a²11 (mod p), not possible by Theorem 3.22. Thus the integers 2, ..., (p - 2) can be grouped into (p-3)/2 pairs (a_i, a_i^-1), 1 i (p-3)/2 such that for each i, (a_i, a_i^-1) 1 (mod p). Thus the product (p-1)! 1(p-1)(a_i, a_i^-1)...(a_k, a_k^-1) p -1 -1 (mod p), where k = (p-3)/2.

Quadratic Residues and Euler's Criterion

Suppose p is an odd prime. Clearly 1 ², 2 ², ..., ((p -1)/2) ² are all quadratic residues. Further, if 1 x,y (p -1)/2, then x ² and y ² are incongruent modulo p, since x ²1y ² (mod p) implies (x-y)(x+y) 0(mod p); but then either p|(x-y) and x = y or p|(x+y), which is impossible since 2 x+y p -1. So there are (p -1)/2 quadratic residues accounted for. But
(-1) ², (-2) ², ..., (-(p -1)/2) ²give the same numbers as 1 ², 2 ², ..., ((p -1)/2) ²; so since -(p -1)/2, ..., -1, 1, ..., (p-1)/2 is a complete reduced residue system modulo p, there are exactly (p -1)/2 quadratic residues mod p and hence there are (p -1)/2 quadratic non-residues modulo p.

Definition 3.24 Legendre symbol. The symbol (a/p), called the Legendre symbol, is defined by (a/p) =

1 if a is a quadratic residue mod p
(-1) if a is not a quadratic residue mod p.

Some number theorists write (0/p)=0. Clearly 0 ²1

0 (mod p), but quadratic residues are defined to be relatively prime to the modulus.

Now suppose a is a quadratic residue. Then there is an x with x² = a. But by Fermat's Little Theorem x^{p -1}11(mod p), so a^(p-1)/2 = x²11(mod p). On the other hand, suppose a is not a quadratic residue. Then, if (r, p)=1, the equation rxa(mod p) has a unique solution s(mod p) by Theorem 3.14, and s is not congruent to r(mod p) (otherwise a would be a quadratic residue). Thus the elements of the set 1, 2, ..., p -1 can be grouped into (p -1)/2 pairs, where the product of each pair is a(mod p). Thus a^(p-1)/211(2)...(p -1) -1 by Wilson's Theorem. Thus we have proved

Theorem 3.25 Euler's Criterion (a/p) a^(p-1)/2 (mod p)

This enables one to prove some interesting facts about quadratic residues

Theorem 3.26

a) (a²/p) =1
b) (a/p)=(b/p) if ab (mod p)
c) ((ab)/p) = (a/p)(b/p)
d) (-1/p)=(-1)^(p-1)/2

Proof:
a) and b) are obvious, d) is a restatement of Euler's Criterion for a = 1, and c) follows since (ab)^(p-1)/2 = a^(p-1)/2b^(p-1)/2

So -1 is a quadratic residue of primes p1 (mod 4), since then (p-1)/2 is even but not of primes p3 (mod 4), since then (p-1)/2 is odd. The product of quadratic residues is a quadratic residue, but so is the product of quadratic non-residues. The product of a residue and a non-residue is a non-residue.

For example, if p = 7, the quadratic residues are congruent to 1², 2², 3² (mod 7); that is congruent to 1, 2, 4. Note that -1 is not a residue. Since 2 and 4 are residues, so is 2(4)1(mod 7). Since 3 and 6 are non-residues, 3(6)4(mod 7) is a residue. But 4(6)5(mod 7), the product of a residue and a non-residue, is a non-residue.

Similarly, for p = 13 the residues are 1, 3, 4, 9, 10, 12 and the non-residues are 2, 5, 6, 7, 8, 11; -112 (mod 13) is a residue, 4(9)10 (mod 13) is a residue, 5(6)4(mod 13) is a residue, and 4(7)2(mod 13) is a non-residue.

For p=11, the residues are 1, 3, 4, 5, 9; the non-residues 2, 6, 7, 8, 10.

Quadratic Reciprocity

The most famous theorem relating quadratic residues, the Law of Quadratic Reciprocity, was first proved by Gauss.

Theorem 3.27. Gauss's Law of Quadratic Reciprocity.
If p, q are odd primes (p/q)(q/p) = (-1)^{((p-1)/2)((q-1)/2)}= (-1)^{((p-1)(q-1))/4}

This theorem has been proved many times in many ways. An example illustrates its power. There are many proofs available on the Internet. Take a look at the linked paper to see one contemporary strategy. Compare the proof of Wilson's Theorem in the linked material to the proof of Wilson's Theorem above in order to gain insight into the strategy in the linked paper. Then, consider the example below and then the proof of the Quadratic Reciprocity law that appears in that link. (Click here, if the link at the beginning of this paragraph fails).

(p/3)(3/p) = (-1)^(p-1)(3-1)/4 = (-1)^(p-1)/2 = (-1/p)
Therefore, (p/3)(3/p)(3/p) = (-1/p)(3/p) = (-3/p)
Therefore (p/3) = (-3/p)
But (p/3) = (1/p)=1 if p1 (mod 3) and (p/3) = (2/3) = -1. Thus -3 is a quadratic residue of the primes p1 (mod 3) but not of the primes p2 (mod 3).