Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Foreword..............................................................................................7
I. Arithmetic in integral domains ....................................................10
I.1 Divisibility...................................................................................10
I.2 Euclidian domains .......................................................................22
I.3 Euclidian rings of quadratic integers...........................................26
I.4 Principal ideal domains ...............................................................31
I.5 Unique factorization domains .....................................................36
I.6 Polynomial ring arithmetic..........................................................43
Exercises ...........................................................................................50
II. Modules.........................................................................................58
II.1 Modules, submodules, homomorphisms....................................58
Exercises ...........................................................................................72
II.2 Factor modules and the isomorphism theorems.........................74
Exercises ...........................................................................................80
II.3 Direct sums and products. Exact sequences ..............................81
Exercises .........................................................................................102
II.4 Free modules ............................................................................104
Exercises .........................................................................................116
III. Finitely generated modules over principal ideal domains ....118
III.1 The submodules of a free module...........................................118
Exercises .........................................................................................128
III.2 Finitely generated modules over a principal ideal domain.....130
Exercises .........................................................................................138
III.3 Indecomposable finitely generated modules ..........................140
Exercises .........................................................................................149
6
7
8
think that the Categories are particularly useful for a better understan-
ding and for unifying many algebraic concepts and proofs.
The chapters on Modules (II, III) can be read independently of the
rest of the book. The section VI.1, Ruler and compass constructions, is
not a prerequisite for the other sections in chapter VI.
I.1 Divisibility
10
I.1 Divisibility 11
d ≠ 1 and d is not divisible by the square of any integer greater than 1),
the subring of C generated by 1 and d , denoted by " [ d ] , is a do-
main. One easily checks that " [ d ] consists of the complex numbers
of the form a + b d , with a, b ∈ Z. The ring " [ −1 ] is called the
ring of Gauss integers.1
c) If R is a domain and n ∈ N*, then the polynomial ring in n
indeterminates with coefficients in R, R[X1,..., Xn], is a domain.
In a domain, one can simplify the nonzero factors:
1
Carl Friedrich Gauss (1777 − 1855), famous German mathematician.
I.1 Divisibility 13
1.10 Remarks. a) Given a1, ..., an ∈ R, if there exists a GCD for a1,
..., an, say d ∈ R, then d is uniquely determined up to association in
divisibility: if e is also a GCD of a1, ..., an, then e ∼ d. Moreover, if
e ∼ d, then e is a GCD of a1, ..., an.
The same remark applies to the LCM.
b) When the domain R in which we work is not clear from the con-
text, we use occasionally a subscript, like in the notation (a1, ..., an)R.
For a given domain R and given x, y ∈ R, a GCD(x, y) may not ex-
ist (see the Exercises for some examples). A domain R with the prop-
erty that any two elements x, y ∈ R possess a GCD is called a GCD
domain. For instance, Z is a GCD domain.
Writing d = (a1, ..., an) means that d is associated with a GCD of
a1, ..., an. This can lead to some oddities: in Z, we can write 1 = (1, 2)
= −1, but this does not imply that 1 = −1 (of course, it implies that 1 ∼
−1).
c) Note that a1, a2 are coprime if and only if all their common divi-
sors are units in R.
d) For any domain R and any a ∈ R, there exists GCD(a, 0) = a. If u
is a unit, then there exists GCD(a, u) = u. What can you say about the
LCM in these cases?
If d ≠ 0 and d|a, a/d denotes the unique element x ∈ R with a = dx.
1.11 Proposition. Let R be a domain and let a1, ..., an, r ∈ R \{0}.
a) If there exists d = (a1, ..., an), then a1/d,..., an/d have a GCD,
equal to 1.
b) If there exists (a1, ..., an) =: d and exists (ra1, ..., ran) =: e, then
e = rd. Thus:
(ra1, ..., ran) = r(a1, ..., an).
c) If there exists [a1, ..., an] = m and exists [ra1, ..., ran] =: µ, then
µ = rm. Thus:
[ra1, ..., ran] = r[a1, ..., an].
16 I. Arithmetic in integral domains
LCM implies the existence of the GCD, but not conversely, in gen-
eral).
2
The converse is false in general. For a counterexample, see the section
Principal Ideal domains.
I.1 Divisibility 21
degree function deg : K[X] \ {0} → N). These rings are also the most
important examples of Euclidian domains.
3
We hope that the algorithm is clear to the reader. We do not want to present a
rigorous “pseudo-programming language” or use strict syntax rules from a particular
language. Also, this algorithm is intended to serve a theoretical purpose; for
instance, “finding” the elements q, r at step 1 does not imply a description of a
concrete procedure (such procedures can be given for particular rings as Z, Q[X],
…) and merely uses the fact that these elements exist. Moreover, implementing this
algorithm must take into account computer representations of the elements of R,
decision algorithms of the equality of two elements in R, addition and multiplication
algorithms in R etc. These important issues are not discussed here.
I.2 Euclidian domains 25
4
A polynomial is called monic if the coefficient of its monomial of maximum
degree is 1.
5
Because every algebraic integer over Z which is rational (in Q) must be in Z.
Prove!
28 I. Arithmetic in integral domains
Tr(α) := α + α = 2a
for any α = a + b d ∈ Q[ d ] (a, b ∈ Q).
The norm N is multiplicative and the trace Tr is additive: for any
α, β ∈ Q[ d ],
N(αβ) = N(α)N(β),
Tr(α + β) = Tr(α) + Tr(β).
One can prove that, for any x ∈ Q[ d ] : x is integral over Z (x is a
quadratic integer) ⇔ Tr(x) ∈ Z and N(x) ∈ Z.
The quadratic integers in Q[ d ] form a ring, called the ring of
integers of Q[ d ] . This ring is traditionally called a ring of quadratic
integers (imaginary if d < 0, respectively real if d > 0). We have the
following (for the proof, see the Exercises):
d ≡ 2 or 3 (mod 4) d ≡ 1 (mod 4)
6
For any z ∈ C and any ε ∈ R, ε > 0, there exists x ∈ Q[ d ] such that |z − x| < ε.
7
This result was obtained in 1923 by L. E. Dickson.
I.4 Principal ideal domains 31
We obtain that the following rings are Euclidian with respect to |N|:
Z[i], " [i 2 ] , " ⎡⎣(1 + i 3 ) 2 ⎤⎦ , " ⎡⎣(1 + i 7 ) 2 ⎤⎦ , " ⎡⎣(1 + i 11 ) 2 ⎤⎦ .
One can prove that these are all the imaginary Euclidian rings of
quadratic integers (not necessarily with respect to |N|). The real case
d > 0 has no geometric interpretation and is considerably more diffi-
cult.
8
Sometimes the fields are not considered principal ideal domains by definition.
32 I. Arithmetic in integral domains
pal, so R[X] is not a PID. In particular, the rings Z[X], K[X, Y], with K
a field, are not PID's.
Indeed, suppose that for some f ∈ R[X], we have ( f ) = (r, X). Then
f | r. We obtain deg f = 0, so f ∈ R. Since f | X, there exists g ∈ R[X]
with X = fg, so f is invertible in R. Thus, the GCD of r and X is 1, so
f = 1. But the ideal generated by r and X does not contain 1: if h, q ∈
R[X] are such that 1 = hr + qX, setting X = 0 in this equality of
polynomials, it follows that 1 = h(0)·r, which means r is invertible, a
contradiction.
Using Proposition 1.19 and the fact that any PID is a GCD-domain,
we obtain:
9
The products may contain a single factor (i.e., the element itself is a prime).
I.4 Principal ideal domains 35
10
Emmy Noether (1882–1935), German.
36 I. Arithmetic in integral domains
11
Such a product is also called a prime decomposition of the element. Products
may have a single factor.
I.5 Unique factorization domains 37
In this sum, aibj is not divisible by p and the other terms are divisi-
ble by p (as products of two factors, at least one of which is divisible
12
This is called “Gauss' Lemma”.
42 I. Arithmetic in integral domains
i+j
by p). So, the coefficient of X is not divisible by p and neither is the
polynomial fg .
b) If fg is not primitive, there exists p ∈ R, prime, such that p | fg.
The previous paragraph implies p | f or p | g, contradiction.
c) Let f = c{ f }·f1, g = c(g)·g1, where f1 and g1 are primitive. Then
fg = c{ f }c(g)·f1·g1,
with f1g1 primitive by b). It is clear now that c{ fg} = c{ f }c(g). !
6.1 Remark. Given f ∈ R[X], the following simple facts are worth
remembering:
- if deg f = 0, then f ∈ R. In this case, f is irreducible in R[X] if and
only if it is irreducible in R. If R = K (R is a field), then f is invertible
and thus reducible.
- if deg f = n > 0, then f is irreducible in R[X] if and only if f has no
non invertible divisors of degree 0 and there are no decompositions
f = gh, with g, h ∈ R[X] and 1 ≤ deg g, deg h < n.
The fact that f has no non-invertible divisors of degree 0 amounts to
saying that the GCD of the coefficients of f exists and is 1. In practice,
when R is a UFD, this condition reads “f is primitive”. Recall that, in
this case, f is irreducible in R[X] ⇔ f is primitive and f is irreducible in
K[X].
If R is a domain and not a field, R[X] is not principal (and certainly
not Euclidian), so the theorem of division with remainder does not
hold in R[X]. However, if f, g ∈ R[X], and g has as leading coefficient
a unit, the argument of the proof of the division with remainder theo-
rem for K[X] (K a field) still holds (see Example2.5.b)). The proof of
the following result is left to the reader:
a) a is a root of f.
b) the polynomial X − a divides f in R[X].
Proof. There exist q, r ∈ R[X] such that f = (X − a)q + r, where
deg r = 0 or r = 0. Note that X − a divides f if and only if r = 0. But
f (a) = (a − a)q(a) + r(a) = r, so f (a) = 0 is equivalent to r = 0. !
This theorem is the basis of the notion of multiple root:
g = f ( X + 1) =
X +1−1
=∑ i X
i =1
()
Remark that the Eisenstein criterion can be applied to g, since p di-
vides all the binomial coefficients ( ) , with 1 ≤ i < p. Thus, g is
p
i
Exercises
1
Although the volume Algèbre linéaire (1961) of the famous Bourbaki series
“Eléments de Mathématique” begins with the definition of the… module.
58
II.1 Modules, submodules, homomorphisms 59
2
Also called “multiplication of elements in M with scalars in R”.
60 II. Modules
n
r(x1, …, xn) = (rx1, …, rxn), ∀r ∈ R, ∀(x1, …, xn) ∈ R .
e) If R is a ring and m, n ∈ N*, the set Mm,n(R) of m×n matrices with
entries in R is an Abelian group endowed with usual matrix multipli-
cation and becomes an R-module by defining the “multiplication of
matrices with scalars”: for r ∈ R, A = (aij) ∈ Mm, n(R), r(aij) := (raij)
(multiply every entry of the matrix with r).
f) Let R := M2(Z) (the ring of 2×2 matrices with entries in Z) and
M := M2, 1(Z) (the Abelian group of 2×1 matrices with entries in Z). M
has a natural structure of left R-module: ∀A ∈ M2(Z) = R,
∀U ∈ M2, 1(Z) = M, AU ∈ M is the usual matrix product. Checking the
module axioms is straightforward and it boils down to the known
properties of matrix operations. Can you generalize this example? Can
M be endowed with a “natural” structure of a right R-module ?
g) Let ϕ : R → S be a unitary ring homomorphism. If M is a left
S-module, then M has a structure of left R-module by “restriction of
scalars”: ∀r ∈ R, ∀x ∈ M, rx := ϕ(r)x. In particular, S becomes a left
R-module (and also a right R-module). This example generalizes a
situation often encountered in field extensions: any field S is a vector
space over any subfield R.
3
Sometimes this ring is called the ring of left endomorphisms of M, emphasizing
that the functions are written at the left of the argument, like u(x); this forces the
definition of the composition of functions in the “usual” manner defined above. But
62 II. Modules
if one writes (x)u for the value of u at x, then the composition of u and v is defined as
(x)(uv) = ((x)u)v. With this multiplication, End(M) is called the ring of right
endomorphisms of M and it is the opposite of the ring of left endomorphisms of M.
II.1 Modules, submodules, homomorphisms 63
4
Note that a “family (ei)i ∈ I of elements of M” is in fact a function f : I → M
(denoting f (i) = ei, ∀i ∈ I ).
5
An ordered set A is called a complete lattice if, for any B ⊆ A, there exists
sup B (the smallest upper bound of B) and inf B (the largest lower bound of B) in A.
II.1 Modules, submodules, homomorphisms 67
6
Here is a (singular) situation when the union of a family of submodules is a
submodule.
68 II. Modules
7
Wolfgang Adolf Ludwig Helmuth Krull (1899-1971), German mathematician.
8
A geometric terminology, used mainly for vector space homomorphisms.
II.1 Modules, submodules, homomorphisms 69
Exercises
2
6. Determine all submodules of the R-module R .
7. Give an example of a module M and A, B, C ≤ M such that
A∩(B + C) ≠ A∩B + A∩C (i.e., LR(M ) is not distributive). (Hint. Try a
vector space.)
8. Let (G, +) be an Abelian group and n ∈ N* such that na = 0,
∀a ∈ G. Then G has a canonical structure of Zn-module. Is the con-
verse true? Generalization.
2
9. Identify the Euclidian plane with R , seen as an R-vector space.
Which of the following transformations of the plane is a linear
2 2
transformation (an R-module homomorphism) from R to R ?
a) The rotation of angle α around (0, 0).
b) The rotation of angle α around (0, 1).
c) The translation by the vector v = (x, y).
d) The symmetry with respect to a line.
e) The projection on a line.
S
10. Let R be a ring, let S be a set and R := {ϕ : S → R}. Define a left
S
R-module structure on R . More generally, let M be a left R-module
S
and let S be a set. Define a left R-module structure on M
= {ϕ : S → M}.
11. Let R be a ring. Show that, for any R M, R M',
HomR(M, M') ⊆ HomZ(M, M'). Give examples of R, M, M' such that
the inclusion is strict.
12. Let u : M → M' be an R-module homomorphism, A, B ≤ R M and
A', B' ≤ R M'. Study the validity of the statements:
a) u(A + B) = u(A) + u(B).
b) u(A ∩ B) = u(A) ∩ u(B).
−1 −1 −1
c) u (A' + B') = u (A') + u (B').
−1 −1 −1
d) u (A' ∩ B') = u (A') ∩ u (B').
13. Let R be a commutative ring. Show that EndR(R) ≅ R. More gener-
ally, for any R M, HomR(R, M) ≅ M.
74 II. Modules
The method used to construct the factor ring (given a ring and an
ideal in the ring) can be applied to modules, with minor modifications.
Let M be a left R-module and let L be a submodule in M. Consider-
ing only the Abelian group structure on M, L is a subgroup in M, so
we can construct the factor group M/L, which is also an Abelian
group. The Abelian group M/L can be endowed with a natural
R-module structure, inherited from the R-module structure on M.
II.2 Factor modules and the isomorphism theorems 75
2.1 Definition. The R-module M/L defined above is called the fac-
tor module of M with respect to L. The map π : M → M/L,
π(x) = x + L, ∀x ∈ M, is a surjective module homomorphism, called
the canonical homomorphism or the canonical surjection.
76 II. Modules
Kerϕ = {x ∈ F | x + E = 0 + E} = {x ∈ F | x ∈ E} = E ∩ F. Apply
the fundamental isomorphism theorem, we get F/(E ∩ F) ≅ (E + F)/E.!
The following result is often used in module theory arguments:
Exercises
9
So, v1 factorizes through π1 and v2 factorizes through π2.
II.3 Direct sums and products. Exact sequences 83
10
This is exactly the universality property of the direct product.
II.3 Direct sums and products. Exact sequences 85
(U), apply (U) for P with the homomorphisms (pi)i ∈ I, there exists a
unique homomorphism ϕ : P → Q, such that pi = qi◦ϕ. We show that ϕ
and ψ are inverse one to each other. Indeed, ϕ ◦ψ : Q → Q satisfies
qi◦(ϕ ◦ψ) = (qi◦ϕ)◦ψ = pi◦ψ = qi, ∀i ∈ I; but the homomorphism idQ
: Q → Q has the same property: qi◦idQ = qi, ∀i ∈ I. By uniqueness,
guaranteed by (U), we get ϕ ◦ψ = idQ. Similarly, ψ ◦ϕ = idP. !
vi v
E
86 II. Modules
The module (i∈I Mi is often denoted ⊕i∈I Mi. For a finite family
n
M1, …, Mn, the notation is M1 ⊕…⊕ Mn or ⊕ M i . If the modules in
i =1
the family (Mi)i∈I are all equal to the same module M, ⊕i∈I Mi is de-
(I)
noted by M . In order to avoid the confusion with the notion of inter-
nal direct sum of submodules (see 3.11), (i∈I Mi is sometimes called
external direct sum.
As in the case of the direct product, the universality property
characterizes the direct sum of a family of modules up to an isomor-
phism. The reader is invited to formulate this precisely and prove it, as
in the case of the direct product.
3.8 Remark. If the set of indices I is finite, the module direct sum
⊕i∈I Mi = (i∈I Mi coincides with the module direct product ∏i∈I Mi in
3.2. Nevertheless, the direct sum is a couple (⊕i∈I Mi, (σi)i∈I) and is not
the same with the direct product {∏i∈I Mi, (πi)i∈I}.
88 II. Modules
⎛ ⎞
b)⇒c) Let xj ∈ Lj ∩ ⎜⎜ ∑ Li ⎟⎟ . Then there exists a family of finite
⎝ i∈I \{ j} ⎠
support, (xi)i∈I \ {j}, xi ∈ Li, ∀i ∈ I \ {j}, such that xj = ∑i∈I \ {j} xi. We
obtain that 0 is the sum of the finite support family (yi)i∈I, where
yi = xi, ∀i ≠ j and yj = −xj. Since 0 has a unique writing as a sum of a
finite support family (evidently, 0 is the sum of the family (0)i∈I), we
obtain xi = 0, ∀i ∈ I. So, xj = 0.
c)⇒d) Let (xi)i∈I, xi ∈ Li, ∀i ∈ I, be such that ∑i∈I xi = 0. Let j ∈ I.
⎛ ⎞
Then xj ∈ Lj; since xj = ∑i∈I \ {j}(−xi), we get xj ∈ ⎜⎜ ∑ Li ⎟⎟ . So, xj = 0.
⎝ i∈I \{ j} ⎠
d)⇒a) We show that (L, (ηi)i∈I) satisfies the definition 3.5. Let RE
and let a family of homomorphisms (vi)i∈I, vi : Mi → E. If x ∈ L, there
exists a family of finite support (xi)i∈I (xi ∈ Li, ∀i ∈ I), such that
x = ∑i∈I xi. From d), we obtain that this family is unique: if
∑i∈I xi = ∑i∈I yi, with yi ∈ Li, ∀i ∈ I, then ∑i∈I (xi − yi) = 0, so xi = yi,
∀i ∈ I.
Define the homomorphism ϕ : L → E by: ∀x ∈ L, put
ϕ(x) := ∑i∈I vi(xi), where (xi)i∈I is the unique family of finite support
with xi ∈ Li, ∀i ∈ I, and x = ∑i∈I xi. The map ϕ thus defined is an
R-module homomorphism: if x, y ∈ L, and (xi)i∈I, (yi)i∈I are the unique
families of finite support with xi, yi ∈ Li, ∀i ∈ I and such that
x = ∑i∈I xi, y = ∑i∈I yi, then x + y = ∑i∈I xi + ∑i∈I yi = ∑i∈I(xi + yi),
where (xi + yi)i∈I has finite support and xi + yi ∈ Li, ∀i ∈ I. So,
ϕ(x + y) = ∑i∈I vi(xi + yi) = ∑i∈I vi(xi) + ∑i∈I vi(yi) = ϕ(x) + ϕ(y).
Similarly one sees that ϕ(rx) = rϕ(x), ∀r ∈ R, ∀x ∈ L. If j ∈ I and
xj ∈ Lj, then (ϕ ◦ηj)(xj) = ϕ(xj) = vj(xj), which shows that ϕ ◦ηj = vj,
∀j ∈ I. We must show that ϕ is unique with this property. Let
90 II. Modules
ρ
∏ N i ⎯⎯→
⎯j N j
v : ∏ Mi → ∏ Ni such that ρj◦v = vj◦πj, ∀j ∈ I:
The homomorphism v is called the direct product of the family of
homomorphisms (vi)i∈I and is usually denoted by ∏i∈I vi or ×i∈I vi. If
∏i=1 vi
n
I = {1, …, n}, the notations are or v1 ×…× vn.
For any x = (xi)i∈I ∈ ∏ Mi, {∏i∈I vi}(x) = (vi(xi))i∈I ∈ ∏ Ni.
Similarly one defines the direct sum of the family of homomor-
phisms (vi)i∈I. Let (⊕ Mi, (σi)i∈I) (respectively (⊕Ni, (τi)i∈I)) be the di-
rect sum of the family (Mi)i∈I (respectively (Ni)i∈I). For any j ∈ I,
τj◦vj : Mj → ⊕Ni is a homomorphism. The universality property of the
II.3 Direct sums and products. Exact sequences 93
vj w
jτ
N j ⎯⎯→ ⊕ Ni
The homomorphism w is called the direct sum of the family (vi)i∈I
and is denoted by (i∈I vi or ⊕i∈I vi. If I = {1, …, n}, w is denoted by
⊕in=1 vi or v1⊕…⊕vn.
If x = ∑i∈I xi ∈ ⊕ Mi, where xi ∈ Mi and the family (xi)i∈I has finite
support11, then (⊕i∈I vi)(x) = ∑i∈I vi(xi) ∈ ⊕ Ni.
Let R-Mod be the category of the left R-modules and Ab the cate-
gory of Abelian groups.
3.19 Definition. (The Hom functors) For any A ∈ R-Mod, define
A
the (covariant) functor h : R-Mod → Ab:
A
∀ B∈ R-Mod, h (B) := HomR(A, B),
(note that HomR(A, B) is an Abelian group with respect to
homomorphism addition);
11
We identify xi with its image through the canonical injection σi(xi).
94 II. Modules
A
∀u : B → B' morphism in R-Mod, h (u) : HomR(A, B) → HomR(A, B')
is defined as
A
h (u)(g) := u◦g, ∀g ∈ HomR(A, B).
A A
It is immediate that h (u) is a morphism in Ab, that h (1B) = 1hA(B)
A A A
and that h (v◦u) = h (v)◦h (u), for any R-module B and any R-module
A
homomorphisms u : B → B' and ∀v : B' → B". So, h is a functor, also
denoted by HomR(A, -).
In a similar manner one defines the contravariant functor
hA : R-Mod → Ab. For any B ∈ R-Mod, hA(B) := HomR(B, A); for any
u : B → B' in R-Mod, hA(u) : HomR(B', A) → HomR(B, A) is given by
hA(u)(g) := g◦u, ∀g ∈ HomR(B', A). The functor hA is denoted also by
HomR(-, A).
We study the behavior of HomR(A, -) and HomR(-, A) with respect
to direct products and direct sums.
12
Called in the sequel sequence of modules.
II.3 Direct sums and products. Exact sequences 97
ι π
0 → A⎯
⎯→ B⎯
⎯→ B A→0
is exact.
13
The technique used is called “diagram chasing” and it is used extensively in
arguments involving diagrams of module homomorphisms.
II.3 Direct sums and products. Exact sequences 99
summand in B).
v
An epimorphism B ⎯ ⎯→ C → 0 is called split if the short exact se-
quence 0 → Ker v → B ⎯ ⎯→ C → 0 splits (⇔ Ker v is a direct sum-
v
mand in B).
II.3 Direct sums and products. Exact sequences 101
Exercises
4.2. Remarks. a) If there exists i ≠ j with xi = xj, then the family x1,
…, xn is linearly dependent: the linear combination xi − xj is 0. Thus,
in studying linear independence we may suppose that x1, …, xn are
distinct. On the other hand, the notion of linear independence does not
depend on the indexing of x1, …, xn. This is the reason we can speak
about a linearly dependent (finite) subset of M.
b) The set {x} (containing a single element x ∈ M ) is linear
independent if and only if ∀r ∈ R, rx = 0 implies r = 0. This suggest
the following definition: the annihilator of x in R is
14
The reference to the ring R is often omitted.
II.4 Free modules 105
4.5. Examples. a) The set {1}, containing only the unity of the ring
R (seen as a left R-module) is linearly independent. More generally,
∀r ∈ R, {r} is linearly dependent ⇔ r is a right zero divisor in R
(∃s ∈ R, s ≠ 0, such that sr = 0).
106 II. Modules
4.8. Examples. a) {∅} is a basis (the only one!) of the module {0}.
b) {1} is a basis of R R. More generally, {r} is a basis of R R if and
only if r is right invertible in R.
II.4 Free modules 107
(I)
c) If I is a set, the R-module R (the direct sum of | I | copies of R,
(I)
with canonical injections σi : R → R ) is free, a basis being {ei}i∈I,
(I) (I)
where ei = σi(1). This basis is called the canonical basis of R ; R is
also called the free R-module on the set I (or the free R-module of ba-
n
sis I ). If I = {1, …, n}, the canonical basis of the free R-module R
are e1 = (1, 0,…, 0), e2 = (0, 1,…, 0), …, en = (0, 0,…, 1).
A module homomorphism is determined by its values on a generat-
ing set. But, for an arbitrary generating set, there may be no
homomorphism that takes prescribed values on the elements of the
generating set. In the privileged case of free modules, for any choice
of the values on the elements of a basis, a unique homomorphism
takes the respective values on the elements of the basis:
v(x) := ∑i∈I riyi, for any x = ∑i∈I riei with (ri)i∈I family of elements in
R, having a finite support,
defines a module homomorphism. Since, for any x ∈ L there exists a
unique family (ri)i∈I of elements in R such that x = ∑i∈I riei, v is well
defined. Let x = ∑i∈I riei and y = ∑i∈I siei be elements in L, with (ri)i∈I,
(si)i∈I families of elements in R. For any a, b ∈ R, we have:
v(ax + by) =
v{∑i∈I ariei + ∑i∈I bsiei} = v{∑i∈I (ari + bsi)ei} = ∑i∈I (ari + bsi)yi =
= ∑i∈I ariyi + ∑i∈I bsiyi = av(x) + bv(y).
(I)
We prove a): Ker v = {∑i∈I riei | (ri)i∈I ∈ R , ∑i∈I riyi = 0}. It is
clear that Ker v = 0 ⇔ (yi)i∈I is linearly independent.
The rest of the proof is left to the reader. !
(I)
The R-modules of the type R are “all” free R-modules:
)u1
0 → Ker u1 → L1 ⎯i⎯ u0
→ L0 ⎯⎯→ M →0
since Ker u0 = Im i◦u1. One continues by induction on n such that ex-
ists an exact sequence of the form:
0 → Kn → Ln → … → L1 → L0 → M → 0,
with Li free, 1 ≤ i ≤ n.
In a vector space, any two bases have the same cardinal. In the case
of a free module over an arbitrary ring, this fact is not guaranteed.
Nevertheless, the free modules over a commutative ring have this
property; in fact, the proof of this result reduces the problem to the
case of vector spaces.
suppose that the ring R is commutative, although some results hold for
any ring with identity.
15
We consider the bases as being totally ordered (the place of the element in the
basis matters).
II.4 Free modules 113
t
where ϕ(e) = (ϕ(e1), …, ϕ(en)). In the same way, if B = Me', f'(ϕ), then
−1
ϕ(e') = B·f'. Also we have e' = S·e and f' = T·f, or f = T ·f'. Since ϕ is
an R-module homomorphism, we have ϕ(S·e) = S·ϕ(e) (prove this!).
Thus:
−1 −1
ϕ(e') = ϕ(S·e) = S·ϕ(e) = S·(A·f) = (SA)·f = (SA)·(T ·f') = (SAT )·f',
−1
which says that the matrix of ϕ in the bases e' and f' is SAT . !
Exercises
118
III.1 The submodules of a free module 119
1
What about the case of vector spaces?
III.1 The submodules of a free module 121
−1
invertible matrices U and V such that U AV = D = (dij) ∈ Mm, n(R) is a
diagonal matrix (i ≠ j implies dij = 0), with the additional condition
d11|d22|…|dmm.
We reduced the statement to a problem concerning matrices with
elements in R. The following definitions are helpful:
1.4 Theorem. Let R be a PID and let m, n ∈ N*. Then any matrix
A ∈ Mm, n(R) is arithmetically equivalent to a matrix in Smith normal
form. Moreover, the Smith normal form of A is unique in the following
sense: if D = diag(d1,…, dr) and D' = diag(d'1,…, d'r) are in Smith nor-
122 III. Finitely generated modules over principal ideal domains
i j i
1 ! ! 0 1 0 ! 0
* ! ! 0 1 ! 0
i ++ 0 + 1 * !
Di(u) =
Pij = ! * ! i +++ u
j ++ 1 + 0 *
* 0 0 + 0 1
0 1
PijPij = I;
−1
Dij(u)Dij(v) = Dij(uv), ∀u, v ∈ U(R); so Dij(u)Dij(u ) = Dij(1) = I.
In other words, the inverse of an elementary matrix exists and is
also an elementary matrix (of the same type).
Looking at the definition of the relation of arithmetic equivalence
between matrices, we obtain:
1.6 Proposition. For any A ∈ Mm, n(R), any matrix obtained from A
by elementary transformations of rows and/or columns is arithmeti-
cally equivalent to A. !
We also need:
⎛ m ⎞⎛ n ⎞
chosen rows and columns. There are ⎜ ⎟⎜ ⎟ minors of order k in an
⎝ k ⎠⎝ k ⎠
m×n matrix).
Note that, if U ∈ Mm(R), then ∆k(A)|∆k(UA). Indeed, the rows of UA
are linear combinations (with coefficients in R) of the rows of A. Thus,
the rows of a minor of order k of UA (corresponding to the choice of
columns i1, …, ik of UA) are linear combinations of the rows of A
(truncated to contain only the entries on the columns i1, …, ik). Apply-
ing the fact that the determinant of a matrix is a multilinear function of
the rows of the matrix2, it follows that a minor of order k of UA is a
linear combination of minors of order k of A. The claim now follows.
Similarly, if V ∈ Mn(R), then ∆k(A)|∆k(AV). So, if A ∼ B, then
∆k(A)|∆k(B) and, by symmetry, ∆k(B)|∆k(A), i.e. ∆k(A) ∼ ∆k(B). If
D = diag(d1, …, dr) is in Smith normal form, an easy check show that
∆k(D) ∼ d1…dk. So, if A ∼ D, d1, …, dr are determined (up to associa-
tion in divisibility) by ∆1(A), …, ∆r(A) and
d1 ∼ ∆1(D)∼ ∆1(A), dk ∼ ∆k(A)/∆k−1(A) , for k ≥ 2.
These relations indicate another method to compute effectively d1,
…, dr (although the amount of computation is prohibitive if m, n are
not small). !
2
By denoting (l1, …, lk) the matrix having the rows l1, …, lk, the following
relation holds: det(al1 + bl'1, …, lk) = adet(l1, …, lk) + bdet(l'1, …, lk), ∀a, b ∈ R
(similarly for any row li).
128 III. Finitely generated modules over principal ideal domains
Exercises
⎡ a 0⎤
2. Let a, b ∈ R. Show that the Smith normal form of ⎢ ⎥ is
⎣ 0 b⎦
⎡d 0 ⎤
⎢ 0 m ⎥ , where d = GCD(a, b), m = LCM(a, b). (Hint. Use the invari-
⎣ ⎦
ants ∆k.)
3. Find the Smith normal form of a diagonal matrix diag (a1, …, an)
∈ Mn(R).
4. Find the Smith normal form of a row matrix (a1, …, an) ∈ M1, n(R).
5. Determine all subgroups of (Z×Z, +).
*
6. Let n ∈ N , x1, …, xn ∈ R and d = GCD(x1, …, xn). Show that there
exists V ∈ GL(n, R) such that (x1, …, xn)V = (d, 0, …, 0). (Ind. Con-
sider the Smith normal form of the row matrix (x1, …, xn)).
*
7. Let n ∈ N and let a1, …, an ∈ R. Show that: there exists
V ∈ GL(n, R) such that the first row of V is (a1, …, an) if and only
if GCD(a1, …, an) = 1.
8. Let K be a field and let A ∈ Mm, n(K). Then the Smith normal form
of A is diag(1,…,1, 0,…, 0), where 1 appears r times (r is the rank of
the matrix A).
* n m
9. Let m, n ∈ N and let ϕ : R → R be an R-homomorphism whose
matrix is A ∈ Mm, n(R) (in the canonical bases). Let U ∈ GL(m, R) and
V ∈ GL(n, R) such that UAV is in Smith normal form, namely
diag(d1,…, dr, 0,…, 0), with r ≤ min(m, n) and d1,…, dr nonzero. Show
that a basis in Kerϕ is (vr+1, …, vn), where vi is the column i of the ma-
n
trix V (vi is seen as an element in R ).
10. Suppose L is a free R-module of rank n, (e1, …, en) is a basis in L
and {f1, …, fm} ⊆ L. Show that a basis in F = < f1, …, fm > can be ob-
tained as follows:
130 III. Finitely generated modules over principal ideal domains
2.2 Lemma. Let R be a ring and let M be a left R-module such that
M is the direct sum of a family of submodules (Mi)i∈I, M = ⊕I Mi. If
Ni ≤ R Mi, ∀i ∈ I, then the sum of the submodules (Ni)i∈I is direct and
we have a canonical isomorphism :
⊕ Mi M
i∈I
≅⊕ i .
⊕ N i i∈I N i
i∈I
Proof. Let πj : ⊕I Mi → Mj be the canonical surjections. Define
M
ϕ : M → ⊕ i by ϕ(x) = (πi(x) + Ni)i∈I, ∀x ∈ M. One easily checks
i∈I N
i
that ϕ is a surjective homomorphism (in fact, ϕ is the direct sum of the
family of homomorphisms ηi◦πi : M → Mi/Ni, where ηi : Mi → Mi/Ni
is the canonical surjection). We have Kerϕ = {x ∈ M | πi(x) ∈ Ni,
∀i ∈ I}. Since x = ∑i∈I πi(x), it follows that Kerϕ = ∑i∈I Ni = ⊕I Mi.
Apply now the isomorphism theorem. !
We state now the following important theorem, which determines
the structure of finitely generated modules over a principal ideal do-
main. Recall that R° is the set of nonzero noninvertible elements of R.
k k
c) Let d ∈ R and k ∈ N such that p ||d. Then tp(R/Rd) ≅ R/Rp . In
k
particular, if x ∈ M and o(x) = d, then tp(Rx) ≅ R/Rp ; if p-d, then
tp(Rx) = 0.
Proof. a) and b) have similar proofs with 2.5 and are proposed as
an exercise.
k
c) Let b ∈ R such that d = p b. We claim that tp(R/Rd) = Rb/Rd. In-
s
deed, let r + Rd ∈ tp(R/Rd). There exist s ∈ N such that p r ∈ Rd, i.e.
s k s
p r = dc = p bc, with c ∈ R. So b| p r and (b, p) = 1, which imply b|r.
So, r ∈ Rb and tp(R/Rd) ⊆ Rb/Rd. The other inclusion is obvious.
k
We have R/Rp ≅ Rb/Rd by the isomorphism theorem applied to
ϕ : R → Rb/Rd, ϕ(r) = rb + Rd, ∀r ∈ R. !
α j +1 −α j α m −α j
≅ R Rp ⊕ … ⊕ R Rp
We have used the proprieties (whose proof is immediate): if
α β
M = ⊕i∈I Mi and r ∈ R, then rM = ⊕i∈I rMi ; p {R/Rp } ≅ 0 if β ≤ α;
α β β −α
p {R/Rp } ≅ R/Rp if β > α.
From (**), with a similar argument, we have :
( )
p j t p ( M ) ≅ p j R Rp β1 ⊕ … ⊕ p j R Rp βm
α α α
( )
β j −α j β m −α j
≅ R Rp ⊕ … ⊕ R Rp
Therefore,
α −α α −α β −α β −α
R Rp j +1 j ⊕ … ⊕ R Rp m j ≅ R Rp j j ⊕ … ⊕ R Rp m j
These are decompositions of the type (*), as one easily sees. Let k
be the number of indices i for which αi > αj (evidently, 0 ≤ k ≤ m − j).
In the left hand side we have k nonzero terms and in the right hand
side there are m − j + 1 nonzero terms. The first part of the proof
shows that k = m − j + 1, contradicting k ≤ m − j. Thus, we must have
αi = βi, 1 ≤ i ≤ m. !
Exercises
field) is cyclic. (Hint. If G has more than one invariant factor and d is
the greatest invariant factor, then any element of G is a root of a
d
X − 1 and d < |G|).
14. Determine the finitely generated Abelian groups with the property
that their lattice of subgroups is a chain ( = totally ordered with respect
to inclusion).
3
The ideals Ii şi Ij are called in this case comaximal. For example, the ideals Za
and Zb of Z are comaximal if and only if a and b are coprime.
142 III. Finitely generated modules over principal ideal domains
R R R R
= ≅ ×… × ,
I1 ⋅ … ⋅ I n I1 ∩ … ∩ I n I1 In
η(r + I1 ∩…∩ In) = (r + I1, …, r + In), ∀r ∈ R.
R R
b) Conversely, if the homomorphism ϕ : R → ×… × ,
I1 In
ϕ(r) = (r + I1, …, r + In), ∀r ∈ R is surjective (inducing an isomor-
R R R
phism ≅ ×… × as above), then Ii and Ij are comaxi-
I1 ∩ … ∩ I n I1 In
mal ideals, for any i ≠ j.
Proof. a) We prove by induction on n that I1·…·In = I1 ∩…∩ In and
that η is an isomorphism. For n = 2, I1 + I2 = R implies the existence
of x ∈ I1, y ∈ I2 such that x + y = 1. Let z ∈ I1 ∩ I2. Then
z = z·1 = zx + zy, with zx, zy ∈ I1·I2, so I1 ∩ I2 ⊆ I1I2. Thus, I1 ∩ I2 = I1I2.
R R
Let ϕ : R → × , ϕ(r) = (r + I1, r + I2), ∀r ∈ R. ϕ is a ring (and
I1 I 2
an R-module) homomorphism (it is the direct product of the canonical
surjections R → R/Ij). We have:
Kerϕ = {r ∈ R| (r + I1, r + I2) = (0 + I1, 0 + I2)} = I1 ∩ I2
The isomorphism theorem implies that R/I1 ∩ I2 ≅ Imϕ and the
isomorphism is precisely η. Let us prove that ϕ is surjective (which
R R
will finish the proof). Let (r1 + I1, r2 + I2) ∈ × . We must exhibit
I1 I 2
r ∈ R with r − r1 ∈ I1, r − r2 ∈ I2. Such an element is r = r1 y + r2 x. In-
deed,
r − r1 = r1y + r2x − r1x − r1y = (r2 − r1)x ∈ I1.
4
Recall that the product IJ of two ideals I and J is the ideal generated by all the
products ij, i ∈ I, j ∈ J. The product of ideals is associative and always IJ ⊆ I ∩ J.
III.3 Indecomposable finitely generated modules 143
Similarly, r − r2 ∈ I2.
Suppose now that for any k < n and any ideals I1,…, Ik, pairwise co-
maximal , we have I1·…·Ik = I1 ∩…∩ Ik and η is an isomorphism. Take
n pairwise comaximal ideals I1,…, In. Since Ij + In = R, 1 ≤ j ≤ n − 1,
there exist aj ∈ Ij, bj ∈ In, such that aj + bj = 1. Multiply these n − 1
equalities:
n −1
∏ (a j + b j ) = a1·…·an−1 + b = 1, where b ∈ In, a1·…·an−1 ∈ I1·…·In−1.
j =1
R R R
≅ ×… ×
Ra1 … an Ra1 Ran
r + Ra1…an & (r + Ra1, …, r + Ran), ∀r ∈ R.
b) If M is a cyclic R-module M = Rx (x ∈ M), with
o(x) = d = a1·…·an ∈ R° and (ai, aj) = 1, ∀i ≠ j, then there exist xi ∈ M,
1 ≤ i ≤ m, such that o(xi) = ai and
M = Rx = Rx1⊕…⊕Rxm
c) Any cyclic R-module can be written as a direct sum of
indecomposable submodules.
Proof. a) If a, b ∈ R, then (a, b) = 1 if and only if Ra and Rb are co-
maximal. Indeed, the ideal generated by GCD(a, b) is Ra + Rb. So, (a,
b) = 1 ⇔ Ra + Rb = R. Apply now the Chinese remainder theorem for
Ra1, …, Ran.
b) We have M ≅ R/Rd. By a), we have R/Ra1 ×…× R/Ran ≅ R/Rd.
So, there is an isomorphism ϕ : R/Ra1 ×…× R/Ran → M. Let
yi := (0 + Ra1,…, 1 + Rai, …, 0 + Ran) and xi := ϕ(yi). Obviously,
R/Ra1 ×…× R/Ran = Ry1⊕…⊕Ryn, so, (applying ϕ) M = Rx1⊕…⊕Rxn.
Also, o(xi) = o(yi) = ai, 1 ≤ i ≤ n.
c) This follows from a): let M = Rx, with x ∈ M and let d = o(x). If
d = 0, then M ≅ R is indecomposable. If d ≠ 0, let d = p1k1 … ptkt the
prime factor decomposition of d (where p1, …, pt are distinct primes in
k
R). Clearly, piki and p j j are coprime if i ≠ j; applying b), there exist
xi ∈ M such that M = Rx1⊕…⊕Rxt, with o(xi) = piki . So Rxi is
indecomposable, being isomorphic to R Rpiki . !
k
R/Rp , where p ∈ R is prime and k ∈ N* (in this case M is a torsion
module).
Proof. The proof of the “if” part is easy: we saw that the modules
k
R/Rp are indecomposable.
Let M be indecomposable finitely generated. There exists a
decomposition (D), as in theorem 2.3. Keeping the notations in 2.3,
we see that M is indecomposable only if m = n = 1 or m = 0 and n = 1.
If m = 0, n = 1,then M is free of rank 1, and thus isomorphic to R.
If m = n = 1, then M = Rx, with o(x) = d. So, M ≅ R/Rd. Let
d = p1k1 … ptkt be the prime decomposition of d (p1, …, pt are distinct
k
primes in R). We claim that t = 1. If t > 1, piki and p j j are coprime if
i ≠ j, so R/Rd ≅ R Rp1k1 × … × R Rptkt , which is clearly decomposable. !
5
We avoid the term “set”, because in a set all elements are distinct, while the
elementary divisors can occur more than once.
6
The class of all groups isomorphic to a given group G is called the type of
isomorphism of G. (This definition can be generalized to any type of algebraic
148 III. Finitely generated modules over principal ideal domains
2
with 60 elements: Z60 ≅ Z4⊕Z3⊕Z5 (the elementary divisors are (2 , 3,
5)) and Z2⊕Z30 ≅ Z2⊕Z2⊕Z3⊕Z5 (the elementary divisors are (2, 2, 3,
5)).
c) If the invariant factors (d1, …, dm) are given, the elementary divi-
sors can be obtained by decomposing in a product of powers of primes
every di and writing down all the powers that occur, as many times
that they arise.
Conversely, if the family of the elementary divisors is given, the
invariant factors are obtained as follows: write a product containing
(only once) all primes in the family of the elementary divisors, at the
greatest power. The product obtained is dm (the “largest” invariant fac-
tor – divisibility-wise). Erase from the family of the elementary divi-
sors the powers written in the product and repeat the procedure with
what is left. Continue until the elementary divisors are exhausted.
For example, if the family of the elementary divisors of a Z-module
2 3
is (2, 2, 2 , 3, 3 , 5), following the procedure above we obtain succes-
2 3
sively: 2 ·3 ·5, 2·3, 2, which are the invariant factors of the Z-module.
Which is this Z-module?
structures: Abelian groups, rings, modules, fields, ordered sets…). For a given type
of algebraic structure, the description of all types of isomorphism of the structure is
a most important (and hard to attain, in general) objective, called classification. For
instance, the theorem of invariant factors yields a classification of finitely generated
Abelian groups. The classification of the finite simple groups (having no normal
proper subgroups) is one of the great successes of group theory, accomplished in the
1980's.
III.3 Indecomposable finitely generated modules 149
Exercises
4.12 Remark. With the notations in 4.9, µu is o(vm), the highest de-
gree invariant factor of u. If f ∈ K[X] is monic, the following proper-
ties are equivalent:
a) f = µu.
b) f (u) = 0 and ∀g ∈ K[X], g(u) = 0 implies f |g.
c) f (u) = 0 and ∀g ∈ K[X], g ≠ 0, g(u) = 0, implies deg f ≤ deg g.
The proof is easy, using the definitions. Note that, unlike the mini-
mal polynomial of an algebraic element in a field extension, the mini-
mal polynomial of an endomorphism is not necessarily irreducible.
The next result translates in matrix language the fact that V is a di-
rect sum of u-invariant subspaces.
4.13 Proposition. a) Let V = V1⊕…⊕Vm, where V1, …, Vm are
u-invariant subspaces. If vi is a basis in Vi, 1 ≤ i ≤ m, then
v1∪…∪vm =: v is a basis7 in V. If Ai is the matrix of the restriction of u
to Vi, in the basis vi, 1 ≤ i ≤ m, then the matrix of u in the basis v is
(written on blocks):
⎡ A1 0⎤
⎢ ⎥
Mv(u) = ⎢ * ⎥,
⎢ ⎥
⎣0 Am ⎦
b) Conversely, if the matrix of u in a basis v is of the form above,
then the rows of the block Ai correspond to a set of vectors in v that
generate an u-invariant subspace Vi (1 ≤ i ≤ m) and V = V1⊕…⊕Vm.
Proof. a) It is clear that v is a basis in V (see also II.4.11). To keep
notations manageable, suppose m = 2 and v1 = (e1, …, ep), v2 = ( f1,
…, fq), p + q = n = dimV. Then v = (e1, …, ep, f1, …, fq). Since V1, V2
are u-invariant, u(ei) is a linear combination of e1, …, ep, and u( fj) is a
linear combination of f1, …, fq. Writing the matrix of u in the basis v,
7
We totally order the vectors in the basis v, by sequencing the elements of the
bases v1, …, vm, in this order.
156 III. Finitely generated modules over principal ideal domains
⎡ A1 0 ⎤
Mv(u) = ⎢ ⎥.
⎢⎣ 0 A2 ⎥⎦
b) The task of detailing the proof is left to the reader. !
We want to find a basis v of V such that Mv(u) has as “simple” a
form as possible. Since Vu is a direct sum of indecomposable submod-
ules (theorem 4.9) the previous result allows us to study the restriction
of u to each of the u-invariant subspaces in the direct sum. It is thus
natural to study first the case in which Vu is indecomposable:
k
Vu = K[X]v, for some v ∈ V, o(v) = p , p irreducible in K[X], k ∈ N*.
r r −1
4.14 Definition. If p ∈ K[X], p = X − ar −1 X −… − a1 X − a0,
define the r×r matrices with entries in K:
⎡ 0 1 0 … 0 ⎤ ⎡0 0 0 … 0⎤
⎢ 0 0 1 … 0 ⎥ ⎢0 0 0 … 0⎥
⎢ ⎥ ⎢ ⎥
Cp = ⎢ ! * ⎥ , N = ⎢… ⎥
⎢ 0 0 0 … 1 ⎥ ⎢0 0 0 … 0⎥
⎢ ⎥ ⎢ ⎥
⎢⎣ a0 a1 a2 … ar −1 ⎥⎦ ⎢⎣ 1 0 0 … 0 ⎥⎦
The matrix Cp is called the matrix companion of the polynomial p.
Define the rk×rk matrix (written in block form):
Cp N 0 … 0 0
0 Cp N … 0 0
k * *
J{ p } = ∈ Mrk(K)
* *
0 0 0 … Cp N
0 0 0 … 0 Cp
k k
J{ p } is called the Jordan cell 8 associated to the polynomial p .
8
Camille Jordan (1838-1922), French mathematician.
III.4 The endomorphisms of a finite dimensional vector space 157
9
In other texts this matrix is called a rational canonical matrix, the name Jordan
canonical matrix being given only if pi are polynomials of degree 1.
158 III. Finitely generated modules over principal ideal domains
r−1
e0 = v; e1 = X·v = u(e0); … ; er−1 = X ·v = u(er−2);
r−1
er = p·v; er+1 = Xp·v = u(er); … ; e2r−1 = X p·v = u(e2r−2);
…
k −1 r−1 k −1
e(k −1)r = p ·v; e(k −1)r+1 = Xp·v = u(e(k −1)r); …; ekr−1 = X p ·v = u(ekr−2).
The next lemma proves that e = (e0, …, ekr−1) is a basis:
Lemma. If u ∈ EndK(V) is an endomorphism such that Vu = K[X]v
for some v ∈ V, and f = o(v), deg f = n, then, for any g0,
…, gn−1 ∈ K[X], with deg gi = i, 1 ≤ i ≤ n, the vectors
g0·v, …, gn−1·v
form a basis of V.
Proof of the lemma. Vu ≅ K[X]/(f ) (K[X]-module isomorphism, so
also a K-vector space isomorphism), hence dimKV = dimK K[X]/(f )
= deg f = n. The vectors g0·v, …, gn−1·v are linearly independent: if
a0g0·v + … + an−1gn−1·v = 0, with ai ∈ K, then h·v = 0, where h = a0g0 +
… + an−1gn−1. Since o(v) = f, and deg h < n, we have f |h, hence h = 0.
But the polynomials g0, …, gn−1 are linearly independent in the
K-vector space K[X], being of distinct degrees, so a0 = … = an−1 = 0.
The n elements g0·v, …, gn−1·v are thus linearly independent in V,
whose dimension is n, which means they are a basis.
k
We get back to proving that Me(u) = J{ p }. If 1 ≤ i < k, we have:
r−1 i−1 r i−1 r−1 i−1
u(eir−1) = X(X p ·v) = X p ·v = (p + a0 + a1X + … + ar−1X )p ·v
i i−1 i−1 r−1 i−1
= p ·v + a0p ·v + a1Xp ·v + … + ar−1X p ·v
= eir + a0e(i −1)r + a1e(i −1)r+1 + … + ar−1e(i −1)r+r−1
If i = k, u(ekr−1) = a0e(k −1)r + a1e(k −1)r+1 + … + ar−1e(k−1)r+r−1, since
k
p ·v = 0.
These equalities, together with the relations (0), …, (k − 1), prove
the claim.
b) Decompose Vu as a direct sum of u-indecomposable u-invariant
subspaces (see 4.9). By a), each such subspace has a basis in which the
k k
restriction of u has the matrix of the form J{ p }, with p elementary
divisor of u. Apply now Prop. 4.13. !
III.4 The endomorphisms of a finite dimensional vector space 159
If f = (f1, f2,…, fn), then these relations can be written f = (XI − A)e.
Lemma. f =: ( f1, f2,…, fn) is a basis in F = Ker ϕ.
Proof of the lemma. For any 1 ≤ i ≤ n,
u(vi) = ai1v1 + ai2v2 + … + ainvn
Thus,
ϕ(fi) = Xvi − (ai1v1 + ai2v2 + … + ainvn)
= u(vi) − (ai1v1 + ai2v2 + … + ainvn) = 0.
This shows that fi ∈ Kerϕ = F. Let us prove that f is a system of
generators for F. Note that:
Xei = fi + ai1e1 + ai2e2 + … + ainen, 1 ≤ i ≤ n. (*)
2
So, X ei = Xfi + ai1 Xe1 + ai2 Xe2 + … + ain Xen. Using (*), we obtain:
X ei = ∑j qj fj + ∑i rjej, for some qj ∈ K[X], rj ∈ K, 1 ≤ j ≤ n. By induc-
2
m
tion, one easily sees that, ∀m ∈ N*, X ei is expressed as:
X ei = ∑j qj fj + ∑i rjej, for some qj ∈ K[X], rj ∈ K, 1 ≤ j ≤ n. (**)
m
This means that the monic polynomials d1·…·dm and fA differ by the
factor detS·detT ∈ K*, which shows that they are equal. On the other
hand, it is clear that the product of the elementary divisors equals the
product of the invariant factors. !
10
Arthur Cayley (1821-1895), Sir William Rowan Hamilton (1805-1865), British
mathematicians.
11
Ferdinand Georg Frobenius (1849-1917), German mathematician.
164 III. Finitely generated modules over principal ideal domains
Exercises
This chapter contains the basic concepts and results from the theory
of field extensions. Standard facts about rings and vector spaces are a
prerequisite: polynomial rings, factor rings, ring isomorphism theo-
rems, bases and dimension in vector spaces, prime and maximal ide-
als. Knowledge of polynomial ring arithmetic is recommended (as
provided in the chapter “Arithmetic in integral domains”). Some
elementary properties of cardinals and Zorn's Lemma are used in the
proof of existence of the algebraic closure of a field. Most of these
facts can be found in the Appendices; a more detailed treatment is
found in most Abstract (Modern) Algebra introductory texts.
169
170 IV. Field extensions
1.4 Definition. The fields that have no proper subfields are called
prime fields.
We determine now all prime fields.
Recall the notion of characteristic of a ring R with identity. Let 1
be the identity element of R and let n ∈ N; n ·1 denotes the multiple
1 + … + 1 (n terms); the characteristic of R, denoted char R, is defined
as follows:
- if, for any n ∈ N*, n ·1 ≠ 0, then char R = 0;
39
Evariste Galois (1811-1832), French mathematician.
40
One can define the degree of an extension of skew fields (division rings): if L
is a skew field and K is a subfield (skew) of L, then L is naturally a K-vector space
and one defines the degree [L : K] as dimKL.
IV.1. Algebraic extensions 173
K(S). One also says that K(S) is obtained by adjoining to K the ele-
ments in S.
We denote by K[S] the subring of L generated by K ∪ S (the
intersection of all subring of L that include K ∪ S).41
It is easy to see that K(S) (respectively K[S]) is the smallest subfield
(respectively subring) of L that includes K ∪ S. Obviously, the ring
K[S] is a domain (it is a subring of the field L) and K[S] ⊆ K(S). The
field of fractions of the domain K[S] is canonically isomorphic to K(S)
(see the universality property of the field of fractions).
If S = {x1,…, xn}, then K(S) is denoted by K(x1,…, xn) and K[S] by
K[x1,…, xn]. An extension K ⊆ L with the property there exists a finite
subset S of L such that L = K(S) is called a finitely generated exten-
sion. Do not confuse with the concept of finite extension (which means
that its degree is finite)!
If there exists α ∈ L such that L = K(α), the extension K ⊆ L is
called a simple extension, and α is called a primitive element. A primi-
tive element need not be unique: for instance, K(α) = K(α + 1).
41
This notation is used also for ring extensions: if R is a subring of the ring A,
and S is a subset of A, R[S] is the subring generated by R ∪ S in A (R[S] coincides
with the R-subalgebra of A generated by S).
176 IV. Field extensions
K[S] =
⎪⎧ ⎪⎫
⎨ ∑ ai … i x1 … xn n ∈ '* , x1 ,… , xn ∈ S , ai … i ∈ K , ∀ ( i1 , … , in ) ∈ ' n ⎬
' i i 1 n
42
An ordered set (S, ≤) is called a lattice (resp. a complete lattice) if any subset
with two elements (resp. any subset) of S has a least upper bound and a greatest
lower bound in L.
IV.1. Algebraic extensions 177
b) Since K[S] ⊆ K(S) and K(S) is a field, any element of the form
αβ −1, with α, β ∈ K[S], β ≠ 0, is in K(S). But the set of all these ele-
ments is a subfield in L (standard check).
c), d) Exercise. !
In the particular case S = {x1, …, xn}, with x1, …, xn ∈ L, :
K[x1, …, xn] = { f ( x1 ,… xn ) f ∈ K [ X 1 ,… , X n ]}
⎧ f ( x1 ,… xn ) ⎫
K(x1, …, xn) = ⎨ f , g ∈ K [ X 1 ,… , X n ], g ( x1 ,… xn ) ≠ 0⎬ .
⎩ g ( x1 ,… xn ) ⎭
If S = {a} ∈ L:
K[a] = {f (a) | f ∈ K[X]} and K(a) = {f (a)/g(a) | f, g ∈ K[X], g(a) ≠ 0}.
Thus, K[a] = Im eva, where eva : K[X] → L is the unique K-algebra
homomorphism with the property that eva(X) = a; eva is called the
“homomorphism of evaluation in a”. If f ∈ K[X], f = b0 + b1 X + … +
n n
bn X , then eva( f ) = b0 + b1a + … + bna ∈ L. The usual notation for
eva( f ) is f (a), called the “value of f in a”. If f (a) = 0, we say „a is a
root of f”.
The following notion is central in all the theory we describe.
1 = ( 3 2 − 1)(1 + 3 2 + 3 4 ).
43
This result, whose proof is simple for a modern mathematician, was known
intuitively for a long time and often tacitly accepted in 17th to 19th centuries’
mathematical arguments (of course, only polynomials with numerical coefficients
were considered). Around 1629, Albert Girard, states –without proof– that an
equation of degree n has n roots, that can be complex numbers or “other similar
numbers”. In 1792, Pierre Simon de Laplace gives an elegant proof of the
“Fundamental Theorem of Algebra” – any nonconstant polynomial with complex
coefficients has a complex root – admiting though that the roots exist “somewhere”.
IV.1. Algebraic extensions 183
⎛ ⎞
If ∑ aij xi y j = 0 , with aij ∈ K, then ∑
⎜⎜ ∑ aij xi ⎟⎟ y j = 0 ,
1≤i ≤ m , 1≤ j ≤ n 1≤i ≤m ⎝ 1≤ j ≤n ⎠
where ∑ aij xi ∈ L. The L-linear independence of {y1, …, yn} implies
1≤ j ≤ n
that ∑ aij xi = 0, for any i. Hence aij = 0 for any i and j, because {x1,
1≤ j ≤ n
1.27 Remarks. a) The theorem is also true for division rings, since
the proof does not use the commutativity of multiplication.
b) More generally, if the extensions are not necessarily finite, sup-
pose that (xi)i∈I is a K-basis of L and (yj)j∈J is an L-basis of M. Then
(xiyj)(i,j)∈I×J is a K-basis of M.
3
Eisenstein's criterion (or observing that X − 2 has no rational roots).
Thus, the degree of the extension is 3 and a Q-basis of Q (3 2 ) is
{1, 3 2 , 3 4 } . This means that any element of Q (3 2 ) is written
uniquely as a + b3 2 + c3 4 , with a, b, c ∈ Q. The same argument
shows that for any n ∈ N , the extension Q ⊆ Q (n 2 ) has degree n and
*
{
a Q-basis is 1, n 2 ,…, n 2n −1 . }
b) The extension Q ⊆ Q( 2 , 3 ) has degree 4 and a Q-basis is
{1, 2 , 3, 6 }. for the proof, consider the tower of extensions
Q ⊆ Q ( 2 ) ⊆ Q( 2 )( 3 ) = Q( 2 , 3 ) . The first extension has de-
2
gree 2 (since Irr ( 2 , Q) = X − 2) and a basis is {1, 2 }. The exten-
sion Q ( 2 ) ⊆ Q( 2 )( 3 ) has also degree 2, because Irr ( 3,Q( 2 ))
= X − 3. Indeed, X 2 − 3 is irreducible over Q ( 2 ) , because
2
We have deg h = max (deg f, deg g). This is clear if deg f ≠ deg g; if
deg f = deg g, then the leading coefficient of h is ub − a, where a,
respectively b, are the leading coefficients of f, respectively g. We
−1
have ub − a ≠ 0, because otherwise u = ab ∈ K, contradicting the as-
sumption u ∉ K.
We have to prove that h = g(X)u − f (X) ∈ K(u)[X] is irreducible.
First, note that u is transcendental over K. Indeed, K ⊆ K(T) is infinite
(T is transcendental over K) so K ⊆ K(u) must also be infinite
(otherwise, considering the tower of extensions K ⊆ K(u) ⊆ K(T), the
theorem of transitivity of finite extensions would imply that K ⊆ K(T)
is finite). Thus, there is a K-isomorphism K[Y] ≅ K[u] (where Y is an
indeterminate). The irreducibility of g(X)u − f (X) ∈ K(u)[X] is thus
equivalent to the irreducibility of r = g(X)Y − f(X) ∈ K(Y)[X]. Since
K[Y] is a UFD and K(Y) is its field of quotients, this is equivalent to r
being irreducible in K[Y][X] ≅ K[Y, X]. Suppose r = pq, with p, q ∈
K[Y, X]. Let degY r be the degree of r, seen as a polynomial in Y with
coefficients in K[X]. We have 1 = degY r = degY p + degY q, so we may
suppose degY p = 1 and degY q = 0. So, q ∈ K[X] and p = cY + d, for
some c, d ∈ K[X] and r = g(X)Y − f(X) = (cY + d )q. Identifying the
coefficients of the powers of Y, we obtain that q|g and q| f in K[X],
implying that q ∈ K*, since ( f, g) = 1. Thus, h is irreducible in
K[Y][X].
The polynomial h is irreducible and vanishes in T, so it is associ-
ated to Irr(T, K(u)). Proposition 1.20 says that [K(T) : K(u)]
= deg Irr(T, K(u)) = deg h = max (deg f, deg g).
It is interesting to remark that any intermediate field K ⊆ L ⊆ K(T)
with K ≠ L is of the form L = K(u), for some u ∈ K(T), u ∉ K. This fact
is known as “Lüroth’s Theorem” and is of significance in Algebraic
Geometry (see MORANDI [1996], WALKER [1950]).
({ })
1.33 Example. The extension Q ⊆ L = Q n 2 n ∈ N* is algebraic
and infinite. Q ⊆ L is algebraic because L is the union of its intermedi-
ate fields of the form Q ({n 2 n ≤ m}) , m ∈ N*, and each of these is
algebraic (they are even finite). But Q ⊆ L is not finite: if, by
contradiction, [L : Q] = n ∈ N*, then each intermediate field would
have the degree a divisor of n. But Q ⊆ Q (n +1 2 ) has degree n + 1.
Exercises
⎧⎪aa11xx1 ++ … + a1nxn = b1
… + a2nxn = b2
(S): ⎨
21 1
.
…
⎩⎪am1x1 + … + amnxn = bm
Prove that: the system (S) has a solution in E ⇔ (S) has a solution
in K. Can you prove other similar properties of (S)?
4. Let K ⊆ L be an extension and let p ∈ L[X] \ K[X], deg p = n. Prove
that | p(K) ∩ K | ≤ n. (Hint. Suppose p(αi) = βi, 1 ≤ i ≤ n + 1, with αi,
βi ∈ K. Interpret this as a system of n + 1 equations, the unknowns be-
ing the coefficients of the polynomial.)
5. Let K ⊆ L be an extension and let α ∈ L, algebraic over K. If
deg Irr(α, K) = n, then deg Irr(β, K) divides n, for any β ∈ K(α).
6. Let K ⊆ L be an extension and let x ∈ L. Prove that x is
transcendental over K if and only if K[x] is K-isomorphic to the
K-algebra K[X] of polynomials in the indeterminate X.
7. Let K be a field and let (Ki)i ∈ I be a chain of subfields of K (∀i,
j ∈ I, Ki ⊆ Kj or Kj ⊆ Ki). Prove that ∪i ∈ I Ki is a subfield in K.
−1
8. Find a basis of the extension Q ⊆ Q[ 3 ]. Express (1 + 3 ) in
this basis. Find Irr(1 + 3 , Q). The same problem for Q ⊆ Q(α) and
3
the element 1 − α, where α is a root of X + X + 1.
IV.1. Algebraic extensions 191
EL
L E
L∩E
K
a) If K ⊆ L is algebraic, then E ⊆ EL is algebraic.
b) If S is a subset of L and K < S > = L (S generates the K-vector
space L), then E < S > = EL. (Hint. Reduce to the case when S is finite.)
c) If [L : K] is finite, then [EL : E] ≤ [L : K].
d) If the degrees [L : K] and [E : K] are finite and coprime, then
[EL : E] = [L : K] and [EL : K] = [E : K]·[L : K].
e) If [EL : K] = [E : K]·[L : K], then K = L ∩ E.
f) If [E : K] = 2 and K = L ∩ E, then [EL : K] = [E : K]·[L : K].
g) Give an example such that [E : K] = [L : K] = 3, K = L ∩ E, but
[EL : K] < 9.
27. Let K ⊆ L be an extension of degree n and let g ∈ K[X], deg g = p,
with p prime, (p, n) = 1. If g is irreducible in K[X], then g is irreducible
in L[X].
5
28. Show that X − 2 is irreducible in Q(ω)[X], where
ω = cos(2π/5) + i·sin(2π/5).
2.6 Proposition. Let K be a field and let f ∈ K[X]. Then f has multi-
ple roots (in some extension of K) if and only if f and f ' are not rela-
tively prime.
Proof. Let Ω be an extension of K in which f splits (IV.1.25). If f
has multiple roots, we saw that f and f ' are not relatively prime. Con-
versely, if g = GCD(f, f ') has degree ≥ 1, then g has a root α in Ω (the
roots of g are among the roots of f ) and α is a multiple root of f since
f(α) = f '(α) = 0. !
Without knowing the roots of a polynomial, the criterion above al-
lows to decide if it has multiple roots.
44
In honor of François Viète, 1540-1603, French mathematician.
IV.2 Roots of polynomials. Algebraically closed fields 197
In particular,
an(x1 + … + xn) = – an − 1
an(x1 x2 + x1 x3 + … + xn −1 xn) = an − 2
…
n
an x1… xn = (–1) a0.
Consequently, any root in R of f divides a0.
Proof. The polynomial g = an(X − x1)…(X − xn) divides f, by prop.
2.2. The polynomials g and f have the same degree and g | f, so they
are associated in divisibility in K[X] (K is the field of fractions of R).
Since g and f have the same leading coefficient, g = f. The other
equalities result by identifying the coefficients of g and f. !
n −1 n
ψ(a0 + a1X + … + an − 1X + X ) = (a0, …, an − 1), is injective. On the
other hand, |K| = |K| if K is infinite, so |P| = ∑n≥1 |Pn| ≤ ∑n≥1 |K| =
n
45
This name (kept for historic reasons) expresses the concept that Algebra
studies mainly complex numbers. This is no longer true since the 19th century,
although complex numbers continue to play an important role in mathematics.
200 IV. Field extensions
|K × N| = |K|. If K is finite, then |K| is finite and |P| ≤ ∑n≥1 |K| = |N|
n n
46
You guessed, Zorn's Lemma will be used. All proofs of the existence of the
algebraic closure use some form of the Axiom of Choice.
47
This choice for | M | is suggested by the previous lemma. We want to be sure
that M includes a "copy" of any algebraic extension of K.
48
In fact, we need just an injective function α : K → M, but we assume K ⊆ M to
simplify notations.
IV.2 Roots of polynomials. Algebraically closed fields 201
3
c) The splitting field of X − 2 over Q is Q( 3 2 ,ω ), where ω ∈ C is
2 3
a root of X + X + 1. Indeed, the roots of X − 2 are 3 2 ,ω 3 2 ,ω 2 3 2
and Q (3 2 , ω ) = Q (3 2 , ω 3 2 , ω 2 3 2 ) .
d) If f ∈ K[X] is a polynomial of degree n, and L is its splitting field
over K, then [L : K] ≤ n!.
Indeed, if x1, …, xn ∈ L are the roots of f, then
[K(x1) : K] = deg(Irr(x1, K)) ≤ deg f = n. Note that L is a splitting field
over K(x1) of g := f/(X − x1) ∈ K(x1)[X]. Since deg g = n − 1, apply an
induction to obtain that [L : K(x1)] ≤ (n − 1)! and so [L : K] ≤ n!.
Proving the uniqueness (up to a K-isomorphism) of the splitting
field of a family of polynomials over K requires some results on the
extension of field homomorphisms. These results have also other
important applications.
We shall use frequently the following elementary fact: if σ : K → L
is a field homomorphism, then σ has a unique extension to a ring
n
homomorphism τ : K[X] → L[X], namely τ(a0 + a1X + … + anX )
n
= σ(a0) + σ(a1)X + … + σ(an)X . This is the unique ring homomorph-
ism τ : K[X] → L[X] satisfying τ|K = σ and τ(X) = X. The existence and
uniqueness of τ are a consequence of the universality property of the
polynomial ring K[X]. By a harmless abuse of notation, the extension
to K[X] of the homomorphism σ is denoted also by σ.
The following property is very simple, but has deep implications;
in particular, it is instrumental in the determination of the Galois
group of an extension.
n n
0 = ϕ(a0 + a1α + … + anα ) = a0 + a1ϕ(α) + … + anϕ(α) = f(ϕ(α)).!
E is ordered by " ≤ ", defined by: (F, ϕ) ≤ (F', ϕ') if and only if
F ⊆ F' and ϕ'|F = ϕ. A straightforward proof shows that " ≤ " is indeed
an order relation. Moreover, if {(Fi, ϕi)}i∈I is a chain in E, it is
bounded above by { ∪i∈I Fi, ϕ}, where ϕ : ∪i∈I Fi → Ω is defined by
ϕ(x) = ϕi(x) if x ∈ Fi (this definition is independent of the choice of
i ∈ I such that x ∈ Fi). Thus, E is inductively ordered and has, by
Zorn's Lemma, a maximal element (F, ϕ).
Let us prove that F = E, which finishes the proof. If F ≠ E, pick
x ∈ E \ F. The element x is algebraic over F; let f = Irr(x, F). Apply
Prop. 2.18 to the following situation: ϕ : F → ϕ(F) field isomorphism,
f ∈ F[X], x is a root of f (in E), x' is a root of ϕ{ f } (in Ω). We obtain an
isomorphism ϕ' : F(x) → ϕ(F)(x') that extends ϕ. Since ϕ(F)(x') ⊆ Ω,
ϕ' is an extension of ϕ to F(x), contradicting the maximality of (F, ϕ).!
−1 −1
σ : Ω' → Ω, we obtain analogously that σ (R') ⊆ R, so σ estab-
lishes a bijection between R and R'. If we remember the form of the
elements in K(R), we obtain that σ (K(R)) = K(σ (R)) = K(R'), so the
restriction of σ to K(R) is a K-isomorphism between K(R) and K(R').
b) is a particular case of c).
d) Let K ⊆ L be algebraic. Then the canonical inclusion ι : K → Ω
extends to a K-homomorphism ϕ : L → Ω by 2.19. Thus, L is
K-isomorphic to ϕ(L), a subfield of K. !
Part d) above says that a given algebraic closure Ω of K includes
“all” the algebraic extensions of K.
49
Also known as the d’Alembert-Gauss theorem. Jean le Rond d’Alembert
proposes an incomplete proof in 1746. C.F. Gauss gives four correct proofs of this
theorem, the first one in 1797. Other proofs are due to Jean Argand (1814), Augustin
Louis Cauchy (1820). The “theorem of Liouville” (which is due in fact to Cauchy,
1844) –“any holomorphic bounded function on C is constant”– proves the theorem
in one line. The present proof belongs to Pierre Samuel and has the advantage (?) of
being more “algebraic”. Note that all proofs use some analysis, because fundamental
(topological) properties of R do not possess purely algebraic descriptions. The
essential role is played in fact by the order properties of R.
206 IV. Field extensions
Exercises
50
The skew field of quaternions was discovered (invented?) by W.R. Hamilton.
208 IV. Field extensions
⎛1 0⎞ ⎛i 0 ⎞ ⎛ 0 1⎞ ⎛0 i ⎞
b) Let 1 := ⎜ ⎟ , i := ⎜ ⎟ , j := ⎜ ⎟ , k := ⎜ ⎟ . Show
⎝0 1⎠ ⎝0 − i⎠ ⎝ −1 0⎠ ⎝ i 0⎠
that any element of H is written uniquely as a1 + bi + cj + dk, where a,
b, c, d ∈ R.
c) Let Q be the multiplicative subgroup of H* generated by i and j.
Prove that Q is not commutative and has 8 elements. Write down the
multiplication table of Q. (Q is called the quaternion group). Deduce
that H is a skew field.
2
d) Prove that X + 1 ∈ H[X] has an infinity of solutions in H. Does
this contradict 2.3?
e) Prove that H is an extension of C.
f) Give an example of a countable skew field.
3. Give an example of an extension K ⊆ L, where L is algebraically
closed and L is not algebraic over K.
4. Let K ⊆ L ⊆ E be a tower of extensions and let f ∈ K[X]. If E is a
splitting field of f over K, then E is a splitting field of f over L.
5. Let K ⊆ L ⊆ E be field extensions and let f ∈ K[X]. Suppose
L = K(x1, …, xn), where x1, …, xn are roots of f (not necessarily all of
them). Show that E is a splitting field of f over K if and only if E a
splitting field of f over L.
6. Let f ∈ K[X] be a polynomial of degree n and let x1, …, xn be the
roots of f in some extension L of K. Show that K(x1, …, xn −1) is a split-
ting field of f over K.
7. If the extension K ⊆ L ⊆ Ω is such that any nonconstant polynomial
in K[X] splits over L, then L = Ω.
8. Show that (Q'R)(i) = Q'C.
9. Let char K = 0 and let g ∈ K[X], irreducible, deg g ≥ 2. If α, β ∈ Ω
are roots of g, then α − β ∉ K. (Hint. Suppose α − β ∈ K and let
β = α + b, b ∈ K. Then K(α) = K(α + b). Because g is irreducible,
there exists a K-isomorphism ϕ : K(α) → K(β) = K(α) that takes α to
IV.3 Finite fields 209
2 n
β = α + b. Then α, ϕ(α), ϕ(ϕ(α)) = ϕ (α),…, ϕ (α),… are roots of ϕ,
n
for any n. Since ϕ (α) = α + nb and char K = 0, these are all distinct.)
10. Let f, g ∈ K[X], nonconstant and let β be a root of f in Ω. Then:
f(g(X)) is irreducible in K[X] ⇔ f is irreducible in K[X] and g(X) − β is
irreducible in K(β)[X]. (Hint. Let f = a∏(X − βi), where β = β1, …,
βn ∈ Ω. If α is a root of g(X) − β, then f(g(α)) = 0 and: f(g(X)) is
irreducible in K[X] ⇔ [K(α) : K] = deg f(g(X)) = deg f ·deg g. In the
tower of extensions K ⊆ K(β) ⊆ K(α), [K(β) : K] = deg Irr(β, K) and
[K(α) : K(β)] = deg Irr(α, K(β)).)
∑ ( pi ) x
p p p
ϕ(x + y) = (x + y) = p −i
yi = x + y ,
0 ≤i ≤ p
51
Ferdinand Georg Frobenius (1849-1917), German mathematician.
IV.3 Finite fields 211
n
Proof. a) Un(Ω) has n elements if and only if f = X − 1 has no
n −1 n
multiple roots ⇔ { f ', f } = 1 ⇔ {nX , X − 1} = 1⇔ n·1 ≠ 0 ⇔
char K does not divide n.
t
b) In general, m | n implies Um ⊆ Un. Let p =: q. If x ∈ Un, then
x = (x m ) = 1. But y & y , ∀y ∈ K, is a field endomorphism
n q q
3.10 Remark. In the complex plane, the complex nth roots of unity
are the vertices of a regular polygon with n sides inscribed in the unit
circle. This justifies the name cyclotomic, of Greek origin and mean-
ing approximately “circle dividing”.
p p n p p p p
π(g ) = π(g) = (π(a0) + π(a1)X + … + X ) = π(a0) + π(a1) X + … +
pn p pn
X = π(a0) + π(a1)X + … + X = π(g1).
p p
Since g1 = fh, π(g1) = π(g) = π{ f }π(h). Thus π{ f } | π(g) in Zp[X],
so all irreducible divisors of π{ f } divide also π(g). Because deg π{ f }
= deg f, there exists an irreducible common factor h of π{ f } and π(g),
2
deg h ≥ 1. But then π(Φn) = π{ f }π(g) is divisible with h , so it has
n
multiple roots. This is absurd, since Φn | X − 1, so π(Φn) divides
n n −1
π(X − 1), who has no multiple roots: its derivative is nX ≠ 0 in
n
Zp[X] (p - n implies n invertible in Zp) and π(X − 1) is coprime to
n −1
nX . !
We will encounter cyclotomic extensions again in the study of Ga-
lois Theory. We need now some elementary facts on the conjugacy
classes of a finite group, which will be used here to prove that any fi-
nite division ring is commutative.
−1
d) It is clear that Ca = {z az | z ∈ G}. Let
G/C(a) := {C(a)x | x ∈ G}
be the set of right cosets of the subgroup C(a) in G. Define
−1
ϕ : G/C(a) → Ca by ϕ(C(a)x) = x ax, ∀x ∈ G. ϕ is correctly defined
−1
(x ax is independent on the representative x of the class C(a)x). In-
−1 −1 −1
deed, for any x, y ∈ G: C(a)x = C(a)y ⇔ xy ∈ C(a) ⇔ axy = xy a
−1 −1
⇔ x ax = y ay. The injectivity of ϕ also follows, since
−1 −1
x ax = y ay implies C(a)x = C(a)y. Because ϕ is obviously surjec-
tive, |Ca| = |G/C(a)| = [G : C(a)].
e) G is the disjoint union of the conjugacy classes. Clearly, a
∈ C(G) ⇔ C(a) = {a} ⇔ |Ca| = 1. Let S be a system of representatives
as in the statement. Then S ∪ C(G) is a system of representatives for
the conjugacy classes, so
G = ∪{Ca | a ∈ S ∪C(G)} = {∪{Ca | a ∈ S }}∪ C(G).
The unions are disjoint, so, taking cardinals and using
|Ca| = [G : C(a)], we obtain the formula. !
We can prove now the following celebrated result:
52
Joseph Henry Maclagen Wedderburn (1882-1948), Scottish mathematician.
220 IV. Field extensions
d(a)
Z(a)* := Z(a)\{0}. Let d(a) = [Z(a) : C], so |Z(a)| = q . By 1.27, the
multiplicativity of degrees holds also for extensions of division rings,
*
so Zp ⊆ C ⊆ Z(a) ⊆ K implies da | n. In the group (K , ·) apply the
conjugacy classes formula:
∑a∈S [K * : Z (a ) ],
* * *
|K | = |C | +
S being a system of representatives of the conjugacy classes of ele-
ments not contained in C*. Note that a ∈ S implies da ≠ n (otherwise
Z(a) = K, so a ∈ C*, contradicting the choice of S). In the formula
* n * * qn − 1
above, |K | = q − 1, |C | = q − 1, [K : Z(a)*] = , so:
qd (a ) − 1
qn − 1
q −1=q−1+∑
n
d (a )
. (1)
a∈S q −1
n d(a)
The cyclotomic polynomial (over C) Φn divides (X − 1)/(X − 1)
in Z[X], ∀a ∈ S. Indeed, 3.12.b) implies X − 1 = Φn· ∏ Φ d ; since
n
d n , d ≠n
d(a)
da | n, da ≠ n and X ∏d d (a ) Φ d , the claim follows. So, Φn(q)
−1=
n n d(a)
divides q − 1 and also divides (q − 1)/(q − 1), ∀a ∈ S; from (1)
we obtain that Φn(q) | (q − 1).
On the other hand, |Φn(q)| = ∏ζ ∈P n
q − ζ . We have |q − ζ | > q − 1,
∀n ≥ 2, ∀ζ ∈ Pn (to see this, represent the complex nth roots of unity
in a plane). So |Φn(q)| > q − 1, contradiction with Φn(q) | q − 1. !
IV.3 Finite fields 221
Exercises
(q − 1) | m ; Tm, q = 0 if (q − 1) - m.
d) Let f ∈ F[X1,…, Xn], of (total) degree < n(q − 1). Then:
IV.4 Transcendental extensions 223
5 5
ent over Q because f (e, e ) = 0, where f = Y − X ∈ Q[X, Y]. It is not
known if {e, π} is algebraically independent over Q.
b) The empty set is algebraically independent over any field.
c) Let K be a field and let n ∈ N*. The symmetric elementary
polynomials s1, …, sn are algebraically independent over K in
K(X1,…, Xn). This statement is equivalent to the uniqueness part in the
fundamental theorem of symmetric polynomials: “Any symmetric
polynomial in K[X1,…, Xn] is written uniquely as a polynomial with
coefficients in K of s1, …, sn.”
5
d) In K ⊆ K(X, Y), the elements X and Y are algebraically
independent over K. Prove this. Can you generalize? So,
5 5
K[X, Y] ≅ K[X , Y] (K-isomorphism), although K[X , Y] ( K[X,Y].
∑ f k ( x1,… , xm )α k = 0 ,
0≤ k ≤ n
for some x1, …, xm ∈ S and fk ∈ K[X1, …, Xm], ∀k ∈ {0, …, n}, with
fn(x1, …, xm) ≠ 0.
b) S is called a transcendence basis for the extension K ⊆ L (or a
transcendence basis of L over K) if S is algebraically independent over
K and S algebraically generates L over K.
Exercises
The idea behind modern Galois Theory is the following: for a given
field extension, one associates to it a group (the Galois group of the
extension). Various properties of the extension can then be deduced by
investigating its Galois group. The idea of studying a certain structure
(in our case, a field extension) by associating to it another structure (in
our case, a group) has been very fertile in 20th century mathematics. It
can be found in many areas: Algebraic Topology, Class Field Theory
(recently generalized in the form of the “Langlands Correspondence”),
Algebraic Geometry, Representation Theory, and the list is far from
complete. The recent proof of Fermat's Last Theorem 1 uses Galois
Theory as a basic tool.
1
Pierre de Fermat (1601-1665), French mathematician. “Fermat’s Last
n n n
Theorem” claims that if n ≥ 3 is an integer, then the equation x + y = z has no
positive integer solutions. All attempts at proving this assertion failed until 1995,
although a lot of partial results were proven. In 1995, the English mathematician
Andrew Wiles proved the “Shimura-Taniyama conjecture”, a statement which
implies FLT (a fact proven by Gerhard Frey and Ken Ribet in 1986). It appeared
soon that some parts of the proof were wrong, but A. Wiles and R. Taylor finally
gave a proof which is now accepted as correct. A major part of modern number
theory and Algebra owes its existence to the efforts of proving FLT.
232
V.1 Automorphisms 233
V.1 Automorphisms
S
2
More generally, for any subset S of G(L/K), defining L = {x ∈ L | σ(x) = x,
S
∀σ ∈ S}, one can easily verify that L is a subfield of L containing K and that
S <S>
L = L , where < S > is the subgroup generated by S.
234 V. Galois Theory
ai1… in ∈ K and ∑' indicates the fact that the sum has a finite number of
terms. For such an element, we have:
V.1 Automorphisms 235
3
This is in fact the original view that Evariste Galois had on the notion of group
associated to a polynomial. We shall exploit this point of view later in order to
obtain data on the Galois group of a polynomial.
236 V. Galois Theory
over R (find it!). The same remark can be made for the “conjugates”
γ = a + b d and δ = a − b d , where a, b ∈ Q; γ and δ are conjugate
over Q, in the sense of the above definition.
An algebraic element α over K can have at most n conjugates,
where n is the degree of α over K (n = deg Irr(α, K)). The corollary
3.15.a) says that σ(α) is a conjugate of α, ∀σ ∈ G(L/K).
2 , 3 2ω , 3 2ω 2 and so Q(3 2 ,ω ) =
3
deed, the roots of X − 2 are 3
Exercises
V.3 Separability
4
This terminology, introduced by B. L. van der Waerden, expresses the idea that
the roots of f are "separated" (distinct).
V.3 Separability 245
p
c) If char K = p > 0, then f is inseparable iff f ∈ K[X ]. Further-
more, there exists e ∈ N* and an irreducible separable polynomial
( )
g ∈ K[X], such that f = g X p .
e
5
Recall that Ω is a fixed algebraic closure of K that contains L.
V.3 Separability 249
3.18 Remark. For the extensions of the type K ⊆ K(α, β), with K
infinite, lemma 3.16 gives also a practical procedure to find a primi-
tive element (or, at least, a class of good candidates for it, if the condi-
tion (i) is hard to verify).
The remaining results (until the end of the section) are a somewhat
deeper study of (in)separability. The reader interested primarily in the
Fundamental Theorem of Galois theory may skip directly to the next
section.
Recall that every algebraic extension of a field of characteristic 0 is
separable; thus, all that follows is relevant only in characteristic p > 0.
s
cluded in L, (K L includes any separable extension K ⊆ E with E ⊆ L).
The separable closure of K in Ω (an algebraic closure of K) is called
s s
the separable closure of K, denoted K . Notice that K is the splitting
s
field over K of the family of all separable polynomials in K[X] (thus K
is unique up to a K-isomorphism).
The following concept is, in a certain sense, the opposite of
separability:
(
f = ( X − α )p
e
) = (X
m
pe
−α p
e
)
m
,
so the coefficient of X p ( ) e m −1 e
is (m ⋅ 1)α p . Since m·1 ≠ 0 in K
e
(because p - m), we get α p ∈ K. The polynomial f is irreducible in
e d
K[X], so m = 1; thus f = ( X − α ) p . If α p ∈ K, for some d ∈ N, then
g := X p − α p = ( X − α )p ∈ K[X] and g(α) = 0.
d d d
(α − β ) pe+d
=α ( ) ( )
pe
− β
−1
pd
∈ K.
i
pd
pe
s i
KL KL
and hence are not mutually prime; being irreducible and monic, they
t
are equal. So f = g , where t ∈ N*; t is a power of p, because
m t
t·grad g = p . Let b := g(0) ∈ K. We have b = f(0) = −a. If t > 1,
p p p
−a ∈ K . Since K is a subfield, a ∈ K , contradiction. It follows that
t = 1, so f = g, which is irreducible.
Conversely, if a = b
p
for some b ∈ K, then f = X p ( m −1
−b )
p
,
contradicting that f is irreducible. !
This proposition gives a method to exhibit purely inseparable ele-
ments. In addition, any purely inseparable element can be constructed
by this method (cf. 3.21).
Exercises
i
c) KL = K.
14. Let K ⊆ L ⊆ E be a tower of algebraic extensions, with L ⊆ E nor-
mal and K ⊆ L purely inseparable. Then K ⊆ E is normal.
15. Suppose K ⊆ L is a finite extension of characteristic p > 0, such
p
that L ⊆ K.
a) Prove that K ⊆ L is purely inseparable.
b) Suppose {x1, …, xn} ⊆ L is such that
K ( K(x1) ( K(x1, x2) ( … ( K(x1,…, xn) = L.
n
Show that [L : K] = p (a set {x1, …, xn} with these properties is
called a p-basis of K ⊆ L).
c) Any two p-bases of K ⊆ L have the same cardinal (called the
p-dimension of the extension K ⊆ L).
d) The extension K ⊆ L has a p-basis.
{ }
e) {x1, …, xn} is a p-basis ⇔ x1i1 … xnin i1 ,… , in ∈ {0,… , p − 1} is a
linearly independent set over K ⇔ ∀i ∈ {1, …, n}, xi ∉
K({x1, …, xn} \ {xi}).
16. Let char K = p > 0 and consider L = K(X, Y), the field of rational
p p
functions in two indeterminates X and Y over K. Let F = K(X , Y ).
Show that:
a) F ⊆ L is purely inseparable.
2
b) [L : F] = p .
p
c) For any α ∈ L, α ∈ F.
d) F ⊆ L is not simple.
e) Deduce that F ⊆ L has an infinity of intermediate fields. Prove
that ∀β, γ ∈ F, F(X + βY) = F(X + γY) iff β = γ.
f) {X, Y} is a p-basis of F ⊆ L.
Can an extension of degree p have an infinity of intermediate
fields?
17. Take p = 2 and K = F2 in the previous problem.
262 V. Galois Theory
6
Julius Wihelm Richard Dedekind (1831-1916), German mathematician, one of
the creators of algebraic number theory.
V.4 The Fundamental Theorem of Galois Theory 263
α1σ1(x) + … + αnσn(x) = 0
for any x ∈ G, then α1 = … = αn = 0.
b) If K ⊆ L is a finite extension, then |G(L/K)| ≤ [L : K].
Proof. a) The statement can be rephrased as follows: in the
G
K-vector space K of functions defined on G with values in K, σ1, …,
σn are linearly independent.
Suppose the statement is false. Relabelling if necessary, there exists
m ≤ n and α1, …, αm ∈ K, all nonzero, such that
α1σ1(x) + … + αmσm(x) = 0, ∀x ∈ K. (1)
We may even suppose that m is the smallest having this property,
in the sense that any linear dependence relation between σ1, …, σn has
at least m terms. Since σ1 ≠ σ2, there exists y ∈ G such that
σ1(y) ≠ σ2(y). Replacing x with xy in (1), we have:
α1σ1(xy) + … + αmσm(xy) = α1σ1(x)σ1(y) + … + αmσm(x)σm(y) = 0 (2)
α1σ1(x)σ1(y) + … + αmσm(x)σ1(y) = 0, (3)
(3) is obtained by multiplying (1) with σ1(y). Subtracting (2) from (3)
we get
α2(σ2(y) − σ1(y))σ2(x) + … + αm(σm(y) − σ1(y))σm(x) = 0, ∀x ∈ K.
In this equality, α2(σ2(y) − σ1(y)) is nonzero, so we obtained a linear
dependence relation with less than m terms, contradiction with the
minimality of m.
b) First, we notice that G(L/K) is finite. Indeed, let [L : K] = n and
take {x1, …, xn} a K-basis of L. Then any σ ∈ G(L/K) is determined
by its values in x1, …, xn. But σ(x) is a conjugate of x, ∀x ∈ L, and the
number of conjugates of x is finite. So let G(L/K) = {σ1, …, σm} and
suppose, by contradiction, that m > n. Consider the matrix:
⎡ σ 1 ( x1 ) σ 1 ( x2 ) + σ 1 ( xn ) ⎤
⎢σ ( x ) σ ( x ) + σ ( x ) ⎥
A =⎢ 2 1 2 2 2 n
⎥ ∈ Mm, n(L).
⎢ ! ! * ! ⎥
⎢σ ( x ) σ ( x ) + σ ( x )⎥
⎣ m 1 m 2 m n ⎦
264 V. Galois Theory
7
Emil Artin (1898-1962), Austrian mathematician (he lived most in Germany
and the USA).
V.4 The Fundamental Theorem of Galois Theory 265
G(L E)
Proof. a) L / = {x ∈ L|σ(x) = x, ∀σ ∈ G(L/E)} ⊇ E.
b), c), d) are proposed as exercises. !
arises: “Which are all field extensions for which the Galois connec-
tions are bijective?” The answer to this problem was given in 1951 by
the Romanian mathematician Dan Barbilian 8 (in the paper Soluţia
exhaustivă a problemei lui Steinitz (The exhaustive solution of the
Steinitz problem), Acad. R.P.R., Stud. Cerc. Mat. 2 (1951), 195-259).
His result states that: Any field extension for which the Galois connec-
tions are bijective and inverse to each other is a finite Galois exten-
sion. Notice that, if we suppose that the extension is finite, the result is
a consequence of 4.3.
If K ⊆ L is finite Galois, IF(L/K) and Subg(G(L/K)) are anti-isomor-
phic as ordered sets, since the Galois connections are inclusion-
reversing bijections. Moreover, they are anti-isomorphic as lattices:
sup(E, F) = EF in IF(L/K) corresponds to inf(Φ(E), Φ(F)) = Φ(E)∩Φ(F)
∈ Subg(L/K), that is: G(L/EF) = G(L/E)∩G(L/F). A similar statement
holds for inf(E, F) = E∩F (see problem 7).
We remark that, if K ⊆ L is Galois of degree n and x ∈ L, the
conjugates of x over K (the roots of Irr(x, K)) are exactly
{σx | σ ∈ G(L/K)} =: {x1, …, xm} (where necessarily m | n. Why? We
have m = n iff L = K(x)). So, Irr(x, K) = (X − x1)…(X − xm).
8
Also known as a poet, under the pen name Ion Barbu (1895-1964).
V.4 The Fundamental Theorem of Galois Theory 269
ments, and the number of all possible choices is also 4, the automor-
phisms in G are determined by their action on generators, according to
the following table:
id σ τ η
2 2 − 2 2 − 2
3 3 3 − 3 − 3
For instance, τ( 2 ) = 2 and τ( 3 ) = − 3 . Based on the table
above one can compile the multiplication table of G.
We remark that G ≅ Z2×Z2 (the 4 Klein group), any element of G
being of order 2. The subgroups of G are {id}, <σ > = {id, σ},
<τ > = {id, τ}, <η > = {id, η} and G. So, the extension has 3 proper
intermediate fields, corresponding to the proper subgroups
<σ >, <τ >, <η >. On the other hand, one sees immediately that
Q( 2 ) , Q( 3 ) , Q( 6 ) are proper distinct intermediate fields, so
<σ>
these are all intermediate fields. We have σ( 3 ) = 3 , so L
⊇ Q( 3 ) ; the equality holds since “the degrees match”:
[Q( 3 ) : Q] = [L< σ > : Q]. But [L< σ > : Q] = [L : Q]/[L : L< σ >] = 4/2 = 2
= [Q( 3 ) : Q] . The connections between the remaining subgroups and
the remaining intermediate fields are established similarly.
Here is a sample of the applications of Galois Theory. The follow-
ing result is often used in arguments on field extensions:
ML
L M
L∩M
K
Proof. L is the splitting field over K of a separable polynomial
f ∈ K[X], so ML is the splitting field over M of f (considered in M[X]).
So, M ⊆ ML is normal, finite and separable. Let σ be an automor-
phism in G := G(ML/M). Then σ|L ∈ G(L/K). Consider the group
homomorphism res : G → G(L/K), res(σ) = σ|L, ∀σ ∈ L. It is injective:
∀σ ∈ G with σ|L = id, we have σ = id (since σ and id agree on M and
on L, they agree on ML). Thus G ≅ I, where I is the image of the res
homomorphism; I is a subgroup of G(L/K). Let us show that
I = G(L/M ∩ L). By the fundamental theorem, this is tantamount to the
fact that the fixed fields of I and G(L/M∩L) are equal. We have
I
L = {x ∈ L|σ(x) = x, ∀σ ∈ G(ML/M)}. But {x ∈ ML| σ(x) = x,
I
∀σ ∈ G(ML/M)} = M, so L = M ∩ L. !
The classical Galois theory that we presented here using a “lin-
earized” approach, due to Dedekind and Artin, has generalizations and
counterparts in multiple directions:
Infinite Galois Theory treats the case of an extension that is alge-
braic, normal and separable, but not necessarily finite. The idea is to
make the Galois group of the extension a topological group (by means
of the Krull 9 topology) ; the fundamental theorem reads in this case:
9
Wolfgang Adolf Ludwig Helmuth Krull (1899-1971), German mathematician.
V.4 The Fundamental Theorem of Galois Theory 271
Exercises
3
1. Let L be the splitting field over Q of X − 2. Find G(L/Q) and all
the subfields of L. (Hint: Look for the action of the automorphisms on
2
the generators 3 2 and ω, where ω + ω + 1 = 0).
4
2. The same problem, for the polynomial X − 2 ∈ Q[X].
3. Let G = G(L/Q), where L = Q( 8 2 ). Find G and the fixed field of G.
4. Let K be a field and let K(X) be the rational function field over K.
272 V. Galois Theory
10
A more general way to construct such extensions uses discriminants.
274 VI. Applications of Galois Theory
These are the only constructions allowed. Note that drawing a line
(a circle) does not mean that all points belonging to it are constructed.
A point is constructed only if it is identified as an intersection between
lines (or circles, or lines and circles).
Analyzing the various problems of construction, one realizes that
all are equivalent to a problem of the following type:
“For a given set S of points in the plane, construct a set T of points
(satisfying some property)”.
Indeed, the construction of a line reduces to the construction of two
distinct points on that line; an angle is determined by three points (the
vertex of the angle and a point on each side) etc. The reader is invited
to formulate in this form the problems in the list above.
11
Sometimes CS is defined as the set of circles centered in a point in S with
radius equal to the distance between two arbitrary points in S. The definition we
have adopted for CS corresponds to a “collapsible compass”: one cannot “transport”
with the compass the distance between two points. The two definitions are in fact
equivalent, in the sense that they lead to the same set of constructible points C(S).
Prove this!
12
We omit saying “by the ruler and compass” in what follows.
276 VI. Applications of Galois Theory
13
Of course, we mean the real numbers constructible from S. Omitting S from
the notation K simplifies the notation.
VI.1 Ruler and compass constructions 279
2 2 2 2 2 2
r cos α + r sin α = r , so r ∈ K, r and r ∈ K (which is closed un-
der square roots). We get also cosα, sinα ∈ K. We have
cos(α 2 ) = ± (1 + cosα ) 2 ∈ K. Likewise, sin(α/2) ∈ K, so
z ∈ K[i]. The next lemma shows that any subfield of C containing
the affixes of points in S and is closed under conjugation and square
roots must include K[i]. !
14
The result belongs to M.L. Wantzel, who proved it and published it in 1837
(when he was “elève-ingénieur des Ponts-et-Chaussées”). It seems though that Gauss
knew as early as 1796 this criterion of constructibility.
VI.1 Ruler and compass constructions 283
K0 ⊆ … ⊆ Kn ⊆ K n ( u1 ) ⊆ … ⊆ K n ( u1 ,… , ur ) ,
satisfying the condition that each extension has degree at most 2.
“⇐” We prove by induction on n that, for any tower
Q(SR) = K0 ⊆ … ⊆ Kn ⊆ R, with [Ki : Ki−1] ≤ 2, 1 ≤ i ≤ n , we have
Kn ⊆ K (recall K is the field of the real numbers constructible from S).
If n = 0, then K0 = Q(SR). Using 1.9.a), we deduce that K includes the
subfield of R generated by SR, which is exactly Q(SR). If n > 0,
[Kn : Kn−1] ≤ 2 implies either that Kn = Kn−1 (and the induction
hypothesis shows that we are done) or that Kn is an extension of de-
gree 2 of Kn−1. In this case there exists u ∈ Kn−1, u > 0, such that
Kn = Kn−1( u ). By induction, we know that Kn−1 ⊆ K; since K is
closed under square roots, we deduce Kn ⊆ K. !
15
This result does not say that no angle can be trisected (for instance, a 90° angle
can be trisected), but that some angles (the 60° angle) cannot be trisected. Thus,
there exists no ruler and compass construction of the trisection of an arbitrary angle.
VI.1 Ruler and compass constructions 285
c) Choosing the unit length to be the radius of the circle, the ini-
tially constructible points are O and I. The area of the circle of radius
1 is π, so the side of the square with area π is π . But π is
transcendental over Q and π is also transcendental, so it is not con-
structible. !
For the formulation of a necessary and sufficient criterion of
constructibility, the following complex version of Theorem 1.11 is
useful:
16
Such a prime number is called a Fermat prime. For m = 0, 1, 2, 3, 4, one
obtains 3, 5, 17, 257, 65537, which are indeed prime. No other Fermat primes have
been found yet. Using computers, it has been proven that the above are all the
40000
Fermat primes less than 10 .
17
We say “the angle u” instead of “the angle of measure u”.
288 VI. Applications of Galois Theory
α
Suppose from now on that n = p , p a prime, p ≠ 2, α ∈ N*. We
α
prove that cos(2π/p ) is constructible ⇔ α = 1 and p is a Fermat
prime.
Suppose cos(2π/n) is constructible. Then, using 1.12,
[Q(cos(2π/n )) : Q] is a power of 2. Let ζ := cos(2π/n) + isin(2π/n).
The next lemma says that [Q(ζ ) : Q] = 2·[Q(cos(2π/n)) : Q]. On the
other hand, the degree of the cyclotomic extension Q ⊆ Q(ζ ) is ϕ(n) =
α α α−1 α α−1
ϕ(p ) = p − p , so p − p is a power of 2. Let e ∈ N with
α−1 e e e
p (p − 1) = 2 . If α ≥ 2, then p | 2 , absurd. So α = 1 and p − 1 = 2 ,
e b
whence p = 2 + 1. Suppose e is not a power of 2. Then e = a ·2 , for
b b
some a, b ∈ N, a ≥ 3, a odd. So, p = 22 a + 1. Let d := 22 , so
p = d a + 1. Since a is odd , d a + 1 is divisible by d + 1, so p is not a
prime, contradiction. Hence e must be a power of 2, which means that
p is a Fermat prime.
Conversely, let p be a Fermat prime. Let us prove that cos(2π/p) is
constructible. The extension Q ⊆ Q(ζ ) is normal, of degree
ϕ(p) = p − 1, which is a power of 2. Since ζ = cos(2π/p) + isin(2π/p)
−1
and ζ = cos(2π/p) − isin(2π/p) are in Q(ζ ), cos(2π/p) belongs to
Q(ζ ). The remark following Theorem 1.15. applies and we obtain the
constructibility of cos(2π/p). !
18
In other words, S is a commutative ring with identity and R is a subring of S
containing the identity.
290 VI. Applications of Galois Theory
n
ϕ(ei) = ∑ aij e j , ∀i ∈ {1, …, n}.
j =1
n n−1
Let P(ϕ) := X + an − 1X + … + a1 X + a0; the coefficients ai ∈ R
depend only on ϕ (not on the choice of the basis), so the following
definitions are correct:
Tr(ϕ) := − an − 1 (called the trace of ϕ)
n
det(ϕ) := (−1) a0 (called the determinant of ϕ).
If A = (aij) ∈ Mn(R) is the matrix of ϕ in some basis, then
Tr(ϕ) = Tr(A) = a11 + a22 + … + ann; det(ϕ) = det(A).
For any x ∈ S, the function
θx : S → S, θx(y) = xy, ∀y ∈ S,
is an R-module homomorphism. We apply the definitions above to
θx ∈ EndR(S). The following notations and terms are used:
P(θx) =: P(x, S/R) ∈ R[X] is called the characteristic polyno-
mial of x;
Tr(θx) =: TrS/R(x) ∈ R is called the trace of x;
det(θx) =: NS/R(x) ∈ R is called the norm of x.
It is clear that these notions depend not only on x, but on the exten-
sion S of R. The notations we use take this into account, but not the
terminology, so some caution is recommended. We defined therefore
the mappings trace TrS/R : S → R, and norm NS/R : S → R.
The case we are interested in is when R and S are fields. Fix a finite
extension of fields K ⊆ L. We suppose that all algebraic extensions of
K are subfields of Ω, a fixed algebraic closure of K.
19
This fact is proven exactly as the theorem of transitivity of finite extensions.
296 VI. Applications of Galois Theory
Exercises
n
1. Let K ⊆ L be an extension of finite fields, | K | = q, | L | = q .
a) Write explicit formulas for NL/K(x) and TrL/K(x), ∀x ∈ L.
b) Prove that Tr : L → K is surjective. Can the hypothesis “L is a fi-
nite field” be weakened?
c) Prove that the norm NL/K is a homomorphism of multiplicative
groups N : L* → K*.
d) If α is a generator of the cyclic group L*, then N(α) is a genera-
tor of K*. Deduce that the norm N : L → K is surjective.
e) Calculate the cardinal of the set {x ∈ L | NL/K(x) = 1}.
2. Let n > 1 and let ω be a primitive complex nth root of unity. Show
that NL/Q(ω) = 1, where L = Q(ω).
3. Write down formulas for the trace and the norm of an arbitrary ele-
ment of a quadratic extension Q ⊆ Q ( d ) , with d ∈ Z squarefree.
The same problem for Q ⊆ Q (3 2 ) .
4. Let K ⊆ L be a finite extension and let x ∈ L such that L = K(x) and
n n−1
Irr(x, K) = X + an − 1 X + … + a0.
n
Then NL/K(x) = (−1) a0 and TrL/K(x) = − an − 1.
5. Let p > 2 be a prime and let ω be a primitive complex pth root of
unity. Show that NL/Q(1 − ω) = p, where L = Q(ω). Compute
NL/Q(a − ω), where a ∈ Q. (Hint. Find Irr(1 − ω, Q)).
6. Let L/K be finite Galois. If K ⊆ F ⊆ L is an intermediate field, then
F = K({TrL/F(x) | x ∈ L}). Is it true in general that F = K ({NL/F(x)
| x ∈ L})?
7. Let K be a field and let K(X) be the field of fractions of K[X]. Let
ϕ : K(X) → K(X) be the unique K-homomorphism with
ϕ(X) = 1/(1 − X). Prove that ϕ ∈ AutK(K(X)). Determine the fixed field
3
of ϕ. (Ind. ϕ = id. Use the trace to find an element in the fixed field.)
VI.3 Cyclic extensions and Kummer extensions 297
n i
Proof. a) The roots of X − b are {aω | 0 ≤ i < n}, thus K ⊆ L is
normal. It is also separable (because char K does not divide n), so it is
i
Galois. If σ ∈ G := G(L/K), then σ(a) ∈ {aω | 0 ≤ i < n}. For any
i
σ ∈ G, there exists a unique i < n such that σ(a) = aω ; define
ϕ : G → Z/nZ, ϕ(σ ) = i + nZ. The function ϕ is a injective
homomorphism, since ϕ(σ ) = 0 + nZ implies σa = a ⇒ σ = id. Thus,
G is isomorphic to Imϕ, a subgroup of Z/nZ, so it is cyclic. |G| di-
vides n, by the theorem of Lagrange.
b) Let η be a generator of G. Then m = |G| = ordη. There exists
s
s < n such that η(a) = aω . We have
NL/K(a) = ∏σ ∈G σa = ∏0 ≤ i < m aω = a ω ∈ K,
si m t
m m n m m n n
with t ∈ N, so a ∈ K. Therefore, b = (a ) = (a ) ∈ K* . Let q de-
n n
note the order of bK* in grupul (K*/K* , ·). We have q | m. Let us
q n q n
show that m | q. Since b ∈ K* , b = c , for some c ∈ K. So,
q n q n q i q q
(a ) = b = c , whence a = cω , with i < n. Thus, a ∈ K, so a is
q q q sq q sq q
fixed by η: η(a ) = η(a) = a ω = a . We obtain ω = 1 ⇒ η = id.
This shows that ordη = m divides q. From m | q and q | m we deduce
that m = q.
m m
c) deg Irr(a, K) = [L : K] = m, and a is a root of X − a ∈ K[X]. !
20
This result is included as "Satz 90" in the Hilbert's monumental Bericht über
die Theorie der algebraischen Zahlkörper (Zahlbericht for short, published in 1897).
VI.3 Cyclic extensions and Kummer extensions 301
n n n n n n
σ(a ) = ω a = a , a is fixed by the subgroup <σ > = G, so a ∈ K.
n n
The polynomial X − a ∈ K[X] has the root a and its degree is n, so it
is equal to Irr(a, K). Thus, [K(a) : K] = n and K(a) = L. !
The propositions 3.3 and 3.6 determine all cyclic extensions of de-
gree n of a field K of characteristic exponent p, such that p - n and a K
contains a primitive nth root of unity. The cyclic extensions of degree
p of a field of characteristic p > 0 are described in what follows. This
part is not used in the theory of solvability by radicals and may be
skipped in a first reading.
3.9 Remark. The proof also shows that, for any b ∈ K, where
p
char p > 0, X − X − b is either irreducible in K[X] or splits over K.
21
Otto Schreier (1901-1929), German mathematician.
VI.3 Cyclic extensions and Kummer extensions 303
22
Ernst Eduard Kummer (1810-1893),German mathematician.
304 VI. Applications of Galois Theory
If K contains a primitive nth root of unity and b ∈ K*, 3.6 says that
K( n b )/K is a cyclic extension and its degree divides n, so it is an
n-Kummer extension.
n jn
so σ ◦τ = τ ◦σ. Thus, G(L/K) is Abelian. Also, σ (bi) = biω = bi, so
n
σ = id and the exponent of G(L/K) divides n.
Suppose now that K ⊆ L is n-Kummer. By 3.11, L = L1…Lm, where
each Li is a cyclic extension of K, included in L; G(Li/K) has order di,
with di|n. For the extension K ⊆ Li we can apply prop. 3.6, since ω n di
is a primitive di-th root of unity in K. So, Li = K(bi), with bi ∈ L* and
bidi ∈ K*. Then L = K(b1, …, bm) and clearly bin =: ai ∈ K*. We have
n n
di > 1 (otherwise K = Li), so ai ∉ K* (if ai = c , for some c ∈ K, then,
j
for some j, bi = cω ∈ K and Li = K(bi) = K, absurd).
The last statement is justified as follows: S is a subgroup in (K*, ·)
and K( n S ) ⊆ L. But L = K(b1,…, bm), with bi ∈ L* and bin = ai ∈ K*,
1 ≤ i ≤ m, so ai ∈ S. So, L ⊆ K( n S ). !
This result generalizes 3.3 and 3.6. In order to investigate the struc-
ture of the Galois group of an n-Kummer extension, we need some
background.
n
Proof. ϕ is well defined: if xK*∈ tn(L*/K*), then x ∈ K*; if x,
−1
y ∈ L* such that xK* = yK* ∈ tn(L*/K*), then xy ∈ K*, so
−1 n n n n n n
(xy ) ∈ K* , i.e. x K* = y K* . It is immediate that ϕ preserves
multiplication.
n n n
Let us show Kerϕ = {1K*}. Let x K* = 1K* for x ∈ L* with
n n n n n
x ∈ K. Hence x ∈ K* , so there exists a ∈ K* such that x = a .
Then x = aζ, where ζ is an nth root of unity (ζ ∈ K by hypothesis).
So, x ∈ K* and x K* = 1K*.
n n n n n n n
Imϕ = {x K* | x ∈ L*, x ∈ K*} = {x K* | x ∈ L* ∩ K*}
n n
= (L* ∩ K*)/K* .
The inverse of ϕ is
n n n
ψ : (L* ∩ K*)/K* → tn(L*/K*), ψ(yK* ) = n y K*.
n
The definition of ψ is correct: ∀y ∈ L* ∩ K*, ∃x := n y ∈ L* such
n
that x = y. The class xK* is independent on the choice of the root x of
n
X − y, since any other root is of the form xα, with α ∈ Un(K), so
xαK* = xK*. !
The main theorem says that the n-Kummer extensions of K are in
n
one-to-one correspondence with the finite subgroups of K*/K* . If
K ⊆ L is n-Kummer, then
n n
G(L/K) ≅ tn(L/K) ≅ (L* ∩ K*)/K* .
We need some group theoretical facts, stated in the following
lemma.
3.15 Lemma. Let (G, ·), (H, ·) be finite Abelian groups and let (C, ·)
be a cyclic group. Denote by 1 the neutral elements.
a) Hom(G, C*) ≅ G. (C* is the multiplicative group of nonzero com-
plex numbers). If exp G divides |C|, then
Hom(G, C*) ≅ Hom(G, C) ≅ G.23
23
Hom(G, C*) is called the dual of the group G.
VI.3 Cyclic extensions and Kummer extensions 307
n n n
∀ bK* ∈ (L* ∩ K*)/K* , ∀σ ∈ G(L/K).
Therefore, G(L/K) is isomorphic to the finite subgroup
*n n n
(L ∩ K*)/K* of K*/K* .
b) There exists an order preserving bijection between the set of all
n-Kummer extensions of K included in Ω and the finite subgroups of
n
K*/K* :
VI.3 Cyclic extensions and Kummer extensions 309
n n
the extension K ⊆ L corresponds to the subgroup (L* ∩ K*)/K* ;
n n
the subgroup S/K* , with K* ≤ S ≤ K*, corresponds to the extension
K ⊆ K( n S ).
The degree of the extension is equal to the order of its correspond-
ing subgroup.
Proof. a) Let Un = Un(K) be the group of the nth roots of unity in K
(we know that Un = Un(Ω) is a cyclic group of order n). Define:
p : G(L/K)×tn(L*/K*) → Un, p(σ, xK*) := σ(x)/x,
for any σ ∈ G(L/K) and any x ∈ L* with xK*∈ tn(L*/K*).
The definition is correct: ∀x ∈ L* with xK*∈ tn(L*/K*), we have
n n n
x = a ∈ K, so σ(x) is a root of X − a, so (σ(x)/x) = 1; thus,
σ(x)/x ∈ Un. It is easily seen that p(σ, xK*) is independent on the
choice of the representative of the class xK*.
p is bilinear. Indeed, ∀σ, τ ∈ G(L/K) and ∀xK*, yK*∈ tn(L*/K*), we
have:
στ ( x ) στ ( x ) τ ( x ) ⎛ σ ( x ) ⎞ τ ( x ) σ ( x ) τ ( x )
p(στ, xK*) = = ⋅ =τ⎜ ⎟⋅ = ⋅ =
x τ (x) x ⎝ x ⎠ x x x
p(σ, xK*)p(τ, xK*)
We used first that στ = τσ (G(L/K) is Abelian), then that
σ(x)/x ∈ K, so it is fixed by τ. Also, p(σ,xK*·yK*) = p(σ,xyK*) =
σ(xy)/xy = (σ(x)/x)·(σ(y)/y) = p(σ, xK*)p(σ, yK*).
p is nondegenerate: if σ ∈ G(L/K) and p(σ, xK*) = 1,
n
∀xK* ∈ tn(L*/K*), then σ(x) = x, ∀x ∈ L with x ∈ K. But
n
L = K({x ∈ L*| x ∈ K}) (see 3.12), so σ = id. If xK* ∈ tn(L*/K*) with
p(σ, xK*) = 1, ∀σ ∈ G(L/K), then x is fixed by any σ ∈ G(L/K), so
x ∈ K. Thus, xK* = 1K*.
We have tn(L*/K*) ≅ Hom(G(L/K), Un), via the isomorphism in
n n
Lemma 3.15.c). Using ψ : (L* ∩ K*)/K* → tn(L*/K*), the isomor-
n
phism in 3.14, ψ(yK* ) = n y K*, we obtain the other isomorphism. Fi-
nally, 3.15.a) implies
310 VI. Applications of Galois Theory
n n
tn(L*/K*) ≅ (L* ∩ K*)/K* ≅ G(L/K).
n n
b) Any finite subgroup of K*/K* can be written uniquely as S/K* ,
n n
with K* ≤ S ≤ K* and [S : K* ] finite.
n n
If K ⊆ L is n-Kummer, (L* ∩ K*)/K* ≅ G(L/K), a finite group.
Thus, the subgroup corresponding to the extension K ⊆ L is finite.
n n
Conversely, if K* ≤ S ≤ K*, S/K* is finite, and a1, …, am ∈ K* are the
n
representatives of the classes in S/K* , then
K ⊆ K{ n S } = K{ n a1 ,…, n am }
is an n-Kummer extension (see 3.12). Clearly, these correspondences
are inclusion preserving. They are also inverse to each other, as we
now prove.
n
Let K ⊆ L be n-Kummer. Its associated subgroup is S = L* ∩ K*.
n
To S/K* corresponds K( n S ), equal to L (by 3.12).
n n
Conversely, let K* ≤ S ≤ K*, where ∆ := S/K* is a finite group.
Its associated n-Kummer extension is K ⊆ K( n S ) =: L. Let
n n n
(L* ∩ K*)/K* =: ∆'. We have to prove that ∆ = ∆' (⇔ S = L* ∩ K*).
n
Evidently, S ⊆ L* ∩ K*, so ∆ ⊆ ∆'. For the other inclusion, define
n
p : G(L/K) × ∆ → Un, p(σ, xK* ) := σ( n x )/ n x , ∀σ ∈ G(L/K),
n
∀xK* ∈ ∆.
As in a), one shows that p is correctly defined and bilinear. It is
n n
nondegenerate: if σ ∈ G(L/K) and p(σ, xK* ) = 1, ∀xK* ∈ ∆, then
n n
σ( n x ) = n x , ∀x ∈ S. But L = K( n S ), so σ = id. If xK* ∈ S/K*
with p(σ, xK*) = 1, ∀σ ∈ G(L/K), then n x is fixed by any
*n n n
σ ∈ G(L/K), so x ∈ K and x ∈ K . Consequently, xK* = 1K* .
n
∆' ≅ Hom(G(L/K), Un), by a). Since ∆ ⊆ ∆' and the groups are finite,
they are equal. !
Using the same methods, an analogous result can be obtained for
Abelian extensions of exponent p of a field K having characteristic
p > 0. For any extension K ⊆ L, we use the following notations:
VI.3 Cyclic extensions and Kummer extensions 311
Exercises
5. Let K = C(X, Y) (the fraction field of the polynomial ring C[X, Y ]).
n
Let L = K{ 4 XY , 4 Y 3 }, where, ∀t ∈ K, n t denotes a root of T − t (in
some extension of K). Prove that K ⊆ L is a 4-Kummer extension and
determine its degree and all its intermediate fields.
6. Construct a 6-Kummer extension of degree 18 of C(X, Y).
24
Leonhard Euler (1707-1783), famous Swiss mathematician.
314 VI. Applications of Galois Theory
Henrik Abel gave in 1824 (at the age of 22) a clear and rigorous proof
of this impossibility. In 1829, two months before his death, Abel pub-
lished a memoir in which he describes a class of polynomials solvable
by radicals (namely the polynomials whose Galois group is commuta-
tive). The commutative groups are called today Abelian, in his honor.
In 1830, Evariste Galois, unaware of Abel's results, creates the notion
of group (of permutations) and formulates a general criterion of
solvability by radicals of a polynomial equation, using what we call
today the Galois group of the polynomial. The ideas and results of
Galois had a decisive contribution to the development of Algebra.
2
4.2 Examples. a) The equation aX + bX + c = 0, where a, b,
c ∈ C, a ≠ 0, is solvable by radicals over Q(a,b,c). Indeed, the formula
−b ± b 2 − 4ac
for the roots, x1,2 = shows that the roots xi are in the
2a
2-radical extension Q(a,b,c) ⊆ Q ( a , b, c ) ( )
b 2 − 4ac . Note that, if a,
b, c ∉ Q, then the equation may not be solvable over Q.
b) Any polynomial f ∈ R[X] is solvable by radicals over R. Indeed
the extension R ⊆ C is radical (why?) and C is algebraically closed, so
it includes the splitting field of f over R.
Historically, solvability by radicals means “solvability by radicals
over Q of polynomials in Q[X]”.
We want to prove the following result (Galois' criterion of solvabil-
ity by radicals): If char K = 0, a polynomial is solvable by radicals
over K if and only if its Galois group over K is solvable.
25
If τ ∈ Sn, the conjugation by τ is the automorphism κτ : Sn → Sn defined by
−1
κτ(η) = τ ητ, ∀η ∈ Sn.
322 VI. Applications of Galois Theory
t0 t1 … tn − 1
t1 t2 … tn
D(g) = det
! ! * !
tn − 1 tn … t2n−2
The sums tm can be computed by recurrence using the following
relations (Newton's identities):
t0 = n; t1 = − an − 1;
−tm = an − 1 tm − 1 + an − 2 tm − 2 + … + an − m + 1 t1 + an − m m, if 2 ≤ m ≤ n;
−tm = an − 1 tm − 1 + an − 2 tm − 2 + … + a0 tm − n, if m > n.
Proof. ∆ = ∏i< j (xj − xi) = det A, where A is the Vandermonde ma-
trix:
1 x1 … x1n-1
1 x2 … x2n-1
A=
! ! * !
1 xn … xnn-1
t
The matrix A·A is the matrix in the statement, and
t 2 2
det( A·A) = det(A) = ∆ = D(g).
Viète's relations coupled with Newton's identities (see Appendix)
yield the formulas for the tm. !
The discriminant can also be computed using the resultant, defined
below.
deg(r)
since g(xi) = 0, 1 ≤ i ≤ m. Bur Res(g, r) = a ∏i r(xi) and R3 follows.
Finally, R1 implies R4.
c) By induction on min(deg g, deg h), using R3 and R4. !
A polynomial g has multiple roots if and only if g and its formal
derivative g' have common roots. Thus D(g) = 0 ⇔ Res(g, g') = 0. In
fact, D(g) and Res(g, g') differ only by ± the dominant coefficient of g.
326 VI. Applications of Galois Theory
Exercises
26
The formula was obtained around 1515, by Scipione del Ferro (? - 1526), but
was not published. Niccolò Fontana “Tartaglia” (1500-1557) rediscovered the
formula in 1535 and communicated it to Girolamo Cardano (1501-1576), who
published it in his book “Ars Magna, sive de regulis algebraicis”.
27
The formula was found by Lodovico Ferrari (1522-1565), a student of
Cardano.
328 VI. Applications of Galois Theory
2 2
⎛ x 2 + p + u ⎞ = − qx − r + p + u 2 + 2ux 2 + pu
⎜ 2 ⎟ 4
⎝ ⎠
The right hand side is a perfect square of some polynomial in x if
and only if
2
p2 q ⎞
− qx − r + + u 2 + 2ux 2 + pu = ⎛⎜ x 2u − ⎟
4 ⎝ 2 2u ⎠
d) The last equality is equivalent to
( )
8u 3 + 8 pu 2 + 2 p 2 − 8r u − q 2 = 0 .
(called the cubic resolvent of Ferrari).
e) Let u be a solution of the cubic resolvent. Then the 4 solutions of
f = 0 are:
u −u p εq
x =ε ± − − , where ε ∈ {− 1, 1}.
2 2 2 2 2u
3
6. Let g = X + pX + q ∈ Q[X] be irreducible, and let α ∈ R be a root
3 2
of g. Prove that, if ∆ = − 4p − 27q is the discriminant of g, then
Q(α, ∆ ) is the splitting field of g over Q. Deduce that Q ⊆ Q(α) is
normal if and only if ∆ is the square of a rational number.
3
7. Let g = X + X + 3 ∈ Q[X]. Show that g is irreducible. Let α be the
real root of g and let K be the splitting field of g over Q. Show that
K ∩ Q(α, d ) = Q(α), for any d ∈ Z a positive squarefree integer.
8. (The resultant as a determinant) Let R be a domain, let a, b ∈ R and
let m, n ∈ N*. Consider the polynomials in X, with coefficients in
R[X1,…, Xm, Y1,…, Yn]:
f := a ∏1 ≤ i ≤ m (X − Xi) = ∑0 ≤ j ≤ m ajX
j
g := b ∏1 ≤ i ≤ n (X − Yi) = ∑0 ≤ j ≤ n bjX .
j
⎡ am bn ⎤
⎢a a bn −1 bn ⎥
⎢ m −1 m ⎥
⎢ ! am −1 ! ! bn −1 ⎥
⎢ ! ! ! am b0 ! ! ⎥
D= ⎢ ⎥
⎢ a0 ! ! am −1 b0 ! bn ⎥
⎢ ⎥
⎢ a0 ! ! ! bn −1 ⎥
⎢ ! ! ! ⎥
⎢ ⎥
⎣ a0 b0 ⎦
There are n columns of a's and m columns of b's (the empty places
contain 0's). The point is to prove that Res{ f, g} = det D.
a) Let M be the Vandermonde matrix of dimension m + n,
⎡ Y1m+ n −1 … … Y1 1⎤
⎢ ⎥
⎢ … … ⎥
⎢ Ynm+ n −1 … … Yn 1⎥
M = ⎢ m +n −1 ⎥
⎢ X1 … … X 1 1⎥
⎢ … … ⎥
⎢ m +n −1 ⎥
⎣⎢ X m … … X m 1⎦⎥
Computing det(MD) in two ways, show that det D = Res{ f, g}
b) Prove that Res{ f, g} ∈ R[a0, …, am, b0, …, bn].
9. Let K be a field and let x, y be algebraic elements over K; let
f = Irr(x, K) and let g = Irr(y, K). Let Z be an indeterminate and let
gZ := g(Z + X) ∈ K[Z][X]. View f and gZ as polynomials in X (with
coefficients in K[Z]), and let h := Res{ f, gZ} ∈ K[Z]. Prove that x + y is
a root of h. More precisely, the roots of h are xi + yj, 1 ≤ i ≤ n,
1 ≤ j ≤ m, where {x1, …, xn} are the roots of f and {y1, …, ym} are the
roots of g.
2
10. Let C be the curve in R given by the parametric representation
⎧ x = x (t ) = t 2 + t 2
⎨ , t ∈ R. This means that C = {(x, y) ∈ R | ∃t ∈ R
⎩ y = y (t ) = t − t + 1
2
330 VI. Applications of Galois Theory
331
332 Appendices
2.4 Examples. a) The ring Mn(R) of all square matrices of type n×n
with entries in R is an associative R-algebra with identity
(noncommutative if n ≥ 2). The structural homomorphism takes r ∈ R
to the matrix having r on the diagonal and 0 elsewhere.
b) The polynomial ring R[X] is a commutative R-algebra. If K ⊆ L
is a field extension, L is a K-algebra. Which are the structural
homomorphisms (equivalently, which is the module structure) for
these examples?
80
Called group algebra if G is a group.
2. Algebras. Polynomial and monoid algebras 337
(G, ·) is (N, +), the additive monoid of the natural numbers, then R[G]
is exactly the ring R[X].
⎛ ⎞
⎜ ⎟
∑ 2 ⎜ ∑ϕ2 (s )ψ (t ) ⎟η (v ) = ∑ϕ3(s )ψ (t )η (v ) .
( u ,v )∈G ⎜ ( s ,t )∈G ⎟ ( s ,t ,v )∈G
uv = g ⎝ st =u ⎠ stv = g
⎧0, if h ≠ e
∀r ∈ R, define ψr : G → R by ψ r ( h ) = ⎨ , ∀h ∈ G.
⎩ r, if h = e
The identity of the ring R[G] (the neutral element for multiplication) is
ηe = 1e.
Proof. a) It is evident that ψr + s = ψr + ψs, ∀r, s ∈ R. Computing
ψr·ψs, we obtain ψr·ψs(g) = ∑ψ r (u )ψ s (v ) . If g ≠ e, then, for any cou-
uv =g
c) Exercise.
d) For any h ∈ G, we have
⎛ ⎞ ⎧0, if h ∉ supp (ϕ )
(
⎜ ∑ ag g ⎟ ( h ) = ∑ ψ ϕ ( g )η g ) ( h ) = ⎨ϕ ( h ) , if h ∈ supp (ϕ ) = ϕ ( h )
⎝ g∈ suppϕ ⎠ g∈ supp(ϕ ) ⎩
.
⎧0, if h ≠ g
We used in the last equality that (ψ ϕ ( g )η g ) ( h ) = ⎨ , as
⎩ϕ ( g ) , if h = g
seen at c).
⎛ ⎞
The uniqueness ensues from ⎜⎜ ∑ a g g ⎟⎟(h ) = ah , ∀h ∈ G.
⎝ g∈ G ⎠
e) We show that ηe is the identity of the ring R[G]. For any g ∈ G,
ηgηe = ηge = ηg = ηeηg, by c). The general case is proven using d) and
the distributivity. !
i∈ N
81
Often encountered terminologies are “unknown” or “variable” instead of
“indeterminate”.
342 Appendices
n
Any element in N is written uniquely - up to a order of the terms -
n
as a sum of ei's (in other words, {e1, …, en} generate the monoid N ).
n
The element ηei ∈ R[N ] is denoted by Xi and is called an indetermi-
nate. A product of indeterminates (of the form X 1i1 … X nin ) is called a
n
term. Any polynomial g ∈ R[N ] is then written uniquely as a finite
sum:
g= ∑ ai1…in X 1i1 … X nin ,
( i1 ,…,in )∈' n
( S)
Any element i ∈ N is written uniquely as i = ∑ ms e s , where (ms)s∈S
s∈S
82
is a family of natural numbers indexed by S, having finite support.
( S)
Therefore, ηi = ∏ X sms . An arbitrary polynomial f ∈ R[N ] has a
s∈supp ( i )
( S)
unique writing as f = ∑ aiηi , where F is a finite subset of N ; if
i∈F
β ϕ
(S)
82
For any m ∈ N and i ∈ R[N ], mi is defined as (mi)(s) := m·i(s), ∀s ∈ S.
344 Appendices
n
g = a0 + a1X + … + anX , where an ≠ 0,
the natural number n is called the degree of g, denoted deg g. Thus,
the degree of g is the greatest degree of the monomials of g. We de-
fine by convention deg 0 = −∞. Sometimes deg 0 is not defined.
The elements a0, …, an ∈ R are called the coefficients of the
polynomial g; among these, an is called the leading coefficient of g, a0
is called the constant term.
If the leading coefficient an is 1, then the polynomial g is called
monic.
If R is a domain, then the degree is additive: ∀g, h ∈ R[X1,…, Xn],
deg (gh) = deg g + deg h.
Also:
deg (g + h) ≤ max(deg g, deg h).
If aX 1i1 … X nin is a monomial in R[X1,…, Xn] (where a ≠ 0), and
1 ≤ k ≤ n, then its degree in Xk is deg ( aX 1i1 … X nin , X k ) := ik (the expo-
nent of Xk in the monomial), also denoted deg X k ( aX 1i1 … X nin ) .
For any g ∈ R[X1,…, Xn], deg (g, Xk) is the greatest degree in Xk of
the monomials of g. If R is a domain, then the degree in Xk satisfies
the same relations as above: ∀g, h ∈ R[X1,…, Xn],
deg (gh, Xk) = deg (g, Xk) + deg (h, Xk).
deg (g + h, Xk) ≤ max(deg (g, Xk), deg (h, Xk)).
The total degree of the monomial aX 1i1 … X nin is i1 + … + in; the to-
tal degree of an arbitrary polynomial g ∈ R[X1,…, Xn] is the largest to-
tal degree of its monomials. Usually, the “degree” of a polynomial in
several indeterminates is its total degree, unless otherwise specified. A
polynomial whose monomials have all the same degree is called a
homogeneous polynomial. The total degree satisfies relations similar
to the above, if R is a domain.
348 Appendices
3. Symmetric polynomials
()
nates chosen among {X1,…, Xn}; thus sk has n monomials. By
k
convention, s0 = 1 and sk = 0 if k > n. The polynomial sk is homogene-
ous of degree k (indeed, all its monomials have degree k). Since sk
obviously depends on the number of indeterminates, the notation
sk(X1,…, Xn) is sometimes used to avoid confusions. For example, the
fundamental symmetric polynomials in 4 indeterminates are:
s0 = 1
s1 = X1 + X2 + X3 + X4
s2 = X1 X2 + X1 X3 + X1 X4 + X2 X3 + X2 X4 + X3 X4
s3 = X1 X2 X3 + X1 X2 X4 + X1 X3 X4 + X2 X3 X4
s4 = X1 X2 X3 X4
The fundamental symmetric polynomials appear in the relations be-
tween the coefficients of a polynomial and its roots (Viète's relations).
83
A relation that is reflexive and transitive, but not necessarily antisymmetric.
352 Appendices
3.7 Lemma. a) Let (A, ≤) and (B, ≤) be well ordered sets. Then A×B
is well ordered by the (lexicographic) order defined as (a, b) ≤ (a', b')
if and only if a < a' or (a = a' and b ≤ b').
b) In a well ordered set (A, ≤) there exist no infinite strictly
decreasing sequences.
c) For n ∈ N, the set Tn of the terms in R[X1,…, Xn] is well ordered
by the lexicographic order (thus any strictly decreasing sequence of
terms must be finite).
Proof. a) Recall that the ordered set (A, ≤) is called well ordered if
for any nonempty subset S of A, ∃α ∈ S such that α ≤ a, ∀a ∈ S (α is
unique with this property and is called the smallest element of S. Thus,
A is well ordered if any nonempty subset has a smallest element). Let
∅ ≠ S ⊆ A×B. Since S1 := {a ∈ A | ∃b ∈ B cu (a, b) ∈ S} ≠ ∅, and A is
well ordered, there exists its smallest element α ∈ S1 (so, ∀(a, b) ∈ S,
α ≤ a). Let S2 := {b ∈ B| (α, b) ∈ S}. There exists the smallest element
β of S2. Then (α, β) is the smallest element of S: ∀(a, b) ∈ S, we have
α < a (thus (α, β) < (a, b)) or α = a, in which case b ∈ S2, so β ≤ b.
b) Let (an)n ≥ 1 be a decreasing sequence of elements in A. Then the
set {an | n ≥ 1} has a smallest element ak. For any n ≥ k, we must have
then ak ≤ an; since an ≤ ak (the sequence is decreasing), an = ak and the
sequence is not strictly decreasing.
3. Symmetric polynomials 353
n
c) Induction on n. If n = 1, T1 = {X | n ∈ N} is isomorphic as an
ordered set to (N, ≤), which is well ordered. If n > 1, then Tn, ordered
lexicographically, is isomorphic to Tn−1 × T1 with the order defined as
in a). By induction, Tn−1 is well ordered and, by a), Tn−1 × T1 is well
ordered.
The theorem 3.6 extends easily to symmetric rational fractions.
definition 4.2 is to handle the more general case when the multiplica-
tively closed set S possibly contains zero divisors.
−1
4.6 Proposition. The operations defined above on S R are cor-
−1
rectly defined and S R becomes a commutative ring with identity.
−1
The elements 0 and 1 in S R are:
0 0
0 = = , ∀s ∈ S;
1 s
1 s
1 = = , ∀s ∈ S.
1 s
−1
The mapping ϕ : R → S R, ϕ(a) = a/1, ∀a ∈ R, is a ring
−1
homomorphism, called the canonical homomorphism (thus S R is an
R-algebra).
Proof. We check that the addition is correctly defined. Let (a, s),
(b, t), (a', s'), (b', t') ∈ R × S, such that (a, s) ∼ (a', s') and (b, t) ∼ (b', t').
We must show that (ta + sb, st) ∼ (t'a' + s'b', s't'). Let u, v ∈ S such that
u(s'a − sa') = 0 and v(t'b − tb') = 0. Multiply the first of these equalities
by tt'v and the second by ss'u and add them. We obtain
vu((ta + sb)s't' − (t'a' + s'b')st) = 0.
The rest of the proof (the multiplication is correctly defined; check-
−1
ing the axioms for the ring S R) is left to the reader. !
Note that any s ∈ S is taken by ϕ into an invertible element in
−1 −1
S R: ϕ(s) = s/1 has the inverse 1/s ∈ S R.
Moreover, the homomorphism ϕ is injective ⇔ S contains no zero
divisors. Indeed, a/1 = 0/1 ⇔ ∃u ∈ S such that ua = 0.
−1
If 0 ∈ S, then S R is the zero ring (with only one element,
0/1 = a/s, ∀a ∈ R, ∀s ∈ S); for this reason the condition 0 ∉ S is im-
posed in the definition of a multiplicatively closed set.
−1
The ring S R is called the ring of fractions (or the ring of quo-
tients) of R with respect to the multiplicatively closed set S.
−1
In the important case when R is a domain and S = R \ {0}, S R is a
field, called the field of fractions (or field of quotients) of R, and de-
noted Q(K).
358 Appendices
−1
The “complete” structure of S R is that of R-algebra, the canoni-
cal homomorphism ϕ being the structural homomorphism. In this set-
ting, the above property reads:
For any commutative R-algebra (γ, Τ), where γ : R → T is the
structural homomorphism, such that γ(s) is invertible in T for any
−1
s ∈ S, there exists a unique R-algebra homomorphism g : S R → T.
As expected, the universality property of the ring of fractions deter-
mines the ring of fractions up to a (unique) isomorphism:
x y tx + sy
+ := .
s t st
−1 −1
Moreover, S M is an S R-module with the multiplication defined
by: ∀a ∈ R,∀x ∈ M, ∀s, t ∈ S,
a x ax
⋅ := . !
s t st
−1 −1
The S R-module S M is called the module of fractions (or quo-
tients) of M relative to the multiplicatively closed subset S. The homo-
−1
morphism ϕM : M → S M, ϕM(x) = x/1, ∀x ∈ M, is called the canoni-
cal homomorphism.
The connection between the ideals of R and the ideals of the ring of
fractions is very close. An immediate property is:
The following property says that the any ring of fractions can be
constructed using a saturated multiplicative set.
5. Categories, functors
84
In the Gödel-Bernays-von Neumann theory, the notion of class is a primary
notion. The sets are exactly the classes that are elements of some class.
85
We do not define here this formal language. Roughly speaking, it consists of
expressions (strings of symbols) formed from the atomic expressions (of the type
x = y or x ∈ y) by using the logical operators ∨, ¬, ∧ and the quantifiers ∀ and ∃.
For instance "(∀x(x ∈ y))∧ (∃z(y = z)∨¬(z = a))" is an expression, in which x and z
are bound variables and y is a free variable. If a is assumed to be a constant, then
this expression is a predicate (it has one free variable, namely y). An expression with
no free variables is a proposition.
5. Categories, functors 365
5.3 Examples. a) The category Set of all sets. Its objects are sets. If
A and B are sets, HomSet(A, B) is the set of all functions ϕ : A → B.
The composition of morphisms in Set is the usual function composi-
tion. The identity morphism of A is the identity function of A.
86
Formally, it is the class defined by the predicate H(u) = "∃A∃B(A ∈ Ob C ∧
B ∈ Ob C ∧ u ∈ Hom(A, B))".
366 Appendices
cal spaces, the continuous mapping being the morphisms etc.). In each
situation it is necessary to state exactly the class of the objects of the
category, the set of morphisms between two arbitrary objects, the
composition of morphisms and check axioms 1)-3).
Often the writing A ∈ C replaces A ∈ Ob C, if no confusion arises.
87
For any set A, there exists a unique function ∅ → A, namely the function ∅.
5. Categories, functors 369
cific category the assertion “between any two initial (final) objects in
a category there exists a unique isomorphism”.
5.10 Example. a) The dual of the notion of initial object is the no-
tion of final object.
b) The dual of the notion of monomorphism is the notion of
epimorphism.
c) The notion of isomorphism is autodual.
d) We saw that: for any category C, any two initial objects in C
(if any) are isomorphic. By dualization, one obtains (no new proof
needed): for any category C, any two final objects in C (if any) are
isomorphic.
The intuitive concept of “morphism of categories” is the notion of
functor.
88
in other words, P°(C) is the same thing as P(C°) interpreted in C.
5. Categories, functors 371
89
A group G is, formally, a couple (G, ·), where G is the underlying set of the
group and · : G×G → G is the group operation. Thus, a function from a group (G, ·)
372 Appendices
to a group (H, *) is not the same thing as a function between their underlying sets G
and H. For this reason, Gr is not a subcategory of Set.
6. Solvable groups 373
F (u )
F ( A) ⎯⎯ ⎯ ⎯⎯→ F (B )
αA αB
G (u )
G ( A) ⎯⎯ ⎯ ⎯⎯→ G (B )
If , for any A ∈ C, αA : F(A) → G(A) is an isomorphism in D, then
α : F → G is called a natural isomorphism, in which case the functors
F and G are called naturally isomorphic.
Many “canonical isomorphisms” in module theory (for instance)
express the fact that there exists a natural isomorphism between some
functors.
6. Solvable groups
90
Some authors use in this case the term “subnormal” and call a “normal series”
a chain (S) of normal subgroups of G.
91
V is also called the Klein group (the “Viergruppe”).
6. Solvable groups 375
92
The notion of exact sequence of groups and group homomorphisms is defined
exactly as in the case of modules.
376 Appendices
basis of a module.........................106
A
bilinear function ..........................307
adjoining elements to a field....... 175 nondegenerate.........................307
algebra ........................................ 334 bimorphism .................................367
algebra homomorphism .............. 335
C
algebraic (element) ..................... 177
algebraic closure (absolute) ........ 198 canonical homomorphism .............75
algebraic closure (relative) ......... 189 canonical injections.......................86
algebraic generators .................... 226 canonical projections.....................81
algebraic integer ........................... 27 canonical surjection.......................75
algebraic number ........................ 178 category.......................................364
algebrically independent (set)..... 223 concrete ..................................371
alternating group......................... 374 dual.........................................369
annihilator........................... 104, 133 Cayley-Hamilton theorem ...........163
arithmetically equivalent matrices121 center of a ring ............................334
arrow........................................... 364 characteristic exponent................173
Artin-Schreier theorem ............... 302 characteristic matrix ....................290
ascending chain condition............. 34 characteristic of a ring.................172
associated in divisibility ............... 12 characteristic polynomial ............290
autodual ...................................... 370 of a matrix ..............................161
of an endomorphism...............161
B
Chevalley theorem ......................223
basis change matrix..................... 113 Chinese remainder theorem.........141
379
380
F G
free......................................... 372 Q
initial...................................... 367
quadratic integer ...........................27
zero ........................................ 367
quaternion group .........................208
object (in a category).................. 364
quaternions, skew field of ...........207
opposite of a ring.......................... 60
quotient ................................. 22, 356
order ........................................... 130
R
P
rank of a free module ..................111
p-basis......................................... 261
rational integers.............................27
p-dimension................................ 261
remainder ......................................22
perfect field ................................ 246
resultant.......................................324
p-group ....................................... 378
ring
PID ............................................... 31
integrally closed .......................52
polynomial
of quadratic integers.................28
elementary symmetric............ 349
R-Mod .........................................366
fundamental symmetric ......... 349
root
homogeneous ......................... 347
multiple ..................................194
monic ....................................... 27
simple.....................................194
symmetric .............................. 348
root of a polynomial....................177
polynomial algebras ................... 341
root of unity ................................212
polynomial function ..................... 54
primitive .................................213
prime ............................................ 18
prime field .................................. 172 S
primitive element........................ 175 separable
primitive polynomial .................... 41 element...................................244
principal ideal............................... 20 extension ................................245
principal ideal domain .................. 31 polynomial .............................244
projective linear group................ 272 separable closure.........................253
proper divisor ............................... 13 separable degree..........................248
p-submodule ............................... 133 sequence........................................97
purely inseparable....................... 254 sequence of modules .....................96
purely inseparable closure .......... 255 exact .........................................96
385
1. ALBU, T., ION, I.D. [1984] Capitole de teoria algebrică a numerelor, Ed.
Academiei R.S.R., Bucureşti.
2. ALBU, T., MANOLACHE, N. [1987] 19 Lecţii de teoria grupurilor, Ed.
Universităţii Bucureşti, Bucureşti.
3. ALBU, T., RAIANU, Ş. [1984] Lecţii de algebră comutativă, Ed. Universităţii
Bucureşti, Bucureşti.
4. ANDERSON, F.W., FULLER, K.R. [1974] Rings and categories of modules,
Springer-Verlag, New York.
5. AYAD, M. [1997] Théorie de Galois. 122 exercices corrigés, Ellipses, Paris.
6. BOREVICI, Z.I, ŞAFAREVICI, I.R. [1985], Teoria numerelor, Ed. Ştiinţifică şi
Enciclopedică, Bucureşti.
7. BOURBAKI, N. [1958] Eléments de mathématique, Fasc. VII, Livre II: Algè-
bre, Chapitre 3, Algèbre multilinéaire, Hermann, Paris.
8. BOURBAKI, N. [1967] Eléments de mathématique, Fasc. VI, Livre II: Algèbre,
Chapitre 2, Algèbre linéaire, Hermann, Paris.
9. BOURBAKI, N. [1981] Algèbre, Chapitres 4 à 7, Masson, Paris.
10. BOURBAKI, N. [1985] Eléments de mathématique: Algèbre commutative,
Chapitres 1 à 4, Masson, Paris.
11. ESCOFIER, J.P. [1997] Théorie de Galois, Masson, Paris.
12. FRIED, M., JARDEN, M. [1986] Field Arithmetic, Springer Verlag, Berlin.
13. GALBURĂ, GH. [1961] Corpuri de funcţii algebrice şi varietăţi algebrice, Ed.
Academiei R.P.R., Bucureşti.
14. GALBURĂ, GH. [1972] Algebră, Ed. Didactică şi pedagogică, Bucureşti.
15. GOZARD, I. [1997] Théorie de Galois, Ellipses, Paris.
16. HALL, M. [1959] The Theory of Groups, Macmillan, New York.
17. HERRLICH, H., STRECKER G.E. [1979], Category Theory, second edition,
Heldermann Verlag, Berlin.
18. HUNGERFORD, T.W. [1974], Algebra, Springer-Verlag, New York.
19. ION, I.D., NĂSTĂSESCU, C., NIŢĂ, C. [1984] Complemente de algebră, Ed.
Ştiinţifică and enciclopedică, Bucureşti.
20. ION, I.D., RADU, N. [1981a] Algebra, Ed. Didactică and pedagogică, Bucur-
eşti.
387
388 Bibliography
21. ION, I.D., RADU, N., NITA, C., POPESCU, D. [1981b] Probleme de algebră, Ed.
Didactică and pedagogică, Bucureşti.
22. JACOBSON, N. [1964], Lectures in Abstract Algebra III. Theory of Fields and
Galois Theory, Springer-Verlag, New York.
23. JACOBSON, N. [1974], Basic Algebra I, W.H. Freeman and Co., San Fran-
cisco.
24. KAPLANSKY, I. [1973], Fields and Rings, The University of Chicago Press,
Chicago.
25. LAFON, J.P. [1977] Algèbre commutative. Langages géometrique et algébri-
que, Hermann, Paris.
26. LANG, S. [1964], Algebraic numbers, Addison-Wesley, Reading Massachus-
setts.
27. MACCARTHY, P.J. [1966], Algebraic Extensions of Fields, Blaisdell Publish-
ing, Waltham, Massachusets.
28. MORANDI, P. [1996] Field and Galois Theory, Springer-Verlag, New York.
29. NĂSTĂSESCU, C. [1974] Introducere în teoria mulţimilor, Ed. Didactică şi
pedagogică, Bucureşti.
30. NĂSTĂSESCU, C. [1976] Inele. Module. Categorii, Ed. Academiei R.S.R., Bu-
cureşti.
31. NĂSTĂSESCU, C., NIŢĂ, C. [1979] Teoria calitativă a ecuaţiilor algebrice, Ed.
Tehnică, Bucureşti.
32. NĂSTĂSESCU, C., NIŢĂ, C., VRACIU, C. [1986] Bazele Algebrei, vol. I, Ed.
Academiei R.S.R., Bucureşti.
33. NEUKIRCH, J. [1986] Class Field Theory, Springer-Verlag, Berlin.
34. NITA, C., SPIRCU, T. [1974] Probleme de structuri algebrice, Ed. Tehnică,
Bucureşti.
35. PARENT, D.P. [1978] Exercices en théorie des nombres, Gauthier-Villars, Pa-
ris.
36. POPESCU, N. [1971] Categorii abeliene, Ed. Academiei R.S.R., Bucureşti.
37. PURDEA, I. [1982] Tratat de algebră modernă, vol II, Ed. Academiei R.S.R.,
Bucureşti.
38. RADU, GH. [1988] Algebra categoriilor şi functorilor, Ed. Junimea, Iaşi.
39. RADU, GH., TOFAN, I., GONTINEAC, V. M. [2000] Introducere în algebra
omologică, Ed Universităţii „Al. I. Cuza”, Iaşi.
40. RADU, N. [1968] Inele locale, vol. I, Ed. Academiei R.S.R., Bucureşti.
41. REGHIŞ, M. [1981] Elemente de teoria mulţimilor and logică matematică, Ed.
Facla, Timişoara.
42. SAMUEL, P. [1963] Anneaux factoriels, Sociedade de Matemática de São
Paulo.
43. SAMUEL, P. [1968] Théorie algébrique des nombres, Hermann, Paris.
44. SCORPAN, A. [1996] Introducere în teoria axiomatică a mulţimilor, Ed.
Universităţii Bucureşti, Bucureşti.
45. SPINDLER, K. [1994] Abstract Algebra with Applications, vol. II, M. Dekker,
New York.
6. Solvable389
groups