Norms and Traces

Before discussing norms and traces we introduce some notation for field extensions. If $K\subset L$ are number fields, we let

denote the dimension of

viewed as a

-vector space. If

is a number field and $a\in \overline{\mathbf{Q}}$ , let

be the extension of

generated by

, which is the smallest number field that contains both

and

. If $a\in \overline{\mathbf{Q}}$ then

has a minimal polynomial $f(x) \in \mathbf{Q}[x]$ , and the Galois conjugates of

are the roots of

. The are called the Galois conjugates because the are the orbit of

under the action of $\Gal (\overline{\mathbf{Q}}/\mathbf{Q})$ .

Example 2.4.1 The element $\sqrt{2}$ has minimal polynomial

and the Galois conjugates are $\sqrt{2}$ and $-\sqrt{2}$ . The cube root $\sqrt[3]{2}$ has minimial polynomial

and three Galois conjugates $\sqrt[3]{2}, \zeta_3\sqrt[3]{2}, \zeta_3^2\sqrt[3]{2}$ , where $\zeta_3$ is a cube root of unity.

We create the extension $\mathbf{Q}(\zeta_3)(\sqrt[3]{2})$ in SAGE.

sage: L.<cuberoot2> = CyclotomicField(3).extension(x^3 - 2)
sage: cuberoot2^3
2

Then we list the Galois conjugates of $\sqrt[3]{2}$ .

sage: cuberoot2.galois_conjugates()
[cuberoot2, (-zeta3 - 1)*cuberoot2, zeta3*cuberoot2]

Note that $\zeta_3^2 = -\zeta_3 - 1$ :

sage: zeta3 = L.base_field().0
sage: zeta3^2
-zeta3 - 1

Suppose $K\subset L$ is an inclusion of number fields and let $a\in L$ . Then left multiplication by

defines a

-linear transformation $\ell_a:L\to L$ . (The transformation $\ell_a$ is

-linear because

is commutative.)

Note that if $f\in\mathbf{Q}[x]$ is the characteristic polynomial of $\ell_a$ , then the constant term of

is $(-1)^{\deg(f)}\det(\ell_a)$ , and the coefficient of $x^{\deg(f)-1}$ is $-\tr (\ell_a)$ .

Proposition 2.4.3 Let $a\in L$ and let $\sigma_1,\ldots, \sigma_d$ , where , be the distinct field embeddings $L\hookrightarrow \overline{\mathbf{Q}}$ that fix every element of . Then

$\displaystyle \Norm _{L/K}(a) = \prod_{i=1}^d \sigma_i(a)$ and $\displaystyle \quad \tr_{L/K}(a) = \sum_{i=1}^d \sigma_i(a).$

Proof. We prove the proposition by computing the characteristic polynomial

. Let $f\in K[x]$ be the minimal polynomial of

over

, and note that

has distinct roots and is irreducible, since it is the polynomial in

of least degree that is satisfied by

and

has characteristic 0. Since

is irreducible, we have

, so $[K(a):K]=\deg(f)$ . Also

satisfies a polynomial if and only if $\ell_a$ does, so the characteristic polynomial of $\ell_a$ acting on

. Let $b_1,\ldots,b_n$ be a basis for

over

and note that $1,\ldots, a^m$ is a basis for

, where $m=\deg(f)-1$ . Then

is a basis for

over

, and left multiplication by

acts the same way on the span of $b_j, a b_j, \ldots, a^m b_j$ as on the span of $b_k, a b_k, \ldots, a^m b_k$ , for any pair $j, k\leq n$ . Thus the matrix of $\ell_a$ on

is a block direct sum of copies of the matrix of $\ell_a$ acting on

, so the characteristic polynomial of $\ell_a$ on

is $f^{[L:K(a)]}$ . The proposition follows because the roots of $f^{[L:K(a)]}$ are exactly the images $\sigma_i(a)$ , with multiplicity

(since each embedding of

into $\overline{\mathbf{Q}}$ extends in exactly

ways to

). $\qedsymbol$

It is important in Proposition 2.4.3 that the product and sum be over all the images $\sigma_i(a)$ , not over just the distinct images. For example, if $a=1\in L$ , then $\Tr _{L/K}(a) = [L:K]$ , whereas the sum of the distinct conjugates of

Proof. For the first equation, both sides are the product of $\sigma_i(a)$ , where $\sigma_i$ runs through the embeddings of

into $\overline{\mathbf{Q}}$ that fix

. To see this, suppose $\sigma:L\to \overline{\mathbf{Q}}$ fixes

. If $\sigma'$ is an extension of $\sigma$ to

, and $\tau_1,\ldots, \tau_d$ are the embeddings of

into $\overline{\mathbf{Q}}$ that fix

, then $\sigma'\tau_1,\ldots,\sigma'\tau_d$ are exactly the extensions of $\sigma$ to

. For the second statement, both sides are the sum of the $\sigma_i(a)$ . $\qedsymbol$

The norm and trace down to $\mathbf {Q}$ of an algebraic integer

is an element of $\mathbf {Z}$ , because the minimal polynomial of

has integer coefficients, and the characteristic polynomial of

is a power of the minimal polynomial, as we saw in the proof of Proposition 2.4.3.

Proof. We saw in Lemma 2.3.15 that $\mathbf{Q}\O_K = K$ . Thus there exists a basis $a_1,\ldots, a_n$ for

, where each

is in $\O_K$ . Suppose that as $x=\sum_{i=1}^n c_i a_i\in \O_K$ varies over all elements of $\O_K$ the denominators of the coefficients

are arbitrarily large. Then subtracting off integer multiples of the

, we see that as $x=\sum_{i=1}^n c_i a_i\in \O_K$ varies over elements of $\O_K$ with

between 0 and

, the denominators of the

are also arbitrarily large. This implies that there are infinitely many elements of $\O_K$ in the bounded subset

$\displaystyle S = \left\{c_1 a_1 +\cdots + c_n a_n : c_i \in \mathbf{Q}, 0\leq c_i \leq 1\right\}\subset K.$

Thus for any $\varepsilon >0$ , there are elements $a,b\in \O_K$ such that the coefficients of

are all less than $\varepsilon$ (otherwise the elements of $\O_K$ would all be a ``distance'' of least $\varepsilon$ from each other, so only finitely many of them would fit in

As mentioned above, the norms of elements of $\O_K$ are integers. Since the norm of an element is the determinant of left multiplication by that element, the norm is a homogenous polynomial of degree in the indeterminate coefficients , which is 0 only on the element 0. If the get arbitrarily small for elements of $\O_K$ , then the values of the norm polynomial get arbitrarily small, which would imply that there are elements of $\O_K$ with positive norm too small to be in $\mathbf {Z}$ , a contradiction. So the set contains only finitely many elements of $\O_K$ . Thus the denominators of the are bounded, so for some , we have that $\O_K$ has finite index in $A=\frac{1}{d}\mathbf{Z}a_1 + \cdots + \frac{1}{d}\mathbf{Z}a_n$ . Since is isomorphic to $\mathbf{Z}^n$ , it follows from the structure theorem for finitely generated abelian groups that $\O_K$ is isomorphic as a $\mathbf {Z}$ -module to $\mathbf{Z}^n$ , as claimed. $\qedsymbol$