parametric geometry of numbers and exponents of diophantine …uu.diva-portal.org › smash › get...

82
U.U.D.M. Project Report 2019:29 Examensarbete i matematik, 30 hp Handledare: Andreas Strömbergsson Examinator: Denis Gaidashev Juni 2019 Department of Mathematics Uppsala University Parametric Geometry of Numbers and Exponents of Diophantine Approximation Erik Landstedt

Upload: others

Post on 24-Jun-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

U.U.D.M. Project Report 2019:29

Examensarbete i matematik, 30 hpHandledare: Andreas StrömbergssonExaminator: Denis GaidashevJuni 2019

Department of MathematicsUppsala University

Parametric Geometry of Numbers and Exponents of Diophantine Approximation

Erik Landstedt

Page 2: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation
Page 3: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Parametric Geometry of Numbers and Exponents ofDiophantine Approximation

Erik Landstedt

June 2019

Page 4: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

2

Page 5: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Acknowledgments

I would like to express my gratitude to my supervisor, Professor Andreas Strombergsson, for hisguidence and enthusiasm about this project. Your feedback has truly been helpful together withour discussions.

3

Page 6: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

4

Page 7: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Contents

1 Introduction 7

2 Preliminaries 92.1 Lattices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92.2 Convex Bodies and Minkowski’s First Theorem . . . . . . . . . . . . . . . . . . . . 122.3 Minkowski’s Second Convex Body Theorem . . . . . . . . . . . . . . . . . . . . . . 17

2.3.1 Gauge Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172.3.2 Successive Minima . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182.3.3 Statement of Minkowski’s Second Theorem . . . . . . . . . . . . . . . . . . . 19

3 Rigid Systems and Convex Bodies 253.1 Convex Bodies and Alternating k-forms . . . . . . . . . . . . . . . . . . . . . . . . . 28

4 The Approximation Theorem of Schmidt and Summerer 354.1 Appendix: Translation between Roy’s article and the Article by Schmidt and Summer 41

5 Roy’s Contribution 45

6 Exponents of Diophantine Approximation 676.1 Khinchin’s Transference Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . 696.2 A Consequence of Roy’s Contribution . . . . . . . . . . . . . . . . . . . . . . . . . . 74

5

Page 8: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

6

Page 9: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 1

Introduction

Diophantine approximation is the part of number theory that studies how well an arbitrary realnumber can be approximated by a rational number. The famous Dirichlet approximation theoremasserts that, provided a real number θ and a positive integer n, then there are integers α and βsuch that

|αθ − β| < 1

n

where 1 ≤ α ≤ n. It follows that if θ is irrational, then there are infinitely many rational numbersα/β such that ∣∣∣∣θ − α

β

∣∣∣∣ ≤ 1

β2.

A more general idea is to consider an arbitrary unit vector u ∈ Rn and construct a rationalcodimension one subspace that approximately contains u: Define τ(u) to be the supremum ofη > 0 such that the two inequalities

||x|| ≤ Q

and

|x · u| ≤ Q−η

admit a nonzero solution x ∈ Zn for arbitrarily large values of Q. Then τ(u) is an example of anexponent of Diophantine approximation. There are other exponents as well, and the approximat-ing subspace does not necessarily need to have codimension 1.

The present work is a survey of Roy’s article: On Schmidt and Summerer Parametric Geometryof Numbers, [15], where the author develops a framework which allows many types of Diophantineexponents to be understood in a new light. The building blocks of Roy’s article are the resultsfrom Schmidt and Summerer, [16] and [17], where they construct certain one parameter familiesof convex bodies. The asymptotical behavior of the successive minima of these bodies for largeparameter values is directly connected to a certain family of Diophantine exponents. Schmidt

7

Page 10: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and Summerer show that the successive minima can be approximated with bounded difference bycertain functions which are amenable to analysis.

Roy constructs his own class of approximating functions, called rigid systems. He also proves that,conversely, given any such rigid system, there exists a point in Rn whose corresponding successiveminima approximate the rigid system with a bounded difference. The discussion of this result willform the main part of the present thesis.

Chapter two is dedicated to preliminaries where concepts such as convex bodies and successiveminima are presented. The necessary foundations from the geometry of numbers are also intro-duced here such as Minkowski’s second convex body theorem.

In Chapter three we build up Roy’s theory regarding rigid systems, starting with the constructionof a certain type of convex bodies, in order to be able to present the Approximation Theorem ofSchmidt and Summerer. This theorem is presented in Chapter four together with a translationbetween Schmidt and Summerer’s notation and Roy’s. In Chapter five we focus on Roy’s resultregarding the existence of a rigid system for any point in Rn. Finally, in Chapter 6, we discusssome of the foundations of Diophantine exponents and explain the connection between these andRoy’s result.

8

Page 11: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 2

Preliminaries

2.1 Lattices

The main objective in this introductory section will be the study of lattices and we will introducesome fundamental concepts and results from the geometry of numbers. The content can be foundin [19].

Definition 1. For any linearly independent vectors e1, ..., em in Rn (m ≤ n), the additive subgroupof (Rn,+) generated by e1, ..., em is called a lattice of dimension (or rank) m. We will denotelattices with Λ or Γ.

This means that a lattice of rank m is an abelian group of the form

Λ =m⊕k=1

Zek

and one immediate example is (Z × Z,+). This is a lattice of rank 2 in R2 and the geometricinterpretation can be seen in Figure 2.1. Intuitively we feel that a lattice should be a discretesubset of Rn and this is true together with the converse. It is possible to show that an additivesubgroup of Rn is a lattice if and only if it is discrete.

Example 2. Take e1 = (1, 1) and e2 = (0, 1) in R2. Then the additive group

Λ1 = (α1e1 + α2e2 : α1, α2 ∈ Z,+)

is a discrete subgroup of R2 and hence a lattice. Notice that Λ1 in fact will be Z2. In the sameway it can be noted that if e1, ..., en is a basis in Rn, then

Λ2 =

(n∑k=1

αkek : αk ∈ Z

,+

)

will be a lattice in Rn.

4

9

Page 12: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Figure 2.1: This is an example of a lattice of rank 2 in R2. Hence it is of full rank.

Definition 3. Given a lattice Λ of full rank in Rn, any set of the formn∑j=1

ajej : a1, ..., an ∈ [0, 1)

where e1, ..., en is a set of generators of Λ, is called a fundamental domain of Rn/Λ.

Let Λ be a lattice of rank n in Rn. Then the quotient group Rn/Λ is isomorphic to the n-dimensionaltorus, which we will denote by Tn. This follows from the fact that R/Z ∼= S where S denotes thecircle group. The latter follows using the map ϕ : R→ Z defined by

ϕ(x) = e2πix,

which is clearly a surjective homeomorphism and its kernel is Z. Then R/Z ∼= S follows from thefirst group isomorphism theorem. Then it is natural to define ψ : Rn → Tn ∼= S× · · · × S via

ψ

(n∑j=1

ajej

):= (e2πia1 , ..., e2πian)

10

Page 13: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and the isomorphism Rn/Λ ∼= Tn follows analogously. The volume of a subset D of Rn will bedenoted by vol(D):

vol(D) :=

∫D

dx1...dxn.

It is also necessary to define a volume measure on Tn and to do this we let π : Rn → Rn/Λ be thecanonical homomorphism and put for any smooth subset X ⊂ Rn/Λ

vol(X) := vol(T ∩ π−1(X)), (2.1)

where T is a fundamental domain of Λ. This volume measure will indeed be well-defined and thisstatement is formulated as the following proposition:

Proposition 4. The volume measure, vol, on T defined above is well-defined.

Proof. To prove this, let T and T ′ be two fundamental domains for Tn. The goal is to show that

vol(T ∩X) = vol(T ′ ∩X) (2.2)

where X is any Lebesgue measurable subset of Rn such that we for all x ∈ X and λ ∈ Λ havex + λ ∈ X. We will need the following lemma:

Lemma 5. For every x ∈ Rn there exists a unique λ ∈ Λ such that x ∈ λ + T , where T is afundamental domain of Rn/Λ.

Proof. With

x =n∑k=1

ckek

where c1, ..., cn ∈ R we can take

λ =n∑k=1

bckcek.

Then it follows that every element of Rn/Λ has a unique representation in T .

Lemma 5 above now gives that

Rn =⋃λ∈Λ

(T ′ + λ)

and hence

T ∩X =⋃λ∈Λ

(T ∩X ∩ (T ′ + λ))

11

Page 14: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and since these sets are pairwise disjoint it follows that

vol(T ∩X) =∑λ∈Λ

vol (T ∩X ∩ (T ′ + λ)) .

In a similar way one gets

vol(T ′ ∩X) =∑λ∈Λ

vol (T ∩X ∩ (T ′ + λ))

and the equality in (2.2) is obtained.

2.2 Convex Bodies and Minkowski’s First Theorem

Convex bodies play a fundamental role in the geometry of numbers and we will see that manytheorems require the relevant set to be e.g. convex. What is normally called Minkowski’s firsttheorem asserts that if we have a convex body of a certain size, then it must contain a point of agiven lattice. The precise statement and proof will be discussed below. One of the main objectsthat are studied in both [16] and in [15] are convex bodies but these are also used in many otherareas of mathematics such as optimization theory and geometry.

Definition 6. Let S be a subset of Rn. Then S is called convex if for any x,y ∈ S and λ ∈ [0, 1]we have λx + (1 − λ)y ∈ S. The expression λx + (1 − λ)y (with λ ∈ [0, 1]) is often referred toas a convex combination. Moreover, S is called a convex body if S is a compact, convex andsymmetric region that contains a neighborhood of the origin.

It is somewhat non-standard to require, as we do, every convex body to be symmetric (viz.,−S = S); however, this convention is used in [15], which we will give a survey of later on.Moreover, S will be allowed to be a subset of a finite dimensional Euclidean space and the notionof a convex body translates directly to this situation.

Example 7. The closed unit ball

B = x ∈ Rn : ||x|| ≤ 1

is an example of a convex body in Rn. It is clearly nonempty and symmetric. By the Heine-Boreltheorem it is compact. To show convexity we can pick x,y ∈ B and form the convex combination

λx + (1− λ)y

for 0 ≤ λ ≤ 1 and show that it lies in B. By the triangle inequality we have

||λx + (1− λ)y|| ≤ λ||x||+ (1− λ)||y|| ≤ λ+ (1− λ) = 1

12

Page 15: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

=⇒ λx + (1− λ)y ∈ B.

4

Figure 2.2: The picture shows two sets. The one to the left is convex and the one to the right isnot convex.

The lattice Λ ⊆ Rn will still be assumed to have rank n. We have the following lemma, [19, p.136], that is used to prove Minkowski’s first theorem.

Lemma 8. Let π : Rn → Tn be the canonical homomorphism and let X be a bounded Lebesguemeasurable subset of Rn. Then vol(π(X)) ≤ vol(X) and if the restriction map π|X is injective thenwe have equality.

Proof. Let T be a fundamental domain for Γ. We have, using Lemma 5, that

vol(π(X)) = vol(T ∩ π−1(π(X))

)= vol

(T ∩

⋃λ∈Λ

(X + λ)

).

Since Γ is countable, a property of the Lebesgue measure implies that

vol

(T ∩

⋃λ∈Λ

(X + λ)

)≤∑λ∈Λ

vol(T ∩ (X + λ)

)=∑λ∈Λ

vol(X ∩ (T − λ)

)

= vol

(⋃λ∈Λ

(X ∩ (T − λ)

))= vol

(⋃λ∈Λ

(X − λ) ∩X

)

13

Page 16: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and since ⋃λ∈Λ

(T − λ)

tessellates Rn we get

vol

(⋃λ∈Λ

(T − λ) ∩X

)= vol(X).

Now, if π|X is an injective map, then(T ∩ (X + λ1)

)∩(T ∩ (X + λ2)

)= ∅

for all λ1 6= λ2 ∈ Λ. To see this, assume that x ∈(T ∩ (X + λ1)

)∩(T ∩ (X + λ2)

). Then, in

particular π(x−λ1) = π(x−λ2) since (x−λ1)− (x−λ2) = λ2−λ1 ∈ Λ and injectivity of π|X nowimplies x− λ1 = x− λ2; thus λ1 = λ2, a contradiction to the original assumption. This concludesthe proof of the lemma.

We are now ready to state Minkowski’s first theorem. We use the formulation found in [19,Theorem 7.1, page 140.]

Theorem 9. Let Λ be a lattice of full rank in Rn with fundamental domain T and let X be abounded symmetric convex subset of Rn. If

vol(X) > 2nvol (Rn/Λ)

then X ∩ Λ 6= 0.

Proof. We follow the proof presented in [19]. The idea is to double the size of the given lattice, Λ,and use Lemma 8. The proof idea is rather standard and similar arguments can be found in e.g. [18]where the lattice is is scaled with 1/2 instead. So, let us consider Γ = 2Λ. Any fundamental domainof Γ has volume 2nvol(T ) where T is a fundamental domain of Λ. By assumption 2nvol(T ) < vol(X)and hence the canonical homomorphism π : Rn → Rn/Γ fulfills that

vol(π(X)) ≤ vol(Rn/Γ) < vol(X)

and by Lemma 8 it is given that π|X is not injective. This in turn means that there are elementsx1, x2 ∈ X with x1 6= x2 such that x1 − x2 ∈ Γ. By symmetry and convexity of X we get

1

2(x1 − x2) ∈ X

14

Page 17: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Figure 2.3: This is an illustration of Minkowski’s first convex body theorem.

and since

1

2(x1 − x2) ∈ Λ

we have

1

2(x1 − x2) ∈ X ∩ Λ.

This means that X ∩ Λ 6= 0 and the proof is done.

Example 10. An easy example would be to take Λ = Z3 and X as any ellipsoid centered atthe origin with volume V > 8. Then Minkowski’s convex body theorem implies that the ellipsoidcontains a nonzero lattice point.

4

Example 11. A number-theoretic application of Minkowski’s first theorem is to show the followingfamous result proved in 1770 by Lagrange:

Proposition 12. Every positive integer n is the sum of four integer squares.

Proof. This proof can be found in [19, p. 143] and [5, Theorem 5]. The idea is to first show thestatement for primes and then extend it to all positive integers. We may also assume p is oddsince 2 = 02 + 12 + 02 + 12. If p is an odd prime, then there are integers x and y such that

x2 + y2 ≡ −1 (mod p) (2.3)

15

Page 18: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

holds. Define the lattice Λ as

Λ := Z(x, y, 1, 0)⊕ Z(−x, y, 0, 1)⊕ Z(p, 0, 0, 0)⊕ Z(0, p, 0, 0).

Then the volume of the fundamental domain becomes

vol(Rn/Λ) =

∣∣∣∣∣∣∣∣x −y p 0y x 0 p1 0 0 00 1 0 0

∣∣∣∣∣∣∣∣ = p2

∣∣∣∣ 1 00 1

∣∣∣∣ = p2.

As a symmetric convex set we will consider the sphere centered at the origin with radius R > 0,called BR, in R4. The volume becomes

vol(BR) =

∫∫∫∫x21+x22+x23+x24≤R2

dx1dx2dx3dx4 =π2

Γ(

42

+ 1)R4 =

π2

2R4.

Then

π2

2R4 !

> 24p = 16p

if we for instance take R2 = 19p10

. Then Minkowski’s theorem insures the existence of a nonzerolattice point (a, b, c, d) ∈ Λ ∩BR. This gives

0 < a2 + b2 + c2 + d2 ≤ R2 < 2p.

Hence

a2 + b2 + c2 + d2 ≡ 0 (mod p)

and this means that a2 + b2 + c2 + d2 is a multiple of p and since it is positive and strictly smallerthan 2p we get

a2 + b2 + c2 + d2 = p.

Let n be a composite integer with prime factorization

n =m∏j=1

ptjj .

We have just shown that each prime can be written as the sum of four squares, so

n =m∏j=1

(a2j + b2

j + c2j + d2

j

)tj .16

Page 19: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

This will also be a sum of four squares since

(a2 + b2 + c2 + d2)(e2 + f 2 + g2 + h2) = (ae− bf − cg − dh)2 + (af + be+ ch− dg)2+

+(ag − bh+ ce+ df)2 + (ah+ bg − cf + de)2.

4

2.3 Minkowski’s Second Convex Body Theorem

In order to formulate and prove Minkowski’s second convex body theorem some theory is required.We will give a proof by contradiction following [18].

2.3.1 Gauge Functions

This section is dedicated to the concept of a gauge function introduced by Minkowski. The materialbelow can be found in [18].

Definition 13. Let X be a bounded, convex and open set in Rn containing the origin. The gaugefunction for X is the function f : Rn → [0,∞) such that f(0) = 0, and for every w 6= 0, f(w)is the unique positive real number such that f(w)−1w ∈ ∂X.

We provide some examples to explore what this concept means.

Example 14. Let X be the interior of the unit circle. Then the gauge function corresponding toX is given by

f(x1, x2) =√x2

1 + x22.

4

Example 15. If X is the interior of an ellipsoid with axes a > 0, b > 0 and c > 0, then its gaugefunction is

f(x1, x2, x3) =

√x2

1

a2+x2

2

b2+x2

3

c2.

4

17

Page 20: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Let us note two fundamental properties satisfied by any gauge function f . First,

f(µv) = µf(v), ∀v ∈ Rn, µ ≥ 0. (2.4)

Indeed, f is the gauge function of X, and if µ > 0 and v 6= 0, then (2.4) follows from the factthat (µf(v))−1µv = f(v)−1v ∈ ∂X; and in the remaining case when µ = 0 or v = 0, both sides of(2.4) clearly vanish; hence (2.4) is proved. Secondly,

f(u + v) ≤ f(u) + f(v), ∀u,v ∈ Rn. (2.5)

In order to establish the subadditivity condition, (2.5), we notice that f(v) ≤ 1 for all vectorsv ∈ X := X ∪ ∂X. In particular this means that for any convex combination λu1 + (1−λ)u2 ∈ Xthe following holds:

f(λu1 + (1− λ)u2) ≤ 1, ∀u1,u2 ∈ X. (2.6)

Assume, without loss of generality, that u,v 6= 0 and notice that u/f(u) ∈ X and v/f(v) ∈ Xbecause of the positive-homogeneous condition

f

(u

f(u)

)= 1

and similarly for v/f(v). By defining

λ =f(u)

f(u) + f(v)

and applying (2.6) with u1 := u/f(u) and u2 = v/f(v) it follows that

f

(u + v

f(u) + f(v)

)≤ 1 ⇐⇒ f(u + v) ≤ f(u) + f(v)

and (2.5) is fulfilled.

2.3.2 Successive Minima

One of the main objects that are studied in both [16] and in [15] are convex bodies. These areused in many areas of mathematics such as optimization theory and geometry, but the purposethey will have for us is in the construction of successive minima and hence helping us define a veryfundamental and important function.

Definition 16. Let C be a convex body in Rn, and let Λ be a lattice of full rank in Rn. Forj = 1, ..., n we define λj(C) to be the smallest positive λ ∈ R such that |SpanλC ∩ Λ| ≥ j. Thenumbers λ1(C), ..., λn(C) are called the successive minima of C with respect to Λ.

18

Page 21: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

2.3.3 Statement of Minkowski’s Second Theorem

Let λ1, ..., λn denote the successive minima of a convex body, C, with respect to some latticeΛ ⊂ Rn. Observe that this will correspond to an even gauge function. Minkowski’s second convexbody theorem states the following:

Theorem 17. The successive minima fulfill

2n

n!vol(Rn/Λ) ≤

n∏j=1

λj · vol(C) ≤ 2nvol(Rn/Λ). (2.7)

Proof. The following proof can be found in [18] (we modify it slightly since our C is compactwhereas Siegel works with open convex bodies). Let f be the gauge function of C; then

C = x ∈ Rn : f(x) ≤ 1.

Also, without loss of generality, let Λ = Zn. We will start by proving the second inequality in(2.7). Assume, towards a contradiction, that

n∏j=1

λj · vol(C) > 2nvol(Rn/Zn).

Since vol(Rn/Zn) = 1 the assumption is that

n∏j=1

λj · vol(C) > 2n. (2.8)

Take vectors x1, ...,xn ∈ Zn such that the corresponding successive minima of the gauge functionf are attained at these integral points. Since these vectors are linearly independent they form abasis for Rn.

We now define a map T : C → Rn as follows: Let x ∈ C be given, and choose a1, ..., an ∈ Rn sothat

x =n∑j=1

ajxj.

For k ∈ 0, ..., n− 1 we set

Lk =

k∑j=1

cjxj +n∑

j=k+1

ajxj : c1, ..., ck ∈ R

; (2.9)

this is an affine linear subspace of Rn and so Lk ∩ C is a bounded convex set containing the pointx. Therefore the center of mass of Lk ∩ C, which is denoted c = c(ak+1, ..., an), is a well-definedpoint lying in Lk ∩ C. In particular for k = 0 we get c(a1, ..., an) = x. Finally we define

19

Page 22: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

T (x) := λ1c(a1, ..., an) + (λ2 − λ1)c(a2, ..., an) + · · ·+ (λn − λn+1−1)c(an). (2.10)

This will be an injective map because if

T (x) =n∑j=1

bjxj

then it follows after some calculations that

bj = λjaj +n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an), (2.11)

which means that it is possible to uniquely determine aj from bk for j, k = 1, ..., n. Hence T isinjective. Assume now that

n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an) = 0

for all j = 1, ..., n. Then T will be a rescaling and hence

vol(T (C)) = vol(C)n∏j=1

λj.

If

n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an) 6= 0

and if A is the matrix corresponding to the linear map changing basis to x1, ...,xn we get thatthe volume becomes

vol(T (C)) = detA ·∫∫· · ·∫

(y1,...,yn)∈Rn:∑n

j=1 yjxj∈T (C)

dy1...dyn = detA ·∫∫· · ·∫

D

dy1...dyn.

Without loss of generality, assume that the back substitution of (2.11) isb1 = λ1a1 +

∑n−1k=2(λk+1 − λk)g2(ak+1, ..., an)

b2 = λ2a2

...

bn = λnan

and let the integration interval for y1 be given by the functions ϕ1(a2, ..., an) and ϕ2(a2, ..., an).

20

Page 23: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Then

vol(T (C)) = detA ·∫∫· · ·∫

D

(∫ A2(a2,...,an)

A1(a2,..,an)

λ1dy1

)dy2...dyn

where

A1(a2, ..., an) =1

λ1

(ϕ1(a2, ..., an)−

n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an)

)

and

A2(a2, ..., an) =1

λ1

(ϕ2(a2, ..., an)−

n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an)

).

Now,

vol(T (C)) = detA ·∫∫· · ·∫

D

(∫ A2(a2,...,an)

A1(a2,..,an)

λ1dy1

)dy2...dyn

= detA ·∫∫· · ·∫

D

(ϕ1(a2, ..., an)−

n−1∑k=j

(λk+1 − λk)gj(ak+1, ..., an)− ϕ2(a2, ..., an)

+n−1∑k=j

(λk+1−λk)gj(ak+1, ..., an)

)dy2...dyn = detA·

∫∫· · ·∫

D

(ϕ1(a2, ..., an)− ϕ2(a2, ..., an)) dy2...dyn

= detA ·∫∫· · ·∫

D

dy1...dyn = vol(C).

Hence the volume of T (C) is independent of whether the latter part of (2.11) is zero or not, whichmeans that

vol(T (C)) = vol(C). (2.12)

It is convenient, just as in the proof of Minkowski’s first convex body theorem, to scale the lattice,or equivalently C. Follwing the proof presented in [18], we consider the set 1

2T (C). Since

vol

(1

2T (C)

)= 2−nvol(C)

n∏j=1

λj > 1

by (2.8) and (2.12), it follows by Lemma 8 that there exist two distinct points 12y1,

12y2 ∈ 1

2T (C).

21

Page 24: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Since T is injective we have T−1(y1) 6= T−1(y2), and of course these two points lie in C. Thenthere are two vectors y1 6= y2 ∈ C so that

0 6= 1

2(y1 − y2) ∈ C ∩ Zn.

Notice that T−1(y1), T−1(y2) ∈ C. By expressing both these as linear combinations using vectorsfrom x1, ...,xn we get

T−1(y1) =n∑j=1

αjxj

and

T−1(y2) =n∑j=1

βjxj,

implying that αj 6= βj for some j ∈ 1, ..., n due to injectivity. From the definition of the map Tit follows that:

1

2(y1 − y2) = λ1

1

2(c(α1, ..., αn)− c(β1, ..., βn)) +

1

2(λ2 − λ1)(c(α2, ..., αn)− c(β2, ..., βn)) + · · ·

+1

2(λn − λn+1−1)(c(αn)− c(βn)).

This yields that

f

(1

2(y1 − y2)

)= f

(λ1

1

2(c(α1, ..., αn)−c(β1, ..., βn))+

1

2(λ2−λ1)(c(α2, ..., αn)−c(β2, ..., βn))+· · ·

+1

2(λn − λn+1−1)(c(αn)− c(βn))

)= f

(λ1

1

2(c(α1, ..., αn)− c(β1, ..., βn))

+1

2(λ2 − λ1)(c(α2, ..., αn)− c(β2, ..., βn)) + · · ·+ 1

2(λk − λk−1)(c(αk, ..., αn)− c(βk, ..., βn))

)

≤ λ1 + (λ2 − λ1) + · · · (λk − λk−1 = λk,

where we used that f(x) ≤ 1 for all x ∈ C. This is a contradiction to the choice of the vectorsx1, ...,xn. This finishes the proof of the second inequality of (17).

In order to show the first inequality of (17), it is enough to observe that xk ∈ λk∂C, which inparticular means that 1

λkxk ∈ ∂C. By symmetry of C we have − 1

λkxk ∈ ∂C. After placing out all

2n points we get an n-dimensional octahedron, Oc. See Figure 2.4 for an illustration.

22

Page 25: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Figure 2.4: This figure describes the octahedron in three dimensions.

The volume becomes, after iterating,

vol(Oc) =2n

n!λ1 · · ·λm

and since Oc ⊂ C it follows that

2n

n!λ1 · · ·λm≤ vol(C) ⇐⇒ 2n

n!vol(Rn/Λ) ≤

n∏j=1

λjvol(C).

and the theorem is proved.

23

Page 26: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

24

Page 27: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 3

Rigid Systems and Convex Bodies

The goal of this chapter is to build up the theory required to deal with Roy’s contribution madein [15, Th. 1.3]. Roy’s work is a continuation of what was developed by Schmidt and Summererin [16] and [17]. Hence parts of their work will also be presented in this chapter and we willsee a connection between Roy’s article and that of Schmidt and Summerer. This chapter andChapter 4 will serve as a survey of Roy’s article [15], which means that we follow it closely throughthese chapters also regarding notation. Notationwise, V will always denote a finite dimensionalEuclidean vector space and Λ will denote a lattice of V. We also write V = U⊕W where W 6= 0and U are subspaces of V to split V into a direct sum of orthogonal subspaces.

Initially, the construction of a special type of convex bodies will take place, where the shape of thebody depends on a parameter Q > 0.

Definition 18. Let dimR(V) = m and let W 6= 0 be a fixed linear subspace of V. We then definethe following one parameter family of convex bodies:

C(Q) := x ∈ V : ||x|| ≤ 1 and ||x||W|| ≤ Q−1

for Q ≥ 1 and where x||W denotes the orthogonal projection on W. Furthermore, we let Γ be afixed lattice of full rank in V, and for each j = 1, ...,m, we define Lj : [0,∞)→ R by

Lj(q) = log λj(C(eq))

for q ≥ 0. We also put

L(q) = (L1(q), ..., Lm(q)).

Let the focus now be the foundations of the theory presented in [15]. Some fundamental propertiesof these functions presented in the above definition will be described later on. One thing we mightestablish directly is that L1(q) ≤ · · · ≤ Lm(q) for q ≥ 0 since the natural logarithm function ismonotonically increasing.

25

Page 28: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Figure 3.1: This is an illustration of C(Q) when V = R3. We have two hyperplanes at a distance2Q−1 giving us the set of vectors inside the unit ball that also are in C(Q). These are illustratedas the blue part of the unit sphere.

Example 19. The situation which will be most common for us is when V = Rn (thus m = n) andW = Ru for a fixed unit vector u. In this case we write Cu := C(Q), and we clearly have

Cu(Q) = x ∈ V : ||x|| ≤ 1 and |x · u| ≤ Q−1. (3.1)

Note that since u is a unit vector, the scalar product x · u stands for the orthogonal projectionof the vector x onto u, in accordance with Definition 18. Observe also that if we consider Rn forn ≥ 2 as

Rn ∼= U⊕W (3.2)

with W = Ru for a unit vector u ∈ R and with U = W⊥ it is natural to consider Cu(Q) for Q ≥ 1as a convex body. From this it is possible to consider a special case of L. Initially, define

∆n = x ∈ Rn : x1 ≤ · · · ≤ xn (3.3)

to be the set of all monotone increasing n-tuples of real numbers. Next we define a mapLu : [0,∞)→ ∆n by

Lu(q) = (Lu,1(q), ..., Lu,n(q)) (3.4)

for q ≥ 0 where we for each j = 1, ..., n have

Lu,j(q) := infyy ∈ Rn

≥0 : |Span(Zn ∩ eyCu(eq))| ≥ j.

Then Lu,j(q) = log λj(Cu(eq)) with Γ = Zn is indeed a special case of L.

26

Page 29: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

4

We now prove some first properties of the functions defined above.

Proposition 20. The function L(q) is Lipschitz continuous.

Proof. If 0 ≤ r1 ≤ r2 it follows that C(er2) ⊆ C(er1) and since

e−r2 = er1−r2e−r1

we get

er1−r2C(er1) ⊆ C(er2).

The chain of inclusions then implies that

Lj(r1) ≤ Lj(r2) ≤ Lj(r1) + (r2 − r1) (3.5)

for j = 1, ...,m and since this is true for all 0 ≤ r1 ≤ r2 we see that Lj is continuous since (3.5)implies that

|Lj(r2)− Lj(r1)| ≤ |r2 − r1|,

which means that Lj is Lipschitz continuous with Lipschitz constant 1.

Example 21. Notice that then the corresponding functions

Lu,j(q) = log λj(Cu(eq)) (3.6)

will be continuous for all j = 1, ...,m.

4

27

Page 30: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Moreover, the following property holds:

Lemma 22. The function L(q) is piecewise linear with slopes 0 and 1.

Proof. As a first step, define

λx(C(Q)) = max||x||, Q||x||W||

where x ∈ V and Q ≥ 1. Using this one can define

Lx(q) = log λx(C(eq)) (3.7)

for q ≥ 0. Intuitively this means that whenever x 6= 0 the function Lx allows us to follow thetrajectory of x, which depends on C(eq). A consequence of this is that the set (q, Lx(q))must havea finite number of intersection points with any bounded subset of [0,∞)×R for each x ∈ Λ \ 0.We further notice that Lx is piecewise linear and if x||W = 0 then Lx = log ||x|| and hence it hasslope 0. If x||W 6= 0 it will first have slope 0 and then, as q increases, get slope 1. Hence allcomponent functions L1, ..., Lm will be piecewise linear functions with slopes 0 and 1.

3.1 Convex Bodies and Alternating k-forms

It is possible to somewhat generalize the concept from the previous section by considering spacesof alternating k-forms and see that the corresponding family of convex bodies for such spaceswill work as an approximation of the convex hull of the exterior products of k copies of Cu(Q).Observe that we still follow Roy’s article, [15]. However, it will here be convenient to present somebackground regarding alternating k-forms. A good account on this topic can be found in [1], [14]and [7]. Let

∧(V) denote the exterior algebra of V and recall that, by definition

∧(V) :=

∞⊕k=0

∧k(V)

where∧k(V) is the space of alternating multilinear forms

ϕ : V∗ × · · · × V∗ → R

under the ∧-product operation defined for any k, l ≥ 0 by∧k(V)×

∧l(V)→

∧k+l(V)

and

(ϕ1, ϕ2) 7→ ϕ1 ∧ ϕ2 :=(k + l)!

k!l!

∑σ∈Sn

(signσ)(σ(ϕ1 ⊗ ϕ2)) (3.8)

28

Page 31: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

where Sn denotes the permutation group on n letters. Note also that if e1, ..., en is an orthonormalbasis for Rn, then ej1 ∧ · · · ∧ejk for 1 ≤ j1 ≤ · · · ≤ jk ≤ n will be an orthonormal basis for

∧k Rn

and with U = W⊥ and W = Ru we get∧kRn ∼=

(∧kU)⊕(∧k−1

U ∧ Ru).

If we put W(k) =∧k−1 U ∧ Ru it is natural to consider the following, corresponding, parameter

family of convex bodies:

C(k)u (Q) :=

x ∈

∧kRn : ||x|| < 1 and ||x||W(k) || < Q−1

and the associated maps

L(k)u,j(q) = log(λj(C(k)

u (eq))).

Definition 23. The k:th compound convex body of Cu(Q) is the convex hull of the exteriorproducts of k elements of Cu(Q).

Example 24. For k = 1 we have∧1 Rn ∼= Rn so C(1)

u (Q) = Cu(Q) and for k = n we have∧nU = 0

and hence

C(n)u (Q) =

x ∈

∧nRn : ||x|| < Q−1

. (3.9)

We continue to follow [15] and Roy’s way to describe the theory developed in Schmidt and Sum-

merer [16] and [17]. The following lemma shows that∧kCu(Q) ⊆ kC(k)

u (Q) and can be found in [15,Lemma 2.4.]:

Lemma 25. If x1, ...,xk ∈ Cu(Q) with Q ≥ 1, then x1 ∧ ... ∧ xk ∈ kC(k)u (Q) and in particular if

z1, ..., zk are linearly independent vectors in Zn we have the inequality

L(z1 ∧ ... ∧ zk, q) ≤k∑j=1

L(zj, q) + log k, (3.10)

where q ≥ 0 and the function Lx : [0,∞)→ R is defined by

L(x, q) = maxlog ||x||, q + log ||x||W(k)||. (3.11)

Proof.Pick u ∈ U with U as in (3.2) and define yj = xj||U. For the sake of clarity the inner product x · ywill here be denoted 〈x,y〉, for x and y vectors. First note that

||x1 ∧ · · · ∧ xk|| ≤ ||x1|| · · · ||xk|| ≤ 1

29

Page 32: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and that by the standard projection formula from linear algebra

x1∧· · ·∧xk = (y1+〈x1,u〉u)∧· · ·∧(yk+〈xk,u〉u) = y1∧· · ·∧yk+k∑j=1

(−1)j+k〈xj,u〉y1∧· · ·∧yj∧· · ·∧yk∧u,

where yj stands for that the element yj is omitted in the sum. By taking norms on both sides wesee that

k∑j=1

(−1)j+k〈xj,u〉y1 ∧ · · · ∧ yj ∧ · · · ∧ yk ∧ u = (y1 ∧ · · · ∧ yk)||W(k)

=⇒ ||(y1 ∧ · · · ∧ yk)||W(k)|| ≤k∑j=1

|(−1)j+k| · |〈xj,u〉|k∏i=1i 6=j

||yi|| · ||u|| =k∑j=1

|〈xj,u〉|k∏i=1i 6=j

||yi||

≤k∑j=1

k∏i=1i 6=j

||yi|| ≤k∑j=1

Q−1 = kQ−1.

The conclusion is hence that x1 ∧ · · · ∧ xk ∈ kC(k)u (Q). In order to prove the inequality in (3.10)

we note that exp(L(zj, q))−1zj ∈ Cu(eq) because exp(L(y, q)) > eq. This means that

k∏j=1

z1 ∧ · · · ∧ zkexp(L(zj, q))

∈ kC(k)u (Q).

A consequence is then that

L(z1 ∧ ... ∧ zk) = maxlog ||z1 ∧ ... ∧ zk||, q + log ||(z1 ∧ ... ∧ zk)||W(k)||

≤ log

(k∏j=1

||kz1 ∧ · · · ∧ zk|| exp(L(zj, q))

)≤ log

(k∏j=1

k exp(L(zj, q))

)=

k∑j=1

L(zj, q) + log k.

This completes the proof of the lemma.

In 1955 Kurt Mahler published two articles, [10] and [11], on the compound of convex bodies, andthe main idea and original source for Lemma 26 comes from his theory in these articles. However,for the sake of consistency we will continue to follow Roy’s, [15], way of presenting this result andhis added estimates to the inequality. Define

N := dimR

(∧kRn)

=

(n

k

)

30

Page 33: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and then we state the lemma:

Lemma 26. Order the elements of the setk∑

m=1

Lu,jm(q) : 1 ≤ j1 < · · · < jk ≤ n

in monotone increasing order and define S(k)u,jNj=1 to be the sequence consisting of those. Then

the following inequality holds true:

− log n ≤ S(k)u,j − L

(k)u,j(q) ≤ 2nn log n (3.12)

for q ≥ 0 and 1 ≤ j ≤ N .

Proof. Pick y1, ...,yk as linearly independent vectors of Zn in such a way that L(yj, q) = Lu,j(q)for j = 1, ..., N . This can always be done since there for each j = 1, ..., n exist a successive minimafor Cu(eq). This means in particular that

y1 ∧ · · · ∧ yk

for 1 ≤ j1 < ... < jk ≤ N will be linearly independent in∧k Zn. Let (µ1, ..., µN) be the N -tuple

consisting of the numbers L(y1 ∧ · · · ∧ yk, q) for 1 ≤ j1 < ... < jk ≤ n in monotone increasingorder. Then L(yj, q) ≤ µj for all j = 1, ..., N since L(yj, q) will act as the j:th successive minima

for C(k)u (eq). Lemma 25 now implies that

L(yj, q) ≤k∑j=1

L(zj, q) + log k =⇒ L(yj, q) ≤ S(k)u,j(q) + log k

for j = 1, ..., n. Observe specially that Lu,j(q) is monotonically increasing, which was provedearlier, and hence ∣∣∣∣∣

n∑i=1

Lu,i(q)− q

∣∣∣∣∣ ≤ n log n (3.13)

holds for q ≥ 0 together with∣∣∣∣∣N∑i=1

L(k)u,i(q)−

(n− 1

k − 1

)q

∣∣∣∣∣ ≤(n

k

)log

(n

k

)= N logN. (3.14)

That this is true is due to Minkowksi’s second convex body theorem, Theorem 17. In the lattersituation the inequalities

2N

N !≤

N∏i=1

λi(C(k)u (Q))vol(C(k)

u (Q)) ≤ 2N (3.15)

31

Page 34: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

hold true. The next step is to derive an inequality for the volume of C(k)u (Q). To do this, take

E := e1, ..., eK as an orthonormal basis for∧k−1U ∧ Ru where

K = dimR

(∧k−1U ∧ Ru

).

Define the following convex body using a natural extension of E :

C :=

x ∈

∧kRn : max

1≤j≤K|x · ej| ≤ Q−1, max

K+1≤j≤N|x · ej| ≤ 1

.

The set C will then be an N -dimensional cuboid. Notice that the volume of this cuboid will be2NQ−K . Further we have that N−1C ⊂ C(k)

u (Q) ⊂ C and in terms of volume this means(2

N

)NQ−K ≤ vol(C(k)

u (Q)) ≤ 2NQ−K

=⇒ 1

N !≤

N∏i=1

λi(C(k)u (Q))Q−K ≤ NN .

Taking logarithms on both sides then implies∣∣∣∣∣log

(N∏i=1

λi(C(k)u (Q))

)−K logQ

∣∣∣∣∣ ≤(n

k

)log

(n

k

)= N logN

⇐⇒

∣∣∣∣∣N∑i=1

log λi(C(k)u (Q))−K logQ

∣∣∣∣∣ ≤(n

k

)log

(n

k

)= N logN

⇐⇒

∣∣∣∣∣N∑i=1

L(k)u,i(q)−K logQ

∣∣∣∣∣ ≤ N logN.

and if we change notation by letting Q = eq and observe that

K = dimR

(∧k−1U ∧ Ru

)=

(n− 1

k − 1

)we indeed get (3.14). Showing that (3.13) holds is done in exactly the same way. Now when wehave established (3.13) and (3.14) we continue the proof via the following estimation:

N∑j=1

(S(k)u,j(q)− L

(k)u,j(q) + log k) =

N∑j=1

S(k)u,j(q)−

N∑j=1

L(k)u,j(q) +N log k

=N∑j=1

N∑m=1

1≤j1<···<jk≤n

Lu,jm(q)−N∑j=1

L(k)u,j(q) +N log k =

(n− 1

k − 1

) n∑j=1

Lu,j(q)−N∑j=1

L(k)u,j(q) +

(n

k

)log k

32

Page 35: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

≤(

n

k − 1

)log n+N log(kN)

due to the inequalities in (3.13) and (3.14). Because(n

k − 1

)= kN ≤ nk

it follows that

N∑j=1

(S(k)u,j(q)− L

(k)u,j(q) + log k) ≤ 2

(n− 1

k − 1

)n log n ≤ 2nn log n,

which in addition means that

− log n ≤ S(k)u,j(q)− L

(k)u,j(q) ≤ 2nn log n.

33

Page 36: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

34

Page 37: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 4

The Approximation Theorem of Schmidtand Summerer

This chapter is dedicated to the approximation theorem of Schmidt and Summerer, Theorem 28.The original source for this theorem is [17, p. 57 pp.] but we will still use Roy’s notation. However,translation is needed between the two articles. Unlike Schmidt and Summerer, Roy uses the polarbodies. The representations are equivalent, which Roys points out, and we will provide a proof ofthis in the appendix to this chapter.

For now we focus on the approximation theorem of Schmidt and Summerern, namely Theorem 28.The theorem is originally found in [17]. Let us start with the concept of an (n, γ)-system in orderto state Theorem 28 in the same way as in [15, p. 750]. The notion of (n, γ)-systems is omittedin [17] but since we are following Roy’s article, [15], very closely we use his way of presenting theproof as well. The following definition can be found in [15, Def. 2.8]:

Definition 27. For γ, q0 ≥ 0, a so-called (n, γ)−system on the half line [q0,∞) is a functionP = (P1, ..., Pn) : [q0,∞)→ Rn such that the following holds:

(i) −γ ≤ Pj(q) ≤ Pj+1(q) + γ for 1 ≤ j < n and q0 ≤ q;

(ii) Pj(q1) ≤ Pj(q2) + γ for 1 ≤ j ≤ n and q0 ≤ q1 ≤ q2;

(iii) the function given by Mj := P1 + · · · + Pj : [q0,∞) → R is a continuous function for allj = 1, ..., n and piecewise linear with slopes 0 and 1;

(iv) Mn(q) = q for q0 ≤ q;

(v) if there is some j ∈ 1, ..., n− 1 such that Mj changes slope from 1 to 0 at a point q > q0,then

Pj+1(q) ≤ Pj(q) + γ.

The first natural thing to ask after a definition has been stated is whether or not such an objectexists. The existence of such a function P given in the above definition is guaranteed by thefollowing theorem, found in [15, p. 750]:

35

Page 38: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Theorem 28. For every unit vector u ∈ Rn there exists an (n, γ)-system P : [0,∞) → Rn forγ = 6n2n log(n) such that

supq≥0||P(q)− Lu(q)||∞ ≤ γ.

The original source for Theorem 28 goes back to [17, p. 57 ff.] except the precise value of γ, whichis due to Roy.

Proof. Start by defining j1 = 1, ..., jk = k in (3.12) from Lemma 26.Then for every q ≥ 0 thefollowing inequality holds: ∣∣∣∣∣L(k)

u,1(q)−k∑j=1

Lu,j(q)

∣∣∣∣∣ ≤ 2nn log n. (4.1)

It will be convenient to put Pk = L(k)u,1(q) − L(k−1)

u,1 (q) for all k = 1, ..., n (with L(0)u,1 := 0) because

the triangle inequality implies that

|Pk(q)− Lu,k(q)| ≤ 2n+1n log n =⇒ −2n+1n log n ≤ Lu,k(q)− 2n+1n log n ≤ Pk(q)

and hence the conclusion is that part (i) of the definition is satisfied and hence proved. Moreover,

Pk(q) ≤ Lu,k(q) + 2n+1n log n ≤ Lu,k+1(q) + 2n+1n log n ≤ Pk+1(q) + 2n+2n log n

and for 0 ≤ q1 ≤ q2 the inequality

Pk(q1) ≤ Lu,k(q1) + 2n+1n log n ≤ Pk(q2) + 2n+2n log n

holds for all 1 ≤ k ≤ n and notice that (ii) is fulfilled. We also see that

Mk =k∑j=1

Pj =k∑j=1

(L(j)u,1(q)− L(j−1)

u,1 (q)) = L(k)u,1(q).

By Proposition 20 the function Mk will be continuous for all k = 1, ..., n and by Lemma 22 itwill be piecewise linear with slope 0 and 1. This completes part (iii). From (3.9) it follows that||x||W(n)|| = ||x|| and hence we have that for any x ∈

∧nRn and for any q ≥ 0 that

L(n)u,1(q) = log eq = q.

This proves (iv). Finally, the part that is left is (v). This one is implied from the fact that when

the function L(k)u,1 changes slope from 1 to 0 at q > 0 the equality L

(k)u,1(q) = L

(k)u,2(q) holds. This

is true because if x and y are two linearly independent vectors in Λ such that ddqL

(k)x (q) = 1 and

ddqL

(k)y (q) = 0, then both x and y belong to exp(L1(q))C(k)(eq). Without loss of generality one can

36

Page 39: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

assume that k < n, and by letting j1 = 1, ..., jk−1 = k − 1 and jk = k + 1 in (3.12) from Lemma26 we get that ∣∣∣∣∣L(k)

u,2(q)−k−1∑j=1

Lu,j(q)− Lu,k+1(q)

∣∣∣∣∣ ≤ 2nn log n (4.2)

and combining (4.2) together with (4.1) implies that

∣∣∣∣∣L(k)u,1(q)−

k∑j=1

Lu,j(q)− L(k)u,2(q)−

k−1∑j=1

Lu,j(q) + Lu,k+1(q)

∣∣∣∣∣ ≤ 2n+1n log n ⇐⇒ |Lu,k+1−Lu,k| ≤ 2n+1n log n.

Because Lu,k+1 ≥ Lu,k we get

Lu,k+1 − Lu,k ≤ 2n+1n log n =⇒ Pk+1(q) ≤ Pk + 2 · 2nn log n ≤ Pk + 6 · 2nn log n = Pk + γ.

This proves (v) and hence the existence of an (n, γ)-system P : [0,∞) → Rn for γ = 6n2n log(n)such that

supq≥0||P(q)− Lu(q)||∞ ≤ γ

has been shown.

Schmidt and Summerer derived the same conclusion in [17]. Let µ1, ..., µN denote the successiveminima of the lattice in Rn generated by all exterior products x1 ∧ x2 ∧ · · · ∧ xk for x1, ...,xk ∈ Λ.They derived inequality (4.1) from the fact that

µ1 N∏k=1

λk(C(k)u (eq)). (4.3)

The proof of (4.3) can be found in [10]. Hence there exists a constant c depending on n giving∣∣∣∣∣log µ1 −N∑k=1

log λk(C(k)u (eq))

∣∣∣∣∣ =

∣∣∣∣∣L(k)u,1(q)−

k∑j=1

Lu,j(q)

∣∣∣∣∣ ≤ c.

The rest of the proof is almost the same except for the estimate of γ. Recall that in Example 19we defined

∆n = x ∈ Rn : x1 ≤ · · · ≤ xn.

Next the notion of a canvas will be defined. This is a mathematical object that provides aid tographically understand the rigid n-systems that will be defined later on. The definition below is

37

Page 40: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

cited from Roy’s article, [15], but we notice that Schmidt and Summerer do not use this conceptin their article, [17]. This can be seen as the point where Roy starts to create a new way of lookingat the situation. Eventually the concept of a rigid n-system will be defined and it will be used inthe proof of what is in this thesis called Roy’s contribution.

Definition 29. Let Φn : Rn → ∆n be the continuous function that lists the coordinates of a givenpoint in monotone increasing order and let δ ∈ R+ and s ∈ (N ∪ ∞) \ 0. A canvas withmesh δ and cardinality s in Rn is a triple (a, k, l) where a = aisi=1 is a sequence of pointsin ∆n and where k = kis−1

i=1 and l = lis−1i=1 are two sequences of the same cardinality s such that

for each index 0 ≤ i < s we have that

(i) the coordinates of (a1i , ..., a

ni ) of ai are a strictly increasing sequence of positive multiplies of

δ;

(ii) we have that 1 ≤ k0 ≤ l0 = n and 1 ≤ ki < li ≤ n given that i ≥ 1;

(iii) if i+ 1 < s then ki ≤ li+1, ali+1

i + δ ≤ ali+1

i+1 ;

(iv) we have that

(a1i , ..., a

kii , ..., a

ni ) = (a1

i+1, ..., ali+1

i+1 , ..., ani+1) (4.4)

for i+ 1 < s.

Before we give an example, we note that the sequence a uniquely determines all li and ki, exceptks−1 in the case s <∞.

Example 30. Let a = (2, 4, 6, 8), (2, 4, 8, 12), (2, 4, 12, 14). Then a1 form a strictly increasingsequence of positive multiples of δ = 2. This is also true in a2 and a3. We have that l0 = 4 andthe inequalities

1 ≤ k0 ≤ l1 ≤ 4

1 ≤ k1 < l2 ≤ 4

a0l1

+ 2 ≤ a1l1

a1l2

+ 2 ≤ a2l2,

which together with (4.4) gives that

l = (4, 4, 4)

and

k = (3, 3, k2).

As one can see, we can pick k2 = 3 and then get a canvas with mesh 2 and cardinality 3.

38

Page 41: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

The next definition will help us to understand these canvases graphically.

Definition 31. Let P := (P1, ..., Pn) : [c,∞) → ∆n be a function. The combined graph of Pabove I, where I is a subinterval of [c,∞), is the combined graph of its components restricted to I.

Definition 32. To each canvas of mesh δ > 0 we define P : [q0,∞)→ ∆n given by

P(q) = Φn(a1i , ..., a

kii , ..., a

ni , a

kii + q − qi) (4.5)

for 0 ≤ i < s, qi ≤ q < qi+1 and where qi = a1i + · · · + ani for 0 ≤ i < s and qs = ∞ if s < ∞.

Such a function is called a rigid n-system with mesh δ and qis−1i=0 is its sequence of switch

numbers.

We see from the definition above that if i+ 1 < s then P is a continuous function and that on theinterval [qi, qi+1) the component (akii + q − qi) gives us a line segment of slope 1 since aki < qi forall i = 1, ..., n. The rest of the line segments on this interval will be horizontal.

Example 33. Let us draw the combined graph of a rigid 4-system with mesh 2 with respect tothe canvas given in Example 30. For i = 0 we get

a10 = 2

a20 = 4

a30 = q − 14

a40 = 8

on the interval [qi, qi+1) = [20, 26). If one does similar calculations for the other intervals we getthe combined graph drawn in Figure 4.1. The switch numbers are in this case 20, 26, 32.

Figure 4.1: The combined graph of a rigid 4-system with mesh 2 with respect to the canvas givenin Example 30.

39

Page 42: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

4

Remark 34. More can be said about combined graphs of rigid n-systems. Notice for instancethat in each combined graph in Figure 4.1, the line segments with slopes 1 decrease from left toright. This is no coincidence but follows since we have strict inequality in 1 ≤ ki < li ≤ n fori ≥ 1. This means that a

(i)ki− qi < a

(i+1)ki+1− qi+1 and the statement follows.

Example 35. We will now provide the reader with one more example of a combined graph of arigid n-system. Consider the canvas (3, 9, 12, 15), (3, 9, 15, 21) of cardinality 2 and mesh 3. Thecorresponding rigid 4-system is given by

P(q) = Φn((3, 9, 15, 12 + q − 39)) = Φn((3, 9, 21, q − 27)) (39 ≤ q ≤ 48)

since k0 = 3. This means that we will have three horizontal line segments starting from (0, 3), (0, 9)and (0, 15) together with a line segment of slope one starting at (0,−27) and reaching the verticalline x = qi = 39 at the point (39, 12). Putting k1 = 3 then gives

P(q) = Φn((3, 9, 21, 15 + q − 48)) = Φn((3, 9, 21, q − 33)) (48 ≤ q <∞)

The system will continue to look like this for all q ≥ 48. See an illustration in Figure 4.2.

The idea is to consider rigid n-systems where some of the components of the corresponding com-bined graph are unbounded. The behaviour of the system when q approaches infinity is relatedto exponents of Diophantine approximation; something that Roy has noticed and uses to describethe spectrum related to some of them.

Figure 4.2: This is the combined graph of a rigid 4-system with mesh 3 with respect to the canvasgiven in Example 35.

4

40

Page 43: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

4.1 Appendix: Translation between Roy’s article and the

Article by Schmidt and Summer

Schmidt and Summerer, [17], consider another type of convex body than Roy does. Define forQ ≥ 1

K(q) := (x1, ..., xn) ∈ Rn : |x1| ≤ e(n−1)q, |xi| ≤ e−q ∀i > 1

and for ξ1, ..., ξn ∈ R, let Λ = Λ(ξ1, ..., ξn−1) be the lattice

Λ = (y, ξ1y − y1, ..., ξn−1y − yn−1) : (y, y1, ..., yn−1) ∈ Zn.

Observe that K(q) is a convex body and that it contains a nonzero lattice point by Minkowski’sfirst theorem. Then K(q) will be the convex body considered by Schmidt and Summerer in [17].The authors also define

Li(q) = log λi(Λ,K(q)) (4.6)

for i = 1, ..., n. So, in order to translate these functions to the ones which Roy studies, we fix alinear map A ∈ GLn(R) in such a way that Λ = AZn. The definition of successive minima thenimplies

λi(Λ,K(q)) = λi(A−1Λ, A−1K(q)) = λi(Zn, A−1K(q)). (4.7)

Let e1 = (1, 0, ..., 0) and define a unit vector in Rn by

u := ||A−1e1||−1A−1e1 (4.8)

and fix a rotation R ∈ SO(n) such that Re1 = u. Now A−1 maps K(q) to a “skew” long, thincylinder (for q large), whose axis is in the direction of A−1e1, i.e. the direction of u. In fact, by acomputation one shows that there exist constants 0 < α1 < α2, which may depend on A but noton q, such that

α1RK(q) ⊂ A−1K(q) ⊂ α2RK(q), ∀q ≥ 0. (4.9)

We will now change notation to Li(Λ,K(q) = log λi(Λ,K(q)) and use that (4.9) holds for all q ≥ 0in order to get

|Li(Λ,K, q)− Li(Zn, RK(q))| ≤ O(1), ∀q ≥ 0,

which means that |Li(Λ,K, q) − Li(Zn, RK(q))| is uniformly bounded for all q ≥ 0. In order tocontinue, the notation of a dual body is now needed and is defined as follows:

41

Page 44: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Definition 36. Let C be a convex body. Then the dual body, or polar body, of C is denoted C∗and is defined as the set

C∗ = x ∈ Rn : x · c ≤ 1,∀c ∈ C. (4.10)

One can show that, up to a bounded rescaling, the dual of Ce1(enq) is equal to eqK(q). That is,there exist constants 0 < α3 < α4 such that, for all q ≥ 0,

α3Ce1(enq) ⊆ (eqK(q))∗ ⊂ α4Ce1(enq),

which gives, since u = Re1, that

α3Cu(enq) ⊂ (eq ·RK(q))∗ ⊂ α4Cu(enq). (4.11)

We will use the following theorem, found in [4, Thm. VI]:

Theorem 37. Let λ1, ..., λn denote the successive minima of a lattice Λ with respect to the gaugefunction f and let λ∗1, ..., λ

∗n denote the successive minima of the polar lattice Λ∗ with respect to the

gauge function f ∗ polar to f . Then

1 ≤ λjλ∗n+1−j ≤ n! (4.12)

for 1 ≤ j ≤ n.

Remark 38. The notion of a dual lattice of Λ ⊂ Rn is defined as

Λ∗ = x ∈ Rn : x · Λ ⊂ Z.

Now Theorem 37 gives that

1 ≤ λj(Zn, eq ·RK(q)) · λn+1−j(Zn, (eqRK(q))∗) ≤ n!

and hence

α3 ≤ λj(Zn, eqRK(q)) · λn+1−j(Zn, Cu(enq)) ≤ α4n!, (4.13)

where we also used the fact that (Zn)∗ = Zn. Let L[SS]j denote the corresponding function in the

article of Schmidt and Summerer, [17], and L[Roy]j the function in [15]. Taking logarithms in (4.13)

now gives

O(1) ≤ L[SS]j (Zn, eq ·RK(q)) + L

[SS]n+1−j(Zn, Cu(enq)) ≤ O(1), ∀q ≥ 0.

42

Page 45: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

This means that

L[SS]j (Λ,K, q) = L

[SS]j (Zn, RK(q)) +O(1) = q − L[Roy]

u,n+1−j(nq) +O(1). (4.14)

43

Page 46: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

44

Page 47: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 5

Roy’s Contribution

In Roy’s discussion of the results of Schmidt and Summerer in [16] and [17] he contributes a newresult regarding the existence of rigid systems and this result has applications in Diophantineapproximation. In particular, Roy proves that for a given unit vector u in Rn, with n ≥ 2, thereexists a rigid system P : [q0,∞) → ∆n such that Lu − P is bounded on [q0,∞). In this sectionwe will state his theorem and reproduce, from [15], his proof. The following discussion will relyon the (n, γ)-systems introduced in the previous section. First, we consider a special type of sucha system.

Definition 39. For q0 ≥ 0 we call an (n, γ)-system P = (P1, ..., Pn) on [q0,∞) an (n, γ)-reducedsystem if, for any j ∈ 1, ..., n− 1, a ≥ q0 and b ≥ a+ nγ such that P1 + · · ·+ Pj is constant onthe interval [a, b], we have that each function P1, ..., Pj is constant on [a, b− nγ].

Example 40. For γ = 0 we have that every (n, 0)-system is an (n, 0)-reduced system. Indeed, letP = (P1, ..., Pn) be an (n, 0)-system. Then condition (ii) in Definition 27 says that each Pj is amonotonically increasing function on [q0,∞). Consider any j ∈ 1, ..., n− 1 and any b ≥ a ≥ q0

such that P1+· · ·+Pj is constant on [a, b]. Since each function P1, ..., Pj is monotonically increasing,it then follows that each P1, ..., Pj must be constant on [a, b]. Hence P is an (n, 0)-reduced system.

4

The following theorem will serve as a major building block later on and it can be found as Propo-sition 7.1. in [15, p. 775]:

Theorem 41. Let γ and δ be real numbers such that 0 ≤ γ < δ/2n2. Let P : [0,∞) → Rn be an(n, γ)-reduced system and let q0 = n(n + 1)δ/2. Then there is a rigid n-system R : [q0,∞) → Rn

with mesh δ such that

||P(q)−R(q)||∞ ≤ 3n2δ

for all q ≥ q0.

Proof. We follow [15] closely throughout the entire proof. As a first step, we prove by a simplescaling argument that we may, without loss of generality, assume that δ = 1. Indeed, let γ, δ,P

45

Page 48: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and q0 be given as in Theorem 41; then define the function P : [0,∞)→ Rn by P(q) = δ−1P(δq).This is then an (n, γ/δ)-reduced system, and we note that 0 ≤ γ/δ < 1/2n2. Hence, if Theorem 41holds for “δ = 1”, it follows that there exists a rigid n-system R : [n(n+1)/2,∞)→ Rn with mesh1 such that R(q) = δR(q/δ). Then R is a rigid system with mesh δ, and ||P(q)−R(q)||∞ ≤ 3n2δfor all q ≥ q0, and so we have proved the theorem in the general case.

Hence from now on we assume δ = 1; thus 0 ≤ γ < 1/2n2 and q0 = n(n + 1)/2. Define newfunctions P j : [0,∞)→ R by setting

P j(q) := γ + supPj(t); 0 ≤ t ≤ q

for all j ∈ 1, ..., n and q ≥ 0. Then it follows that

−γ ≤ supPj(t); 0 ≤ t ≤ q =⇒ 0 ≤ P j(q) ∀q ≥ 0, (5.1)

and one can also show that P j(q) ≤ P j+1(q) + γ is true for all j = 1, ..., n − 1. Since Pj(q1) ≤Pj(q2) + γ for 1 ≤ j ≤ n it follows, using q0 ≤ q1 ≤ q2, that the inequality

Pj(q) + γ ≤ P j(q) ≤ Pj(q) + 2γ ∀q ≥ 0 (5.2)

holds. Since each function Pj(t) is piecewise linear with slopes −1, 0 or 1, it follows that eachfunction P j is piecewise linear with slopes 0 or 1. Following Roy, [15], a sequence of functionsEi : [0,∞)→ N∗ is defined recursively via

E1(q) = bP 1(q)c+ 1, j = 1

Ej(q) = maxEj−1(q) + 1, bP j(q)− 2(j − 1)γc+ 1, j = 2, ..., n.(5.3)

Recall that δ = 1 by assumption and that this means that γ < 1/(2n2). A conclusion that can bedrawn here is that for all j = 2, ..., n

|Ej(q)−Pj(q)| = |maxEj−1(q)+1, bP j(q)−2(j−1)γc+1−Pj(q)| ≤ max2(j−1)γ, j(1+γ)+2γ ≤ 1+n

=⇒ ||E(q)−P(q)||∞ ≤ 1 + n, (5.4)

where (5.2) is used. To see one other similarity we state the following Lemma:

Lemma 42. For all q ≥ 0 we have that

P j(q)− 2(j − 1)γ < Ej(q) ≤ P j(q) + j(1 + γ),

for j = 1, ..., n.

46

Page 49: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Proof. The proof of this lemma can be done with the help of mathematical induction. SinceP j(q) < bP 1(q)c + 1 ≤ P j(q) + 1 + γ we see that the inequalities are true for j = 1. For aninduction hypothesis, assume that the above holds for some integer j = p where 1 ≤ p < n. Then

P p+1(q)− 2pγ < bP p+1(q)− 2γc+ 1 ≤ maxEp(q) + 1, bP p+1 − 2γc+ 1 = Ep+1(q).

By definition we have

Ep+1 = maxEp(q) + 1, bP p − 2pγc+ 1 ≤ maxEp(q) + 1, P p+1 − 2pγ + 1

and the induction hypothesis implies that

maxEp(q)+1, P p+1−2pγ+1 ≤ maxP p(q)+p(1+γ)+1, P p+1(q)−2pγ+1 ≤ P p+1(q)+(p+1)(1+γ),

since Pj(q) ≤ Pj+1(q). Hence we have proved the lemma.

Another property is that ∣∣∣∣∣n∑l=1

Ek(q)− q

∣∣∣∣∣ ≤ n(n+ 1), (5.5)

which follows since

n∑k=1

Pk(q) = q.

Indeed,

∣∣∣∣∣n∑k=1

Ek(q)− q

∣∣∣∣∣ =

∣∣∣∣∣n∑k=1

Ek(q)−n∑k=1

Pk(q)

∣∣∣∣∣ =

∣∣∣∣∣n∑k=1

Ek(q)− Pk(q)

∣∣∣∣∣ ≤n∑k=1

|Ek(q)− Pk(q)|

≤n∑k=1

(n+ 1) = n(n+ 1).

47

Page 50: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

We will also need the following lemma:

Lemma 43. For every j = 1, ..., n, the function Ej admits at most one point of discontinuity onI, where I = (a, b) is any subinterval of [0,∞) of length less than or equal to 1.

Proof. We will argue by contradiction, so assume, on the contrary, that there exist j ∈ 1, ..., nsuch that Ej admits more than one discontinuity on I. Suppose q1 < q2 are two such points withthe property that Ej is constant on (q1, q2) ⊂ I and let j be the minimal index so that Ej isdiscontinuous in these points. Assume that Ej = k on (q1, q2), i.e. that Ej is constant on thisinterval. Then by definition of Ej and the assumption about discontinuity at q1 we have

limt→q−1

Ej(t) = k − 1 (5.6)

and

limt→q−2

Ej(t) = k. (5.7)

The discontinuity assumption at q2 then implies that

Ej(q2) = k + 1.

See Figure 5.1 for an illustration.

Figure 5.1: This illustrates the equalities (5.6) and (5.7) above.

This means that

P j(q1) ≤ k − 1 + 2(j − 1)γ.

48

Page 51: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

We also see that

P j(q2) ≤ P j(q1) + 1

since q2 − q1 < 1 and P j(q2) increases at most with slope 1. Hence

bP j(q2)− 2(j − 1)γc+ 1 ≤ k.

Observe that from before it is know that Ej(q2) = k + 1. By definition,

maxEj−1(q2) + 1, bP j − 2(j − 1)γc+ 1 = Ej−1(q2) + 1!

= k + 1,

which gives Ej−1(q2) = k. So, in order for Ej to have a discontinuity at q2 the function Ej−1 mustalso have a discontinuity there. However, we chose j to be the minimal index such that Ej admitsat least two discontinuities. Hence q2 must be the only one for Ej−1 and then we get

limt→q−2

Ej−1(q2) = k − 1.

Since it only has one discontinuity we can conclude that Ej−1 must be constant on (a, q2). Butthis means that

limt→q−1

Ej−1(t) = limt→q−1

Ej(t),

which gives a contradiction. Hence we have proved the lemma above.

Using part (iv) of Definition 27 it is possible to conclude that

P j(q) ≤n∑j=1

Pj(q) + 2γ = q + 2nγ < 1 (5.8)

for all 1 ≤ j ≤ n and provided that 0 ≤ q ≤ 1/2. Hence we see that

maxEj−1(q2) + 1, bP j − 2(j − 1)γc+ 1 = Ej−1(q2) + 1

and it can be deduced that E1 = 1

E2 = 2...

En = n,

49

Page 52: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

for these q ∈ [0, 1/2]. In particular, E is constant on [0, 1/2], which means that the discontinuitiesmust be an infinite discrete subset of [1/2,∞). Following Roy’s notation we let Σ denote the setof all discontinuities of E in [1/2,∞] and we put

Σ = Σ + [−nγ, nγ]. (5.9)

We can then express Σ as

Σ =∞⋃n=0

[an, bn]

where 0 ≤ a0 < b0 < a1 < b1 < · · · such that

2nγ ≤ bi − ai < 1

for all i ≥ 0. Let Ω(i) be the set of indices j such that Ej is not constant on [ai, bi] and we let Ω(q)be the nonempty set of indices such that Ej is discontinuous at q. The reason Roy introduces thisset is because it will help in the construction of the desired rigid n-system. Namely, if we let

k(i) := maxj ∈ Ω(i)|j = 1 or Ej−1(bi) < Ej(bi)− 1 (5.10)

and

l(i) := minj ∈ Ω(i)|j = n or Ej+1(ai) > Ej(ai)− 1 (5.11)

one can prove that k(i+1) ≤ l(i). Let us formulate this as a lemma:

Lemma 44. With the above notation, the inequality

k(i+1) ≤ l(i) (5.12)

holds for all i ≥ 0.

Proof. We recall that we still follow [15] closely throughout this chapter. The first thing to observeis that if we for j < n have that Pj + γ < Pj+1 on some subinterval [a, b], then

Mj =

j∑i=1

Pi

either has constant slope on [a, b] or no slope until a point c ∈ (a, b) and thereafter slope 1. Assumethat k ∈ Ω(q). Then

Ek(q) = limt→q−

Ek(t) + 1. (5.13)

50

Page 53: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

This means that whenever k 6= 1 we have

Ek(t) = bP k − 2(k − 1)γc+ 1 (5.14)

provided that

Ek−1(q) + 1 < Ek(q). (5.15)

This implies that

limt→q−

P′k(t) = 1 (5.16)

since Ek has discontinuity at q. From (5.13) it is immediate that

Ek−1(q) ≤ limt→q−

Ek(t)− 1

and Lemma 42 then gives that

P k−1(t)− 2(k − 2)γ < Ek−1(t) ⇐⇒ P k−1(t) < Ek−1(t) + 2(k − 2)γ ≤ limt→q−

Ek(t)− 1 + 2(k − 2)γ.

Under the above assumptions equation (5.14) gives that

P k(q) = limt→q−

Ek(t) + 2(k − 1)γ

and substitution then provides with

Pk(t) ≥ limt→q−

Ek(t) + 2(k − 1)γ − γ − 1 > Pk−1(q) + γ,

which in particular means that Mk−1 must be a concave up function for q − 1 ≤ t ≤ q. However,there is something stronger to claim here, namely that Mk−1 is constant on this interval. To seethis start by noticing that

limt→q−

M ′k−1(t) = lim

t→q−M ′

k(t)− limt→q−

P ′k(t).

We have limt→q−M′k(t) ≤ 1 by definition and that

limt→q−

P ′k(t) = 1

because of equation (5.16). Hence limt→q−M′k(t) = 0 and we conclude that Mk−1 is constant on

[q − 1, q].

51

Page 54: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Assume now, on the contrary, that there exists some i ≥ 0 such that l(i) < k(i+1). Let q ∈[ai + nγ, bi − nγ] and q′ ∈ [ai+1 + nγ, bi+1 − nγ] be the point of discontinuity of El(i) respectivelyfor Ek(i+1) . Then it follows that

Ek(i+1)−1(q′) ≤ Ek(i+1)(q′)− 1,

which means that Mk(i+1)−1 must be a constant on [q′− 1, q′] and we see that all of P1, ..., Pk(i+1)−1

are constant on [q′ − 1, ai+1] because q′ − nγ ≥ ai+1. A similar argument can be made in orderto show that El(i) is constant on [q, ai+1] and one can deduce that Ml(i) must be concave up on[q − 1, ai+1]. The fact that P is a (n, γ)-reduced system gives that E1, ..., El(i) are constant on[q − 1, ai+1 − nγ]. However, q ∈ [q − 1, ai+1 − nγ], which means that El(i) is discontinuous at q.This yields a contradiction and the lemma is proved.

We are now close to the end of the proof of Theorem 41. The idea from here on is to define someti such that there is a rigid n-system R : [t0,∞) → ∆n such that R(ti) = E(ai) and then showthat

||R(t)−P(t)||∞ ≤ 3n2, (5.17)

for all t ∈ [t0,∞). By defining

ti =n∑k=1

Ek(ai), ∀i ≥ 0

we get that R(ti) = E(ai). To see this claim, recall that E(0) = (1, 2, ..., n) and that therefore

t0 =n∑k=1

k =n(n+ 1)

2.

Let us partition Ω(i) into sets Ω(i)j

rij=1 where each Ω

(i)rk consists of consecutive integers k, ..., l so

that Ek(ai), ..., El(ai) are also consecutive integers. We also choose the indices j ∈ 1, ..., ri in

such a way that max Ω(i)k+1 < min Ω

(i)k for all 1 ≤ k < ri. The goal is to express the inequality

k(i+1) ≤ l(i) in terms of the partitions defined above. To see the connection the following definitionis made:

(Ωi)i≥0 := (Ω(0)1 , ...,Ω(0)

r0,Ω

(1)1 , ...,Ωr1(1), ...). (5.18)

Then we can consider the inequality min Ω(i+1)1 ≤ max Ω

(i)ri instead because

min Ω(i)1 = k(i)

and

l(i) = max Ω(i)ri

52

Page 55: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

for all i ≥ 0. Directly from construction

max Ωi ≥ min Ωi+1

for all i ≥ 0, which implies that max Ωi ≥ min Ωi−1 for i ≥ 1. We can make the last inequalityvalid for i = 0 as well by defining Ω−1 := n. Let us now define

a(i)k = (a

(i)k,1, ..., a

(i)k,n)

by

a(i)k,j =

Ej(ai), if j ∈ Ω

(i)r ∪ · · · ∪ Ω

(i)ri ,

Ej(ai+1), else.

Notice that a(i)k ∈ Zn and that

a(i)1 = E(ai). (5.19)

The last equation, (5.19), follows since E is constant on [bi, ai+1] and that for i ≥ 0 and j ∈ 1, ..., nwe have

Ej(ai+1) =

Ej(ai) + 1, j ∈ Ω(i),

Ej(ai) else,

which holds since Ej has only one point os discontinuity for j ∈ Ω(i) in the interval [ai, bi]. This

means that if we know a(i)k , we can obtain a

(i)k+1 by adding 1 to all components that have index in

Ω(i). We define a(i)ri+1 = ai+1

1 . Then

(a(i)1 , ..., a

(i)ki, ..., a(i)

n ) = (a(i+1)1 , ..., a

(i+1)li+1

, ..., a(i+1)n ) (5.20)

if we define

ki = min Ωi

and

li+1 = max Ωi

for all i ≥ 0. We have now almost constructed a canvas of mesh 1. Unfortunately we have1 ≤ ki ≤ li ≤ n. This can, however, be fixed if we use R(ti) = E(ai) and define (im)0≤m<s to be the

increasing sequence of integers with i = 0 or ki < li. Then (a(im))0≤m<s, (k(im)im

)0≤m<s, (l(im)im

)0≤m<swill be a canvas of mesh δ = 1 because the corresponding equality in (5.20) still holds and we then

53

Page 56: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

get 1 ≤ ki < li ≤ n. Since we have constructed a rigid n-system, i.e. R, we only need to showthat (5.17) holds for all t ∈ [t0,∞). To do this, fix such a t ∈ [ti, ti+1] for i ≥ 0. Then we obtain

|ai − t| ≤

∣∣∣∣∣ai −n∑k=1

Ek(ai)

∣∣∣∣∣+n∑k=1

(Ek(ai+1)− Ek(ai)) ≤ n(n+ 1) + n

and since P is continuous with components that have slopes 0 and 1 we get the estimate

||P(ai)−P(t)||∞ ≤ |ai − t| ≤ n(n+ 1) + n = n2 + 2n.

By the previous discussion,

||R(ti)−P(t)||∞ = ||E(ai)−P(ai)||∞ ≤ 1 + n

and

||R(t)−R(ti)||∞ ≤ ||R(ti+1)−R(ti)||∞ = ||E(ai+1)− E(ai)||∞ = 1.

This means that

||R(t)−P(t)||∞ = ||R(t)−R(ti) + R(ti)−P(ai) + P(ai)−P(t)||∞

≤ ||R(t)−R(ti)||∞ + ||R(ti)−P(t)||∞ + ||R(ti)−P(t)||∞

≤ n2 + 3n+ 2 ≤ 3n2.

This completes the proof of Theorem 41 in the case δ = 1, and by the argument at the beginningof the proof, we have thus proved Theorem 41 in the general setting.

Theorem 41 will work as a lemma in the proof of Theorem 49, which is the main result of thischapter. However, Theorem 41 is not all that is needed. The concept of the so-called projectivedistance will also be of importance.

Definition 45. Let x,y ∈ Rn be two nonzero vectors. The projective distance is defined by

dp(x,y) =||x ∧ y||||x|| · ||y||

.

The geometric intuition of the projective distance comes from linear algebra and the theory re-garding alternating k-forms, since ||x∧y|| represents the area of the parallelogram spanned by thevectors x and y. This means that if the smallest angle between x and y is θ, then dp(x,y) = sin θ.A slight generalization of the definition above is when we talk about the distance from a vector toa subspace of Rn. Then we set

54

Page 57: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

dp(x,V) = infdp(x,y) : y ∈ Sn−1 ∩ V,

whenever V is a nonzero subspace of Rn and x ∈ Rn is a nonzero vector. If we write Rn = V⊕V⊥we have that ∧2

Rn =∧2

V⊕ (V⊥ ∧ V)⊕∧2

V⊥,

which means that one can split x into x = y + z with y ∈ V and z = x||V⊥ ∈ V⊥. Hence we get

dp(x,y) =||(y ∧ y) + (z ∧ y)||

||x|| · ||y||≥ ||z||||x||

=||x ||V⊥||||x||

(5.21)

for any nonzero y ∈ V.

Now the focus will lie on some more technical parts in the form of some lemmas. The followinglemma together with its proof can, again, be found in [15]:

Lemma 46. Let 1 ≤ h ≤ l ≤ n and 1 ≤ k < l ≤ n be integers and assume that x1, ...,xn is abasis of Zn. Let also A ≥ 2l(||x1|| + · · · + ||xl||). Then there exists a basis (y1, ...,yn) of Zn suchthat

(i) (y1, ..., yl, ...,yn) = (x1, ..., xh, ...,xn),

(ii) yl ∈ xh + 〈x1, ..., xh, ...,xl〉Z,

(iii) A ≤ ||yl|| ≤ 2A,

(iv) dp(yl, 〈y1, ..., yk, ...,yl−1〉R) ≥ 1− 12l−1 .

Proof. Let y1, ..., yl, ...,yn be the unique vectors satisfying

(y1, ..., yl, ...,yn) = (x1, ..., xh, ...,xn).

Then for any choice of yl as in (ii), y1, ...,yn is a basis of Zn. Hence our task is to prove thatthere exists a choice of yl as in (ii) such that also (iii) and (iv) are fulfilled. One way to do thisis to construct a chain of vector spaces W ⊂ V ⊂ U where all the subspaces have codimension onerelative to each other. We will define these spaces via

U := 〈x1, ...,xn〉R,

V := 〈y1, ...,yn〉R

and

W := 〈y1, ..., yl, ...,yn〉R.

55

Page 58: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

It is then clear that W ⊂ V ⊂ U. Observe that dimU∩W⊥ = 2 and let us construct an orthonormalbasis to this space by choosing unit vectors u ∈ U ∩ V⊥ and v ∈ V ∩W⊥. Since xh||V⊥ = λu forsome λ 6= 0 we get that

λu− xh + αv ∈ V

for α ∈ R. It will be convenient to choose α = (3/2)A and this will be seen later on. We can nowwrite the vector λu− xh + αv as a linear combination of the basis vectors in V and obtain:

λu− xh + αv =l−1∑j=1

cjyj

where c1, ..., cl−1 ∈ R. If we define

yl := xh +n∑j=1

dcjeyj ∈ xh + 〈x1, ..., xh, ...,xl〉Z

we get that

|||yl|| − α| ≤l∑

j=1

||xj|| ≤α

2l

where we in the last inequality used the assumption A ≥ 2l(||x1||+ · · ·+ ||xn||) and thatα = (3/2)A > A. This implies that

2(1− 2−l)A

3≤ ||yl|| ≤

2(1 + 2−1)A

3

l≥2⇐⇒ A ≤ ||yl|| ≤ 2A.

We need now to show (iv). However, it is known that |yl · v| ≤ ||yl||W⊥|| and since

|yl · v| =∣∣∣∣2A3 + (dcke − ck)yk · v

∣∣∣∣ ≥ 2A

3− ||yk|| ≥

2(1− 2−l)A

3

=⇒ dp(yl, 〈y1, ..., yk, ...,yl−1〉R) =||yl||W⊥||||yl||

≥ 1− 2−l

1 + 2−1≥ 1− 1

2l−1.

This proves the lemma.

The following Corollary can be found as a part of Lemma 5.2 in [15, p. 765].

Corollary 46.1. Let A1, ..., An be real numbers satisfying A1 ≥ 8 and Aj ≥ 2j+3Aj−1 for j =2, .., n. Then there is a basis (y1, ...,yn) of Zn such that

(i) Aj ≤ ||yj|| ≤ 2Aj for all j = 1, ..., n,

(ii) dp(yn, 〈y1, ...,yn−1〉R) ≥ 1− 12n−1 for k < n.

56

Page 59: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Proof. We follow the proof of Lemma 5.2. in [15, p. 765]. We use Lemma 46 in order to generate asequence of integer points in Zn with the desired properties and then use this sequence to constructa new basis of Zn satisfying the list in the Corollary. We do this by considering the standard basisof Zn. If we define this sequence x1, ...,xn ∈ Zn by first letting x1 = e1 and then requiring xj forj ≥ 2 to satisfy Lemma 46. This means that the vectors x1, ...,xj−1, ej, ..., en must be a basis forZn for each j ≥ 2 and also

(i) xl ∈ ej + 〈x1, ..., x1, ...,xj−1〉Z,

(ii) Aj−1 ≤ ||xl|| ≤ 2Aj−1,

(iii) dp(xj, 〈x1, , ...,xj−1〉R) ≥ 1− 12l−1 .

So, we are creating sequences of basis vectors of Zn starting from the initial vector e1. We canthen apply Lemma 46 to the last basis fulfilling the constraints above, namely

x1, ...,xl−1, el, ..., en.

By assumption we obtain

2l

(l−1∑k=1

||xk||+ ||el||

)≤ 2l

(2l−1∑j=2

Aj−1 + 2

)= 2l+1

(1 +

l−1∑j=2

Aj−1

)

≤ 2l+1 (1 + (l − 2)Al−1) ≤ Al−1.

Hence by Lemma 46 we get a new vector xl to add to our list. In particular this is true for l = nand we are done with the corollary.

Whether or not a sequence of vectors (x1, ...,xm) satisfies (iv) of Lemma 46 will be of such animportance that a definition is in order.

Definition 47. A nonempty sequence (x1, ...,xm) consisting of vectors in Rn is called almostorthogonal if the vectors x1, ...,xm are linearly independent and if they for all 2 ≤ l ≤ m satisfy

dp(xl, 〈x1, ...,xl−1〉R) ≥ 1− 1

2l−1. (5.22)

We will now put some more constraints on the constants A1, ..., An from the previous discussionand derive a lemma that will provide an important inequality later on. Let us fix a choice ofs ∈ N∗ ∪ ∞. We will from this point onwards assume that for each 0 ≤ i < s there is given a

fixed point (A(i)1 , ..., A

(i)n ) ∈ Rn and fixed integers ki and li, such that:

(i) 1 ≤ k0 ≤ l0 = n and 1 ≤ ki < li ≤ n whenever i ≥ 1;

(ii) A(i)1 ≥ 2n+3e4 and A

(i)j ≥ A

(i)j−12n+3e4 for all j = 2, ..., n;

57

Page 60: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

(iii) ki−1 ≤ li and A(i)li≥ A

(i−1)li

2n+3e4

(iv) (A(i)1 , ..., A

(i)l1, ..., A

(i)n ) = (A

(i−1)1 , ..., A

(i−1)ki−1

, ..., A(i−1)n ) for i ≥ 1.

We recall that we are following the construction done by Roy in [15, p. 766]. The following resultis a combination of Proposition 5.3 and Proposition 5.4 in [15, p. 766, p. 769]:

Lemma 48. Define

Qi =n∏j=1

A(i)j

for all 0 ≤ i < s and Qs = ∞ whenever s 6= ∞. Then there exists a basis (x(i)1 , ...,x

(i)n ) of Zn for

each 0 ≤ i < s and a unit vector u ∈ Rn such that for each i with 0 ≤ i < s and each Q ∈ [Qi, Qi+1)we have

(i) A(i)j ≤ λ(x

(i)j , Cu(Q)) ≤ 8e4A

(i)j for any j = 1, ..., n and j 6= ki;

(ii)QA

(i)ki

2nQi≤ λ(x

(i)ki, Cu(Q)) ≤

8QA(i)ki

Qi.

Proof. First of all we want to establish that

A(i)j ≤ ||x

(i)j || ≤ 2A

(i)j (5.23)

for all j = 1, ..., n. The above inequality can be seen as a slight generalization of (iii) in Lemma46. In order to prove (5.23) we use mathematical induction and Lemma 46. For the basis of theinduction we consider the case when i = 0. This case follows directly from part (ii) of Corollary46.1. As the induction hypothesis we assume the inequality holds for i = 1, ..., p − 1, i.e. weassume that there is a basis (x

(i)1 , ...,x

(i)n ) with the wanted properties for all such i:s. If we let

k = kp, h = kp−1 and l = lt in the assumptions of Lemma 46 we get from (ii) of the construction

of (A(i)1 , ..., A

(i)n ) that

2lplp∑j=1

||x(p−1)j || ≤ 2lp+1

lp∑j=1

A(p−1)j ≤ A

(p)lp

which means that we may apply Lemma 46 and conclude that there exists a basis (x(p)1 , ...,x

(p)n ) of

Zn such that

A(p)lp≤ ||x(p)

lp|| ≤ 2A

(p)lp. (5.24)

By the principle of mathematical induction we have proved that (5.23) holds. Assume now thats =∞. Define

Ω = 〈x1, ..., xk, ...,xm〉R ∩ 〈x1, ..., xl, ...,xm〉R

58

Page 61: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and let U be a Q-vector space with dimU = m. Choose unit vectors:

u1 ∈ U ∩ 〈x1, ..., xk, ...,xm〉R,

u2 ∈ U ∩ 〈x1, ..., xl, ...,xm〉R,

v1 ∈ Ω⊥ ∩ 〈x1, ..., xk, ...,xm〉R,

v2 ∈ Ω⊥ ∩ 〈x1, ..., xl, ...,xm〉R.

Since u1,v1 and u2,v2 are orthonormal bases of U∩Ω⊥ we see that v1 ∧ v2 = ±u1 ∧ u2. Letnow x ∈ 〈x1, ..., xk, ...,xm〉R. Then we can write x = y+αx for y ∈ Ω and x ∈ 〈x1, ..., xk, ...,xm〉R.This gives

dp(〈x1, ..., xk, ...,xm〉R, 〈x1, ..., xl, ...,xm〉R) = supdp(x, 〈x1, ..., xl, ...,xm〉R) : x ∈ 〈x1, ..., xk, ...,xm〉R\0

= sup(inf(dp(x, y)) : y ∈ Sn−1 ∩ 〈x1, ..., xl, ...,xm〉R,x ∈ Sn ∩ 〈x1, ..., xk, ...,xm〉R

)

= sup

(||(y + αx)||〈x1,...,xk,...,xm〉⊥R ||

||y + αx||: y + αx ∈ Sn ∩ 〈x1, ..., xk, ...,xm〉R

)= dp(v1,v2) = ||v1∧v2||

(5.25)

because the supremum value is attained if and only if y = 0. Let us now define

ω = x1 ∧ · · · ∧ xk ∧ · · · ∧ xl ∧ · · · ∧ xm ∈ Ω

and immediately observe that

||ω ∧ xk|| =

/ xk = y1 + α1y2

y1 ∈ Ω, y2 ∈ 〈x1, ..., xk, ...,xm〉Rα1 ∈ R

/= ||ω ∧ (y1 + α1y2)|| = |α1|||ω||.

A similar calculation shows that

||ω ∧ xl|| = |α2|||ω||

if we write xl = y3 + α2y4 with y3 ∈ Ω, y4 ∈ 〈x1, ..., xl, ...,xm〉R and α2 ∈ R. We also notice that

||ω ∧ yk ∧ yl|| = ||ω ∧ (y1 + α1y2) ∧ (y3 + α2y4)|| = |α1 · α2| · ||ω|| · ||y2 ∧ y4||

= |α1 · α2| · ||ω|| · dp(〈x1, ..., xk, ...,xm〉R, 〈x1, ..., xl, ...,xm〉R).

59

Page 62: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

We let H(U) stand for the co-volume the lattice Zn ∩ U when U is a vector space over Q ofdimension m. With this notation we get

H(Ω) = ||ω|| ≤ 1

||yk|| · ||yl||

m∏j=1

||yj||

and since

||x1 ∧ · · · ∧ xk|| ≥k∏j=1

k∏i=2

||xj||(1− 21−i) ≥k∏j=1

∞∏i=2

||xj||(1− 21−i) = e−2

k∏j=1

||xj||

for all 1 ≤ k ≤ m we get

H(〈x1, ..., xk, ...,xm〉R) ≥ 1

e2||xl||

m∏j=1

||xj|| (5.26)

and

H(〈x1, ..., xl, ...,xm〉R) ≥ 1

e2||xk||

m∏j=1

||xj|| (5.27)

because of (ii) in the assumptions. We can then conclude that

dp(〈x1, ..., xk, ...,xm〉R, 〈x1, ..., xl, ...,xm〉R) ≤ e4||x1 ∧ · · · ∧ xm||||x1|| · · · ||xm||

. (5.28)

Observe that the above inequality is only valid since

dp(xj, 〈x2, ...,xj−1〉R) ≥ 1− 1

2j−1(5.29)

for all j ≥ 3. The next step is to show that if ui is a unit vector that is orthogonal to 〈x(i)1 , ...x

(i)ki, ...,x

(i)n 〉R

then for all i and j such that 0 ≤ i < j < s we have

dp(ui,uj) ≤2e4

||x(i+1)1 || · · · ||x(i+1)

n ||. (5.30)

By induction one can prove that

dp(〈x(i)1 , ...x

(i)li, ...,x(i)

m 〉R, 〈x(i)1 , ...x

(i)ki, ...,x(i)

m 〉R) ≤ e4||x(i)1 ∧ · · · ∧ x

(i)m ||

||x(i)1 || · · · ||x

(i)m ||

,

holds for all i such that (5.29) holds. Since

〈x(i)1 , ...x

(i)ki, ...,x(i)

n 〉R ⊕ 〈x(i)1 , ...x

(i)ki, ...,x(i)

n 〉R ∼= Rn

and since the co-volume of the integer lattice in Rn is 1 we get that

60

Page 63: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

dp(〈x(i)1 , ...x

(i)ki, ...,x(i)

n 〉R, 〈x(i)1 , ...x

(i)li, ...,x(i)

n 〉R) ≤ e4

||x(i)1 || · · · ||x

(i)n ||

.

for i ≥ 1 and because of (5.25) we get

dp(ui−1,ui) ≤e4

||x(i)1 || · · · ||x

(i)n ||

. (5.31)

Now observe that i − 1 and i are consecutive indices. For arbitrary integers 0 ≤ r < t < s suchthat (5.29) holds we get

dp(ur,ut) ≤t∑

i=r+1

dp(ui−1, ui) ≤t∑

i=r+1

e4

||x(i)1 || · · · ||x

(i)n ||≤ 2e4

||x(i+1)1 || · · · ||x(i+1)

n ||.

Hence we have derived (5.30). In particular we see that there exists a sequence (ui)i≥0 ∈ Pn−1(R)of unit vectors converging to a unit vector u such that

dp(ui,u) ≤ 2e4

||x(i+1)1 || · · · ||x(i+1)

n ||

for 0 ≤ i < s. By choosing ui so that ui · u ≥ 0 it follows that

||ui − u|| ≤ 2dp(ui,u) ≤ 4e4

||x(i+1)1 || · · · ||x(i+1)

n ||≤ 4e4

Qi+1

.

This immediately gives that

|x(i)j · (u− ui)| ≤

8e4A(i)J

Qi+1

since |x(i)j · (u − ui)| ≤ ||x(i)

j || · ||u − ui|| and (5.24). If we use that Q < Qi+1 and assume thatj 6= ki we get

|x(i)j · (u− ui)| = |x(i)

j · u|Q ≤ 8e4A(i)j . (5.32)

Notice that the above implies ||x(i)j || ≤ 8e4A

(i)j and so we can conclude that

A(i)j ≤ max||x(i)

j ||, |x(i)j · u| ·Q ≤ 8e4A

(i)j

⇐⇒ A(i)j ≤ λ(x

(i)j , Cu(Q)) ≤ 8e4A

(i)j

and we have proved the first claim. To prove part (ii) we use that

||x(i)ki||

||x(i)1 || · · · ||x

(i)n ||≤ 1

||x1 ∧ · · · ∧ x(i)ki∧ · · · ∧ x

(i)n ||

.

61

Page 64: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

since we in general have

||x1 ∧ · · · ∧ xn|| = ||x1 ∧ · · · ∧ xn−1|| · |xn · u| = 1

⇐⇒ |xn · u| =1

||x1 ∧ · · · ∧ xn−1||.

If we now use (5.26) adapted to our situation we get

1

||x1 ∧ · · · ∧ xn−1||≤

e2||x(i)ki||

||x(i)1 || · · · ||x

(i)n ||

.

Now when we have established the above main inequalities we translate them into the language ofthe lemma to get:

A(i)ki

Qi

≤ 1

||x1 ∧ · · · ∧ xn−1||≤e2A

(i)ki

Qi

⇐⇒A

(i)ki

Qi

≤ |x(i)ki· u| ≤

e2A(i)ki

Qi

. (5.33)

Now, part (iii) of the definition of the point (A(i)1 , ..., A

(i)n ) ∈ Rn together with (5.33) yield

A(i)ki

2nQi

≤ |x(i)ki· u| ≤

8A(i)ki

Qi

because

Qi+1

Qi

≥ 2n+3e4.

This completes the proof of the lemma.

The above properties will be used to prove Roy’s contribution:

Theorem 49. If q0 ≥ 0 and P : [q0,∞)→ Rn is an (n, 0)-system, then there exists a unit vectoru of Rn such that

||P(q)− Lu(q)||∞ ≤ 3n2(n+ 9)

for all q ≥ q0.

What the above theorem says is that we can approximate a rigid n system P with the help ofthe functions Lu constructed before. This is a very important tool because it allows us to findexponents of Diophantine approximation.

62

Page 65: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Proof. Define

P(q) = Φn(0, ..., 0, P1(q0), ..., Pi−1(q0), q − si−1)

for si−1 ≤ q ≤ si and all 1 ≤ i ≤ n where

si =

0, i = 0∑i

k=1 Pk(q0), i ∈ 1, ..., n.

Then P is an (n, 0)-system and in particular it is reduced. Hence, by Theorem 41. we know thatthere for any given value δ > 0 exists a rigid n-system R : [n(n + 1)/2,∞) → Rn with mesh δ sothat

||P(q)−R(q)||∞ ≤ 3n2δ (5.34)

for all q ≥ n(n+ 1)/2. However, this will not immediately give us the wanted inequality. We willnow apply Lemma 48 with

A(i)j = exp(a

(i)j )

for 1 ≤ j ≤ n in order to derive a powerful inequality. Let (x(i)1 , ...,x

(i)n ) be a basis of Zn satisfying

Lemma 46 and Corollary 46.1. Moreover, let u be a unit vector with the properties from Lemma48. First we note that

logQi = log

(n∏j=1

exp(a(i)j )

)=

n∑j=1

a(i)j = qi

for 0 ≤ i < s and we put qs = logQs = ∞ if s 6= ∞. Let q ∈ [qi, qi+1) and notice that

eq ∈ [Qi, Qi+1). Since x(i)j is a basis vector of Zn for all j = 1, ..., n we have that there exists a

permutation σ ∈ Sn such that

λj(C(eq)) ≤ λ(x(i)σ(j), Cu(eq)).

If we now use Lemma 48 and that Lu,j(q) = log λj(C(eq)) we get

Lu,j(q) ≤ log(8e4A(i)j ) = log(8e4) + a

(i)j

when j 6= ki and

Lu,j(q) ≤ log(8A

(i)kieq

Qi

) = log 8 + a(i)ki

+ eq − log

(n∏j=1

exp(a(i)j )

)= log 8 + a

(i)ki

+ eq − qi

≤ log(8e4) + a(i)ki

+ eq − qi

63

Page 66: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

if j = ki. If we combine them we get

Lu,j(q) ≤ Pj(q) + log(8e4).

for all j = 1, ..., n. By (3.13) we then get

q − n log(n) ≤n∑j=1

Lu,j(q).

We also see that

0 ≤ Pj(q) + log(8e4)− Lu,j(q) ≤ n(log n+ log(8e4)) (5.35)

since

n∑j=1

(Pj(q) + log(8e4)) = q + n · log(8e4).

Also, since (5.35) holds true for all 1 ≤ j ≤ n we can conclude that

||Lu(q)−P(q)||∞ ≤ n log(8e4n). (5.36)

Now we use this inequality applied to the rigid n-system R. Then we know there exists a unitvector u such that

||Lu(q)−R(q)||∞ ≤ n log(8e4n).

Let us choose δ = n+ 7 Then it follows, after combining the above inequality with (5.34), that

||P(q)− Lu(q)||∞ = || −P(q) + R(q) + Lu(q)−R(q)||∞ ≤ ||Lu(q)−R(q)||∞ + ||P(q)−R(q)||∞

≤ n log(8e4n) + 3n2δ ≤ 3n2(n+ 9)

for q ≥ n(n + 1)δ/2 since it is for those q vales we can define R. The case when q ≤ n(n + 1)δ/2yields the same result because we then have

Pj(q) ≤n∑k=1

Pk(q) = q ≤ n(n+ 1)δ

2(5.37)

and since (3.13) holds we get∣∣∣∣∣n∑i=1

Lu,i(q)− q

∣∣∣∣∣ ≤ n log n =⇒n∑k=1

Lu,i(q) ≤ q + n log n ≤ n(n+ 1)δ

2+ n log n.

64

Page 67: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

This implies that

||P(q)− Lu(q)||∞ ≤ 3n2(n+ 9)

for 0 ≤ q ≤ n(n+ 1)δ/2 and we have proved the theorem!

Finally, we now obtain Roy’s main theorem [15, 15, Thm. 1.3]:

Theorem 50. Assume that n ≥ 2 is an integer and that δ ∈ (0,∞). For each unit vector u ofRn there exists a rigid system P : [q0,∞) → ∆n with mesh δ such that Lu −P is bounded on theinterval [q0,∞). Conversely, for each rigid system P : [q0,∞) → ∆n with mesh δ, there exists aunit vector u in Rn such that Lu −P is bounded on [q0,∞).

Proof. This is a direct consequence of Theorems 28, 41 and 49.

65

Page 68: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

66

Page 69: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Chapter 6

Exponents of DiophantineApproximation

In the study of Diophantine approximation the goal is to approximate an arbitrary real numberwith a rational number, or more generally, an arbitrary vector in Rn with a vector in Qn. Thefollowing theorem is rather standard and can be found in many elementary number theory books;we follow [13] for the statement and proof. The proof uses the Pigeonhole Principle, which weassume is known from earlier studies.

Theorem 51 (Dirichlet’s approximation theorem). Let α be a real number and n a positiveinteger. Then there are integers a and b with 1 ≤ a ≤ n such that

|aα− b| < 1/n.

Proof. Let α denote the fractional part of α, that is α := α− bαc. Then each of the numbersjα for j = 0, 1, ..., n belongs to exactly one of the disjoint intervals (k − 1)/n ≤ x < k/n,k = 1, 2, ..., n. Note that we have n+ 1 numbers and n intervals. By the Pigeonhole Principle wemust have the existence of some integers l and k with 0 ≤ k < l ≤ n so that

|kα − lα| < 1/n.

If we define a = l − k and b = blαc − bkαc we get

|aα− b| = |lα− kα− blαc+ bkαc| = |kα − lα| < 1/n

and we have proved the theorem.

A slight generalization of Theorem 51 states that for any real numbers ξ2, ..., ξn ∈ R and Q ≥ 1there exists a nonzero point y = (y1, ..., yn) ∈ Zn such that |y1| ≤ Q and

|ξiy1 − yi| ≤ Q−1/(n−1) (6.1)

67

Page 70: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

for i = 2, ..., n. This follows from Minkowski’s theorem applied to the lattice

Γ = (y1, ξ2y1 − y2, ..., ξny1 − yn) : (y1, ..., yn) ∈ Zn (6.2)

and the convex body

Cn(Q) = x = (x1, ..., xn) ∈ Rn : |x1| ≤ Q, |xi| ≤ Q−1/(n−1)

for i = 2, ..., n. The determinant of this full lattice will be

vol(Rn/Γ) = abs

∣∣∣∣∣∣∣∣∣1 0 · · · 0ξ1 −1 · · · 0...

...ξn 0 · · · −1

∣∣∣∣∣∣∣∣∣

= |(−1)n−1| = 1.

The volume of Cn(Q) is given by 2n · Q · Q−(n−1)/(n−1) = 2n and hence we get that V (Cn(Q)) ≥2n det(Γ). By the Minkowski theorem we have that Cn(Q) contains a nonzero lattice point of Γ.From this the existence claimed around (6.1) follows. We have hence proved that the exponent−1/(n− 1) is a valid one in (6.1). Are there others? Given ξ1, ..., ξn−1 ∈ R, for what values η > 0does there exist arbitrarily large integers Q such that there exists (x, y1, ..., yn−1) ∈ Zn \ 0 suchthat |x| ≤ Q and |ξix− yi| ≤ Q−η for i = 1, ..., n− 1? These types of questions form the basis ofthe theory of Diophantine exponents. Some classical exponents were studied by Khintchine [8, 9]defined as follows:

For ξ = (ξ1, ..., ξn−1) ∈ Rn−1 it is standard to define ω = ω(ξ) to be the supremum of real numbersη such that there exists arbitrary large integers Q for which the system of inequalities

|x| ≤ Q (6.3)

and

|ξix− yi| ≤ Q−η, (i = 1, ..., n− 1) (6.4)

has a nontrivial solution (x, y2, ..., yn) ∈ Zn. We will also consider an exponent denoted ω∗ = ω∗(ξ),which is defined as the supremum of η ∈ R such that there exist arbitrarily large integers Q forwhich the system of inequalities ∣∣∣∣∣x−

n−1∑k=1

ξkyk

∣∣∣∣∣ ≤ Q−η (6.5)

and

|yi| ≤ Q, (i = 1, ..., n− 1) (6.6)

68

Page 71: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

admits a nonzero integer solution (x, y1, ..., yn−1) ∈ Zn. Finally, we let ω = w(ξ) and ω∗(ξ) bethe corresponding uniform exponents, i.e. ω is the supremum of real numbers η such that forarbitrarily large integers Q the system (6.3), (6.4) admits a nontrivial integer solution, and ω∗ isthe supremum of real numbers η such that for all sufficiently large integers Q the system (6.5),(6.6) admits a nontrivial integer solution.

6.1 Khinchin’s Transference Principle

There are different ways to present Khinchin’s transference principle but we will first present it inthe same way as it is done in [16, p. 80] and in [21, Satz 3]. However, this result goes back to [8]and says, with the above notation, that

ω∗

(n− 2)ω∗ + n− 1≤ ω (6.7)

and

(n− 1)ω + n− 2 ≤ ω∗ (6.8)

for n > 2. The inequalities that Khinchin’s transference principle provides us with leads to thefollowing conclusion made by Schmidt and Summerer in [16]:

Theorem 52. Consider the lattice from (6.2) and use the convex body

C(Q) = [−Q,Q]× [−Q−1/(n−1), Q−1/(n−1)]n−1

to define the corresponding functions L1, ..., Ln. Then Khinchin’s transference principle is equiva-lent to

lim infq→∞

L1(q)

q+ (n− 1) lim sup

q→∞

Ln(q)

q≥ 0

and

lim supq→∞

Ln(q)

q+ (n− 1) lim inf

q→∞

L1(q)

q≤ 0.

The proof of Theorem 52 will require a lemma, presented in [16, Theorem 1.4]:

69

Page 72: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Lemma 53. Let

E1 = (ω + 1)

(1 + lim inf

q→∞

L1(q)

q

), (6.9)

E2 = (ω + 1)

(1 + lim sup

q→∞

L1(q)

q

), (6.10)

E3 = (ω∗ + 1)

(1

n− 1− lim sup

q→∞

Ln(q)

q

), (6.11)

E4 = (ω∗ + 1)

(1

n− 1− lim inf

q→∞

Ln(q)

q

). (6.12)

Then all the expression takes the same value:

Ei =n

n− 1(i = 1, .., 4).

Proof. We follow section 4 in [16] for the proof of this lemma. Let

Cγ(Q) := x = (x, y1, ..., yn−1)e ∈ Rn : |x| ≤ Q1+γ,

|ξix− yi| ≤ Q−1/(n−1)+γ(i = 1, ..., n− 1).(6.13)

Introducing the auxiliary variable

η :=(n− 1)−1 − γ

1 + γ= −1 +

n

(n− 1)(1 + γ)

makes it possible for us to rewrite (6.13) as

Cγ(Q) := x = (x, y1, ..., yn−1)e ∈ Rn : |x| ≤ Q1+γ,

|ξix− yi| ≤(Q1+γ

)−η(i = 1, ..., n− 1). (6.14)

Hence, letting X = Q1+γ and comparing with the definition of the Diophantine exponent ω,we conclude that γ > lim infq→∞

L1(q)q

implies that ω ≥ η. Then by the definition of L1(q) we

have that for any number γ > lim infq→∞L1(q)q

, there exist arbitrarily large values of Q for which

λ1(C(Q)) < Qγ, i.e. C(Q) ∩ Zn 6= 0. Letting γ → lim infq→∞L1(q)q

we now conclude:

ω ≥ −1 +n

(n− 1)(1 + lim infq→∞L1(q)q

). (6.15)

70

Page 73: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Equivalently,

n

n− 1≤ (1 + ω)

(1 + lim inf

q→∞

L1(q)

q

).

Since ω is defined to be the supremum of the numbers η such that there exist arbitrarily largevalues of X > 0 for which the system of inequalities |x| < X and |ξix− yi| ≤ X−η (i = 1, ..., n− 1)has a non-zero solution (x, y1, ..., yn−1) ∈ Zn, we see that if η < ω then there exist arbitrarily largevalues of Q for which Cγ(Q) contains a nonzero lattice point of Zn. By letting

X = Q1+η

and

γ =n

(n− 1)(η + 1)− 1

we get that

lim infq→∞

L1(q)

q≤ γ.

This means that we get the following inequality:

lim infq→∞

L1(q)

q≤ n

(n− 1)(ω + 1)− 1 ⇐⇒ (1 + ω)

(1 + lim inf

q→∞

L1(q)

q

)≤ n

n− 1,

which means that it is possible to conclude that

(ω + 1)

(1 + lim inf

q→∞

L1(q)

q

)=

n

n− 1.

For E2 we notice that if δ > lim supq→∞L1(q)q

then the corresponding set Cδ(Q) will contain onenonzero lattice point of Zn for arbitrarily large values of Q and we obtain in a similar way as beforethat

ω ≥ n

(n− 1)(lim supq→∞L1(q)q

+ 1)

and whenever η < ω the system |x| ≤ Q1+δ

|ξix− yi| ≤ Q−1/(n−1)δ

for i = 1, ..., n− 1 must have a solution in Zn for arbitrarily large values of X. This means that

lim supq→∞

L1(q)

q≤ δ

71

Page 74: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

and we get the inequality:

(1 + ω + 1)

(1 + lim sup

q→∞

L1(q)

q

)≤ n

n− 1,

which means that we can conclude that E2 = n/(n− 1).

Next, let νi(Q) denote the successive minima for C(Q) with respect to Cn(Q) and the dual latticeΓ∗ for 1 ≤ i ≤ n. Define now

πi(Q) :=log νi(Q)

logQ. (6.16)

Note that if C is a convex body and C∗ denotes the dual body, then we have that C∗ ⊆ C ⊆ nC∗and

νi(Q) ≤ λ∗i (Q) ≤ n · νi(Q),

which by monotonicity gives that

0 ≤ log λ∗i (Q)− log νi(Q) ≤ log n.

This in turns means that∣∣∣∣πi(Q)− log λ∗i (Q)

logQ

∣∣∣∣ ≤ ∣∣∣∣ log νi(Q)

logQ− log λ∗i (Q)

logQ

∣∣∣∣ ≤ log n

logQ≤ n

logQ.

Hence

lim infQ→∞

πi(Q) = lim infQ→∞

log λ∗i (Q)

logQ

and

lim supQ→∞

πi(Q) = lim supQ→∞

log λ∗i (Q)

logQ.

We now use the following inequality:

λi(Q)λn+1−i(Q) 1, (6.17)

which was proved by Mahler in 1939. The proof can be found in e.g. [4, Th. VI, p. 219]. Thisgives us ∣∣∣∣lim sup

Q→∞

λ∗i (Q)

logQ+ lim sup

Q→∞

λ∗n+1−i(Q)

logQ

∣∣∣∣ < A

logQ

72

Page 75: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

where A is some constant. Using the formulation found in [4] we have A = n! but other sharperestimates can be used as well. Following Banaszczyk in [22] we see that we can take A = n.However, what we want to conclude is that

lim supQ→∞

λ∗i (Q)

logQ= − lim inf

Q→∞

λn+1−i(Q)

logQ= − lim inf

Q→∞π1(Q)

and

lim infQ→∞

λ∗i (Q)

logQ= − lim sup

Q→∞

λn+1−i(Q)

logQ= − lim sup

Q→∞π1(Q).

Hence we see that

E3 = (ω∗ + 1)

(1

n− 1+ lim inf

Q→∞π1(Q)

).

Let γ > ω∗ and consider the following convex body:

Cγ(Q) =

x = (x, y1, ..., yn−1) ∈ Rn : |x| < Q1+γ,

∣∣∣∣∣x−n−1∑j=1

ξjyj

∣∣∣∣∣ ≤ Q−1/(n−1)+γ

.

By the definition of ω∗ we have that Cγ(Q) contains a nonzero integer point for some arbitrarilylarge values of Q. If we use the same reasoning as before with the same η we get

−1 +n

(n− 1)(1 + lim infQ→∞ π1(Q))≤ ω∗ ⇐⇒ (ω∗ + 1)

(1

n− 1+ lim inf

Q→∞π1(Q)

)≥ n

n− 1.

Whenever η < ω∗ we get the reversed inequality and combining them yields

(ω∗ + 1)

(1

n− 1+ lim inf

Q→∞π1(Q)

)=

n

n− 1.

Hence we have proved that E3 = n/(n − 1). The proof for E4 is analogous with the proof forE3.

We will now prove Theorem 52 using the above lemma.

Proof. If we use Kinchin’s transference principle, (6.7), we get

ω∗

(n− 2)ω∗ + n− 1≤ ω ⇐⇒ ω∗

(n− 2)ω∗ + n− 1+1 ≤ ω+1 ⇐⇒ ω∗ + (n− 2)ω∗ + n− 1

(n− 2)ω∗ + n− 1≤ ω+1

⇐⇒ n− 1

1/(ω∗ + 1) + n− 2≤ ω + 1.

73

Page 76: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Let

L1 := lim infq→∞

L1(q)

qand Ln := lim sup

q→∞

Ln(q)

q.

By lemma 53 we get

n− 1(1/(n− 1)− Ln

) (n−1n

)+ n− 2

≤ n/(n− 1)

1 + L1

⇐⇒ n− 1

n− Ln≤ n

(n− 1) (1 + L1)

⇐⇒ (n− 1) (1 + L1) ≤ n− 1− Ln

⇐⇒ Ln + (n− 1)L1 ≤ 0.

If we now use (6.8) instead:

ω∗ + 1 ≥ (n− 1)ω + n− 1 ⇐⇒ ω∗ + 1

n− 1≥ n

(n− 1) (1 + L1)

⇐⇒ n

(n− 1)2( 1n−1− Ln)

≥ n

(n− 1) (1 + L1)

⇐⇒ (n− 1)Ln + L1 ≥ 0,

which finishes the proof.

6.2 A Consequence of Roy’s Contribution

For any nonzero unit vector u ∈ Rn Roy defines various Diophantine exponents τ(u), τ(u), λ(u), λ(u);these are defined in a “projective” way and the connection with the more standard definitions isas follows: All of τ, τ , λ and λ are invariant under the map u 7→ cu(c ∈ R \ 0), and for un = 1,i.e. u = (u1, ..., un−1, 1) (and u1, ..., un−1, 1 Q-linearly independent) we have

τ(u) = ω∗(u1, ..., un−1); τ(u) = ω∗(u1, ..., un−1);

λ(u) = ω(u1, ..., un−1); λ(u) = ω(u1, ..., un−1),

where “ω∗, ω∗, ω, ω” is the notation of Schmidt and Summerer, [16].

Theorem 50 brings a very important corollary regarding exponents of Diophantine approximation.This is due to Roy, [15], and establishes the following bijection:

74

Page 77: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Theorem 54. There exists a bijection between the set of quadruples (τ(u), τ(u), λ(u), λ(u)) whereu runs through all unit vectors of Rn with Q-linearly independent coordinates, and the set ofquadruples (

lim infq→∞

P1(q)

q, lim sup

q→∞

P1(q)

q, lim infq→∞

Pn(q)

q, lim sup

q→∞

Pn(q)

q

)(6.18)

where P = (P1, ..., Pn) goes through all rigid n-systems of mesh δ > 0 for which P1 is unbounded.The bijection is given by θ : R4 → R4 defined as

θ(τ, τ , λ, λ) =

(1

1 + τ,

1

1 + τ,

λ

1 + λ,

λ

1 + λ

). (6.19)

Proof. We will se that this, in more precise terms, is a consequence of Theorem 50, combined withLemma 53 and a thorough translation of Schmidt and Summerer’s notation and Roy’s. Let usstart by connecting the definition of τ(u), τ(u), λ(u) and λ(u) with the more standard definitions:Note that the definitions verbatim make sense for any vector u ∈ Rn \ 0 and furthermore, forany u ∈ Rn and c ∈ R \ 0 we have τ(cu) = τ(u), and similarly for the exponents τ , λ and λ;thus τ, τ , λ and λ are really functions on (Rn \ 0)/R× ' Pn−1(R). Considering now the specialcase un = 1, and writing m = n − 1 together with u = (u1, ..., um, 1) ∈ Rm+1, we see that τ(u),respectively τ(u), is the supremum of all τ > 0 for which[

∃q ∈ Zm, a ∈ Z : ||(q, a)|| ≤ X and

∣∣∣∣∣m∑k=1

qkuk + a

∣∣∣∣∣ ≤ X−τ

]

holds for arbitrarily large values of X, and for all sufficiently large values of X respectively. Here(since τ > 0) a is forced to be the integer closest to −(q1u1 + · · · + qmum) for all large X; hencewe may just as well define τ(u) and τ(u) as the supremum of all τ > 0 such that[

∃q ∈ Zm, a ∈ Z : ||q|| ≤ X and 〈q1u+ · · · qmum〉 ≤ X−τ]

holds for arbitrarily large values of X, and for all sufficiently large values of X respectively.Notice that it does not matter here if || · || is the l2 or the l∞-norm; hence it follows thatτ(u) = ω∗(u1, ..., Un−1) and τ(u) = ω∗(u1, ..., un) in the notation of Schmidt and Summerer.

A remark is that Schmidt and Summerer only define ω∗(u1, ..., un−1) and ω∗(u1, ..., un−1) if 1, u1, ..., un−1

are linearly independent over Q, which is equivalent to u1, ..., un−1 being linearly independent overQ and this condition is invariant under u 7→ cu.

Next, consider λ(u) and λ(u) and note that

||x ∧ u|| =√∑

i<j

(xiuj − xjui)2

from which it follows that for all relevant choices of x ∈ Zn

75

Page 78: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

||x ∧ u|| u

n−1∑i=1

|xnui − xi| . (6.20)

Indeed, for 1 ≤ i < j < n we get

|xiuj − xjui| =∣∣∣∣xi(uj − xj

xn

)− xj

(ui −

xixn

)∣∣∣∣ ≤ ∣∣∣∣ xixn∣∣∣∣ |xnuj − xj|+ ∣∣∣∣xjxn

∣∣∣∣ |xnui − xi| .Hence we see that λ(u) = ω(u1, ..., un−1 and λ(u) = ω(u1, ..., un−1) in the notation of Schmidt andSummerer in [16].

Since u is assumed to be a unit vector the above relations become

τ(u) = ω∗(ξ); τ(u) = ω∗(ξ); λ(u) = ω(ξ); λ(u) = ω(ξ),

where

ξ = (ξ1, ..., ξn−1) =

(u1

un, ...,

un−1

un

)∈ Rn−1. (6.21)

From the discussion in the appendix to Chapter 4 it is possible to derive that

L[SS]j (Λ,K, q) = Lj((n− 1)q). (6.22)

Then it follows from (4.14) that

Lj((n− 1)q) = L[SS]j (Zn, RK(q)) +O(1) = q − L[Roy]

u,n+1−j(nq) +O(1). (6.23)

Remember that

Λ = (y, ξ1y − y1, ..., ξn−1y − yn−1) : (y, y1, ..., yn−1) ∈ Zn.

while u is proportional to A−1e1, where A is a matrix in GLn(R) with Λ = AZn. Let us choose

A =

1 0 0 · · · 0ξ1 −1 0 · · · 0...

. . ....

ξn−1 0 0 · · · −1

.

76

Page 79: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

It is then easy to verify that Λ = AZn and that

A

1ξ1...

ξn−1

= e1 ⇐⇒ A−1e1 =

1ξ1...

ξn−1

.

The conclusion from this is that the vectors u and ξ can be obtained, after a certain cyclicpermutation of u:s coordinates, from one another via (6.21). Now, let P = (P1, ..., Pn) be a rigidsystem corresponding to the unit vector u as in Roy’s main result, Theorem 50. Then for each j,the difference

Lu,n+1−j(q)− Pn+1−j(q)

is uniformly bounded for all large q. This implies that

Lj((n− 1)q) = q − Pn+1−j(nq) +O(1)

and so

lim supq→∞

Lj((n− 1)q)

q= 1−lim inf

q→∞

Pn+1−j(nq)

q⇐⇒ (n−1) lim inf

q→∞

Lj(q)

q= 1−n lim sup

q→∞

Pn+1−j(nq)

q.

Letting

Lj := lim infq→∞

Lj(q)

q;Lj := lim sup

q→∞

Lj(q)

q;P j := lim inf

q→∞

Pj(q)

q;P j := lim sup

q→∞

Pj(q)

q

then gives the relations

(n− 1)Lj = 1− nP n+1−j ⇐⇒ Lj =1

n− 1

(1− nP n+1−j

)(6.24)

and

(n− 1)Lj = 1− nP n+1−j ⇐⇒ Lj =1

n− 1

(1− nP n+1−j

)(6.25)

Applying this to Lemma 53, E1, it follows that

n

n+ 1= (ω+1)(1+L1) = (λ+1)

(1 +

1

n+ 1

(1− nP n

))⇐⇒ 1 = (λ+1)(1−P n) ⇐⇒ P n =

λ

λ+ 1.

77

Page 80: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

The formula, E2, in Lemma 53 gives

n

n− 1= (ω + 1)(1 + L1) = (λ+ 1)(1 +

1

n− 1(1− nP n)) ⇐⇒ P n =

λ

1 + λ.

The formulas E3 and E4 in Lemma 53 give

n

n− 1= (ω∗ + 1)

(1

n− 1− Ln

)= (τ + 1)

(1

n− 1− 1

n− 1(1− nP 1)

)⇐⇒ P 1 =

1

1 + τ

and

n

n− 1= (ω∗ + 1)

(1

n− 1− Ln

)= (τ + 1)

(1

n− 1− 1

n− 1

(1− nP 1

))⇐⇒ P 1 =

1

1 + τ.

78

Page 81: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

Bibliography

[1] R. Bott and L. W. Tu, Differential Forms in Algebraic Topology. Springer Verlag, New Yorkinc, 1982.

[2] H. Brezis, Functional Analysis, Sobolev Spaces and Partial Differential Equations. Springer,New York, 2011.

[3] Y. Bugeaud and M. Laurent, Diophantine approximation and parametric geometry of numbers.Monatsh. Math. 169, , 51–104, (2006).

[4] J.W.S. Cassels, An Introduction to the Geometry of Numbers. Springer-Verlag Berlin Heidelberg1997.

[5] Jesse Ira Deutsch, Geometry of Numbers Proof of Gotzky’s Four-Squares Theorem. Journal ofNumber Theory 96, 417–431, 2002.

[6] O. German, On Diophantine exponents and Khintchine’s Transference Principle.arXiv:1004.4933 [math.NT] 2010.

[7] J. Jost, Riemannian Geometry and Geometric Analysis. Springer Verlag Berlin Heidelberg,2011.

[8] A. Khintchine, Uber eine Klasse linearer Diophantischer Approximationen. PRend. Circ. Math.Palermo 50, 170–195 (1926).

[9] A. Khintchine, Zur metrischen Theorie der diophantischen Approximationen. MathematischeZeitschrift 24, 706-714 (1926).

[10] K. Mahler, On Compound Convex Bodies I. Proc. Lond. Math. Soc 5(3), 358-379 (1955).

[11] K. Mahler, On Compound Convex Bodies II. Proc. Lond. Math. Soc 5(3), 380-384 (1955).

[12] A. Marnat, About Jarnık’s-type Relation in Higher Dimension. Ann. Inst. Fourier, Grenoble68, 1 (2018) 131-150.

[13] K. H. Rosen, Elementary Number Theory and Its Applications. Pearson education, Inc, 2011.

[14] S. Roman, Advanced Linear Algebra. Springer Science+Buisness Media, LLC, 2008.

[15] D. Roy, On Schmidt and Summerer parametric geometry of numbers. Annals of Math. 182,739-786, (2015).

[16] W. M. Schmidt and L. Summerer, Parametric geometry of numbers and applications. ActaArith. 140, (2009), 67–91.

79

Page 82: Parametric Geometry of Numbers and Exponents of Diophantine …uu.diva-portal.org › smash › get › diva2:1333477 › FULLTEXT01.pdf · Introduction Diophantine approximation

[17] W. M. Schmidt and L. Summerer, Diophantine approximation and parametric geometry ofnumbers. Monatsh. Math. 169, (2013), 51–104.

[18] C. L. Siegel, Lectures on the Geometry of Numbers. Springer-Verlag Berlin Heidelberg, 1989,Notes by B. Friedman, Rewritten by Komaravolu Chandrasekharan with the assistance ofRudolf Suter, With a preface by Chandrasekharan.

[19] I. Stewart and D. Tall, Algebraic Number Theory and Fermats Last Theorem. third edition:AK Peters, 2002.

[20] M. Laurent, On transfer inequalities in Diophantine Approximation.https://arxiv.org/abs/math/0703146. 2007.

[21] V. Jarnık, Zum Khintchineschen ”Ubertragungssatz”. Trudy Tbilis. Mat. Instituta 3 (1938),pp. 193–216.

[22] W. Banaszczyk, New bounds in some transference theorems in the geometry of numbers. Math-ematische Annalen, 296(4):625–635, 1993.

80