real analysis oral exam study notes notes transcribed by ...nica/oral/real_notes.pdf · real...

Real Analysis Oral Exam study notes

Notes transcribed by Mihai Nica

3

Abstract. These are some study notes that I made while studying for my oralexams on the topic of Real Analysis, mostly covering the theory of integration.I took these notes from parts of the textbooks by Richard Bass [1] and a fewother sources which are indicated throughout. Please be extremely cautionwith these notes: they are rough notes and were originally only for me to helpme study and are not complete or guaranteed to be free of errors. I have madethem available to help other students on their oral exams.

Contents

Families of sets 41.1. Algebras and σ − algebras 41.2. Monotone Class Theorem 41.3. Pi-Lambda Theorem 5

Measures 7

Construction of Measures 93.4. Outer Measures 93.5. Lebesgue-Stieltjes Measure 113.6. Examples and related results 133.7. Nonmeasurable Sets 143.8. The Caratheodory Extension Theorem 15

Measurable Functions 174.9. Measurability 174.10. Approximation of functions 184.11. Lusin's Theorem 18

The Lebesgue Integral 20

Limit Theorems 216.12. Monotone Convergence Theorem 216.13. Linearity of the Integral 226.14. Fatou's Lemma 226.15. Dominated Convergence Theorem 22

Product Measures 247.16. Product σ−algebras 247.17. The Fubini Theorem 25

Signed Measures 268.18. Positive and Negative sets 268.19. Hahn Decomposition Theorem 268.20. Jordan Decomposition Theorem 27

The Radon-Nikodym Theorem 289.21. Absolute Continuity 289.22. The main theorem 289.23. Lebesgue Decomposition Theorem 29

Di�erentiation 3010.24. Di�erentiation of Monotone Functions 3210.25. Functions of Bounded Variation 3410.26. Di�erentiation of an Integral 3510.27. Absolute Continuity 37

Lp spaces 3911.28. Norms 3911.29. Completeness 4011.30. Convolutions 4111.31. Bounded Linear Functionals 41

Fourier Transforms 4312.32. Basic Properties 4312.33. The Inversion theorem 44

Fourier Series 4513.34. More Fourier Facts 48

Bibliography 50

Families of sets

These are notes from Chapter 2 of [1].

1.1. Algebras and σ − algebras

Fix a universal set X to work in.

Definition. (2.1) An algebra is a collection A of subsets of X that:i) ∅, X ∈ Aii) A ∈ A =⇒ Ac ∈ Aiii) closed under �nite unions and intersectionsA σ−algebra has:iv) closed under countable unions and intersections.

Lemma. (2.7.) Any arbitrary intersection of σ−algebras is a sigma algebra

Definition. We de�ne the σ−algebra generated by a subset C to be thesmallest σ− algebra cotaining C: i.e. the intersection of all σ− algebras containingC. This is denoted by σ(C).

Definition. The Borel σ − algebra is the σ − algebra that is generated byany of the following: 1) open intervals 2) closed intervals 3) half open intervals 4)semi-in�nite open intervals (a,∞]

Proof. Its easy to check by doing some intersections unions that these allgenerate the same thing. �

1.2. Monotone Class Theorem

Definition. (2.9.) A monotone class is a collection of subsets of X thatis closed under increasing, and decreasing sets. I.e. An ↑ A =⇒ A ∈ M andAn ↓ A =⇒ A ∈M.

Theorem. (2.10) Suppose A0is an algebra and, A = σ(A0) is the smallestσ − algebra containing A0 andM is the smallest monotone class containing A0.

ThenM = A.(Mihai's version) In other words: for an algebra A0, the smallest monotone

class containing A0 is in fact, a σ − algebra.

Proof. We must show A ⊂ M as the other inclusion is clear. To do this wehave to show that the monotone class is closed under intersections and unions, forthenM will be an algebra containgn A0 and hence A = σ(A0) ⊂M.

The idea of the proof is the usual trick with sigma algebras and so on: De�nethe set you are interested in and show it contains what you want.

Claim 1: Closed under complements

4

1.3. PI-LAMBDA THEOREM 5

Let N = {A ∈M : Ac ∈M}. Since A0 is an algebra and closed under com-plements, A0 ⊂ N . We claim now that N is a monotone class, for if An ↑ A withAn ∈ N then by def'n Acn ∈ M and we have Acn ↓ Ac shows Ac ∈ M since M isa monotone class. Similar for An ↓ A. Hence N is a monotone calss containg A0

and consequentlyM⊂ N i.e. M = N .Claim 2: Closed under intersections with A0

Let N = {A ∈M : A ∩B ∈M ∀B ∈ A0}. Again N contains A0 since A0 isan algebra, and N is a monotne class since if An ↑ A then An∩B ↑ A∩B for everyB ∈ A0, so the monotone-class-ness of M gives us A ∩ B ∈ M and this holds forevery B ∈ A0. (same idea for An ↓ A). As before, we get N =M.

Claim 3: Closed under intersectionsLet N = {A ∈M : A ∩B ∈M ∀B ∈M}. This contain A0 by the previous

claim! It is a monotone class by the same argument as in claim 2. Hence, we haveM = N .

This shows that M is in fact a σ − algebra (�nite unions/intersections +monotone limits yields countable unions/intersections by splitting up any countableunion/intersection up into a monotne limit of �nite unions/intersections) �

Remark. The sneaky thing in the proof is that you can't jump to closed underintersections right away, because you don't know that A0 is in there a priori. Thelittle extra step is to �rst prove that A0 is closed under intersections fromM andthen move on to the whole space.

1.3. Pi-Lambda Theorem

I'm going to include a little thinger on the Pi-Lambda theorem, as this comesup occasionally in Probability theory.

Definition. A π−system is a collection of subsets which is closed under �niteintersections.

Definition. A λ−system C is a collection of subsets which has:i) X ∈ Cii) A,B ∈ C and A ⊂ B =⇒ B −A ∈ Ciii) An ∈ C , and An ↑ A =⇒ A ∈ C

Proposition. (Alternative de�nition of a λ−system)a) X ∈ Cb) A ∈ C =⇒ Ac ∈ Cc) If An ∈ C are disjoint then

⋃∞n=1An ∈ C

Proof. Its easy to see that i), ii) and iii) imply a,b,c. We prove the conversestatements one at a time assuming a), b) and c):

ii): For any A,B with A ∩B = ∅, we have that:A ∪B ∈ C

=⇒ Ac ∩Bc ∈ CBut Ac∩Bc = Ac−B so this shows that whenver A∩B = ∅, we have Ac−B ∈ C.

Noticing that A ∩B = ∅ ⇐⇒ A ⊂ Bc and relabeling Bc = C gives the result ii)iii) Take set di�erences Bn = An −An−1which are disjoint. �

Theorem. If C is both a π−system and a λ−system, then C is a σ−algebra

1.3. PI-LAMBDA THEOREM 6

Proof. For any arbitarty sequence An ∈ C, we can create Bn ∈ C which aredisjoint with ∪nk=1Bk = ∪nk=1Ak by doing intersections (ok since C is a π−system)and complements (ok since C is a λ-system). Then, since C is a λ−system, we havethat ∪∞k=1Bk ∈ C and so ∪∞k=1Ak = ∪∞k=1Bk ∈ C too! �

Theorem. (7.4.) (Dynkin's π−λ theorem) Suppose Pis a π-system and, A =σ(P) is the smallest σ−algebra containing P and L is the smallest monotone classcontaining P.

Then L = A.(Mihai's version) In other words: for a π−system P0, the smallest λ−system

containing P0 is in fact, a σ − algebra. )

Remark. This is very similar in feel to the monotone class theorem, it has thesame �two step� trick to it.

Proof. We will show that L is a σ-algebra. By the above theorem, it su�cesto show that is is a π-system, so we will do this easier task instead.

Claim 1: For a �xed C ∈ L, de�ne LC := {D ∈ L : C ∩D ∈ L}. Then LC is aλ-system.

Pf: Use the �rst def'n of λ−system (because this one is easier to check) andthe result follows wihtout too much di�cutly because L itself is a λ−system.

Now, choose any C ∈ P. Since P is a π−system, it is closed under intersectionsand we know at least that P ⊂ LC since L contains P. Now, since LC is a λ−systemcontaining P, we have that L = LC since L is the smallest such λ-system.

Since LC = L for every C ∈ P, this is saying that C ∩D ∈ L for every C ∈ Pand every D ∈ L.

Now for any E ∈ L, we claim that LE contain P. Indeed, to check this wewould need E ∩ C ∈ L for every C ∈ P, but this is exactly the above property.Hence LE = L.

Since this holds for all E ∈ L, we have that L is in fact a π−system. �

Remark. There should be a nicer way to make the proofs of the monotoneclass theorem and the π − λ theorem look more similar, but I'm not going to dothat now.

Measures


Definition. (3.1) A measure on a set X and an σ− algebraA is a a functionµ : A → [0,∞] such that:

i) µ(∅) = 0ii) Countably additive for disjoint sets:

µ(⋃∞

i=1Ai) =

∞∑i=1

µ(Ai) for disjoint A′is

Proposition. (3.5.) (Basic Properties of Measures)i) A ⊂ B =⇒ µ(A) ≤ µ(B)ii) A = ∪∞i=1Ai =⇒ µ(A) ≤

∑∞i=1 µ(Ai)

iii) Ai ↑ A =⇒ limn→∞ µ(Ai) = µ(A)iv) If µ(A1) <∞, then ,Ai ↓ A =⇒ limn→∞ µ(Ai) = µ(A)

Proof. These all follow by doing constructions involving complements and setdi�erences and so on in such a way as to reduce the sets in question as unions ofdisjoint sets, for which we know that measures play nice.

Most of them use the �Treat Disjointly Trick� (See remark below)i) Holds since µ(A−B) ≥ 0ii) Make Bi ⊂ Ai by taking set di�erences so that ∪ni=1Bi = ∪ni=1Ai for each n

and the Bi's are disjoint. Then µ(A) = µ(∪Bi) =∑µ(Bi) ≤

∑µ(Ai)

iii) µ(An) is a montone increasing function and is bounded above by µ(A), soit is convergent (to possibly ∞if µ(A) =∞). Hence, by the monotone convergencetheorem for real numbers, this sequence has a limit. To see that the limit is actuallyµ(A), take Bi = Ai+1 −Ai so the Bi's are disjoint and have ∪ni=1Bn = ∪ni=1Ai andapply the countable additivity to get the result. Explicitly:

µ(A) = µ(∪∞n=1An)

= µ(∪∞n=1Bn)

=

∞∑n=1

µ(Bn)

= limn→∞

n∑i=1

µ(Bi)

= limn→∞

µ (∪ni=1Bi)

= limn→∞

µ (∪ni=1Ai)

iv) Take complements of the result in iii) (or, more generally if you arent in a�ntis measures spaceBk = A1 −Ak). The hypothesis that µ(A1) <∞ is needed to

7

MEASURES 8

avoid a �∞−∞” problem. Counterexample to keep in mind: An = (n,∞) whichis decreasing to the empty set but always has in�nte Lebesgue measure. �

Remark. A common theme that comes up for this type of thing: The proofis easy for disjoint sets. The arbitary case can be reduced to the disjoint case bymaking sets Bi which are disjoint and have ∪ni=1Bi = ∪ni=1Ai for every n. I willlabel this trick as [Treat Disjointly]

Definition. (3.7.) De�nition of �nite and σ-�nite measure spaces. A null

set is a set which is a subset of a 0 measure measurable set. (i.e. null sets neednot be measurable) A complete measurable space is one where all null sets aremeasurable. The completion of A is the smallest sigma algebra containing all thenull sets.

Construction of Measures

These are notes from Chapter 4 of [1].Recall the di�uculty with measures is that if you try to de�ne them on all

subsets of a space X, you get problems. The solution is to carefully choose theright σ − algebra to de�ne the measure on. As a consequence, �measurable sets�will be nice enough to work with! This is outlined here.

3.4. Outer Measures

Definition. (4.1.) An outer measure is a function µ∗ de�ned on a collectionof subsets of X satisfying:

i) µ∗(∅) = 0ii) A ⊂ B =⇒ µ∗(A) ⊂ µ∗(B)iii) µ∗(∪∞i=1Ai) ≤

∑∞i=1 µ

∗(Ai)

Remark. There is no notion of �measurable� or σ-algebra or whatever foran outer measure. It has to be de�ned for every subset. It is called an �outermeasure� because of the following common way to construct an outer measure byapproximation with sets �from the outside�. (Sometimes you'll see the constructionhere as the de�nition of an outer measure, and the properties we demand in thede�nition here would be something to be proven)

Proposition. (4.2.) Suppoce C is a collection of subsets of X containg both ∅and X. Suppse that ` : C → [0,∞] with `(∅) = 0. De�ne:

µ∗(E) = inf

{ ∞∑i=1

`(Ci) : Ci ∈ C∀i and E ⊂ ∪∞i=1Ci

}Then µ∗is an outer measure.

Proof. The points i) and ii) from the de�ntion are very easy. To prove iii)consider as follows. Given any collection of subsets A1, A2, . . . and any ε > 0, �nd acollection of subsets Cij ∈ C so that for each �xed i theC ′ijs cover Ai andµ

∗(Ai) ≥∑`(Cij)+ ε/2

i for each i. (This is by de�nition of �inf�). Notice that the collectionCij now covers all of ∪Ai and so we have that:

µ∗(∪Ai) ≤∑ij

`(Cij)

≤ . . . ≤∞∑i=1

µ∗(Ai) + ε

Since this holds for any ε, the desired inequality must be true. �

9

3.4. OUTER MEASURES 10

Example. Lebesgue outer measure is what you get when you put in C =halfopen intervals and `(a, b] = b − a . The Lebesgue-Stieltjes outer measure is whatyou get if you again choose C =half open intervals and you have a non-decreasingand right continuous function α.

Definition. (4.5) A set A ⊂ X is measurable with respect to an outer

measure µ∗ (or more simply µ∗−measurable....I like the �rst phrasing better be-cause it reminds me that outer measures are de�ned on all subsets of a space)if:

µ∗(E) = µ∗(E ∩A) + µ∗(E ∩Ac) ∀E ⊂ X

Remark. By the de�nition of an outer measure, it is ALWAYS TRUE for anyset A that:

µ∗(E) ≤ µ∗(E ∩A) + µ∗(E ∩Ac) ∀E ⊂ XSo the property of being measurable is really about the other inequality. This

is most often how one proves certain sets are measurable.One way to rememember the direction of the sign is to think that �E ∩A and

E ∩ Ac together form a cover of E. Hence, since µ∗(E) is the inf over the measurof all covers, µ∗(E) ≤the sum�. Of coruse, this is a mnemonic device only sincestricly speaking the measure µ∗ is de�ned only as the inf over covers formed by thespecial sets C.

Theorem. (4.6.) Let A = {measurble w.r.t. µ∗ sets} .Then A is a σ−algebra.If we de�ne µ := µ∗|A, then µ is a measure on A. Finally, A contains all the setsof outer measure 0.

Remark. One of the main themes to keep in mind for this proof is that weare trying hard to get the inequalities of the form µ∗(E) ≥ . . . , because this isthe non-trivial inequality for being measurable. Where will these inequalities comefrom? The only way is to use the trivial side of the inequality in a clever way:you must manipulate µ∗(E) = . . . = µ∗(U ∩ V ) + µ∗(U ∩ V c) and then remarkµ∗(U ∩ V ) + µ∗(U ∩ V c) ≥ µ∗(U) since this always holds. (i.e. use property iii)from the de�nition of outer measure.)

Proof. Claim 1: A is closed under complements.Pf: Follows by the symmetry of the de�nition of measurable w.r.t. µ∗.Claim 2: A,B ∈ A =⇒ A ∪B ∈ APf: For any E, use the measurablity property of A at the set E and use the

measurability property of the set B once at the set E∩A and once at the set E∩Ac:µ∗(E) = µ∗(E ∩A) + µ∗(E ∩Ac)

= (µ∗(E ∩A ∩B) + µ∗(E ∩A ∩Bc) +(µ∗(E ∩Ac ∩B) + µ∗(E ∩Ac ∩Bc)))

Now notice that the �rst three terms are E ∩A ∩B, E ∩Ac ∩B, E ∩A ∩Bc.Since A ∪ B = (A ∩ B) ∪ (A ∩ Bc) ∪ (B ∩ Ac), so consequently the �rst three setshere cover E ∩ (A∪B). Hence the sum of their measures is ≥ µ∗(E ∩ (A∪B)) andwe get (alspo use Ac ∩Bc = (A ∪B)cby DeMorgan):

µ∗(E) ≥ µ∗(E ∩ (A ∪B)) + µ∗(E ∩ (A ∪B)c)

So indeed, A∪B satisfy the non-trivial direction of the measurable inequality,and are hence measurable.

3.5. LEBESGUE-STIELTJES MEASURE 11

Claim 3: An ∈ A =⇒ ∪∞n=1An ∈ APf: We �rstly remark that is su�ces to check this for disjoint sets An, because

by using the property of Claim 2, we can rewrite any countable union as a countableunion of disjoint sets modulo some �nite union operations.

For disjoint sets An let Bn = ∪ni=1Ai and B = limn→∞Bn. For E ⊂ X have:

µ∗(E ∩Bn) = µ∗(E ∩Bn ∩An) + µ∗(E ∩Bn ∩Acn)= µ∗(E ∩An) + µ∗(E ∩Bn−1)

Moving the µ∗(E∩Bn−1) to the other side, and then using this as a telescopingsum, we get that:

µ∗(E ∩Bn)− µ∗(E ∩B1) =

n∑i=1

µ∗(E ∩Ai)

=⇒ µ(E ∩Bn) ≥n∑i=1

µ∗(E ∩Ai)

Hence:

µ∗(E) = µ∗(E ∩Bn) + µ∗(E ∩Bcn) ≥n∑i=1

µ∗(E ∩Ai) + µ∗(E ∩Bc)

Taking n→∞, (ok since the sum is monotone increasing) we get:

µ∗(E) ≥∞∑i=1

µ∗(E ∩Ai) + µ∗(E ∩Bc)

≥ µ∗ (∪∞i=1E ∩Ai) + µ∗(E ∩Bc)= µ∗(E ∩B) + µ∗(E ∩Bc)≥ µ∗(E)

So indeed B is µ∗-measurable!These claims together show that A is a sigma algebra.Claim 4: µ := µ∗|A is a measure.Pf: We have only to show that it is countable additive. Following the discussion

above, we had that µ∗(E) =∑∞i=1 µ

∗(E ∩Ai) + µ∗(E ∩Bc) for any set E. TakingE = B gives exactly the result for countable additivity.

Claim 5: If µ∗(A) = 0 then A is µ∗measurable.Pf: This follows from the monotone property of the outer measure, for if

µ∗(A) = 0 then:

µ∗(E ∩A) + µ∗(E ∩Ac) ≤ µ∗(A) + µ∗(E ∩Ac)≤ µ∗(A) + µ∗(E)

= µ∗(E)

Shows the non-trivial direction of the measurability inequality. �

3.5. Lebesgue-Stieltjes Measure

Let α(x) be an increasing right continuous function, let C = {(a, b] : a,< b} andde�ne `((a, b]) = α(b)− α(a). Then de�ne the outer measure:

m∗(E) = inf{∑

`(Ai) : Ai cover E}

3.5. LEBESGUE-STIELTJES MEASURE 12

By Prop 4.2 this is an outer measure, and by theorem 4.6 this de�nes a measureon the collection of m∗−measurable sets.

The convenience of using semi open intervals is due to the following fact:

Lemma. If K and L are disjoint semi open intervals, (of the form (a, b] ) andK ∪ L is also a semi open interval then:

`(K) + `(L) = `(K ∪ L)

Proof. Both are equal to α(c) − α(a) = (α(c) − α(b)) + (α(b) − α(a)) wherea, b, c denote the endpoints of the intevrals in question. �

What kinds of sets are m∗−measurable? Here is a useful fact!

Proposition. (4.7.) Every set in the Borel σ−algebra on R ism∗−measurable.

Proof. Since the collection of m∗−measurable sets is a σ−algebra (proven inProp 4.6) it su�ces to show that every interval J of the form (c, d] ism∗−measurable.Let E be any set with m∗(E) <∞. We need to show that:

m∗(E) ≥ m∗(E ∩ J) +m∗(E ∩ Jc)

Choose I1, I2, . . . of the form (ai, bi] that cover J and so thatm∗(E) ≥∑i[α(bi)−

α(ai)]− ε. Now, since E ⊂ ∪Ii we have that:

m∗(E ∩ J) ≤∑i

m∗(Ii ∩ J)

m∗(E ∩ Jc) ≤∑i

m∗(Ii ∩ Jc)

So we get in sum that:

m∗(E ∩ J) +m∗(E ∩ Jc) ≤∑i

m∗(Ii ∩ J) +m∗(Ii ∩ Jc)

Now we get to the convenience of using semi-open intervals! Since each Ii andJ are semi open intervals, Ii ∩ J is an interval too! Ii ∩ Jc is the union of zero, oneor two such intervals. This is exactly the setup for us to apply the Lemma before.Applying this zero, one or two times we have:

m∗(Ii ∩ J) +m∗(Ii ∩ Jc) = m∗(Ii)

Thus our inequality above is:

m∗(E ∩ J) +m∗(E ∩ Jc) ≤∑i

m∗(Ii) ≤ m∗(E) + ε

Since this holds for any ε > 0, we have the desired inequality. �

The next thing to verify is that m∗ and ` agree on semi-open sets. This is notat all surprising, but it needs to be veri�ed.

Lemma. (4.8.) Let Jk = (ak, bk) be a collection of �nte open intervals whichcovera �nite closed interval [C,D] then:

n∑k=1

[α(bk)− α(ak)] ≥ α(D)− α(C)

3.6. EXAMPLES AND RELATED RESULTS 13

Proof. By looking at a subset of the intervals, we may suppose WOLOG thatthe intervals are �in order� so to speak so that:

a1 ≤ C ≤ b1 and , an ≤ D ≤ bn and ak < bk−1 < bk

Then just write out both sums and see by comparison to a telescoping sumthat the inequality holds. �

Proposition. (4.9) If e and f are �nite and I = (e, f ] then m∗(I) = `(I)

Proof. (This is the one that uses compactness of closed intervals. The ideais to convert any cover by semi-open intervals to a cover using open intervals (thisgives an ε of error by the continuity of α) and then use compactness to get downto �nitely many intervals of interest. The apply the previous lemma.)

Clearly m∗(I) ≤ `(I) by the inf de�nition of m∗ since I is a cover for itself.For the other inequalty, suppose that I ⊂ ∪Ai where each Ai = (ci, di] is a aninterval. By the right continuity of the function α, choose C ∈ (e, f) so that

α(C) < α(e) + ε/2. Let D = f . Choose d′

i > d so that α(d′

i) < α(di) + ε/2i+1 andlet Bi = (ci, d

′i)

(Each (ci, d′i) aproximates the semi open interval (ci, di] and the whole thing

is done so that the total error from the point of view of the α function is no morethan ε.)

Now use the compactness of [C,D] to �nd a �nite set of the Bi's that cover allof [C,D] still. By the previous lemma we get that:

`(I) ≤ α(D)− α(C) + ε/2 ≤∑

`(Ai) + ε/2i+1 + ε/2

and since ε is arbitary the result follows. �

Now that all the major points have been hit, we will drop the ∗ and refer to mas the Lebesgue-Stieltjes measure corresponding to α.

3.6. Examples and related results

3.6.1. Approximating Sets.

Example. (4.10) For the Lebesgue measure its not hard to show that singletonsare measure 0 and so the open, half-open, closed, half-closed intervals all have thesame measure. All countable sets are Lebesgue measure 0 too.

Example. (4.11) The middle thirds Cantor set is uncontable but still hasLebesgye measure 0. The Cantor ternary function is also brei�y described in thisexample.

Example. (4.13) By changing the fraction of removed set as you go, you canend up with a positive measure generalized Cantor set. This set is closed andcontains no intervals, and every point is a limit point.

Proposition. (4.14) Let A ⊂ [0, 1] be a Borel measurable set and let m be theLebesgue measure. Then:

i) Given any ε > 0 there is an open set G so that A ⊂ G and m(G−A) < εii) Given any ε > 0 there is a closed set F so that F ⊂ A and m(A− F ) < εiii) There is a Gδ set H so that A ⊂ H and m(H −A) = 0iv) There is an Fσ set F so that F ⊂ A and m(F −A) = 0

3.7. NONMEASURABLE SETS 14

Remark. We used half open sets in our construction of the Riemann-Stietijesmeasure, so we need to adapt from our half open intervals to the open intervals thatmake up open sets. If one had used open intervals to construct the measure (whichis totally legit btw) then this proposition would be straight from the inf de�nitionof m∗; but some early results (mostly the proof that m∗ = ` for intervals would beslightly harder)

Proof. i) Approximate A be a union of semi-open intervals so that the erroris no more than ε/2 (this holds by the inf de�nition of m∗) Then approximate eachsemi-open interval by an open interval so that th error is no more than ε/2n+1.

ii) Take complements and use the result in i)iii) Choose ε = 1

n and apply the result in i) to get a sequence of open set Gn.WOLOG they are decreasing. Take their intersection to get the desired Gδset.

iv) Same procedure as in iii) applied to closed sets Fn generated from ε = 1n in

part ii) �

3.7. Nonmeasurable Sets

Theorem. (4.15) [Existence of a non-measurable set] Let m∗be the outer mea-sure de�ned in the usual way with the collection C = {(a, b]} and `((a, b]) = b − a.The m∗ is not a measure on the collection of all subsets of R.

Remark. Since we showed thatm∗IS a measure when restricted to them∗measurablesets, this is saying that there are some sets that are not m∗−measurable, and thatthese sets are badly behaived enough that they prohibit m∗from being a measurehere.

Proof. (Here we have the construction using shifting by rationals. There isalso the �x ∼ y if x − y = kα mod 1 proof where α is irrational� that works in asomeqhat similar way.)

Suppose m∗ is a measure. De�ne x ∼ y if x − y is a rational. This is anequivalence relation. Use the axiom of choice to get a representative from eachequivalence class from the points in [0, 1]. Let A be the set of these representatives.Clearly:

[0, 1] ⊂ ∪q∈[−1,1]∩QA+ q

because every point in [0, 1] belongs to some equivalence class, and is hence arational shift away from some point in A by de�nition. Moreover, each set A+ q1is disjoint from A + q2 for q1 6= q2 for if there was a point w in there intersectionthen w − q1 ∈ A and w − q2 ∈ A are both in the same equivalence class and thiscontradicts the choice of A. Hence:

1 ≤∑q

m∗(A+ q)

=∑q

m∗(A)

(m∗(A) = m∗(A+q) comes from the fact that `(I) = `(I+q) and the de�nitionof m∗). On the other hand, ∪q∈[−1,1]∩QA+ q ⊂ [−1, 2] since A ⊂ [0, 1]. So we get:

1 ≤∑q

m∗(A) ≤ 3

3.8. THE CARATHEODORY EXTENSION THEOREM 15

There is no value for m∗(A) that satis�es these inequalites! Either m∗(A) = 0and the lower bound is violated, or m∗(A) > 0 and the upper bound is viloated. �

3.8. The Caratheodory Extension Theorem

Definition. Recall the de�nition of an algebra. (It was like a σ−algebra but�ntie unions instead of countable ones.) A premeasure on an algebra A0 is afunction µ : A0 → [0,∞] satisfying:

i) µ(∅) = 0ii) Countably additive for disjoint sets that happen to stay in the algebra:If A1, . . . are disjoint and it happens that ∪∞i=1Ai ∈ A0, then

µ(⋃∞

i=1Ai) =

∞∑i=1

µ(Ai)

The next theorem repackages the constructuion of the Lebesgue-Stieltjes mea-sure in a slightly more general framework:

Theorem. (4.16) [CARATHEODORY EXTENSION THEOREM] Supposethat A0 is an algebra and that ` : A0 → [0,∞] is a premeasure on A0. De�ne:

µ∗(E) = inf

{ ∞∑i=1

`(Ai) : each Ai ∈ A0 cover E

}Then:1) µ∗ is an outer measure2) µ∗(A) = `(A) if A ∈ A0

3) Every set in A0 is µ∗−measurable4) If `is σ−�nite then there is a unique extension to σ(A0)

Proof. (1) follows by proposition 4.2(2) µ∗(E) ≤ `(E) for E ∈ A0 by the inf def'n of µ∗ since E covers itself. The

other inequality holes because if E ⊂ ∪Ai, then we can let Bn = E∩(An − ∪n−1i=1 Ai

)so that the ∪nBk = ∪nAk and since the Bi's are pairwise disjoing and each in A0

and their union is E which also happens to be in A0, so by the premeasure property:

`(E) =

∞∑i=1

`(Bi) ≤∞∑i=1

`(Ai)

So clearly then by taking inf's we have that`(E) ≤ µ∗(E)(3) To see that every set A ∈ A0 is µ∗measurable, consider as follows. For any

E ⊂ X, take B1, B2, . . . ∈ A0 that cover E and∑`(Bi) ≤ µ∗(E) + ε. Then, since

`(Bi) = `(Bi ∩ A) + `(Bi ∩ Ac) because ` is a premeasure and A,B ∈ A0. Havethen:

µ∗(E) + ε ≥∞∑i=1

`(Bi) =

∞∑i=1

`(Bi ∩A) +∞∑i=1

`(Bi ∩Ac)

≥ µ∗(E ∩A) + µ∗(E ∩Ac)

Since ε arbitrary, we get the desired non-trivial half of the measurability in-equality. �

3.8. THE CARATHEODORY EXTENSION THEOREM 16

(4) The fact that the measure exists follows from our previous work: We knowthat A = {µ∗ −measurable sets}is a σ−algebra, it contains A0 by (3), and weshowed earlier that µ = µ∗|A is a measure. Uniqueness is most easily done withthe π − λ theorem:

If µ, ν are two measures so that µ(A) = ν(A) = `(A) for all A ∈ A0, letP = {A : µ(A) = ν(A)}. This contains the algebra A0 by construction, and weverify that it is a λ−system. Hence, by the π−λ theorem, it is in fact a σ-algebra!Since it contains A0, it must in fact be all of σ(A0).

Measurable Functions


4.9. Measurability

Fix a measurable space (X,A)

Definition. (5.1.) A function f : X → R is measurable or A-measurable

if f−1 ((a,∞)) ∈ A for all a ∈ R

Proposition. (5.5.)The following are equivalent (i) f−1 ((a,∞)) ∈ A for alla ∈ R ii) f−1 ([a,∞)) ∈ A for all a ∈ R iii) f−1 ((∞, a)) ∈ A for all a ∈ R iv)f−1 ((∞, a]) ∈ A for all a ∈ R

Proof. Use compleemnts and intersections/unions with a fudge factor of 1n�

Proposition. (5.6.) If X is a metric space and A contains all the open sets,then continuous functions are always measurable

Proof. The preimage through any continuous function of any open set is open,so f−1 ((a,∞)) ∈ Open Sets ⊂ A �

Proposition. (5.7.) If f, g are measurable then so are f+g,−f, cf, fg,max(f, g)and min(f + g)

Proof. i) {f + g < a} = ∪r∈Q {f < r} ∪ {g < a− r}ii) {−f > a} = {f < −a}iii) Assume WOLOG by ii that c > 0 and then:{cf > a} = {f > a/c}iv) fg = 1

2

((f + g)2 − f2 − g2

)v) {max(f, g) < a} = {f < a} ∩ {g < a}vi) {min(f, g) > a} = {f > a} ∩ {g > a} �

Proposition. (5.8.) If fn are all measurable then so are sup fn, inf fn, lim sup fn, lim inf fn

Proof. {sup fn > a} = ∩n {fn > a} and {lim sup fn > a} = ∪n∩k>n{fk > a} =lim sup {fn > a}. Similar for inf �

Proposition. (5.10) If f is monotone, then f is Borel measurable.

Proof. SupposeWOLOG that f is increasing. Look at x0 = sup {y : f(y) ≤ a}since f is increasing, for x > x0 we have f(x) ≥ a by uisng the de�niton of inf andthe fact that f is increasing. Hence (x0,∞) ⊂ {f > a}. Similarly, (−∞, x0) ⊂{f ≤ a}. So then {f > a} = [x0,∞) or (x0,∞) and in either case it is measur-able. �

17

4.11. LUSIN'S THEOREM 18

Proposition. (5.11.) f is measurable if and only if f−1(A) ∈ A for all Borelmeasurable A

Proof. Check that the set{B : f−1(B) ∈ A

}is a sigma algebra. It is equal

to the Borel sigma algebra if and only if it contains the open intervals. �

4.9.1. Non-Borel set of Lebesgue measure 0.

Example. (5.12) Let f be the Cantor ternary function and let F be its �inverse�

F (x) := inf {y : f(y) ≥ x}This is an increasing function, so it is measurable. Morevoer, F [0, 1] = C the

Cantor set. Since F is Borel measurable, F−1 maps Borel sets to Borel sets.Now, take any non-measurable set A. Let B = F (A). Since B = F (A) ⊂ C, B

is a Lebesgue-measure-zero set. On the other hand, B cannot be Borel measureable,for it it were then we would have A = F−1(B) and that would mean that A ismeasurable, a contradiction.

4.10. Approximation of functions

Definition. (5.13) De�ne the characteristic function χE to be the 1 on Eand 0 o� E. A simple function is a a �nite linear combination of char functions.

Proposition. (5.14) Suppose f is a non-negative and measurable function.Then there exists a sequence of non-negative measurable simple functions sn in-creasing to f

Proof. Perform the �diadic decompostion�, En,i ={i−12n ≤ f <

i2n

}, and then

�round down� to the diadic sets. (You also have to increase the �ceiling� with n asyou go) �

4.11. Lusin's Theorem

Theorem. (5.15) Suppose f : [0, 1] → R is Borel measurable and m is theLebesgue measure. For any ε > 0 there exists a continuous function g : [0, 1] → Rwhich is continuous and which has {f = g} is a closed set with m {f 6= g} < ε.

Proof. [Standard ladder proof]First prove that it is true for characteristic functions. Find a closed set E

and an open set G so that E ⊂ A ⊂ G with measure m(G − E) < ε and put

δ = inf {|x− y| : x ∈ E, y ∈ Gc} and de�ne g(x) = (1− d(x,E)/δ)+. Then g = f

on F = E ∩Gc (both have the value 1 on E and 0 on Gc) and m([0, 1]−F ) < ε bythe choice of E,G .

For simple functions, f =∑Mi=1 aiχAi approximate each aiχAi by a continuous

function gi on a closed set Fi so that m ([0, 1]− Fi) < ε/M and then put F = ∩iFiwill be the set we want.

For arbitrary non-negative bounded functions, do the diadic decompositionfn =

∑i2nχAi,n

which is increasing to f . Notice that hn = fn+1 − fn is a simplenon-negative function which is bounded by 2−n and we have that (by telescoping)fn =

∑nk=1 hk. Now approximate each hk by a continous function gk so that they

agree on a set Fk so that m([0, 1] − F ) ≤ ε/2k. Then we will have that fn and∑nk=1 gk will agree on a set F = ∩kFk so thatm([0, 1]−F ) < ε . Finally, notice that

since each gk ≤ 12kthe in�nte sum

∑∞k=1 gk is uniformly converging (by say M-test)

4.11. LUSIN'S THEOREM 19

and since each gk is continuous, it converges to a continous function g. This functionis equal to

∑∞k=1 hk on all of F . But by de�nition,

∑∞k=1 hk = limn→∞ fn = f so

this is exactly what we want.Finally, for arbitarty functions, split up f into f+ and f− and do the dance for

both pieces. �

Example. (5.16) A good exmaple to see what this theorem is and is not sayingis to look at f = χ[0,1]−Q. Since the irrationals are dense in [0, 1], the function ftakes values of both 1 and 0 in any interval, so you might be tempted to thingthat its impossible for a continuous function to be equal to this on a set of positivemeasure.

The resolution is that, since m(Q) = 0, we can �nd an closed set F thatcontains no rationals withm(F ) > 1−ε (this can be done explicitly in this case: putG = ∪q∈Q∩[0,1]

(q − ε

2n , q +ε2n

)and F = [0, 1]−G contains no rational numbers.)

Put g ≡ 1 everywhere on [0, 1]. Now since F contains no rationals, g and f agreeon F (they are identically 1 there) and m([0, 1]− F ) < ε.

So in this case, Lusin's theorem gives us a very useless approximation otχ[0,1]−Q, namely the constant function 1. This example nicely illustrates that thetype of approximation �there exists a continuous function that agree except for asmall measure set� is actually not so useful.

The Lebesgue Integral


Definition. (6.1.) Let X,A, µ be a measure space. If s =∑ni=1 aiχEi

is asimple function, we de�ne the integral of a simple function to be:ˆ

sdµ =

n∑i=1

aiµ(Ei)

We de�ne the integral of a non-negative measurable function to be:ˆfdµ = sup

{ˆsdµ : s ≤ f and s is simple

}For arbitary functions we de�ne

´fdµ =

´f+dµ−

´f−dµ

Definition. (6.2.) We say f is integrable when´|f | dµ <∞

Proposition. (6.3.) Some basic conseqeunces of the de�nitionsi) If a ≤ f(x) ≤ b for all x ∈ X then aµ(X) ≤

´fdµ ≤ bµ(X)

ii) If f ≤ g then´fdµ ≤

´gdµ

iii) If f is integrable, then´cfdµ = c

´fdµ for all c

iv) If µ(A) = 0 and f is integrable then´fχAdµ = 0

Proof. These are pretty straightforward, most of them are proven by �rstproving it for simple functions, and then using the de�nition of the integral as thesup over simple functions to get the result (sometimes the trick you have to do is todo sup

{´sdµ : s ≤ f and s is simple

}= sup

{´sdµ : s ≤ f and s is simple and has a certain property

}where the certain property helps you get the result. (For example in part i) theproperty is s ≥ a) �

Proposition. (6.4.) If f is integrable, then:∣∣∣∣ˆ f

∣∣∣∣ ≤ ˆ |f |Proof. For the real case its easy using f ≤ |f | and −f ≤ |f | �

20

Limit Theorems


6.12. Monotone Convergence Theorem

Theorem. (7.1.) Suppose fn is a sequence of non-negative measurable func-tions with f1 ≤ f2 ≤ . . . for all x and with:

limn→∞

fn = f

Then: ˆfndµ→

ˆfdµ

\

Proof. Since fn is an increasing sequence of functions, by the result in thelast chapter that says integration is monotone (i.e. f ≤ g =⇒

´f ≤´g), we know

that´fn is an increasing sequence of numbers. Moreover, it is bounded above by´

f (because fn ≤ f and again because integration is monotone).By the monotone convergence theorem for real number sthen, there is a limit

limn→∞´fndµ = L . L ≤

´fdµ since each term in the sequence has this property,

so we desire only to show that L ≥´fdµ. By the de�nition of the integral, it is

su�cient to show that L ≥´sdµ for any simple function s with s ≤ f .

To do this we give ourselves a multiplicative factor of 1− ε of room. Fix ε > 0.For any simple function s with f ≤ s look at the set An = {fn > (1 − ε)s}. Sincefn ↑ f and s < f we know that An ↑ X. Now notice that (from the basic propertiesof integration we proved) that:

ˆfn ≥

ˆfnχAn

≥ (1− ε)ˆsχAn

= (1− ε)∑

aiµ (Ei ∩An)

→ (1− ε)∑

aiµ(Ei)

= (1− ε)ˆsdµ

Since this holds for any ε > 0 we have´f ≥´s and the result then follows by

the de�ntion of the integral! �

21

6.15. DOMINATED CONVERGENCE THEOREM 22

6.13. Linearity of the Integral

Theorem. (7.4.) If f, g are non-negative and measurable, then:ˆ(f + g) dµ =

ˆfdµ+

ˆgdµ

Proof. (Ladder Proof)We �rst prove this for non-negative simple functions, which is easy from the

de�ntion.Then we use the diadic decomposition to get sn ↑ f and rn ↑ g and then use

the monotone convergence theorem to get the result for non-negative f,g.For arbitarty f, g, split it into f+ and f− �

6.14. Fatou's Lemma

Theorem. (7.6.) Suppose that fn are non-negative and measurable. Then:ˆlim inf fn ≤ lim inf

ˆfn

Proof. Let gn = infk≥n fk so that gn ↑ lim inf fn. Clearly, gn ≤ fk for eachk ≥ n so we have: ˆ

gn ≤ infk≥n

ˆfk

Now take the limit of this inequality as n→∞. By the Monotone covnergencetheorem, the LHS goes to

´lim inf fn (since gn ↑ lim inf fn) while the RHS goes to

lim inf´fn just by the de�nition of the it �

Remark. Let's compare this to Fatou's lemma for sets:

P (lim inf An) ≤ lim inf P (An)

Proof. Let Bn = ∩k≥nAk so that Bn ↑ lim inf An by the de�nition of theliminf of sets. Also clearly, Bn ⊂ Ak for any k ≥ n so in particular:

P (Bn) ≤ infk≥n

P(Ak)

Take limit as n→∞now. By the continuity of probability, Bk ↑ lim inf An =⇒P(Bk) ↑ P (lim inf An) and on the other side tends to lim inf P(An) �

Remark. So the proof is exactly the same, with the continuity of probabilitydoing the work in the set case, and the monotone convergence theorem doing thework in the function case. (notice actually that An ↑ X =⇒ µ(Ei ∩An)→ µ(Ei)is also used explicitly in the MCT proof)

Once the set one is proven btw, I think you can do a standard ladder to get tothe Monotone convergence theorem for arbitary functions.

6.15. Dominated Convergence Theorem

Theorem. (7.7.) Suppose fn are measurable real-valued functions and fn → fa.e.. Suppose there exists an integrable function g such that |fn| ≤ g for all x. Then:

limn→∞

ˆfn →

ˆf

6.15. DOMINATED CONVERGENCE THEOREM 23

Proof. Use Fatous's lemma once on fn+g which is non-negative and once ong − fn which is also non-negative. You get a limsup/liminf sandwhich and get thedesired equality. �

Theorem. (Exercise 7.2.) If fn, gn, f, g are integrable with fn → f a.e., gn →g a.e. and |fn| ≤ gn for each n and

´gn →

´g, then

´fn →

´f

Proof. Since fn − gn and gn − fn are non-negative we can apply Fatou toknow that:

lim inf

ˆ(fn − gn) ≥

ˆ(lim inf fn − gn)

=

ˆf − g

=

ˆf −ˆg

On the other hand, lim inf(´fn − gn

)= lim inf(

´fn) −

´g by the fact that´

gn →´g

(Use the lemma: If bn → b then lim inf (an + bn)→ lim inf(an) + b Pf: (Subse-quences) Take a subsequence nk so that ank

is convergent and . Then bnk→ b since

bn is convergent. Have then lim (ank+ bnk

) = lim(ank) + b ≥ lim inf(an) + b. Since

every limit point of an+ bn is ≥ lim inf(an)+ b the liminf obeys this inequailty too.By taking a subsequence so that lim(ank

) = lim inf(an), we see the other inequality.)

From here the proof procedes the same as the ordinary Fatou lemma. �

Product Measures


7.16. Product σ−algebras

Suppose (X,A, µ) and (Y,B, ν) are two measure σ-�nte measure spacdes.

Definition. A measurable rectangle is a set of the form A × B with A ∈A and B ∈ B. Let C0 be the collection of �nite unions of disjoint measurablerectangles, i.e. C0 = {∪ni=1Ai ×Bi}. One can easily verify that C0 is an algebra.(Some people say the measurable rectangles are a semi algebra and then have atheory based on that, but its exactly the same as this)

We de�ne the product sigma algebra by:

A× B = σ (C0)

For E ∈ A× B we de�ne its x-section and y-section by:

sx(E) := {y ∈ Y : (x, y) ∈ E}ty(E) := {x ∈ X : (x, y) ∈ E}

Given a function f : X × Y → R that is A× B measurable, we de�ne for eachx and y Sxf : Y → R and Tyf : X → R by:

Sxf(y) := f(x, y)

Tyf(x) := f(x, y)

Lemma. (11.1) i) If E ∈ A× B then sx(E) ∈ B for each x and ty(E) ∈ A foreach y

ii) If f is A × B measurable, then Sxf and Tyf are B and A measurable re-spectively.

Proof. (Sigma Albgebra proof)Let C be the collection of sets for which sx(E) ∈ B. Check that its a sigma

algebra and that it contains the rectangles.For ii), do a ladder proof. The result holds for simple functions by i) and so

on. �

Proposition. (11.2) For E ∈ A × B, de�ne h(x) = ν (sx(E)) and k(y) =µ (ty(E)) then:

i) h, k are measurableii) We have: ˆ

h(x)µ(dx) =

ˆk(y)ν(dy)

24

7.17. THE FUBINI THEOREM 25

In other words:ˆ [ˆSxχE(y)ν(dy)

]µ(dx) =

ˆ [ˆTyχE(x)µ(dx)

]ν(dy)

This is sometimes written more succinctly as:ˆ ˆχE(x, y)ν(dy)µ(dx) =

ˆ ˆχE(x, y)µ(dx)ν(dy)

Proof. (This is where the monotone class theorem is �nally used)Let C be the collection of sets where this holds. We check that it contains the

rectangles in A× B and also similarly for a �nite union of rectangles (can assumeWOLOG they are disjoint)

Now suppose En ↑ E and each En ∈ C. The result will follow by the monotoneconvergence theorem.

If En ↓ E and each En ∈ C, then the result will follow by using the dominatedconvergence theorem. (To make this one work you must assume that µ and ν are�nite measures....for σ−�nite it will still work by proving it for each �nite piece�rst)

Hence C is a monotone class containing the algebra of �nite unions of rectangles,therefore it actually contains the whole sigma algebra generated by these, which isall of A× B. �

Definition. The product measure µ× ν is de�ned as:

µ× ν(E) =

ˆh(x)µ(dx) =

ˆk(y)ν(dy)

This is indeed a measure. One can check it is �nitely additive, and then use amonotone convergence theroem to get that it is countable additive.

7.17. The Fubini Theorem

Theorem. (11.3.) If f : X × Y → R is measurable with respect to A× B. Ifeither:

a) f is non-negativeb)´|f(x, y)| d (µ× ν) (x, y) <∞

Then:

ˆ ˆf(x, y)d (µ× ν) =

ˆ (ˆf(x, y)dµ(x)

)dν(y)

=

ˆ (ˆf(x, y)dν(y)

)dµ(y)

Remark. Even if (X,A, µ) and (Y,B, ν) are complete, the product µ × ν onA×B will not be complete. (Example a singleton cross with a non-measurable setis not measurable) For this reason

Signed Measures


8.18. Positive and Negative sets

Definition. (12.1) Let A be a sigma algebra. A signed measure is a functionµ : A → (−∞,∞] such that µ(∅) = 0 and µ

(∪∞i=1Ai

)=∑∞i=1 µ(Ai) whenever the

Ai's are pairwise disjoint.

Remark. The only di�erence between this and an ordinary measure is that anordinary measure is required to be non-negative as well.

Definition. (12.2.) Let µ be a signed measure, a set A ∈ A is called apositive set for µ if µ(B) ≥ 0 for all B ⊂ A and B ∈ A. Similarly a negative setA is one where µ(B) ≤ 0 for all B ⊂ A, B ∈ A.

Example. (12.3) If m is the Lebesgue measure then :

µ(A) =

ˆA

fdm

for some integrable f is a signed measure. The sets P = {f > 0} and N ={f < 0} are positive and negative sets for µ.

Proposition. (12.4.) Let µ be a signed measure which takes values in (−∞,∞]let E be measureable with µ(E) < 0. Then there exists a measurable subset F of Ewhich cis measurable and which is a negative set with µ(F ) < 0.\

Proof. (The idea is to start with E and cut out all the positive measure setsof size 1

nk. Any subset of the remaining will be negative or else it would have been

cut out)I'm going to skip the actual details here. �

8.19. Hahn Decomposition Theorem

Definition. A4B := (A−B) ∪ (B −A).

Theorem. (12.5.) (1) Let µ be a signed measure taking values in (−∞,∞].There exists disjoint measurable sets E and F in A whose union is X and suchthat E is a negative set and F is a positive set.

2) The decomposition is essentially unique in the sense that: If E′ and F ′ areanother pair of positive and negative sets whose union is X, then E4E′ = F4F ′is a null set w.r.t. µ

3) If µ is not a positive measure then µ(E) 6= 0. If −µ is not a positive measurethen µ(F ) 6= 0.

26

8.20. JORDAN DECOMPOSITION THEOREM 27

Proof. (1) Let L = inf {µ(A) : A is a negative set } Choose sets An so thatµ(An) → L. Let E = ∪∞n=1An. Let Bn = An − (B1 ∪ . . . ∪Bn) so that ∪nk=1Bk =∪nk=1Ak and the Bk's are pairwise disjoint. Since the An's are all negative sets, soare the Bn's. We claim that E = ∪nBn = ∪nAn is a negative set. This follows bythe continuity of measure, sicne for any C ⊂ E we have::

µ(C) = limn→∞

µ (C ∩ (∪ni=1Bi)) = limn→∞

n∑i=1

µ(C ∩Bi) ≤ 0

We claim that µ(E) = L since L ≤ µ(E) = µ (An) + µ(E − An) ≤ µ(An)→ Lmakes the desired sammich.

Now we claim that F = Ec is a positive set. Otherwise, there is a subset C ⊂ Fof negative measure. But then C contains a negative set (prop 12.4.) and so E ∪Cmakes a negative set with measure stricly less than L, contradicting L as the sizeof the �largest� negative set.

(2) Rewritting E4E′ as (E∩F ′)∪(E′∩F ) makes it clear that E4E′ = F4F ′.Any subset of this can be written as a subset of E ∩ F ′ union with a subset ofF ∩ E′. Any subset of either of these must simultaneuous have non-negative andnon-positive measure, so it is indeed a null set.

(3) Skip this for now. �

Definition. We say two (non-negative) measures µ and ν are mutually sin-

gular if there exist two disjoint subsets E and F in A whose union is all of X andwith µ(E) = 0 and µ(F ) = 0. This is often written µ ⊥ ν.

Example. (12.6) The Lebesgue measure restricted to (0, 12 ] and the Lebesgue

measure restrived to [ 12 , 1) are mutually singular. This works because the Lebsegue

measure of { 12} is 0.

Example. (12.7.) Look at the Cantor Ternary function f on [0, 1] and let ν bethe assoiciated Lebesgue-Stieltjes measure. Let µ be the Lebesgue measure. Thenµ ⊥ ν by the partitioning of the [0, 1] = C ∪Cc where C is the Cantor set. On Cc,ν is measure zero because f is constant on every interval there.

8.20. Jordan Decomposition Theorem

Theorem. (12.8.) If µ is a signed measure on a measurable space (X,A),then there exist positive measure µ+ and µ− such that µ = µ+ − µ− and µ+ ⊥ µ−.Moreover this decomposition is unique.

Proof. Use the Hanh decomposition to �nd the negative set E and positiveset F that decompose X/ Let µ+(A) = µ(A ∩ F ) and µ−(A) = −µ(A ∩ E). Thisgives the desired decomposition.

The uniqueness follows with some work from the �uniqueness up to null sets�result for the Hanh decomposition. �

Definition. The measure:

|µ| = µ+ + µ−

Is called the total variation measure of µ and |µ| (X) is called the totalvariation of µ.

The Radon-Nikodym Theorem


9.21. Absolute Continuity

Definition. (13.1) A measure ν is said to be absolutly continuous withrespect to a measure µ if µ(A) = 0 =⇒ ν(A) = 0. This is written ν � µ

Proposition. (13.2) Let ν be a �nite measure. Then ν is absolutely continuouswith respect to µif and only if for all ε > 0, there exists δ > 0 so that µ(A) < δ =⇒ν(A) < ε

Proof. (⇒) Suppose by contradiction this is not the case. Then there existsε0 so that for every δ there is a bad set Aδ with µ(Aδ) < δ and ν(Aδ) > ε0.Take δ = 2−n to get a sequence of these sets and let A = lim supAn. Then since∑µ(An) < ∞ we know that µ(A) = 0 by Borel-Ceantelli but on the other hand

ν(A) = ν (lim supAn) ≥ lim sup ν(An) > ε0 which contradicts absolute continuity.(⇐) If µ(A) = 0 then µ(A) < δ for all δ and so using the hypothesis we can

conclude that ν(A) < ε for any ε, meaning that ν(A) = 0 too. �

Remark. This proposition gives the connection with the notion of absolutecontinuity for a function. Recall that a function is said to be absolutely continuousif for all ε > 0, there exists δ > 0 so that any pairwise disjoint sequence of intervals(xk, yk) has

∑k |yk − xk| < δ =⇒

∑k |f(yk)− f(xk)| < ε.

9.22. The main theorem

Lemma. (13.3.) Let µ and ν be �nite positive measure on a measurable space(X,A). Either µ ⊥ ν or else there exists ε > 0 and G ∈ A such that µ(G) > 0 andG is a positive set for ν − εµ. (i.e. ν(A) ≤ εµ(A) for all A ⊂ G)

Proof. Conisder the Hahn decomposition for ν − 1nµ. Let F = ∪nFn be the

union over all the positive sets and E = ∩nEn be the intersection of all the negativesets. Notice that Ec = F by DeMorgan's laws since Ecn = Fn for each n.

Claim: ν(E) = 0Pf: ν(E) ≤ ν(En) ≤ 1

nµ(En) ≤1nµ(X)→ 0

If µ(Ec) = µ(F ) = 0 then the sets E and F divide up X to show that µ ⊥ ν.\Otherwise, µ(Ec) = µ(F ) > 0. In this case, µ(Fn) > 0 for some Fn. Choosing

G = Fn and ε = 1n gives the result. �

Theorem. (13.4) Suppose that µis a σ−�nite positve measure on a measurablespace (X,A) and ν is a �nite positive measure on (X,A) such that ν � µ. Then

28

9.23. LEBESGUE DECOMPOSITION THEOREM 29

there exists a µ−integrable non-negative f which is measurable with respect to Asuch that:

ν(A) =

Â

fdµ

Moreover the f is unique up toµ−a.e.

Proof. (The idea is to look at the set of functions f so thatÁfdµ ≤ ν(A)

for all A ∈ A, and then to take the �largest� of these) The uniqueness part is easysince if

Áfdµ =

Ágdµ for all A ∈ A then

Á(f − g)dµ = 0 for all A ∈ A and

then f = g a.e. (for example look at the set A = {|f − g| > ε} must be measure 0for all ε)

Let F ={g measurable : g ≥ 0,

Ágdµ ≤ ν(A) for all A ∈ A

}Let L = sup

{´gdµ : g ∈ F

}and take a sequence gn ∈ F so that

´gn → L.

Let hn = max (g1, . . . , gn) . Check that this is still in F . Let f = suphn = limhnsince the hn's are increasing. By the monotone convergence theorem,

´fdµ =

limn

´hndµ ≤ ν(A) since hn ∈ F which shows that f ∈ F and also

´fdµ ≥´

hndµ ≥´gndµ→ Lshows that

´fdµ = L.

Now there is some technical work (in partticular using the previous lemmas)to show that f is the desired function and also the fact that ν � µ to make itwork. �

9.23. Lebesgue Decomposition Theorem

Theorem. (13.5.) If µ is a sigma �nite measure and ν is a �nite measure,then there exists positive measure λand ρ so that:

i) ν = λ+ ρii) ρ� µiii) λ ⊥ µ

Proof. Take f as in the proof of the R-N theorem and set ρ(A) =Áfdµ and

λ = ν − ρ and it will work out (technical details are omitted) �

Di�erentiation

These are notes from Chapter 5 of [3]. Here is a mind map of the proofs of thissection:

30

DIFFERENTIATION 31

Vita

li C

over

A c

olle

ctio

n of

inte

rval

s is

cal

led

a V

itali

cove

r if e

very

poi

nt is

cov

ered

by

an in

terv

al o

f arb

itrar

ily s

mal

l siz

e

Vita

li C

over

Lem

ma

Giv

en ε

> 0

and

a V

itali

cove

r of a

se

t E, t

here

is a

fini

te s

ubse

t of

inte

rval

s fro

m th

e co

ver t

hat c

over

E

excp

et fo

r a s

et o

f siz

e <

ε

Incr

easi

ng F

unct

ions

are

Diff

Pf H

int:

Che

ck th

at th

e "fo

ur

deriv

ativ

es" a

re e

qual

pai

r by

pair

exce

pt fo

r nul

l set

s. L

ook

at th

e se

t w

here

D-<

u<v<

D+

with

u,v

, rat

iona

l. M

ake

a V

itali

cove

r her

e w

here

the

endp

oint

s ar

e co

ntro

lled.

Use

in

crea

sing

to g

et a

n in

equa

lity.

Fato

u B

ound

for

Incr

easi

ng f

⌠f '

< f(b

) - f(

a)

Func

tions

of B

ound

ed V

aria

tion

are

the

Diff

of T

wo

Mon

oton

e Fu

nctio

nsP

f Hin

t: Th

e tw

o fu

nctio

ns a

re th

e cu

mul

ativ

e po

sitiv

e an

d ne

gativ

e va

riatio

n

Func

tions

of

Bou

nded

Var

iatio

n A

re D

iffer

entia

ble

Inde

finite

Inte

gral

s ar

e of

bou

nded

va

riatio

n

diff

( int

f(t)

dt) =

f(x)

Pf h

int:

Pro

ve it

for b

ound

ed fu

nctio

ns

first

usi

ng th

e co

verg

ence

of f

n =

seca

nt c

urve

of s

ize

1/n

to F

' . T

hen

boot

stra

p to

inte

grab

le F

.

Inde

finite

Inte

gral

s ar

e ab

s. c

ont.

Def

n of

Abs

olut

ely

Con

tinuo

us

Abs

. Con

t. is

bo

unde

d va

riatio

n

f abs

con

t and

f' =

0 a

.e. t

hen

f =

cons

tP

f hin

t: Fi

x ε

and

δ . U

se th

e fa

ct th

at |f

' | <

δ to

m

ake

a V

italli

cov

er s

o th

at f

can'

t mov

e m

uch

on th

e in

terv

als.

The

n us

e co

ver l

emm

a to

get

fin

itely

man

y th

at c

over

it e

xcpe

t for

a s

mal

l set

. U

se a

bs c

ont t

o co

ntro

l the

mov

emen

t of f

on

the

smal

l set

.

f abs

con

t the

n f =

int (

diff

f ) d

tP

f hin

t: C

heck

that

g(x

) = f(

x) -

int f

'(t)

dt

has

zero

der

ivat

ive

ever

ywhe

re.

10.24. DIFFERENTIATION OF MONOTONE FUNCTIONS 32

10.24. Di�erentiation of Monotone Functions

Definition. Let I be a collection of intervals we say that I is a Vitali cov-ering of a set E if for all ε > 0 and any x ∈ E, ∃I ∈ I with x ∈ I and `(I) < ε.The intervals may be open, closed or half-opne but no degenerate intervals (i.e.singletons) are allowed.

Remark. Another way to say this heurstically: �I is a Vitali covering if itcovers every point in E with an arbitarily small interval�

Lemma. (Vitali) Let E be a set of �nte outer measure and I a Vitali coveringof E. Then given ε > 0 there is a �nite disjoint collection {I1, . . . , IN} of intervalsin I such that:

m∗

[E −

N⋃n=1

In

]< ε

Proof. (Pf Idea only)Assume WOLOG all the intervals are closed and O ⊃ E is an open set containg

E and every interval is a subset of O.The idea is to recusrvly construct a sequence {I1, . . .} by adding in intervals

one at a time, chosen so that they are disjoint and every new added inteval is closeto being as large (in the sense of length) as it can be. (i.e. its length is at least 1

2

the sup over the lengths of all candidate intervals) Have then∑∞k=1 `(Ik) ≤ m(O).

For any ε > 0 take N so large so that∑k≥N `(Ik) < ε then and then show that

∪Nn=1In is the set we are looking for. The fact that its a Vitalli covering, and thatwe chose the intervals to be close to as large as possible means we could not havemissed a piece of E of size more than ε. �

Definition. The derivatives (there are four of them) of a function f arede�ned as: (Can come from the left or right (denoted by+ or −) and can be thelimsup or the liminf (denoted by either a superscript or a subscript)

D+f(x) = lim suph→0+

f(x+ h)− f(x)h

D−f(x) = lim suph→0−

f(x+ h)− f(x)h

D+f(x) = lim infh→0+

f(x+ h)− f(x)h

D−f(x) = lim infh→0−

f(x+ h)− f(x)h

Clearly D+ ≥ D+ and D− ≥ D−. If all four of these are equal, then we say thefunction is di�erentiable and call its derivative f ′. If D+f = D+f we say that fhas �right-hand� derivatevs and de�ne f ′(x+) to be its value. Similary for f ′(x−)

Remark. The connection between the Vitali covering lemma and these deriva-tives is that at each point x, we can �nd (by de�niton of limsup/liminf) we can�nd a points hn → 0 so that f(x+ hn) is controlled. The intervals [x, x+ hn] willbe a nice vitalli covering for us.

Proposition. (2) If f is continuous on [a, b] and one of its derivatives (forconcreteness say D+) is everywhere non-negative on (a, b), then f is non-decreasingon [a, b] (i.e. f(x) ≤ f(y) for all x ≤ y in the interval)

10.24. DIFFERENTIATION OF MONOTONE FUNCTIONS 33

Proof. ....I tried for a bit to come up with a proof by contradiction but coudlntmake it work. �

Theorem. Let f be an increasing real-valued function on the interval [a, b].Then f is di�erentiable almost everywhere. The derivative f ′ is measurable and:ˆ b

a

f ′(x)dx ≤ f(b)− f(a)

Remark. The Cantor ternary function is an example where f ′ = 0 a.e. andso´f ′ = 0 < f(b)− f(a)

Proof. The idea is to show that any two of the the four derivateves agreeexcept on a set of measure 0. For concreteness we will handle D+f = D−f . Toshow that thesea re not equal except on a set of measure zero, it su�ces to show bycountability that Euv = {D+f > u > v > D−f} is a measure zero set for all u, vrational.

Let O be an open set so that m(O) < m(Eu,v) + ε. For each x ∈ Eu,v there isan arbitrarily small intervale [x− h, x] contained in O such that (since D− < v):

f(x)− f(x− h) < vh

This is a Vitalli covering! By the Vitali covering lemma, we choose a �nitecollection {I1, . . . , IN} of them whose interiors cover Eu,v with measure at leastm(Eu,v)− ε. Denote A = ∪Ik. Then summing over these intervals, we have:

N∑n=1

[f(xn)− f(xn − h)] ≤ v

N∑n=1

hn

≤ v (m(O) + ε)

≤ v (m(Eu,v) + ε)

Now, each point y ∈ A is the left endpoint of some interval (y, y + k)so thatf(y+ k)− f(y) > uk. Supopose WOLOG that each of these is a subset of some Ik.Using this as a Vitalli covering lemma for THIS new vitalli covering J1, . . . witheach interval being a subset of some I ′is. Have (similar to before):

M∑i=1

f(yi + ki)− f(yi) > u∑

ki

> u(m(Eu,v) + ε)

Now, SINCE f is increasing, and every interval Ji is contained in some intervalIi we have for each Jithat:∑

Ji⊂In

f(yi + ki)− f(yi) ≤ f(xn)− f(xn − hn)

so summing these up we get:

N∑n=1

[f(xn)− f(xn − h)] ≥M∑i=1

f(yi + ki)− f(yi)

But the LHS ≤ v (m(Eu,v) + ε) and the RHS > u(m(Eu,v) + ε). Since thisholds for every ε > 0 we conclude that vm(Eu,v) ≥ um(Eu,v). But since u < v thiscan only hold if m(Eu,v) = 0, as desired.

10.25. FUNCTIONS OF BOUNDED VARIATION 34

This shows that:

g(x) = limh→0

f(x+ h)− f(x)h

Is de�ned almost everywhere and that f is di�erentiable when g is �nite.To get the desired inequality on the integral of the derivative, we use Fatous

lemma and the sequence gn(x) =f(x+h)−f(x)

h

∣∣∣h= 1n

(Notice that g is non-negative

since f is increasing, so we can indeed use Fatou's lemma!) which converges point-wise to g. Have: ˆ b

a

g(x) =

ˆ b

a

lim inf gn

≤ lim inf

ˆ b

a

gn

≤ . . .

= f(b)− f(a)

�

10.25. Functions of Bounded Variation

Let f be a real-valued function de�ned on [a, b] and de�ne, for any subdivisionx0 < x1 < . . . < xn:

p =

k∑i=1

[f(xi)− f(xi−1)]+

n =

k∑i=1

[f(xi)− f(xi−1)]−

t = n+ p

=

k∑i=1

|f(xi)− f(xi−1)|

Here x+ = |x| 1x>0 and x− = |x| 1x<0. Clearly, f(b)− f(a) = p− n. Set:

P = sup p

N = supn

T = sup t

These are called the positive, negative and total variations of f . We sometimeswrite Pa,b to remind ourselves of the interval we are looking over.

Definition. We say f is bounded varaition if T <∞.

Theorem. If f is of bounded variation, then:

Ta,b = Pa,b +Na,b

and:

f(b)− f(a) = Pa,b −Na,bCorollary. If f is of bounded variation, then we can write it as di�erence of

two monotone functions

10.26. DIFFERENTIATION OF AN INTEGRAL 35

Proof. By the theorem, we can write:

f(x) = f(a) + Pa,x −Na,xPa,x and Na,x are seen to be monotone functions of x. �

Remark. The converse is also true since any monotone function is of boundedvaraition, and sums of bounded variation functions are bounded variation.

Proof. For any subdivision we have:

p = n+ f(b)− f(a)and taking suprema over all possible subdivisions we get:

P = N + f(b)− f(a)Now we can write:

t = p+ n = 2p− (f(b)− f(a))And taking superma gives:

T = 2P − (f(b)− f(a)) = P +N

�

Corollary. Every function of bounded variation is di�erentiable.

10.26. Di�erentiation of an Integral

We will show that the derivative of F (x) =´ xaf(t)dt is di�erentiable whenever

f is integrable.

Lemma. If f is integrable on [a, b] then the function F de�ned by:

F (x) =

ˆ x

a

f(t)dt

is a continuous function of bounded variation on [a, b]

Proof. The fact that its continuous follows from the �uniform integrability�property of integrals; for any ε > 0 there is a δ > 0 so that the integral over a setof size < δ is less than ε.

To show that f is bounded variation, notice that:∑|F (xi)− F (xi−1)| =

∑∣∣∣∣ˆ xi−1

xi

f(t)dt

∣∣∣∣≤

∑ˆ xi+1

xi

|f(t)| dt

=

ˆ|f(t)| dt

So taking sup, we have an upper bounder for the total varaition. �

Lemma. If f is integrable on [a, b] and:ˆ x

a

f(t)dt = 0

for all x ∈ [a, b] then f(t) = 0 a.e. in [a, b]

10.26. DIFFERENTIATION OF AN INTEGRAL 36

Proof. I think I have a creative proof using the monotone class theorem. Let:

M =

{A :

ˆf1A = 0

}We see (with a bit of work) from the hypothesis thatM contains the algebra

of �nite collections of intervals.Now M is a monotone class, since if An ↑ A or ↓ A then f1An

→ f1A a.e.and is dominated by the integrable function |f |, so by LDCT 0 =

´f1An →

´f1A

shows A ∈M. (I guess I could have done this with σ-algebra instead too)Hence M is the Borel sets. This means f is 0, becuase for example the set{

f > 1n

}must be measure 0 for each n. �

Lemma. (9) If f is bounded, |f | ≤ K and measureable on [a, b] and:

F (x) =

ˆ x

a

f(t)dt+ F (a)

Then F ′(x) = f(x) for almost all x ∈ [a, b]

Proof. Since F is of bounded variation, we know it is di�erentiable. Set fn =F (x+h)−F (x)

h

∣∣∣h= 1n

so that fn → F ′(x) a.e.. Notice also that fn = 1h

´ x+hx

f(t)dt ≤K is bounded, so we are ok to use a bounded convergence theorem. We now checkthat: ˆ

F ′dx = limn→∞

ˆfn

= limn→∞

n

ˆF (x+

1

n)− F (x)dx

= F (b)− F (a)

=

ˆ b

a

f(x)dx

So then F ′ − f integrates to zero on every interval, and consequently mustvanish everywhere. �

Theorem. The above works if f is integrable instead of boudned:

d

dx

ˆ x

a

f(t)dt = f(x)

Proof. Take fn ↑ f with each fn = f ∧ n bounded by say n. We know thatddx

´ xafn = fn(x) a.e. since these are bounded. and so we have that:

F′(x) =

d

dx

ˆf

=d

dx

ˆfn +

d

dx

ˆ(f − fn)

≥ fn(x)

Since f − fn ≥ 0. But then´ (

F′ − fn

)≥ 0 for any n and consequently´

F ′ − f ≥ 0 too (LDCT justi�es us) For any interval. On the other hand, we

showed (it was a Fatou's lemma in the end) that´F ′n ≤ F (b)−F (a) =

´ baf . So we

have cobining these inequalites that´(F ′− f) = 0 over any interval and so F ′ = f

a.e. �

10.27. ABSOLUTE CONTINUITY 37

10.27. Absolute Continuity

Definition. A real valued function f is said to be absolutly continuous on[a, b] if and only if, for all ε > 0 there is a δ > 0 such that:

n∑i=1

|x′i − xi| < δ =⇒n∑i=1

|f(x′i)− f(xi)| < ε

Remark. Compare this with the �de�nition� of absolute continuity for a mea-sure, ν � µ if for all ε > 0 there exists δ > 0 so that:

ν(A) < δ =⇒ µ(A) < ε

(The more usual way to think about this is ν(A) = 0 =⇒ µ(A) = 0, whichis equivalent.) This is how you could think of absolutly continus functions tooheuristically: the change in the function over a tiny set is tiny.

Theorem. For f non-negative integrable, the function g(x) =´ xaf(x)dx is

absolutly continuous. More precisly, for all ε > 0 there exists δ > 0 so that µ(A) <δ =⇒

Áfdx < ε

Proof. Can do this using the equivalence with absoltule continuity for measursabove, thinking of

Áf as a measure. Its not hard by bare hands either though:

take fn = f ∧ n so that fn ↑ f . By LDCT we have:ˆfn →

ˆf

Hence, for any ε > 0, there exits n large enough so that´f1{f>n} < ε/2. Now

take δ = ε/2n and we �nd that for any set A with µ(A) < δ that:ˆ

A

f =

Â∩{f≤n}

f +

Â∩{f>n}

f

≤ nµ(A) + ε/2

< ε/2 + ε/2

�

Lemma. If f is absolutly continuous on [a, b] then it is of bounded variation on[a, b]

Proof. Take ε = 1 to get a δ in the de�nition of absolute continuity. LetK = 1 + b−a

δ be the largest number of intervals of size δ one could cram into a, b.The varaition on each interval of size δ is no more than 1 (by the de�nition ofabsolutle continiuity) and so the total variation on the whole [a, b] can be no morethan K, the total number of such intervals. �

Corollary. If f is absolutly continuos then f has a derivative almost every-where.

Lemma. If f is abs. continuous on [a, b] and f ′(x) = 0 a.e. then f is constant.

Proof. (Uses vitalli covering lemma) Fix ε and η arbitrarily small. Get theδ from absolute continuity of f . The idea is as follows, for any interval (a, c) weuse the fact that |f ′(x)| < η everywhere to make a Vitalli covering of [a, c] with|f(x+ h)− f(x)| ≤ ηh on each interval [x, x + h]. Then by the Vitalli coveringlemma, extract a �nite set of intervals that cover all of [a, c] except a set of size δ.

10.27. ABSOLUTE CONTINUITY 38

Beacuse |f(x+ h)− f(x)| ≤ ηh on the intervals, the total contribution to|f(c)− f(a)| on the intervals is no more than η(c− a).

Because the δ was chosen by the abs. cont. of f at ε, the total the totalcontribution to |f(c)− f(a)| on the complement of the intervals (which is size nomore than δ) is < ε.

In sum |f(c)− f(a)| < ε + η(c − a) which can be made arbitarily small as weplease. �

Corollary. Every absolutly continuous function can be written as:

f(x) = f(a) +

ˆ x

a

f ′(x)dx

Proof. Let g(x) = f(x) −´ xaf ′(x)dx. Since inde�nite integrals are abs con-

tinuous, g is the di�ernce of absolutely continuous functions and is hence abs.continuous. Moreover, g′ = f ′ − f ′ = 0 a.e. By the previous lemma, then g is aconstant! g(a) = f(a) then gives the result. �

Lp spaces

These are notes from Chapter 15 of [1].Let (X,A, µ) be a σ−�nite meausre sapce. For 1 ≤ p <∞ de�ne the Lpnorm

of f by:

‖f‖p =(ˆ|f(x)|p dx

)1/p

For p = ∞ ‖f‖∞ is the essential supremum of f . The space Lp is the set offunctions whose Lpnorm is �nite.

11.28. Norms

Proposition. (15.1.) (Holder's Inequality) If 1 < p, q <∞ and p−1+q−1 = 1then: ˆ

|fg| dµ ≤ ‖f‖p ‖g‖q

Proof. Assume WOLOG that ‖f‖p = ‖g‖q = 1 and then integratF (x)G(x) ≤F (x)p

p + G(x)q

q to get the result. �

Proposition. (Minkowski's Inequality) If 1 ≤ p ≤ ∞ then:

‖f + g‖p ≤ ‖f‖p + ‖g‖p

Proof. Write:

|f + g|p = |f + g| |f + g|p−1

≤ |f | |f + g|p−1 + |g| |f + g|p−1

Now apply Holder's inequality with p and q =(1− 1

p

)−1to get:

ˆ|f + g|p ≤

ˆ|f | |f + g|p−1 +

ˆ|g| |f + g|p−1

≤ ‖f‖p

(ˆ|f + g|(p−1)q

)1/q

+ ‖g‖p

(ˆ|f + g|(p−1)q

)1/q

This gets us to:

‖f + g‖pp ≤(‖f‖p + ‖g‖p

)‖f + g‖p/qp

And dividing out gives the resutl. �

39

11.29. COMPLETENESS 40

11.29. Completeness

Theorem. (15.4.) If 1 ≤ p ≤ ∞ then Lp is complete.

Proof. Say fn is a Cauchy sequence. Look at the subsequence nk so that∥∥fnk− fnk+1

∥∥ ≤ 2−k. We will show that∑m

(fnm − fnm−1

)converges absolutely

and this will be our limit point for the Cauchy sequence. Let gk =∑ki=1

∣∣fnm− fnm−1

∣∣ByMinkowski we have:

‖gj‖p ≤k∑i=1

∥∥fnm− fnm−1

∥∥p

≤j∑

m=1

2−m < 1

Let g be the pointwise limit g = limn→∞ gn. By Fatou, we have´|g(x)|p ≤

limm→∞ ‖gj‖pp ≤ 1. Hence g must be �nite a.e.

Now have f(x) = limK→∞∑Km=1

(fnm− fnm−1

)(x) = limK→∞ fnK

(x) and byFatou we have:

‖f − fnk‖pp =

ˆ ∣∣f − fnj

∣∣p ≤ lim infK→∞

ˆ ∣∣fnK− fnj

∣∣p ≤∑∥∥fnk+1− fnk

∥∥pp≤ 2(−j+1)p → 0

So fnkl→ f in Lp as desired. �

Proof. To prove a space is complete, it su�ces to prove that every absolutlyconvergent sequence is convergent. Suppose hn is a sequence with

∑‖hn‖Lp <∞.

We want to de�ne f(x) =∑∞n=1 hn(x) and show that this is �nite a.e. and is the

Lp limit of∑∞n=1 hn(x).

Claim 1: The pointwise limit f(x) = limN→∞∑Nn=1 hn(x) exists and is �nite

a.e.Pf: Let gN (x) =

∑Nn=1 |hn(x)| and let g(x) = supN gN (x) = limN→∞ gN (x).

By Fatou's lemm:

ˆ(g(x))

pdx =

ˆlimN→∞

(gN (x))pdx

≤ lim infN→∞

ˆ(gN (x))

pdx

≤ lim infN→∞

N∑n=1

‖hn‖pp by Minkowski inequality

< ∞ since hn is absolutly summable

Hence g(x) is �nite a.e.. Hence for a.e. x the sum f(x) =∑∞n=1 hn(x) converges

absolutly and is �nite.

Claim 2: f(x) = limN→∞∑Nn=1 hn(x) in L

p

11.31. BOUNDED LINEAR FUNCTIONALS 41

Pf: De�ne fN (x) =∑Nn=1 hn(x) so fN → f pointwise for a.e. x. Now again by

Fatou: ˆ|f − fN |p dx =

ˆlim |fM − fN |p dx

≤ lim infM→∞

ˆ|fM − fN |p dx

= lim infM→∞

ˆ ∣∣∣∣∣M∑

k=N+1

hk(x)

∣∣∣∣∣p

dx

≤ lim infM→∞

M∑n=N+1

‖hn‖pp

≤M∑

n=N+1

‖hn‖pp

→ 0 as N →∞

�

Proposition. The set of continuous functions with compact support is densein Lp(R)

Proof. Compact sets is clear since´ ∣∣f − f1[−n,n]∣∣ → 0 by LDCT. To do

continuous function, approximate f by simple functions. Approximate each mea-surable set by a closed set. Then approximate the characteristic function of a closedset by a continuous function. �

11.30. Convolutions

The convolution of two measurable functions is de�ned by:

f ∗ g =

ˆf(x− y)g(y)dy

Proposition. (15.7.) If f, g ∈ L1 then f ∗ g is in L1 and:

‖f ∗ g‖1 ≤ ‖f‖1 ‖g‖1Proof. Write:

‖f ∗ g‖1 ≤ˆ ˆ

|f(x− y)| dx |g(y)| dy

=

ˆ ˆ|f(x)| dx |g(y)| dy

= ‖f‖1 ‖g‖1�

11.31. Bounded Linear Functionals

Theorem. (15.8) For 1 < p <∞ and p−1 + q−1 = 1 we have:

‖f‖p = sup

{ˆfgdµ : ‖g‖q ≤ 1

}

11.31. BOUNDED LINEAR FUNCTIONALS 42

Proof. ≥ follows by the Holder inequality. To see the other way choose some-

thing like g = (sgn(f(x)) |f(x)|p−1

‖f‖p/qp

. �

Proposition. Actually it su�ces to consider only simple functions:

‖f‖p = sup

{ˆfgdµ : ‖g‖q ≤ 1, g is a simple function

}Proof. Approximate f by simple functions and put again g = (sgn(f(x)) |sn(x)|

p−1

‖sn‖p/qp

�

Proposition. (15.10) If 1 < p < ∞ with p−1 + q−1 = 1 and g ∈ Lq thenintegration against g de�nes a bounded linear functional on Lp with ‖H‖ = ‖g‖q

Proof. By Holder H is bounded and by the above prop ‖H‖ = ‖g‖q. �

Theorem. (15.11) If 1 < p < ∞ and H is a real valued bounded linear func-tional on Lp then there exists g ∈ Lq such that H(f) =

´fg and ‖g‖q = ‖H‖

Proof. The idea is to de�ne the measure ν(A) = H(1A) then show that νis a measure with ν � µ. The function we want is g = dν

dµ . Once we have estab-

lished thatH(1A) =´gdµ it is easy to see that for simple functions we haveH(s) =´

sgdµ. Bu then ‖g‖q = sup(´gs : ‖s‖ ≤ 1, s is simple

)= sup (H(s) : ‖s‖ ≤ 1, s is simple) ≤

‖H‖. Hence g ∈ Lq. Finally then, by approximating any function f by simple func-tions one can check H(sn)→ H(f) which shows that H(f) =

´fg. �

Fourier Transforms


12.32. Basic Properties

For f ∈ L1(Rn) de�ne:

f(u) =

ˆ

Rn

eiu·xf(x)dx

Proposition. If f, g in L1 then:

i) f is bounded and continuousii)· is linear

Proof. Write:

f(u+ h)− f(u) =

ˆ (ei(u+h)·x − eiu·x

)f(x)dx∣∣∣f(u+ h)− f(u)

∣∣∣ ≤ ˆ ∣∣eiu·x∣∣ ∣∣eih·x − 1∣∣ |f(x)| dx

which is bounded by 2 |f(x)|. Since eix·h− 1→ 0 as h→ 0 the integral → 0 bythe LDCT. �

12.32.1. Derivatives.

Proposition. (16.2.) If f ∈ L1 and xf ∈ L1 then f is di�erentiable with:

f ′ = i

ˆeiu·xxf(x)dx

Remark. In Rn the condition is that xjf ∈ L1 =⇒ f has a partial ujderiavive.

Proof. Write:

f(u+ h)− f(u)h

=

ˆeiu·x

(eihx − 1

h

)f(x)dx

And since xf(x) is integrable, we use∣∣∣ eihx−1

h

∣∣∣ ≤ |x| then apply LDCT and get

the result. �

Proposition. If f ∈ L1, f is absolutly continuous and f ′ ∈ L1 then the

Fourier transform of f ′ is −iuf(u).\

43

12.33. THE INVERSION THEOREM 44

Proof. The trick is to do integration by parts:

f ′ =

∞

−∞

eiuxf ′(x)dx =

îueiuxf(x)dx+ boundaryterms

The fact that f ′is in L1 can be used to help show that f → 0 as x→ ±∞ thatthe boundary terms die. �

Proposition. If f, g ∈ L1 then f ∗ g = f g

Proof. Write it out and use ‖f ∗ g‖L1 ≤ ‖f‖L1 ‖g‖L1 �

12.33. The Inversion theorem

Proposition. Let Ha : R → R be the pdf of a Gaussian with mean 0 andvariance a2 then:

Ha(x) =1

a√2πe−x

2/2a2

Then, Ha is the pdf of a Gaussian with mean 0 and variance 1/a2 and NOPRECONSTANTS:

Ha(u) = e−a2u2/2

Moreover, Ha is an approximate identity as a→ 0, i.e. we have:

sup ‖Ha‖L1 < ∞ˆHa = 1

ˆ

|x|>δ

Ha → 0 as a→ 0 for any �xed δ > 0

Consequenctly we have:

Ha ∗ f‖·‖∞−−−→ f for all f ∈ C∞(R)

Ha ∗ f‖·‖p−−−→ f for all f ∈ Lp(R)

Also notice that Ha(u)→ 1 as a→ 0.

Theorem. If f and f are in L1 then:

f(y) =1

2π

ê−iu·y f(u)du converges a.e.

Proof. The trick is to hit things with Ha, then unload the integration ontoH, which we know is its own Fourier transform essentially.

Write:1

2π

ê−iu·y f(u)du = lim

a→0

ê−iu·y f(u)Ha(u)du =

ˆ êiuxe−iu·y f(u)Ha(u)dxdu

= C

ˆHa(x− y)f(x)dx

= CHa ∗ f→L1

f

�

Fourier Series

These are notes from Chapter 1 of [2]. These are rather informal and do notfollow very closely with the textbook.

Definition. For a function f ∈ L1(T) we de�ne its n-th Fourier coe�cient by:

f(n) =1

2π

ˆf(t)e−intdt

And tis Fourier series is the formal trigonmentric series:

S[f ] =

∞∑n=−∞

f(n)eint

Proposition. Have: ∣∣∣f(n)∣∣∣ ≤ ‖f‖L1

Proof. Holds since∣∣eint∣∣ = 1 �

Definition. A summability kernal is a seqeunce {kn} of 2π periodic functionssuch that:

1

2π

ˆkn(t)dt = 1

1

2π

ˆ|kn(t)| dt = ‖kn‖L1 ≤ C for some �xed C and for all n

limn→∞

2π−δˆ

δ

|kn(t)| dt = 0 for every �xed δ

Lemma. If k is a summability kernal then for continuous ϕ ∈ C (T) we have::

limn→∞

1

2π

ˆkn(s)ϕ(s)ds = ϕ(0)

Proof. Given any ε > 0, �nd δ so small by the continuity of ϕ so that|ϕ(x)− ϕ(0)| < ε whenever |x| < δ. Then:

45

FOURIER SERIES 46

∣∣∣∣ 12πˆkn(s)ϕ(s)ds− ϕ(0)

∣∣∣∣ =

∣∣∣∣ 12πˆkn(s) (ϕ(s)− ϕ(0)) ds

∣∣∣∣≤ 1

2π

2π−δˆ

δ

|kn(s)| |ϕ(s)− ϕ(0)| ds+1

2π

δˆ

−δ

|kn(s)| |ϕ(s)− ϕ(0)| ds

≤ 2 ‖ϕ‖∞

1

2π

2π−δˆ

δ

|kn(s)|

+ ε1

2π

δˆ

−δ

|kn(s)| ds

→ 0 + εC

�

Proposition. If f ∈ L1(T), and kn a summability kernal, then we have:

kn ∗ fL1

−−→ f

Proof. We �rst prove it for f continuous. If f is continuous, then for any ε�nd δ so small so that |f(x)− f(y)| < ε for |x− y| < δ. Consider:∥∥∥∥ 1

2π

ˆkn(s)f(· − s)ds− f(·)

∥∥∥∥L1

=1

(2π)2

ˆ ∣∣∣∣ˆ kn(s) (f(t− s)− f(t)) ds∣∣∣∣ dt

=1

(2π)2

ˆ ∣∣∣∣∣∣ δˆ

−δ

kn(s) (f(t− s)− f(t)) ds+2π−δˆ

δ

kn(s) (f(t− s)− f(t)) ds

∣∣∣∣∣∣ dt≤ 1

(2π)2

ˆ ∣∣∣∣∣∣‖kn‖L1 ε+ 2 ‖f‖∞

2π−δˆ

δ

kn(s)ds

∣∣∣∣∣∣ dt→ 0 + ε sup

n‖kn‖L1

This proves it for continuous f . For f ∈ L1, �nd g continuous with ‖f − g‖L1 <ε and consider:

‖kn ∗ f − f‖L1 ≤ ‖kn ∗ f − kn ∗ g‖L1 + ‖kn ∗ g − g‖L1 + ‖g − f‖L1

≤(supn‖kn‖L1

)‖f − g‖L1 + ‖g − f‖L1 + ‖kn ∗ g − g‖L1

�

Definition. The Fejer Kernal is the summability kernal:

Kn(t) =1

n+ 1

n∑j=0

j∑k=−j

eikt

=

n∑j=−n

(1− |j|

n+ 1

)eijt

Proposition. Have the identity:

Kn(t) =1

n+ 1

(sin n+1

2

sin 12 t

)2

FOURIER SERIES 47

Proof. Use the identity:

cos(t) = cos

(2

(t

2

))= cos

(1

2t

)2

− sin

(1

2t

)2

= 1− 2 sin

(1

2t

)2

So:

sin

(1

2t

)2

=1

2(1− cos(t)) = −1

4e−it +

1

2− 1

4eit

Then check that:(−1

4e−it +

1

2− 1

4eit) n∑j=−n

(1− |j|

n+ 1

)eijt

turns it into a telescoping sum. �

Definition. We de�ne:

σn(f)(t) = (Kn ∗ f) (t) =n∑

k=−n

(1− |j|

n+ 1

)f(j)eijt

Remark. Since Kn is a summability kernal, we know that:

σn(f)ptwise−−−−→ f for f ∈ C (T)

σn(f)L1

−−→ f for f ∈ L1(T)The pointwise convergence is actually uniform (go back to our argument we

used) and the convergence in L1 can be moved up to Lp convergence with thesame method of proof as before (just approximate the Lp function by a contunousfunction)

σn(f)‖·‖∞−−−→ f for f ∈ C(T)

σn(f)Lp

−−→ f for f ∈ Lp(T)

Remark. (This is kind of o� topic...) Using the same kind of naive-ish esti-mates as above, one can show that the Fourier series converge in Cα, the space ofHolder continuous functions:

Sn(f)‖·‖∞−−−→ f for f ∈ Cα(T)

13.33.1. Corollaries to Fejer Convergence.

Theorem. (Fejer Convergence Thm)

σn(f)‖·‖∞−−−→ f for f ∈ C(T)

σn(f)Lp

−−→ f for f ∈ Lp(T)

Corollary. Trig. polynomials are dense in C(T), Lp(T) for 1 ≤ p <∞

Proof. Each σn(f) is a trig. polynomial. �

13.34. MORE FOURIER FACTS 48

Corollary. The functions{eint

}∞n=−∞ form a maximal orthonormal set (i.e.

an orthonormal set) for L2(T).

Proof. Its easy to check they are orthonormal. To see they are maximal,suppose by contradiction that f ⊥ eint for all n. Find g a trig poly. so that‖f − g‖L2 < ε. Then we have that:

‖f‖2 = 〈f, f〉 = 〈f, f − g〉+ 〈f, g〉 ≤ ‖f‖ ‖f − g‖+ 0 ≤ ε ‖f‖

So we get ‖f‖ = 0. �

Corollary. We have:

‖f‖22 =∑n∈Z

∣∣∣f(n)∣∣∣2f = lim

n→∞

n∑i=−n

f(n)eint = limn→∞

n∑i=−n

⟨f, eint

⟩eint

ˆf(x)g(x)dx = 〈f, g〉L2 =

∑n∈Z

⟨f, eint

⟩ ⟨eint, g

⟩=∑n∈Z

f(n)g(n)

Proof. These follow by basic Hilbert space theory and are equivalent to thefact that eint are a basis. The �rst one is Plancharel's identity, the second one isRiesz-Fisher, and the last one is Parseval's identity. �

Corollary. (Uniquenss Theorem) If f(n) = 0 for all n and f ∈ L1(T) thenf ≡ 0

Proof. σn(f) = 0 for every f and σn(f)→ f �

Corollary. (Riemann-Lebesgue) If f ∈ L1(T) then f(n)→ 0 as n→ ±∞

Proof. Given any ε > 0, choose N so large so that ‖σNf − f‖L1 < ε. Then

for any n > N , we have that σn(f) = 0 and so:b vb∣∣∣f(n)∣∣∣ = ∣∣∣ σNf(n)− f(n)∣∣∣ ≤ ‖σNf − f‖L1 < ε

�

Remark. As we have seen in Hang's harmonic class, given any sequence an → 0there is an L1 function whose fourier seires go to 0 slower than an. (You canconstruct it as long as its convex)

13.34. More Fourier Facts

Proposition. Fejer's Theorem: For continuous function the Fejer kernal Fn(x) =1n

∑n−1k=0 Dk(x) where Dk is the Dirichlet kernal Dk(x) =

∑ks=−k e

isx.(This can be

written as Fn(x) = 1n

1−cos(nx)1−cos(x) ) has Fn ∗ f → f uniformly in [−π, π] if f is

continuous.

13.34. MORE FOURIER FACTS 49

Proof. Write

(Fn ∗ f − f) (y) =1

2π

π

−π

Fn(x)f(y − x)dx− f(y)

=1

2π

π

−π

Fn(x) [f(y − x)− f(y)] dx since Fejer kernal integrates to 1

=1

2π

π

−π

Fn(x) [fy−·(x)− fy−·(0)] dx

Where fy−·(z) := f(y−z). Now divide the region of integration into two zones,one where x is small and one where x is large. When x is small, the continuity off can be used to control it. When x large, the fact that the the Fejer kernal has´|x|>δ |Fn| → 0. �

Proposition. If f ∈ L2 then the Fourier series approx for f converges in theL2 sense Dn ∗ f → f in L2

Proof. This amounts to showing that the trigonometric polynomials einx forma BASIS for the Hilbert space L2. Indeed, we �rst remark that these are dense inL2 (by Stone-Weirestrass or by the Fejer kernal we know that these are dense inthe continuous functions....and the continuous functions are dense in L2). Next,suppose by contradiction that einx is not a basis for L2. Then there exists anf ∈ L2 perp to all einx. But then �nd a sequence of trigonometric polynomialsfn → f in L2. We have then:

‖f‖2 = 〈f, f〉= 〈f − fn, f〉 since 〈fn, f〉 = 0

≤ ‖f − fn‖ ‖f‖

→ 0 since fnL2

−−→ f

�

Proposition. The fourier series of a continuous function does not nessisarilyconverge pointwise.

Proof. (Using the Banach Steinhouse principle). Notice that the Dirichletkernal Dn has ‖Dn‖1 ∼ log n → ∞. If Dn ∗ f → f pointwise, then we wouldhave that {x : supn |Dn ∗ f(x)| < ∞} = [−π, π]. By the uniform boundenessprinciple, we would have that supn ‖Dn ∗ f‖∞ is uniformly bounded! However,‖Dn ∗ f‖∞ = ‖f‖∞ ‖Dn‖1 →∞ so this is a contradiction. �

Bibliography

[1] R.F. Bass. Real Analysis for Graduate Students. CreateSpace Independent Publishing Plat-form, 2013.

[2] Y. Katznelson. An Introduction to Harmonic Analysis. Cambridge Mathematical Library.Cambridge University Press, 2004.

[3] H.L. Royden. Real Analysis. Mathematics and statistics. Macmillan, 1988.

50

real analysis oral exam study notes notes transcribed by ...nica/oral/real_notes.pdf · real...

Documents