from hyperbolic systems to kinetic theory: a personalized quest

294
Lecture Notes of 6 the Unione Matematica Italiana

Upload: others

Post on 11-Sep-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: From hyperbolic systems to kinetic theory: a personalized quest

Lecture Notes of 6the Unione Matematica Italiana

Page 2: From hyperbolic systems to kinetic theory: a personalized quest

Editorial Board

Franco Brezzi (Editor in Chief)Dipartimento di MatematicaUniversita di PaviaVia Ferrata I27100 Pavia, Italye-mail: [email protected]

John M. BallMathematical Institute24-29 St Giles’Oxford OX1 3LBUnited Kingdome-mail: [email protected]

Alberto BressanDepartment of MathematicsPenn State UniversityUniversity ParkState CollegePA 16802, USAe-mail: [email protected]

Fabrizio CataneseMathematisches InstitutUniversitatstraße 3095447 Bayreuth, Germanye-mail: [email protected]

Carlo CercignaniDipartimento di MatematicaPolitecnico di MilanoPiazza Leonardo da Vinci 3220133 Milano, Italye-mail: [email protected]

Corrado De ConciniDipartimento di MatematicaUniversità di Roma “La Sapienza”Piazzale Aldo Moro 200133 Roma, Italye-mail: [email protected]

Persi DiaconisDepartment of StatisticsStanford UniversityStanford, CA 94305-4065, USAe-mail: [email protected],

[email protected]

Nicola FuscoDipartimento di Matematica e ApplicazioniUniversità di Napoli “Federico II”, via CintiaComplesso Universitario di Monte S. Angelo80126 Napoli, Italye-mail: [email protected]

Carlos E. KenigDepartment of MathematicsUniversity of Chicago1118 E 58th Street, University AvenueChicago IL 60637, USAe-mail: [email protected]

Fulvio RicciScuola Normale Superiore di PisaPlazza dei Cavalieri 756126 Pisa, Italye-mail: [email protected]

Gerard Van der GeerKorteweg-de Vries InstituutUniversiteit van AmsterdamPlantage Muidergracht 241018 TV Amsterdam, The Netherlandse-mail: [email protected]

Cédric VillaniEcole Normale Supérieure de Lyon46, allée d’Italie69364 Lyon Cedex 07Francee-mail: [email protected]

The Editorial Policy can be found at the back of the volume.

Page 3: From hyperbolic systems to kinetic theory: a personalized quest

Luc Tartar

From Hyperbolic Systemsto Kinetic TheoryA Personalized Quest

ABC

Page 4: From hyperbolic systems to kinetic theory: a personalized quest

Luc TartarDepartment of Mathematical SciencesCarnegie Mellon UniversityPittsburgh, PA [email protected]

ISBN 978-3-540-77561-4 e-ISBN 978-3-540-77562-1

DOI 10.1007/978-3-540-77562-1

Lecture Notes of the Unione Matematica Italiana ISSN print edition: 1862-9113ISSN electronic edition: 1862-9121

Library of Congress Control Number: 2007942545

Mathematics Subject Classification (2000): 35K05, 35L45, 35L60, 35L65, 35L67, 35Q30, 70F45, 76A02,76N15, 76P05, 82C22, 82C40

c© 2008 Springer-Verlag Berlin Heidelberg

This work is subject to copyright. All rights are reserved, whether the whole or part of the material isconcerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting,reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publicationor parts thereof is permitted only under the provisions of the German Copyright Law of September 9,1965, in its current version, and permission for use must always be obtained from Springer. Violations areliable to prosecution under the German Copyright Law.

The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply,even in the absence of a specific statement, that such names are exempt from the relevant protective lawsand regulations and therefore free for general use.

Cover design: WMXDesign GmbH

Printed on acid-free paper

9 8 7 6 5 4 3 2 1

springer.com

Page 5: From hyperbolic systems to kinetic theory: a personalized quest

Dedicated to Robert DAUTRAY

He helped me at a critical time, when I could no longer bear the rejection in theacademic world (partly for having refused the current methods of falsifications,and partly because I was too interested in science for a mathematician), andhe also guided me in my readings while I worked at Commissariat a l’EnergieAtomique, so that I did not get lost like many other mathematicians in thejungle of models which physicists have generated, and I could understand whatmathematical tools should be developed for helping understand in a betterway how nature works.

to Peter LAX

He gave an example of how a good mathematician can work, by putting someorder in a corner of the physical world where the preceding knowledge wasmade up of a few examples and too many guesses. Why have there been sofew mathematicians who wanted to follow his example?

to Lucia

to my children, Laure, Michael, Andre, Marta

and to my grandson, Lilian

Page 6: From hyperbolic systems to kinetic theory: a personalized quest

Preface

After publishing An Introduction to Navier–Stokes Equation and Oceanogra-phy [20],1,2 and An Introduction to Sobolev Spaces and Interpolation Spaces[21],3 the revised versions of my lecture notes for graduate courses that I hadtaught in the spring of 1999 and in the spring of 2000, I want to follow withanother set of lecture notes for a graduate course that I had taught in the fallof 2001, with the title “Introduction to kinetic theory”. For this one, therehad been no version available on the Internet, and I had not even written thenotes for the last four lectures, and after a few years, I find it useful to makethe text available to a larger audience by publishing a revised and completedversion, but I had to change the title in a significant way.

In [21], I had written that my reasons for publishing lecture notes is totell the readers some of what I have understood, the technical mathematicalaspects of the course, the scientific questions behind the theories, and more,and I shall have succeeded if many become aware, and go forward on the pathof discovery, not mistaking research and development, knowing when and whythey do one or the other, and keeping a higher goal in mind when for practicalreasons they decide to obey the motto of the age for a while, publish or perish.

In the fall of 2001, I had done precisely that, and I had taught the math-ematical results that I had proven during my quest for understanding about1 Claude Louis Marie Henri NAVIER, French mathematician, 1785–1836. He had

worked in Paris, France. He introduced the equation now known as the Navier–Stokes equation in 1821, although he did not understand about shear stress.

2 Sir George Gabriel STOKES, Irish-born mathematician, 1819–1903. He had workedin London, and in Cambridge, England, holding the Lucasian chair (1849–1903).

3 Sergei L’vovich SOBOLEV, Russian mathematician, 1908–1989. He had worked inLeningrad, in Moscow, and in Novosibirsk, Russia. There is now a Sobolev Insti-tute of Mathematics of the Siberian branch of the Russian Academy of Sciences,Novosibirsk, Russia. I first met Sergei SOBOLEV when I was a student, in Paris in1969, and conversed with him in French, which he spoke perfectly (all educatedEuropeans at the beginning of the 20th century learned French).

Page 7: From hyperbolic systems to kinetic theory: a personalized quest

VIII Preface

kinetic theory, which I had started in the early 1970s, but I had also taughtabout what is wrong with kinetic theory, which I had started to understand inthe early 1980s, and I had tried to teach a little about continuum mechanicsand physics with the critical mind of a mathematician, so that the studentscould understand what were the results of my detective work on this particu-lar question of kinetic theory, and understand how to attack other questionsof continuum mechanics or physics by themselves later (having in mind thedefects that have already been found on each question, by me or by others).

In [21], I had suggested to the readers who already know something aboutcontinuum mechanics or physics to look at my lecture notes, to read about thedefects which I know about in classical models, because other authors rarelymention these defects even though they have heard about them. This set oflecture notes, written with a concern towards kinetic theory, is of this type.I had suggested to the readers who do not yet know much about continuummechanics or physics, to start with more classical descriptions about the prob-lems, for example by consulting the books which have been prepared underthe direction of Robert DAUTRAY,4 and of Jacques-Louis LIONS,5 whom hehad convinced to help him, [5]–[13].

I have mentioned that my personal point of view, which is that one shouldnot follow the path of the majority when reason clearly points to a differentdirection, probably owes a lot to having been raised as the son of a (Calvinist)Protestant minister,6 but I had lost the faith when I was twelve or thirteenyears old, and I may not have explained well why I later found myself forcedto practice the art of the detective in deciding what had to be discarded fromwhat I could reasonably trust until some new information became available.Becoming a mathematician had been one of the reasons, because mathemati-cians must know what is proven and what is only conjectured, and when laterI became interested in understanding continuum mechanics and physics froma mathematical point of view, I found that the analysis that must be donein organizing the information, as well as the misinformation that “scientists”transmit about the real world, is quite similar to the analysis that must bedone in organizing the information and misinformation that various religious4 Ignace Robert DAUTRAY (KOUCHELEVITZ), French physicist, born in 1928.5 Jacques-Louis LIONS, French mathematician, 1928–2001. He received the Japan

Prize in 1991. He had worked in Nancy and in Paris, France; he held a chair(analyse mathematique des systemes et de leur controle, 1973–1998) at Collegede France, Paris. The laboratory dedicated to functional analysis and numericalanalysis which he initiated, funded by CNRS (Centre National de la RechercheScientifique) and Universite Paris VI (Pierre et Marie Curie), is now named afterhim, the Laboratoire Jacques-Louis Lions. I first had Jacques-Louis LIONS as ateacher at Ecole Polytechnique in Paris in 1966–1967, and I did research underhis direction, until my thesis in 1971.

6 Jean CALVIN (CAUVIN), French-born theologian, 1509–1564. He had worked inParis and in Strasbourg, France, in Basel and in Geneve (Geneva), Switzerland.

Page 8: From hyperbolic systems to kinetic theory: a personalized quest

Preface IX

traditions transmit, and in both these approaches, one can observe the per-verse influence of political factors.

The particular difficulty that I had encountered myself around 1980 was re-lated to the political perversion of the French academic system itself, becauseI found myself facing an unimaginable situation of forgeries, organized by a“mathematician” and continued by a “physicist”, which turned into a night-mare when I was repeatedly confronted with the racist behaviour of those whoinsisted that it was normal that I should not have the same rights as others.7

Fortunately, Robert DAUTRAY provided me with a new job outside thisstrange “academic” world,8 and I was extremely grateful to him for that, asit contrasted a lot with the rejection that I was feeling in the mathematicalworld, including the strange opposition of my mentors, Laurent SCHWARTZ

and Jacques-Louis LIONS,9 who had chosen the side of the forgers againstme, probably because they had some different, wrong information. However,I am even more grateful to Robert DAUTRAY for something that very fewpeople could have provided me, as my understanding of physics could nothave improved in the way it did without his help, which was mostly throughtelling me what to read, and it is natural that I should dedicate this set oflecture notes to him, although he may not agree entirely with my personalanalysis on the subject of kinetic theory.

My new job, or more precisely what I had understood about what I hadto do, had been both simple and impossible, to understand physics in a betterway, through a mathematical approach, of course. I felt that Robert DAUTRAY

understood that physics had reached a few dead ends, where physicists werehitting some walls which had been created before them, by other physicistswho had invented the wrong games for understanding how nature works. Itshould not have been too critical, as it is natural that guessing produces a fewanswers that are not completely right, although they may not be completelywrong, and using the art of the engineer one can make things work even thoughone does not have the correct equations for describing the processes that onewants to tame, but this approach in science has its limitations. In order to goforward, one needs to apply a scientific approach, and practice the art of thedetective to discover what has been done wrong, and then one needs to doit in a better way, ideally in the right way, if that is possible. I thought thatRobert DAUTRAY was not only aware of that, but that he saw that some of7 This happened in one of the campuses of University Paris XI (Paris Sud), Orsay,

France, from 1979 to 1982.8 I worked at CEA (Commissariat a l’Energie Atomique) in Limeil, France, from

1982 to 1987.9 Laurent SCHWARTZ, French mathematician, 1915–2002. He received the Fields

Medal in 1950. He had worked in Nancy, in Paris, France, at Ecole Polytechnique,which was first in Paris (when I had him as a teacher in 1965–1966), and then inPalaiseau, and at Universite Paris 7 (Denis Diderot), Paris.

Page 9: From hyperbolic systems to kinetic theory: a personalized quest

X Preface

this work of providing more order must be done by mathematicians, at leastwell-trained mathematicians.

The job of a detective is certainly made quite difficult if he/she is forbiddento ask questions to important witnesses, or if he/she realizes that there is a wallof silence and that there is information that could be useful for his/her searchwhich some powerful group does not want him/her to discover. That type ofdifficulty exists in physics, as well as in other sciences, including mathematics.At the beginning, some guessed rule had been successful in one situation,and although it was dangerous to apply a similar guess indiscriminately forall kinds of problems, it had been done, but what made this practice quiteunfortunate was then to create a dogma, and to teach it to new generations ofstudents. Because no hints were given that some of these rules could be slightlywrong, or even completely misleading, these physicists were not really trainedas scientists, and it is not surprising that many of them ended up working likeengineers, mistaking physics and technology, and not caring much for the factthat some of the currently taught “laws of physics” are obviously wrong: theyare simply the laws that physicists have guessed in their quest about the lawsthat nature follows, and it would have been surprising that their first guesshad been right.

Before 1982, I had mostly thought about questions concerning contin-uum mechanics, developing homogenization and the compensated compact-ness method, partly with Francois MURAT,10 but I had also understood aquestion of the appearance of nonlocal effects by homogenization of some hy-perbolic equations, and I thought that this was a more rational explanationthan the strange games of spontaneous absorption and emission that physi-cists had invented, so that their probabilistic games were just one possibleapproach to describing the correct effective equations, confirming what I hadalready discovered before, that probabilities are introduced by physicists whenthey face a situation that they do not understand, so that it should be pointedout how crucial it is to introduce probabilities as late as possible in the analy-sis of a problem, ideally not at all if possible, but certainly further and furtheraway from one generation to the next. However, up to 1982, I did not see howto include quantum mechanics and statistical mechanics in my approach tothe partial differential equations of continuum mechanics and physics.

After 1982, the first step was relatively easy, and in reading what RobertDAUTRAY had told me I identified a few points which are certainly wrong inthe laws that physicists use; however, making them right seemed to require thedevelopment of new mathematical tools. The tool of H-measures [18], whichI started describing at the end of 1986, was something that I had alreadyguessed two years before, but its extension to semi-linear hyperbolic systems10 Francois MURAT, French mathematician, born in 1947. He works at CNRS (Cen-

tre National de la Recherche Scientifique) and Universite Paris VI (Pierre et MarieCurie), Paris, France.

Page 10: From hyperbolic systems to kinetic theory: a personalized quest

Preface XI

has eluded me since, and I see that extension as necessary to explain someof the strange rules about quantum mechanics, and then derive better rulesthan those of statistical mechanics.

At the end of 1983, a year before the first hint about new mathematicaltools, I already “knew” what is wrong with kinetic theory, which is the subjectof this set of lecture notes, as a consequence of having “understood” whatis wrong with quantum mechanics. As I am a mathematician, I use quotesbecause I want to emphasize that it was not yet mathematical knowledge,and it was not about a precise conjecture either because I could not formulateone at the time, but I had acquired the certitude that some aspects of whatthe physicists say will not appear in the new mathematical framework that Iwas searching for.

The main mistake of physicists had been to stick to 18th century ideasof classical mechanics, instead of observing that if the 19th century ideasabout continuum mechanics are inadequate for explaining what is observedat a microscopic level, it is because one needs new mathematical tools for20th century mechanics/physics (turbulence, plasticity, atomic physics), whichhave no probability in them, of course, as the use of probabilities is the signthat one does not understand what is going on. It had been a mistake toconcentrate too much effort on problems of partial differential equations whichshow finite-dimensional effects, for which 18th century mechanics is adapted,instead of observing that the more interesting problems of partial differentialequations all show infinite-dimensional effects, which cannot be grasped with18th/19th century ideas; actually, my subject of research since the early 1970shad been precisely focused on studying the effect of microstructures in partialdifferential equations, a subject which I have decided to describe as beyondpartial differential equations. The certitude that mathematics brings is thatthere are absolutely no particles at atomic level, there are only waves, so thatthere cannot be any particles interacting in the way that had been assumedby MAXWELL,11 and by BOLTZMANN.12

Nevertheless, one should be careful not to disparage MAXWELL andBOLTZMANN for the fact that their pioneering work in kinetic theory hassome defects, because they had shown a good physical intuition for the wayto correct an important defect of continuum mechanics, which is that theconstitutive relations used are wrong, because they result from the inexactpostulate that the relations valid at equilibrium are true at all times.

That there are no particles and that they are waves could have been un-derstood earlier, as a consequence of an observation of POINCARE in his study11 James CLERK MAXWELL, Scottish physicist, 1831–1879. He had worked in

Aberdeen, Scotland, in London and in Cambridge, England, holding the firstCavendish professorship of physics (1871–1879).

12 Ludwig BOLTZMANN, Austrian physicist, 1844–1906. He had worked in Graz andVienna, Austria, in Leipzig, Germany, and then again in Vienna.

Page 11: From hyperbolic systems to kinetic theory: a personalized quest

XII Preface

of relativity,13 that instantaneous forces at a distance do not make any sense,which EINSTEIN after him had probably not understood so well,14 and that“particles” feel a field that transmits the interactions as waves, but POINCARE

had died many years before the wave nature of “particles” was confirmed byan observation of L. DE BROGLIE in his study of “electrons”,15 that theyare waves. Unfortunately, the idea that there are only waves and no parti-cles was then completely messed up in the following development of quantummechanics, which led to that strange dogmatic discipline where “nonexistentparticles” are assumed to play “esoteric probabilistic games”.

At the end of 1983, I had then “understood” that there are absolutely noparticles at a microscopic level, so that real gases are not made of particles,and I understood it in a mathematical way in the late 1980s, by introducingH-measures [18], which are related to oscillations and concentration effectsin weakly converging sequences, and then by proving transport equations forthem when one considers sequences of solutions of particular linear hyperbolicsystems. Better mathematical results are still needed in order to understandthe case of semi-linear hyperbolic systems, which I believe is the mathematicalproblem to study to explain all the strange effects which are observed at amicroscopic level.

Although MAXWELL and BOLTZMANN had done quite a good job in postu-lating their equations for kinetic theory, because it is not yet clear a centuryand a half after them how to write the equations correctly, it is useful todescribe some defects in their work to show some limitations of kinetic the-ory, in the same way that one shows the limitations of classical mechanicsby pointing out that NEWTON’s work was unchallenged for two centuries,16

until relativity was introduced by POINCARE, and then EINSTEIN, so that one13 Jules Henri POINCARE, French mathematician, 1854–1912. He had worked in

Paris, France. There is now an Institut Henri Poincare (IHP), dedicated to math-ematics and theoretical physics, part of Universite Paris VI (Pierre et MarieCurie), Paris.

14 Albert EINSTEIN, German-born physicist, 1879–1955. He received the Nobel Prizein Physics in 1921, for his services to theoretical physics, and especially forhis discovery of the law of the photoelectric effect. He had worked in Bern, inZurich, Switzerland, in Prague, now capital of the Czech Republic, at ETH (Eid-genossische Technische Hochschule), Zurich, Switzerland, in Berlin, Germany, andat IAS (Institute for Advanced Study), Princeton, NJ. The Max Planck Institutefor Gravitational Physics in Potsdam, Germany, is named after him, the AlbertEinstein Institute.

15 Prince Louis Victor Pierre Raymond DE BROGLIE, 7th Duc de Broglie, Frenchphysicist, 1892–1987. He received the Nobel Prize in Physics in 1929, for hisdiscovery of the wave nature of electrons. He had worked in Paris, France.

16 Sir Isaac NEWTON, English mathematician, 1643–1727. He had worked in Cam-bridge, England, holding the Lucasian chair (1669–1701). The Isaac Newton In-stitute for Mathematical Sciences in Cambridge, England, is named after him.

Page 12: From hyperbolic systems to kinetic theory: a personalized quest

Preface XIII

knows now that one needs relativistic corrections when the velocities involvedcan be compared with the speed of light c.

Some have thought that what I had understood with H-measures was wellknown, but it is exactly as if one says that Laurent SCHWARTZ’s theory ofdistributions had been introduced by DIRAC,17 and the authors of such re-marks only show that they cannot recognize mathematics when they see it.However, such deceptive statements were also made by good mathematicians,and in that case it shows something else: in each religion, there is a funda-mentalist party who is interested in enforcing dogmas, not always because allthese people believe in them, but often because some prefer to slow down theadvance of knowledge (usually for keeping the power they have over the naivewho believe in these dogmas), and in the case that I consider it means slowingdown the evolution of science in general, and physics in particular, and it isnot too difficult to understand the political motivation of those who behavein this way, and they often associate with people who do not hide that theirwork is political, but insist in brainwashing the naive that it is correct.

Although I advocate using reason for criticizing without concessions thepoints of view that are taught in order to find better “truths”, one should ob-serve that this approach is more suited to mathematicians than to physicistsor engineers, but not all mathematicians have been trained well enough forfollowing that path, and that might explain why some people initially trainedas mathematicians write inexact statements, which they often do not changeafter being told about their mistakes, which others repeat then without know-ing that they propagate errors; if their goal had not been to mislead others, abetter strategy would have been to point out that some statements were onlyconjectures.

Of course, although a few problems of continuum mechanics or physicshave led to some of the mathematical questions described in this course, Ihave added some results for the usual reason that mathematicians are sup-posed to discover general structures hidden behind particular results, anddescribe something more general after having done a systematic study, akinto a cleaning process.

I had not consciously been following the path that Peter LAX had opened,18

of developing mathematics for a better understanding of continuum mechan-ics and physics. I first heard him talk at the Lions–Schwartz seminar at IHP17 Paul Adrien Maurice DIRAC, English physicist, 1902–1984. He received the Nobel

Prize in Physics in 1933, jointly with Erwin SCHRODINGER, for the discovery ofnew productive forms of atomic theory. He had worked in Cambridge, England,holding the Lucasian chair (1932–1969).

18 Peter David LAX, Hungarian-born mathematician, born in 1926. He received theWolf Prize in 1987, for his outstanding contributions to many areas of analysisand applied mathematics, jointly with Kiyoshi ITO. He received the Abel Prizein 2005. He works at NYU (New York University), New York, NY.

Page 13: From hyperbolic systems to kinetic theory: a personalized quest

XIV Preface

(Institut Henri Poincare) in Paris, in the late 1960s, about N -waves for theBurgers equation,19 to show that there are two invariants for integrable data(whose sum is the classical invariant

∫Ru(·, t) dx), and about the Korteweg–

de Vries equation (not yet popularized as the KdV equation),20,21 to discussits infinite list of invariants. Then, I heard him talk in 1971 in Madison, WI,during my first visit to United States, at a meeting of MRC (MathematicsResearch Center) in Madison, WI, organized by Eduardo ZARANTONELLO,22

and Peter LAX talked about “entropies” for systems, but I did not knowenough about hyperbolic systems of conservation laws at the time to appre-ciate the importance of the results that he was presenting. Actually, I knewalmost nothing of that subject, which was not really known among mathe-maticians in France in the early 1970s, and I may have helped to make itbetter known by teaching a few courses on the subject in the late 1970s, butI had first heard about the details in a course by Joel SMOLLER in Orsayin 1973,23 then in discussions with Ron DIPERNA,24 and with ConstantineDAFERMOS,25 and then in a course by Takaaki NISHIDA in Orsay in the late1970s,26 before I started teaching it myself.

Although I had understood early that Laurent SCHWARTZ was not in-terested in continuum mechanics or physics, I had taken some time to makethe same observation concerning Jacques-Louis LIONS, but in the late 1970s,once that I was explaining the point of view that one should try to under-stand more about the physical meaning of the equations that one is studying,I had been surprised to hear Jacques-Louis LIONS defend the opposite posi-tion, that in his opinion this was not strictly necessary, so after that I had no19 Johannes Martinus BURGERS, Dutch-born mathematician, 1895–1981. He had

worked at University of Maryland, College Park, MD.20 Diederik Johannes KORTEWEG, Dutch mathematician, 1848–1941. He had

worked in Amsterdam, The Netherlands.21 Gustav DE VRIES, Dutch mathematician, 1866–1934. He had worked in Breda,

in Alkmaar, and then as a high school teacher in Haarlem, The Netherlands.22 Eduardo H. ZARANTONELLO, Argentinian mathematician, born in 1918. He has

worked in La Plata, in Cordoba, in San Juan, and in San Luis y Cuyo, Argentina,but when I first met him in 1971, during my first trip to the US, he was workingat MRC (Mathematics Research Center) in Madison, WI; ten years ago he wasstill working, in Mendoza, Argentina.

23 Joel Alan SMOLLER, American mathematician. He works at University of Michi-gan, Ann Arbor, MI.

24 Ronald John DI PERNA, American mathematician, 1947–1989. He had worked atBrown University, Providence, RI, at University of Michigan, Ann Arbor, MI, atUniversity of Wisconsin, Madison, WI, at Duke University, Durham, NC, and atUCB (University of California at Berkeley), Berkeley, CA.

25 Constantine M. DAFERMOS, Greek-born mathematician, born in 1941. He hasworked at Cornell University, Ithaca, NY, and at Brown University, Providence,RI.

26 Takaaki NISHIDA, Japanese mathematician, born in 1942. He works at KyotoUniversity, Kyoto, Japan.

Page 14: From hyperbolic systems to kinetic theory: a personalized quest

Preface XV

more doubts about his interests, and our paths separated. A few years ago,Peter LAX recalled a discussion from the 1950s where Jacques-Louis LIONS

had been criticized by British applied mathematicians for focusing too muchon functional analysis and for caring very little about continuum mechan-ics, and that Jacques-Louis LIONS found nothing better than replying with ajoke,27 which showed that he was already against understanding more aboutcontinuum mechanics.

In the early 1970s, after working with Francois MURAT on an extensionof the work of Sergio SPAGNOLO on G-convergence,28 before I borrowed theterm homogenization from Ivo BABUSKA for designing it and Francois MURAT

chose to call our approach H-convergence,29 it had been the work of EvaristeSANCHEZ-PALENCIA that helped me understand the connection of our workwith continuum mechanics,30 and after that I insisted more and more aboutthe usefulness of understanding about the possible physical meanings of theequations that one studies. The main features which I tried then to develop inmy research work now look to me very similar to those which Peter LAX hadchosen for himself, to learn about results in continuum mechanics and physicsand, after developing an intuition for a particular field, to select a good subjectand to put some order in it by creating an adapted mathematical framework,and eventually introduce new mathematical tools for studying it.

In some way, the qualities that Peter LAX has shown are not so commonamong mathematicians, even those who have been in contact with him. WhenI first met Ralph PHILLIPS in the spring of 1983,31 in Stanford,32 CA, I askedhim a question about a remark of Leonardo DA VINCI,33 which I thoughtmust be classical for specialists of scattering,34 but I was surprised to discover27 Jacques-Louis LIONS’s answer was that the British could not be trusted, since

the time they had burnt Jeanne D’ARC (Joan of ARC).28 Sergio SPAGNOLO, Italian mathematician, born in 1941. He works at Universita

di Pisa, Pisa, Italy.29 Ivo M. BABUSKA, Czech-born mathematician, born in 1926. He worked at Charles

University, Prague, Czech Republic, at University of Maryland, College Park, MD,and at University of Texas, Austin, TX.

30 Enrique Evariste SANCHEZ-PALENCIA, Spanish-born mathematician, born in1941. He works at CNRS (Centre National de la Recherche Scientifique) andUniversite Paris VI (Pierre et Marie Curie), Paris, France. I have always knownhim under the French form of his first name, Henri, but he now uses his secondname, Evariste.

31 Ralph Saul PHILLIPS, American mathematician, 1913–1998. He had worked atUSC (University of Southern California), Los Angeles, CA, and at Stanford Uni-versity, Stanford, CA.

32 Leland STANFORD, American businessman, 1824–1893. Stanford University, andthe city of Stanford where it is located, are named after him.

33 Leonardo DA VINCI, Italian artist, engineer and scientist, 1452–1519. He hadworked in Milano (Milan) and in Firenze (Florence), Italy.

34 In the beginning of 1982, while I was visiting the Scuola Normale Superiore inPisa, Italy, I was told to take the train to Firenze (Florence) to see an exhibition

Page 15: From hyperbolic systems to kinetic theory: a personalized quest

XVI Preface

that Ralph PHILLIPS had no physical intuition at all, and that for him scat-tering theory was just a chapter of functional analysis, so that he had notthought of using his collaboration with Peter LAX on the subject for learningabout the physical phenomena which could be covered by their mathematicaltheory. Some have worked on a subject that Peter LAX had initialized, likethat of hyperbolic systems of conservation laws, and many have pushed theirwork in directions totally disconnected from reality, despite a warning fromConstantine DAFERMOS that the umbilical cord that joins the theory of sys-tems of conservation laws with continuum physics is still vital for the properdevelopment of the subject and it should not be severed.

For example, why are there people who play with models where thereare shocks which do not satisfy some of the conditions that Peter LAX hadintroduced, and who forget to point out that the models that they use havebeen postulated by engineers, and why is it that they do not see that they areobviously incompatible with classical ideas in thermodynamics? Of course, Ihave been teaching for many years that thermodynamics is flawed and shouldbe improved, but that does not mean that any model which is incompatible withclassical thermodynamics can be considered a good model of physical reality!

When Peter LAX introduced “entropy conditions” for systems,35 he wasgeneralizing the work for the scalar case of Eberhard HOPF,36 and of KRUZH-

KOV,37 who had found an intrinsic way for expressing a condition introducedby Olga OLEINIK,38 and he had observed that if a sequence of approximationslike that created by the method of artificial viscosity converges almost every-where, then an “entropy condition” holds, but he knew how difficult it was toobtain enough estimates for proving that desired strong convergence.39 Some

of a manuscript of Leonardo DA VINCI. To explain the fact that the surface of themoon reflects the light from the sun in every direction, Leonardo had assumedthat there were oceans on the moon and that because of waves the light couldbe reflected in various directions. We know now that there are no oceans on themoon, so that he was wrong, but I had admired Leonardo’s inventiveness, and Ihad thought that he had not been too far from guessing why a rough surface canreflect light in every direction.

35 Constantine DAFERMOS prefers to call them E-conditions, as these notions arenot always linked to thermodynamic entropy, and I had chosen myself to write“mathematical entropies” in making the distinction.

36 Eberhard Frederich Ferdinand HOPF, Austrian-born mathematician, 1902–1983.He had worked at MIT (Massachusetts Institute of Technology), Cambridge, MA,in Leipzig and in Munchen (Munich), Germany, and at Indiana University, Bloom-ington, IN, where I met him in 1980.

37 Stanislav Nikolaevich KRUZHKOV, Russian mathematician, 1936–1997. He hadworked in Moscow, Russia.

38 Olga Arsen’evna OLEINIK, Ukrainian-born mathematician, 1925–2001. She hadworked in Moscow, Russia. I do not remember when I first met her, before 1976.

39 In the mid 1960s, James GLIMM had found a way to estimate the total variationof the solution of some systems, for initial data having a small variation. It wasonly a few years after that approach of Peter LAX that I introduced a different

Page 16: From hyperbolic systems to kinetic theory: a personalized quest

Preface XVII

authors do not seem to have understood that they just repeat Peter LAX’sargument when they write articles with statements that if something con-verges strongly, then the Hilbert expansion is true,40 without pointing outthe known defects of that conjecture of HILBERT that letting the “mean freepath between collisions” tend to 0 in the Boltzmann equation gives the Eulerequation for an ideal gas,41 that the Boltzmann equation has been derived byassuming that a gas is rarefied and that, apart from having also postulatedirreversibility by introducing probabilities, it does not make any sense to ap-ply it to a dense gas by making a “mean free path between collisions” tendto 0, and that as real gases are not ideal, it means that either the Boltzmannequation does not apply to real gases or that the Hilbert expansion is false.

As in my preceding lecture notes, [20] and [21], I have given informationin footnotes about the people who have participated in the creation of theknowledge related to the subject of the course, and I refer to the prefaces ofthose lecture notes in explaining my motivation, and I just want to repeat themotto of Hugo of Saint Victor,42 Learn everything, and you will see afterwardthat nothing is useless, as it corresponds to what I have understood in myquest about how creation of knowledge occurs.

I have often heard people say about famous scientists from the past, thatluck played an important role in their discoveries, but the truth must be thatthey would have missed the importance of the new hints that had occurred ifthey had not known beforehand all the aspects of their problems. Those whopresent chance as an important factor in discovery probably wish that everyesoteric subject that they like be considered important and funded, but thatis not at all what the quoted motto is about.

I hope that the many pieces of the puzzle that I describe in this coursewill help a few mathematicians to understand a way to follow the path ofPeter LAX, by doing mathematics on problems which have been selected withcare, so that in the end they help clarify a piece of that important puzzle,understanding physics in a better way.

I would not have been able to complete the publication of my first twolecture notes and to think about revising and completing this third set oflecture notes without the support of Lucia OSTONI, and I want to thank herfor that and for much more, having given me the stability that I had lacked

method, based on the results of compensated compactness that I had introducedwith Francois MURAT, but it appeared difficult to apply for systems, and the firstto succeed was Ron DIPERNA.

40 David HILBERT, German mathematician, 1862–1943. He had worked in Konigs-berg (then in Germany, now Kaliningrad, Russia) and in Gottingen, Germany.

41 Leonhard EULER, Swiss-born mathematician, 1707–1783. He had worked in StPetersburg, Russia, in Berlin, Germany, and then again in St Petersburg.

42 Hugo VON BLANKENBURG, German-born theologian, 1096–1141. He had workedat the monastery of Saint Victor in Paris, France.

Page 17: From hyperbolic systems to kinetic theory: a personalized quest

XVIII Preface

so much in the last twenty-five years, so that I could feel safer in resumingmy research of giving a sounder mathematical foundation to 20th centurycontinuum mechanics and physics.

I want to thank my good friends Carlo SBORDONE and Franco BREZZI forhaving proposed to publish my lecture notes in a series of Unione MatematicaItaliana. I also want to thank the referee for the improvements that he hassuggested.

Milano, July 2007 Luc TARTAR

Correspondant de l’Academie des Sciences, ParisMembro Straniero dell’Istituto Lombardo Accademia di Scienze e Lettere,

MilanoUniversity Professor of Mathematics

Department of Mathematical SciencesCarnegie Mellon UniversityPittsburgh, PA 15213-3890

United States of America

Notes on names cited in footnotes for the Preface, LUCAS,43 P. CURIE and M.SK�LODOWSKA-CURIE,44 FIELDS,45 DIDEROT,46 CAVENDISH,47 PLANCK,48

43 Reverend Henry LUCAS, English clergyman and philanthropist, 1610–1663.44 Pierre CURIE, French physicist, 1859–1906. He and his wife, Marie SK�LODOWSKA-

CURIE, Polish-born physicist, 1867–1934, received the Nobel Prize in Physics in1903, in recognition of the extraordinary services they have rendered by their jointresearch on the radiation phenomena discovered by Professor Henri BECQUEREL,jointly with Antoine Henri BECQUEREL. Marie SK�LODOWSKA-CURIE also re-ceived the Nobel Prize in Chemistry in 1911, in recognition of her services to theadvancement of chemistry by the discovery of the elements radium and polonium,by the isolation of radium and the study of the nature and compounds of thisremarkable element. They had worked in Paris, France. University Paris VI inParis, France, is named after them, Universite Pierre et Marie Curie.

45 John Charles FIELDS, Canadian mathematician, 1863–1932. He had worked inMeadville, PA, and in Toronto, Ontario.

46 Denis DIDEROT, French philosopher and writer, 1713–1784. He had worked inParis, France, and he was the editor-in-chief of the Encyclopedie. Universite Paris7, Paris, France, is named after him.

47 Henry CAVENDISH, English physicist and chemist (born in Nice, not yet in Francethen), 1731–1810. He was wealthy and lived in London, England.

48 Max Karl Ernst Ludwig PLANCK, German physicist, 1858–1947. He received theNobel Prize in Physics in 1918, in recognition of the services he rendered to theadvancement of physics by his discovery of energy quanta. He had worked in Kiel

Page 18: From hyperbolic systems to kinetic theory: a personalized quest

Preface XIX

SCHRODINGER,49 WOLF,50 ITO51 ABEL,52 BROWN,53 DUKE,54 CORNELL,55

CHARLES IV,56 BERKELEY,57 Jeanne D’ARC,58 and for the preceding foot-notes, NOBEL,59 BECQUEREL,60 James GLIMM.61

and in Berlin, Germany. There is a Max Planck Society for the Advancement ofthe Sciences, which promotes research in many institutes, mostly in Germany (Ispent my sabbatical year 1997–1998 at the Max Planck Institute for Mathematicsin the Sciences in Leipzig, Germany).

49 Erwin Rudolf Josef Alexander SCHRODINGER, Austrian-born physicist, 1887–1961. He received the Nobel Prize in Physics in 1933, jointly with Paul AdrienMaurice DIRAC, for the discovery of new productive forms of atomic theory. Hehad worked in Vienna, Austria, in Jena and in Stuttgart, Germany, in Breslau(then in Germany, now Wroc�law, Poland), in Zurich, Switzerland, in Berlin, Ger-many, in Oxford, England, in Graz, Austria, and in Dublin, Ireland.

50 Ricardo WOLF, German-born inventor, diplomat and philanthropist, 1887–1981.He emigrated to Cuba before World War I; from 1961 to 1973 he was Cuban Am-bassador to Israel, where he stayed afterwards. The Wolf Foundation was estab-lished in 1976 with his wife, Francisca SUBIRANA-WOLF, 1900–1981, to promotescience and art for the benefit of mankind.

51 Kiyosi ITO, Japanese mathematician, born in 1915. He received the Wolf Prize in1987, for his fundamental contributions to pure and applied probability theory,especially the creation of the stochastic differential and integral calculus, jointlywith Peter LAX. He worked in Kyoto, Japan, although he worked at some timeat Aarhus University, Aarhus, Denmark (1966–1969) and at Cornell University,Ithaca, NY (1969–1975).

52 Niels Henrik ABEL, Norwegian mathematician, 1802–1829.53 Nicholas BROWN Jr., American merchant, 1769–1841. Brown University, Provi-

dence, RI, is named after him.54 Washington DUKE, American industrialist, 1820–1905. Duke University, Durham,

NC, is named after him.55 Ezra CORNELL, American philanthropist, 1807–1874. Cornell University, Ithaca,

NY, is named after him.56 CHARLES IV of Luxembourg, 1316–1378. German king and King of Bohemia (in

1346) and Holy Roman Emperor (in 1355) as Karl IV. Charles University, whichhe founded in Prague in 1348, is named after him.

57 George BERKELEY, Irish-born philosopher and Anglican Bishop, 1685–1753. Thecity of Berkeley, CA, is named after him.

58 Jeanne D’ARC, French national heroine, and saint, 1412–1431. She was beatifiedin 1909, and canonized in 1920.

59 Alfred NOBEL, Swedish industrialist and philanthropist, 1833–1896. He createda fund to be used as awards for people whose work most benefited humanity.

60 Antoine Henri BECQUEREL, French physicist, 1852–1908. He received the NobelPrize in Physics in 1903, in recognition of the extraordinary services he has ren-dered by his discovery of spontaneous radioactivity, jointly with Pierre CURIE

and Marie SK�LODOWSKA-CURIE. He had worked in Paris, France.61 James G. GLIMM, American mathematician, born in 1934. He worked at MIT

(Massachusetts Institute of Technology), Cambridge, MA, at NYU (New YorkUniversity), New York, NY, and at SUNY (State University of New York), StonyBrook, NY.

Page 19: From hyperbolic systems to kinetic theory: a personalized quest

XX Preface

Detailed Description of Lectures

a.b: refers to definition, lemma or theorem # b in lecture # a, while (a.b)refers to equation # b in lecture # a.

Lecture 1: Historical Perspective.Conservation laws (1.1)–(1.2), linearized wave equation (1.3)–(1.5), quasi-

linear wave equation (1.6), gas dynamics (1.7)–(1.9), Burgers equation (1.10)–(1.13).

Lecture 2: Hyperbolic Systems: Riemann Invariants, Rarefaction Waves.Linear system (2.1), 2.1: linear hyperbolic or strictly hyperbolic system,

eigenvalues and eigenvectors (2.2)–(2.4), solution of linear hyperbolic system(2.5)–(2.7), 2.2: quasi-linear hyperbolic or strictly hyperbolic system (2.8),gas dynamics (2.9)–(2.28), 2.3: Riemann problem (2.29), solution of linearcase (2.30), 2.4: Riemann invariants (2.31), integral curves (2.32), Riemanninvariants for gas dynamics (2.33)–(2.35), 2.5: simple waves, equations forRiemann problem (2.36)–(2.39), 2.6: linearly degenerate of genuinely nonlinearfields (2.40)–(2.41), the case of gas dynamics (2.42)–(2.44).

Lecture 3: Hyperbolic Systems: Contact Discontinuities, Shocks.Contact discontinuities (3.1)–(3.2), conservation forms (3.3)–(3.4), 3.1:

weak solutions (3.5)–(3.7), 3.2: Rankine–Hugoniot conditions (3.8)–(3.10), thecase of gas dynamics (3.11)–(3.12), 3.3: shocks (3.13)–(3.17), 3.4: entropy andentropy flux (3.18).

Lecture 4: The Burgers Equation and the 1-D Scalar Case.Burgers equation (4.1), Burgers–Hopf equation and Hopf–Cole transform

(4.2)–(4.5), one sided inequality for ux implying uniqueness (4.6), Lax–Friedrichs scheme (4.7)–(4.8), CFL condition (4.9), order-preserving prop-erty (4.10)–(4.11), 4.1: Crandall–Tartar lemma, application to Lax–Friedrichsscheme (4.12)–(4.13).

Lecture 5: The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik.Galilean transformation (5.1), nonuniqueness (5.2)–(5.3), 5.1: Oleinik E-

condition (5.4)–(5.5), 5.2: Lax E-condition (5.6)–(5.7), rarefaction wave (5.8)–(5.11), shock (5.12).

Lecture 6: Hopf’s Formulation of the E-Condition of Oleinik.Hopf’s entropy condition (6.1)–(6.2), a family of entropy giving Oleinik

E-condition (6.3), Lax generalization to systems (6.4)–(6.7), Lax–Friedrichsscheme (6.8), viscous shock profile (6.9)–(6.14).

Lecture 7: The Burgers Equation: Special Solutions.One-sided inequality for ux implying uniqueness (7.1), perturbation of

a constant (7.2)–(7.8), perturbation of Riemann data (7.9)–(7.12), variousscalings (7.13)–(7.19), perturbation of a rarefaction wave (7.20)–(7.27).

Lecture 8: The Burgers Equation: Small Perturbations; the Heat Equation.

Page 20: From hyperbolic systems to kinetic theory: a personalized quest

Preface XXI

The danger of linearization (8.1)–(8.8), heat equation (8.9), Fokker–Planckequation (8.10), elementary solution of heat equation (8.11)–(8.12), differencescheme for 1-D heat equation (8.13)–(8.16).

Lecture 9: The Fourier Transform; the Asymptotic Behaviour for the HeatEquation.

Fourier transform of integrable functions (9.1)–(9.3), derivation and mul-tiplication (9.4)–(9.5), Fourier transform on S(RN ) and S′(RN ) (9.6)–(9.8),Plancherel formula (9.9), inverse Fourier transform (9.10), affine change ofvariable (9.11), Fourier transform of a convolution product (9.12)–(9.13),Fourier transform for the heat equation (9.14)–(9.22), semi-group (9.23)–(9.24), scaling and decay (9.25)–(9.28), 9.1: relation with moments and decay(9.29)–(9.30), matrix of inertia and anisotropic Gaussians (9.31)–(9.33), solv-ing a diffusion equation with anisotropic Gaussians (9.34)–(9.42).

Lecture 10: Radon Measures; the Law of Large Numbers.Radon measures (10.1), Fourier transform of a Radon measure (10.2)–

(10.4), centre of mass and convolution (10.5)–(10.7), 10.1: law of large numbers(10.8), matrix of inertia and convolution (10.9)–(10.11), 10.2: strong law oflarge numbers (10.12).

Lecture 11: A 1-D Model with Characteristic Speed 1ε .

Explicit difference schemes (11.1), 1-D model with velocities ± 1ε (11.2),

11.1: limit as ε→ 0 (11.3)–(11.4),

Lecture 12: A 2-D Generalization; the Perron–Frobenius Theory.2-D model with velocities ± 1

ε along axes (12.1)–(12.4), 12.1: reduciblematrices, 12.2: a condition for irreducibility, 12.3: ρ(A) is a simple eigenvaluewith positive eigenvector, 12.4: the case of other eigenvalues of modulus ρ(A),12.5: primitive or imprimitive irreducible matrices, 12.6: a criterion using thelength of loops, 12.7: asymptotic behaviour of Anw as n→ ∞.

Lecture 13: A General Finite-Dimensional Model with Characteristic Speed1ε .

The model (13.1)–(13.5), 13.1:M e = 0 and e positive, L∞ estimate (13.6),13.2: coerciveness on e⊥ (13.7), estimates (13.8)–(13.10), convergence (13.11)–(13.17).

Lecture 14: Discrete Velocity Models.Conservations in a collision (14.1), probabilities (14.2), general model

(14.3), properties of coefficients (14.4)–(14.8), entropy (14.9)–(14.10), conser-vations and decay of entropy (14.11)–(14.13), four velocities Maxwell model(14.14)–(14.15), general semi-linear case (14.16)–(14.17), 14.1: local exis-tence (14.18), 1-D four velocities model and Broadwell model (14.19)–(14.20),14.2: finite propagation speed, 14.3: condition for positivity (14.21), 14.4:forward invariant sets for ordinary differential equations, 14.5: characteri-zation of forward invariant sets (14.22), 14.6: forward invariant sets for asemi-linear system, characterization (14.23)–(14.24), a model with a bounded

Page 21: From hyperbolic systems to kinetic theory: a personalized quest

XXII Preface

forward invariant set (14.25)–(14.27), Carleman model (14.28)–(14.29), formal(Hilbert) expansion for Broadwell model (14.30)–(14.37), restriction of con-volution product on circle (14.38).

Lecture 15: The Mimura–Nishida and the Crandall–Tartar Existence Theo-rems.

15.1: Mimura–Nishida existence theorem (15.1)–(15.2) and (15.8)–(15.12),15.2: Crandall–Tartar existence theorem (15.3)–(15.5), 15.3: use of bounds onentropy (15.6)–(15.7).

Lecture 16: Systems Satisfying My Condition (S).Condition (S) (16.1)–(16.2), 16.1: spaces Vc andWc, 16.2: product onWc1×

Wc2 with c1 �= c2 (16.3), 16.3: global existence (t ∈ R) for small data in L1

(16.4)–(16.13), a case of necessity for small data (16.14), 16.4: local existencefor data in L1 (16.15), asymptotic behaviour (16.16), 16.5: a Mimura–Nishidatype estimate (16.17).

Lecture 17: Asymptotic Estimates for the Broadwell and the Carleman Mod-els.

Asymptotic behaviour for the Broadwell model (17.1)–(17.4), 2-D four ve-locities model (17.5)–(17.7), Illner–Reed estimate for Carleman model (17.8),self-similar solutions of Carleman model (17.9)–(17.12).

Lecture 18: Oscillating Solutions; the 2-D Broadwell Model.Oscillating solutions of Carleman model (18.1)–(18.2), 18.1: div-curl lemma

(18.3)–(18.5), 18.2: application (18.6), systems stable by weak convergence(18.7)–(18.9), Gagliardo–Nirenberg estimate (18.10), application to 2-D fourvelocities model (18.11)–(18.17).

Lecture 19: Oscillating Solutions: the Carleman Model.Rescaling of a solution (19.1), bounded sequences of solutions (19.2)–

(19.4), general system of two equations (19.5), extracting converging subse-quences (19.6)–(19.9), an infinite system (19.10)–(19.11), uniqueness (19.12)–(19.14), strength of oscillations and differential inequalities (19.15)–(19.18).

Lecture 20: The Carleman Model: Asymptotic Behaviour.Integrable nonnegative data and rescaling (20.1)–(20.4), 20.1: strong con-

vergence in |x| > t+ε (20.5)–(20.7), 20.2: a subsequence converges to a solutionof Carleman with support in |x| ≤ t, formal (Hilbert) limit of the Broadwellmodel (20.8)–(20.12), 20.3: the case of Carleman model (20.13)–(20.14), Kurtzscaling (20.15)–(20.18), oscillating solutions for Broadwell model (20.19)–(20.23).

Lecture 21: Oscillating Solutions: the Broadwell Model.Properties of weak limits and the weak limit X111 of unvnwn (21.1)–(21.8),

21.1: estimate for X111 (21.9), inequality for σw (21.10), periodically modu-lated case (21.11)–(21.12), 21.2: the Carleman model (21.13)–(21.20), 21.3: theBroadwell model (21.21)–(21.31), a system for Fourier coefficients (21.32).

Page 22: From hyperbolic systems to kinetic theory: a personalized quest

Preface XXIII

Lecture 22: Generalized Invariant Regions; the Varadhan Estimate.The Broadwell model (22.1)–(22.10), Varadhan potential of interaction

I(t) (22.11), 22.1: the decrease of I(t) (22.12)–(22.17), the Carleman model(22.18)–(22.29).

Lecture 23: Questioning Physics; from Classical Particles to Balance Laws.Potential in 1

r (23.1), Maxwell–Heaviside equation and Lorentz force(23.2)–(23.3), conservation of mass and balance of momentum (23.4)–(23.8),Cauchy stress (23.9)–(23.10).

Lecture 24: Balance Laws; What Are Forces?Conservation of mass in the sense of distributions (24.1)–(24.4), balance

of momentum in the sense of distributions (24.5)–(24.11), the fluid quantitiesin kinetic theory (24.12)–(24.13).

Lecture 25: D. Bernoulli: from Masslets and Springs to the 1-D Wave Equa-tion.

The evolution equation (25.1), equilibria (25.2)–(25.8), the time-dependentcase (25.9)–(25.13), vibration frequencies (25.14)–(25.17), deriving equationsfor f(x,v, t) (25.18)–(25.25).

Lecture 26: Cauchy: from Masslets and Springs to 2-D Linearized Elasticity.The 1-D case (26.1)–(26.3), a 1-D dissipative model (26.4)–(26.6), lin-

earized elasticity (26.7)–(26.9).

Lecture 27: The Two-Body Problem.Conservation of linear and angular momentum (27.1)–(27.6), parametriza-

tion of collisions (27.7)–(27.12).

Lecture 28: The Boltzmann Equation.General form (28.1)–(28.4), forces in distance−s (28.5)–(28.6), a critical

question (28.7), Fokker–Planck equation (28.8), conservations (28.9)–(28.11),fluid quantities (28.12)–(28.17), conservation laws (28.18)–(28.23), 28.1: col-lision invariants (28.24)–(28.25), 28.2: characterization of collision invariants(28.26)–(28.35), variation of entropy and importance of Maxwellian distribu-tions (28.36)–(28.43), relation with thermodynamics (28.44)–(28.47).

Lecture 29: The Illner–Shinbrot and the Hamdache Existence Theorems.The iterative method (29.1)–(29.4), the estimates to be proven (29.5)–

(29.8), a choice of function and verification (29.9)–(29.13).

Lecture 30: The Hilbert Expansion.The expansion (30.1)–(30.3), coefficient of ε−1 (30.4), coefficient of ε0 and

consequences (30.5)–(30.10), viscous stress tensor (30.11), rectangular “Gaus-sians” (30.12)–(30.13).

Lecture 31: Compactness by Integration.31.1: f, ft + vfx, fv ∈ L2 imply f ∈ H

1/2loc (31.1)–(31.4), 31.2: commutator

of ∂t + v∂x and ∂vk(31.5)–(31.8), 31.3: discrete analogue of commutation

Page 23: From hyperbolic systems to kinetic theory: a personalized quest

XXIV Preface

(31.9)–(31.10), 31.4: discrete analogue of half derivatives (31.11)–(31.17), 31.5:compactness by integration (31.18)–(31.23).

Lecture 32: Wave Front Sets; H-Measures.First-order equations and bicharacteristic rays (32.1)–(32.2), Wigner trans-

form (32.3)–(32.5), H-measures (32.6)–(32.9), localization principle (32.10).

Lecture 33: H-Measures and “Idealized Particles”.H-measures for the wave equations (33.1)–(33.5), internal energy and

equipartition of energy (33.6)–(33.7).

Lecture 34: Variants of H-Measures.Geometrical optics (34.1)–(34.5), my proposal for introducing a character-

istic length (34.6), Gerard’s proposal of semi-classical measures (34.7), P.-L.Lions & Paul’s proposal to define them with Wigner transform (34.8)–(34.9),an observation of Wigner (34.10), 34.1: k-point correlation measures (34.11)–(34.12), 34.2: properties of correlation measures (34.13)–(34.14). Conclusion.

35: Biographical Information.Basic biographical information for people whose name is associated with

something mentioned in the lecture notes.

36: Abbreviations and Mathematical Notation.

References.

Index.

Page 24: From hyperbolic systems to kinetic theory: a personalized quest

Contents

1 Historical Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

2 Hyperbolic Systems: Riemann Invariants, RarefactionWaves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

3 Hyperbolic Systems: Contact Discontinuities, Shocks . . . . . . 31

4 The Burgers Equation and the 1-D Scalar Case . . . . . . . . . . . . 39

5 The 1-D Scalar Case: the E-Conditions of Laxand of Oleinik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

6 Hopf’s Formulation of the E-Condition of Oleinik . . . . . . . . . . 51

7 The Burgers Equation: Special Solutions . . . . . . . . . . . . . . . . . . . 57

8 The Burgers Equation: Small Perturbations; the HeatEquation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

9 Fourier Transform; the Asymptotic Behaviourfor the Heat Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

10 Radon Measures; the Law of Large Numbers . . . . . . . . . . . . . . 83

11 A 1-D Model with Characteristic Speed 1ε

. . . . . . . . . . . . . . . . . 91

12 A 2-D Generalization; the Perron–Frobenius Theory . . . . . . . 97

13 A General Finite-Dimensional Model with CharacteristicSpeed 1

ε. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

14 Discrete Velocity Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

Page 25: From hyperbolic systems to kinetic theory: a personalized quest

XXVI Contents

15 The Mimura–Nishida and the Crandall–Tartar ExistenceTheorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

16 Systems Satisfying My Condition (S) . . . . . . . . . . . . . . . . . . . . . . 135

17 Asymptotic Estimates for the Broadwelland the Carleman Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143

18 Oscillating Solutions; the 2-D Broadwell Model . . . . . . . . . . . . 149

19 Oscillating Solutions: the Carleman Model . . . . . . . . . . . . . . . . . 157

20 The Carleman Model: Asymptotic Behaviour . . . . . . . . . . . . . . 163

21 Oscillating Solutions: the Broadwell Model . . . . . . . . . . . . . . . . 169

22 Generalized Invariant Regions; the Varadhan Estimate . . . . 179

23 Questioning Physics; from Classical Particles to BalanceLaws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187

24 Balance Laws; What Are Forces? . . . . . . . . . . . . . . . . . . . . . . . . . . 197

25 D. Bernoulli: from Masslets and Springsto the 1-D Wave Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201

26 Cauchy: from Masslets and Springs to 2-D LinearizedElasticity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

27 The Two-Body Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213

28 The Boltzmann Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

29 The Illner–Shinbrot and the Hamdache ExistenceTheorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229

30 The Hilbert Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233

31 Compactness by Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239

32 Wave Front Sets; H-Measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245

33 H-Measures and “Idealized Particles” . . . . . . . . . . . . . . . . . . . . . 251

34 Variants of H-Measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257

35 Biographical Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267

36 Abbreviations and Mathematical Notation . . . . . . . . . . . . . . . . . 271

Page 26: From hyperbolic systems to kinetic theory: a personalized quest

Contents XXVII

References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277

Page 27: From hyperbolic systems to kinetic theory: a personalized quest

1

Historical Perspective

The goal of these lectures is to study partial differential equations related toquestions of kinetic theory, and to elucidate some of the questions of contin-uum mechanics or physics which lie behind these problems.

One may arrive at these questions from different ways and many interestingmathematical questions arise in the various approaches.

From a classical mechanics point of view, one imagines a collection of rigidbodies moving under some set of forces, for example gravitational attractionbetween them, and one wants to study the evolution of such a system. Ofcourse, one should also consider electromagnetic effects, and ALFVEN hasexplained by electromagnetic effects some of the features observed in galax-ies,1 which astrophysicists pretend to explain by gravitational effects only,and I have read that there are anomalies in the movement of the planetJupiter, which might be related to its important magnetic properties, becauseof Lorentz forces,2 but then one could not just play with ordinary differentialequations as is usual in classical mechanics, and one would have to add what isknown as the Maxwell equation, which is a system of partial differential equa-tions, and it becomes the realm of continuum mechanics, but in these lecture1 Hannes Olof Gosta ALFVEN, Swedish-born physicist, 1908–1995. He received the

Nobel Prize in Physics in 1970, for fundamental work and discoveries in magneto-hydrodynamics with fruitful applications in different parts of plasma physics,jointly with Louis NEEL. He had worked in Uppsala and Stockholm, Sweden, inUCSD (University of California at San Diego), La Jolla, CA, and USC (Universityof Southern California), Los Angeles, CA.

2 Hendrik Antoon LORENTZ, Dutch physicist, 1853–1928. He received the NobelPrize in Physics in 1902, jointly with Pieter ZEEMAN, in recognition of the ex-traordinary service they rendered by their research into the influence of magnetismupon radiation phenomena. He had worked in Leiden, The Netherlands. The In-stitute for Theoretical Physics in Leiden, The Netherlands, is named after him,the Lorentz Institute.

Page 28: From hyperbolic systems to kinetic theory: a personalized quest

2 1 Historical Perspective

notes I shall call it the Maxwell–Heaviside equation,3 because if MAXWELL

had unified the previous results on electricity and on magnetism obtained byAMPERE,4 GAUSS,5 BIOT and SAVART,6,7 and FARADAY,8 it is to HEAVISIDE

that one owes the simplified version of the Maxwell–Heaviside equation usingvector calculus.9

In considering rigid bodies which are submitted to forces acting at a dis-tance, one uses the point of view of NEWTON, which he developed for gravi-tation. As explained by FEYNMAN in taped lectures given at Cornell Univer-sity,10 the difficulty that NEWTON had overcome was that in his day, it wasexplained that planets turn around the sun (or the moon around the earth)because angels were pulling them, and he did not question the existence ofangels, but the fact that these angels were believed to pull the planets ina tangential way, and NEWTON’s first contribution was to observe that theforce applied by the angels was towards the sun for a planet, or towards theearth for the moon. Then, he realized that the force pulling the moon mustbe the same as the force drawing apples towards the ground, so that he haddiscovered the name of these angels, gravitation. NEWTON added a curious ar-gument for having the gravitational force decay in distance−2, while he could3 Oliver HEAVISIDE, English engineer, 1850–1925. He had worked as a telegrapher,

in Denmark, in Newcastle upon Tyne, England, and then did research on his own,living in the south of England.

4 Andre Marie AMPERE, French mathematician, 1775–1836. He had worked inBourg, in Lyon, and in Paris, France.

5 Johann Carl Friedrich GAUSS, German mathematician, 1777–1855. He hadworked in Gottingen, Germany.

6 Jean-Baptiste BIOT, French mathematician and physicist, 1774–1862. He hadworked in Beauvais, and in Paris, France, holding a chair (physique mathemati-que, 1801–1862) at College de France, Paris.

7 Felix SAVART, French physicist, 1791–1841. He had worked at College de France,Paris, France (physique generale et experimentale, 1836–1841).

8 Michael FARADAY, English chemist and physicist, 1791–1867. He had worked inLondon, England, as Fullerian professor of chemistry at the Royal Institution ofGreat Britain.

9 MAXWELL had imagined mechanical devices for transmitting the electric fieldand the magnetic field, and I read that HEAVISIDE replaced a set of 20 equationsin 20 variables that MAXWELL had written by a set of 4 equations in 2 variables.HEAVISIDE had also developed an operational calculus, which was given a math-ematical explanation by Laurent SCHWARTZ, using his theory of distributions.

10 Richard Phillips FEYNMAN, American physicist, 1918–1988. He received theNobel Prize in Physics in 1965, jointly with Sin-Itiro TOMONAGA and JulianSCHWINGER, for their fundamental work in quantum electrodynamics, with deep-ploughing consequences for the physics of elementary particles. He had workedat Cornell University, Ithaca, NY, and at Caltech (California Institute of Tech-nology), Pasadena, CA.

Page 29: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 3

have deduced that from one of Kepler’s laws.11 Forces at a distance pertain toclassical mechanics, and they only involve ordinary differential equations, buta general existence theory for ordinary differential equations was not knownuntil CAUCHY for the analytic case,12 and LIPSCHITZ for more general cases.13

Again, NEWTON’s point of view is classical mechanics, but there is somethingwrong about forces acting at a distance in an instantaneous way; one difficultyis about a force acting at a distance (and about what a force is anyway, butthat goes beyond continuum mechanics too), but another difficulty is aboutaction being instantaneous, and if one tries to give a precise meaning to in-stantaneity, one is bound to find the question of relativity, which was firststudied by POINCARE, and he observed that the Maxwell–Heaviside equationis invariant by the Lorentz group, but it was EINSTEIN who really understoodthe physical meaning of the question. However, it seems to be POINCARE’sunderstanding of relativity that a particle feels the action of the field and tellsthe field that it is there, so that the field transmits (at the velocity of light c)the information between particles: a particle does not store any informationabout the positions of the other particles, and mathematically it leads to thestudy of semi-linear hyperbolic systems having only the velocity of light c ascharacteristic velocity, but again there is a problem with the precise notion ofa particle, which goes beyond continuum mechanics.

In the case of a universe made up of a finite number of classical particles,LAGRANGE had an interesting thought,14 that if one was given the initialposition of all the particles, then the whole future of the universe could bedescribed, but he overlooked a few problems; of course, there were alreadysome hints at his time that the world is not described by ordinary differentialequations, like the Euler equation for ideal fluids, but if it had been as heimagined, he had not actually proven a global existence theorem for ordinarydifferential equations because of possible collisions ; another difficulty is thatone may need infinite accuracy on the initial data because of possible chaoticeffects, as was first observed by POINCARE (although the term chaos wascoined much later, and is used now by people who usually forget to say howmuch they owe to POINCARE for the tools that they use, not always in anaccurate way if one considers the reactions provoked by those who had the idea11 Johannes KEPLER, German-born mathematician, 1571–1630. He had worked in

Graz, Austria, in Prague, now capital of the Czech republic, and in Linz, Austria.12 Augustin Louis CAUCHY, French mathematician, 1789–1857. He was made Baron

by CHARLES X. He had worked in Paris, France, went into exile after the 1830revolution and worked in Torino (Turin), Italy, returned from exile after the 1848revolution, and worked in Paris again.

13 Rudolf Otto Sigismund LIPSHITZ, German mathematician, 1832–1903. He hadworked in Breslau (then in Germany, now Wroc�law, Poland) and in Bonn, Ger-many.

14 Giuseppe Lodovico LAGRANGIA (Joseph Louis LAGRANGE), Italian-born mathe-matician, 1736–1813. He had worked in Torino (Turin) Italy, in Berlin, Germany,and in Paris, France. He was made Count in 1808 by NAPOLEON I.

Page 30: From hyperbolic systems to kinetic theory: a personalized quest

4 1 Historical Perspective

of explaining instability by saying that it is as if the movement of a butterfly inBrazil could create a storm in New York; predictably, if one considers the lowlevel of scientific knowledge nowadays, it was misunderstood that butterfliesin Brazil have an effect on the weather in New York. Some people have beenupset enough to take the time to show that this kind of effect is precluded bysome models of hydrodynamics (but they have probably not explained thatlots of terms had been thrown out of the models that they use, just becausethey were believed to be small, although the derivative of something smallis not always small, and the long-term effect of those terms had never beenascertained). Both these reactions were a little silly, because all that had beensaid was that a small cause (like the movement of a butterfly in Brazil, orany other thing that one likes to think of as very small) might create a largeeffect (like a storm in New York, or any other thing that one likes to thinkof as very large); however, one should observe that most people do not evenunderstand the difference between the quantifiers ∃ and ∀, and one shouldhave explained that for some systems of ordinary differential equations a verysmall perturbation at some point may become very large later on (at a differentpoint), but not every small perturbation at a point has this property (becauseperturbing in the direction of the flow is just a translation in time, whichremains under control), and that for a given system this is not valid for allpoints, and that this effect does not happen for all systems. Anyway, the worldis not described by ordinary differential equations, and those who believe thatpartial differential equations always behave like ordinary differential equationsshould start by learning about which terms have been neglected in arrivingat the model that they use, and they should then prove, and not postulate,that these terms can really be neglected because their later effects will alwaysremain very small, unlike the chaotic behaviour that they pretend to specializeupon.

Using rigid bodies is also an approximation, and one could think of con-sidering elastic bodies, but that would also force us to use partial differentialequations instead of ordinary differential equations, and a particular difficultywould actually arise because of questions of finite elasticity which are not yetwell understood.15

15 Finite is not opposite to infinite but to infinitesimal: if a point x in an initialconfiguration is moved to a point u(x), an hypothesis of infinitesimal deformationconsists in assuming that ∇u(x) is near I , and this leads to linearized elasticity,while in finite elasticity one only assumes that ∇u(x) is near a rotation (but itmay be far from rotations for materials like rubber), and that leads to problemswhich are not so well understood from a mathematical point of view, becauseone should look at the evolution problem, of course, and one cannot use thesimplistic view that elastic materials minimize their potential energy, which isa fake continuum mechanics point of view, which has been pushed forward bysome adepts of the calculus of variations, probably because it is irrelevant froma physical point of view.

Page 31: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 5

Knowing all these limitations is just a way to know in advance that somequestions are not too physical, like the asymptotic behaviour of an approxi-mate system of ordinary differential equations for example, and a lot of whatis said is hardly relevant from a realistic point of view, because one assumesthat the models used are exact, and there are always a few things which havebeen neglected so that the model can be accurate for a large time, but notfor an infinite time. For example, fluids are not incompressible, and one canmeasure a finite speed of progation of sound (about 300 meters per second forair and 1,500 meters per second for water) while it is infinite for an incom-pressible fluid; discussing the asymptotic behaviour of a truncated system isusually not relevant, in particular for turbulence, which is not about lettingtime go to infinity anyway, except possibly in infinite domains when one maydo a rescaling of space.

Assuming that one works with rigid bodies, one must compute the resul-tant of the forces applied to the body, and consider that it is applied to thecentre of gravity of the body, which will move according to the classical law ofmotion, resulting from the work of NEWTON, after some initial thoughts byGalileo,16 that force is mass × acceleration, and the resultant torque whichwill make the body rotate, and for which one needs to know the matrix ofinertia of the body. In most treatments of kinetic theory, torque effects areneglected as if the body were points, but in the case of colliding spheres any-one having played billiards knows about the importance of spin for the resultof a collision, and such questions should be addressed.

We are not interested in asymptotic behaviour but in the finite-time ex-istence in the case of a large number of particles;17 of course, if one letsthe number of particles go to infinity, the mass of each particle must bescaled accordingly. An important problem is to study possible collisions ornear collisions.16 Galileo GALILEI, Italian mathematician, 1564–1642. He had worked in Siena, in

Pisa, in Padova (Padua), Italy, and again in Pisa.17 Celestial mechanics is interested in a small number of particles (planets), and

apart from proving that no collisions will occur, one wants to know if a solutionstays globally bounded (once one moves with the centre of gravity), and thatmeans analysing if a planet can escape to infinity; this cannot happen if there isnot enough energy in the system, as the total energy (kinetic energy plus potentialenergy) is conserved, and in the case of two bodies the escape velocity can be easilycomputed. The escape velocity from the attraction of the earth is around 11.2

km s−1; it corresponds to the kinetic energy v2

2being able to compensate for the

difference in potential between the surface of the earth and infinity, equal to GMR

,and GM

R2 is the acceleration of gravity, around 9.81 m s−2, and the radius of theearth R is around 6,378 km (the gravitational constant G has been measured asaround 6.67 ×10−11 N m2 kg−2, where N stands for newton, the unit of force, sothat the mass of the earth M is around 5.98 ×1024 kg).

Page 32: From hyperbolic systems to kinetic theory: a personalized quest

6 1 Historical Perspective

A puzzling fact is that, although the system is Hamiltonian,18 a notionalready introduced by LAGRANGE, so that it conserves energy, one finds nu-merical evidence that the energy decreases when one computes solutions witha large number of particles. The same effect would be observed by two math-ematical observers, one using time in the usual way and the other reversingtime (and velocities), so that one observes some kind of irreversibility, whichonly occurs because the number of particles gets very large; of course, fora given number of particles, one could make the numerical methods preciseenough for avoiding the loss of energy, but if the number of particles tendsto infinity and the masses of the particles are rescaled it becomes a differentproblem to ascertain what the limit is. The observation that some energy is“lost” is not in contradiction with other approaches, where one uses internalenergy, and where irreversibility also occurs, so that one seems to be losingenergy but the “lost” part is just hidden and it can be followed as heat, and atechnical word will be associated with this effect, entropy, and one will haveto understand what it means, but the main mathematical difficulty will bethat the actual postulates concerning the second principle in thermodynamicsare inadequate and should be improved, but in ways which have not beenunderstood yet.

From the continuum mechanics point of view, partial differential equationswere used for describing the movement of a gas or a liquid, and in theseequations various thermodynamical quantities appeared like the density � andthe pressure p, as in the Euler equation for ideal (inviscid) fluids

∂�

∂t+

3∑

i=1

∂(� ui)∂xi

= 0, (1.1)

expressing the conservation of mass, and

∂(� uj)∂t

+3∑

i=1

∂(� ui uj)∂xi

+∂p

∂xj= 0 for j = 1, 2, 3, (1.2)

expressing the balance of linear momentum, and later other quantities wereadded, the absolute temperature θ (or T ), the internal energy (per unit ofmass) e, the entropy (per unit of mass) s, and so on. Early in the study ofgases, it had been found by BOYLE in 1662,19 and by MARIOTTE in 1676,20

that the product of pressure by volume is constant, and for a long time it wasimplicitly assumed that one worked at constant temperature, and it is worthrecalling how the notion of temperature had evolved.18 Sir William Rowan HAMILTON, Irish mathematician, 1805–1865. He had worked

in Dublin, Ireland.19 Robert BOYLE, Irish-born physicist, 1627–1691. He had worked in Oxford, and

in London, England.20 Edme MARIOTTE, French physicist and priest, 1620–1684. He had been prior of

Saint Martin sous Beaune, near Dijon, France.

Page 33: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 7

The first thermoscope was invented by Galileo in 1593, the first ther-mometer using air by SANTORIO,21 and the first thermometer using liquidby REY,22 but the first sealed thermometer that used liquid (alcohol) was in-vented in 1654 by FERDINAND II,23 and called a Florentine thermometer. In1661, BOYLE was shown such a Florentine thermometer by R. SOUTHWELL.24

Mercury was first substituted for alcohol in Florence, at Accademia del Ci-mento (academy of experiment), founded in 1657 by FERDINAND II and hisbrother Leopold,25 but it was in 1714 that FAHRENHEIT found a way to avoidmercury clinging to the glass,26 and introduced his scale of temperature, witha mixture of water, ice, and cooking salt at 0◦, a mixture of water and iceat 32◦ and boiling water at 212◦. REAUMUR had a scale in 1730 where wa-ter froze at 0◦ and boiled at 80◦.27 CELSIUS had a scale in 1742 with waterfreezing at 100◦ and boiling at 0◦ in 1741,28 and a few people are creditedfor inverting the scale as it is used today (and named after CELSIUS since1948, as it was called degrees centigrade before): J.-P. CRISTIN (in 1743),29

EKSTROM,30 LINNE (in 1745),31 and STROMER.32 An absolute temperaturescale was introduced in 1862, by THOMSON,33 later to become Lord Kelvin,and JOULE,34 and is now named after Lord Kelvin, where the temperature isobtained by adding 273.15 to the temperature in degrees Celsius.21 Santorio SANTORIO (SANCTORIUS of Padua), Italian physician, 1561–1636. He

had worked in Padova (Padua), Italy.22 Jean REY, French physician and chemist, 1583–1645.23 Ferdinando DE MEDICI, Italian statesman, 1610–1670. In 1621 he became Grand

Duke of Tuscany as FERDINAND II. He had lived in Firenze (Florence), Italy.24 Sir Robert SOUTHWELL, Irish-born diplomat, 1635–1702.25 Leopoldo DE MEDICI, Italian noble, 1617–1675. He was named cardinal in 1667.

He had lived in Firenze (Florence), Italy.26 Gabriel Daniel FAHRENHEIT, German-born physicist, 1686–1736. He had worked

in Amsterdam, The Netherlands.27 Rene Antoine FERCHAULT DE REAUMUR, French scientist, 1683–1757. He had

worked in Paris, France.28 Anders CELSIUS, Swedish astronomer, 1701–1744. He had worked in Uppsala,

Sweden.29 Jean-Pierre CRISTIN, French scientist, 1683–1755.30 Daniel EKSTROM, Swedish instrument maker, 1711–1755. He had worked in Up-

psala, Sweden.31 Carl LINNAEUS (Carl VON LINNE), Swedish naturalist, 1707–1778. He had worked

in Uppsala, Sweden.32 Marten STROMER, Swedish astronomer, 1707–1770. He had worked in Uppsala,

Sweden.33 William THOMSON, Irish-born physicist, 1824–1907. In 1892 he was made Baron

Kelvin of Largs, and thereafter known as Lord Kelvin. He had worked in Glasgow,Scotland.

34 James Prescot JOULE, English scientist, 1818–1889. He had lived in Manchester,England, being a brewer with an interest in science.

Page 34: From hyperbolic systems to kinetic theory: a personalized quest

8 1 Historical Perspective

Thinking that the temperature could be considered constant, it is naturalto consider that in the Euler equation the pressure p is a smooth functionof � (barotropic model) with dp

d� > 0. If one considers small and smoothperturbations around a constant solution u = u0 and � = �0, one may useGalilean invariance and assume that u0 = 0,35 and the linearized problemaround (0, �0) is then

∂�

∂t+

3∑

i=1

�0∂ui∂xi

= 0 (1.3)

from conservation of mass, and

�0∂uj∂t

+dp

d�(�0)

∂�

∂xi= 0 for j = 1, 2, 3, (1.4)

from balance of linear momentum, so that one has

∂2�

∂t2− dp

d�(�0)Δ� = 0, (1.5)

a wave equation where perturbations propagate at the velocity√

dpd�(�0). How-

ever, if one used the Boyle–Mariotte law p = A�, one found that√A is rather

different than the measured velocity at which perturbations propagate, thespeed of sound, first estimated by NEWTON.

Improving the Boyle–Mariotte law by measuring the effects of temperaturewas done by GAY-LUSSAC in 1802,36 whose law states that at fixed volumethe pressure is proportional to the absolute temperature (although the notionwas not defined yet), and he mentions a law found in 1787 (but not pub-lished) by CHARLES,37 that at constant pressure the volume is proportionalto the absolute temperature. In 1811, AVOGADRO stated his law,38 that equalvolumes of any two different gases at the same temperature and pressure con-tain an equal number of molecules,39 a number called the Avogadro number35 If a new frame moves at constant velocity a with respect to an initial frame,

then one uses x = x − t a in the new frame, so the new velocity is u(x, t) =

u(x, t)−a, but the change for any thermodynamical quantity f is f(x, t) = f(x, t);the Euler equation is invariant by such transformations (which form a group).Some authors mistakenly use the term Galilean invariance for other groups oftransformations, like the group of rotations in x space, in which case the correctqualifier is isotropic, and the Euler equation describes an isotropic fluid.

36 Joseph Louis GAY-LUSSAC, French physicist, 1778–1850. He had worked in Paris,France.

37 Jacques Alexandre Cesar CHARLES, French physicist, 1746–1823. He had workedin Paris, France.

38 Lorenzo Romano Amedeo Carlo AVOGADRO, Count of Quaregna and Cerreto,Italian physicist, 1776–1856. He had worked in Torino (Turin), Italy.

39 That law is not true at high pressure, where gases may liquefy.

Page 35: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 9

(6.0221367 ×1023) by PERRIN,40 who had measured it in relation with Brow-nian motion,41 and this had led to the law P V = nRT for perfect gases,where n is the number of moles, and R is the perfect gas constant (8.314joules per mole per kelvin).

In 1807, POISSON used a law p = C �γ ,42 which may have been suggestedby LAPLACE,43 and there is a value of γ which gives the measured valueof the speed of sound, but I doubt that LAPLACE or POISSON knew theexplanation that one teaches now in thermodynamics, related to adiabatictransformations.44 Working in a one-dimensional situation (the barrel of agun), POISSON used the equation

∂2w

∂t2− ∂

∂x

(f(∂w∂x

))= 0, (1.6)

with f(z) = C zγ , which is quasi-linear,45 and he studied special solutions(rarefaction waves), which he left in an implicit form.

This equation is related to the Lagrangian point of view (already intro-duced by EULER), where one follows material points; from an initial positiony one considers x = Φ(y, t) the solution of dx

dt = u(x(t), t) with x(0) = y(where u is the velocity field, supposed to be smooth enough), and while inthe (physical) Eulerian point of view one considers functions of x and t, the40 Jean Baptiste PERRIN, French physicist, 1870–1942. He received the Nobel Prize

in Physics in 1926, for his work on the discontinuous structure of matter, andespecially for his discovery of sedimentation equilibrium. He had worked in Paris,France.

41 Robert BROWN, Scottish-born botanist, 1773–1858. He had collected specimensin Australia, and then worked in London, England.

42 Simeon Denis POISSON, French mathematician, 1781–1840. He had worked inParis, France.

43 Pierre-Simon LAPLACE, French mathematician, 1749–1827. He had been madeCount in 1806 by NAPOLEON I and Marquis in 1817 by LOUIS XVIII. He hadworked in Paris, France. NAPOLEON I wrote in his memoir, written on St Helena,that he had removed LAPLACE from the office of minister of the interior, which heheld in 1799, after only six weeks, “because he brought the spirit of the infinitelysmall into the government”.

44 Intuitively, a wave propagates too fast for an equilibrium in temperature to takeplace, so the process is not isothermal (i.e. at constant temperature), and theBoyle–Mariotte law does not apply; as there is no time for heat exchange, theprocess is called adiabatic (i.e. without heat transfer), a term equivalent to isen-tropic (as the second law of thermodynamics is δQ = θ ds, and θ > 0, δQ = 0 isequivalent to ds = 0).

45 A semi-linear equation is linear in the highest-order derivatives with coefficientsindependent of lower-order derivatives, while a quasi-linear equation is linear inthe highest-order derivatives but with coefficients which may depend upon lower-order derivatives. For example, wtt − c2Δw = F (w,wt, wx1 , . . . , wxN ) is semi-linear, while wtt −A(w,wt, wx1 , . . . , wxN )Δw = F (w,wt, wx1 , . . . , wxN ) is quasi-linear.

Page 36: From hyperbolic systems to kinetic theory: a personalized quest

10 1 Historical Perspective

(mathematical) Lagrangian point of view expresses them as functions of y andt. In all dimensions N one has ∂f

∂t

∣∣y= ∂f

∂t +∑Ni=1 ui

∂f∂xi

; in one dimension one

has 1�(y,0)

∂f∂y

∣∣t= 1

�(x,t)∂f∂x . I do not find the Lagrangian point of view so useful

for fluids in more than one dimension, because of turbulence effects,46 and theLagrangian point of view is more often used for solids. The Lagrangian pointof view requires us to use the mathematical Piola stress tensor,47 also intro-duced by KIRCHHOFF,48 and called the Piola–Kirchhoff stress tensor, usuallynot symmetric, instead of the physical Cauchy stress tensor, always symmet-ric, which appears in the physical Eulerian point of view.49 In Lagrangiancoordinates, the Euler equation becomes ∂�

∂t + �2

�0∂u∂y = 0 and ∂u

∂t + 1�0

∂p∂y , which

imply ∂∂t

(�0�2∂�∂t

) − ∂∂y

(1�0

∂p∂y

)= 0, and w =

∫ y 1� satisfies an equation of the

type considered by POISSON in the case where �0 is constant.In 1848, CHALLIS noticed that there must be something wrong with the

formula derived by POISSON in the case of periodic initial data,50 and STOKES

explained that the profile of a solution was getting steeper and steeper until itapproached a discontinuous solution; he was then the first to derive the correctjump conditions for discontinuous solutions, as a consequence of conservationof mass and the balance of momentum. Jump conditions were rediscoveredlater by RIEMANN in his thesis in 1860,51 where he used conservation of mass,balance of momentum and conservation of entropy,52 instead of conservationof mass, balance of momentum and conservation of energy. It is importantto notice that Peter LAX has generalized some notions from gas dynamicsto other quasi-linear systems of conservation laws, and he has given a newmeaning to the term Riemann invariants, but also, extending the work donefor a scalar equation by Olga OLEINIK, and then Eberhard HOPF, he has46 Although turbulent flows are only said to occur in three dimensions, there are

effects of a similar type in two dimensions for fluids when one uses a more realisticphysical description (and exactly two-dimensional or one-dimensional flows areonly a mathematical approximation, of course), but turbulence is certainly notabout letting t tend to infinity (except possibly in an infinite region unchangedby rescaling in space).

47 Gabrio PIOLA, Italian mathematician, 1794–1850. He had worked in Milano (Mi-lan), Italy.

48 Gustav Robert KIRCHHOFF, German physicist, 1824–1887. He had worked inBreslau (then in Germany, now Wroc�law, Poland).

49 The appearance of plasticity or turbulence renders the Lagrangian point of viewproblematic, and numerical analysts tend to use a mixture or Eulerian and La-grangian points of view for this reason.

50 James CHALLIS, English astronomer, 1803–1882. He had worked in Cambridge,England.

51 Georg Friedrich Bernhard RIEMANN, German mathematician, 1826–1866. He hadworked in Gottingen, Germany.

52 The term (thermodynamic) entropy was only coined by CLAUSIUS in 1865, al-though the idea may go back to CARNOT, and RIEMANN must have used a func-tion of (thermodynamic) entropy without using the name entropy.

Page 37: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 11

used the term “entropy” in designing other functions not directly linked tothermodynamical entropy; the details of this important question will be ex-plained later, and meanwhile I shall add the qualifier thermodynamical whenreferring to the usual physical quantity appearing in the second law of thermo-dynamics. Nowadays, the jump conditions are called the Rankine–Hugoniotconditions,53,54 probably because STOKES did not reproduce his derivation of1848 of the jump conditions when he edited his complete works in 1880, apol-ogizing for his “mistake”, because he had been (wrongly) convinced by LordRayleigh,55 and by THOMSON (not yet Lord Kelvin) that his discontinuoussolutions were not physical, because they did not conserve energy. This showsthat none of them understood at that time that the missing energy had beentransformed into heat, but if one has learnt thermodynamics, one should notdisparage these great scientists of the 19th century for their curious mistake,and one should recognize that there are things which take time to understand.Actually, some mathematicians should pay more attention to what thermo-dynamics says, and by publishing too much on questions that they have notstudied enough, it tends to make engineers and physicists believe that math-ematicians do not know what they are talking about, and they should alsoobserve that thermodynamics is not a good name, as it is not about dynamicsbut about equilibria. By describing the various pieces of the puzzle that Ihave studied, and by pointing out the limitations that I know, an importantone being that the laws discovered experimentally by looking at equilibriaare used all the time, even out of equilibrium, I want to convince the readerthat one should try to go beyond the actual version of thermodynamics, andthat one should create a good theory for questions out of equilibrium; thatis usually treated by kinetic theory, the subject of these lectures, which hasother defects which I shall point out.

Because the mathematical model used has no temperature variable, thepart of the energy transformed into heat is apparently “lost”, and the firstway to correct this defect is to use a model which takes into account the firstlaw of thermodynamics, expressing the conservation of total energy by theintroduction of an internal energy (per unit mass) e (and de = −p d( 1

)+δQ for

a gas, where δQ is the heat received, which the second law of thermodynamicsrelates to thermodynamical entropy as δQ = θ ds, a form which may be dueto DUHEM,56 leading to the system of gas dynamics,53 William John Macquorn RANKINE, Scottish engineer, 1820–1872. He had worked

in Glasgow, Scotland.54 Pierre Henri HUGONIOT, French engineer, 1851–1887.55 John William STRUTT, third Baron Rayleigh (known as Lord Rayleigh), English

physicist, 1842–1919. He received the Nobel Prize in Physics in 1904, for hisinvestigations of the densities of the most important gases and for his discoveryof argon in connection with these studies. He had worked in Cambridge, England,holding the Cavendish professorship (1879–1884), after MAXWELL.

56 Pierre Maurice Marie DUHEM, French mathematician, 1861–1916. He had workedin Lille and in Bordeaux, France.

Page 38: From hyperbolic systems to kinetic theory: a personalized quest

12 1 Historical Perspective

∂�

∂t+

3∑

i=1

∂(� ui)∂xi

= 0, (1.7)

for conservation of mass,

∂(� uj)∂t

+3∑

i=1

∂(� ui uj)∂xi

+∂p

∂xj= 0 for j = 1, 2, 3, (1.8)

for the balance of linear momentum, and

∂t

(� |u|22

+ � e)

+3∑

i=1

∂xi

(� |u|2ui2

+ � e ui + p ui

)= 0, (1.9)

for the balance of energy. The unknowns are the velocity u and some ther-modynamical quantities, the density �, the pressure p and the internal energyper unit of mass e (in the absence of a force field, the total energy per unit ofmass is E = � |u|2

2 +e, but in a force field deriving from a potential V one mustadd V to the preceding quantity); of course, there are not enough equations,but there is a relation between �, p and e, given by the equation of state,which results from measurements of equilibria (and interpolation between themeasured values, of course). The model does not take into account the effectsof viscosity (which would appear in the three equations describing the bal-ance of momentum) and heat conductivity (which would appear in the lastequation describing the balance of energy). Energy cannot disappear in thismodel, because internal energy is supposed to take into account all the energytransformed into heat and stored inside the body (at a mesoscopic level), butthe analysis of this model will show that something else disappears, and thatwill involve thermodynamical entropy; for this question I shall show some ofthe general principles (i.e. valid for many other systems, but restricted to onespace dimension), which Peter LAX initiated in 1957.57

The question of appearance of discontinuities can be better described in asimpler model, the Burgers equation,

∂u

∂t+ u

∂u

∂x= 0 for x ∈ R, t > 0; u(x, 0) = u0(x) for x ∈ R, (1.10)

57 COURANT and FRIEDRICHS had written in 1948 a book on questions of shocks,which summarized many technical reports on questions which had been of impor-tance during World War II. Peter LAX once told me that he had once to lecture onthis subject, and instead of going through all the particular examples which weretreated in the book, he prefered to start by developing a general mathematicalframework encompassing all of them.

Page 39: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 13

where u has the dimension of a velocity.58 Using the method of characteristiccurves,59 the equation of a characteristic curve is dxdt = u(x(t), t) with x(0) = y,and then along this curve one has d(u(x(t),t))

dt = 0, so that the characteristiccurves are lines and the solution is given by

u(x(t), t) = u0(y) and x(t) = y + t u0(y), (1.11)

as long as it makes sense; based on the remark that y = x − t u, the implicitsolution found by POISSON was similar to writing a solution of the Burgersequation in the implicit form

u(x, t) = u0

(x− t u(x, t)

). (1.12)

Of course, the solution u is assumed to be smooth in this computation, and(1.7) shows that, apart from the case where u0 is nondecreasing, there cannotexist a smooth solution for all t > 0, but one can also deduce such a propertyfrom the implicit equation (1.8).60 What CHALLIS had noticed is similar toobserving that if u0(x) = sinx then the implicit equation cannot have a uniquesolution for all t; indeed u = 0 and x − t u = j π, i.e. x = j π, gives a fewsolutions, and u = 1 and x − t u(x, t) = 2k π + π

2 , i.e. x = t + 2k π + π2 gives

a few solutions, but one has trouble deciding between u = 0 and u = 1 ifone has j π = t + 2k π + π

2 for integers j, k, and this indeed happens (forall x ∈ R) for t = π

2 . It is simpler to use (1.7) and observe that if y1 < y2

but u0(y1) > u0(y2), the characteristic lines through y1 and y2 intersect at apositive time − y2−y1

u0(y2)−u0(y1) and that it is impossible to have a smooth solutionuntil that time as both u0(y1) and u0(y2) (which are different) compete to bethe value of u(x, t) for the point of intersection of the two characteristic lines;one deduces that the time of existence of a smooth solution is exactly

Tc =1β

if infx∈R

du0

dx(x) = −β < 0, (1.13)

58 Some people prefer to write the equation ∂v∂t

+ c v ∂v∂x

= 0 for a characteristicvelocity c, with v having no dimension.

59 For solving a first-order partial differential equation ∂u∂t

+∑N

i=1ai(x, t)

∂u∂xi

=

f(x, t, u), with initial data u0, one first computes for every y ∈ RN the char-

acteristic curve going through y, defined by the system of ordinary differentialequations dxi

dt= ai(x(t), t) for i = 1, . . . , N , and x(0) = y; then v(t) = u(x(t), t) is

the solution of a scalar differential equation dvdt

= f(x(t), t, v) with v(0) = u0(y).I do not know who developed the method in a precise way, perhaps CAUCHY whohad the first abstract theory of differential equations, but POISSON must haveunderstood that, because he had a good physical intuition, according to what Iheard about his work on the three-dimensional wave equation in the lectures ofLaurent SCHWARTZ, when I was a student at Ecole Polytechnique.

60 For example, if du0dx

> −α with α ≥ 0 then the implicit equation has a uniquesolution as long as 0 ≤ t < 1

α, because the function v �→ v − u0(x − t v) has a

derivative ≥ 1 − t α > 0; in particular if u0 is nondecreasing the solution existsfor all t > 0.

Page 40: From hyperbolic systems to kinetic theory: a personalized quest

14 1 Historical Perspective

because for t < 1β only one characteristic line goes through (x, t) whatever x is,

but if t > 1β (and u0 is bounded) there exists x with two different chracteristic

lines going through the point (x, t).

What should one do after the appearance of the first singularity? STOKES’sproposal is to accept discontinuous solutions, and that is related to consider-ing solutions in the sense of distributions, but for that one will have to useequations in conservative form, because u ux does not make sense when u isdiscontinuous, because ux has a Dirac mass at this point, and one can onlymultiply a Dirac mass by a function which is continuous at that point,61 andu is not, but one can define the derivative of u

2

2 in the sense of distributions.A way to see the difficulty is to consider a function v equal to −1 for x < 0and +1 for x > 0 (i.e. u = −1+2H , where H is the Heaviside function, whosederivative in the sense of distribution is the Dirac mass at 0), so that thederivative of u is ux = 2δ0; as u2 = 1 one has 3u2ux = 6δ0, but as u3 = u thederivative of u3 is 2δ0, one deduces (u3)x �= 3u2ux; one should be careful thenthat some nonlinear calculus rules are not allowed for discontinuous solutions.

[Taught on Monday August 27, 2001.]

Notes on names cited in footnotes for Chapter 1, NEEL,62 ZEEMAN,63 FUL-

LER,64

61 That is not exactly true because one can multiply a Dirac mass by any Borelfunction, and Borel functions are defined at every point (and are not equiva-lence classes, as locally integrable functions are). However, in order to solve par-tial differential equations one uses the theory of distributions, where Lebesgue-measurable functions are identified if they coincide almost everywhere (a functionis measurable if and only if it coincides with a Borel function outside a set ofLebesgue measure 0). If one thinks in terms of mathematics, one may well liketo use Borel functions, but the theory of distributions is adapted to the laws ofcontinuum mechanics and physics (at least for their linear equations), in partic-ular because of the point of view that I developed in the early 1970s, that weakconvergence is a good model for explaining the relations between different levels,microscopic/mesoscopic/macroscopic.

62 Louis Eugene Felix NEEL, French physicist, 1904–2000. He received the NobelPrize in Physics in 1970, for fundamental work and discoveries concerning anti-ferromagnetism and ferrimagnetism which have led to important applications insolid state physics, jointly with Hannes ALFVEN. He had worked in Strasbourg,and in Grenoble, France.

63 Pieter ZEEMAN, Dutch physicist, 1865–1943. He received the Nobel Prize inPhysics in 1902, jointly with Hendrik LORENTZ, in recognition of the extraor-dinary service they rendered by their research into the influence of magnetismupon radiation phenomena. He had worked in Leiden, and in Amsterdam, TheNetherlands.

64 John FULLER, English politician and philanthropist, 1757–1834.

Page 41: From hyperbolic systems to kinetic theory: a personalized quest

1 Historical Perspective 15

TOMONAGA,65 SCHWINGER,66 CHARLES X,67 BONAPARTE/NAPOLEON I,68

CLAUSIUS,69 CARNOT,70 COURANT,71 FRIEDRICHS,72 BOREL,73 LEBESGUE,74

and for the preceding footnotes, PURDUE,75 HARVARD.76

65 Sin-Itiro TOMONAGA, Japanese-born physicist, 1906–1979. He received the NobelPrize in Physics in 1965, jointly with Julian SCHWINGER and Richard FEYNMAN,for their fundamental work in quantum electrodynamics, with deep-ploughingconsequences for the physics of elementary particles. He had worked in Tokyo,Japan, in Leipzig, Germany, in Tsukuba, Japan, and at IAS (Institute for Ad-vanced Study), Princeton, NJ.

66 Julian Seymour SCHWINGER, American physicist, 1918–1994. He received theNobel Prize in Physics in 1965, jointly with Sin-Itiro TOMONAGA and RichardFEYNMAN, for their fundamental work in quantum electrodynamics, with deep-ploughing consequences for the physics of elementary particles. He had worked atUCB (University of California at Berkeley), Berkeley, CA, at Purdue University,West Lafayette, IN, and at Harvard University, Cambridge, MA.

67 Charles-Philippe de France, 1757–1836, comte d’Artois, duc d’Angouleme, pairde France, was King of France from 1824 to 1830 under the name CHARLES X.

68 Napoleon BONAPARTE, French general, 1769–1821. He became Premier Consulafter his coup d’etat in 1799, was elected Consul a vie in 1802, and he proclaimedhimself emperor in 1804, under the name NAPOLEON I (1804–1814, and 100 daysin 1815).

69 Rudolf Julius Emmanuel CLAUSIUS, German physicist, 1822–1888. He had workedin Berlin, Germany, in Zurich, Switzerland, in Wurzburg and in Bonn, Germany.

70 Sadi Nicolas Leonard CARNOT, French engineer, 1796–1832. He had worked inParis, France.

71 Richard COURANT, German-born mathematician, 1888–1972. He had worked inGottingen, Germany, and at NYU (New York University), New York, NY. Thedepartment of mathematics of NYU is named after him, the Courant Institute ofMathematical Sciences.

72 Kurt Otto FRIEDRICHS, German-born mathematician, 1901–1982. He had workedin Aachen and in Braunschweig, Germany, and at NYU (New York University),New York, NY.

73 Felix Edouard Justin Emile BOREL, French mathematician, 1871–1956. He hadworked in Lille and in Paris, France.

74 Henri Leon LEBESGUE, French mathematician, 1875–1941. He had worked inRennes, in Poitiers, and in Paris, France, holding a chair (mathematiques, 1921–1941) at College de France, Paris.

75 John PURDUE, American industrialist, 1802–1876. Purdue University, WestLafayette, IN, is named after him.

76 John HARVARD, English clergyman, 1607–1638. Harvard University, Cambridge,MA, is named after him.

Page 42: From hyperbolic systems to kinetic theory: a personalized quest

2

Hyperbolic Systems: Riemann Invariants,Rarefaction Waves

The book of COURANT and FRIEDRICHS [3] helped mathematicians enteringan important domain of continuum mechanics by collecting a lot of informa-tion scattered in the engineering literature, but the analysis done by PeterLAX in extracting a general mathematical framework out of it was crucial.When I looked at this book in the late 1970s for lectures that I was teach-ing, I noticed an historical section, and it had some influence on my interestin the history of ideas in mathematics, or in science in general, and when Ilectured on quasi-linear hyperbolic systems of conservation laws in the springof 1991 I tried to read some of the earlier texts that were mentioned there.Cathleen MORAWETZ,1 who had been asked to edit the book by her advisor(FRIEDRICHS) when she was a graduate student, told me a few years ago thatthe historical section was initially much larger, but had to be trimmed becausethe book was too long.

Questions of shock waves had been important in industrial or militaryapplications, and while COURANT and FRIEDRICHS had been involved onthe American side, with Peter LAX working out some of the mathematicalquestions, similar work had been done in USSR by Sergei GODUNOV andOlga OLEINIK,2 and there was some work in England and in France, wherethe mathematical community was not really aware of these questions. Aftersome initial mathematical work in the early 1950s on a scalar equation, whichwill be described later, it was time in the late 1950s to try to handle systemsin a mathematical way, and this is the trend that Peter LAX started; however,1 Cathleen SYNGE-MORAWETZ, Canadian-born mathematician, 1923. She works at

NYU (New York University), New York, NY. Her father, John SYNGE had beenthe head of the mathematics department at Carnegie Tech (Carnegie Institute ofTechnology), now CMU (Carnegie Mellon University), Pittsburgh, PA, from 1946to 1948.

2 Sergei Konstantinovich GODUNOV, Russian mathematician, born in 1929. Heworks at the Sobolev institute of mathematics of the Siberian branch of theRussian Academy of Sciences, Novosibirsk, Russia.

Page 43: From hyperbolic systems to kinetic theory: a personalized quest

18 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

one should be aware that numerical codes have been written since the early1950s on questions for which the mathematical understanding is missing, andby using experimental information and conjectures these codes perform quitewell, but Peter LAX tried to attack the problem from a mathematical pointof view, working on equations where the physical intuition might not exist.

There was a different group of mathematicians working on questions re-lated to viscous fluids, which give rise to partial differential equations ofparabolic type (because one cheats with physics by pretending that the fluidsare incompressible), the prototype being the Navier–Stokes equation,3 andthe first mathematical work was done by Jean LERAY in the 1930s,4 us-ing his work with SCHAUDER of extending the Brouwer topological degreeto an infinite-dimensional setting;5,6 the work was continued by EberhardHOPF, Olga LADYZHENSKAYA,7 and others in the 1960s, like Ciprian FOIAS,8

Jacques-Louis LIONS, and James SERRIN.9

Although real problems from continuum mechanics occur in three spacedimensions, the framework for quasi-linear hyperbolic systems mostly dealswith problems in one space dimension; even for linear hyperbolic systems, themultidimensional situation is much more difficult than the one-dimensionalcase. After understanding how to define and solve linear hyperbolic systems(in one space dimension), the elements of the general theory will be presented(Riemann problem, Riemann invariants, shocks and contact discontinuities,“entropies”), with the system of gas dynamics as an example.

One considers a linear system with constant coefficients3 NAVIER had introduced the equation in 1821 by a molecular approach, and it

was rederived more mathematically in 1843 by SAINT-VENANT and in 1845 bySTOKES.

4 Jean LERAY, French mathematician, 1906–1998. He received the Wolf Prize in1979, for pioneering work on the development and application of topological meth-ods to the study of differential equations, jointly with Andre WEIL. He hadworked in Nancy, France, in a prisoner of war camp in Austria (1940–1945), inParis, France, holding a chair (theorie des equations differentielles et fonction-nelles, 1947–1978) at College de France, Paris.

5 Juliusz Pawel SCHAUDER, Polish mathematician, 1899–1943. He had worked inLvov (then in Poland, now in Ukraine).

6 Luitzen Egbertus Jan BROUWER, Dutch mathematician, 1881–1966. He hadworked in Amsterdam, The Netherlands.

7 Olga Aleksandrovna LADYZHENSKAYA, Russian mathematician, 1922–2004. Shehad worked at the Steklov Mathematical Institute, in Leningrad, USSR, then StPetersburg, Russia. I first met her in 1991 in Bath, England.

8 Ciprian Ilie FOIAS, Romanian-born mathematician, born in 1933. He worked inBucharest, Romania, at Universite Paris-Sud, Orsay, France (where he was mycolleague in 1978–1979), at Indiana University, Bloomington, IN, and at TexasA&M, College Station, TX.

9 James B. SERRIN Jr., American mathematician, born in 1926. He works at Uni-versity of Minnesota, Minneapolis, MN.

Page 44: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 19

∂U

∂t+A

∂U

∂x= 0 for x ∈ R, t > 0; U(·, 0) = U0 in R, (2.1)

where U(x, t) is a vector with p components, and A is a p × p matrix inde-pendent of x, t or U . Every partial differential equation with constant co-efficients can be rewritten as such a system but I am only interested herein linear hyperbolic systems, which must exhibit an effect of finite speed ofpropagation. Another definition of hyperbolicity (in a given direction, whichis time in the physical examples) is that the Cauchy problem should be wellposed. Linear hyperbolic equations have been studied by Lars GARDING,10

Lars HORMANDER,11 Peter LAX, and Jean LERAY.

Definition 2.1. One says that the system is hyperbolic if A has only realeigenvalues and is diagonalizable; the system is said to be strictly hyperbolicif A has only distinct real eigenvalues (so that it is diagonalizable).

One orders the eigenvalues in increasing order

λ1 ≤ . . . ≤ λp, (2.2)

and one chooses a basis of eigenvectors rj , j = 1, . . . , p, i.e.

Arj = λjrj for j = 1, . . . , p, (2.3)

and also uses the dual basis lj , j = 1, . . . , p, i.e. lj(rk) = δj,k the Kroneckersymbol,12 for j, k = 1, . . . , p, so that

AT �k = λk�k for k = 1, . . . , p. (2.4)

Peter LAX calls the rj right eigenvectors and the �k left eigenvectors, andone may think of rj as a column vector and of �k as a row vector; of course,this is related to the fact that the rj belong to a vector space E = R

p,and the �k belong to its dual E′, and that no Euclidean structure on R

p

is necessary,13,14 so that it should not be identified with its dual. If A is10 Lars GARDING, Swedish mathematician, born in 1919. He worked at Lund Uni-

versity, Lund, Sweden.11 Lars HORMANDER, Swedish mathematician, born in 1931. He received the Fields

Medal in 1962, and the Wolf Prize in 1988, for fundamental work in modern anal-ysis, in particular, the application of pseudo-differential and Fourier integral oper-ators to linear partial differential equations, jointly with Friedrich HIRZEBRUCH.He worked in Stockholm, Sweden, at Stanford University, Stanford, CA, at IAS(Institute for Advanced Study), Princeton, NJ, and in Lund, Sweden.

12 Leopold KRONECKER, German mathematician, 1823–1891. He had worked inBerlin, Germany.

13 EUCLID of Alexandria, “Egyptian” mathematician, about 325 BCE–265 BCE. Itis not known where he was born, but he had worked in Alexandria, Egypt, shortlyafter it was founded by ALEXANDER the Great, in 331 BCE.

14 BCE = Before Common Era, CE = Common Era.

Page 45: From hyperbolic systems to kinetic theory: a personalized quest

20 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

hyperbolic, the explicit solution of the Cauchy problem is easily obtained bydecomposing the unknown vector U(x, t) on the basis of eigenvalues rj , j =1, . . . , p,

U(x, t) =p∑

j=1

uj(x, t)rj , (2.5)

and applying �k to the equation; as 〈�k, U〉 = uk and 〈�k, AU〉 = 〈AT �k, U〉 =λkuk, one finds that

∂uk∂t

+ λk∂uk∂x

= 0, k = 1, . . . , p, (2.6)

giving uk(x, t) = uk(x− λkt, 0) = 〈�k, U0(x− λkt)〉, and one deduces that thesolution is given by the formula

U(x, t) =p∑

j=1

〈�j , U0(x− λjt)〉 rj , x, t ∈ R. (2.7)

This formula shows that the eigenvalues of the matrix A are velocities, calledcharacteristic velocities, of propagation of some particular modes correspond-ing to the eigenvectors rj (the dual basis is only a technical tool). For aquasi-linear system, one starts with a similar definition.

Definition 2.2. The quasi-linear system (shown here with an initial datum)

∂U

∂t+A(U)

∂U

∂x= 0 for x ∈ R, t > 0; U(·, 0) = U0 in R (2.8)

is hyperbolic if for U in a domain D ⊂ Rp,15 the matrix A(U) has real

eigenvalues and is diagonalizable for every U ∈ D, and strictly hyperbolicif A(U) has real distinct eigenvalues for every U ∈ D (which one ordersλ1(U) < . . . < λp(U)).

Assuming that A has distinct eigenvalues and is a smooth function in D,then the eigenvalues are smooth functions and one may define a basis of righteigenvectors rj(U), j = 1, . . . , p which are smooth functions, and the dualbasis of left eigenvectors is then also smooth.15 In gas dynamics, the density � is nonnegative, and the case where � = 0 is re-

lated to cavitation and creates mathematical difficulties; the internal energy e isnonnegative, and it designates a part of the energy which is hidden at mesoscopiclevel; the pressure is nonnegative, and it is interpreted in kinetic theory as re-sulting from particles bouncing on the boundary of the container and exchangingmomentum with it (negative pressures like suction involve viscosity).

Page 46: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 21

Is then the system of gas dynamics strictly hyperbolic? One considers thesimple case where u2 = u3 = 0,16 which is a reasonable hypothesis when thegas moves in a small tube at slow velocities, so that the system is

�t + (� u)x = 0,(� u)t + (� u2 + p)x = 0,(� u2

2 + � e)

t+(� u3

2 + � e u+ p u)

x= 0,

(2.9)

and if one assumes that the solution (�, u, e) is smooth in (x, t) and that theequation of state gives p as a smooth function of (�, e), one can rewrite thesystem by noticing that

((� u)t + (� u2 + p)x

)− u(�t + (� u)x

)= � ut + � u ux + px, (2.10)

and((

�u2

2 + � e)

t+(�u3

2 + � e u + p u))

+(u2

2 − e)(�t + (� u)x

)

−u((� u)t + (� u2 + p)x

)= � et + � u ex + p ux,

(2.11)

so that, as long as � > 0, one finds the system

�t + u �x + � ux = 0,ut + u ux + 1

� px = 0,et + u ex + p

� ux = 0,(2.12)

which for Ue =

⎝�ue

⎠ belonging to the domain De = (0,∞) × R × (0,∞)

corresponds to A(Ue) given by

A(Ue) =

⎜⎝

u � 01�∂p∂�

∣∣∣e

u 1�∂p∂e

∣∣∣�

0 p� u

⎟⎠ = u I +

⎜⎝

0 � 01�∂p∂�

∣∣∣e

0 1�∂p∂e

∣∣∣�

0 p� 0

⎟⎠, (2.13)

and the fact that A(Ue) − u I only depends upon the thermodynamical vari-ables (�, e) is related to the Galilean invariance of the system of gas dynamics.The characteristic polynomial of A(Ue) − u I is −λ3 + λ

(∂p∂�

∣∣e+ p�2∂p∂e

∣∣�

)= 0,

so the gas dynamics system is strictly hyperbolic if and only if one has

c2 =∂p

∂�

∣∣∣e+p

�2

∂p

∂e

∣∣∣�> 0, (2.14)

16 Without this condition, the system is hyperbolic but not strictly hyperbolic. Thetwo added components of velocity, u2, u3 solve the equations (uj)t + u1(uj)x =0 for j = 2, 3, so that the eigenvalue u1 has multiplicity 3. The equation of

balance of energy is(

� |u|22

+ � e)

t+(

� |u|2u12

+ � e u1 + p u1

)x

= 0, and it gives

� et + �u1 ex + p (u1)x = 0.

Page 47: From hyperbolic systems to kinetic theory: a personalized quest

22 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

and c > 0 is the local speed of sound; the eigenvalues are then

λ1(Ue) = u− c; λ2(Ue) = u; λ3(Ue) = u+ c, (2.15)

and one may choose as right eigenvectors

r1(Ue) =

⎝−�2

� c−p

⎠ ; r2(Ue) =

⎜⎝

∂p∂e

∣∣∣�

0−∂p∂�

∣∣∣e

⎟⎠ ; r3(Ue) =

⎝+�2

� c+p

⎠ . (2.16)

The computations are made simpler if one uses the first and second law ofthermodynamics, so that

de = −p d(1�

)+ θ ds. (2.17)

Multiplying the equation in � by p�2 and adding to the equation in u gives

θ(st + u sx

)= 0, and as θ > 0, one deduces that

st + u sx = 0. (2.18)

If one assumes now that the equation of state gives p as a smooth function of(�, s), one considers the system

�t + u �x + � ux = 0,ut + u ux + 1

� px = 0,st + u sx = 0,

(2.19)

which for Us =

⎝�us

⎠ belonging to the domain Ds = (0,∞) × R × R corre-

sponds to A(Us) given by

A(Us) =

⎜⎝

u � 01�∂p∂�

∣∣∣s

u 1�∂p∂s

∣∣∣�

0 0 u

⎟⎠ = u I +

⎜⎝

0 � 01�∂p∂�

∣∣∣s

0 1�∂p∂s

∣∣∣�

0 0 0

⎟⎠ , (2.20)

and again the fact that A(Us)− u I only depends upon the thermodynamicalvariables (�, s) is related to Galilean invariance. As the velocity of propagationshould not depend upon the basis used for the space U , one finds the sameeigenvalues

λ1(Us) = u− c; λ2(Us) = u; λ3(Us) = u+ c, (2.21)

but the local velocity of sound c is given by the simpler formula (giving thesame value than before)

c2 =∂p

∂�

∣∣∣s, (2.22)

Page 48: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 23

and one may choose the right eigenvectors as

r1(Us) =

⎝−�c0

⎠ ; r2(Us) =

⎜⎝

−∂p∂s

∣∣∣�

0∂p∂�

∣∣∣s

⎟⎠ ; r3(Us) =

⎝+�c0

⎠ . (2.23)

One should notice that this computation has not relied on what θ is butonly on θ �= 0, so that if one replaces s by ϕ(s) and θ by θ

ϕ′(s) with a functionϕ such that ϕ′ > 0 for example, one obtains the same result; actually theequation for s implies the conservation law

(� f(s)

)t+(� u f(s)

)x

= 0, (2.24)

for all smooth functions f , and it is not clear at this point why the ther-modynamical entropy s cannot be replaced by ϕ(s).17 It is this property ofthermodynamical entropy, that it corresponds to new conserved quantities forsmooth solutions, that led Peter LAX to call any conserved quantity an “en-tropy” (which I often qualify as mathematical, so that the uninformed readerwill not be mistaken); one should notice that looking for “entropies” does notrequire a system to be hyperbolic, and classical results in this direction areoften related to a theorem of A. NOETHER.18

The laws of thermodynamics for a gas have given de = −p d( 1�

)+ θ ds,

which implies that∂s

∂�

∣∣∣e= − p

θ �2;∂e

∂s

∣∣∣�=

1θ, (2.25)

and∂e

∂�

∣∣∣s=

p

�2;∂e

∂s

∣∣∣�= θ; (2.26)

writing a function f as f(�, e(�, s)

), one has

17 The experimental facts have shown that when two separate bodies are put incontact, heat flows from one body to the other if they do not have the same tem-perature (and heat flows from the hotter one to the colder one), and equilibriumoccurs when the temperatures of the two bodies coincide, and not when theirvalues of θ

ϕ′(s) coincide; however, this requires some heat conductivity, which ismissing in the model considered here. It is for this kind of reason that one shouldbe careful about the properties of a system of equations that one uses for describ-ing physical reality, and one has to prove that the system has a property which isobserved, and if it is lacking some real property it does not mean that the modelis useless, but it points out in what situations the model should be used and inwhat other situations the model should not be used.

18 Amalie (Emmy) NOETHER, German-born mathematician, 1882–1935. She hadworked in Gottingen, Germany, and then in Bryn Mawr, PA.

Page 49: From hyperbolic systems to kinetic theory: a personalized quest

24 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

∂f∂�

∣∣∣s= ∂f

∂�

∣∣∣e+ p�2∂f∂e

∣∣∣�

∂f∂s

∣∣∣�= θ ∂f∂e

∣∣∣�

(2.27)

and writing a function f as f(�, s(�, e)

), one has

∂f∂�

∣∣∣e= ∂f

∂�

∣∣∣s− pθ �2

∂f∂s

∣∣∣�

∂f∂e

∣∣∣�= 1

θ∂f∂s

∣∣∣�.

(2.28)

Definition 2.3. The Riemann problem is a particular case of the Cauchyproblem, where the initial datum U0 has the form

U0(x) ={U− if x < 0U+ if x > 0. (2.29)

Although the computations done previously on the linear case seem tohave assumed some smoothness of the solution, they are actually true in thesense of distributions and one may take for U0 any measurable and locallyintegrable function (or any distribution, if one likes); in the linear case thesolution of the Riemann problem is then piecewise constant, of the form

U(x, t) =

⎧⎨

a0 = U− for x < λ1taj for λjt < x < λj+1t and 1 ≤ j ≤ p− 1ap = U+ for λpt < x

(2.30)

showing that the initial discontinuity splits in general into p discontinuities,propagating at one of the characteristic velocities, and there is such a discon-tinuity propagating at velocity λj if and only if 〈�j , U+ − U−〉 �= 0.

The solution of the Riemann problem is slightly different for the quasi-linear situation. There are sectors of the (x, t) plane where the solution isconstant, but these sectors do not cover the whole plane in general, and indescribing the solution one may need some sectors where the solution changescontinuously, the centred rarefaction waves, whose study involves the Riemanninvariants, and one may also need some discontinuities; two types of discon-tinuities may occur, contact discontinuities as in the linear case, and shocks.Which shocks are acceptable is a difficult question, related to explaining howto deal with irreversible phenomena, and understanding the terms “entropies”and entropy conditions will be a part of the answer.

To describe the regular parts in the solution of the Riemann problem,Peter LAX introduced the notion of Riemann invariants, which generalizewhat RIEMANN had done on a particular example, of course.

Definition 2.4. A function w, defined on the domain D ⊂ Rp, is called a

j-Riemann invariant (for some j ∈ {1, . . . , p}) if it satisfies

〈∇w(U), rj(U)〉 = 0 in D. (2.31)

Page 50: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 25

The equation for a j-Riemann invariant is a differential equation, for thevector field rj , and one can apply the method of characteristic curves to finda local solution if the value of w is given on a noncharacteristic surface S, i.e.a surface S such that for every U ∈ S the vector rj(U) is not tangent to S.The characteristic curves are obtained by solving the differential system

dV

dτ= rj(V (τ)

), (2.32)

and along each of these curves (which are defined locally because rj is smooth)a j-Riemann invariant must be constant. If w1, w2, . . . , wk are j-Riemanninvariants, then for every smooth function h of k variables, h(w1, w2, . . . , wk)is also a j-Riemann invariant, and one can describe locally all the j-Riemanninvariants if one knows p − 1 functions on a noncharacteristic hypersurfaceS whose differentials are linearly independent, and the independence staystrue along the characteristic curve for the corresponding j-Riemann invariantsthat they define, a classical result for linear ordinary differential equations;indeed, if

∑pk=1(rj)k

∂w∂xk

= 0, then deriving in x gives∑p

k=1(rj)k∂2w

∂xk∂x�+

∑pk=1

∂(rj)k

∂x�

∂w∂xk

= 0, i.e. M(τ) = ∇w(V (τ)

)satisfies a linear differential

equation dMdτ +B(τ)M(τ) = 0, where the matrix B has entries B,k = ∂(rj)k

∂x�,

and one deduces that if M vanishes at some value of τ it must vanish for allvalues of τ .

Which are the Riemann invariants for the system of gas dynamics? Usingthe variables (�, u, s), the equation for a 1-Riemann invariant is −� ∂w∂�+c ∂w∂u =0, which has two independent solutions, one being w = s and another onebeing w = u + g(�, s), where g satisfies ∂g

∂� = c� (so g is defined modulo

a function of s, and as functions of s are 1-Riemann invariants it does notmatter much), and then

the general 1-Riemann invariant is h(u+ g(�, s), s) with ∂g∂� = c

for an arbitrary smooth function h;(2.33)

the equation for a 2-Riemann invariant is −∂p∂s

∣∣�∂w∂� + ∂p

∂�

∣∣s∂w∂s = 0, which has

two independent solutions, one being w = u and another one being w = p, so

the general 2-Riemann invariant is h(u, p)for an arbitrary smooth function h; (2.34)

and, either by repeating the same computation, or by using the fact thata change of orientation of the x axis exchanges the order of eigenvalues (asit changes their sign) and changes u into −u, the formulas for 1-Riemanninvariants give 3-Riemann invariants by changing u into −u, so

the general 3-Riemann invariant is h(u− g(�, s), s) with ∂g∂� = c

for an arbitrary smooth function h.(2.35)

Page 51: From hyperbolic systems to kinetic theory: a personalized quest

26 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

Again, one sees an interesting property of functions of s, that they areboth 1-Riemann invariants and 3-Riemann invariants, so that the surfacess = constant are well defined without having to invoke thermodynamics, buts could be replaced by an arbitrary function of s (so one cannot discoverthe special role of temperature from the equations); this also shows that thesystem of gas dynamics is special, because the vector fields r1 and r3 sat-isfy an integrability condition,19 of a type studied by CLEBSCH,20 and byFROBENIUS.21

Definition 2.5. A regular solution U of the system in an open set Ω of the(x, t) plane is called a j-simple wave (or a j-wave) if, for every j-Riemanninvariant w, w

(U(x, t)

)is constant in Ω.

It means that the values taken by U in the open set Ω are all taken fromone integral curve of the vector field rj , so if one has a parametrization V (τ)of the integral curve, it just means that τ is a function of (x, t), so that

U(x, t) = V(τ(x, t)

), with

dV

dτ= rj(V (τ)

)and Arj = λjrj , (2.36)

so that τ satisfies∂τ

∂t+ λj

(V (τ)

)∂τ∂x

= 0, (2.37)

an equation already considered in the special case of the Burgers equation; thesame computation as before shows that the characteristic curves are straightlines, where τ is constant.

The solution of the Riemann problem has some constant sectors and mayhave some sectors where the solution is smooth and not constant, and it isthen a j-wave where all the characteristic lines go through 0, a centred wave(of course, the solution may have different sectors, corresponding to j-waveswith different values of j).

The reason that one looks for centred waves is a question of invariance.Because the equation is invariant by the group of transformations (x, t) �→19 The commutator of the operators A1 and A3 of derivation in the directions r1

and r3 should be a linear combination of A1 and A3. If a nonzero vector fieldv in R

N is given and one wants to find a hypersurface which is perpendicularto v at each point, it means that one looks for a smooth function f such that

v = c grad f with a function c �= 0, from which one deduces that ∂vi∂xj

− ∂vj

∂xi=

∂c∂xj

∂f∂xi

− ∂c∂xi

∂f∂xj

= ∂d∂xj

vi − ∂d∂xi

vj for all i, j, where d = log c, and conversely,

∂vi∂xj

− ∂vj

∂xi= ∂d

∂xjvi − ∂d

∂xivj for all i, j implies ∂(e−dvi)

∂xj− ∂(e−dvj)

∂xi= 0 for all i, j,

so that e−dvi = ∂w∂xi

for all i, locally.20 Rudolf Friedrich Alfred CLEBSCH, German mathematician, 1833–1872. He had

worked in Berlin, in Karlsruhe, in Giessen and in Gottingen, Germany.21 Ferdinand Georg FROBENIUS, German mathematician, 1849–1917. He had

worked in Zurich, Switzerland, and in Berlin, Germany.

Page 52: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 27

(k x, k t) and the initial datum U0 of the Riemann problem is also invariantby the same transformations, one first looks for a solution which is invariantby these transformations,22 i.e. a solution of the form

U(x, t) = W(xt

). (2.38)

Denoting σ = xt one finds that the equation says

A(W (σ)

)W ′(σ) = σW ′(σ), (2.39)

so that when W ′(σ) �= 0 it should be an eigenvector and σ a correspondingeigenvalue; as the system is strictly hyperbolic one must have σ = λj

(W (σ)

)

for a fixed value of j around the point, and one finds then that W ′(σ) must beproportional to rj

(W (σ)

), so that the values taken by W are on an integral

curve of the vector field rj , and U is then a j-simple wave; however, thereis more to it, because one must move along this curve in such a way thatσ = λj

(W (σ)

)is satisfied, and that is not always possible. In order to move

along the integral curve and satisfy σ = λj(V (σ)

), one must look at the way

λj varies along the curve, and this question led Peter LAX to the followingdefinition.

Definition 2.6. The jth characteristic field is said to be linearly degeneratein D if one has

〈∇λj(U), rj(U)〉 = 0 for all U ∈ D. (2.40)

The jth characteristic field is said to be genuinely nonlinear in D if one has

〈∇λj(U), rj(U)〉 �= 0 for all U ∈ D, (2.41)

and one may assume that 〈∇λj(U), rj(U)〉 = 1 in D, by multiplying rj by anonzero function.23

Of course, there are intermediate cases, and one should not think of havingunderstood the general case by treating only these two extreme possibilities,22 There are cases of equations invariant by a group, with solutions only invariant

by a subgroup (or not invariant at all if the subgroup is restricted to identity);for example, for the eigenvalue problem −Δu = λu with Dirichlet boundarycondition on all the boundary (or with Neumann boundary condition on all theboundary) of a disc, where the problem and the boundary data are invariant byrotation, but for particular values of λ there are solutions of the form fn(r) cosnθ,with n �= 0 (and fn is related to Bessel functions). In the case of an evolutionproblem like here, the question is different because one expects existence anduniqueness of a solution (if one finds the right way to define what kind of solutionone is looking for), and the solution must then inherit the invariance.

23 If one changes the normalization of the eigenvectors rj , the j-Riemann invariantsstay the same, as well as the integral curves of the vector field rj , and only theirparametrization changes.

Page 53: From hyperbolic systems to kinetic theory: a personalized quest

28 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

but the theory is much simpler if each field is either genuinely nonlinear orlinearly degenerate, and it was natural that Peter LAX had first consideredthis case; actually, if it was not the case that for the system of gas dynamicsthe second field is linearly degenerate, one could have thought that such acondition was purely artificial and only occurred for linear systems.

For the system of gas dynamics, using the description in (�, u, s), the quan-tity for the first characteristic field is

(− ∂c∂�

)(−�) + 1 c, while for the third

characteristic field it is(+ ∂c∂�

)(+�) + 1 c, and for the second it is 0, as r2 has

its second component 0. One deduces that the first and third characteristicfields are genuinely nonlinear if and only if

∂(� c)∂�

= �∂c

∂�+ c �= 0, or equivalently

∂2(� p)∂�2

�= 0, (2.42)

because �(∂2(� p)

∂�2

)= �2 ∂

2p∂�2 + 2� ∂p∂� =

∂(�2 ∂p

∂�

)

∂� = ∂(�2c2)∂� .

Assuming that the jth characteristic field is genuinely nonlinear and thatone has normalized rj(U) by 〈∇λj(U), rj(U)〉 = 1, so that if dV

dτ = rj(V (τ)

)

one deduces that d(λj(V (τ)))dτ = 1, and the constraint λj

(V (τ)

)= σ is then

easy to implement; moving along an integral curve of the normalized vectorfield rj from τ1 to τ2 with τ1 < τ2 corresponds to the following centred j-wavesolution:

U(x, t) = V (τ1) for x < λj(V (τ1)

)t

U(x, t) = V(xt − λj

(V (τ1)

)+ τ1

)for λj

(V (τ1)

)t < x < λj

(V (τ2)

)t

U(x, t) = V (τ2) for x > λj(V (τ2)

)t,

(2.43)

which is a smooth (locally Lipschitz continuous) solution of the Riemannproblem with U− = V (τ1) and U+ = V (τ2). The preceding solution is alsocalled a rarefaction wave, as when t increases the nonconstant part of thesolution spreads over larger and larger intervals in x.

By using the invariance of the equation by scaling (with k = −1) andtranslation in time, one sees that if T0 > 0 and U is a solution of the equation∂U∂t +A(U)∂U∂x = 0 for t > 0, then U defined by

U(x, t) = U(−x, T0 − t) for x ∈ R, 0 < t < T0, (2.44)

is a solution; if one applies this procedure to the rarefaction wave found above,one finds a compression wave, which starts from a smooth (Lipschitz contin-uous) datum at t = 0, but converges at t → T0 to the datum of a translatedRiemann problem with U(x, T0) = V (τ2) for x < 0 and U(x, T0) = V (τ1) forx > 0.

One has seen then that using the j-waves permits one to move in thespace R

p along special curves but only in some directions, so that this does

Page 54: From hyperbolic systems to kinetic theory: a personalized quest

2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves 29

not solve the general case of the Riemann problem; in other cases one must usediscontinuities, either contact discontinuities or shocks, which will be discussednext.

[Taught on Wednesday August 29, 2001.]

Notes on names cited in footnotes for Chapter 2, SYNGE,24 CARNEGIE,25

MELLON,26 DE SAINT-VENANT,27 WEIL,28 STEKLOV,29 FOURIER,30 HIRZE-

BRUCH,31 ALEXANDER the Great,32 DIRICHLET,33 F.E. NEUMANN,34

BESSEL.35

24 John Lighton SYNGE, Irish mathematician, 1897–1995. He had worked in Toronto(Ontario), at OSU (Ohio State University), Columbus, Ohio, and at CarnegieTech (Carnegie Institute of Technology), now CMU (Carnegie Mellon University),Pittsburgh, PA, where he had been the head of the mathematics department from1946 to 1948, and in Dublin, Ireland.

25 Andrew CARNEGIE, Scottish-born businessman and philanthropist, 1835–1919.Besides endowing the school that became Carnegie Institute of Technology andlater Carnegie Mellon University when it merged with the Mellon Institute ofIndustrial Research, he funded about three thousand public libraries, namedCarnegie libraries in United States.

26 Andrew William MELLON, American financier and philanthropist, 1855–1937.He had founded the Mellon Institute of Industrial Research in Pittsburgh, PA,which merged in 1967 with the Carnegie Institute of Technology to form CarnegieMellon University.

27 Adhemar Jean Claude BARRE DE SAINT-VENANT, French mathematician, 1797–1886. He had worked in Paris, France.

28 Andre WEIL, French-born mathematician, 1906–1998. He received the Wolf Prizein 1979, for his inspired introduction of algebro-geometry methods to the theory ofnumbers, jointly with Jean LERAY. He had worked in Aligarh, India, in Haverford,PA, in Swarthmore, PA, in Sao Paulo, Brazil, in Chicago, IL, and at IAS (Institutefor Advanced Study), Princeton, NJ.

29 Vladimir Andreevich STEKLOV, Russian mathematician, 1864–1926. He hadworked in Kharkov and in St Petersburg (then Petrograd), Russia. The SteklovMathematical Institute in St Petersburg, Russia, is named after him.

30 Jean-Baptiste Joseph FOURIER, French mathematician, 1768–1830. He hadworked in Auxerre, in Paris, France, accompanied BONAPARTE in Egypt, wasprefect in Grenoble, France, until the fall of NAPOLEON I, and worked in Parisagain. The first of three universities in Grenoble, Universite de Grenoble I, isnamed after him, and the Institut Fourier is its department of mathematics.

31 Friedrich HIRZEBRUCH, German mathematician, born in 1927. He received theWolf Prize in 1988, for outstanding work combining topology, algebraic and differ-ential geometry, and algebraic number theory; and for his stimulation of mathe-matical cooperation and research, jointly with Lars HORMANDER. He workedin Erlangen, Germany, at Princeton University, Princeton, NJ, and in Bonn,Germany.

(footnotes 32 to 35 on next page)

Page 55: From hyperbolic systems to kinetic theory: a personalized quest

30 2 Hyperbolic Systems: Riemann Invariants, Rarefaction Waves

32 Alexandros Philippou Makedonon, 356–323 BCE, was King of Macedon asALEXANDER III, and is referred to as ALEXANDER the Great, in relation tothe large empire that he conquered.

33 Johann Peter Gustav LEJEUNE DIRICHLET, German mathematician, 1805–1859.He had worked in Breslau (then in Germany, now Wroc�law, Poland), in Berlinand in Gottingen, Germany.

34 Franz Ernst NEUMANN, German mathematician, 1798–1895. He had worked inKonigsberg (then in Germany, now Kaliningrad, Russia).

35 Friedrich Wilhelm BESSEL, German mathematician, 1784–1846. He had workedin Konigsberg (then in Germany, now Kaliningrad, Russia).

Page 56: From hyperbolic systems to kinetic theory: a personalized quest

3

Hyperbolic Systems: Contact Discontinuities,Shocks

If the jth characteristic field is linearly degenerate, i.e. 〈∇λj(U), rj(U)〉 = 0for U ∈ D, then on every integral curve of dV

dτ = rj(V (τ)

)the eigenvalue λj

is constant, and one cannot construct (nonconstant) centred wave solutions,i.e. of the form U

(xt

)and taking values in one integral curve.

However, if λ∗j is the constant value of λj on a given integral curve whichis parametrized by V (τ) (for τ1 ≤ τ ≤ τ2), one can construct solutions of theform

U(x, t) = V(f(x− λ∗j t)

)for λ∗j t+ z1 < x < λ∗j t+ z2 and

f smooth in (z1, z2) with τ1 ≤ f(·) ≤ τ2.(3.1)

Indeed, Ux = f ′(x − λ∗j t)dVdτ and Ut = −λ∗jf ′(x − λ∗j t)

dVdτ , and dV

dτ is aneigenvector of A

(V (τ)

)for the eigenvalue λ∗j , showing that U is a solution.

One can then construct a sequence of smooth functions fn converging to adiscontinuous function f∞ which takes the value τ1 for z < 0 and τ2 forz > 0 (for example fn(z) = τ1 for z < 0, fn(z) = (1 − n z)τ1 + n z τ2 for0 < z < 1

n and fn(z) = τ2 for z > 1n ), and the corresponding Un is a sequence

of smooth (Lipschitz continuous) solutions of the equation, which convergesto a discontinuous function U∞ defined by

U∞(x, t) = V (τ1) if x < λ∗j t; U∞(x, t) = V (τ2) if x > λ∗j t. (3.2)

If the system is in conservative form, i.e.

A(U) = ∇F (U), so that∂U

∂t+A(U)

∂U

∂x= 0 is written as

∂U

∂t+∂F (U)∂x

= 0,

(3.3)then Un → U∞ and F (Un) → F (U∞) strongly (in Lploc strong for everyp ∈ [1,∞) for example), and passing to the limit in the sense of distributionsgives

∂U∞∂t

+∂(F (U∞)

)

∂x= 0, (3.4)

Page 57: From hyperbolic systems to kinetic theory: a personalized quest

32 3 Hyperbolic Systems: Contact Discontinuities, Shocks

but the writing as A(U∞)∂U∞∂x does not make sense. However, although the

meaning of a discontinuous solution is not clear, one tends to accept U∞ as agood discontinuous solution, because it is a strong limit of smooth solutions;notice that one then accepts the discontinuous function jumping from V (τ1)for x < λ∗j t to V (τ2) for x > λ∗j t, as well as the discontinuous function jumpingfrom V (τ2) for x < λ∗j t to V (τ1) for x > λ∗j t. In the conservative case, wherethe notion of a discontinuous solution is clearer, these discontinuous functionsare weak solutions, which satisfy then the Rankine–Hugoniot condition, andthey are particular cases of j-contact discontinuities, according to the followingdefinitions and properties.

Definition 3.1. A function U defined in an open set Ω of the (x, t) plane isa weak solution of the system in conservative form Ut +

(F (U)

)x

= 0 if Uand F (U) are Lebesgue-measurable and locally integrable in Ω and satisfy theequation in the sense of distributions in Ω, i.e.

Ω

(U∂ϕ

∂t+ F (U)

∂ϕ

∂x

)dx dt = 0 for all ϕ ∈ C∞

c (Ω), (3.5)

where C∞c (Ω) is the space of infinitely smooth functions with compact support

in Ω;1 U is a weak solution of the Cauchy problem

Ut +(F (U)

)x

= f for x ∈ R, t > 0, and U(·, 0) = U0 in R, (3.6)

if

∫R×(0,∞)

(−U ∂ϕ

∂t − F (U) ∂ϕ∂x)dx dt =

∫R×(0,∞) f ϕ dx dt +

∫RU0ϕ(·, 0) dx

for all ϕ ∈ C∞c (R2),

(3.7)where f is given, locally integrable in R

2, and U0 is given, locally integrablein R.

Smooth solutions of Ut +(F (U)

)x

= 0 are weak solutions, of course, as isseen by multiplying the equation by ϕ and integrating by parts (notice that ϕis scalar, but as U and F (U) are vectors, the equation is an equality betweenvectors, and it could be written separately for each component); a precise

1 A few mathematicians are afraid of this space, probably because they rememberthat Laurent SCHWARTZ had described a precise topology, which makes the dualbe the space of distributions; none of this nonelementary theory is needed ingeneral, and here it is just a set of test functions. One difficult question forquasi-linear hyperbolic systems concerns shocks, and one must understand enoughabout distributions to realize that F (U) is defined because U is a function, butthat it has no meaning for general distributions, and that once F (U) is locallyintegrable its derivative makes sense as a distribution, which is the linear mappingϕ �→ −

∫F (U)ϕx dx, but that A(U)Ux does not make sense as a distribution in

general.

Page 58: From hyperbolic systems to kinetic theory: a personalized quest

3 Hyperbolic Systems: Contact Discontinuities, Shocks 33

regularity condition needed for this integration by parts to be valid is thatthe components of U and those of F (U) belong to the Sobolev space W 1,1

loc ,i.e. functions who partial derivatives (in the sense of distributions) are locallyintegrable.

In the case where the solution is piecewise smooth, with a curve of discon-tinuity, the concept of a weak solution makes the Rankine–Hugoniot conditionappear, a condition which could as well have been called after STOKES andRIEMANN, as they had proven such conditions before RANKINE or HUGONIOT

were involved in this question. In [3], COURANT and FRIEDRICHS also men-tioned the work of EARNSHAW,2 who extended in 1860 the computationsof STOKES to a more general equation of state; EARNSHAW pointed to abook by PARRY,3 and he must have understood that a supersonic effect hadunknowingly been observed.4

Proposition 3.2. Let Ω be an open subset of the (x, t) plane, cut by a smoothcurve x = g(t), defining two nonempty open subsets Ω− = {(x, t) ∈ Ω | x <g(t)} and Ω+ = {(x, t) ∈ Ω | x > g(t)}. Assume that U is a smooth solutionof Ut +

(F (U)

)x

= 0 in Ω− which extends into a continuous function on Ω−with limits U(g(t)−, t) on the curve, and that U is also a smooth solution ofUt+(F (U)

)x

= 0 in Ω+ which extends into a continuous function on Ω+ withlimits U(g(t)+, t) on the curve; then U is a weak solution of Ut+

(F (U)

)x

= 0

2 Samuel EARNSHAW, English mathematician and clergyman, 1805–1888. He hadworked in Cambridge and in Sheffield, England.

3 Sir William Edward PARRY, English rear admiral and arctic explorer, 1790–1855.4 Thanks to the interlibrary loan system, I obtained a microfilm of the book, and

I read the corresponding appendix. It was during the first of three voyages ofPARRY to find the north-west passage, in 1819–1820, but that there would bethree tentatives was not written in the narrative of the first voyage, of course,and I learnt that information later, from the Internet. During the winter, PARRY

had to perform a few scientific experiments, and one of them was to measurethe velocity of sound at low temperature; he mentioned that someone had mea-sured the velocity of sound in Calcutta, India, with a temperature above 100◦

Fahrenheit. While his boat (Hecla) was stuck in the ice, he walked away with thephysician, and had his lieutenant order a sailor to fire one of the ten guns, andthey both measured the time between the flash of the detonation and the soundof the detonation, with chronometers precise to one fifth of a second. PARRY re-ported that one day, just after hearing the detonation, they had heard distinctlythe order fire, which had preceded the detonation; PARRY had wondered whatcould have happened and what was special on that day, and he mentioned thatthe barometer was very low. EARNSHAW must have understood that the sound ofthe detonation had started supersonically and had overtaken the preceding sound,without erasing it. What was obviously not understood at the time is that thevelocity of sound depends upon temperature and pressure (and the percentage ofhumidity in the air, which must have been important in the Calcutta measure-ment), so all these measurements of velocity had been done with a reading of thetemperature, but without a reading of the pressure!

Page 59: From hyperbolic systems to kinetic theory: a personalized quest

34 3 Hyperbolic Systems: Contact Discontinuities, Shocks

in Ω if and only if the following Rankine–Hugoniot condition is satisfied:

F(U(g(t)+, t)

)− F(U(g(t)−, t)

)= g′(t)

(U(g(t)+, t) − U(g(t)−, t)

)

almost everywhere along the curve. (3.8)

Proof : The usual notation is to write [f ] for the jump of any quantity f alongthe curve, i.e. f(g(t)+, t)−f(g(t)−, t), and denote by s the velocity g′ at whichthe discontinuity moves (except when the thermodynamical quantity s is alsoinvolved), and then write the Rankine–Hugoniot condition as

[F (U)] = s [U ], (3.9)

which is an equality between vectors, of course.For ϕ ∈ C∞

c (Ω), one decomposes∫Ω

(U ∂ϕ

∂t + F (U) ∂ϕ∂x)dx dt into a term∫

Ω−and a term

∫Ω+

, and each term is then integrated by parts; for exam-

ple,∫Ω−

(U ∂ϕ

∂t + F (U) ∂ϕ∂x)dx dt = − ∫

Ω−

(∂U∂t + ∂F (U)

∂x

)ϕdxdt+

∫∂Ω−

(U nt +F (U)nx)ϕd� =

∫∂Ω−

(U nt + F (U)nx)ϕd�, where n is the exterior normal toΩ− (used only on the part of the curve intersecting the support of ϕ); simi-larly, an integration by parts is performed for Ω+, with the important remarkthat on the curve the exterior normal to Ω+ is exactly the opposite of theexterior normal to Ω−, so that by adding the two terms one sees that U isa weak solution in Ω if and only if

∫∂Ω−

([U ]nt + [F (U)]nx)ϕd� = 0 for allϕ ∈ C∞

c (Ω); this means that [U ]nt + [F (U)]nx = 0 almost everywhere alongthe curve, and the normal to Ω− on the curve is given by

(nxnt

)

=1

√1 + (g′)2

(1

−g′)

, (3.10)

from which the Rankine–Hugoniot condition follows. ��The proof actually shows that the statement is true if the curve is Lipschitz

continuous, if both U and F (U) belong to W 1,1(Ω−)∩W 1,1(Ω+), and if theysatisfy the equation Ut +

(F (U)

)x

= 0 in Ω− and in Ω+ (in the sense ofdistributions), because the trace theorem asserts the existence of some notionof limits on the curve and the integration by parts formula holds (notice thatthis argument is valid for F continuous, in which case g′ could be infinite atsome points).

In the case of the Burgers equation, where U is scalar and F (U) = U2

2 , thediscontinuity must propagate at the speed U−+U+

2 , and it will be seen laterthat only the discontinuities with U− > U+ are “physical”. In the generalone-dimensional scalar case, for ut +

(f(u))x

= 0 the condition is that s =f(u+)−f(u−)

u+−u−.

KRUZHKOV has extended some properties known in one dimension to the

multidimensional scalar equation ∂u∂t +

∑Ni=1

∂(fi(u))

∂xi= 0, but the case N ≥ 2

Page 60: From hyperbolic systems to kinetic theory: a personalized quest

3 Hyperbolic Systems: Contact Discontinuities, Shocks 35

is not very physical, as it implies a strong anisotropy for space. I had heardof this fact long before the work of KRUZHKOV, and the remark was at-tributed to Rene THOM,5 who argued that one needs a vector unknown, andhe was actually thinking that equations of Hamilton–Jacobi type were a nat-ural generalization,6 while another generalization is obviously a system likegas dynamics, where there is a velocity field u and the space is isotropic but aderivative in the direction of u is natural; however, such an anisotropy existsalso in the case of effects arising at the interface between two materials, butthe domain of validity to be considered is limited to a small region in space,of course, moving eventually with a (shock) wave.

In the case of the system of gas dynamics, studied by HUGONIOT afterthe preliminary work of STOKES, EARNSHAW and RIEMANN,7 the Rankine–Hugoniot conditions are

[� u1

]= D

[�]

[� (u1)2 + p

]= D [� u1][

� u1u2

]= D

[� u2

]and[� u1u3

]= D

[� u3

][� |u|2u1

2 +� e u1+p u1

]= D

]� |u|2

2 +� e]

(3.11)

where one writes [f ] for the jump of any quantity f , and one uses D for thevelocity of the discontinuity, in order to avoid any confusion with the ther-modynamical entropy s (used for entropy per unit of mass); using Galileaninvariance, i.e. moving at velocityD, one may consider the simpler (but equiv-alent) question where D = 0, so that

moving at the velocity of the discontinuity, so that it appears stationary,�+u1+ = �−u1− = Q, flux of mass through the discontinuity,Q u1+ + p+ = Qu1− + p−,if Q �= 0, u2+ = u2− and u3+ = u3−,Q |u1+|2

2 +Qe+ + p+u1+ = Q |u1−|22 +Qe− + p−u1−.

(3.12)The case Q = 0 is easily checked to give u1 and p continuous through thediscontinuity, and this corresponds to a contact discontinuity, as u1 and p arethe corresponding Riemann invariants; all such contact discontinuities are ac-cepted as physical, because, as was seen in the general case, such discontinuousweak solutions are strong limits of smooth solutions.5 Rene THOM, French mathematician, 1923–2002. He received the Fields Medal

in 1958. He had worked in Grenoble, in Strasbourg, and at IHES (Institut desHautes Etudes Scientifique) at Bures-sur-Yvette, France.

6 Carl Gustav Jacob JACOBI, German mathematician, 1804–1851. He had worked inKonigsberg (then in Germany, now Kaliningrad, Russia) and in Berlin, Germany.

7 In 1848, STOKES did not use the equation of balance of energy, and EARNSHAW

was following him in 1860, the same year in which RIEMANN did his indepen-dent work, but it seems that RIEMANN had worked with the wrong system, withentropy being conserved instead of energy; HUGONIOT’s work dates from 1889.

Page 61: From hyperbolic systems to kinetic theory: a personalized quest

36 3 Hyperbolic Systems: Contact Discontinuities, Shocks

The case Q �= 0 is more technical to analyse, and some discontinuities arerejected, but the reasons for rejection are complex, and they are primarilyrelated to the laws of thermodynamics, but if they make sense for the partic-ular system of gas dynamics that I am considering, Peter LAX had to interpretwhat could be more general rules, valid for all quasi-linear hyperbolic systems,and he certainly took in consideration the question of stability of shocks, andthe understanding of the scalar case.

In the scalar case, there is a complete theory, and I shall describe laterthe conditions imposed by Peter LAX, which are necessary conditions for ad-missibility of shocks, and the conditions imposed by Olga OLEINIK, which arenecessary and sufficient, and the equivalent formulation by Eberhard HOPF,but one should observe that for the general scalar equation ut +

(f(u))x

= 0,Galilean invariance only occurs for f(v) = v2

2 +a v+ b, so that only the Burg-ers equation is a physically relevant model. For systems, a good completetheory is missing, and some conditions are named after Peter LAX, some afterConstantine DAFERMOS, and some after Tai-Ping LIU.8

The general definition that Peter LAX introduced, j-shocks and j-contactdiscontinuities, is as follows.

Definition 3.3. A discontinuous solution of the form

U(x, t) ={U− for x < x0 + s t,U+ for x > x0 + s t,

(3.13)

satisfying the Rankine–Hugoniot condition

F (U+) − F (U−) = s(U+ − U−), (3.14)

is a j-shock satisfying the Lax condition if

λj(U−) ≥ s ≥ λj(U+) (3.15)

andλj−1(U−) < s < λj+1(U+), (3.16)

forgetting the corresponding inequality for indices 0 or p+1, so that a j-shockcannot be a k-shock for k �= j. It is a j-contact discontinuity if

λj(U−) = s = λj(U+), (3.17)

but one talks of a left j-contact discontinuity if λj(U−) = s, and of a rightj-contact discontinuity if s = λj(U+).

8 Tai-Ping LIU, Chinese-born mathematician. He has worked at University of Mary-land, College Park, MD, at NYU (New York University), New York, NY, and atStanford University, Stanford, CA.

Page 62: From hyperbolic systems to kinetic theory: a personalized quest

3 Hyperbolic Systems: Contact Discontinuities, Shocks 37

In the last few years, shocks which do not satisfy (3.16) have been studied,and called overcompressive shocks, but that seems to be a specialty of peoplewho do not have much interest in continuum mechanics, because they do notseem to care that the model used by engineers which they mention contradictsthermodynamics, not in a way that seems reasonable, and some of them seemto play too much with the scalar one-dimensional equation in more than onevariable, without ever saying that it has no physical grounding. It seems thatone must be a good mathematician, like Peter LAX, to identify a mathematicalgeneralization of a problem from continuum mechanics or physics which isboth interesting from a mathematical point of view and a good testing groundfor understanding more about continuum mechanics or physics.

The reason for being so interested in shocks is that they are related toirreversibility, and irreversibility is often connected to the laws of thermody-namics, which it would be important to understand mathematically in a bet-ter way. The verb understand has a different meaning in mathematics than inother branches of science (as mathematics is a part of science) or engineering;in the present context, it is not about learning the rules of thermodynamicsand applying them correctly (which one can do with a reasonable amount ofeffort, as for many other games invented by physicists or engineers), it is aboutdiscussing which part of the rules can be deduced from more basic principlesof physics, and how to replace the other parts by more precise mathematicalrules, and go beyond the rules of equilibrium thermodynamics.

Of course, many consider the Boltzmann equation as an answer, and themathematical properties of the Boltzmann equation are a part of the subjectof these lectures, because it is a classical model of kinetic theory, but theBoltzmann equation has a similar defect than thermodynamics, because it ispostulated and irreversibility has already been put by force into the model,so that it cannot be used for studying how irreversibility occurs.

It is important for mathematicians to understand the limitations of themodels used, and to avoid pretending that one has proven something whichhas actually been postulated, a process usually referred to as a vicious circle.

In order to study how irreversibility occurs it is important to start fromconservative models which are reversible, and to explain why in some limitsomething seems to have been lost, and to study if something has really beenlost, or if a better account can be given of what other models declare lost. Com-paring various approaches to irreversibility seems then a good first step, andquasi-linear hyperbolic systems of conservation laws are particularly suited forthat purpose, because an important example is the system of gas dynamicswhich deals precisely with the type of questions that were used for guessingthe laws of thermodynamics in the first place.

One has postulated that the equation of state is always valid, but theequation of state of a gas had been discovered by looking only at various equi-libria, and it is a very questionable hypothesis to assume that it also appliesto the evolution problem. It is important to understand if there is a natural

Page 63: From hyperbolic systems to kinetic theory: a personalized quest

38 3 Hyperbolic Systems: Contact Discontinuities, Shocks

definition of what a solution can mean, and the concept of weak solutionscan be considered basic in order to define what discontinuous solutions mean,but it will be seen that too many weak solutions exist, and a choice must bemade, and up to now the choice has been purely local, accepting some shocksand rejecting others by looking only at the limiting states on both sides ofthe discontinuity. Because one does not know enough (at the moment, andfrom a mathematical point of view) about quasi-linear hyperbolic systems ofconservation laws, the description will not be conclusive, but knowing abouta few pieces of the puzzle is probably useful for the future solution of thisdifficult question, and a good way to learn more about the classical aspectsis to consult the recent book [4] of Constantine DAFERMOS. After one moredefinition for systems, I shall turn my attention towards the case of a scalarequation (in one-dimensional space), a question which had been completed inthe 1970s.

Definition 3.4. An “entropy” ϕ, together with an entropy flux ψ, for thesystem Ut + A(U)Ux = 0, is a pair of smooth scalar functions (defined in adomain D where A is defined) satisfying

∇ϕ(V )A(V ) = ∇ψ(V ) for V ∈ D. (3.18)

The important property is that any smooth solution U of Ut+A(U)Ux = 0automatically satisfies

(ϕ(U)

)t+(ψ(U)

)t= 0.

As was mentioned before, the choice of the term entropy in this definitionof Peter LAX may be a little confusing, which is why I often add the qualifiermathematical; for example, a system of conservation laws Ut +

(F (U)

)x

= 0always has the (trivial) entropies Uj, for j = 1, . . . , p, with flux Fj(U), so forthe system of gas dynamics, one has the trivial entropies �, � uj for j = 1, 2, 3,and � |u|2

2 +� e (so mass, momentum, and energy are mathematical entropies),but there are some nontrivial entropies like � f(s) for an arbitrary smoothfunction f , where s is the thermodynamical entropy per unit of mass.

[Taught on Friday August 31, 2001.]

Page 64: From hyperbolic systems to kinetic theory: a personalized quest

4

The Burgers Equation and the 1-D Scalar Case

The Burgers equation,

ut + u ux = 0 in R × (0,∞), with u(·, 0) = u0 in R, (4.1)

first appeared in the work of BATEMAN in 1915,1 but it was forgotten.BURGERS reintroduced it in the 1940s with reference to turbulence, and Eber-hard HOPF immediately pointed out that it was not the size of the velocity inthe fluid that creates turbulence, as BURGERS had seemed to think, but thefluctuation in velocity in the fluid, because of Galilean invariance, and thatis indeed a basic fact about turbulence that everyone agrees upon.2 EberhardHOPF first solved the equation in 1950, by considering for ε > 0 the equation

ut + u ux − ε uxx = 0 in R × (0,∞), with u(·, 0) = u0 in R, (4.2)

now known as the Burgers–Hopf equation, and then letting ε tend to 0.Although the added term corresponds to a viscosity effect, one calls this

approach the method of artificial viscosity, because one often adds regularizingterms for purely mathematical reasons, in which case the qualifier artificial isa way to point out that one does not claim that the model used has a soundphysical interpretation. It is sometimes useful to invent nonphysical models inorder to overcome a technical difficulty that one has encountered in the studyof a physical model, i.e. one which at some moment is supposed to give a gooddescription of a part of physical reality; what one should avoid is to let suchnonphysical models pass in describing a real situation, and when problemstake many years to be solved it is useful to remind younger people about thereasons which had led to the introduction of the various models.1 Harry BATEMAN, English-born mathematician, 1882–1946. He had worked in Liv-

erpool, in Manchester, England, in Bryn Mawr, PA, at Johns Hopkins University,Baltimore, MD, and at Caltech (California Institute of Technology), Pasadena,CA, named Throop College at the time he arrived.

2 One may wonder then about the reasons why some people have recently used theterm “Burgers turbulence”!

Page 65: From hyperbolic systems to kinetic theory: a personalized quest

40 4 The Burgers Equation and the 1-D Scalar Case

Eberhard HOPF found a way to transform (4.2) into a linear heat equation,3

by what is now called the Hopf–Cole transformation,4 because Julian COLE

discovered it independently a little later.5 This change of unknown comesnaturally if one introduces the function U defined by

U(x, t) =∫ x

−∞u(y, t) dy for x ∈ R, t ≥ 0, (4.3)

so that if the solution u is smooth and integrable, one obtains

Ux = u; Ut +U2x

2− εUxx = 0, (4.4)

so that U is a potential associated to the conservation law given by the Burgersequation. Then, using the observation that if V satisfies Vt − ε Vxx = 0 thenf(V ) satisfies

(f(V )

)t− ε(f(V )

)xx

+ ε f ′′(V )(Vx)2 = 0, one sees that theequation for U is satisfied by U = f(V ) if one has 2ε f ′′ = (f ′)2, which oneintegrates immediately into 2ε

f ′(V ) = C−V , so that choosing C = 0 the formulafor u = Ux = f ′(V )Vx becomes the Hopf–Cole transformation

u =−2ε VxV

. (4.5)

Another reason to use U is that ut + u ux = 0 transforms into a Hamilton–Jacobi equation Ut + U2

x

2 = 0, and one can then solve the equation by classi-cal techniques of calculus of variations, like those of CARATHEODORY,6 whohad introduced the method of dynamic programming, long before RichardBELLMAN made it popular.7

3 Fourier’s law, that the flux of heat is proportional (and opposite) to the gradientof temperature has been postulated, as well as Fick’s law for diffusion of mass,and although the parabolic equations that they lead to are quite popular, onemust notice the nonphysical effect that heat may travel arbitrarily fast, and oneshould consider that they are approximations corresponding to having let thevelocity of light c tend to ∞, and this will be studied in more detail later.

4 Julian David COLE, American mathematician, 1925–1999. He had worked at Cal-tech (California Institute of Technology), Pasadena, CA, at UCLA (University ofCalifornia at Los Angeles), Los Angeles, CA, and at RPI (Rensselaer PolytechnicInstitute), Troy, NY.

5 Ten years after the work of BATEMAN, who had studied the case ε→ 0, FORSYTH

had already introduced the “Hopf–Cole” transformation.6 Constantin CARATHEODORY, German mathematician (of Greek origin), 1873–

1950. He had worked in Gottingen, in Bonn, and in Hanover, Germany, in Breslau(then in Germany, now Wroc�law, Poland), and in Berlin, Germany. After WorldWar I, he worked in Athens, Greece, in Smyrna (then in Greece, now Izmir,Turkey), and in Munchen (Munich), Germany.

7 Richard Ernest BELLMAN, American mathematician, 1920–1984. He had workedat USC (University of Southern California), Los Angeles, CA.

Page 66: From hyperbolic systems to kinetic theory: a personalized quest

4 The Burgers Equation and the 1-D Scalar Case 41

This was the first approach which permitted one to see which discontinuoussolutions were approached when ε tends to 0, but the Hopf–Cole transforma-tion does not extend to scalar equations ut +

(f(u))x

= 0 for a general f ,which were studied for mathematical reasons,8 but appeared to be a goodtraining ground for understanding about admissibility conditions for shocks.The same approach of adding viscosity was also used by Olga OLEINIK, usingmore traditional compactness arguments for proving existence of a solution,and she proved uniqueness for solutions satisfying a one-sided inequality

ux ≤ 1t, (4.6)

or more general inequalities of the type ux ≤ E(t), which also hold for ut +(f(u))x

= 0 for some strictly convex f ; for example, f ′′ ≥ α > 0 impliesux ≤ 1

a t . The maximum principle is used for proving this inequality, as wellas others, and if u solves ut + u ux − ε uxx = 0, then v = ux satisfies theequation vt + u vx + v2 − ε vxx = 0, so if v(·, 0) ≤ a0 one has v(·, t) ≤ a(t),where a′ + a2 = 0 and a(0) = a0, i.e. a(t) = a0

1+t a0which is ≤ 1

t if a0 > 0 and≤ 0 if a0 ≤ 0. This bound cannot be improved, because for an initial datumwhich is bounded and Lipschitz continuous, the solution of ut + u ux = 0obtained by the method of characteristic curves is also bounded and Lipschitzcontinuous for an interval of time, and along a characteristic curve (whichis the straight lines x = y + t u0(y) for y ∈ R), the function v satisfies thedifferential equation v′+v2 = 0, i.e. v

(x(t), t

)= v0

1+t v0. Of course, the existence

part requires bounds, and if u0 ∈ L∞(R), then u stays bounded in L∞(R),while if u0 ∈ BV (R), then ux stays bounded in L1(R).

Another method is to use a numerical approximation, by finite differences,using the Lax–Friedrichs scheme. In finite-difference schemes one uses a meshsize Δx in space and a mesh size Δt in time, and one discovers that onecannot take Δt too large; it is standard to denote by Uni the approximation ofu(iΔx, nΔ t), and the Lax–Friedrichs scheme for the equation ut+

(f(u))x

=0 is the explicit scheme

1Δt

(Un+1i − Uni−1 + Uni+1

2

)+

12Δx

(f(Uni+1) − f(Uni−1)

)= 0, (4.7)

which must be supplemented by giving the initial data U0i for all i, for example,

U0i =

1Δx

∫ iΔx+ Δ x2

iΔx−Δ x2

u0 dx. (4.8)

The condition that Δt should satisfy a bound in terms of Δx is calleda Courant–Friedrichs–Lewy condition,9 abbreviated as CFL condition. This8 Despite the popularity of this model, the Galilean invariance only occurs for

f(v) = v2

2+ a v + b, so that one should wonder if other fs correspond to any

physical situation.9 Hans LEWY, German-born mathematician, 1904–1988. He received the Wolf Prize

in 1984, for initiating many, now classic and essential, developments in partial

Page 67: From hyperbolic systems to kinetic theory: a personalized quest

42 4 The Burgers Equation and the 1-D Scalar Case

elementary condition arises for numerical approximations of hyperbolic equa-tions, where there is a finite speed of propagation, and expresses the necessaryfact that the numerical domain of dependence must contain the exact domainof dependence if one wants the scheme to converge. For example, if one looksat a linear equation ut + a ux = 0, where the local speed of propagation is a,then if one has α ≤ a(x, t) ≤ β, the solution u(x, t) is equal to u0(y) wherey is the base point of the characteristic curve going through (x, t) (assumingsmoothness of a so that the characteristic curves are defined in a unique way),and one has x − β t ≤ y ≤ x − α t; on the other hand a numerical schemelike the Lax–Friedrichs scheme has a speed of propagation Δx

Δ t as the value ofUn+1i depends upon Uni−1 and Uni+1, so that depends upon the values of U0

j

for i− n ≤ j ≤ i+ n (but not all of them, and only the values with i+ j + neven are involved); the CFL condition in this case is Δx

Δ t ≥ max{|α|, |β|}, andif it is not true there is a constant a such that the sequence of approximationswith Δx and Δt converging to 0 while keeping a fixed ratio (which is whatone usually does for hyperbolic equations) does not converge to the solution.In the nonlinear problem ut +

(f(u))x

= 0, one is lucky that a maximumprinciple holds and that if M− ≤ u0 ≤M+ then the desired solution satisfiesM− ≤ u(x, t) ≤M+ for almost every x ∈ R, t > 0; as the problem is formallyut+f ′(u)ux = 0, one takes a = f ′(u), so that one needs to look at the boundsof f ′ on the interval [M−,M+], and this leads to the CFL condition

(max

v∈[M−,M+]|f ′(v)|

)Δt ≤ Δx, (4.9)

and then one writes the (explicit) scheme as

Un+1i =

(12Uni−1−

Δt

2Δxf(Uni−1)

)+(1

2Uni+1+

Δt

2Δxf(Uni+1)

)= G(Uni−1, U

ni+1),

(4.10)and one notices that G(v, w) is order preserving in v and w (i.e. Gv ≥ 0and Gw ≥ 0) if M− ≤ v, w ≤ M+, as the corresponding derivatives involvequantities like 1

2 ± f ′(z)Δt2Δx , which is ≥ 0 for z ∈ [M−,M+]. A consequence is

if M− ≤ Uni ≤M+ for all i, then M− ≤ Un+1i ≤M+ for all i; (4.11)

indeed, due to the order-preserving property of G, the value of Un+1i =

G(Uni−1, Uni+1) is a minimum when Uni−1 and Uni+1 are replaced by M−, which

gives a value M− for Un+1i , and a maximum when Uni−1 and Uni+1 are replaced

by M+, which gives a value M+ for Un+1i . One has seen that the order relation

on R plays a crucial role, and this will be seen again in obtaining a bound inBV (R); unfortunately, nothing of that sort is known for the case of generalsystems.

differential equations, jointly with Kunihiko KODAIRA. He had worked inGottingen, Germany, at Brown University, Providence, RI, and at UCB (Uni-versity of California at Berkeley), Berkeley, CA.

Page 68: From hyperbolic systems to kinetic theory: a personalized quest

4 The Burgers Equation and the 1-D Scalar Case 43

It has been noticed that the semi-group defined by the equation ut +(f(u))x

= 0 is a contraction semi-group in L1(R), by Barbara KEYFITZ,10

and the work of KRUZHKOV is related; when I heard of the work of PhilippeBENILAN on nonlinear contraction semi-groups in L1,11 I noticed that allthe examples could be treated by techniques of order preserving, and when Iarrived in Madison, WI, in the fall of 1974, I showed my argument to MichaelCRANDALL,12 and he proved the other part of our Lemma 4.1; later, withAndrew MAJDA he noticed applications to numerical schemes.13

Lemma 4.1. (Crandall–Tartar)14 Let Ω,Ω′ be endowed with nonnegativeRadon measures dμ, dμ′,15 and let X be a subset of L1(Ω; dμ) stable byinf (or by sup); let S be a mapping from X into L1(Ω′; dμ′) satisfying∫Ω′ S(v) dμ′ =

∫Ωv dμ for all v ∈ X; then the following two properties are

equivalent:i) S is order preserving, i.e. v, w ∈ X and v ≤ w almost everywhere (for

dμ) implies S(v) ≤ S(w) almost everywhere (for dμ′),ii) S is a contraction in L1, i.e.

∫Ω′ |S(v) − S(w)| dμ′ ≤ ∫Ω |v − w| dμ for

all v, w ∈ X.

Proof : For v, w ∈ X , let z = inf{v, w}, so that z ≤ v and z ≤ w, and (i) impliesS(z) ≤ S(v) and S(z) ≤ S(w), from which one deduces

∫Ω′ |S(v)−S(w)| dμ′ ≤∫

Ω′ |S(v) − S(z)| dμ′ +∫Ω′ |S(z) − S(w)| dμ′, but

∫Ω′ |S(v) − S(z)| dμ′ =∫

Ω′(S(v)− S(z)

)dμ′ =

∫Ω

(v − z) dμ and∫Ω′ |S(z)− S(w)| dμ′ =

∫Ω′(S(w) −

S(z))dμ′ =

∫Ω(w − z) dμ and

∫Ω(v − z) dμ+

∫Ω(w − z) dμ =

∫Ω |v − w| dμ.

10 Barbara Lee KEYFITZ, Canadian-born mathematician, born in 1944. She workedat Columbia University, New York, NY, in Princeton, NJ, at Arizona State Uni-versity, Tempe, AZ, in Houston, TX, and at the Fields Institute for Research inMathematical Sciences, Toronto, Ontario.

11 Philippe M. A. BENILAN, French mathematician, 1940–2001. He had worked inBesancon, France.

12 Michael Grain CRANDALL, American mathematician, born in 1940. He workedat Stanford University, Stanford, CA, at UCLA (University of California at LosAngeles), Los Angeles, CA, at University of Wisconsin, Madison, WI, and heworks now at UCSB (University of California at Santa Barbara), Santa Barbara,CA.

13 Andrew Joseph MAJDA, American mathematician, born in 1949. He worked atUCB (University of California at Berkeley), Berkeley, CA, at Princeton Univer-sity, Princeton, NJ, and at NYU (New York University), New York, NY.

14 Luc Charles TARTAR, French-born mathematician, born in 1946. I worked at Uni-versite Paris IX Dauphine, Paris, France, at Universite Paris-Sud, Orsay, at CEA(Commissariat a l’Energie Atomique), Limeil, France, and at CMU (CarnegieMellon University), Pittsburgh, PA.

15 Johann RADON, Czech-born mathematician, 1887–1956. He had worked in Ham-burg, in Greifswald and in Erlangen, Germany, in Breslau (then in Germany, nowWroc�law, Poland) before World War II, and after 1947 in Vienna, Austria.

Page 69: From hyperbolic systems to kinetic theory: a personalized quest

44 4 The Burgers Equation and the 1-D Scalar Case

Assume ii) and let v, w ∈ X with v ≤ w, then∫Ω′ |S(w) − S(v)| dμ′ ≤∫

Ω|w − v| dμ =

∫Ω

(w − v) dμ =∫Ω′(S(w) − S(v)

)dμ′, so that one must have

S(w) ≥ S(v) almost everywhere (for dμ′). ��The Lax–Friedrichs scheme is conservative, i.e. one has

∑i U

n+1i =

∑i U

ni

for every n ≥ 0, so one uses Lemma 4.1 with counting measures and the factthat it is order preserving gives∑

i

|Un+1i − V n+1

i | ≤∑

i

|Uni − V ni | so that it is ≤∑

i

|U0i − V 0

i |. (4.12)

Using the invariance by translation, for example taking V 0i = U0

i+1 for everyi, one has V ni = Uni+1 for every i and every n ≥ 0, from which one deduces

i

|Un+1i+1 −Un+1

i | ≤∑

i

|Uni+1 −Uni | so that it is ≤∑

i

|U0i+1 − U0

i |, (4.13)

giving a BV (R) bound if the bounded variation of the initial approximationstays bounded; one deduces easily from the scheme a bound for an approxi-mation of ut.

[Taught on Wednesday September 5, 2001 (Monday September 3 was LaborDay).]

Notes on names cited in footnotes for Chapter 4, HOPKINS,16 THROOP,17

FICK,18 RENSSELAER,19 FORSYTH,20 KODAIRA.21

16 Johns HOPKINS, American financier and philanthropist, 1795–1873. Johns Hop-kins University, Baltimore, MD, is named after him.

17 Amos Gager THROOP, American businessman and politician, 1811–1894.18 Adolph Eugen FICK, German physiologist/physicist, 1829–1901. He had worked

in Zurich, Switzerland, and in Wurzburg, Germany.19 Kilean VAN RENSSELAER, Dutch merchant, c. 1580–1644. The Rensselaer Poly-

technic Institute (RPI), Troy, NY, is named after him.20 Andrew Russell FORSYTH, Scottish mathematician, 1858–1942. He worked in

Cambridge, England, UK, holding the Sadleirian chair of Pure Mathematics(1895–1910), and in London, England, UK.

21 Kunihiko KODAIRA, Japanese mathematician, 1915–1997. He received the WolfPrize in 1984, for his outstanding contributions to the study of complex mani-folds and algebraic varieties, jointly with Hans LEWY. He had worked in Tokyo,Japan, at IAS (Institute for Advanced Study), Princeton, NJ, at Harvard Univer-sity, Cambridge, MA, at Johns Hopkins University, Baltimore, MD, at StanfordUniversity, Stanford, CA, and again in Tokyo.

Page 70: From hyperbolic systems to kinetic theory: a personalized quest

5

The 1-D Scalar Case: the E-Conditions of Laxand of Oleinik

Although the equation ut+u ux = 0 was first introduced by BATEMAN, I shallfollow the classical use and call it the Burgers equation, written ut+

(u2

2

)x

= 0for the correct class of discontinuous solutions, and an important property isthat it is invariant by Galilean transformations; it means that if one moves atconstant velocity a, one replaces x by x − a t and one replaces the velocity uby u− a, and one defines a function v related to u by

u(x, t) = a+ v(x− a t, t), (5.1)

then, as is easily verified, the equation for v is also vt + v vx = 0.

For the sake of understanding in a better way which discontinuities shouldbe accepted (and that should tell us more about how irreversibility occurs),it is useful to consider the more general equations ut +

(f(u))x

= 0 for otherfunctions f ; the Galilean invariance only holds if f satisfies f ′(a+v) = a+f ′(v)for all a, v ∈ R, i.e. if f(v) = v2

2 + αv + β for all v ∈ R, where α = f ′(0) andβ = f(0).

If u is smooth and satisfies the equation ut+(f(u))x

= 0, i.e. ut+f ′(u)ux =0, then w = f ′(u) satisfies wt+wwx = 0, so one may think that the knowledgeof the Burgers equation is sufficient for solving the more general equation, butthat property only holds for smooth solutions, except if f is a polynomial ofdegree ≤ 2. Indeed, if u is a discontinuity jumping from a to b �= a, theRankine–Hugoniot condition says that u satisfies ut +

(f(u))x

= 0 if andonly if the discontinuity travels at velocity s1 = f(b)−f(a)

b−a , while f ′(u) satisfies(f ′(u)

)t+( (f ′(u))2

2

)x

= 0 if and only if the discontinuity travels at velocity

s2 = f ′(a)+f ′(b)2 , and one has s1 = s2 for all a, b if and only if f ′ is affine.

The method of characteristic curves shows that if u0 is a bounded Lips-chitz continuous function, and f is assumed to have a locally bounded secondderivative, then a solution of ut +

(f(u))x

= 0 with u(·, 0) = u0 exists for

Page 71: From hyperbolic systems to kinetic theory: a personalized quest

46 5 The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik

an interval of time. Indeed, assuming that u is smooth, one defines the char-acteristic curves by dx

dt = f ′(u(x(t), t))

and x(0) = y, and as the equationimplies that u is constant along such a curve, it is then a straight line, andthat means that on the line x = y+ t f ′(u0(y)

), one has u(x, t) = u0(y); if one

has M− ≤ u0 ≤M+, |(u0)x| ≤ A and |f ′′(v)| ≤ B for v ∈ [M−,M+], then thesolution is bounded (with M− ≤ u ≤ M+) and locally Lipschitz continuousfor x ∈ R and t ∈ [−T,+T ] if AB T < 1 (with a bound A

1−AB t ), because onlyone characteristic curve goes through each point (x, t) with |t| ≤ T , and themapping (x, t) �→ y is Lipschitz continuous.

If u0 is constant, the solution is constant, but one can construct an infinityof weak solutions with the same initial datum in the case where f is not anaffine function. Indeed, because f is not affine, one can find a and b witha < u0 < b such that the chord joining

(a, f(a)

)and

(b, f(b)

)does not

contain(u0, f(u0)

); in the case where the chord goes above

(u0, f(u0)

), one

has

s1 =f(u0) − f(a)

u0 − a< s2 =

f(b) − f(a)b− a

< s3 =f(b) − f(u0)

b− u0; (5.2)

choosing a point x0 arbitrary, one defines u by

u(x, t) =

⎧⎪⎨

⎪⎩

u0 for x < x0 + s1ta for x0 + s1t < x < x0 + s2tb for x0 + s2t < x < x0 + s3tu0 for x0 + s3t < x

(5.3)

and one checks easily that the Rankine–Hugoniot conditions are satisfied foreach of the three discontinuities; in the case where the chord goes below(u0, f(u0)

), one has a similar construction.

These nonconstant weak solutions are rejected as nonphysical, the constantsolution being considered the physical one, even though the model may notcome from a reasonable modelling of physical reality.

The constant u0 is solution of the regularized equation ut+u ux−ε uxx = 0,and the choice Uni = u0 for all i ∈ Z, and n ≥ 0 is a solution of the Lax–Friedrichs scheme, so that the limiting process selects the constant solution.It will be shown that for other initial data, the limit satisfies supplementaryconditions (which are automatically satisfied for smooth solutions), namelythat the discontinuities observed in the limiting weak solution must satisfythe Oleinik E-condition,1 expressed below in the case of piecewise constantsolutions.

1 Although usually called an entropy condition, it is not related to thermodynami-cal entropy, and I follow the name E-condition used by Constantine DAFERMOS,referring to his book [4] for many more references and generalizations. My pur-pose here is not to describe everything about quasi-linear hyperbolic systems ofconservation laws, but to observe that some discontinuities occur and that othersdo not, and to understand why it is so; this is important for the topic of kinetictheory, in relation with the question of how irreversibility occurs.

Page 72: From hyperbolic systems to kinetic theory: a personalized quest

5 The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik 47

Definition 5.1. A discontinuous function

u(x, t) ={u− for x < x0 + s tu+ for x > x0 + s t

(5.4)

which is a weak solution of ut +(f(u))x

= 0, i.e. satisfies the Rankine–Hugoniot condition f(u+)−f(u−) = s(u+−u−), is said to satisfy the OleinikE-condition if

either u− < u+ and the chord joining(u−, f(u−)

)and(u+, f(u+)

)

is below the graph of for u− > u+ and the chord joining

(u−, f(u−)

)and(u+, f(u+)

)

is above the graph of f.

(5.5)

The weak solutions constructed before have three disontinuities, from u0

to a, from a to b and from b to u0, and the discontinuity from a to b fails tosatisfy the Oleinik E-condition.

Before Olga OLEINIK had introduced her condition, which appeared tobe the desired characterization for the general equations ut +

(f(u))x

= 0,Peter LAX had introduced a simpler Lax E-condition, which makes sense forsystems.

Definition 5.2. A discontinuous function

U(x, t) ={U− for x < x0 + s tU+ for x > x0 + s t

(5.6)

which is a weak solution of Ut +(F (U)

)x

= 0, i.e. satisfies the Rankine–Hugoniot condition F (U+) − F (U−) = s(U+ − U−), is said to satisfy the LaxE-condition if

one has λj(U−) ≥ s ≥ λj(U+) for some j ∈ {1, . . . , p}. (5.7)

For the scalar case, the intuition that such a condition must hold can beguessed easily from what happens for the Burgers equation.

If one looks for solutions of the Burgers equation depending upon xt , one

finds that xt is a special solution, and using invariance by translation in x or

in t, one finds that for every x0, t0 ∈ R, a particular solution is x−x0t−t0 , valid for

t �= t0. If the initial datum is u0(x) = αx+ β, which corresponds to t0 = − 1α

and x0 = − βα , then the solution is u(x, t) = αx+β

1+α t . If one restricts attentionto increasing times, one sees that if α ≥ 0 the solution exists for all t ≥ 0, andit corresponds to the solution computed using the method of characteristiccurves, of course, while if α < 0, the solution blows up for a critical timeTc = t0 = − 1

α , and in that case all the characteristic lines pass through thepoint

(− βα ,− 1

α

).

Let a < b, and for k > 0, let us consider the initial datum u0 defined by

Page 73: From hyperbolic systems to kinetic theory: a personalized quest

48 5 The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik

u0(x) =

⎧⎨

a for x ≤ 0(1 − k x)a+ k x b for 0 ≤ x ≤ 1

kb for x ≥ 1

k ,(5.8)

so that u0 is bounded and Lipschitz continuous, and the solution u, computedby the method of characteristic curves, is given by

u(x, t) =

⎧⎨

a for x ≤ a t(1−k x)a+k x b

1+k(b−a)t for a t ≤ x ≤ 1k + b t

b for x ≥ 1k + b t,

(5.9)

so that u is bounded and Lipschitz continuous for all t ≥ 0. When one lets ktend to infinity, the sequence of initial data converges to u∗0 given by

u∗0(x) ={a for x ≤ 0b for x ≥ 0, (5.10)

and the sequence of solutions converges to u∗ given by

u∗(x, t) =

{a for x ≤ a txt for a t ≤ x ≤ b tb for x ≥ b t,

(5.11)

so that one is led to decide that, for reasons of continuity,2 one prefers thesmooth solution u∗, i.e. a discontinuity corresponding to u− = a < b = u+

is unstable and creates a rarefaction wave as in the formula for u∗, and onerejects the solution with a discontinuity travelling at velocity s = a+b

2 , whichwould be u given by

u(x, t) ={a for x ≤ s tb for x ≥ s t.

(5.12)

If one applies the method of characteristic curves to the discontinuousinitial datum u∗0, one observes that the characteristic lines with y < 0 coverthe part of the (x, t) plane with x < a t and the characteristic lines with y > 0cover the part of the (x, t) plane with x > b t, leaving a gap, namely the sectora t < x < b t.

Conversely, in the case where a > b, and where one accepts the discon-tinuous solution u, the characteristic lines coming from y < 0 interact withthose coming from y > 0, and the discontinuity is obtained as a compromise.2 One postulates that there is a natural topology for deciding if two initial data are

near and that nearby initial data create nearby solutions, i.e. the mappings S(t)defined by S(t)(u0) = u(·, t) are continuous for t ≥ 0, and define a continuoussemi-group, so that S(t)(u0) → u0 as t tends to 0, and S(s + t) = S(s)S(t) fors, t ≥ 0. For the scalar case, the strong topology of L1(R) is such a topology, andit is believed to work also in the case of systems, at least in one space dimension,but as L1 is not a good functional space for dealing with partial differentialequations which do not satisfy the maximum principle, it seems that other spacesare needed, and I have proposed some interpolation spaces.

Page 74: From hyperbolic systems to kinetic theory: a personalized quest

5 The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik 49

One is led then to consider that a discontinuity occurs because the leftside of the discontinuity has information travelling faster than s, while theright side has information travelling slower than s, and that is what the LaxE-condition is about.

Another way to look at the problem is to consider in the (x, u) planethe graph of u∗0 to which one adds a vertical part from (0, a) to (0, b) at thediscontinuity (so that one has a continuous curve), and to consider that eachpoint (x, u) of the curve travels at velocity u for a small time Δt. If a < bone then finds that the vertical part transforms into the graph of a Lipschitzcontinuous function, and that leads to accepting such a rarefaction wave (themethod of characteristic curves with y = 0 and u0(y) moving from a to bgenerates characteristic lines which fill the gap that was observed before).Conversely, if a > b, one obtains a curve which is not the graph of a function,exactly as one observes for breaking waves on a sloping beach, and I had learntabout the influence of the variation of depth in my continuum mechanicscourse at Ecole Polytechnique with Jean MANDEL,3 although he had onlytreated linear effects, with the purpose of showing that momentum could betransported without any real transport of mass.4 However, because the modeldoes not allow for the breaking of waves, a compromise must be found, wherethe matter which has gone too fast at the top of the wave is used to help thematter which has gone too slow at the bottom of the wave, and the rule mustbe compatible with the fact that the integral of u should be conserved.

The crucial effect which explains why the Oleinik E-condition improvesthe Lax E-condition is that a discontinuity from a to b may well break up intosmaller discontinuities separated by smooth parts.

Constantine DAFERMOS has found a simple way to analyse such a questionby looking at functions f which are continuous and piecewise affine, and this3 Jean MANDEL, French mathematician, 1907–1982. I had him as a teacher in

1966–1967, for the course of continuum mechanics at Ecole Polytechnique, Paris,France. He had worked in Saint-Etienne and in Paris, France.

4 In the open sea, one observes some sinusoidal waves, with a profile dependingonly upon one direction, and these waves seem to move at a constant velocity,but if one linearizes the equation of hydrodynamics in an ocean of fixed depth H(around a zero velocity field), one finds that disturbances of the surface decomposeas waves travelling at a velocity V (H) (the same in every direction); it is the topof one of these unidimensional waves which travels at this velocity, and one canfollow it easily with the eye, and this velocity is a phase velocity which does notcorrespond to any transport of mass (and a floating cork moves a little when thewave goes by, but does not drift), so that it transports linear momentum (feltat the end, on the beach where the waves break). If there is a sharp decrease indepth near a beach, for example due to the presence of a submerged coral reef,then the waves from the open sea arrive too fast compared to the local velocityfavoured by the waves and one observes the breaking of waves, the delight ofsurfers, whose art is precisely about using the momentum transported by thesewaves.

Page 75: From hyperbolic systems to kinetic theory: a personalized quest

50 5 The 1-D Scalar Case: the E-Conditions of Lax and of Oleinik

simplifies the analysis of the smooth parts in the solutions (which, according tothe analysis shown before for systems, come either from the portions where f isconvex and u increases, or the portions where f is concave and u decreases).For this class of functions, if u0 is piecewise constant, then the solution ispiecewise constant at each later time, and a discontinuity from u− on the leftto u+ on the right is accepted if and only if one cannot find v such that thediscontinuity from u− to v travels at a velocity s1 strictly smaller than thevelocity s2 of the discontinuity from v to u+ (in which case one would preferto decompose the discontinuity into a succession of discontinuities which allsatisfy the condition and move faster and faster); notice that the Lax E-condition is recovered as the limiting cases, where v is taken near u− ornear u+. This manner of selecting discontinuities is precisely the Oleinik E-condition for this particular class of functions; for a general function f , oneapproaches f uniformly (on bounded intervals) by a sequence fn of continuouspiecewise affine functions, and one looks at the limit of the sequence un ofsolutions, noticing that although each un is piecewise constant, its limit maynot be and that a smooth transition in the limiting solution is approached byparts containing plenty of discontinuities of small amplitude.

The analysis using continuous curves which are not necessarily graphs andthen deducing a compromise to transform the result into a graph also leadsone to discover why the Oleinik E-condition is natural, and it was followed byYann BRENIER,5 and his arguments look to me as if one accepted breakingof the waves but one was letting the gravity g tend to ∞.

The analysis of the regularized equation ut+(f(u))x−ε uxx = 0, and that

of the Lax–Friedrichs finite-difference scheme, have relied too heavily upon themaximum principle, which is of little use for general systems of conservationlaws. The order relation on R is also used in the method described above,where one first creates a curve which is not a graph and one then dropssomething which has gone too fast at the top, so one uses a direction on R; itsuggests that for systems one might need a vector field in R

p along which topush information, but the matter is made more difficult by the fact that for asystem not all discontinuities satisfy a Rankine–Hugoniot condition, and thatone needs more than one direction anyway for transporting information, aseach of the directions of the eigenvectors rj , j = 1, . . . , p, have this role whenthe solution is smooth.

[Taught on Friday September 7, 2001.]

5 Yann BRENIER, French mathematician, born in 1957. He has worked at UCLA(University of California at Los Angeles), Los Angeles, CA, at INRIA (InstitutNational de Recherche en Informatique et Automatique), Rocquencourt, at Uni-versite Paris VI (Pierre et Marie Curie), Paris, and at CNRS (Centre National dela Recherche Scientifique) at Universite de Nice-Sophia-Antipolis, Nice, France.

Page 76: From hyperbolic systems to kinetic theory: a personalized quest

6

Hopf’s Formulation of the E-Conditionof Oleinik

In the late 1960s, Eberhard HOPF found a nice analytical way for express-ing the Oleinik E-condition (for a scalar equation), and his condition makessense without assuming that the solution is smooth enough to have limits oneach side of a discontinuity; Peter LAX then did the analysis for systems, andhe quotes Eberhard HOPF but also KRUZHKOV as having the idea indepen-dently. Eberhard HOPF observed that if ϕ and ψ are related by ψ′ = f ′ϕ′

(and following Peter LAX one calls ϕ an entropy, and ψ an entropy flux, or(ϕ, ψ) an entropy/entropy flux pair), then any solution of ut +

(f(u))x

= 0which is piecewise smooth and satisfies the Oleinik E-condition for each of itsdiscontinuities automatically satisfies the condition(ϕ(u))t+(ψ(u)

)x≤ 0 in the sense of distributions/Radon measures,

for all convex entropies ϕ,(6.1)

and for a discontinuity from u− to u+, travelling at velocity s = f(u+)−f(u−)u+−u−

,this condition is equivalent to

ψ(u+) − ψ(u−) ≤ s(ϕ(u+) − ϕ(u−)

), for all convex entropies ϕ. (6.2)

Conversely, the above condition implies the Oleinik E-condition, and also theRankine–Hugoniot condition for s, by choosing for ϕ all affine functions. Thislast equivalence is seen by noticing that every convex function can be ap-proached (uniformly on bounded sets) by convex piecewise affine functions,and that shows that the conditions for all convex ϕ can be replaced by anequivalent condition where one uses the functions ϕ(u) = ±u, correspondingto ψ(u) = ±f(u) and the family of entropy/entropy flux pairs indexed byk ∈ R

ϕk(u) ={

0 for u ≤ ku− k for u ≥ k

; ψk(u) ={

0 for u ≤ kf(u) − f(k) for u ≥ k

. (6.3)

If k ≤ min{u−, u+} or if k ≥ max{u−, u+}, the condition using the pair(ϕk, ψk) is trivially satisfied, and for min{u−, u+} < k < max{u−, u+} the

Page 77: From hyperbolic systems to kinetic theory: a personalized quest

52 6 Hopf’s Formulation of the E-Condition of Oleinik

condition tells whether the point(k, f(k)

)is above or below the chord joining(

u−, f(u−))

and(u+, f(u+)

), and using all these ks gives the Oleinik E-

condition.

If uε solves uεt +(f(uε)

)x− ε uεxx = 0 with a fixed initial datum u0, then

from the bounds obtained one may extract a subsequence uη which convergesalmost everywhere to a limit u0, and one wants to show that u0 satisfies(ϕ(u0)

)t+(ψ(u0)

)x≤ 0 for all ϕ convex. Multiplying the equation for uη by

ϕ′(uη), one deduces that(ϕ(uη)

)t+(ψ(uη)

)x−η(ϕ(uη)

)xx

+ηϕ′′(uη)(uηx)2 = 0,and the sequence of nonnegative functions μη = ηϕ′′(uη)(uηx)2 converges inthe sense of distributions (or in the sense of Radon measures) to a nonneg-ative Radon measure μ0, and because uη is bounded and converges almosteverywhere to u0, one has g(uη) → g(u0) in Lploc strong for every p ∈ [1,∞)and for every continuous function g, and passing to the limit in the equationone deduces that

(ϕ(u0)

)t+(ψ(u0)

)x

+ μ0 = 0.Peter LAX noticed that the same argument holds for a system of conser-

vation lawsUt +

(F (U)

)x

= 0, (6.4)

which one regularizes by an artificial viscosity term1

Uεt +(F (Uε)

)x− εDUεxx = 0, (6.5)

if one selects D = α I with α > 0. If ϕ is a convex entropy (and not allfunctions are entropies for systems), then one has(ϕ(Uε)

)t+(ψ(Uε)

)x− ε α

(ϕ(Uε)

)xx

+ ε αϕ′′(Uε)[Uεx, Uεx ] = 0, (6.6)

where ϕ′′(u)[v, w] is the symmetric bilinear form defined by the Hessian matrixϕ′′ at u;2 a particular difficulty is that one does not know enough a prioribounds to be able to extract a subsequence converging almost everywhere,and the mathematical result is then that if a subsequence exists which staysbounded in L∞ and converges almost everywhere, then the limit U0 satisfies

(ϕ(U0)

)t+(ψ(U0)

)x≤ 0 for all convex entropies ϕ, (6.7)

and one should limit the growth at infinity of the entropy functions used ifone only has a bounded sequence in some Lp for p <∞.1 For the system of gas dynamics, it means that one adds a term −ε �xx in the equa-

tion of conservation of mass (corresponding to a diffusion of mass, as postulatedby FICK), and a term −ε(�u)xx in the equation of balance of momentum (cor-responding to a viscosity effect quite different from that postulated by NAVIER),

and a term −ε(

� u2

2+ � e)

xxin the equation of balance of energy (corresponding

to a diffusion of heat of a much stranger form than that postulated by FOURIER).2 Ludwig Otto HESSE, German mathematician, 1811–1874. He had worked in

Konigsberg (then in Germany, now Kaliningrad, Russia), in Heidelberg, and inMunchen (Munich), Germany.

Page 78: From hyperbolic systems to kinetic theory: a personalized quest

6 Hopf’s Formulation of the E-Condition of Oleinik 53

The approximate solutions constructed by the Lax–Friedrichs scheme alsoapproach weak solutions satisfying the Oleinik E-condition (under the for-mulation of Eberhard HOPF). Indeed, let a = min{Uni−1, U

ni+1} and b =

max{Uni−1, Uni+1} (and a < b, otherwise Un+1

i = a = b and there is nothing toprove); by the CFL condition one has imposed Δt

Δx |f ′(v)| ≤ 1 for a ≤ v ≤ b,and one wants to show that

U− a+b2

Δt + f(b)−f(a)2Δx = 0 implies ϕ(U)−ϕ(a)+ϕ(b)

2Δt + ψ(b)−ψ(a)

2Δx ≤ 0for all convex entropies ϕ.

(6.8)

This is trivially satisfied if ϕ is affine, and it is equivalent then to show theimplication for the special functions ϕk which were used before; because a ≤U ≤ b, the condition is trivially satisfied if k ≤ a or if k ≥ b; the case a < k < bsplits into two subcases, according to the position of U with respect to k. Inthe case a ≤ U ≤ k < b, one has ϕ(a) = ϕ(U) = ψ(a) = 0 and ϕ(b) =b − k, ψ(b) = f(b) − f(k), and the inequality becomes − b−k

2Δt + f(b)−f(k)2Δx ≤ 0,

which follows from the mean value theorem f(b)−f(k) = (b−k)f ′(v) for somev ∈ (k, b); similarly for the case a < k ≤ U ≤ b, one has ϕ(a) = ψ(a) = 0and ϕ(U) = U − k, ϕ(b) = b − k, ψ(b) = f(b) − f(k), and the inequality

becomes U− k+b2

Δt + f(b)−f(k)2Δx ≤ 0, which after using the definition of U means

a−k2Δt + f(a)−f(k)

2Δx ≤ 0, which follows from f(a)− f(k) = (a− k)f ′(w) for somew ∈ (a, k).

Another approach for choosing or rejecting a discontinuity is to look for aviscous shock profile, i.e. a curve which describes intermediate values betweenU− and U+ (which are assumed to satisfy a Rankine–Hugoniot conditionF (U+) − F (U−) = s(U+ − U−)), for a regularized equation of the form

Ut +(F (U)

)x− ε(D(U)Ux

)x

= 0, (6.9)

where the (artificial) viscosity matrix D(U) is nonnegative, but there areother conditions to impose, in particular such a matrix should not destabilizeconstant states, as was noticed by Andrew MAJDA and Robert PEGO,3 andone looks for a solution of the form

U(x, t) = V(x− s t

ε

)with V (−∞) = U− and V (+∞) = U+. (6.10)

Notice that this is different from the Cauchy problem with an initial datumindependent of ε. I have heard that this type of question had been initializedby GEL’FAND,4 but Constantine DAFERMOS mentions that for gas dynamics3 Robert Leo PEGO, American mathematician. He worked at University of Michi-

gan, Ann Arbor, MI, at University of Maryland, College Park, MD, and at CMU(Carnegie Mellon University), Pittsburgh, PA.

4 Izrail Moiseevic GEL’FAND, Russian-born mathematician, born in 1913. He re-ceived the Wolf Prize in 1978, for his work in functional analysis, group represen-

Page 79: From hyperbolic systems to kinetic theory: a personalized quest

54 6 Hopf’s Formulation of the E-Condition of Oleinik

such an idea had already been used by RANKINE and by Lord Rayleigh. Theequation for V is then

−s V ′ +(F (V )

)′ − (D(V )V ′)′ = 0, (6.11)

which one integrates immediately as

−s V + F (V ) −D(V )V ′ = C, (6.12)

where one must have

C = F (U−) − sU− if U → U− at −∞C = F (U+) − sU+ if U → U+ at −∞,

(6.13)

so that there would be no solution if the Rankine–Hugoniot condition was notsatisfied. If D(U) is invertible, then one has an ordinary differential equation

V ′ =(D(V )

)−1(F (V ) − s V − C

), (6.14)

which has both U− and U+ as critical points, and one is looking for a connect-ing orbit. The existence of such a connecting orbit requires that U− have anunstable manifold, so that at least one eigenvalue of D(U−)(∇F (U−) − s I)has a nonnegative real part, and that U+ have a stable manifold, so that atleast one eigenvalue of D(U+)(∇F (U+)−s I) has a nonpositive real part, andthat is related to the Lax condition; it is, however, a difficult global questionto decide if by leaving U− along the unstable manifold, one is able to reachthe stable manifold of U+, and this question has been studied extensivelyby Charles CONLEY and Joel SMOLLER.5 For a scalar equation, the crite-rion selects discontinuities such that between U− and U+ there is no othercritical point of the differential equation, and this corresponds to the OleinikE-condition.6

The search to determine which discontinuities are acceptable is certainlynot over for the case of systems. One reason to be confident to have found theright condition for the scalar case is also that there is a uniqueness theorem(whose general form is probably due to KRUZHKOV), for solutions satisfyingthe Oleinik E-condition, expressed in the analytic form of Eberhard HOPF.

tation, and for his seminal contributions to many areas of mathematics and itsapplications, jointly with Carl L. SIEGEL. He worked in Moscow, Russia, and atRutgers University, Piscataway, NJ.

5 Charles Cameron CONLEY, American mathematician, 1933–1984. He had workedat University of Wisconsin, Madison, WI, where I met him during the year 1974–1975 that I spent there, and then at University of Minnesota, Minneapolis, MN.

6 Some discontinuities satisfying the Oleinik E-condition may actually be obtainedby putting together elementary discontinuities satisfying the condition with thesame velocity s, and the viscous shock profile only selects elementary discontinu-ities satisfying the Oleinik E-condition.

Page 80: From hyperbolic systems to kinetic theory: a personalized quest

6 Hopf’s Formulation of the E-Condition of Oleinik 55

In the late 1970s, I had initialized the study of oscillations (later calledmicrostructures) in the nonlinear partial differential equations of continuummechanics, and while the previous analysis for quasi-linear hyperbolic systemshad concentrated on shocks, my analysis was to study oscillating sequences ofsolutions. When the model is supposed to represent physical reality, my ap-proach is more suitable for following the physics behind the phenomena andfor discovering if the laws have been averaged correctly and if more efficienteffective equations should be derived, and this is related to the questions ofhomogenization which I have developed partly with Francois MURAT, gen-eralizing an earlier approach of Sergio SPAGNOLO. I want to emphasize thatour approaches use no periodicity assumptions, of course, and one should payattention to the fact that, among those who only use periodic modulationideas, many forget to refer to a general theory of homogenization, and theyavoid mentioning Sergio SPAGNOLO for G-convergence, or Francois MURAT

and myself for H-convergence, or even Evariste SANCHEZ-PALENCIA, who wasthe first to guess correct asymptotic expansions, or Ivo BABUSKA, who wasthe first to use the method for engineering applications, and who first used theterm homogenization in the mathematical literature,7 to which I gave a moregeneral meaning in my Peccot lectures,8 in the beginning of 1977. It shouldnot come as a surprise that most of those engaged in wide misattribution ofideas are also keen in advocating fake continuum mechanics or physics.

The question of wave breaking has been mentioned, and if an equation likethe Burgers equation is supposed to describe the vertical displacement of thesurface of the water, one knows that some situations create the breaking ofwaves, and one expects then to observe bubbles of air trapped for a while inthe water and trying to move upward, and droplets of water in the air tryingto move downward. A layer representing a mixture of air and water might bea good description for what is going on, but homogenization tells us that onecannot expect a too simple law for the evolution of such mixtures, becauseeffective properties do not depend only upon proportions, and H-measuresmight be useful for computing corrections [18], and other mathematical toolsmay have to be developed for a complete understanding of that question. It isuseful to observe then that the laws of thermodynamics have some limitations,not only because they have been derived from the observation of equilibria,but also because the effective properties of mixtures do not depend only uponproportions. One should then reject part of the rules of thermodynamics andexplain why one does not follow them, like I do, but it remains an importantmathematical question to settle here, which is to derive a new and betterthermodynamics, by understanding more about the evolution of mixtures.Of course, this should be done without using probabilities, which are alwaysused when one lacks information on the processes which must be understood.7 As pointed out to me by Michael VOGELIUS, the term homogenization was used

previously by nuclear engineers.8 Claude Antoine PECCOT, French child prodigy, 1856–1876.

Page 81: From hyperbolic systems to kinetic theory: a personalized quest

56 6 Hopf’s Formulation of the E-Condition of Oleinik

Although the introduction of probabilities is suitable for engineers, who mustcontrol situations for which one does not even know which equations to use,it is certainly not suitable for scientists, despite the bad example of those po-litically inclined physicists who coined the dogma that there are probabilitiesin the laws of nature.

Although the complete problem is too difficult at the moment, one canguess that if one lets the gravity g tend to ∞, the mixtures will separatequickly, with water on the bottom and air on the top, and the solution mightlook precisely like the discontinuous solutions of the Burgers equation whichhave been selected.

[Taught on Monday September 10, 2001.]

Notes on names cited in footnotes for Chapter 6, SIEGEL,9 RUTGERS,10

VOGELIUS,11 and for the preceding footnotes, GEORGE II.12

9 Carl Ludwig SIEGEL, German mathematician, 1896–1981. He received the WolfPrize in 1978, for his contributions to the theory of numbers, theory of severalcomplex variables, and celestial mechanics, jointly with Izrail GEL’FAND. He hadworked at Georg-August University, Gottingen, Germany.

10 Henry RUTGERS, American colonel, 1745–1830. Rutgers University, Piscataway,NJ, is named after him.

11 Michael VOGELIUS, Danish-born mathematician. He worked at University ofMaryland, College Park, MD, and at Rutgers University, Piscataway, NJ.

12 Georg Augustus, 1683–1760. Duke of Brunswick-Luneburg (Hanover), he becameKing of Great Britain and Ireland in 1727, under the name GEORGE II.

Page 82: From hyperbolic systems to kinetic theory: a personalized quest

7

The Burgers Equation: Special Solutions

Let us now use our knowledge of which discontinuities to accept for the Burgersequation, i.e. U− ≥ U+, to compute a few explicit solutions; in that case, theuniqueness result of Olga OLEINIK applies, based on the estimate

ux ≤ 1t, (7.1)

which is inherited from the same inequality for the solution of the equationregularized by adding −ε uxx.

Let u0 be of the form

u0(x) =

{a for x < x1

b for x1 < x < x2

a for x > x2,(7.2)

where a < b. By using invariance by translation and Galilean invariance, onemay assume that a = 0 and x1 = 0, i.e. one puts u(x, t) = a+v(x−x1−a t, t)and v satisfies the Burgers equation; the initial datum for v has a discontinuityat 0, where u jumps from 0 to c = b−a > 0, and this discontinuity transformsinto a rarefaction wave u = x

t and a discontinuity at L = x2 − x1 where ujumps from c to 0, and this discontinuity travels unchanged at velocity c

2 , andthis description is valid as long as the rarefaction wave has not caught up withthe slower shock in front of it, i.e. for t ≤ 2L

c , so that one has

v(x, t) =

⎧⎪⎨

⎪⎩

0 for x ≤ 0xt for 0 ≤ x ≤ c t < 2Lc for c t ≤ x < L+ c t

20 for L+ c t

2 < x

, for t <2Lc, (7.3)

At t = 2Lc , the solution has a triangular shape, v = 0 for x ≤ 0, v = x

t for0 ≤ x < 2L, and v = 0 for x > 2L; afterwards the solution has a similarstructure

Page 83: From hyperbolic systems to kinetic theory: a personalized quest

58 7 The Burgers Equation: Special Solutions

v(x, t) =

⎧⎨

0 for x ≤ 0xt for 0 ≤ x < z(t)0 for x > z(t)

, for t ≥ 2Lc, (7.4)

and besides z(

2Lc

)= 2L, the value of z is obtained by using the Rankine–

Hugoniot conditiondz

dt=

z

2t, (7.5)

because the shock jumps from z(t)t to 0; integrating the differential equation

gives z(t) = k√t, and as k

√2Lc = 2L one has k =

√2L c, so that

z(t) =√

2σ t, with σ = L c, for t ≥ 2Lc, (7.6)

which can also be obtained by observing that the integral of u in x must beconstant, and that gives z2

2t = L c, but one has to check the Rankine–Hugoniotcondition then, in order to be sure that one has a solution.

One can observe on these particular solutions of the Burgers equation howshocks create irreversible effects. Indeed, if one takes different data of thepreceding type at time 0, where for j = 1, . . . ,m, the function vj0 is 0 forx < 0, cj for 0 < x < Lj and 0 for x > Lj, and if all the products Ljcjare equal to σ, then for t large enough, and more precisely for t > maxj

2Lj

cj,

then all the solutions coincide. Actually, letting L tend to 0, with c = σL ,

gives a sequence of initial data converging to σ δ0, so the formula for z can beunderstood as the solution with initial datum σ δ0.

There are other initial data which create the same triangular shaped so-lution at a later time; for example if u0 is given by

u0(x) =

{ 0 for x < 0α(L − x) for 0 < x ≤ L0 for x ≥ L,

(7.7)

with α > 0, the solution is

u(x, t) =

⎧⎪⎪⎨

⎪⎪⎩

0 for x < 0xt for 0 < x ≤ αL tα(L−x)1−α t for αL t ≤ x ≤ L

0 for x ≥ L

, for 0 < t <1α, (7.8)

giving for t = 1α a triangular profile corresponding to σ =

∫Ru0 dx = αL2

2 .Actually, if u0 is 0 outside (0, L) and is a piecewise constant nonincreasingnonnegative function on (0, L), then after a finite time the solution has takenthe triangular profile.

However, if b > a > 0 and

Page 84: From hyperbolic systems to kinetic theory: a personalized quest

7 The Burgers Equation: Special Solutions 59

u0(x) =

{ 0 for x < 0b for 0 < x < La for x > L,

(7.9)

then after time Tc = 2Lb−a the solution takes the following form:

u(x, t) =

⎧⎨

0 for x < 0xt for 0 < x < t y(t)a for x > t y(t)

, for t > Tc =2Lb− a

, (7.10)

where

y(t) > a andd(t y)dt

=y + a

2for t ≥ Tc, and y(Tc) = b, (7.11)

giving after integration

y(t) = a+α√t, with α =

√Tc(b− a) =

√2L(b− a). (7.12)

The Burgers equation is invariant by some changes of scale, and if onemakes the change

u(x, t) = a v(b x, a b t), or u(x, t) =L

Tv( xL,t

T

), (7.13)

with a, b �= 0, or L, T �= 0, then u satisfies the Burgers equation if and only ifv satisfies the Burgers equation; of course, this corresponds to changing theunit of length and the unit of time, and the unit for measuring velocities isautomatically determined.

The solution of the Riemann problem corresponds to looking for a solutioninvariant by the change (x, t) �→ (λx, λ t), i.e. which uses the subgroup oftransformations given by a = 1, so one looks for solutions of the form

u(x, t) = f(xt

), (7.14)

and this gives the particular solution xt , or the shocks with u = U− for x < s t

and u = U+ for x > s t, with U− ≥ U+ and s = U−+U+2 .

If one considers functions with compact support, whose integral must stayconstant, one must consider the subgroup of transformations b = a, and asolution invariant by the transformations a u(a x, a2t) must be of the form

1√tf( x√

t

), (7.15)

which at first sight contains the already known solution xt , apart from the fact

that f must be integrable; a simple analysis gives the family that Peter LAX

called N-waves,

Page 85: From hyperbolic systems to kinetic theory: a personalized quest

60 7 The Burgers Equation: Special Solutions

u(x, t) =

⎧⎨

0 for x < −√2m−t

xt for −√

2m−t < x <√

2m+t0 for x >

√2m+t,

(7.16)

where m−,m+ ≥ 0; as Peter LAX has noticed, there are invariants that canbe defined for integrable solutions satisfying the Burgers equation, and twoof them are the quantities m− and m+ corresponding to a breaking of theconserved quantity

∫Ru dx = m+ − m− into two conserved quantities, the

part moving to −∞ and the part moving to +∞.

One should notice that an L∞ bound of the solution in O(

1√t

)can be

derived from the L1 norm of the initial datum u0 (and the bound is sharpby looking at the preceding triangular-shaped solutions). One uses the factthat ux ≤ 1

t , so if u(x0, t) = a > 0 for example, then one must have u(x, t) ≥x−x0+a t

t for x0 − a t < x < x0, so that the integral of |u| is greater than thecorresponding surface a2t

2 , and must be ≤ ∫R|u0| dx, giving the estimate

|u(x, t)| ≤√

2∫

R|u0| dxt

almost everywhere for x ∈ R, t > 0. (7.17)

Peter LAX had also noticed that if the initial datum is periodic, then thesolution is periodic of course, and it converges to the average over one periodin O

(1t

), and this follows easily again from the estimate ux ≤ 1

t . There areactually self-similar solutions which decay in 1

t , and they correspond to thesubgroup of transformations b = 1, for which one must look for solutions ofthe form

u(x, t) =1tf(x), (7.18)

and one finds again the case xt , but if one is interested in a global decay in

1t , one is led by a simple analysis to the following family of solutions, whereone can choose an arbitrary family of disjoint intervals Ij = (xj −Lj, xj +Lj)(with Lj > 0), finite or countable but with an upper bound on the Lj in theinfinite case,

u(x, t) =

⎧⎨

x−xj+Lj

t for xj − Lj < x < xjx−xj−Lj

t for xj < x < xj < xj + Lj0 outside the union of the intervals Ij .

(7.19)

In the summer of 1986, I had been asked by Roger CHERET to write atheoretical chapter for a book about shocks in solids,1 [2], and I first wrote aset of notes (in French) for an introduction to quasi-linear hyperbolic systems

1 Roger CHERET, French physicist. He worked at CEA (Commissariat a l’EnergieAtomique).

Page 86: From hyperbolic systems to kinetic theory: a personalized quest

7 The Burgers Equation: Special Solutions 61

of conservation laws [17], from which he was going to select some material. Ihad been led to do other computations, and one of them was to wonder aboutwhat happens to a perturbation of a rarefaction wave (or compression wavein the case κ < 0), i.e. I considered the case

u0(x) = κx+ v0(x), (7.20)

and I found that if one uses the transformation

u(x, t) =κx

1 + κ t+

11 + κ t

v( x

1 + κ t,

t

1 + κ t

), (7.21)

then v satisfies the Burgers equation with initial data v0; if v0 is bounded,one then has a correction O

(1t

)if κ > 0, and one should notice that if v0 has

compact support the solution v decays in O(

1√t

), but that does not make the

correction O(

1t√t

), because v is evaluated at time t

1+κ t which tends to 1κ as

t tends to ∞.It seemed to me at the time that the knowledge of explicit solutions should

be used for improving numerical methods, and I was worried that the numeri-cal methods are not Galilean invariant, for example, but I also thought aboutthe idea of adding explicit solutions in the treatment of finite element ap-proximations, like for domains with corners, and Louis BRUN had once men-tioned to me that this idea had already been used by LAGRANGE.2 After JeanOVADIA had told me that,3 for numerical reasons, it was useful to considerthe generalized Riemann problem

u0(x) ={u− + κ−x for x < 0u+ + κ+x for x > 0, (7.22)

I computed the corresponding explicit solution, which in the case u− ≤ u+ is

u(x, t)

⎧⎪⎨

⎪⎩

u−+κ−x1+κ−t for x ≤ u−txt for u−t ≤ x ≤ u+tu++κ+x1+κ+t

for x ≥ u+t

, as long as 1 + κ−t > 0, 1 + κ+t > 0, (7.23)

and in the case u− > u+ is

u(x, t) =

{u−+κ−x1+κ−t

for x < g(t)u++κ+x1+κ+t

for x > g(t), as long as 1+κ−t > 0, 1+κ+t > 0, (7.24)

where g satisfies a differential equation, corresponding to the Rankine–Hugo-niot condition2 Louis BRUN, French mathematician. He worked at CEA (Commissariat a

l’Energie Atomique).3 Jean OVADIA, French mathematician. He worked at CEA (Commissariat a

l’Energie Atomique).

Page 87: From hyperbolic systems to kinetic theory: a personalized quest

62 7 The Burgers Equation: Special Solutions

dg

dt=

12

(u− + κ−g(t)1 + κ−t

+u+ + κ+x

1 + κ+t

), and g(0) = 0, (7.25)

whose solution is

g(t) = tu−

√1 + κ−t+ u+

√1 + κ+t√

1 + κ−t+√

1 + κ+t, as long as 1 + κ−t > 0, 1 + κ+t > 0,

(7.26)showing that the amplitude of the shock at time t is

u(g(t)−, t) − u(g(t)+, t) = u+−u−√(1+κ−t)(1+κ+t)

,

as long as 1 + κ−t > 0, 1 + κ+t > 0.(7.27)

[Taught on Wednesday September 12, 2001.]

Page 88: From hyperbolic systems to kinetic theory: a personalized quest

8

The Burgers Equation: Small Perturbations;the Heat Equation

The Burgers equation is a good example for pointing out that one should becareful about throwing away some terms because one thinks that they aresmall.

Let us assume that for some ε > 0, an initial datum satisfies

a− ε ≤ u0 ≤ a+ ε in R, (8.1)

so that the correct solution of the Burgers equation

ut +(u2

2

)

x= 0 in R × (0,∞); u(·, 0) = u0 in R (8.2)

also satisfiesa− ε ≤ u ≤ a+ ε in R × (0,∞). (8.3)

With the usual tacit assumption that ε is a small quantity, it may seemreasonable to use the linearization

u = a+ v, and u2 ≈ a2 + 2a v, (8.4)

arguing that one keeps the term 2a v which is O(ε), but one rejects the termv2, which is O(ε2).

Mathematicians tend to be bothered by hasty simplifications which physi-cists and engineers do without guilt because they “know” that some terms aresmall; of course, in doing that they implicitly assume that the mathematicalmodel is a good approximation of reality so that they believe that what onehas observed in reality is a property that the model has, but mathematiciansinsist that one should prove that the mathematical model possesses someproperty and that one should not postulate it. After all, it could happen thatthe equation is not a good model of reality, and the only way to prove that itis not a good model is precisely to show that it does not possess a particularproperty which is observed. However, one should remember that every model

Page 89: From hyperbolic systems to kinetic theory: a personalized quest

64 8 The Burgers Equation: Small Perturbations; the Heat Equation

has some limitations, and that a bad model may still be useful if one uses itunder the conditions for which it is known to be good, and this knowledgemay come from having performed a precise mathematical analysis.

Here one has a precise bound for the dropped term v2, which one deemssmall because one norm of it is small, and one may think that v is near wgiven by

wt + awx = 0; w(·, 0) = v0, i.e. w(x, t) = v0(x− a t). (8.5)

However, if one uses a Galilean transformation

u(x, t) = a+ U(x− a t, t), (8.6)

i.e. one moves with velocity a, then U solves the Burgers equation with initialdata v0, while the chosen approximation w is a constant equal to v0, so that itis actually a quite bad approximation. One may think that the approximationis good for a small time, but looking more precisely into the matter showsthat it is very dependent upon the regularity of v0, and one then learns thatbefore discarding a term one should first try to understand what norm oneshould use for measuring how small this term is. For example, if v0 = ε, thenU = ε, but if

v0 = ε ϕ0

( xεm

), (8.7)

thenU(x, t) = ε ϕ

( xεm

,t

εm−1

), (8.8)

where ϕ is the solution of the Burgers equation with initial datum ϕ0, so ifϕ0 is periodic with average 0, ϕ decays in O

(1t

), giving the estimate O

(εm

t

)

for U , and one should observe that for m ≥ 1 the function v0 is small but notits derivative (high derivatives mean quick formation of discontinuities, whichmay imply rapid decay due to irreversible effects).

In 1987, I had suggested that noise in 1f , i.e. inversely proportional to

frequency could be related to the tendency of the Burgers equation to cre-ate solutions with a triangular shape, and I had mentioned it first to JamesGLIMM and then in a letter sent for refereeing a note on the subject, to PaulGERMAIN and Alfred JOST,1,2 because they correspond to Fourier transformsdecaying in 1

ξ . If a line of transmission is not exactly linear, but one makes theassumption that it is linear because the deviation from linearity seems small,one may have neglected small quadratic terms which have an effect whichis not completely negligible, and in particular create small triangular-shaped1 Paul GERMAIN, French mathematician, born in 1920. He worked at Universite

Paris VI (Pierre et Marie Curie), Paris, and at ONERA (Office National d’Etudeset de Recherches Aeronautiques), Chatillon, France.

2 Alfred JOST, French biologist, 1916–1991. He had worked in Paris, France, holdinga chair (physiologie du developpement, 1947–1987) at College de France, Paris.

Page 90: From hyperbolic systems to kinetic theory: a personalized quest

8 The Burgers Equation: Small Perturbations; the Heat Equation 65

structures in the solution, which it could be interesting to filter, but for whichFourier analysis does not seem the right thing to do.

Of course, there is a tendency to call noise whatever it is that one does notunderstand,3 and the reason why one does not understand it is because oneapplies linear tools in situations where there are nonlinear effects, which onehas too quickly postulated to be negligible, so that one finds oneself working onoversimplified equations that cannot explain what is observed. It is certainlynot by an invocation of probabilistic methods that one can correct previousmistakes, and the dogma that there are probabilities in the laws of naturemay be seen as a silly invention of people who wanted to hide the fact thatthey had not understood what they were doing.

The N-waves appearing in the Burgers equation have the curious effect ofhaving propagated of a length of order

√t after time t, instead of a length

of order t as one expects for linear waves, and I think that many observedeffects of this type have been wrongly attributed to linear diffusion effects,like for the heat equation, postulated by FOURIER, or the effects of viscosity,considered by NAVIER.

In connection with the heat equation, there is an important probabilisticgame which I have to describe for criticizing it, called “Brownian” motion,4

which is not really related to what BROWN had observed, but it is related tomy subject of kinetic theory. BROWN had observed pollen under a microscope,and he had noticed some erratic motions, which were wrongly interpreted asjumps in position, while they actually were the results of jumps in velocity dueto collisions with much smaller particles. With the interpretation of randomwalks, which correspond to nonphysical jumps in position, the first mathemat-ical analysis was the work of BACHELIER in 1900,5 although the objection tojumps in position does not apply to his work, as he was interested in ques-tions of finance, and the effects of buyers and sellers on the stock market arereasonably well simulated by a random walk model, but I heard from DavidHEATH that there seems to be a slight asymmetry,6 and the stock prices seemto fall a little faster than they have risen. Of course, it is important to havea large number of customers, so that some kind of effective behaviour can be3 Real noise is related to acoustic effects, which seem to come from nonperiodic

phenomena, which the human ear does not seem to process in the same waythan music (although the sounds of nature like running water or wind are notclassified as music, but are not described as noise either). The creation of soundin hydrodynamic phenomena is often neglected, but it is one way for the energydissipated to be transported away.

4 After having called it (mathematical) Brownian motion, I prefer to call it now“Brownian” motion, and the presence of quotes serves as a reminder that thename is not well chosen.

5 Louis BACHELIER, French mathematician, 1870–1946. He had worked in Be-sancon, in Dijon, in Rennes, France, and in Besancon again.

6 David HEATH, American mathematician. He worked at Cornell University, Ithaca,NY, and at CMU (Carnegie Mellon University), Pittsburgh, PA.

Page 91: From hyperbolic systems to kinetic theory: a personalized quest

66 8 The Burgers Equation: Small Perturbations; the Heat Equation

observed, and to improve the model one may need a better understanding ofthe possible behaviour of buyers and sellers. However, when EINSTEIN workedon the subject in 1905, I believe that he did not point out the nonphysicalcharacter of jumps in position, which require an infinite speed, while jumpsin velocity are more realistic, although they require an infinite acceleration,so that instantaneous jumps in velocity are an idealization, as I shall discusslater in the study of collisions.

Since the work of POINCARE on relativity, before that of EINSTEIN, it isunderstood that no information can travel faster than the velocity of lightc. Some physicists dispute that idea, mostly because they do not understandwhat errors have been made in inventing the rules of quantum mechanics. Afirst error is to rely on the Schrodinger equation, which has the same defectas the Fourier heat equation invented more than a century earlier, that it isa postulated equation which corresponds to physical models where one has(unknowingly) let the velocity of light c tend to +∞, and that it is a logicalmistake to use the real value of c in these models; a second error is thatonly waves exist at a microscopic level and that it is because physicists weresticking to 19th century ideas, like using particles for describing what happensin a gas, which were wrongly imposed on 20th century physics, that some ofthe silly games of quantum mechanics have been invented.7

According to the basic point of view on physics just described, there areno jumps in position in nature, but the “Brownian” motion is just one wayto derive the heat equation

ut − κΔu = 0, (8.9)

whose defects I shall emphasize. It was WIENER who developed the mathe-matical theory which most mathematicians call Brownian motion,8 and whichI call “Brownian” motion to recall that it is not related to what BROWN hadobserved,9 and which probabilists like, a little too much in my opinion, butwhat BROWN had observed were jumps in velocity, which are related to the7 Mathematically, the problem is not to start from some Hamiltonian and derive a

partial differential equation like the Schrodinger equation, in too many variablesso that some other dogma has to be used for getting a reasonable equation withx ∈ R

3 or (x,v) ∈ R3 × R

3, but to start from semi-linear hyperbolic systems,like the Dirac equation (from which the Schrodinger equation can be derived byletting the velocity of light c tend to +∞), and to show that in the limit of infinitefrequencies waves are reasonably described by simpler models, for which one mayuse the interpretation of “idealized particles”, elementary or not.

8 Norbert G. WIENER, American mathematician, 1894–1964. He had worked atMIT (Massachusetts Institute of Technology), Cambridge, MA.

9 It may have been called Brownian motion by EINSTEIN, who was obviouslynot such a good physicist that he could mistake jumps in velocity for jumpsin position.

Page 92: From hyperbolic systems to kinetic theory: a personalized quest

8 The Burgers Equation: Small Perturbations; the Heat Equation 67

Fokker–Planck equation,10 where the unknown is a function of position x,velocity v, and time t,

ft − v.fx − κΔvf = 0, (8.10)

but there is an unfortunate tendency now, which may have started amongprobabilists, to call the Fokker–Planck equation any diffusion equation witha drift, even if there is only one variable x and no velocity variable v.11

The Fokker–Planck equation can be created by a different probabilistic game,named after ORNSTEIN and UHLENBECK.12,13

Forgetting now about the question of physical relevance of the model, theheat equation can be solved by convolution (in x) of the initial datum with anelementary solution E, which one computes easily, either by using the Fouriertransform, or by looking for a radial function of the form

E(x, t) ={

1tN/2 f

(r√κ t

)for t > 0

0 for t < 0, (8.11)

and this form is guessed from the fact that the heat equation is invariant byrotation and by the scalings a u(b x, b2 t), and that to have the integral of u in-dependent of t one must choose a = bN ; of course, κ has the dimension length2

time−1, and although mathematically the argument of scaling introduces thequantity x√

t, it is better to use x√

κ t, which is a dimensionless quantity; writing

that E satisfies the heat equation for t > 0 gives a differential equation in f ,easily integrated, and a constant of integration is imposed by the fact thatone wants to have

∫RN E(x, t) dx = 1 for t > 0 (so that E(·, t) converges to δ0

as t tends to 0), and this gives

E(x, t) ={

1(4π κ t)N/2 e

− |x|24κ t for t > 0

0 for t < 0. (8.12)

10 Adriaan Daniel FOKKER, Indonesian-born Dutch physicist and composer, 1887–1972. He had worked in Leiden, The Netherlands. He wrote music under thepseudonym Arie DE KLEIN.

11 I wonder if this is intentional sabotage, because those who use this terminologyforget to mention the difference between a diffusion in space and a diffusion invelocity, and their wrong use of a name from kinetic theory only induces studentsto talk about things that they do not know, as they are told nothing about kinetictheory.

12 Leonard Samuel ORNSTEIN, Dutch physicist, 1880–1941. He had worked inUtrecht, The Netherlands.

13 George Eugene UHLENBECK, Indonesian-born Dutch physicist, 1900–1988. Hereceived the Wolf Prize (in Physics) in 1979, for his discovery, jointly with thelate S. A. GOUDSMIT, of the electron spin, jointly with Giuseppe OCCHIALINI. Hehad worked at University of Michigan, Ann Arbor, MI, in Utrecht, The Nether-lands, at Columbia University, New York, NY, at MIT (Massachusetts Instituteof Technology), Cambridge, MA, at Princeton University, Princeton, NJ, and atthe Rockefeller Institute, New York, NY.

Page 93: From hyperbolic systems to kinetic theory: a personalized quest

68 8 The Burgers Equation: Small Perturbations; the Heat Equation

At each positive time, the shape of E is that of a Gaussian, which isisotropic (i.e. invariant by rotations), but I shall show later a family of solu-tions of the heat equation which are anisotropic Gaussians, i.e. exponentialsof quadratic functions whose principal part is not necessarily proportional to|x|2.

It is important to notice that Gaussian functions arise naturally in a fewmathematical problems, which are not all related, and one should not deducethat if a Gaussian occurs there must be a diffusion behind; actually, stationarysolutions of the Boltzmann equation are Gaussian functions in velocity (thiscorresponds to independent work by MAXWELL and by BOLTZMANN), andthe physical rule behind the Boltzmann equation is not diffusion in velocity,as for the Fokker–Planck equation, and although the linear Fokker–Planckequation does not have constant coefficient,14 so that its Green function isnot translation invariant,15 it also involves Gaussians.

The heat equation has a smoothing effect, and this can be proven byobserving that E has all its derivatives in x integrable (with the L1 normsbeing powers of 1√

t). Although the Fokker–Planck equation has only a diffusion

in velocity, there is a combined effect of the transport part and the diffusionpart for smoothing (less rapidly) in all directions. I first heard about thisequation in the late 1960s at the Lions–Schwartz seminar, in a talk describinga work of Lars HORMANDER on a class of hypoelliptic operators (i.e. whosesolutions must be smooth where the data is smooth), and although his workwas very general, his article refers to a previous work of KOLMOGOROV onthe smoothness of solutions of the Fokker–Planck equation.16

Another simple approach for solving the heat equation is to use a numericalscheme, and for simplification I consider the one-dimensional heat equation

ut − κuxx = 0, (8.13)

and the explicit difference scheme

14 Physicists also use a nonlinear Fokker–Planck equation, either by deriving itfrom the Boltzmann equation, which is quite illogical, because the Fokker–Planckequation is about near collisions which are not well described by the postulatedBoltzmann equation, or by postulating a nonconstant diffusion in velocity for thepurpose of having Gaussians in velocity become solutions, which is dogma at itsworst: dogmas are always introduced by people who fear rational thinking andwant to force others to obey the rules that they believe in, the silly ones as wellas the interesting ones.

15 George GREEN, English mathematician, 1793–1841. He had been a miller andhad never held any academic position.

16 Andrey Nikolaevich KOLMOGOROV, Russian mathematician, 1903–1987. He re-ceived the Wolf Prize in 1980, for deep and original discoveries in Fourier analy-sis, probability theory, ergodic theory and dynamical systems, jointly with HenriCARTAN. He had worked in Moscow, Russia.

Page 94: From hyperbolic systems to kinetic theory: a personalized quest

8 The Burgers Equation: Small Perturbations; the Heat Equation 69

Un+1i − UniΔt

− κUni−1 − 2Uni + Uni+1

(Δx)2= 0 for i ∈ Z, n ≥ 0, (8.14)

and the choice of the discretization is explained by using the Taylor expan-sion,17 for a smooth function.18 Writing the scheme as

Un+1i = αUni−1 + (1 − 2α)Uni + αUni+1, with α =

κΔ t

(Δx)2, (8.15)

one sees that the conditionκΔ t

(Δx)2≤ 1

2(8.16)

is a sufficient condition to obtain a nonnegative approximation for a nonneg-ative datum, i.e. that U0

i ≥ 0 for all i implies Uni ≥ 0 for all i ∈ Z, n ≥ 0, butalso gives stability in L1(R) and L∞(R), and therefore in Lp(R) by interpola-tion for every p ∈ [1,∞]; this condition will be seen to be necessary to obtainstability in L2(R), and the proof uses Fourier series.

There is a probabilistic interpretation of the preceding scheme, related toa random walk where one jumps from i to i − 1 or to i + 1 with probabilityα, and remaining at i with probability (1 − 2α), all these jumps being doneindependently, and Uni can be expressed as the expectation of U if one startsat n = 0 with the values U0

j for all j; I consider the idea of “Brownian” motionrelated to this remark.

It should be mentioned that FEYNMAN has developed probabilistic ideasfor the Schrodinger equation, mixed with ideas of diagrams, which no math-ematician understands.19

A first observation is that one can use difference schemes similar to (8.14)for the Schrodinger equation, and that there is a similar stability condition,but the analysis does not rely on positivity or probabilities, and it is based onL2 estimates proven by using a Fourier series; actually, very few schemes usedin numerical analysis can be interpreted by probabilistic arguments, and I seeas an important limitation that probabilists only work on partial differentialequations showing positivity properties.

A second observation comes from my work in the early 1980s about nonlo-cal effects induced by homogenization, which I had started because I thought17 Brook TAYLOR, English mathematician, 1685–1731. He had worked in London,

England.18 If ϕ is smooth, then ϕ(x±Δx) = ϕ(x) ±Δxϕ′(x) + (Δ x)2

2ϕ′′(x) + O

((Δx)3

),

which shows that ϕ(x−Δ x)−2ϕ(x)+ϕ(x+Δ x)

(Δ x)2= ϕ′′(x) +O(Δx).

19 I have heard propositions to have a complex time in the construction of “Brow-nian” motion, which I find silly. I have asked James GLIMM what to read forlearning about diagrams, and I interpreted his answer, that there was no articlefor that, as meaning that no one has definitions, and that it is a state of mind,hard to understand by mathematicians, who are not used to playing games beforelearning about their rules.

Page 95: From hyperbolic systems to kinetic theory: a personalized quest

70 8 The Burgers Equation: Small Perturbations; the Heat Equation

that the physicists’ rules of spontaneous absorption or emission of “particles”that have been invented for explaining spectroscopy experiments are just away of describing a nonlocal effect in an effective equation, whose form is stillunknown. In the late 1980s, when I tried to attack a nonlinear model, I stum-bled on a question of how to handle too many terms in an expansion, and howto explain the convergence of such an expansion, and I had the feeling thatFEYNMAN’s method of diagrams might have been his answer for describing anonlocal effect and handling a similar bookkeeping problem, so that his useof probabilities had nothing to do with the way they are used in “Brownian”motion.

My approach is not to try to understand the precise rules of the gamesthat physicists play, because they are able to play games without learningfirst about all the rules, so that they actually play lots of variants, and if avariant seems to give a good result they may incorporate some of its rulesinto their own game. Some mathematicians try to put some order into the listof all these rules that various physicists use, but that corresponds to beinginterested in physicists’ problems and not necessarily in physics questions.

I advocate trying to understand what the physics problems are, behind thegames that physicists play, and then developing the necessary mathematicaltools for solving the problem that one has selected, and checking meanwhileif one still has reasons to believe that it will be useful for explaining a piece ofphysics. One may also discover something for one purpose and once it is partlydone one may observe that it is useful for explaining something else, and forexample the theory of distributions of Laurent SCHWARTZ came out of thequestion of defining

∑n∈Z

cnei n x for coefficients cn with polynomial growth,

and once it was understood it explained some of the formal computations ofDIRAC.

There is a difference between research and development, and one shouldnot be dogmatic about the way to do research, because one looks in part fornew ideas, which no one may have thought of, but certainly one should beable to explain why one spends some time working on a problem.

[Taught on Friday September 14, 2001.]

Notes on names cited in footnotes for Chapter 8, GOUDSMIT,20 OCCHIALINI,21

ROCKEFELLER,22

20 Samuel Abraham GOUDSMIT, Dutch physicist, 1902–1978. He had worked atUniversity of Michigan, Ann Arbor, MI.

21 Giuseppe OCCHIALINI, Italian physicist, 1907–1993. He received the Wolf Prize(in Physics) in 1979, for his contributions to the discoveries of electron pair pro-duction and of the charged pion, jointly with George Eugene UHLENBECK. Hehad worked in Genova (Genoa) and in Milano (Milan), Italy.

22 John Davison ROCKEFELLER Sr., American industrialist and philanthropist,1839–1937. The Rockefeller Institute, New York, NY, is named after him.

Page 96: From hyperbolic systems to kinetic theory: a personalized quest

8 The Burgers Equation: Small Perturbations; the Heat Equation 71

Henri CARTAN,23 and for the preceding footnotes, E. CARTAN.24

23 Henri Paul CARTAN, French mathematician, born in 1904. He received the WolfPrize in 1980, for pioneering work in algebraic topology, complex variables, andhomological algebra, and inspired leadership of a generation of mathematicians,jointly with Andrey N. KOLMOGOROV. He worked in Paris and at UniversiteParis-Sud, Orsay, France, retiring in 1975 just before I was hired there. Theoremsattributed to CARTAN are often the work of his father E. CARTAN.

24 Elie Joseph CARTAN, French mathematician, 1869–1951. He had worked in Paris,France.

Page 97: From hyperbolic systems to kinetic theory: a personalized quest

9

Fourier Transform; the Asymptotic Behaviourfor the Heat Equation

An important tool for studying partial differential equations with constantcoefficients (like the heat equation ut− κΔu = f in the whole space x ∈ R

N ,with usually N = 1, 2, 3 in applications, and with an initial condition) is theFourier transform. Initially, using Laurent SCHWARTZ notation,1 one definesthe Fourier transform F , as well as F , for functions f ∈ L1(RN ) by

Ff(ξ) =∫

RN

f(x)e−2i π(x.ξ) dx and Ff(ξ) =∫

RN

f(x)e+2i π(x.ξ) dx, (9.1)

so that F f = Ff for every f ∈ L1(RN ), and one notices that

|Ff(ξ)| ≤∫

RN

|f(x)| dx for all ξ ∈ RN , i.e. ||Ff ||∞ ≤ ||f ||1, (9.2)

and the same properties hold for F (which is actually the inverse of F , an ad-vantage of Laurent SCHWARTZ notation). A simple application of the Lebesguedominated convergence theorem shows that Ff is continuous; once one hasshown that Ff tends to 0 at infinity when f ∈ C∞

c (RN ) (or S(RN )) onededuces by an argument of density that

f ∈ L1(RN ) implies Ff ∈ C0(RN ), (9.3)

where C0(RN ) is the space of continuous functions tending to 0 at infinity,which is a Banach space,2 when equipped with the sup norm; the same prop-erty holds for F .1 Specialists of harmonic analysis do not put the coefficient 2π in the definition,

and a factor involving π appears in their inverse Fourier transform; one shouldthen be careful in comparing formulas from various books, but at the end onefinds the same solutions, of course. I do prefer the symmetry which appears inLaurent SCHWARTZ notation, but I am a little biased, because I was a studentof Laurent SCHWARTZ at Ecole Polytechnique in 1965–1966, and I have alwaysused his notation.

2 Stefan BANACH, Polish mathematician, 1892–1945. He had worked in Lwow (thenin Poland, now Lvov, Ukraine). There is now a Stefan Banach International

Page 98: From hyperbolic systems to kinetic theory: a personalized quest

74 9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation

If f is of class C1 with f and ∂f∂xj

belonging to L1(RN ), an integration byparts shows that

F ∂f

∂xj(ξ) = 2i π ξjFf(ξ) for all ξ ∈ R

N , (9.4)

while if f is of class C1 with f and |x| f belonging to L1(RN ), Lebesguedominated convergence shows that Ff is of class C1 and

F(−2i π xjf)(ξ) =∂(Ff)∂ξj

(ξ) for j = 1, . . . , N and all ξ ∈ RN . (9.5)

Reiteration of these two properties led Laurent SCHWARTZ to introducethe Schwartz space S(RN ) of C∞ functions f such that for every derivativeDα of any order and every polynomial P of any degree one has P Dαf ∈L1(RN ) (or equivalently all these products are asked to be bounded); thenthe Fourier transform maps continuously S(RN ) into itself (as well as itsinverse F).3 Laurent SCHWARTZ extended then Fourier transform to S′(RN )(whose elements are called tempered distributions), the dual of S(RN ), bynoticing the formula∫

RN (Ff)g dξ =∫

RN×RN f(x)g(ξ)e−2i π(x.ξ) dx dξ =∫

RN (Fg)f dxfor all f, g ∈ L1(RN ),

(9.6)

which is proven by Fubini theorem,4 and defined the extension by

〈FT, ϕ〉 = 〈T,Fϕ〉 for all T ∈ S′(RN ) and all ϕ ∈ S(RN ). (9.7)

As derivations and multiplication by polynomials map S(RN ) into itself, onededuces that

F ∂T∂xj

= 2i π ξjFT and F(−2i π xjT ) = ∂(FT )∂ξj

for all T ∈ S′(RN ) and j = 1, . . . , N.(9.8)

As 1 ∈ S′(RN ) and ∂1∂xj

= 0 for j = 1, . . . , N , one deduces that ξjF1 = 0for j = 1, . . . , N , and this implies that F1 = Aδ0, and one finds that A = 1by using the Gaussian function G(x) = e−π |x|2 , whose Fourier transform is

Mathematical Centre in Warsaw, Poland. The term Banach space was introducedby FRECHET.

3 The natural topology for the Schwartz space S(RN), defined by the family ofnorms ||xβ Dαf ||L1(RN ) for all multi-indices α, β, does not make it a Banachspace, but only a Frechet space, i.e. a locally convex space which is a completemetric space (I do not know who introduced the term Frechet space). The Fouriertransform F is continuous, together with its inverse F .

4 Guido FUBINI, Italian-born mathematician, 1879–1943. He had worked in Cata-nia, in Genova (Genoa), and in Torino (Turin), Italy, and then in New York,NY.

Page 99: From hyperbolic systems to kinetic theory: a personalized quest

9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation 75

FG(ξ)=e−π |ξ|2 ; this last result follows from the fact that ∂G∂xj

=−2π xj G for

j=1, . . . , N , which implies that ∂(FG)∂ξj

=−2π ξj FG for j=1, . . . , N , so that

FG(ξ)=B e−π |ξ|2 , and the fact that B=1 follows by taking ξ=0 and using∫Re−π x

2dx=1.5 The fact that the inverse of F is F , either from S(RN ) into

itself or from S′(RN ) into itself, is equivalent to the Plancherel formula6

RN

Ff(ξ)Fg(ξ) dξ =∫

RN

f(x)g(x) dx for all f, g ∈ S(RN ), (9.9)

which permits us to extend F to L2(RN ) into an isometry with inverseF . Indeed, using h(ξ) = Fg(ξ), the Plancherel formula means FFg =g for every g, i.e. FF = I on S(RN ) (which is the same as FF =I by complex conjugation); one has FFu(x) =

∫RN Fu(ξ)e+2iπ(x.ξ) dξ =∫

RN e+2iπ(x.ξ)

(∫RN u(y)e−2iπ(ξ.y) dy

)dξ, but the hypotheses of the Fubini theo-

rem are not satisfied for exchanging the order of integrations,7 and one noticesinstead that

FFu(x) = limn→∞∫

RN G(ξn

)Fu(ξ)e+2iπ(x.ξ) dξ

= limn→∞∫

RN

∫RN G

(ξn

)u(y)e2iπ(x−y.ξ) dy dξ

= limn→∞∫

RN u(y)nNG(n(y − x)

)dy = u(x),

(9.10)

where one has used a simple scaling property, that if f ∈ L1(RN ) and λ �= 0and g(x) = f(λx), then Fg(ξ) = 1

λN F( ξλ)

for all ξ ∈ RN , a particular case

of a linear change of variable in the definition of the Fourier transform, whosegeneral form shows that for f ∈ L1(RN ) and A ∈ L(RN ; RN ) invertible,

if g(x) = f(Ax) then Fg(ξ) =1

|det(A)| F((A−1)T ξ

)for ξ ∈ R

N ; (9.11)

one has also used that G is its own Fourier transform, and that the interme-diate result is the convolution of u by a smoothing sequence.

One important property of the Fourier transform is that it transformsconvolution into multiplication8

5 If I =∫

Re−π x2

dx, then I2 =∫

R×Re−π(x2+y2) dx dy, and using polar coordinates

(and dx dy = r dr dθ) it is∫∞0

∫ 2π

0e−π r2

r dr dθ =∫∞0

2π r e−π r2dr = 1.

6 Michel PLANCHEREL, Swiss mathematician, 1885–1967. He had worked in Gene-ve (Geneva), in Fribourg, and at ETH (Eidgenossische Technische Hochschule),Zurich, Switzerland.

7 It would make the formal quantity∫

RN e−2iπ(y−x.ξ) dξ appear, which physicistswrite as δ(y− x), and this corresponds to what happens if one wants to computeF1 as if it was defined by an integral; however, the result is consistent with thefact that F1 = δ0, which had to be proven otherwise, and for what concerns linearcomputations one can often (but not always!) transform a formal considerationinto a proof by using a suitable regularization.

8 As noticed by Laurent SCHWARTZ, derivations are convolutions with distributionshaving their support at 0, so that the property of F transforming derivations

Page 100: From hyperbolic systems to kinetic theory: a personalized quest

76 9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation

for f, g ∈ L1(RN ) and h(x) = f � g(x) =∫

RN f(y)g(x− y) dyone has Fh(ξ) = Ff(ξ)Fg(ξ) for all ξ ∈ R

N ,(9.12)

which is an easy consequence of the Fubini theorem, writing

Fh(ξ) =∫

RN

(∫RN f(x− y)g(y) dy

)e−2iπ(x.ξ) dx

=∫

RN

[g(y)e−2iπ(y.ξ)

(∫RN f(x− y)e−2i π(x−y.ξ) dx

)]dy

=∫

RN g(y)e−2iπ(y.ξ)Ff(ξ) dy = Ff(ξ)Fg(ξ).(9.13)

The formula F(f � g) = (Ff)(Fg) shows that FL1(RN ) is a multiplicativealgebra (continuously embedded into C0(RN ));9 the formula extends to f, g ∈L2(RN ), and using the fact that F is a surjective isometry from L2(RN ) ontoitself, one deduces that L2(RN ) � L2(RN ) = FL1(RN ).

The same formula holds for F , and using u = Ff and v = Fg, one deducesthat F(f � g) = Ff Fg = u v, so that F(u v) = FF(f � g) = f � g = Fu �Fv,but one should be careful about the hypotheses for u and v for such a formulaF(u v) = Fu � Fv to hold; the preceding proof works for u, v ∈ FL1(RN ),and it is easily proven also for u, v ∈ L2(RN ).10 As products and convolutionproducts are not defined for all distributions, it may happen that one side ofthe equality is defined while there is no clear way of defining the other sidedirectly; for example, the Heaviside function H is defined by H(x) = 0 for x <0 and H(x) = 1 for x > 0, so that H ∈ L∞(R) ⊂ S′(R), and as the derivativeof H is δ0, one finds that 2i π ξFH(ξ) = 1 so that FH(ξ) = 1

2i πpv.(

)+C δ0,

and the value of C is found if one notices that H − 12 is real and odd,11 so

that its Fourier transfom is purely imaginary and odd, giving C = 12 ; although

the product of H by itself is well defined, the definitions of convolutions fordistributions do not allow both distributions to be pv.1ξ .

Another approach for studying partial differential equations with constantcoefficients like the heat equation (or generalizations like ut−

∑Ni,j=1 Di,juxixj

into multiplications is the same property as F transforming convolutions intoproducts.

9 The word algebra comes from an Arabic word in the treatise Hisab al-jabr w’al-muqabala by AL KHWARIZMI, whose name has also been used for coining theword algorithm.

10 The formula is also true for u, v ∈ FL1(RN ) + L2(RN ), in which case the twosides of the formula belong to FL1(RN ) + L2(RN ) + L1(RN ).

11 For a smooth function f one defines f by f(x) = f(−x), and for a distribution Tone defines T by 〈T , ϕ〉 = 〈T, ϕ〉 for every ϕ ∈ C∞

c (RN), so that T even meansT = T and T odd means T = −T ; then δ0 is even and pv.

(1x

)is odd; pv. stands for

principal value, and pv.(

1x

)is the only odd distribution T in R satisfying x T =

1, and it is defined more precisely by⟨pv.(

1x

), ϕ⟩

= limn→∞(∫ −1/n

−∞ϕ(x)

xdx +

∫ +∞1/n

ϕ(x)x

dx)

for every ϕ ∈ C∞c (R), and this type of definition goes back to

CAUCHY, and was generalized by HADAMARD before Laurent SCHWARTZ put itin the framework of distributions.

Page 101: From hyperbolic systems to kinetic theory: a personalized quest

9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation 77

= 0, where the matrix D with entries Di,j is symmetric and positive definite,which corresponds to diffusion in an anisotropic medium), is the method ofelementary solutions; first I shall deduce it by using the Fourier transform, inthe simple case of the heat equation in an isotropic medium ut−κΔu = 0, inorder to simplify the computations. One applies a partial Fourier transform,in x alone; this means that one is looking for a solution u which is continuousin t,12 with values in S′(RN ) for example,13 and one finds that

(Fu)t + 4κπ2|ξ|2Fu = 0, , with initial datum Fu |t=0= Fu0, (9.14)

givingFu(ξ, t) = Fu0(ξ)e−4κ π2|ξ|2t for t ≥ 0, (9.15)

and this means thatu(·, t) = u0 � E(·, t), (9.16)

where the elementary solution E is the inverse Fourier transform of the Gaus-sian e−4κπ2|ξ|2t;14 as it is G(λ ξ) with λ =

√4κπ t, one finds

E(x, t) =1

(4κπ t)N/2e−|x|2/(4κ t). (9.17)

The elementary solution permits us to write the solution of ut−κΔu = fwith u |t=0= u0, by using the invariance of the equation by translations (in xand in t) and the linearity of the equation, and one obtains

u(x, t) =∫

RN

E(x−y, t)u0(y) dy+∫ t

0

(∫

RN

E(x−y, t−s)f(y, s) dy)ds. (9.18)

Laurent SCHWARTZ explained such formulas in the framework of his theoryof distributions by noticing that, once one extends functions by 0 for t < 0,i.e.

u(x, t) ={

0 for t < 0u(x, t) for t > 0, (9.19)

one hasut − κΔ u = f + u0 ⊗ δ0 (9.20)

12 Some kind of continuity is required in order to give a meaning to the value at aparticular time, so that imposing an inital datum makes sense.

13 One could also look for a solution in S ′(RN × R). One should notice that the

functions eα t+∑

jβjxj

are solutions of the equation if α = κ∑

jβ2

j , but exceptif α = 0 and βj = 0 for all j, these functions are not tempered distributions, andone must be careful about talking of their Fourier transform.

14 One should say an elementary solution, but if one asks that it be 0 for t < 0 andthat it belongs to S ′(RN × R), then one finds only one.

Page 102: From hyperbolic systems to kinetic theory: a personalized quest

78 9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation

and that the elementary solution is satisfying15

Et − κΔ E = δ(0,0) = δ0 ⊗ δ0, (9.21)

and the formula for u is just the same as

u = E �(x,t) (f + u0 ⊗ δ0), (9.22)

where �(x,t) serves to emphasize that the convolution is in both variables xand t. Another way to write such formulas is the semi-group approach, whereone defines S(t) acting on functions in x for t > 0 by

S(t)v = E(·, t) �x v, (9.23)

and the formula becomes

u(·, t) = S(t)u0 +∫ t

0

S(t− s)f(·, s) ds. (9.24)

There is another way to discover the formula for the elementary solutionE, which is to observe that the equation is invariant by the group of transfor-mations b u(aP x, a2t), where a, b ∈ R and P ∈ SO(N) is a rotation, and thatby formally integrating in x one wants the integral

∫RN u(x, t) dx to be inde-

pendent of t, and the integral is conserved by the transformation if b = aN ; aself-similar solution is then such that16

u(x, t) = aNu(aP x, a2t) for all a, P means u(x, t) =1

tN/2f( |x|√

κ t

). (9.25)

Using the notation σ = x√κ t

, the equation is −N2 f− σ

2 f′−f ′′−N−1

σ f ′ = 0, andit is natural to start with the caseN = 1, which can be integrated immediatelyinto σ

2 f − f ′ = A, giving f = B e−σ2/4 for A = 0, the other solutions being

singular at the origin; then one observes that the coefficient of N is 0 for thatparticular solution, so it is a solution for every N .

There is a larger class of explicit solutions of the heat equation, which Icall anisotropic Gaussians, for which the action of the group of translations15 In a tensor product μ⊗ ν, μ is a Radon measure or a distribution in x while ν is

a Radon measure or a distribution in t; of course, one may write δ0(x) and δ0(t)in order to see more clearly what variables are used, but it is better to avoid thephysicists’ notation of Dirac masses as if they were functions. The uncertaintyabout notation is analogous to that created by denoting by 0 the null vector inevery vector space, but engineers and physicists often like to write u for a vectorand u for a tensor, etc., while mathematicians only mention a problem of notationwhen a formula could be naturally interpreted in different ways.

16 The reason why one prefers x√κ t

to x√t

is that it is a quantity without dimension,

because the only parameter κ appearing in the equation has a dimension length2

time−1.

Page 103: From hyperbolic systems to kinetic theory: a personalized quest

9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation 79

and rotations is nontrivial, and I shall describe them in a moment, but thereason why I was led to study this class in the mid 1980s was a remark foundin a book by ZEL’DOVICH & RAIZER,17,18 concerning a quasi-linear versionof the equation (used for high-temperature phenomena). Considering the casef = 0, one has a first L∞(RN ) bound

|u(x, t)| ≤ C0

tN/2for x ∈ R

N , t > 0, (9.26)

and C0 can be taken proportional to ||u0||L1(RN ), but ZEL’DOVICH & RAIZER

argued that for large time all the information contained in the initial datumhas diffused far away, and from afar the support of u0 looks like a point, so thesolution looks like the elementary solution, and this idea leads to the betterapproximation for large t,

|u(x, t) −M E(x, t)| ≤ C1

t(N+1)/2for x ∈ R

N , t > 0, where M =∫

RN

u0 dx;

(9.27)then they argued that there is no reason to put the information at an arbitrarypoint like 0, and because in the case where u0 is thought of as a density ofmass the only natural point is the centre of mass, one is led in the case M �= 0to a better approximation for large t,

|u(x, t) −M E(x− x∗, t)| ≤ C2t(N+2)/2 for x ∈ R

N , t > 0,where M x∗k =

∫RN xku0(x) dx for k = 1, . . . , N ;

(9.28)

then they argued that when t is large there is no reason to select the particulartime t = 0, and that by comparing u to M E(x − x∗, t − t∗) with a suitablet∗ one may improve the asymptotic estimate; however, they were doing theircomputation in one space variable (for a nonlinear version ut − (um)xx = 0),but I was working in N space variables, and I found that their trick of choosingt∗ does not always work, because of a question of anisotropy. In order to do thislast step correctly, and to prove in a rigourous way the preceding statements,I found the following explanation.

Lemma 9.1. If u0 and v0 are such that∫

RN (1+ |x|k+1) |u0| dx <∞,∫

RN (1+|x|k+1) |v0| dx <∞ and

RN

(u0 − v0)P dx = 0 for all polynomials P of degree ≤ k, (9.29)

then, denoting by u and v the solutions with initial data u0 and v0, one has

|u(x, t) − v(x, t)| ≤ Ckt(N+k+1)/2

for x ∈ RN , t > 0. (9.30)

17 Yakov Borisovich ZEL’DOVICH, Russian physicist, 1914–1987. He had worked atLomonosov State University, Moscow, Russia.

18 Yuri Petrovich RAIZER, Russian physicist.

Page 104: From hyperbolic systems to kinetic theory: a personalized quest

80 9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation

Proof : One has |u(x, t)−v(x, t)| ≤ ∫RN |F(u−v)(ξ, t)| dξ, and F(u−v)(ξ, t) =

e−4κ |ξ|2tF(u0 − v0)(ξ). The hypothesis implies that the function w = F(u0 −v0) has bounded derivarives of order k+1 and that all its derivatives of order≤ k at 0 are 0, so that |F(u0 − v0)(ξ)| ≤ C|ξ|k+1 for all ξ ∈ R

N , and thenone needs to compute

∫RN |ξ|k+1e−4κ |ξ|2t dξ, for which the change of variable

ξ = η√t

gives the desired estimate. ��Lemma 9.1 is also valid if v0 is a Radon measure with finite total mass, and

the choice v0 = M δa with M =∫

RN u0 dx serves to have the same momentsof order 0, whatever the point a is, but the moments of order 1 agree if andonly if one chooses a = x∗ (in the case M �= 0), and in order to have thesecond moments agree, one must look at the analogue of a matrix of inertiaJ given by

Jj,k =∫

RN

(x− x∗j)(x− x∗k)u0(x) dx, for j, k = 1, . . . , N, (9.31)

which cannot agree with those of a Gaussian function v0(x) = M E(x −x∗,−t∗) (for a choice t∗ < 0), unless J is proportional to I. For more generalcases it is useful then to know explicit solutions for which the moments oforder up to 2 are known, and I used for that a family of solutions of the formv = equadratic(x). Besides

∫Re−π x

2dx = 1, which gives

∫Re−αx

2dx =

√πα by

rescaling, one also uses∫

Rx2e−αx

2dx =

√π

2α3/2 by integration by parts; forA symmetric positive definite and B symmetric, one deduces the followingformulas: ∫

RN e−(Ax.x) dx = πN/2√

det(A)∫RN (B x.x)e−(Ax.x) dx = πN/2

2√det(A)

trace(A−1B),(9.32)

these integrals being easily computed in an orthonormal basis of eigenvec-tors of A (using the invariance by rotations of the Lebesgue measure), thesecond integral appearing to be the first one multiplied by

∑Ni=1

Bi,i

2αiwhere

α1, . . . , αN are the eigenvalues of A, which one must write in an intrinsic wayas 1

2 trace(A−1B) in order for the formula to be valid in any orthonormal

basis. The choicev0(x) = a e−

(A(x−x∗).(x−x∗)

)(9.33)

has the same moments of order up to 2 than u0 if a πN/2√det(A)

= M and

a πN/2

2√det(A)

trace(A−1B) = trace(J B) for every B, i.e. a πN/2

2√det(A)

A−1 = J ,

giving JM = A−1

2 or A = M2 J

−1, and then a = M√det(J)

(M2π

)N/2.

It remains to compute in an explicit way the solution with initial datumv0, and this can easily be done by Fourier transform, but by looking directlyat solutions of the form equadratic(x) with a quadratic form having coefficients

Page 105: From hyperbolic systems to kinetic theory: a personalized quest

9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation 81

in t, one can solve more general equations of the form19

ut −N∑

i,j=1

Di,juxixj +N∑

k=1

Bkuxk+ C u = 0, (9.34)

with coefficients Di,j ∈ P0, Bk ∈ P1 and C ∈ P2, where Pm denotes the spaceof polynomials in x of degree at most m, with coefficients depending upon t;this is due to the fact that if one uses the change of unknown function

u = eϕ, (9.35)

then ϕ satisfies a nonlinear partial differential equation

ϕt =N∑

i,j=1

Di,j(ϕxixj + ϕxiϕxj ) −N∑

k=1

Bkϕxk− C, (9.36)

and the right side shows a nonlinear operator that maps P2 into itself, and thepartial differential equation becomes an ordinary differential equation whenrestricted to P2, which one may solve explicitly.20 We write ϕ as

ϕ(x, t) = −(A(t)x.x) + 2(b(t).x) + c(t) (9.37)

with A(t) symmetric, and then if

D(t) is symmetricBk(x, t) =

∑Nj=1(B1)j,k(t)xj + (B0)k(t) for k = 1, . . . , N

C(x, t) = (C2(t)x.x) + 2(C1(t).x) + C0(t) with C2(t) symmetric,(9.38)

one has∑

i,j Di,j∂2ϕ

∂xi∂xj= 2∑i,j Di,jAi,j = 2trace(AD)

∑Ni,j=1 Di,j

∂ϕ∂xi

∂ϕ∂xj

=∑N

i,j=1 4Di,j(−Ax+ b)i(−Ax+ b)j= 4(ADAx.x) − 8(AD b.x) + 4(D b.b)∑N

k=1Bkϕxk=∑N

k=1 2(∑N

j=1(B1)j,k(t)xj + (B0)k(t))(−Ax+ b)k

= −2(B1Ax.x) + 2(B1x.b) − 2(Ax.B0) + 2(B0.b)

(9.39)

19 This form englobes the linear Fokker–Planck equation ft−v.fx−κΔvf = 0, wherethe “space” variable is (x,v), and in this case D is nonnegative but degenerate,and some coefficients Bk are linear in v; one may also add terms in (E+v×B).fvcorresponding to the Lorentz force, with E ∈ P1 and B ∈ P0. The computationsin this case permit one to write in an explicit way what the Green function is.

20 One could also do these computations after using a partial Fourier transform inx, but I prefer to show the computations in the way I present them here, becausethe idea extends to some nonlinear equations, for which the Fourier transform isnot a good tool; for example, it can also be used in linear cases with negativediffusion, where the solutions (local in time) may not belong to S ′(RN ).

Page 106: From hyperbolic systems to kinetic theory: a personalized quest

82 9 Fourier Transform; the Asymptotic Behaviour for the Heat Equation

and one deduces that A, b, c satisfy the differential systemdAdt = −4ADA+B1A+ABT1 − C2dbdt = −4AD b−BT1 b+AB0 − C1dcdt = −2trace(AD) + 4(D b.b) − 2(B0.b) − C0.

(9.40)

The first equation gives A, the second gives b and then the third gives c; in thecase where Bk = 0 for k = 1, . . . , N and C = 0, the equation for A becomesdAdt = −4ADA, or

d(A−1)dt

= 4D, i.e. A−1(t) = A−1(0) + 4∫ t

0

D(s) ds, (9.41)

showing that if A(0) is positive definite and D nonnegative then A(t) is posi-tive definite, but in the case where A is nonnegative and degenerate, then A(t)stays degenerate in the same subspace and positive definite on the orthogonalspace; in this particular case, one also deduces

d(A−1b)dt

= 0, (9.42)

which is the stationarity of x∗ (which is not always valid if there are lower-order terms in the equation). In the mid 1980s, Jean-Pierre GUIRAUD hadpointed out that my computations were variants of what was usually done,21

which must be to use the elementary solutions E and its derivatives of order1 or 2, i.e. E multiplied by suitable polynomials.

[Taught on Monday September 17, 2001.]

Notes on names cited in footnotes for Chapter 9, FRECHET,22 AL KHWARIZ-

MI,23 HADAMARD,24 LOMONOSOV,25 and for the preceding footnotes, AL

MA’MUN.26

21 Jean-Pierre GUIRAUD, French mathematician. He worked at Universite Paris VI(Pierre et Marie Curie), Paris, and at ONERA (Office National d’Etudes et deRecherches Aeronautiques), Chatillon, France.

22 Maurice Rene FRECHET, French mathematician, 1878–1973. He had worked inPoitiers, in Strasbourg and in Paris, France.

23 Abu Ja’far Muhammad ibn Musa AL KHWARIZMI (or KHAWARIZMI), “Iraqi”mathematician, 780–850. It is not known where he was born, but he had workedin an academy (bayt al-hikmah = house of wisdom) that the Caliph AL MAMUN

had set up in his capital Baghdad (now in Iraq), with the goal of translatingGreek philosophical and scientific works into Arabic.

24 Jacques Salomon HADAMARD, French mathematician, 1865–1963. He had workedin Bordeaux, in Paris, France, holding a chair (mecanique analytique et mecaniqueceleste, 1909–1937) at College de France, Paris.

25 Mikhail Vasilievich LOMONOSOV, Russian scientist, 1711–1765. He had workedin Moscow, Russia. Lomonosov State University, Moscow, Russia, is named afterhim.

26 Abu al-Abbas Abd Allah AL MA’MUN ibn Harun, 7th Caliph of the Abbasiddynasty, 786–833. He had ruled over the Muslim world from Baghdad, now inIraq.

Page 107: From hyperbolic systems to kinetic theory: a personalized quest

10

Radon Measures; the Law of Large Numbers

Gaussian functions occur in many situations, sometimes related to their ap-pearance for the heat equation, and a classical example is related to proba-bilities and the accumulated errors in independent experiments which is thesubject of this lecture, but there are other reasons, and their appearance forthe Boltzmann equation is different.

The bases of probability theory were laid by the work of FERMAT,1

PASCAL,2 and D. BERNOULLI,3 who were concerned with discrete events,and this corresponds to using Radon measures having a finite number of Diracmasses, while more general questions may involve Radon measures, or Borelmeasures.

Having studied partial differential equations, I prefer Radon measures,which are included in the theory of distributions of Laurent SCHWARTZ asdistributions of order ≤ 0, and for an open set Ω, M(Ω) is the dual of Cc(Ω),the space of continuous functions with compact support in Ω. Actually, Radonmeasures do not require the differential structure of R

N , and they can bedefined on a locally compact topological space which is σ-compact (i.e. acountable union of compact subsets). For a compact set K, M(K) is the dualof C(K), which is a Banach space with the sup norm, and a probability onK is any nonnegative Radon measure (i.e. 〈μ, ϕ〉 ≥ 0 whenever ϕ ≥ 0 inK) with total mass 1 (i.e. 〈μ, 1〉 = 1). In dealing with probabilities in thecontext of Radon measures in an open set Ω, one restricts attention to testfunctions in C0(Ω), the space of bounded continuous functions tending to 0at the boundary and at infinity, which is a Banach space with the sup norm,and its dual is the space Mb(Ω) of Radon measures with finite total mass. An

1 Pierre DE FERMAT, French mathematician, 1601–1665. He had worked (as alawyer and government official) in Toulouse, France.

2 Blaise PASCAL, French mathematician and philosopher, 1623–1662. The Univer-site de Clermont-Ferrand II, Aubiere, France, is named after him.

3 Daniel BERNOULLI, Swiss mathematician, 1700–1782. He had worked in St Pe-tersburg, Russia, and in Basel, Switzerland.

Page 108: From hyperbolic systems to kinetic theory: a personalized quest

84 10 Radon Measures; the Law of Large Numbers

important difference between Radon measures and Borel measures is that onecannot use them on spaces of continuous paths (because infinite-dimensionalBanach spaces are not locally compact), and although FEYNMAN has usedsuch spaces of paths in his computations, I think that it is a mistake to considerthat such notions are important in physics, and as “Brownian” motion can beavoided in dealing with the heat equation, I conjecture that using spaces ofpaths will be avoided, once one has understood what physicists are really afterwhen they play such games. Until proven wrong, I shall continue to teach thatit is Radon measures which are adapted to questions in continuum mechanicsor physics.

If for example one throws a coin n times and one counts the number j ofheads (and n− j of tails), an event is a list of length n of heads or tails (whichone may consider as being 0 or 1, or view such a list as a vertex of a cube{0, 1}n), and there are 2n events, all equally probable with probability 2−n ifthe coin tossing is not biased. To compute the probability of having j heads onemust count the number of subsets with j elements in a set of n elements, andthere are

(nj

)of them, so the probability is π(j) = 2−n

(nj

), for j = 0, . . . , n; of

course, one has∑n

j=0 π(j) = 1, which also follows from the Newton binomialformula (1 + x)n =

∑nj=0

(nj

)xj by taking x = 1. The average value of j is

∑n0 j π(j), which is obviously n

2 , as π(j) = π(n − j) for every j, but a moreanalytic way is to derive (1 +x)n so that n(1 + x)n−1 =

∑nj=0

(nj

)j xj−1, and

taking x = 1 gives∑n

j=0 j(nj

)= n 2n−1, or

∑nj=0 j 2−n

(nj

)= n

2 . Similarly,n(n−1)(1+x)n−2 =

∑nj=0

(nj

)j(j−1)xj−2 gives

∑nj=0 j

2(nj

)=∑nj=0 j

(nj

)+

n(n−1)2n−2 = n 2n−1+n(n−1)2n−2 = n(n+1)2n−2, so that if one computesthe average of

(j− n

2

)2, one finds 2−n(n(n+1)2n−2−n(n 2n−1)+ n2

4 2n)

= n4 ;

one sees then that the important values of j are of the form n2 + O(

√n),

and this type of scaling is general when one repeats a process with the sameprobabilities at each stage, independent of what happened before, and this isrelated to the law of large numbers. Hidden behind these computations is thefact that one has used repeated convolutions of a Radon measure by itself,here the Radon measure 1

2δ0 + 12δ1, and it is useful to review convolutions and

Fourier transform of Radon measures (when they are tempered distributions).

A Radon measure μ on an open set Ω ⊂ RN is a linear form ϕ �→ 〈μ, ϕ〉

defined on Cc(Ω), the space of continuous functions with compact support,which satisfies the (continuity) condition

for every compact K ⊂ Ω there exists a constant CK such that|〈μ, ϕ〉| ≤ CK maxx |ϕ(x)| for all ϕ ∈ Cc(Ω) having their support in K.

(10.1)A Radon measure is said to have finite total mass if one can take CK in-dependent of K, and the total mass is then the supremum of |〈μ, ϕ〉| for all

Page 109: From hyperbolic systems to kinetic theory: a personalized quest

10 Radon Measures; the Law of Large Numbers 85

ϕ ∈ Cc(Ω) of norm ≤ 1 (i.e. satisfying |ϕ(x)| ≤ 1 for all x ∈ Ω);4 in thiscase one finds easily that the mapping ϕ �→ 〈μ, ϕ〉 extends to ϕ ∈ Cb(Ω), theBanach space of bounded continuous functions, equipped with the sup norm.For μ a Radon measure with finite total mass in R

N , one can define its Fouriertransform Fμ by

Fμ(ξ) = 〈μ, e−2i π(·.ξ)〉, (10.2)

and the Lebesgue dominated convergence theorem shows that

Fμ ∈ Cb(RN ). (10.3)

For μ = δa, the Dirac mass at a, one has

Fδa(ξ) = e−2i π(a.ξ) for all ξ ∈ RN , (10.4)

showing that the Fourier transform may not tend to 0 at infinity.5 One hasFδ0 = 1, but in order to give a meaning to the formula F1 = δ0, one must usethe extension by Laurent SCHWARTZ of the Fourier transform to the space oftempered distributions S′(RN ).6

If μ1 and μ2 are two Radon measures with finite total mass, the convolu-tion product μ1 � μ2 is defined by 〈μ1 � μ2, ϕ〉 = 〈μ1 ⊗ μ2, ϕ(x + y)〉, wherethe tensor product μ1 ⊗μ2 is defined by 〈μ1 ⊗μ2, ϕ1 ⊗ϕ2〉 = 〈μ1, ϕ1〉〈μ2, ϕ2〉for all ϕ1, ϕ2 ∈ Cc(RN ).7 The convolution product cannot be defined for all

4 Some people use the term “bounded measure” for saying that a measure has afinite total mass, and this comes from the point of view of measuring sets, so thatfor all Borel sets A one has |μ(A)| ≤ K, and in that case they call its norm the“total variation” of the measure, as it is supA μ(A)− infB μ(B). The point of viewof measuring sets is not adapted to questions of continuum mechanics or physics,and many who use this point of view have actually advocated questions of fakemechanics.

5 There are cases of Radon measures with finite total mass which are not of theform f dx for f ∈ L1(RN ) but for which the Fourier transform nevertheless tendsto 0 at infinity.

6 This extension does not give a Fourier transform for all smooth functions; inparticular if f(x) = ex, one has f ′ = f , and if the Fourier transform could bedefined with the usual formula Ff ′(ξ) = 2i π ξFf(ξ), one would have (2i π ξ −1)Ff = 0, i.e. the support of Ff would be included in the set where 2i π ξ = 1,which is empty, so Ff would be 0, and an extension of the Fourier transformwhich is not invertible is useless (of course, one may then look for extensionswhich do not give distributions).

7 For continuous functions, Φ = ϕ1 ⊗ ϕ2 means Φ(x, y) = ϕ1(x)ϕ2(y) for all(x, y) ∈ X × Y , where X and Y are open sets of finite-dimensional spaces forexample; the definition for Radon measures is a natural extension. Every contin-uous function with compact support in X × Y can be approached uniformly bylinear combinations of tensor products, for example by using the Weierstrass the-orem (that on a compact set of R

N every continuous function can be approacheduniformly by polynomials, which are linear combinations of tensor products) and

Page 110: From hyperbolic systems to kinetic theory: a personalized quest

86 10 Radon Measures; the Law of Large Numbers

Radon measures, and it is an extension of the particular case δa � δb = δa+b,which shows that the group structure of R

N plays a crucial role (while it playsno role in the definition of the tensor product), and this definition extends im-mediately as a bilinear mapping by (

∑i αiδai) � (

∑j βiδbi) =

∑i,j αiβjδai+bj

if one imposes∑i |αi| < ∞ and

∑j |βj | < ∞; then every Radon measure

with finite total mass can be approached (in weak � topology) by combina-tions of Dirac masses, and the definition extends by continuity. For functionsf, g ∈ L1(RN ) the convolution formula is (f � g)(x) =

∫RN f(y)g(x − y) dy,

and it corresponds to the fact that the choice μ1 = f dx and μ2 = g dx givesμ1�μ2 = (f �g)dx, and the property that the Lebesgue measure dx is invariantby translation is crucial (for other locally compact groups, one needs to use aHaar measure for the group,8 and this general approach may have been doneby WEIL).

From a probabilistic point of view, one wants a probability μ to act onsome (measurable) sets, and one writes μ(A) for what is written as 〈μ, χA〉when μ is a nonnegative Radon measure with total mass 1, where χA is thecharacteristic function of A; of course, one must use the Lebesgue extensionthat one can use test functions which are not necessarily continuous, and Amust then be μ-measurable (i.e. a Borel set modulo a set of μ-measure 0).

Tensor products are then natural for dealing with independent probabili-ties, i.e. one has two probabilities μ1 and μ2 on sets X1 and X2 and one wantsto define a probability μ on X1 ×X2 such that μ(A1 × A2) = μ1(A1)μ2(A2)for all measurable subsets A1 ⊂ X1 and A2 ⊂ X2.

Convolution products appear natural when one deals with an Abelian (i.e.commutative) group G and X1 = X2 = G; if one measures a first valuez1 ∈ G with a probability μ1 and independently a second value z2 ∈ G witha probability μ2 and one wants to compute the law corresponding to z1 + z2,then one finds that it is μ1 � μ2.

If G = RN , and one measures n times the value z according to the same

probability μ, each measurement being independent of the preceding ones,then one is dealing with μ�μ�. . .�μ (with n terms in the convolution product);if one averages the result obtained z1+...+zn

n , one must use a rescaling. It is thennatural to wonder if z1+...+zn

n is a good approximation of a suitably definedaveraged value, and in what sense the sequence of rescaled laws converges.

truncation, and this shows that there is at most one Radon measure satisfyingthe imposed conditions. In order to compute 〈μ1 ⊗μ2, Φ〉 for Φ ∈ Cc(X×Y ), onelets μ2 act on the function Φ(x, ·), and this gives Ψ1(x) for a function Ψ1 ∈ Cc(X),and one then lets μ1 act on Ψ1, and by the uniqueness part the same result isobtained by letting μ1 act on the function Φ(·, y), giving Ψ2(y), and letting μ2 acton Ψ2.

8 Alfred HAAR, Hungarian mathematician, 1885–1933. He had worked in Gottin-gen, Germany, in Kolozsvar (then in Hungary, now Cluj-Napoca, Romania), inBudapest and in Szeged, Hungary.

Page 111: From hyperbolic systems to kinetic theory: a personalized quest

10 Radon Measures; the Law of Large Numbers 87

If μ is a nonnegative Radon measure of total mass 1 on RN , then in order

to define an average value, one asks that |x|μ has finite total mass, then onecan define its average, or centre of mass by

average(μ) = a means aj = 〈μ, xj〉 for j = 1, . . . , N (if 〈μ, 1〉 = 1) (10.5)

If (1 + |x|)μ1 and (1 + |x|)μ2 have finite total mass, then

x(μ1 � μ2) = (xμ1) � μ2 + μ1 � (xμ2)average(μ1 � μ2) = average(μ1) + average(μ2) (if 〈μ1, 1〉 = 〈μ2, 1〉 = 1).

(10.6)If μ is a nonnegative Radon measure on R

N with 〈μ, 1〉 = 1, and one definesμn = μ � . . . � μ (with n terms in the convolution product), then assumingthat (1 + |x|)μ has finite total mass and a = 〈μ, x〉, the centre of mass of μn

is at n a, and it is then natural to rescale μn and define νn by

〈νn, ϕ〉 = 〈μn, ϕn〉 = 〈μ � . . . � μ, ϕn〉 with ϕn = ϕ( ·n

), ϕ ∈ Cb(RN ), (10.7)

so that νn is a (bounded) sequence of nonnegative Radon measures, with totalmass 1 and centre of mass at a. This gives us the framework for the first partof the law of large numbers.

Lemma 10.1. Under the preceding hypotheses (μ ≥ 0, 〈μ, 1〉 = 1, 〈μ, |x|〉 <∞, 〈μ, x〉 = a), the sequence νn converges to δa in the weak � topology of X ′,with X = Cb(RN ), i.e.

limn→∞〈νn, ϕ〉 = ϕ(a) for all ϕ ∈ X = Cb(RN ). (10.8)

Proof : One must notice that X is not separable,9 and that the dual X ′ con-tains elements which are not distributions (in agreement with the fact thatC∞c (RN ) is not dense in X),10 so one starts by considering test functions

ϕ ∈ Y = C0(RN ); as C∞c (RN ) is dense in Y , the dual Y ′ is a space of distri-

butions, and it is actually the space of Radon measures with finite total mass.As Y is separable, on bounded sets of Y ′ the weak � topology is metrizable,and one can extract a subsequence νm which converges in Y ′ weak � to ν∞;9 For example, in the case N = 1, one may consider all functions f such thatf(n) ∈ {−1,+1} for n ∈ Z, the functions being extended to be affine continuousin each of the intervals [m,m+ 1]; the distance of two different functions of thisfamily is equal to 2, and because the family is not countable, one cannot coverX by a countable number of balls of radius < 1 (as such a ball contains at mostone function f from the family), so that X is not separable.

10 The sequence δn is bounded in X ′, so that it belongs to a weakly � compact subsetof X ′ by Alaoglu theorem, but none of the accumulation points of the sequenceis a distribution, because for every ϕ ∈ Y = C0(RN) one has 〈δn, ϕ〉 → 0, but 0is not an accumulation point because 〈δn, 1〉 → 1.

Page 112: From hyperbolic systems to kinetic theory: a personalized quest

88 10 Radon Measures; the Law of Large Numbers

once one will have shown that ν∞ = δa, one will deduce that all the sequenceνn converges to δa in Y ′ weak �.

The identification of ν∞ follows from the use of the Fourier transform,which is well defined because Y ′ ⊂ S′(RN ). One has Fμn(ξ) =

(Fμ(ξ))n,

and Fνn(ξ) = Fμn( ξn)

=(Fμ( ξn

))n; the hypotheses imply that |Fμ(ξ)| ≤ 1for all ξ ∈ R

N , Fμ(0) = 1, and Fμ is of class C1, with ∂(Fμ)∂ξj

(0) = −2i π ajfor j = 1, . . . , N , so that Fμ(ξ) = 1 − 2i π (ξ.a) + o(|ξ|) for |ξ| small, sothat Fνn(ξ) → e−2i π (ξ.a) for every ξ ∈ R

N , and because |Fνn(ξ)| ≤ 1, onededuces from the Lebesgue dominated convergence theorem that Fνn(ξ) →e−2i π (ξ.a) = Fδa(ξ) in Lploc(R

N ) strong for 1 ≤ p < ∞ and L∞(RN ) weak �and therefore in S′(RN ) weak �, so that ν∞ = δa.

Let ϕ0 ∈ Cc(RN ) be such that 0 ≤ ϕ0 ≤ 1 and ϕ0(x) = 1 for |x| ≤ |a|, then〈νn, ϕ0〉 → 〈δa, ϕ0〉 = ϕ0(a) = 1, so that 〈νn, ϕ0〉 = 1 − εn with 0 < εn andεn → 0. Then for ϕ ∈ Cb(RN ) one has 〈νn, ϕ〉 = 〈νn, ϕϕ0〉 + 〈νn, ϕ(1 − ϕ0)〉and 〈νn, ϕϕ0〉 → 〈δa, ϕϕ0〉 = ϕ(a)ϕ0(a) = ϕ(a) because ϕϕ0 ∈ C0(RN ) and〈νn, ϕ(1−ϕ0)〉 ≤ ||ϕ||Cb(RN )〈νn, (1−ϕ0)〉 = εn||ϕ||Cb(RN ) → 0 because νn ≥ 0and 1−ϕ0 ≥ 0; this shows that for every ϕ ∈ Cb(RN ) one has 〈νn, ϕ〉 → ϕ(a).

��If μ is a nonnegative Radon measure of total mass 1 on R

N , and |x|2 μ hasfinite total mass, then |x|μ has finite total mass by the Cauchy–Bunyakovsky–Schwarz inequality,11,12 but apart from this first appearance, I shall call thisinequality as it is known, the Cauchy–Schwarz inequality,13 and one definesthe matrix of inertia J of μ by

J(μ)j,k = 〈μ, (x−a)j(x−a)k〉 for j, k = 1, . . . , N (if 〈μ, 1〉 = 1 and 〈μ, x〉 = a).(10.9)

If (1 + |x|)2μ1 and (1 + |x|)2μ2 have finite total mass, then

xjxk(μ1 � μ2) = (xjxkμ1) � μ2 + μ1 � (xjxkμ2)+(xjμ1) � (xkμ2) + (xkμ1) � (xjμ2)

J(μ1 � μ2) = J(μ1) + J(μ2) (if 〈μ1, 1〉 = 〈μ2, 1〉 = 1).(10.10)

Then the matrix of inertia of μn is that of μ multiplied by n, and it is thennatural to translate and rescale μn and define πn by

〈πn, ψ〉 = 〈μn, ψn〉 with ψn(n a+√nσ) = ψ(σ), ψ ∈ Cb(RN ), (10.11)

11 Viktor Yakovlevich BUNYAKOVSKY, Ukrainian-born mathematician, 1804–1889.He had worked in St Petersburg, Russia.

12 Hermann Amandus SCHWARZ, German mathematician, 1843–1921. He hadworked at ETH (Eidgenossische Technische Hochschule), Zurich, Switzerland, andin Berlin, Germany.

13 It should indeed be attributed to BUNYAKOVSKY, who had studied with CAUCHY

in Paris (1825), and had proven the “Cauchy–Schwarz inequality” in 1859, 25years before SCHWARZ.

Page 113: From hyperbolic systems to kinetic theory: a personalized quest

10 Radon Measures; the Law of Large Numbers 89

so that πn is a (bounded) sequence of nonnegative Radon measures, withtotal mass 1, centre of mass at 0 and matrix of inertia J(μ). This gives us theframework for the second part of the law of large numbers.

Lemma 10.2. Under the preceding hypotheses (i.e. adding 〈μ, |x|2〉 < ∞,J(μ) = 〈μ, (x − a) ⊗ (x − a)〉), the sequence πn converges to π∞ in the weak� topology of X ′, with X = Cb(RN ), where π∞ is the (anisotropic) Gaussiandefined by

Fπ∞ = e−2π2(J(μ)ξ.ξ) for ξ ∈ RN . (10.12)

Proof : One starts by considering test functions ψ ∈ Y = C0(RN ), one extractsa subsequence converging to π∞ in Y ′ weak � by the Banach theorem,14 andone identifies π∞ by its Fourier transform. The relation between ψ and ψnconsists in translating μ of −a and μn of −n a, so one may assume that a = 0,and in that case the hypotheses imply that Fμ is of class C2 with Fμ(0) = 1,∂(Fμ)∂ξj

(0) = 0 for j = 1, . . . , N , ∂2(Fμ)∂ξj∂ξk

(0) = 4π2J(μ)j,k for j, k = 1, . . . , N , sothat Fμ(ξ) = 1 − 2π2(J(μ)ξ.ξ) + o(|ξ|2) for |ξ| small. Then (because a = 0)one has Fπn(ξ) =

(Fμ( ξ√n

))n → e−2π2(J(μ)ξ.ξ), and |Fπn(ξ)| ≤ 1, giving

Fπ∞(ξ) = e−2π2(J(μ)ξ.ξ) for ξ ∈ RN . One concludes in the case ψ ∈ Cb(RN )

by a truncation argument, like in Lemma 10.1. ��[Taught on Wednesday September 19, 2001.]

Notes on names cited in footnotes for Chapter 10, WEIERSTRASS,15 ALAO-

GLU.16

14 Y being a separable Banach space, the weak � topology on a bounded set of Y ′

is separable.15 Karl Theodor Wilhelm WEIERSTRASS, German mathematician, 1815–1897. He

had first taught in high schools in Munster and in Braunsberg, Germany, andthen worked in Berlin, Germany.

16 Leonidas ALAOGLU, Canadian-born mathematician.

Page 114: From hyperbolic systems to kinetic theory: a personalized quest

11

A 1-D Model with Characteristic Speed 1ε

In Lemma 10.1 and Lemma 10.2, describing the law of large numbers, thehypothesis that μ is nonnegative is not really necessary, and the proof hasmostly used 〈μ, 1〉 = 1 and |Fμ(ξ)| ≤ 1 for all ξ ∈ R

N ; if μ ≥ 0, then μn

is nonnegative and has total mass 1, and this was used for showing weak� convergence with test functions in C0(RN ) and then in Cb(RN ), while ifone replaces nonnegativity by |Fμ(ξ)| ≤ 1 for all ξ ∈ R

N , one deduces fromFμn(ξ) =

(Fμ(ξ))n that Fμn is bounded in L∞(RN ), and the proof holds

for test functions in FL1(RN ), which is included in C0(RN ).

This remark is useful when approximating partial differential equationswith constant coefficients by (one step) explicit finite-difference schemes ofthe form

Un+1i =

j

ajUni+j for i ∈ Z

N , n ≥ 0, (11.1)

where there is only a finite number of j ∈ ZN in the sum and the coefficients

aj depend explicitly on Δx and Δt; usually one lets Δx and Δt tend to 0in such a way that the coefficients aj do not change,1 and one must check theconsistency and stability of the scheme.

Consistency means that the scheme is adapted to the equation that onewants to solve, and it is checked by using the Taylor expansion of a smoothsolution, or by verifying that the scheme is exact for a precise family ofpolynomial solutions of the equations; for example, if one wants to solveut − κΔu = 0, one wants the scheme to be exact for the function 1 andfor the functions xk, k = 1, . . . , N , and this condition is the same for an equa-tion ut −

∑i,j Di,j

∂2u∂xi∂xj

= 0, and then one wants the scheme to be exact forall the polynomial solutions

∑i,j ci,jxixj − (2κ

∑i ci,i)t, but this condition

1 In a hyperbolic setting it means that Δ tΔ x

is fixed, and one must impose a CFLcondition, but in a parabolic setting it usually means that Δ t

(Δ x)2is constant, and

small enough for a stability condition to hold.

Page 115: From hyperbolic systems to kinetic theory: a personalized quest

92 11 A 1-D Model with Characteristic Speed 1ε

depends upon which diffusion tensor appears in the equation that one wantsto solve.

Stability consists in showing in advance that the numbers generated bythe algorithm (11.1) give a bounded sequence of approximations in a suitablenorm, and the �2 norm

(∑i |Uni |2

)1/2 plays a special role, in part becauseit gives a stability condition in L2(RN ) after defining a suitable function byinterpolation, but mainly because one has a necessary and sufficient conditionof stability in that norm (if the coefficients aj are kept fixed), which is thatthe function M defined by M(ξ) =

∑j aje

i j ξ satisfies |M(ξ)| ≤ 1 for allξ ∈ R

N .2 One then finds the same type of condition which appeared in theproof of the law of large numbers, with the difference that this approachworks for many partial differential equations with constant coefficients, evenif Gaussian functions play no role in their solution.

It has been seen that the Lax–Friedrichs scheme with the natural CFLcondition can be interpreted in terms of a random walk, and this classicalassociation of random walks and heat equation or other diffusion equations isoften considered but it is rarely mentioned that this has not much to do withthe physics of the phenomena which one tries to describe by diffusion models.

As I mentioned before, jumps in position are not physical, because theyinvolve infinite velocities, while jumps in velocity are reasonably good ap-proximations of what happens in collisions or in almost collisions, when thevelocities involved are very small compared to the velocity of light c. It wasconsidered natural by FOURIER to postulate that the heat flux is proportionaland opposite to the gradient of the temperature, because he knew that heatflows from hot regions to cold regions, and he could hardly have argued thattemperature is a statistical concept which has no meaning at a microscopiclevel, because such ideas only appeared at least fifty years after his work,with the introduction of ideas in kinetic theory of gases by MAXWELL and byBOLTZMANN; a particle does not have a temperature, but it has a velocityand it is the fact that not all particles have the same velocity which createsthe need for mesoscopic/macroscopic quantities like internal energy, to whichtemperature is related; although everyone has a clear intuition of what is hotand what is cold, it does not mean that one understands much about what

2 This is done by considering the Unj as the coefficients of a Fourier series∑

j∈ZN Unj e

2i π(j.ξ), defining a periodic function fn(ξ) (with period the unit cube);

the algorithm has then the form fn+1(ξ) = M(2π ξ)fn(ξ); the �2 norm is adoptedbecause of the Parseval theorem which states that it coincides with the L2 normon a period (BESSEL being only credited for proving an inequality).

Page 116: From hyperbolic systems to kinetic theory: a personalized quest

11 A 1-D Model with Characteristic Speed 1ε

93

is really going on at a microscopic level,3 and the Fourier law is only foundnatural because the class of equations considered is too restrictive.4

What happens with particles is not that they jump but that they changetheir velocity, because of interactions with their environment; for simplifica-tion, one uses models of collisions which are instantaneous (and whose resultuses probabilities), and it is the fact that one has neglected the time of inter-action which creates the impression that an infinite acceleration has occurred,resulting in a jump in velocity, and I shall come back to this question in moredetail later.

The subject of this lecture is to show how a linear model in which there arelarge velocities and jumps in velocities attributed to scattering, approximatesa diffusion in space, once one lets a characteristic velocity tend to infinity. Itis precisely the source of illogical statements made by physicists, that theydo not appreciate that c = +∞ in some postulated model, and that thesemodels cannot show that something travels faster than the real velocity oflight. From a logical point of view, I am not sure why some physicists insiston using a postulated equation like the Schrodinger equation for pretendingthat something may travel faster than the velocity of light c, when it is alreadya feature of the Fourier heat equation, which was postulated a century before!From a mathematical point of view it seems better to observe that the Fourierheat equation or the Schrodinger equation appears when one lets c tend toinfinity in more precise models, like the equation of radiative transfer, or theDirac equation. Of course, physicists consider that the Dirac equation is forone relativistic electron, but a scenario in which an electron would stop beforeentering a laboratory to ask if the experience it will participate in is relativisticor not (so that it will choose between solving the Dirac equation or solving theSchrodinger equation) does not seem too serious! Of course, physicists mayobject that “quantum particles” are quite strange objects, which may not bebothered by the silliness of the games that one attributes them, but now thatmathematicians have tools for understanding more about localized solutionsof hyperbolic systems, it is quite obvious that there are no particles at all,only waves.3 If one puts one hand on a marble table, one finds it cold, and one does not have

the same sensation with the other hand on a wooden table, although both tablesare at equilibrium at the temperature of the room; a classical explanation is thatmarble is a good conductor (of electricity and heat) and that it takes the heataway from the hand, which is at a higher temperature than the room (so thatI have assumed the room to be at a much lower temperature than 37 degreesCelsius), and therefore it is not a difference in temperature that the two handshave felt, but a difference in heat flux, created by a difference in conductivity.

4 Even with a more general class like pseudo-differential operators, one should ob-serve that one does not understand much yet about nonlinear effects of a micro-local nature, and one should consider most macroscopic models as approximationswhich must be improved later, once new and more adapted mathematical toolshave been introduced.

Page 117: From hyperbolic systems to kinetic theory: a personalized quest

94 11 A 1-D Model with Characteristic Speed 1ε

I consider the following model, which I first learnt how to treat in a math-ematical way in lectures by Jacques-Louis LIONS,

∂uε

∂t + 1ε∂uε

∂x + aε2 (uε − vε) = 0 in R × (0,∞), uε(·, 0) = ϕ in R

∂vε

∂t − 1ε∂vε

∂x − aε2 (uε − vε) = 0 in R × (0,∞), vε(·, 0) = ψ in R.

(11.2)

I learnt much later about what such models are supposed to represent (ina caricatural way), and the coefficient 1

ε serves as velocity of light c, anduε(x, t), vε(x, t) represent (nonnegative) densities of “photons” moving alongthe x axis, in the positive or negative direction; the term in uε − vε describesa scattering effect, which makes some “photons” change their direction by in-teracting at some high rate with the material environment. I must say thatphysicists’ ideas concerning photons look strange to me, because the quan-tification h ν for the energy of a “photon of frequency ν” only makes senseif light interacts with matter, because the Planck constant h is a couplingparameter between light and matter; when there is no matter, photons followthe Maxwell–Heaviside equation which is linear and they are propagated with-out interacting, and this is why I mention a material environment, withoutwhich I cannot understand the origin of scattering. Physicists say that pho-tons are bosons, i.e. they follow Bose–Einstein statistics,5 so that they mayappear spontaneously with a higher probability when there are already pho-tons present, but I am still not sure about what this rule means; physicistsuse this argument for a computation attributed to EINSTEIN, to “explain”Planck’s law for black body radiation (which has dependence on frequencyand on temperature, and seems to fit well with experimental measurements,apart from the missing frequencies of absorption in a gas), and that may bethe origin of that strange dogma.

In more general models of radiative transfer, one has an unknown func-tion f(x, t, ω, ν) which is a (nonnegative) density of photons at the lo-cation x ∈ R

3 and time t, moving in the direction of the unit vectorω ∈ S

2 and having frequency ν; the equation contains a free transport term∂f∂t + c

∑3j=1 ωj

∂f∂xj

, terms of absorption/emission, and terms of scattering;assuming a linear scattering effect independent of the frequency as a sim-plification,6 there would be a nonnegative kernel K(ω′ �→ ω) for switchingfrom direction ω′ to direction ω, and the equation would contain a term(∫

S2 K(ω �→ ξ) dξ)f(x, t, ω, ν) − ∫S2 K(ω′ �→ ω)f(x, t, ω′, ν) dω′.

For ε > 0 in our simplified model, there is a unique solution for ini-tial data ϕ, ψ ∈ L1

loc(R) if the coefficient a is measurable and (essentially)bounded on compact sets, by using the finite speed of propagation property.If ϕ, ψ ∈ Lp(R) for some p ∈ [1,∞], then if a is measurable and (essentially)bounded there is a unique solution uε, vε and the norms ||uε(·, t)||Lp(R) and

5 Satyendra Nath BOSE, Indian physicist, 1894–1974. He had worked in Dhaka(now capital of Bangladesh), and in Calcutta, India.

6 Uniformity of the stationary solutions in direction requires that∫

S2 K(ω �→ξ) dξ =

∫S2 K(ξ �→ ω) dξ for all ω ∈ S

2.

Page 118: From hyperbolic systems to kinetic theory: a personalized quest

11 A 1-D Model with Characteristic Speed 1ε

95

||vε(·, t)||Lp(R) grow at most in eC t/ε2. If one assumes that a ≥ 0, and ϕ, ψ ≥ 0,

then one has uε, vε ≥ 0 for t > 0, but without knowing the sign of ϕ, ψ one has∫R

[Φ(uε(x, t)

)+Φ(vε(x, t)

)]dx ≤ ∫

R

[Φ(ϕ(x))+Φ(ψ(x)

)]dx for every convex

Φ.

Lemma 11.1. If 0 < α ≤ a(x, t) ≤ β < ∞ a.e. x ∈ R, t > 0, and ϕ, ψ ∈L2(R), then uε and vε converge weakly to z in L2

(R× (0, T )

)for every T > 0

as ε→ 0, where z is the solution of

zt −(

12a zx

)

x= 0 in R × (0,∞)

z(·, 0) = 12 (ϕ+ ψ),

(11.3)

and uε−vε

ε converges weakly to − 1a zx in L2

(R × (0,∞)

).

Proof : Multiplying the first equation by uε and the second equation by vε,and integrating from 0 to T one obtains

12

∫R(|uε(x, T )|2 + |vε(x, T )|2) dx+ α

∫ T0

∫R

∣∣∣uε−vε

ε

∣∣∣2

dx dt

≤ 12

∫R(|ϕ(x)|2 + |ψ(x)|2) dx.

(11.4)

This shows that qε = uε−vε

ε stays in a bounded set of L2(R × (0,∞)

)and

that for every T <∞, uε and vε stay in a bounded set of L2(R× (0, T )

). One

may then extract a subsequence η → 0 such that uη and vη converge weaklyto z in L2

(R × (0, T )

)for every T < ∞, and qη converges weakly to q0 in

L2(R × (0,∞)

), and the reason why the weak limits of uη and vη coincide is

that uη − vη = η qη converges strongly to 0. From the fact that the limit zwill be identified, and that q0 = − 1

a zx, one deduces that the whole sequenceconverges weakly.

Adding the equations gives (uη + vη)t + (qη)x = 0, which shows at thelimit that 2zt + (q0)x = 0 and z |t=0= 1

2 (ϕ + ψ). Subtracting the equationsand multiplying by η gives η(uη − vη)t + (uη + vη)x + 2a qη = 0, which showsat the limit that 2zx + 2a q0 = 0, from which the equation for z follows. ��

The result extends to ϕ, ψ ∈ L1(R) +C0(R), because a function in such afunctional space can be decomposed into an element of L2(R), a small termin L1(R), and a small term in L∞(R), and the fact that a ≥ 0 is importantfor showing that small initial data in L1(R) or L∞(R) give uniformly smallsolutions in these spaces.

[Taught on Friday September 21, 2001.]

Notes on names cited in footnotes for Chapter 11. PARSEVAL.7

7 Marc-Antoine PARSEVAL DES CHENES, French mathematician, 1755–1836.

Page 119: From hyperbolic systems to kinetic theory: a personalized quest

12

A 2-D Generalization; the Perron–FrobeniusTheory

A simple generalization to a situation in R2 is to consider the following system,

where indexing the solutions with ε has been omitted for simplification:

∂u1∂t + 1

ε∂u1∂x + a

ε2 (+3u1 − u2 − u3 − u4) = 0 in R2 × (0,∞)

∂u2∂t − 1

ε∂u2∂x + a

ε2 (−u1 + 3u2 − u3 − u4) = 0 in R2 × (0,∞)

∂u3∂t + 1

ε∂u3∂y + a

ε2 (−u1 − u2 + 3u3 − u4) = 0 in R2 × (0,∞)

∂u4∂t − 1

ε∂u4∂y + a

ε2 (−u1 − u2 − u3 + 3u4) = 0 in R2 × (0,∞)

uj |t=0= vj in R2.

(12.1)

Of course, this is a caricature of a plane situation where only four directions areallowed for “photons” to move and where the scattering tends to equilibrateall four directions, but the limiting equation as ε tends to 0 does not inherit abiased behaviour towards the directions of the axes, and an isotropic diffusionterm will appear.

If the initial data vj belong to L2(R2) for all j, and 0 < α ≤ a(x, y, t) ≤β <∞, then multiplying the equation #j by uj , summing in j and integratingfrom 0 to T gives the estimate

12

∫R2

(∑4j=1 |uj(x, y, T )|2

)dx dy

+ αε2

∫ T0

∫R2

[4∑4j=1 |uj|2 −

(∑4k=1 uk)

2)]dx dy dt

≤ 12

∫R2

(∑4j=1 |vj |2

)dx dy,

(12.2)

and if one observes that 4∑j |uj |2 − (

∑k uk)

2 = 12

∑j,k |uj − uk|2, one sees

that uj stays in a bounded set of L∞(0, T ;L2(R2))

for all j and all T < ∞,and uj−uk

ε stays in a bounded set of L2(R

2 × (0,∞))

for all j, k. One thenextracts a subsequence uj,η such that

uj,η ⇀ z in L2(R

2 × (0, T ))

weak, for all j and all T <∞u1,η−u2,η

η ⇀ q1 and u3,η−u4,η

η ⇀ q2 in L2(R

2 × (0,∞))

weak, (12.3)

Page 120: From hyperbolic systems to kinetic theory: a personalized quest

98 12 A 2-D Generalization; the Perron–Frobenius Theory

and the fact that the whole sequence converges weakly will come from theidentification of z, q1, q2. Adding the four equations gives (u1,η +u2,η +u3,η +u4,η)t +

(u1,η−u2,η

η

)x

+(u3,η−u4,η

η

)y

= 0, and letting η tend to 0 one deduces4zt+(q1)x+(q2)y = 0 and z |t=0= 1

4 (v1 +v2 +v3 +v4); subtracting the secondequation from the first and multiplying by η gives η(u1,η − u2,η)t + (u1,η +u2,η)x + 4a u1,η−u2,η

η = 0 and letting η tend to 0 one deduces 2zx + 4a q1 = 0;subtracting the fourth equation from the third and multiplying by η givesη(u3,η − u4,η)t + (u3,η + u4,η)y + 4a u3,η−u4,η

η = 0 and letting η tend to 0 onededuces 2zy + 4a q2 = 0. This gives

zt −(

18a zx

)

x−(

18a zy

)

y= 0, i.e. zt − div

(18a grad(z)

)= 0,

z |t=0= v1+v2+v3+v44 ; q1 = − zx

2a ; q2 = − zy

2a .(12.4)

In order to generalize the preceding example to a more general situation, withx ∈ R

N , with a finite number of large velocities and with general transitionprobabilities between the different families travelling at one of these veloci-ties, it is useful to recall some results concerning discrete Markov processes,1

which can be derived from results in linear algebra, due to PERRON,2 and toFROBENIUS.

A discrete Markov process is a probabilistic setting in which there areonly a finite number of states, numbered from 1 to m, and probabilities oftransitions Pi,j from state #j to state #i, from time n to time n + 1, andwhat happens at time n is independent of what happened before, and whatthe integer n is.3 If at time n the probability of being in the state #j is xj ,for j = 1, . . . ,m, then at time n + 1 the probability of being in the state#i is

∑mj=1 Pi,jxj . If one denotes by P the matrix with entries Pi,j , and if

zni is the probability of being in state #i at time n, and zn is the vectorwith components zni , i = 1, . . . ,m, then one has zn+1 = P zn for every n, sothat zn = Pnz0. One has Pi,j ≥ 0 for i, j = 1, . . . ,m, and

∑mi=1 Pi,j = 1

for j = 1, . . . ,m, expressing that one must necessarily be in one of the m1 Andrei Andreyevich MARKOV, Russian mathematician, 1856–1922. He had

worked in St Petersburg, Russia.2 Oskar PERRON, German mathematician, 1880–1975. He had worked in Tubingen,

in Heidelberg, and in Munchen (Munich), Germany.3 This is in essence the same idea used in semi-group theory, that the state of a

system at time t is the only information that one needs in order to predict thefuture evolution of the system: if u(t) is the state of the system at time t, thestate of the system at time t+ s is S

(s;u(t)

), and S(s; v) is the state at time s if

one starts at time 0 with the system in state v; of course, one then has S(0; v) = vfor all v and S

(s;S(t; v)

)= S(s + t; v) for all v and all s, t ≥ 0, which in the

linear case is written as S(t; v) = S(t)v with S(0) = I and S(s + t) = S(s)S(t)for all s, t ≥ 0 (the term semi-group comes from the fact that if this was true forall s, t ∈ R one would have a group of transformations, but the transformationsare only defined for t ≥ 0).

Page 121: From hyperbolic systems to kinetic theory: a personalized quest

12 A 2-D Generalization; the Perron–Frobenius Theory 99

states; the last condition is written as PT1 = 1, where 1 is the vector withall components equal to 1, which is the same as having (zn+1.1) = (zn.1)whatever zn is.

Describing the asymptotic behaviour when n tends to ∞ requires thenan understanding of the eigenvalues of P with maximum modulus; 1 is aneigenvalue of PT and so of P , and by the Hadamard–Gershgorin theorem,4,5

the eigenvalues belong to the union of the closed discs Di, centred at Pi,i andwith radius Ri =

∑k =i |Pk,i|, which is 1 − Pi,i because the entries of P are

nonnegative, and all the eigenvalues then have a modulus ≤ 1. The Perrontheorem gives more precise information if Pi,j > 0 for all i, j = 1, . . . ,m, thatthe only eigenvalue of modulus 1 is 1, that it is a simple eigenvalue, and that aneigenvector e has all its coefficients > 0; if one normalizes e by (e,1) = 1, thenas n tends to ∞ the sequence zn converges to e (because z0 has nonnegativecoefficients, and (z0,1) = 1). The case where some entries Pi,j are 0 requiresan improvement due to FROBENIUS, and as it is better to describe this moregeneral case, I need to recall some results from linear algebra.6

Definition 12.1. A m × m matrix A, with entries in an arbitrary ring, isreducible if {1, . . . ,m} = I∪J , with I, J , nonempty and disjoint and Ai,j = 0for all i ∈ I and all j ∈ J . A is irreducible if it is not reducible.

In order to check if a matrix A is irreducible, one associates to it an orientedgraph with m vertices numbered from 1 to m, by putting an oriented arc fromvertex #i to vertex #j if and only if Ai,j �= 0. It is easy to check that A isirreducible if and only if there exists a closed path following the oriented arcsand going at least once through each of the vertices.7

4 HADAMARD remarked that if A is diagonally dominant, i.e. |Ai,i| >∑

j �=i|Ai,j |

for all i (or |Ai,i| >∑

j �=i|Aj,i| for all i), then A is invertible, while GERSHGORIN

expressed the same idea in a more geometrical way: if A is an m×m matrix withcomplex coefficients and λ is an eigenvalue of A, then there exists i ∈ {1, . . . ,m}such that |λ−Ai,i| ≤

∑j �=i

|Ai,j |; indeed, if x is a corresponding eigenvector and

i is such that |xi| ≥ |xj | for all j �= i, one has λxi = (Ax)i =∑

jAi,jxj , so that

|λ− Ai,i| |xi| = |∑j �=i

Ai,jxj | ≤∑

j �=i|Ai,j | |xj | ≤ (

∑j �=i

|Ai,j |)|xi|.5 Semyon Aranovich GERSHGORIN, Belarusian-born mathematician, 1901–1933.

He had worked in Petrograd/Leningrad, Russia.6 I was not taught these results as a student, perhaps because algebraists are not

so interested in them, probably because they use the order relation on R. It wasin my first year as an assistant professor at Universite Paris IX Dauphine in 1971that I learnt about them, because I had been asked to teach complements oflinear algebra from a book by GANTMACHER (translated into French). I realizedafterwards that probabilists do learn about these questions, probably mixed withideas about Markov processes, but it is useful to see that they are results of linearalgebra, which should be taught independently of any probabilistic framework.

7 One defines an equivalence relation by saying that i is equivalent to j if and onlyif either i = j or i �= j and there exists an oriented path going from vertex #i

Page 122: From hyperbolic systems to kinetic theory: a personalized quest

100 12 A 2-D Generalization; the Perron–Frobenius Theory

Lemma 12.2. Assume that an m×m matrix A has real nonnegative entries,then A is irreducible if and only if A = I +A+ . . .+Am−1 has all its entriespositive.

Proof : One has (A2)i,k =∑

j Ai,jAj,k ≥ 0, and (A2)i,k > 0 if and only if thereexists j with Ai,j > 0 and Aj,k > 0, i.e. if and only if there is a oriented path oflength 2 going from vertex #i to vertex #k, and more generally (Ap)i,k > 0 ifand only if there is a oriented path of length p going from vertex #i to vertex#k. If A is irreducible, then for i �= j there is a path going from vertex #i tovertex #j, and one may ensure that it has length ≤ m− 1 by cutting off theloops so that it goes at most once through each of the vertices, so Ai,j �= 0,and for i = j, one has Ai,i ≥ Ii,i = 1. If A is reducible, then Ai,j = 0 for alli ∈ I and all j ∈ J with I and J disjoint implies that for every p one has(Ap)i,j = 0 for all i ∈ I and all j ∈ J . ��

For a vector x, the notation x ≥ 0 will mean that xi ≥ 0 for all i, andx > 0 will mean that xi > 0 for all i (so that, if m > 1, it is not the samething as x ≥ 0 and x �= 0). The spectral radius ρ(A) of a matrix with complexentries is maxj |λj |, where the λj are the eigenvalues of A.

Proposition 12.3. Let A be irreducible with nonnegative entries. Then r =ρ(A) is a simple eigenvalue of A, for an eigenvector e > 0. If Ax ≥ αx withx ≥ 0 and x �= 0, then α ≤ r; if Ay ≤ β y with y ≥ 0 and y �= 0, then r ≤ β;if Az = λ z with z ≥ 0 and z �= 0, then λ = r.

Proof : Let Σ = {ξ ≥ 0, (ξ,1) = 1}, and Σ ={

η(η.1) ∈ Σ | η = A ξ with

ξ ∈ Σ}, which is well defined because ξ ≥ 0 and ξ �= 0 implies A ξ > 0. One

defines ϕ on Σ by ϕ(η) = minj(Aη)j

ηj, which is well defined and > 0 because

η ∈ Σ implies η > 0. The function ϕ is continuous on Σ and Σ is compact soϕ attains its maximum at a point e ∈ Σ with ϕ(e) = r > 0. By definition onehas Ae = r e + f with f ≥ 0, and one must have f = 0; indeed, one appliesA (which commutes with A), giving AA e = r A e+ A f and if one had f �= 0it would imply A f > 0 and therefore A f ≥ ε A e for some ε > 0, implyingϕ(η) ≥ r + ε for η = A e

(A e,1)∈ Σ, contradicting the maximality of r.

If Ax ≥ αx with x ≥ 0 and x �= 0, then one applies A and η = A x

(A x.1)∈ Σ

satisfies α ≤ ϕ(η) ≤ r. If λ is an eigenvalue of A, with eigenvector u, thenfor every j one has |λ| |uj | = |∑k Aj,kuk| ≤

∑k Aj,k|uk|, so that the vector

x defined by xj = |uj| for all j satisfies x ≥ 0, x �= 0 and Ax ≥ |λ|x, so that|λ| ≤ r, showing that r must be the spectral radius ρ(A).

to vertex #j and also an oriented path going from vertex #j to vertex #i. A isirreducible if and only if there is only one equivalence class, and when there ismore than one equivalence class it gives a way to choose what I and J are forshowing that A is reducible.

Page 123: From hyperbolic systems to kinetic theory: a personalized quest

12 A 2-D Generalization; the Perron–Frobenius Theory 101

As AT has nonnegative entries and is irreducible, there exists an eigenvec-tor e′ > 0 such that AT e′ = r e′ (as AT has the same eigenvalues and the samespectral radius than A). If Ay ≤ β y with y ≥ 0 and y �= 0, then taking thescalar product with e′ gives r (e′.y) = (AT e′.y) = (Ay, e′) ≤ β (e′.y), showingr ≤ β.

The eigenspace for r is one-dimensional, because if Af = r f and f �= 0,then one may consider that f is a real vector (or one takes the real part or theimaginary part of f) and then one can choose t ∈ R such that g = f + t e ≥ 0with one component gi = 0, but this implies that

∑j Ai,jgj = r gi = 0, and

therefore Ai,j �= 0 implies gj = 0; one deduces that gk = 0 if one can join ito k by following an oriented path on the graph associated to A, and as A isirreducible one finds g = 0. The algebraic multiplicity of r is one, because ifthere was an associated Jordan block,8 there would exist f (�= 0) such thatAf = r f + e, and for t ∈ R large enough, one would have g = f + t e ≥ 0,g �= 0 and Ag = r g + e, and applying A would show that η = A g

(Ag.1)∈ Σ

satisfies ϕ(η) > r, contradicting the maximality of r. ��PERRON had proven the preceding result in the case where Ai,j > 0 for all

i, j, and the fact that the irreducible character of A implies the same result isthe work of FROBENIUS, but PERRON had also shown that the only eigenvalueof modulus r is r itself, and that is not always the case in the irreducible case,but it only happens when the nonzero entries show a special pattern.

Proposition 12.4. Let A be irreducible with nonnegative entries, and suchthat A has an eigenvalue different from r which has modulus r. Then thereexists an integer p ≥ 2 and a partition of {1, . . . ,m} into p nonempty subsetsI1, . . . , Ip such that all the nonzero entries Ai,j satisfy i ∈ Ik and j ∈ Ik+1

for some k (with Ip+1 = I1). In that case the spectrum of A is invariant byrotation of 2π/p, and if μ is any eigenvalue of A and z is any pth root of unity(e2i j π/p for j = 0, . . . , p − 1) then z μ is an eigenvalue of A with the samealgebraic multiplicity than μ, so there are at least p− 1 simple eigenvalues ofmodulus r which are distinct from r, and if m is not a multiple of p, 0 must bean eigenvalue of A with an algebraic multiplicity n such that n = m (mod p).

Proof : Let λ = r e2i π θ with 0 < θ < 2π, and let u be a correspondingeigenvector normalized by

∑j |uj| = 1, and let x ∈ Σ be defined by xj = |uj|

for all j. Then one has r xj = |λuj | = |∑k Aj,kuk| ≤∑

k Aj,kxk for all j, i.e.Ax ≥ r x, and this implies Ax = r x and therefore x = e, because if it was nottrue then η = A x

(A x.1)∈ Σ would satisfy ϕ(η) > r. In order to have equality in

the triangle inequality that has been used, it is necessary that all the nonzeroterms Aj,kuk have the same argument than λuj , and this means that the

8 Marie Ennemond Camille JORDAN, French mathematician, 1833–1922. He hadworked in Paris, France, holding a chair (mathematiques, 1883–1883) at Collegede France, Paris.

Page 124: From hyperbolic systems to kinetic theory: a personalized quest

102 12 A 2-D Generalization; the Perron–Frobenius Theory

argument of uj increases of θ each time one follows one oriented arc alongthe graph associated to A, and there must be a smaller integer p > 1 suchthat p θ is a multiple of 2π. Multiplying u by a complex number of modulus1 so that u1 > 0, Ij is then defined as the subset of indices k such that theargument of uk is (j − 1)θ modulo 2π, and A has the required structure forits nonzero entries. Such a structure implies that if z is a pth root of unity thecharacteristic polynomial P (λ) = det(λ I − A) satisfies P (z λ) = zmP (λ) forall λ ∈ C,9 and that shows that the characteristic polynomial has the formλnQ(λp) with a polynomial Q such that Q(0) �= 0, and n = m (mod p), andthat μ and z μ always have the same algebraic multiplicity, and in particularall the eigenvalues of modulus r are simple. ��

Definition 12.5. If A is irreducible with nonnegative entries, it is said to beprimitive if the only eigenvalue of modulus r = ρ(A) is r, and imprimitivewith index p ≥ 2 if there are eigenvalues of modulus r different from r and ifp is the largest integer such that r e2i π/p is an eigenvalue.

Lemma 12.6. If A is irreducible with nonnegative entries, and q is the gcd(greatest common divisor) of the length of loops on the graph associated to A,then A is primitive if and only if q = 1, and if q > 1 then A is imprimitive ofindex q.

Proof : One has seen that if A has an eigenvalue r e2i π θ with 0 < θ < 2π,then there exists an integer p ≥ 2 and A has a block structure which impliesthat all loops have a length that is a multiple of p, so that the gcd of thelength of the loops is a multiple of p. If the gcd of the length of the loops isq > 1, one defines the subsets Ij , with j = 1, . . . , q by putting i ∈ Ij if andonly if there exists a path (along the graph associated to A) going from 1 to iand with length equal to j − 1 modulo q; the definition makes sense becauseif �1 and �2 are the lengths of two such paths and �3 is the length of a pathgoing from j to 1 (which exists because A is irreducible), then one has a loopof length �1 + �3 and a loop of length �2 + �3, both of which are multiples ofq and therefore �1 = �2 (mod q); this shows that A has a block structurewhich implies that its characteristic polynomial is of the form λnQ(λq), sothat r e2i π/q is an eigenvalue of A. ��

If one coefficient Ai,i �= 0 then there is a loop of length 1 and A is primitive.If A is imprimitive of index 2, then if a1, a2 are the sizes of I1, I2, one hasa1 + a2 = m and there are at most 2a1a2 nonzero entries of A, and the9 One starts from λ I −A and one multiplies the rows and columns by powers of z

in the following way: row i is multiplied by zk if i ∈ Ik and column j is multipliedby z1−k if j ∈ Ik, then one ends up with the diagonal entries being multiplied byz, and the entries with i ∈ Ik and j ∈ Ik+1 being multiplied by 1 (and one needszp = 1 so that the entries with i ∈ Ip and j ∈ I1 are not changed), and it doesnot matter what the other entries are multiplied by as they are 0. One ends upwith the matrix z λ I −A, and the determinant has been multiplied by zm.

Page 125: From hyperbolic systems to kinetic theory: a personalized quest

12 A 2-D Generalization; the Perron–Frobenius Theory 103

maximum possible for 2a1a2 is m2

2 if m is even (and both sizes are m2 ) and

m2−12 if m is odd (and the sizes are m−1

2 and m+12 ); if A is imprimitive of

index p (with 2 < p ≤ m) and a1, . . . , ap are the sizes of I1, . . . , Ip, then thereare at most a1a2 + a2a3 + . . .+ apa1 nonzero entries of A, and the maximumpossible for real nonnegative aj of sum m is when they are all equal to m

p and

the number of nonzero entries is then ≤ m2

p . Therefore one can conclude that

A is necessarily primitive if the number of its nonzero entries is > m2

2 (i.e.more than half of the entries are different from 0).10

The fact that A is primitive gives a simple description of the asymptoticbehaviour of the sequences obtained by iterating A.

Lemma 12.7. If A is irreducible with nonnegative entries and primitive, thenfor any w0 ≥ 0 and w0 �= 0 the sequence wn = Anw0 satisfies wn

rn → c e with

c > 0, and c = (e′.w0)(e′.e) , where e and e′ are positive eigenvectors of A and of

AT for the eigenvalue r = ρ(A) = ρ(AT ).

Proof : One decomposes Rm into two subspaces which are invariant by A,

the one-dimensional span of e and the subspace X = {x | (e′.x) = 0}; therestriction of A to X has a spectral radius r′ = ρ(A |X) < r, because byhypothesis all the eigenvalues of A different from r have a modulus < r. Onedecomposes w0 = c e+x0 with x0 ∈ X , and one has (e′.w0) = c (e′.e), showingthat c > 0; as wn = Anw0 = c rne + Anx0 and lim supn→∞ ||An||1/nL(X;X) =r′ < r one deduces that r−n||Anx0|| → 0. ��

In the case whereA is imprimitive with index p ≥ 2, one must introduce theeigenvectors ej for the eigenvalues r e2i j π/p for j = 1, . . . , p−1 and w0 = c e+∑p−1j=1 cjej + y, where y ∈ Y , a subspace invariant by A where the eigenvalues

have a modulus < r; then wn = Anw0 = rn(c e+

∑p−1j=1 cje

2i j n π/pej)+Any,

so that ωn = r−nwn looks like c e+∑p−1j=1 cje

2i j n π/pej and may have no limitif some coefficient cj is not 0. If one averages on p successive iterates, one findsthat ωn + . . .+ωn+p−1 → p c e as n→ ∞, and if one does not know the valueof p one finds that 1

n (ω1 + . . .+ ωn) → c e as n→ ∞.

10 Another characterization is that A is primitive if and only if Ak has all its entriespositive for some integer k ≥ 1. If Ak > 0, then A is irreducible and as Ak

has only one eigenvalue of maximum modulus rk, A cannot have more than oneeigenvalue of modulus r and it is primitive. Conversely if A is primitive, there aretwo loops on the graph of lengths �1, �2 with gcd 1, and if L is the length of aloop going at least once through all the vertices, there are such loops with lengthL+ a1�1 + a2�2 for all nonnegative integers a1, a2 and this covers all the integers≥ N for some integer N ; for each i, j there exists n with 1 ≤ n ≤ m − 1 suchthat An

i,j > 0 because there is a path of length n from i to j and therefore thereare paths of length n+ n′ for all n′ ≥ N , so that one can find k such that for alli, j = 1, . . . ,m, there is a path of length k from i to j, and this gives Ak > 0.

Page 126: From hyperbolic systems to kinetic theory: a personalized quest

104 12 A 2-D Generalization; the Perron–Frobenius Theory

[Taught on Monday October 1, 2001 (during the preceding week, I attendeda conference in Salamanca, Spain).]

Notes on names cited in footnotes for Chapter 12, GANTMAKHER.11

11 Feliks Ruvimovich GANTMAKHER, Ukrainian mathematician, 1908–1964.

Page 127: From hyperbolic systems to kinetic theory: a personalized quest

13

A General Finite-Dimensional Modelwith Characteristic Speed 1

ε

In the examples already studied of a linear hyperbolic system with velocities in1ε and scattering terms in 1

ε2 , there were a few special circumstances that madethe proof easy for showing that an isotropic diffusion appeared in the limit.We want to consider now a more general situation in R

N with an arbitrarynumber m of velocities and general probabilities of transition between thevarious families:

∂ui

∂t + 1ε

(Ci.gradx(ui)

)+ a

ε2

∑mj=1 Mi,juj = 0 in R

N × (0, T );ui(·, 0) = vi in R

N , i = 1, . . . ,m,(13.1)

where the Ci are constant vectors and the Mi,k are constant, but a maydepend upon x and t (and I omit an index ε for the ui). We first make thehypothesis that

a ∈ L∞(RN × (0, T )

), (13.2)

so that for any p ∈ [1,∞] one can deduce existence and uniqueness theoremsfor data in

(Lp(RN )

)m, and adding the hypothesis

a ≥ 0; Mi,j ≤ 0 for all i �= j, (13.3)

one deduces that nonnegative data create nonnegative solutions for t ≥ 0. Weassume that

m∑

i=1

Mi,j = 0 for j = 1, . . . ,m (13.4)

so that one has conservation of mass for bounded data with compact supportand this extends to give a uniform bound in L1 for nonnegative integrabledata. In order to obtain uniform bounds in L∞ and in L2 which are indepen-dent of ε > 0, one assumes that

M = (Mi,j)i,j=1,...,m is irreducible, (13.5)

and one uses the Perron–Frobenius theory.

Page 128: From hyperbolic systems to kinetic theory: a personalized quest

106 13 A General Finite-Dimensional Model with Characteristic Speed 1ε

Lemma 13.1. There exists a vector e with positive components such thatM e = 0.

Proof : One considers A = s I−M , with s ≥ maxiMi,i so that A is irreduciblewith nonnegative entries. By hypothesis one has MT1 = 0, so that AT1 = s1,showing that ρ(AT ) = s, and therefore there exists e > 0 such that Ae = s e,i.e. M e = 0. ��

This helps obtain uniform L∞ estimates (i.e. independent of ε), as one has

m−ei ≤ vi(x) ≤ m+ei a.e. x ∈ RN , for i = 1, . . . ,m implies

m−ei ≤ ui(x, t) ≤ m+ei a.e. x ∈ RN , t ∈ (0, T ), for i = 1, . . . ,m, (13.6)

and theorems for describing more general forward invariant sets will be shownin the following lecture. Uniform L2 estimates follow from the following result.

Lemma 13.2. There exists γ > 0 such that

m∑

i,j=1

1eiMi,jξiξj ≥ γ

m∑

i=1

∣∣∣ξi − (ξ.1)

(e.1)ei

∣∣∣2

for every ξ ∈ Rm. (13.7)

Proof : One considers the matrix B with entries Bi,j = s δi,j −Mi,j

√ej√ei

withs ≥ maxiMi,i, so that B is irreducible with nonnegative entries. If one definesf by fi =

√ei then one has B f = s f and BT f = s f . The symmetric matrix

Bsym = B+BT

2 is irreducible with nonnegative entries and has an eigenvectorf with positive components so s is the spectral radius of Bsym and becauseit is symmetric its eigenvalues are real, and therefore the eigenvalues differentfrom s are ≤ s − β for some β > 0. This implies that for each vector η onehas the inequality

((s I − Bsym)η.η

) ≥ β|η′|2 where η′ is the projection ofη on the orthogonal of f , i.e. η′ = η − (η.f)

|f |2 f ; the left side of the inequality

is((s I − B)η.η

)=∑mi,j=1 Mi,j

√ej√eiηiηj , and if one chooses ηi = ξi√

eithen it

is∑mi,j=1

1eiMi,jξiξj ; in order to evaluate the right side of the inequality, one

notices that |f |2 =∑i ei = (e.1) and therefore η′i = ηi −

∑jηj

√ej

(e.1)

√ei =

ξi√ei

− (ξ.1)(e.1)

√ei = 1√

ei

(ξ − (ξ.1)

(e.1)e)i, and the right side is β

∑i

1ei

(ξ − (ξ.1)

(e.1)e)2i,

which is ≥ γ∣∣ξ − (ξ.1)

(e.1)e∣∣2 if γ = mini βei

. ��We then deduce a uniform L2 estimate by multiplying the ith equation by

ui

eiand summing in i, which gives

∂t

( m∑

i=1

|ui|22ei

)+

N∑

j=1

∂xj

( m∑

i=1

(Ci)j |ui|22ε ei

)+γ

ε2

m∑

i=1

(ui− (u.1)

(e.1)ei

)2

≤ 0, (13.8)

implying by integration

Page 129: From hyperbolic systems to kinetic theory: a personalized quest

13 A General Finite-Dimensional Model with Characteristic Speed 1ε

107

RN

( m∑

i=1

|ui(x, t)|2ei

)dx ≤

RN

( m∑

i=1

|vi(x)|2ei

)dx for 0 ≤ t ≤ T, (13.9)

and∫ T

0

RN

1ε2

(ui − (u.1)

(e.1)ei

)2

dx dt ≤ C (independent of ε) for i = 1, . . . ,m.

(13.10)Using the uniform bounds obtained one can extract a subsequence indexed byη such that uηi converges weakly to u0

i for i = 1, . . . ,m, and 1η

(uηi − (uη .1)

(e.1) ei)

converges weakly to qi. Denoting z = (u0.1)(e.1) , one finds that u0

i = z ei for i =1, . . . ,m. In order to obtain the limiting equation, one sums all the equationsand, because of the hypothesis of conservation of mass, one obtains

∂t

( m∑

i=1

uηi

)+

N∑

j=1

∂xj

( m∑

i=1

(Ci)jη

uηi

)= 0, (13.11)

and one is led to impose the condition

m∑

i=1

(Ci)jei = 0 for j = 1, . . . , N, i.e.m∑

i=1

Ci ei = 0, (13.12)

so that the equation can be written as

∂t

( m∑

i=1

uηi)

+N∑

j=1

∂xj

( m∑

i=1

(Ci)jη

(uηi −

(uη.1)(e.1)

ei

))= 0, (13.13)

and gives at the limit η → 0

(e.1)∂z

∂t+

N∑

j=1

∂xj

( m∑

i=1

(Ci)jqi)

= 0; (e.1)z |t=0= v1 + . . .+ vm. (13.14)

Without the condition on the Ci, all the interesting information goes to in-finity, and if this condition is not satisfied, it tells at what velocity C∗

ε onemust travel in order to follow the interesting effects. In order to identify thefunctions qi one multiplies the ith equation by η and, using M e = 0, onewrites it as

η∂uηi∂t

+N∑

j=1

∂((Ci)ju

ηi

)

∂xj+a

η

m∑

k=1

Mi,k

(uηk −

(uη.1)(e.1)

ek

)= 0, (13.15)

giving at the limit η → 0

N∑

j=1

∂((Ci)jeiz

)

∂xj+ a

m∑

k=1

Mi,kqk = 0. (13.16)

Page 130: From hyperbolic systems to kinetic theory: a personalized quest

108 13 A General Finite-Dimensional Model with Characteristic Speed 1ε

This equation has a solution if and only if the vector R with componentsRi =

∑j∂[(Ci)jeiz]

∂xjbelongs to the range of M , i.e. is orthogonal to 1, because

1 generates the nullspace of MT ; this is indeed true as∑mi=1(Ci)jei = 0 for

j = 1, . . . , N . By construction the vector q with components qi is orthogonalto 1 and because the nullspace of M is generated by e which has positivecomponents (and is then not orthogonal to 1), there is a unique solution qorthogonal to 1 (one may also use the inequality

∑i,j

1eiMi,jξiξj ≥ γ|ξ|2 if

(ξ.1) = 0 for constructing the solution). The final equation is of the form

(e.1)∂z

∂t−

N∑

i,j=1

∂xi

(Di,j

a

∂z

∂xj

)= 0, (13.17)

where D is a nonnegative matrix,1 but D is not necessarily proportional to Iand the solution z may not have all its derivative in L2; for example, if thereis an index j ∈ {1, . . . , N} such that (Ci)j = 0 for i = 1, . . . ,m, one obtainsno information on ∂z

∂xj.

The preceding examples, and the exposition of the Perron–Frobenius the-ory are useful for various reasons. One reason is to think about the origin ofdiffusion in space, not from resulting from a random walk with jumps in po-sition, which is not a physically realistic scenario (although it is one of manydifferent mathematical approaches), but from jumps in velocity, after one letsa characteristic velocity tend to ∞ (and in some examples this velocity is thevelocity of light c). However, the models used have the defect of postulatingsome scattering effects with precise probabilities of transition, independent ofthe state of the system, so that the equation obtained is linear. In the fol-lowing lecture, I shall start describing a different type of interaction, whichcreates semi-linear equations with quadratic nonlinearities, because it modelsinteraction of particles of one type with particles of the same or another type,while a linear scattering term supposes that the particles interact with a fixedbackground. Another reason is to observe that the repetition of a game whereprobabilities of transitions appear tends to create a special pattern when timetends to ∞, with only one parameter at one’s disposal, and it has some simi-larity with the rules of thermodynamics, where equilibria are indexed by onlyone parameter, the temperature; however, there is an important hypothesisfor arriving at that conclusion, which is a notion of irreducibility, and in ther-modynamics it corresponds to the necessity of having all the parts of a bodyinteracting together in order to end up with a unique temperature.

Of course, I have not addressed the question of the validity of the proba-bility assumptions yet, and I have only shown games with some probabilitiesbuilt in and then I have deduced something (and even if what I have de-duced is observed, it is in no way a proof that there are inherent probabilities,1 D is symmetric when M is symmetric, which is the case if one assumes that

the probability of transition from state i to state j is equal to the probability oftransition from state j to state i, for all i, j.

Page 131: From hyperbolic systems to kinetic theory: a personalized quest

13 A General Finite-Dimensional Model with Characteristic Speed 1ε

109

of course); in the previous games, the probabilities were said to be related toscattering, and at some time one should then wonder a little more about whatscattering is.

For the linear problems already considered, nonnegative data give rise tononnegative solutions, and because of linearity there is an order-preservingproperty, but for the semi-linear problems nonnegative data will still give riseto nonnegative solutions but the order-preserving property will be lost; it doesexist for the Carleman model,2 as was first noticed by Ignace KOLODNER,3

but the Carleman model is not a model of kinetic theory, as there is no con-servation of momentum.

Starting in the following lecture, I shall switch from the linear modelsstudied up to now to semi-linear models, and that will create important dif-ferences in properties and some of the methods already used will lose theirefficiency.

For the linear cases, there were not many difficulties working with L1,with L∞ or with L2, but for semi-linear problems it will be natural to ob-tain bounds in L1, because of conservation of mass, but L∞ bounds will nolonger be obvious, either because they must be proven by different methods,or sometimes because they are not true. Actually, when the maximum prin-ciple does not hold, L1 and L∞ are not good functional spaces for solvingpartial differential equations, but if 1 < p < ∞ the Lp spaces can be usedfor the singular integrals which appear when one uses the Green functionsfor elliptic partial differential equations with constant coefficients, because ofCalderon–Zygmund theory,4,5 which extended the one-dimensional study of2 Tage Gillis Torsten CARLEMAN, Swedish mathematician, 1892–1949. He had

worked in Lund and in Stockholm, Sweden.3 Ignacs Izaak KOLODNER, Polish-born mathematician, 1920–1996. He had worked

in Albuquerque, NM, at Carnegie Tech (Carnegie Institute of Technology) andat CMU (Carnegie Mellon University), Pittsburgh, PA, where he was head of thedepartment of mathematics from 1964 to 1971, which included the period whereCarnegie Tech became CMU. I had first met him in 1974 at a meeting at BrownUniversity in Providence, RI, long before I came to CMU in 1987.

4 Alberto Pedro CALDERON, Argentine-born mathematician, 1920–1998. He re-ceived the Wolf Prize in 1989, for his groundbreaking work on singular integraloperators and their application to important problems in partial differential equa-tions, jointly with John W. MILNOR. He had worked at Buenos Aires, Argentina,at OSU (Ohio State University), Columbus, OH, at MIT (Massachusetts Insti-tute of Technology), Cambridge, MA, and at The University of Chicago, Chicago,IL. I first heard him talk at the Lions–Schwartz seminar in the late 1960s, andI met him in Buenos Aires when I visited Argentina for two months in 1973; hekept strong ties with Argentina, as can be witnessed from the large number ofmathematicians from Argentina having studied harmonic analysis, often workingnow in United States.

5 Antoni Szczepan ZYGMUND, Polish-born mathematician, 1900–1992. He hadworked in Warsaw, Poland and in Wilno (then in Poland, now Vilnius, Lithuania),and then at the University of Chicago, Chicago IL.

Page 132: From hyperbolic systems to kinetic theory: a personalized quest

110 13 A General Finite-Dimensional Model with Characteristic Speed 1ε

the Hilbert transform, done by M. RIESZ;6 in the case p = 1, one replacesL1 by the smaller Hardy space H1,7 and in the case p = ∞, one replacesL∞ by the larger space BMO (Bounded Mean Oscillation), introduced byFritz JOHN for a question in elasticity,8 then studied by Fritz JOHN and LouisNIRENBERG,9 which is useful for studying the limiting case of the Sobolevembedding theorem.10 In the late 1970s, I had thought that BMO(R) couldbe a good functional space for questions of kinetic theory, for a reason un-related to singular integrals, and I had mentioned something about that toYves MEYER,11 but I had found later a way to prove L∞(R) bounds for thecases that I was interested in.

It is natural for a density of particles to be nonnegative and in L1(RN )if a total mass is finite, and one may wonder about using spaces of Radonmeasures with finite total mass, because bounded nonnegative sequences inL1(RN ) may approach (in a weak � topology) any nonnegative Radon measurein Mb(RN ) (i.e. with finite total mass), but a particular use of entropy (boundson∫f log(f) dx) precludes concentration effects; for what concerns L∞(RN ),

I do not know any physical reason why if one starts with nonnegative datain L∞(RN ) the densities should stay in L∞(RN ), and one should considerthat L∞(RN ) and other spaces used for proving regularity of solutions ofpartial differential equations are chosen for reasons of personal taste ratherthan for reasons related to the (expected) physical content of an equation.The reason why I thought that spaces constructed like BMO(RN ) could beuseful for some problems in kinetic theory is that they are naturally definedby integrals. The precise definition of when a function u ∈ L1

loc(RN ) belongs

to BMO(RN ) is to take any cube Q ⊂ RN , to compute the average uQ of u

on Q, so that u− uQ is related to oscillations of u on Q, and then to considerthe average of |u − uQ| on Q, which is the mean oscillation on Q; the space6 Marcel RIESZ, Hungarian-born mathematician, 1886–1969 (the younger brother

of Frigyes (Frederic) RIESZ). He had worked in Stockholm and in Lund, Sweden.7 Godfrey Harold HARDY, English mathematician, 1877–1947. He had worked in

Cambridge and in Oxford, England, holding the Savilian chair of geometry in1920–1931, and in Cambridge again, holding the Sadleirian chair of pure mathe-matics in 1931–1942.

8 Fritz JOHN, German-born mathematician, 1910–1994. He had worked in Lexing-ton, KY, and at NYU (New York University), New York, NY.

9 Louis NIRENBERG, Canadian-born mathematician, born in 1925. He received theCrafoord Prize in 1982. He works at NYU (New York University), New York, NY.

10 If one observes that one has ||u − uQ||L1(Q) ≤ Cp(Q)||grad(u)||Lp(Q) for u ∈W 1,p(Q) and 1 ≤ p ≤ ∞, then for reasons of homogeneity the case p = N givesCN(Q) = M |Q|; however, the same bound in M |Q| is true if the derivatives ofu belong to the Marcinkiewicz space LN,∞, a particular space in the family ofLorentz spaces; this is the case for log(|x|), which then belongs to BMO(RN ).

11 Yves Francois MEYER, French mathematician, born in 1939. He worked at Uni-versite Paris Sud, Orsay (where he was my colleague from 1975 to 1979), atEcole Polytechnique, Palaiseau, at Universite Paris IX-Dauphine, Paris, and atENS-Cachan (Ecole Normale Superieure de Cachan), Cachan, France.

Page 133: From hyperbolic systems to kinetic theory: a personalized quest

13 A General Finite-Dimensional Model with Characteristic Speed 1ε

111

BMO(RN ) is precisely the space of functions for which this mean oscillationis bounded by a number M independent of which cube Q one has considered,hence the choice of the name BMO; the smallest M is a semi-norm for u, andit does not change by adding a constant function to u. One could considerthat if a density of particles u is constant it corresponds to an equilibriumand then u− uQ could be like a mass out of equilibrium, and describing howmuch mass is out of equilibrium might be useful, although it may not bethe precise way that it enters the definition of BMO(RN ) that should beimportant. Of course, functions in BMO(RN ) are not necessarily bounded,because log |x| ∈ BMO(RN ), but Fritz JOHN and Louis NIRENBERG haveshown that for u ∈ BMO(RN ) there exists ε > 0 such that eε |u| ∈ L1

loc(RN ),

with ε depending only upon the semi-norm of u in BMO(RN ).

[Taught on Wednesday October 3, 2001.]

Notes on names cited in footnotes for Chapter 13, MILNOR,12 F. RIESZ,13

SAVILE,14 SADLEIR,15 CRAFOORD,16 MARCINKIEWICZ,17 G.G. LORENTZ,18

and for the preceding footnotes, WAYNE.19

12 John Willard MILNOR, American mathematician, born in 1931. He received theWolf Prize in 1989, for ingenious and highly original discoveries in geometry,which have opened important new vistas in topology from the algebraic, combina-torial, and differentiable viewpoint, jointly with Alberto CALDERON. He workedat Princeton University, Princeton, NJ, and at SUNY (State University of NewYork) at Stony Brook, NY.

13 Frigyes (Frederic) RIESZ, Hungarian mathematician, 1880–1956. He had workedin Kolozsvar (then in Hungary, now Cluj-Napoca, Romania), in Szeged and inBudapest, Hungary. He introduced the spaces Lp in honour of LEBESGUE andthe spaces Hp in honour of HARDY, but no spaces are named after him, and theRiesz operators have been introduced by his younger brother Marcel RIESZ.

14 Sir Henry SAVILE, English mathematician, 1549–1622. In 1619, he establishedprofessorships of geometry and astronomy at Oxford, England.

15 In 1701, Lady SADLEIR established a professorship of pure mathematics in Cam-bridge, England.

16 Holger CRAFOORD, Swedish industrialist and philanthropist, 1908–1982. He in-vented the artificial kidney, and he and his wife Anna-Greta CRAFOORD, 1914–1994, established the Crafoord Prize in 1980 by a donation to the royal Swedishacademy of sciences, to reward and promote basic research in scientific disci-plines that fall outside the categories of the Nobel Prize (which have includedmathematics, geoscience, bioscience, astronomy, and polyarthritis).

17 Jozef MARCINKIEWICZ, Polish mathematician, 1910–1940. He had worked inWilno (then in Poland, now Vilnius, Lithuania). He died during World War II,presumably executed by the Soviets with thousands of other Polish officers.

(footnotes 18–19 on next page)

Page 134: From hyperbolic systems to kinetic theory: a personalized quest

112 13 A General Finite-Dimensional Model with Characteristic Speed 1ε

18 George Gunther LORENTZ, Russian-born mathematician, born in 1910. Heworked in Toronto, Ontario, at Wayne State University, Detroit, MI, in Syra-cuse, NY, and in Austin, TX.

19 Anthony WAYNE, American general, 1745–1796. Wayne State University, Detroit,MI, is named after him.

Page 135: From hyperbolic systems to kinetic theory: a personalized quest

14

Discrete Velocity Models

In the classical description, a gas is made of atoms and molecules, but whenMAXWELL and BOLTZMANN developed the basic ideas for the kinetic theoryof gases, they imagined a gas made of particles with no internal structure.In celestial mechanics, all planets are assumed to have spherical symmetryso that the gravitational field created outside is the same as that of a pointmass at its centre, and the gravitational forces on another planet produceonly a resulting attraction of its centre and no torque, so that there is nochange in the angular momentum of the planets, and one neglects them. Itwould be different for planets with a magnetic field, because electromagnetismwould have to be taken into account, and in a close encounter planets couldexchange angular momentum through electromagnetic interaction. Actually,ALFVEN observed in the 1970s that some of what is observed in the cosmosshould be explained by electromagnetic effects, but those who adhere to thedogma of gravitation cannot learn about electromagnetism, and they preferto invent dark matter, dark energy, dark fields, and so on, in order to avoidquestioning their dogma.

The same problem occurs concerning the 19th century ideas in kinetictheory, that there has been enough evidence to show that they are wrong, butmost people want to stick to them. The ideas of POINCARE about relativityhave pointed out that there are no instantaneous forces at a distance, and hisreason was that one cannot define instantaneity and that interaction betweenparticles must be transmitted by a field, at the velocity of light, but a morecompelling reason has come out from quantum mechanics, despite its dogmaticerrors, that at a microscopic level there are only waves and no particles, so thatthe classical ideas about near collisions involving only two particles feeling aforce at a distance should be rejected as a naive 19th century point of view.

The classical idea is that one expects particles to collide with other par-ticles and the number of such collisions between a particle of type 1 and aparticle of type 2 is expected to be proportional to the product of the densityof particles of type 1 and the density of particles of type 2, but because this

Page 136: From hyperbolic systems to kinetic theory: a personalized quest

114 14 Discrete Velocity Models

does not seem to explain what is observed, some people have tried to add acorrection involving three types of particles, despite the fact that, from a clas-sical point of view, triple collisions are expected to be extremely rare events.The only way out is to accept the fact that the classical language that one hasbeen using since the end of the 19th century is too limited to explain whatis really going on, and that one cannot avoid treating particles as the wavesthey really are.

Before studying the collision operator imagined by MAXWELL and byBOLTZMANN, which appears in what one calls the Boltzmann equation, I wantto discuss simpler models, where velocities can only take a finite numbers ofvalues, the discrete velocity models. In the early 1970s, Renee GATIGNOL of-fered me a copy of her book [16],1 where she attributes the idea to MAXWELL,and although velocities should belong to R

3, I shall start by considering aproblem in R

N . In this approach, all particles are equal with the same mass.2

Conservation of mass in a collision just means that two particles come inand two particles come out, and one must concentrate then on conservationof momentum and conservation of energy, and that means that a collisionbetween two particles with velocities Vi and Vj may result in two particleshaving velocities Vk and V if

Vi + Vj = Vk + V|Vi|2 + |Vj |2 = |Vk|2 + |V|2, (14.1)

and particles with velocities Vk and V may result in two particles having ve-locities Vi and Vj after a collision, of course. Notice that no conservation ofangular momentum is mentioned, because one assumes that no angular mo-mentum is carried away by the particles (unlike for billiard balls), and thatthe energy has only a translational kinetic part; apart from a possible rota-tional kinetic energy, molecules also show an energy related to the variationsin distances between the atoms forming the molecule.

Again, one should emphasize that the preceding discussion supposed thatthe particles react as rigid bodies do, i.e. according to the rules of classical me-chanics, but one immediately abandons the framework of classical mechanicsby introducing probabilities for choosing the result of a collision, and if twoparticles with velocities Vi and Vj collide, one assumes that there are proba-bilities of transforming into the various possible pairs, counting the possibility1 Renee Yvonne FLANDRIN-GATIGNOL, French mathematician. She works at Uni-

versite Paris VI (Pierre et Marie Curie), Paris, France.2 Dry air is composed mostly of molecules of nitrogen N2, molecules of oxygen O2,

and atoms of argon Ar with proportions 75.5%, 23.2%, 1.3% in mass, or 78.08%,20.94%, 0.93% in volume. In the usual circumstances there is a variable amountof carbon dioxide CO2, but air is not dry and it contains variable amounts ofwater H2O as vapour, and this humidity plays an important role in the weatherconditions. This shows that the hypothesis of identical particles is not alwaysrealistic, and one should take it as a first step, for example for describing a gaslike argon, whose atoms are spherical and show no chemical activity.

Page 137: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 115

of emerging from the collision with the same velocities as before entering it,as if there had not been a collision; one denotes by Pi,j;k, the probability thata collision with velocities Vi and Vj creates particles having velocities Vk andV, and as particles cannot be discerned,3 one asks for a symmetry in i andj, and also a symmetry in k and �; the natural conditions on the coefficientsPi,j;k, are then

Pi,j;k, ≥ 0, Pj,i;k, = Pi,j;,k = Pi,j;k, for all pairs i, j; k, lPi,j;k, = 0 if Vk + V �= Vi + Vj or |Vk|2 + |V|2 �= |Vi|2 + |Vj |2

for all pairs i, j; k, l∑k,l Pi,j;k, = 1 for all pairs i, j.

(14.2)

In general one has also Pk,;i,j = Pi,j;k, for all pairs i, j and k, �. The result ofthis analysis is that if ui(x, t) denotes the (nonnegative) density of particleswith velocity Vi, for i = 1, . . . ,m, then one has

(ui)t +(Vi.grad(ui)

)+

m∑

k,=1

Ai,k,uku = 0 for i = 1, . . . ,m, (14.3)

where Ai,k, = Ai,,k for all i, k, � = 1, . . . ,m, and the coefficients Ai,k, arerelated to the probabilities Pa,b;c,d in the following way. For each of the pairsk, � and a, b, one puts a term K Pk,;a,buku in the equation for uk, a termK Pk,;a,buku in the equation for u, a term −K Pk,;a,buku in the equationfor ua, and a term −K Pk,;a,buku in the equation for ub; this expresses thefact that a collision takes away a particle with velocity Vk and a particle withvelocity V and adds a particle with velocity Va and a particle with velocityVb, and that this happens with the proportion Pk,;a,b; as a simplification,4

the formula for Ai,k, is then

Ai,k, = Km∑

a,b=1

Pk,;a,b(δi,k+δi,−δi,a−δi,b) for all i, k, � = 1, . . . ,m, (14.4)

3 If one looks at small waves on the surface of the sea, one sometimes can followone and see it interact with other waves but it is often impossible to followwhere a particular wave goes during interaction; physicists use an hypothesisof indiscernability of particles, for a similar reason, that there are actually noparticles.

4 Instead of K Pk, ;a,b one should write Kk, Pk, ;a,b, as this concerns only whathappens in the collisions of particles with velocity Vk against particles with ve-locity V . The results concerning the signs of the coefficients Ai,k, and the re-lations expressing conservation of mass, conservation of momentum and conser-vation of energy are unchanged; it is only when deriving the entropy inequalitythat one uses then a supplementary information, which in that general case isKk, Pk, ;a,b = Ka,bPa,b;k, .

Page 138: From hyperbolic systems to kinetic theory: a personalized quest

116 14 Discrete Velocity Models

where K is often written as 1ε and ε is interpreted as a mean free path between

collisions,5 and I shall discuss later the question of letting ε tend to 0. Onethen deduces some useful properties of the coefficients Ai,k,

Ai,k, ≤ 0 if i �= k and i �= � for all i, k, � = 1, . . . ,m, (14.5)

which is related to having nonnegative solutions for nonnegative data,m∑

i=1

Ai,k, = 0 for all k, � = 1, . . . ,m, (14.6)

which is conservation of mass, from which one deduces

Ai,k, ≥ 0 if i = k or i = � for all i, k, � = 1, . . . ,m, (14.7)

and the fact that Pk,;a,b = 0 unless Vk + V = Va + Vb and |Vk|2 + |V|2 =|Va|2 + |Vb|2 implies that when Pk,;a,b �= 0 one has

∑mi=1 Vi(δi,k + δi, −

δi,a − δi,b) = Vk + V − Va − Vb = 0 and∑m

i=1 |Vi|2(δi,k + δi, − δi,a − δi,b) =|Vk|2 + |V|2 − |Va|2 − |Vb|2 = 0, so that

∑mi=1Ai,k,Vi = 0 for all k, � = 1, . . . ,m∑mi=1Ai,k,|Vi|2 = 0 for all k, � = 1, . . . ,m, (14.8)

which express conservation of momentum and conservation of energy.Another important property, related to the H-theorem of BOLTZMANN, is

that when ui > 0 one has∑mi=1 log(ui)(δi,k + δi, − δi,a − δi,b) = log(uk) +

log(u) − log(ua) − log(ub) = log(uku) − log(uaub), and if one assumes nowthat one has

Pk,;i,j = Pi,j;k, for all pairs i, j and k, �, (14.9)

one deduces∑m

i,k,=1Ai,k,uku log(ui)=K∑mk,,a,b=1 Pk,;a,b

(log(uku)−log(uaub)

)uku

= K2

∑mk,,a,b=1 Pk,;a,b

(log(uku) − log(uaub)

)(uku − uaub) ≥ 0,

(14.10)and the inequality stays valid if one allows some ui to vanish. To the conser-vation of mass, conservation of momentum and conservation of energy, whichcan be written as

∂∂t

(∑mi=1 ui

)+∑N

j=1∂∂xj

(∑mi=1(Vi)jui

)= 0

∂∂t

(∑mi=1 uiVi

)+∑Nj=1

∂∂xj

(∑mi=1(Vi)juiVi

)= 0

∂∂t

(∑mi=1 ui|Vi|2

)+∑N

j=1∂∂xj

(∑mi=1(Vi)jui|Vi|2

)= 0,

(14.11)

5 As∫

RN ui(x, t) dx is a mass, one sees that ui has units mass length−N ; each

velocity Vi has units length time−1 so (ui)t and(Vi.grad(ui)

)has units mass

length−N time−1, and each term K uku having those units, one sees that Khas units mass−1 lengthN time−2, so that an interpretation of the inverse of Kas a length does not seem so good, and there are other quantities involved, likescattering cross-sections.

Page 139: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 117

one then adds the inequality

∂t

( m∑

i=1

ui log(ui))

+N∑

j=1

∂xj

( m∑

i=1

(Vi)jui log(ui))≤ 0, (14.12)

expressing the decay of entropy6

I(t) =∫

RN

( m∑

i=1

ui log(ui))dx. (14.13)

If dIdt = 0, then one must have

(log(uku) − log(uaub)

)(uku − uaub) = 0

whenever Pk,;a,b �= 0, i.e. uku = uaub; if Pk,;i,j = Pi,j;k, for all pairs i, jand k, �, one deduces that the nonlinear terms vanish.

I shall describe in more detail later some simplified versions of a two-dimensional model introduced by MAXWELL, where there are four possiblevelocities, so I shall call it the four velocities model, with V1 = (1, 0), V2 =(−1, 0), V3 = (0, 1) and V4 = (0,−1), but it is also known as a Broadwellmodel,7 and because all the velocities have the same norm, kinetic energy isautomatically conserved; this type of model is not so interesting for modellinga gas, because there is no possible temperature (as temperature is related tovariations of |v|2, as will be seen later), and the equations are

(u1)t + (u1)x + N = 0(u2)t − (u2)x + N = 0(u3)t + (u3)y −N = 0(u4)t − (u4)y −N = 0,

(14.14)

where the nonlinear term N is usually taken to be

N = K (u1u2 − u3u4), (14.15)

expressing equal transition probabilities for pairs 1, 2 and 3, 4 to transforminto each other in a collision (as these are the only different pairs correspondingto the same total momentum, equal to 0), and I shall take K = 1 in mostof the discussions, which corresponds to looking at the equations for K uj, orthe equations for uj(K x,K y,K t).

6 Mathematicians use a different sign convention than physicists, who have theentropy increasing; an interpretation is that entropy represents disorder createdby irreversible processes, and this must have been what CLAUSIUS had in mindin inventing the concept (or at least in expressing it in clearer terms, becausethere seems to have been some controversy about who had the original idea atthe time, and nationalistic questions may have obscured the facts); BOLTZMANN

then proposed that entropy is −∫

R3×R3 f(x,v, t) log(f(x,v, t)

)dx dv.

7 James E. BROADWELL, American engineer. He worked at Caltech (CaliforniaInstitute of Technology), Pasadena, CA.

Page 140: From hyperbolic systems to kinetic theory: a personalized quest

118 14 Discrete Velocity Models

For this model, as for the general model with arbitrary coefficients Ai,j,k(i.e. without imposing sign conditions or conservation properties), one has alocal existence theorem for data in L∞, and existence can be asserted at leastfor a time of the order of the inverse of the L∞ norm of the initial data.This type of result is obtained by standard techniques of ordinary differen-tial equations, and one can prove local existence and uniqueness of solutionsfor perturbations of a semi-group by a locally Lipschitz nonlinearity.8 For ageneral first-order system of the form

(ui)t +(Vi.grad(ui)

)= Fi(u1, . . . , um) for x ∈ R

N , t > 0,ui |t=0= vi for x ∈ R

N , i = 1, . . . ,m, (14.16)

with v1, . . . , vm ∈ L∞(RN ), one assumes that the nonlinearities satisfy thebounds

|z1|, . . . , |zm| ≤ r imply maxi |Fi(z1, . . . , zm)| ≤M(r)|z1|, . . . , |zm|, |ξ1|, . . . , |ξm| ≤ r imply

maxi |Fi(z1, . . . , zm) − Fi(ξ1, . . . , ξm)| ≤ K(r)maxj |zj − ξj |.(14.17)

Lemma 14.1. If ρ0 = maxi ||vi||L∞(RN ) and the solution of dρdt = M(ρ) is

finite on [0, T ], i.e. T <∫∞ρ0

dρM(ρ) , there is a unique solution in R

N × (0, T ),satisfying |ui(x, t)| ≤ ρ(t) a.e. (x, t) ∈ R

N × (0, T ), for i = 1, . . . ,m. Oneapproaches the solution by the iterative method

(u(n+1)i )t +

(Vi.grad(u

(n+1)i )

)= Fi(u

(n)1 , . . . , u

(n)m ), (x, t) ∈ R

N × (0, T );u

(n+1)i |t=0= vi, x ∈ R

N , i = 1, . . . ,m,(14.18)

Proof : If the initialization functions u(0)1 , . . . , u

(0)m are bounded (measurable)

on RN×(0, T ) with |u(0)

i (x, t)| ≤ R0, for (x, t) ∈ RN×(0, T ) and i = 1, . . . ,m,

then one has |u(n)i (x, t)| ≤ Rn(t), for (x, t) ∈ R

N × (0, T ) and i = 1, . . . ,m,where Rn(t) = r0+

∫ t0M(Rn−1(s)

)ds for n ≥ 1. Assume that Rn(t) ≤ R∞ for

all n and t ∈ (0, T ), and let K∞ = K(R∞); then if εn(t) = maxi ||u(n)i (·, t) −

u(n−1)i (·, t)||L∞(RN ) for t ∈ (0, T ) and n ≥ 1, one has Fi(u

(n)1 , . . . , u

(n)m )(·, t) −

Fi(u(n−1)1 , . . . , u

(n−1)m )(·, t)||L∞(RN ) ≤ K∞εn(t) for t ∈ (0, T ), i = 1, . . . ,m,

and n ≥ 1, and therefore εn+1(t) ≤ K∞ ∫ t0εn(s) ds, so that one deduces by

8 Some authors seem to have been lost in technical details about mild solutions,because the semi-group of translation S(t) defined by

(S(t)w

)(x) = w(x − a t)

is not a strongly continous semi-group if a �= 0, i.e. S(t)w may not convergeto w in L∞ norm as t tends to 0, but S(t)w ⇀ w in L∞ weak � anyway, andone just has to observe that the natural solution in the sense of distributionsof ut + a.ux = f in R

N × (0,∞) with u |t=0= v is indeed given by u(·, t) =

S(t)v(·) +∫ t

0S(t− s)f(·, s) ds.

Page 141: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 119

induction that εn+1(t) ≤ 1n!(K

∞t)n sups∈(0,t) ε1(s) for t ∈ (0, T ), showing theuniform convergence of un to a limit, which is then the desired solution.

If Rn−1 = ρ on (0, T ), then Rn = ρ on (0, T ), so that one way to create abounded sequence and to prove existence is to choose the initial guess u(0) suchthat |u(0)

i (x, t)| ≤ ρ(t), for (x, t) ∈ RN × (0, T ) and i = 1, . . . ,m, for example

by taking all u0i equal to 0. However, it is important to prove uniqueness

without imposing too precise bounds, so if a solution is bounded by R0, onechooses 0 < S <

∫∞R0

dzM(z) , and the argument shows that it coincides with

the obtained solution for 0 ≤ t ≤ min{T, S}, and if T > S one starts theargument again with initial time S and the solution must then coincide withthe obtained solution for 0 ≤ t ≤ min{T, 2S}, etc. ��

The preceding argument with M(r) = C r2 gives T < Cr0

, and this resultis valid without any sign condition on the coefficients Ai,j,k or on the initialdata.

The uniqueness property shows that if the initial data are periodic in adirection, then the solution is periodic in that direction, i.e. if there existsh ∈ R

N such that vi(x + h) = vi(x) a.e. x ∈ RN for i = 1, . . . ,m, then one

has ui(x + h, t) = ui(x, t) a.e. (x, t) ∈ RN × (0, T ) for i = 1, . . . ,m, where

T is chosen according to Lemma 14.1. Indeed, in all cases, if one defines uiby ui(x, t) = ui(x + h, t) for i = 1, . . . ,m, then it satisfies the equation forinitial data vi defined by vi(x) = vi(x+h) for i = 1, . . . ,m; if then vi = vi fori = 1, . . . ,m, the uniqueness property implies ui = ui for i = 1, . . . ,m. As aconsequence, if the initial data are independent of one direction, the solutionis independent of that direction; for example, if in the four velocities modelthe initial data are independent of y, then the solution is independent of y,and one finds the one-dimensional four velocities model

(u1)t+(ui)x+N = (u2)t− (u2)x+N = (u3)t−N = (u4)t−N = 0, (14.19)

with N = K(u1u2 − u3u4) and initial data depending only upon x (andbelonging to L∞(R)); the presence of the u3u4 term in N may look strangefrom a physical point of view, because one does not expect particles with thesame velocity to interact, but one should remember that 0 is not the velocityof the particles of the third and fourth families, but the projection of theirvelocity on the x axis; a new symmetry arises in this model, which was nottrue for the initial (two-dimensional) four velocities model, that the equationbecomes invariant by exchanging u3 and u4,9 and in the case where u3 = u4

9 It means that if one defines u1 = u1, u2 = u2, u3 = u4, u4 = u3 then one obtainsthe solution for initial data v1 = v1, v2 = v2, v3 = v4, v4 = v3; one deduces that ifv3 = v4 in R

2, then u3 = u4 in R2×(0, T ). A different symmetry exists for the two-

dimensional four velocities model, where one exchanges u3 and u4 but one alsochanges y into −y (and one can also exchange u1 and u2 and change x into −x),i.e. one defines u1(x, y, t) = u1(x,−y, t), u2(x, y, t) = u2(x,−y, t), u3(x, y, t) =u4(x,−y, t), u4(x, y, t) = u3(x,−y, t) then one obtains the solution for initial

Page 142: From hyperbolic systems to kinetic theory: a personalized quest

120 14 Discrete Velocity Models

one obtains the Broadwell model,10 where I have used u = u1, v = u2 andw = u3 = u4,

ut + ux + u v − w2 = 0vt − vx + u v − w2 = 0wt − u v + w2 = 0,

(14.20)

where the density of mass is u + v + 2w, the density of momentum in xis u − v (and 0 for the density of momentum in y as u3 = u4 = w), thedensity of kinetic energy is proportional to mass, and the density of entropyis u log(u) + v log(v) + 2w log(w) for the case of nonnegative data (as thesolution is nonnegative for t > 0).

The uniqueness property can be rendered more powerful by making thestatements local instead of global, and for this one should notice an importantfinite speed of propagation effect.

Lemma 14.2. If initial data belong to L∞(RN ), then for t > 0 the solution at(x, t) only depends upon the initial data at points y ∈ {x}−t conv{V1, . . . , Vm},where conv A is the convex hull of A, i.e. of the form y = x − t

∑i θiVi for

some θi ≥ 0 for i = 1, . . . ,m, with∑i θi = 1.

Proof : One initializes the iterative method with u(0)i = 0 for i = 1, . . . ,m, and

one notices by induction that each u(n)i (x, t) only depends upon the initial

data on {x} − t conv{V1, . . . , Vm}. This follows from the formula u(n)i (x, t) =

vi(x− t Vi)+∫ t

0 Fi(u

(n−1)1 (x− s Vi, t− s), . . . , u(n−1)

m (x− s Vi, t− s))dx, using

the fact that x − t Vi ∈ {x} − t conv{V1, . . . , Vm} and that {x − s Vi} − (t −s) conv{V1, . . . , Vm} ⊂ {x} − t conv{V1, . . . , Vm}. ��

This result permits us to compare solutions which are not necessar-ily defined on a strip R

N × (0, T ) but on a set A ⊂ RN × (0,∞) with

A such that (x, t) ∈ A implies (y, t − s) ∈ A for all s ∈ (0, t) and ally ∈ {x} − s conv{V1, . . . , Vm}.

It is useful then to develop criteria which are necessary, or sufficient, forthe solution to be nonnegative when the initial data are nonnegative, as thiscorresponds to the physical property that a density of particles should benonnegative.

data v1(x, y) = v1(x,−y), v2(x, y) = v2(x,−y), v3(x, y) = v4(x,−y), v4(x, y) =v3(x,−y).

10 One may start from a three-dimensional six velocities model, where one addsvelocities V5 = (0, 0,+1) and V6 = (0, 0,−1), and the nonlinearity in the firstand second families is 2u1u2 − u3u4 − u5u6 for example; if one starts with dataindependent of y, z, then the solution is independent of y, z and if one imposesalso v3 = v4 = v5 = v6, then one has u3 = u4 = u5 = u6 for t > 0, and one findsthe model ut +ux + 2u v−2w2 = vt − vx + 2u v−2w2 = wt −u v+w2 = 0, wherethe density of mass is u + v + 4w, etc.

Page 143: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 121

Lemma 14.3. If for i = 1, . . . ,m, the function Fi has (also) the propertythat zj ≥ 0 for j = 1, . . . ,m, and zi = 0 imply Fi(z1, . . . , zm) ≥ 0, then fornonnegative initial data the solution is nonnegative for t ≥ 0 (as long as itexists).

Proof : One assumes that 0 ≤ vi(x) ≤ r0 a.e. x ∈ RN for i = 1, . . . ,m, and

one chooses T <∫∞r0

drM(r) , so that the solution of dρ

dt = M(ρ) with ρ(0) = r0

is well defined on [0, T ]; one chooses λ ≥ K(ρ(T )

), and one uses a different

iterative technique, where one starts from u(0)i = 0 for i = 1, . . . ,m, but for

n ≥ 1 one defines u(n)i by

(u(n)i )t +

(Vi.grad(u

(n)i ))

+ λu(n)i = λu

(n−1)i + Fi

(u

(n−1)1 , . . . , u

(n−1)m

),

ui |t=0= vi, for i = 1, . . . ,m.(14.21)

If 0 ≤ u(n−1)j (x, t) ≤ ρ(t) a.e. x ∈ R

N , t ∈ (0, T ) for j = 1, . . . ,m, then

one has 0 ≤ λu(n−1)i + Fi

(u

(n−1)1 , . . . , u

(n−1)m

) ≤ λρ(t) + M(ρ(t))

becauseof the choice of λ and the definition of M ; this implies 0 ≤ u

(n)i ≤ r(t),

where r is the solution of drdt + λ r = λρ +M(ρ) with r(0) = r0, which gives

r(t) = ρ(t) for t ∈ (0, T ). Having uniform bounds for all u(n)i , one estimates

the differences u(n)i − u

(n−1)i in L∞ norm as was done before, and u(n) then

converges uniformly to a fixed point, which is the solution. ��The conditions imposed on the functions Fi for i = 1, . . . ,m, are then

sufficient to obtain nonnegative solutions for t > 0 when the initial dataare nonnegative, but they are actually also necessary conditions, and a moregeneral result is proven first in the case of ordinary differential equations.

Definition 14.4. A closed set C ⊂ Rm is forward invariant for the differ-

ential equation dzdt = F (z) (with F locally Lipschitz), if z(0) ∈ C implies

z(t) ∈ C for t ≥ 0 as long as the solution exists.

Lemma 14.5. If F is a locally Lipschitz mapping and C is a closed set ofRm, then it is forward invariant for the differential equation dz

dt = F (z) if andonly if C satisfies the condition

dist(c+ ε F (c);C) = o(ε) for ε > 0 small, for all c ∈ C(or equivalently, for all c ∈ ∂ C). (14.22)

Proof : If z(0) = c0 ∈ C, then one has z(ε) = c0+ε F (c0)+o(ε); if C is forwardinvariant one has z(ε) ∈ C and so the distance from c0 + ε F (c0) to C is lessthan or equal to the distance from c0 + ε F (c0) to z(ε), which is o(ε).

Conversely, assume that C satisfies the condition; to simplify the argu-ment assume that F is globally Lipschitz continuous (with constant K);then one has |z1(t) − z2(t)| ≤ eK t|z1(0) − z2(0)| for t > 0 for any two so-lutions of the differential equation. Let z(t) be any solution of the differ-ential equation, and for some time t0 choose a projection ξ0 of z(t0) onto

Page 144: From hyperbolic systems to kinetic theory: a personalized quest

122 14 Discrete Velocity Models

C, and let ξ be the solution of the differential equation with ξ(t0) = ξ0;then for ε > 0 one has |z(t0 + ε) − ξ(t0 + ε)| ≤ eK ε|z(t0) − ξ0|, andas ξ(t0 + ε) = ξ0 + ε F (ξ0) + o(ε) one has dist(ξ(t0 + ε);C) = o(ε) andtherefore dist(z(t0 + ε);C) ≤ |z(t0 + ε) − ξ(t0 + ε)| + dist(ξ(t0 + ε);C) ≤eK εdist(z(t0);C) + o(ε) = dist(z(t0);C) + K εdist(z(t0);C) + o(ε), show-ing that d[dist(z;C)]

dt |t=t0≤ K dist(z(t0);C) (where the derivative is a rightderivative), and as this holds for all t0 one deduces that dist(z(s);C) ≤eK sdist(z(0);C) for all s ≥ 0, showing that z(0) ∈ C implies z(s) ∈ C for alls ≥ 0. ��

Definition 14.6. For a system (ui)t +(Vi.grad(ui)

)= Fi(u1, . . . , um), i =

1, . . . ,m, with Fi locally Lipschitz for i = 1, . . . ,m, a closed subset C of Rm

is forward invariant if when the initial data satisfy v(x) ∈ C a.e. x ∈ RN

then the solution satisfies u(x, t) ∈ C a.e. x ∈ RN , t ∈ (0, T ) (as long as the

solution exists).

Using initial data independent of x, u must solve the differential equationdudt = F (u), and C must then be forward invariant for the differential equation,so that it must satisfy the condition dist(c+ ε F (c);C) = o(ε) for ε > 0 smalland all c ∈ C. The presence of the transport part in the equation imposessupplementary conditions on C: taking as an example the Broadwell model,which has three distinct velocities, let (u1, v1, w1) and (u2, v2, w2) be twopoints in C, and consider the initial data

u0(x) ={u1 for x < 0u2 for x > 0

, v0(x) ={v1 for x < 0v2 for x > 0

, w0(x) ={w1 for x < 0w2 for x > 0

,

(14.23)then one has for small t > 0

u(x, t) ={u1 +O(t) for x < tu2 +O(t) for x > t

,

v(x, t) ={v1 +O(t) for x < −tv2 +O(t) for x > −t ,

w(x, t) ={w1 +O(t) for x < 0w2 +O(t) for x > 0 ,

(14.24)

and this shows that a forward invariant set C must be a product, because inthe region −t < x < 0 one has points of C of the form (u1, v2, w1)+O(t), andin the region 0 < x < t one has points of C of the form (u1, v2, w2)+O(t), andas C is closed one finds that (u1, v2, w1) ∈ C and (u1, v2, w2) ∈ C, so that Chas the form Cu ×Cv ×Cw; one extends easily this condition to an arbitrarysystem.

For the Broadwell model, one can deduce easily all the forward invariantsets: they are the points corresponding to constant solutions (Cu, Cv, Cw re-duced to one point, satisfying u v = w2), the degenerate solutions (Cv =

Page 145: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 123

Cw = {0}, corresponding to v = w = 0 and u satisfying ut + ux = 0)or (Cu = Cw = {0}, corresponding to u = w = 0 and v satisfyingvt−vx = 0), the nonnegative quadrant corresponding to nonnegative solutions(Cu = Cv = Cw = [0,∞)), and the whole space (Cu = Cv = Cw = R).

It is natural then to look for bounds depending upon t, and for initial datasatisfying 0 ≤ u0 ≤ a0, 0 ≤ v0 ≤ b0, 0 ≤ w0 ≤ c0 a.e. x ∈ R, one wants todeduce that one has 0 ≤ u(x, t) ≤ a(t), 0 ≤ v(x, t) ≤ b(t), 0 ≤ w(x, t) ≤ c(t)a.e. x ∈ R, t > 0, for the best possible bounds a, b, c, but apart from the trivialnecessary condition that da

dt (0) ≥ c20,dbdt (0) ≥ c20,

dcdt (0) ≥ a0b0, one does not

know how to derive other necessary conditions.11

One may want to impose a stronger condition, that the rectangular boxused, i.e. C(t) = [0, a(t)] × [0, b(t)] × [0, c(t)] is such that if the initial datatakes its values in C(s) then the solution at time t takes its values in C(s+ t).One can write necessary conditions for this condition to hold: because onemay use points where u0(x) = a(s), v0(x) = 0, w0(x) = c(s), one finds thatone must have da

dt (s) ≥ c2(s), and similarly one must have dbdt (s) ≥ c2(s),

and dcdt (s) ≥ a(s)b(s), but these differential inequalities do not have global

solutions if a0 or b0 or c0 is > 0.12

There are only special cases in kinetic theory where one can find boundedforward invariant regions, giving easily global L∞ bounds, and one such ex-ample, which I learnt from Henri CABANNES,13 consists in starting from the(two-dimensional) four velocities model, and uses data depending only uponx+y, so that the solutions are functions of x+y and t and the system obtainedis then

(u1)t + (u1)z + N = (u2)t − (u2)z + N = (u3)t + (u3)z −N= (u4)t − (u4)z −N = 0 with N = u1u2 − u3u4,

(14.25)

and one deduces that

(u1 + u3)t + (u1 + u3)z = (u2 + u4)t − (u2 + u4)z = 0, (14.26)

and therefore for any A,B > 0 one has the forward invariant region

0 ≤ u1, u2, u3, u4, u1 + u3 ≤ A, u2 + u4 ≤ B. (14.27)11 As will be shown in the next lecture, if a0, b0, c0 < ∞, then the solution ex-

ists globally, as I proved with Michael CRANDALL in 1975, extending an idea ofTakaaki NISHIDA and MIMURA. However, the best possible bounds in L∞ are notknown.

12 The system of equations dadt

= dbdt

= c2 and dcdt

= a b gives lower bounds and itmay be solved by quadratures, because a − b and a3 − 3a2b+ 2c3 are constants,but as soon as a, b, c are all positive a lower bound which blows up in finite timeis easily obtained, as ϕ = min{a, b, c} satisfies dϕ

dt≥ ϕ2, and the time of existence

is ≤ 1ϕ(0)

.13 Henri CABANNES, French mathematician, born in 1923. He worked at Universite

Paris VI (Pierre et Marie Curie), Paris, France.

Page 146: From hyperbolic systems to kinetic theory: a personalized quest

124 14 Discrete Velocity Models

One has a system with order-preserving property if ∂Fi

∂uj≥ 0 for i �= j,

and this does not occur for systems from kinetic theory, but it occurs for theCarleman model,14

ut + ux + u2 − v2 = 0vt − vx − u2 + v2 = 0, (14.28)

which has the bounded invariant regions

0 ≤ u, v ≤M, (14.29)

in which the solution is order preserving, as was noticed by Ignace KOLODNER

in the early 1960s. Not knowing about his work, I had rediscovered that prop-erty in the early 1970s, but I knew that such a property is not shared bymodels from kinetic theory, and I decided to work on obtaining L∞ boundsfor the Broadwell model, and during the year 1974–1975 that I spent at theUniversity of Wisconsin in Madison, I discussed that question with MichaelCRANDALL. There is an L1 contraction property for the Carleman model,which had been noticed by Thomas LIGGETT,15 but I had noticed that themodels for which an L1 contraction property was known (including also theBurgers equation and the porous medium equation) were also order preserv-ing, and I had shown Michael CRANDALL why it was necessary because ofa conservation of integral, and he had shown the converse, and I have givenour result at Lemma 4.1, but there was no objection for an L1 contractionproperty based on a distance on R

m different from the Euclidean distance.We found a distance adapted to the nonlinear terms, but it is not compatiblewith the transport terms; after three months we had not found much, whenwe were given an article from Takaaki NISHIDA and MIMURA which opened anew approach.16

Although I am discussing at length mathematical questions about discretevelocity models, one should be aware of the limitations of such models; ac-tually, when I first met Clifford TRUESDELL,17 he told me that these modelsare not a good replacement for the Boltzmann equation, because they are notinvariant by rotation, but at the time I could not see why that should be so

14 The model is found in an appendix of a book by CARLEMAN [1], but as the workof CARLEMAN was not finished when he died, the book was edited by LennartCARLESON and FROSTMAN, who completed some proofs; if the model was foundin CARLEMAN’s papers, they must have known that it is not a good model ofkinetic theory, as momentum is not conserved, but it has an entropy inequality.

15 Thomas Milton LIGGETT, American mathematician, born in 1944. He works atUCLA (University of California Los Angeles), Los Angeles, CA.

16 Masayasu MIMURA, Japanese mathematician. He works in Hiroshima, Japan.17 Clifford Ambrose TRUESDELL III, American mathematician, 1919–2000. He had

worked at Indiana University, Bloomington, IN, and at Johns Hopkins University,Baltimore, MD, where I first met him, in the spring of 1975.

Page 147: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 125

important. He may have thought of the problem posed by the formal limitε→ 0, i.e. the Hilbert expansion in the case of the Boltzmann equation, whichis formal (despite having been proposed by HILBERT, so it should have beencalled the Hilbert conjecture, but it may have been known to BOLTZMANN),18

and the first term is the Euler equation for an ideal gas, which is isotropic.Conversely, the formal expansion for (14.14)–(14.15) when K → +∞ consistsin postulating that

uj = Uj +K−1U1j +K−2U2

j + . . . for j = 1, 2, 3, 4, (14.30)

so that

one conjectures that uj ⇀ Uj weakly for j = 1, 2, 3, 4, (14.31)

with

(U1 + U2 + U3 + U4)t + (U1 − U2)x + (U3 − U4)y = 0(U1 − U2)t + (U1 + U2)x = 0(U3 − U4)t + (U3 + U4)y = 0U1U2 − U3U4 = 0,

(14.32)

and the first equation of (14.32) corresponds to conservation of mass

�t + (q1)x + (q2)y = 0, (14.33)

with density of mass �, density of linear momentum q = � V , and macroscopicvelocity V given by

� = U1 + U2 + U3 + U4, q1 = � V1 = U1 − U2, q2 = � V2 = U3 − U4, (14.34)

which imply� ≥ 0, |V1| + |V2| ≤ 1, almost everywhere. (14.35)

The last equation of (14.32) then serves in expressing each Uj in terms of�, q1, q2,

U1 = �4 + q21−q22

4� + q12

U2 = �4 + q21−q22

4� − q12

U3 = �4 + q22−q21

4� + q22

U4 = �4 + q22−q21

4� − q22 ,

(14.36)

18 I have been told that BOLTZMANN expected that some macroscopic properties ofa fluid, like viscosity and heat conductivity, would be very dependent upon whichlaw describes the forces at a distance between particles, so that he could deducewhat kind of forces exist at a microscopic level from macroscopic measurements.The formal expansion shattered his belief, as the first term is the Euler equationfor an ideal gas whatever the law about forces is, so that either the formal expan-sion is wrong, or the Boltzmann equation is not a good model for describing realgases! For different reasons, one knows now that the Boltzmann equation is nota good model for describing real gases, but no one knows if the conjecture aboutthe expansion, which is a question in mathematics, is true or not.

Page 148: From hyperbolic systems to kinetic theory: a personalized quest

126 14 Discrete Velocity Models

but the second and third equations of (14.32) then give

(q1)t +(q21−q22

2� + �2

)

x= 0

(q2)t +(q22−q21

2� + �2

)

y= 0,

(14.37)

which do not resemble the corresponding equations for the balance of linearmomentum given by Newton’s law,19 and in consequence (14.32) cannot be in-terpreted as describing the motion of a fluid, because the directions of the axesplay an important role so that there is no isotropy, and there is no Galileaninvariance, and I guess that this was a reason behind Clifford TRUESDELL’sremark.

In 1989, I thought of a way that could help avoid the angular cut-off whichhas been used in the Boltzmann equation since the work of Harold GRAD,20

and I mentioned it to Pierre-Louis LIONS.21 Back in 1983, I had understoodwhy the Boltzmann equation is not a good physical model, and I had told him,maybe in too cryptic terms, as I had said that there are only mathematicalproblems on the Boltzmann equation; he may not have understood that Imeant that it is a bad physical model, and I had added that they were thequestion of removing the angular cut-off and the question of letting ε tend to0. In 1989, I had been playing with a different problem, for which the factthat the Fourier transform of the uniform measure on the unit circle decaysat infinity (and it decays in 1/

√r) was quite useful, and I could observe that

memory is a curious phenomenon because I had the feeling that I had seensomething like that before, but I could not remember where, and after a whileI thought of checking an article of Charles FEFFERMAN,22 in the proceedingsof ICM (International Congress of Mathematicians) 1974 in Vancouver, andthere it was, about his work and the work of STEIN,23 on restrictions of Fouriertransform on spheres. Because of invariance by rotations, the computation of

19 The corresponding equations for Euler equation are (q1)t+( q2

1�

+p)

x+(

q1q2�

)y

= 0

and (q2)t +(

q1q2�

)x

+( q2

2�

+ p)

y= 0.

20 Harold GRAD, American physicist, 1923–1987. He had worked at NYU (New YorkUniversity), New York, NY.

21 Pierre-Louis LIONS, French mathematician, born in 1956. He received the Fieldsmedal in 1994. He worked at Universite Paris IX-Dauphine, Paris, France, andhe holds now a chair (equations aux derivees partielles et applications, 2002–) atCollege de France, Paris.

22 Charles Louis FEFFERMAN, American mathematician, born in 1949. He receivedthe Fields Medal in 1978. He worked at the University of Chicago, Chicago, IL,and he works now at Princeton University, Princeton, NJ.

23 Elias M. STEIN, Belgian-born mathematician, born in 1931. He received the WolfPrize in 1999, for his contributions to classical and “Euclidean” Fourier analy-sis and for his exceptional impact on a new generation of analysts through hiseloquent teaching and writing, jointly with Laszlo LOVASZ. He worked at theUniversity of Chicago, Chicago, IL, and at Princeton University, Princeton, NJ.

Page 149: From hyperbolic systems to kinetic theory: a personalized quest

14 Discrete Velocity Models 127

the bilinear term Q(f, f) for the Boltzmann equation (in R3) contains the

evaluation of integrals on circles, of bilinear quantities similar to convolutionproducts, and when I told Pierre-Louis LIONS about theorems of restrictionon circles I expected him to be interested in collaborating with me on thatquestion; he was obviously not interested, but later he seemed to rediscover aproperty of smoothing by convolution with uniform measures on spheres, andI assume that it is different from what I had in mind, which I never checkedin detail. I also thought that Clifford TRUESDELL’s remark could have beenabout this kind of effect, which cannot be seen in discrete velocity models.

The basic estimate that I thought useful for estimates on the collisionkernel is to show that f � g has a restriction on the circle S

1,24 for f, g insuitable Lorentz spaces Lp,q(R2); the uniform measure dθ on S

1 has Fdθ ∈L∞(R2) ∩ L4,∞(R2) because it decays in |ξ|−1/2, and for f, g smooth one has∫

S1 f � g dθ =∫

R2 Ff FgFdθ, so that (using estimates for the absolute valuesof f and g) one bounds the L1(S1) norm of (f � g)

∣∣S1 by bounding Ff Fg in

L4/3,1(R2), giving

(f � g)∣∣S1∈ L1(S1) if f ∈ L2(R2) and g ∈ L4/3,2(R2),

or f ∈ L4/3,2(R2) and g ∈ L2(R2)or f ∈ Lp,r(R2), g ∈ Lq,r

′(R2) with 4

3 < p, q < 2, 1p + 1

q = 54 , 1 ≤ r ≤ ∞

(14.38)and that uses the Lions–Peetre interpolation theory.25,26

[Taught on Friday October 5, 2001.]

Notes on names cited in footnotes for Chapter 14, CARLESON,27 FROSTMAN,28

LOVASZ,29 and for the preceding footnotes, THOMPSON,30 YALE,31 EOTVOS.32

24 The curvature of the circle plays a role, and f �g may not have traces on segments.25 Of course, it was Jacques-Louis LIONS who developed the theory of interpolation

spaces, and not his son Pierre-Louis.26 Jaak PEETRE, Estonian-born mathematician, born in 1935. He worked at Lund

University, Sweden.27 Lennart CARLESON, Swedish mathematician, born in 1928. He received the Wolf

Prize in 1992, for his fundamental contributions to Fourier analysis, complexanalysis, quasi-conformal mappings and dynamical systems, jointly with John G.THOMPSON. He worked in Uppsala and in Stockholm, Sweden.

28 Otto FROSTMAN, Swedish mathematician, 1907–1977. He had worked in Stock-holm, Sweden.

(footnotes 29–32 on next page)

Page 150: From hyperbolic systems to kinetic theory: a personalized quest

128 14 Discrete Velocity Models

29 Laszlo LOVASZ, Hungarian-born mathematician, born in 1948. He received theWolf Prize in 1999, for his outstanding contributions to combinatorics, theoreticalcomputer science and combinatorial optimization, jointly with Elias M. STEIN.He works at Yale University, New Haven, CT, and Eotvos University, Budapest,Hungary.

30 John Griggs THOMPSON, American-born mathematician, born in 1932. He re-ceived the Wolf Prize in 1992, for his profound contributions to all aspects offinite group theory and connections with other branches of mathematics, jointlywith Lennart CARLESON. He worked in Cambridge, England.

31 Elihu YALE, American-born English philanthropist, Governor of Fort St George,Madras, India, 1649–1721. Yale University, New Haven, CT, is named after him.

32 Baron Lorand EOTVOS, Hungarian physicist, 1848–1919. Eotvos University, Bu-dapest, Hungary, is named after him.

Page 151: From hyperbolic systems to kinetic theory: a personalized quest

15

The Mimura–Nishida and the Crandall–TartarExistence Theorems

A different idea was needed for discrete velocity models, like the Broadwellmodel, a simplified version of the Maxwell four velocities model, and it camefrom the strong Japanese school specialized in questions of kinetic theory andfluid dynamics, by a result of MIMURA and Takaaki NISHIDA, who mixed L1

and L∞ estimates in the following way.

Lemma 15.1. (Mimura–Nishida) For the Broadwell model, for every k > 1there exists ε(k) > 0 such that if the initial data satisfy

u0, v0, w0 ∈ L∞(R) ∩ L1(R)0 ≤ u0, v0, w0 ≤M0 a.e. in R∫

R(u0 + v0 + 2w0) dx ≤ ε(k),

(15.1)

then the solution exists for all t > 0 and satisfies

0 ≤ u(x, t), v(x, t), w(x, t) ≤ kM0 in R × (0,∞).�� (15.2)

I shall show their proof in a moment, but in explaining the general resultthat I derived after that with Michael CRANDALL, one does not need to knowthe method of proof of the result, and the argument extends to general systemsif one can prove a preliminary result of the type obtained by MIMURA andTakaaki NISHIDA (without needing the refinement that k can be taken as nearto 1 as one likes), and one deduces a global existence theorem for nonnegativebounded data by using the finite speed of propagation property and an entropyinequality.

Proposition 15.2. (Crandall–Tartar) Let a general system satisfy Ai,j,k ≤ 0if i �= j and i �= k, so that nonnegative data give rise to nonnegative solutions,and satisfy an entropy inequality, i.e. such that

∑i,j,k Ai,j,kwjwk log(wi) ≥ 0

for all w ∈ Rm such that wi ≥ 0 for all i, so that

∫RN

(∑i ui log(ui)

)dx is

nonincreasing for nonnegative data with compact support.

Page 152: From hyperbolic systems to kinetic theory: a personalized quest

130 15 The Mimura–Nishida and the Crandall–Tartar Existence Theorems

Assume that a Mimura–Nishida estimate holds, i.e. that there exists k0 ≥ 1and ε0 > 0 such that for initial data such that

0 ≤ vi ≤M0, i = 1, . . . ,m a.e. in RN

∫RN

(∑mi=1 vi

)dx ≤ ε0,

(15.3)

the solution exists globally for t > 0 and satisfies

0 ≤ ui ≤ k0M0, i = 1, . . . ,m a.e. in RN × (0,∞). (15.4)

Then, there exists a function F (t,M) (depending upon ε0, k0,m,N but noton the precise values of the coefficients Ai,j,k for example) such that for anybounded nonnegative data the solution exists for all t > 0, and satisfies

0 ≤ vi ≤M0, i = 1, . . . ,m a.e. in RN implies

0 ≤ ui(x, t) ≤ F (t,M0), i = 1, . . . ,m a.e. in RN × (0,∞).�� (15.5)

Before showing the proof, a few remarks are in order. In principle, theproposition applies to all dimensions N , but in practice one only knows howto use it in one dimension, and it seems unlikely (but not impossible) that thereare genuine cases of N -dimensional problems where it applies. The reason isscaling,1 as all the models with linear transport and quadratic nonlinearitiesare invariant if one changes u(x, t) into λu(λx, λ t), and such transformationsleave the LN(RN ) norm invariant, and this corresponds to mass in dimension1, but in dimension N ≥ 2, the total mass can be made arbitrarily small by asuitable scaling. If such a Mimura–Nishida estimate was valid, then the L∞

norm would just be multiplied by k0 for initial data with compact support,and because of the finite speed of propagation property it would also be truewithout imposing that the total mass be finite; that does happen in cases withforward invariant sets which are bounded, but there is no hint that this couldbe true in a general case, although it does not seem to contradict any knownresult; of course, if k0 could be taken arbitrarily near 1, then in dimension Nit would imply that the L∞ norm does not increase, and that is not realistic(and one should remember that the nonlinearities appearing in the Carlemanmodel are not consistent with principles of kinetic theory).

The critical part of the proof is to use an entropy inequality for deducingan equi-integrability property, and it is related to classical results in functional1 It is natural to change the units of length and time in the same way, so that

the characteristic velocities which enter into the equation do not change, but itmay look a little strange that the unknown u scales as the inverse of a length,which leaves the mass invariant in one dimension but not in dimension N ≥ 2; onereason is that I have neglected factors K in front of the nonlinearities, which havea dimension, and which incorporate some effective scattering cross-sections (sothat the particle “collides” with the other particles present in a cylinder aroundits path).

Page 153: From hyperbolic systems to kinetic theory: a personalized quest

15 The Mimura–Nishida and the Crandall–Tartar Existence Theorems 131

analysis, the Dunford–Pettis theorem,2,3 and the De La Vallee Poussin crite-rion.4 It is important that mass is conserved, but it is equally important inkinetic theory that there cannot be concentrations of mass on sets of smallmeasure, and this results from the entropy inequality, for which Lemma 15.3is the key.

Lemma 15.3. If u1, . . . , um are nonnegative in RN , have a support with finite

measure, and∫

RN

( m∑

i=1

ui log(ui))dx ≤ I <∞, (15.6)

then for every ε > 0, there exists δ > 0 such that

ω

( m∑

i=1

ui

)dx ≤ ε for all measurable subsets ω having measure ≤ δ, (15.7)

and one can choose δ = ε2m e−2J/ε, with J = I + 1

e

∑mi=1 meas

(support(ui)

).

Proof : The function s log(s) is negative for 0 < s < 1 and it attains itsminimum at s = 1

e and the minimum is − 1e ; one deduces that s log+(s) ≤

s log(s) + 1e for s ≥ 0, where log+(s) is the nonnegative part of the log-

arithm, equal to 0 on [0, 1] and log(s) on [1,∞). One then deduces that∫RN

(∑mi=1 ui log+(ui)

)dx ≤ J = I + 1

e

∑mi=1meas

(support(ui)

), although

it is only the measure of the points where 0 < ui < 1 that should be added,but in practice one bounds this measure by an upper bound of the mea-sure of the support. For any measurable set ω of finite measure, and fori = 1, . . . ,m, one decomposes ω into the part where 0 ≤ ui ≤M and the partwhere ui ≥M , and one has

∫ω ui dx ≤M meas(ω)+ 1

log+(M)

∫ω ui log+(ui) dx,

where M > 1 has to be chosen; summing in i, one obtains∫ω

(∑mi=1 ui

)dx ≤

mM meas(ω)+ Jlog+(M) , so one chooses M = e−2J/ε, which makes the second

term ε2 , and one chooses then meas(ω) so that mM meas(ω) ≤ ε

2 . ��Proof of Proposition 15.2 : Let V = maxi |Vi|. In order to find an upper boundof ui(x0, t0) for some t0 ∈ [0, T ], one only needs to know the initial datavj , j = 1, . . . ,m, at points y such that |y−x0| ≤ V t0; so one changes the initialdata by putting all vj(y) equal to 0 if |y− x0| > V T , and keeping the formervalue if |y − x0| ≤ V T , and the bounds that one will be able to prove for thesolution will apply for bounding each ui(x0, t0). With the new initial data, theinitial entropy

∫RN

(∑mi=1 vi log(vi)

)dx is ≤ CNV

NTNmM0 log+(M0), where

2 Nelson DUNFORD, American mathematician, 1906–1986. He had worked at YaleUniversity, New Haven, CT.

3 Billy James PETTIS, American mathematician. He worked at Tulane University,New Orleans, LA, and University of North Carolina, Chapel Hill, NC.

4 Charles Jean Gustave Nicolas DE LA VALLEE POUSSIN, Belgian mathematician,1866–1962. He was made Baron in 1928. He had worked in Louvain, Belgium.

Page 154: From hyperbolic systems to kinetic theory: a personalized quest

132 15 The Mimura–Nishida and the Crandall–Tartar Existence Theorems

CN is the volume of the unit ball of RN ; by hypothesis, as long as the solution

exists in(L∞(RN )

)m, the entropy will be bounded by that quantity; to applyLemma 15.3, one needs to estimate the measure of the support of the solution,but by the finite speed of propagation property the support at time 0 is ina ball of radius V T and grows at most at speed V , so for 0 ≤ t ≤ T it isincluded in a ball of radius 2V T , and the coefficient J of Lemma 15.3 maybe taken to be CNV NTNm (M0 log+(M0) + 4N

e ). For the value ε0, Lemma15.3 gives a value δ, and one defines ρ by CNρN = δ, so that as long as thesolution exists and for any time t ∈ [0, T ], the total mass in a ball of radius ρis less or equal than the critical value for the Mimura–Nishida estimate.

Then one applies the estimate to prove that if the solution exists up totime t1, with t1 ≤ T , then it exists up to time t1+ ρ

V and its L∞ norm betweent1 and t1 + ρ

V is at most multiplied by k0. Indeed, one performs a second typeof truncation, just for the purpose of estimating the norm of the solution: onerestricts the data at time t1 inside a ball centered at a point x1 and withradius ρ, and one replaces the data at time t1 by 0 outside this ball, and thehypothesis of the Mimura–Nishida estimate is valid, and the solution existsfor t > t1 with a norm in L∞(RN ) multiplied at most by a factor k0; of course,this last solution only coincides with ours in a small cone, namely the set ofpoints (x, t) with t1 ≤ t ≤ t1 + ρ

V and |x−x1| ≤ ρ−V (t− t1), but by movingthe point x1 this small cone sweeps the entire strip R

N × (t1, t1 + ρV

), and

therefore the norm of the solution is at most multiplied by k0 in this strip.Then one repeats the process, and because ρ has been estimated uniformly,one attains the time T starting from time 0 in a finite number of operations(bounded by 1 + V T

ρ ), and the L∞ estimate is obtained. ��Proof of Lemma 15.1 : For the Broadwell model, one has the conservation ofmass and the conservation of momentum, written as (u + w)t + ux = 0 and(v+w)t−vx = 0. One starts from nonnegative bounded initial data u0, v0, w0

such that∫

R(u0 +v0 +2w0) dx ≤ ε0, and a precise value of ε0 will be obtained.

For x0 ∈ R and t0 > 0, one integrates (u + w)t + ux = 0 on a triangle withvertices (x0, 0), (x0 + t0, 0), (x0 + t0, t0), and one obtains a boundary integral∫ t

0

w(x0 + s, s) ds+∫ t

0

u(x0 + t0, s) ds =∫ x0+t0

x0

(u0 + w0) dx, (15.8)

and similarly, if one integrates (v + w)t − vx = 0 on a triangle with vertices(x0 − t0, 0), (x0, 0), (x0 − t0, t0), one obtains

∫ t

0

w(x0 − s, s) ds+∫ t

0

u(x0 − t0, s) ds =∫ x0

x0−t0(v0 + w0) dx, (15.9)

so that one deduces that∫ ∞

0

u(x+s, s) ds ≤ ε0,

∫ ∞

0

v(x−s, s) ds ≤ ε0,

∫ ∞

0

w(x, s) ds ≤ ε0 a.e. x ∈ R,

(15.10)

Page 155: From hyperbolic systems to kinetic theory: a personalized quest

15 The Mimura–Nishida and the Crandall–Tartar Existence Theorems 133

where the upper bounds ∞ are only used after one has shown global existence,but until then are restricted to the finite time of existence. Then, as long asthe solution exists one defines M(t) as the smallest number such that

u(x, s), v(x, s), w(x, s) ≤M(t) a.e. x ∈ R, s ∈ (0, t), (15.11)

so that M(0) is max{||u0||L∞(R), ||v0||L∞(R), ||w0||L∞(R)}. For almost all x0 ∈R, one can work on the characteristic line parametrized by (x0 +s, s), and onehas d

dtu(x0 + t, t) ≤ w2(x0 + t, t) ≤ M(t)w(x0 + t, t), giving after integrationu(x0+t, t) ≤ u0(x0)+

∫ t0 M(s)w(x0+s, s) ds ≤M(0)+M(t)

∫ t0 w(x0+s, s) ds ≤

M(0) + ε0M(t); therefore one finds that the essential supremum of u(x, s)for x ∈ R and s ∈ (0, t) is ≤ M(0) + ε0M(t). Similarly, working on thecharacteristic line parametrized by (x0−s, s) for v one finds that the essentialsupremum of v(x, s) for x ∈ R and s ∈ (0, t) is ≤M(0)+ε0M(t), and workingon the characteristic line parametrized by (x0, s) for w one finds that theessential supremum of w(x, s) for x ∈ R and s ∈ (0, t) is ≤ M(0) + ε0M(t).These three bounds, and the definition of M(t) as the smallest number forsome inequality to hold, shows that one has

M(t) ≤M(0) + ε0M(t), (15.12)

as long as the solution exists; therefore one finds global existence if ε0 < 1,and by choosing ε(k) = 1 − 1

k , one finds M(t) ≤ M(0)1−ε(k) = kM(0). ��

Henri CABANNES has checked that a Mimura–Nishida estimate holds forvarious classical discrete velocity models. I shall describe in the next lecturea different way to obtain similar estimates, which is not based on using con-servations, like in the proof of MIMURA and Takaaki NISHIDA.

[Taught on Monday October 8, 2001.]

Notes on names cited in footnotes for Chapter 15, TULANE.5

5 Paul TULANE, American philanthropist, 1801–1887. Tulane University, New Or-leans, LA, is named after him.

Page 156: From hyperbolic systems to kinetic theory: a personalized quest

16

Systems Satisfying My Condition (S)

In this lecture, I shall describe some local and global existence results fora special class (S) of semi-linear systems in only one space variable, that Iintroduced in 1979, those which have the form

(ui)t + Ci(ui)x +∑

j,k

Ai,j,kujuk = 0, i = 1, . . . ,m, (16.1)

with the condition (S)

(S) Cj = Ck implies Ai,j,k = 0 for all i. (16.2)

The reason why I had first looked at this class of system was that ithas a property of stability with respect to weak convergence, which I shalldescribe in the next lecture, a simple consequence of the div-curl lemma (afirst example of compensated compactness), that I had proven a few yearsbefore with Francois MURAT.

As often happens when doing research in mathematics, when one looks forsomething and one has found it, one may overlook another interesting propertythat one was not really looking for; one must stay alert and pay attention todetails, and one may discover some unexpected result. I was checking a newproof of that particular application of the div-curl lemma, different from theone using the Fourier transform, which was our initial approach in 1974, orthe one using the framework of differential forms and Hodge decomposition,1

which was shown to me in 1975 by Joel ROBBIN,2 and it seemed to applyto a more general setting using Lp spaces for p < 2; by looking to the bestpossible value of p, I finally found a simple reason why if u1, u2 ∈ L1(R2)with (u1)x1 , (u2)x2 ∈ L1(R2), then one has u1u2 ∈ L1(R2); I now call that1 William Vallance Douglas HODGE, Scottish mathematician, 1903–1975. He had

worked in Bristol and in Cambridge, England.2 Joel William ROBBIN, American mathematician, born in 1941. He works at Uni-

versity of Wisconsin, Madison, WI.

Page 157: From hyperbolic systems to kinetic theory: a personalized quest

136 16 Systems Satisfying My Condition (S)

type of result compensated integrability [19], because it should not be confusedwith compensated compactness, and it is useful for proving the existence ofsolutions for systems in the class (S), with initial data in L1(R), and it appearsto be a completely different approach from the semi-group point of view (whichhad not really succeeded, apart from cases where an L1 contraction propertyholds).

Definition 16.1. For c ∈ R, Vc is the space of functions u such that ut +c ux = f ∈ L1(R × R) and u |t=0= g ∈ L1(R), with the norm ||u||Vc =||f ||L1(R2) + ||g||L1(R); Wc is the space of functions u such that there existsh ∈ L1(R) with |u(x, t)| ≤ h(x − c t) a.e. in R

2, with the norm ||u||Wc =infh ||h||L1(R).

Notice that the time t is not restricted to be ≥ 0, because one makes nohypothesis on the sign of the coefficients Ai,j,k; taking t ∈ R will be possiblefor small L1 data, but for large L1 data one will only obtain local existence,and the definition of the spaces Vc and Wc can be restricted to functionsdefined in R × (−α, β) for positive α, β (so the interval in time contains 0),and even some other sets, as will be seen in some proofs.

Lemma 16.2. For every c ∈ R, one has Vc ⊂ Wc, with ||u||Wc ≤ ||u||Vc forall u ∈ Vc.

For c1 �= c2, one has u1u2 ∈ L1(R2) whenever u1 ∈Wc1 , u2 ∈Wc2 , with∫

R×R

|u1| |u2| dx dt ≤ 1|c1 − c2| ||u1||Wc1

||u2||Wc2for all u1 ∈Wc1 , u2 ∈Wc2 .

(16.3)

Proof : Using the Fubini theorem, one has u(x, t) = g(x − c t) +∫ t

0 f(x −c s, t − s) ds for almost every x, t, where f = ut + c ux and g = u |t=0, andthis gives |u(x, t)| ≤ h(x − c t) with h(y) = |g(y)| +

∫R|f(y + c s, s)| ds and

||h||L1(R) = ||u||Vc .For i = 1, 2, one has |ui(x, t)| ≤ hi(x − ci t) a.e. in R

2, with ||hi||L1(R) ≤||ui||Wci

+ε, so that∫

R×R|u1| |u2| dx dt ≤

∫R×R

|h1(x−c1 t)| |h2(x−c2 t)| dx dt;one changes variables in the last integral, defining y1 = x− c1 t, y2 = x− c2 t,which is a good change of variables because c1 �= c2, and one has dx dt =

1|c1−c2| dy1 dy2 and the integral is equal to 1

|c1−c2|∫

Rh1(y1) dy1

∫Rh2(y2) dy2;

then, one lets ε tend to 0. ��The technical adantage of using the functional space Vc instead of Wc

is that functions in Vc have a trace at t = 0, and that their behaviour atinfinity is also well described; indeed, if one moves with speed c and onewrites u(x, t) = U(x − c t, t) then U ∈ V0, and if one denotes E = L1(R), afunction U ∈ V0 is such that U(·, 0) ∈ E and Ut ∈ L1(R;E), so U is absolutelycontinuous in t with values in E; apart from having well defined values at every

Page 158: From hyperbolic systems to kinetic theory: a personalized quest

16 Systems Satisfying My Condition (S) 137

time U(·, t) ∈ L1(R), it implies that U also has well defined limits U+ ∈ L1(R)as t→ +∞ and U− ∈ L1(R) as t→ −∞.

Because condition (S) is assumed, it is possible to define solutions withinitial data in L1(R) in a unique way, globally for small L1 data, locally intime for large L1 data, and it is the adopted choice of functional spaces whichpermits that, and the solution is sought such that ui ∈ VCi for i = 1, . . . ,m,and thanks to Lemma 16.2 all the products ujuk appearing with a nonzerocoefficient Ai,j,k belong to L1(R × R).

Proposition 16.3. Assuming condition (S), there exists ε0 > 0 (dependingupon the coefficients Ai,j,k and the velocities Ci in an explicit way) such thatif the initial data vi, i = 1, . . . ,m, satisfy

R

( m∑

i=1

|vi|)dx < ε0, (16.4)

there is a unique solution u = (u1, . . . , um) ∈ VC1 × . . .× VCm . (16.5)

Proof : One proves the existence of a solution with small norm by applyinga fixed point argument for a strict contraction; uniqueness can be provenwithout assuming that the solution has a small norm. One looks for a fixedpoint of the map Φ, defined for u ∈ WC1 × . . . ×WCm , with Φ(u) = U ∈VC1 × . . .× VCm solution of

(Ui)t + Ci(Ui)x +∑

j,k

Ai,j,kujuk = 0, Ui |t=0= vi, i = 1, . . . ,m. (16.6)

Because of Lemma 16.2 and condition (S), all the terms Ai,j,kujuk belong toL1(R × R), so that Ui ∈ VCi for i = 1, . . . ,m. The spaces used are Banachspaces, and one looks for a closed set which is mapped into itself, and thenthat it is a strict contraction. For

αi = ||vi||L1(R), i = 1, . . . ,m, (16.7)

||ui||WCi≤ ξi, i = 1, . . . ,m implies

||Ui||VCi≤ ηi = αi +

∑j,k

′ |Ai,j,k||Cj−Ck|ξjξk, i = 1, . . . ,m,

(16.8)

where ′ means that one avoids indices for which Cj = Ck (which correspondto Ai,j,k = 0). For

β = max{j,k|Cj =Ck}

m∑

i=1

|Ai,j,k||Cj − Ck| , (16.9)

m∑

i=1

ηi ≤m∑

i=1

αi + β∑

j,k

′ξjξk ≤ ε0 + β

( m∑

j=1

ξj

)2

, (16.10)

and one checks immediately that

Page 159: From hyperbolic systems to kinetic theory: a personalized quest

138 16 Systems Satisfying My Condition (S)

ε0 ≤ 14β

andm∑

i=1

ξi ≤ 2ε0 implym∑

i=1

ηi ≤ 2ε0. (16.11)

If one imposes ε0 ≤ 14β , then one has found a closed set mapped into itself,

defined by∑mi=1 ||ui||WCi

≤ 2ε0; in order to check if Φ is a strict contractionon this set, one takes u, u′ in the set and one estimates the norm of Ui − U ′

i ,using the usual decomposition ujuk − u′ju

′k = uj(uk − u′k) + (uj − u′j)u

′k, and

one finds

||Ui−U ′i ||VCi

≤∑

j,k

′ |Ai,j,k||Cj − Ck| (ξj ||uk−u

′k||WCk

+||uj−u′j||WCjξ′j), i = 1, . . . ,m,

(16.12)so that if

∑mi=1 ||ui||WCi

≤ 2ε0 and∑m

i=1 ||u′i||WCi≤ 2ε0 one has

m∑

i=1

||Ui − U ′i ||VCi

≤ 4ε0β

m∑

i=1

||ui − u′i||WCi, (16.13)

and a strict contraction property follows from the choice ε0 <1

4β . Uniquenessof the solution is true without assuming that the second solution has a smallnorm, as will be shown later. ��

With a simple adaptation, one can obtain a local existence theorem forarbitrary data in L1, but the time of existence is not just a function of thenorm of the initial data in L1, like for ordinary differential equations (onceagain one observes the limitations of the point of view of the theory of semi-groups for this kind of problem), and that will be seen by considering theexample

ut + ux = u v; u(·, 0) = u0

vt − vx = u v; v(·, 0) = v0,(16.14)

for which the preceding proposition applies with β = 1, and gives global exis-tence if

∫R(|u0|+|v0|) dx < 1. For a > 0 and L > 1

a , one chooses the initial datau0 and v0 equal to a in (−L,+L) and 0 outside; in the domain of dependence{(x, t) | |x| ≤ L−|t|}, the solution then solves the ordinary differential systemut = vt = u v, u(0) = v(0) = a, whose solution is u(t) = v(t) = a

1−a t and itblows up at time tc = 1

a < L; the initial data satisfy∫

R(|u0|+ |v0|) dx = 4La,

which can be any number > 4, but the time of existence is 1a (because one

has 0 ≤ u(x, t), v(x, t) ≤ a1−a t ), which can be arbitrarily small by taking a

large. This explains that the time of local existence requires a more preciseanalysis than the evaluation of a few global norms (but it can be seen on thenondecreasing rearrangements of the initial data).

Proposition 16.4. Let v1, . . . , vm ∈ L1(R) and let r0 > 0 be such that∫ z+r0

z−r0

( m∑

j=1

|vj(x)|)dx ≤ ε0 for all z ∈ R, (16.15)

Page 160: From hyperbolic systems to kinetic theory: a personalized quest

16 Systems Satisfying My Condition (S) 139

with ε0 as in Proposition 16.3, for example ε0 <1

4β . Then there is a uniquesolution for |t| ≤ r0

maxi |Ci| .

For a given z ∈ R, one takes as new initial data the functions vi in the interval(z− r0, z+ r0) and 0 outside, and for these new initial data the solution existsglobally, but it may only coincide with the desired solution in the domain ofdependence {(x, t) | x− Cit ∈ (z − r0, z + r0) for i = 1, . . . ,m}.

However, one must prove that two solutions starting from two intervalscoincide on the intersection of the domains of dependence. This comes fromusing the uniqueness of small solutions as in Proposition 16.3, but observingthat it applies to domains of the form DJ = {(x, t) | x − Cit ∈ J for i =1, . . . ,m} for any interval J given at time 0, where the L1 norm of the initialdata has to be small, and this is because given gi ∈ L1(J) and fi ∈ L1(DJ)there is one solution of (Ui)t + Ci(Ui)x = fi in DJ and U |t=0= gi on J ,belonging to a space VCi defined in an obvious way, for i = 1, . . . ,m.

If a solution has large norm in L1 of a subset of R×R or R, then one alsouses the fact that for every ε > 0 there exists δ > 0 such that the integral onany subset of measure at most δ is bounded by ε, a property which has beenused in asserting the existence of r0 in (16.15). ��

The fixed point property is used on a set of WC1 × . . . ×WCm but thefixed point is necessarily in the range of Φ, in VC1 × . . .× VCm . In the case ofsmall data in L1, where the solution exists for all time, I have already pointedout that the solution ui belonging to VCi gives information on the asymptoticbehaviour t→ ±∞, i.e. there exists u−i , u

+i ∈ L1(R) such that

R

|ui(x, t) − u±i (x− Cit)| dx→ 0 as t→ ±∞. (16.16)

I have also obtained results of scattering, i.e. about the map u− �→ u+, whichI shall not describe.

The global existence result for small data, and the counter-example show-ing that the time of existence for large data is not only a function of the L1

norm, shows an important effect due to the transport term. The model of thecounter-example is similar to that of a chemical chain reaction which createsan explosion in finite time, but the reaction needs the two constituents to bepresent for sustaining itself, and if the constituents move at a different veloci-ties, the reaction started at one point must be sustained by molecules comingfrom elsewhere, and there is a problem of timing and it is important that thereis a sufficient amount to sustain the reaction to its end. This interpretationexplains one defect of using only global norms of functional spaces like Lp,which give no clue about where the information is located, and one could thinkof using rearrangement methods, as started by HARDY and LITTLEWOOD,3

3 John Edensor LITTLEWOOD, English mathematician, 1885–1977. He had workedin Manchester and in Cambridge, England, where he held the newly foundedRouse Ball professorship (1928–1950).

Page 161: From hyperbolic systems to kinetic theory: a personalized quest

140 16 Systems Satisfying My Condition (S)

but I do not know of any efficient way to do that for models of kinetic theory;techniques of maximal functions, also started by HARDY and LITTLEWOOD

and extended by WIENER should be more adapted, probably in the way usedby Lars HEDBERG,4 and he traced his idea to some earlier work of LennartCARLESON, and of STEIN.

Before generalizing the preceding results to the Broadwell model, whichviolates condition (S) because of the presence of the w2 terms, one shouldobserve that the method also permits one to give bounds in L∞, and theargument is analogous to that of MIMURA and Takaaki NISHIDA, but relieson the bounds in the VCi spaces instead of a conservation property.

Proposition 16.5. Assuming condition (S), for every k > 1 there existsε(k) > 0 such that

vi ∈ L1(R) ∩ L∞(R), i = 1, . . . ,m, and∫

R

(∑mi=1 |vi|

)dx ≤ ε(k) imply

|ui(x, t)| ≤ k maxj ||vj ||L∞(R), i = 1, . . . ,m a.e. (x, t) ∈ R × R.(16.17)

Proof : Taking ε(k) ≤ ε0 one has a global solution satisfying |ui(x, t)| ≤ hi(x−Cit) with

∑i ||hi||L1(R) ≤ 2ε0. One has local existence in L∞, and one must

find a bound for the L∞ norm. Integrating along a characteristic line withvelocity Ci one bounds each of the terms |Ai,j,kujuk| by replacing |uj| byhj(x − Cjt) and |uk| by M(t) if Ci �= Cj , or |uk| by hk(x − Ckt) and |uj | byM(t) if Ci �= Ck, the case Ci = Cj = Ck being of no consequence as Ai,j,k = 0in that case; by integrating one finds that |ui(x, t)| ≤ |vi(x−Cit)|+Kε0M(t)with K depending only on the coefficients Ai,j,k and the velocities Ci, and thisgives an estimate M(t) ≤M(0)+Kε0M(t), and one chooses ε(k) ≤ 1

K

(1− 1

k

).��

In order to treat the Broadwell model, I introduced a slightly more gen-eral framework, but there the nonnegative character of solutions is crucial,and t ≥ 0. The main idea is that an estimate for the integral of w2 is nowobtained by integration of the third equation, which gives

∫R×(0,T )

w2 dx dt+∫

Rw(x, T ) dx =

∫R×(0,T )

u v dx dt+∫

Rw0 dx, and because of u0, v0, w0 ≥ 0 one

has w ≥ 0 and therefore the missing bound is replaced by∫

R×(0,T )w2 dx dt ≤

∫R×(0,T ) u v dx dt + ||w0||L1(R) ≤ 1

2 ||u||W1 ||v||W2 + ε0; one concludes as be-fore that ||u||V1 ≤ ||u0||L1(R) +

∫R×(0,T )w

2 dx dt and ||v||V−1 ≤ ||v0||L1(R) +∫

R×(0,T ) w2 dx dt.5 The solution obtained is such that for large t one has

4 Lars Inge HEDBERG, Swedish mathematician, 1935–2005. He had worked inLinkoping, Sweden.

5 I had not wanted to repeat the same procedure as before in my report, andthat may be why Reinhard ILLNER interpreted that I had not really proven theexistence for data in L1 for the Broadwell model.

Page 162: From hyperbolic systems to kinetic theory: a personalized quest

16 Systems Satisfying My Condition (S) 141

u ≈ u+(x − t), v ≈ v+(x + t) and w ≈ w+(x), but as pointed out by RussellCAFLISCH,6 one has w+ = 0 because w ∈ L2

(R×(0,∞)

). The conservation of

mass and the conservation of momentum show that∫

Ru+ dx =

∫R(u0+w0) dx

and∫

Ruv+ dx =

∫R(v0 + w0) dx.

[Taught on Wednesday October 10, 2001.]

Notes on names cited in footnotes for Chapter 16, R. BALL,7 ILLNER.8

6 Russell Edward CAFLISCH, American mathematician. He has worked at NYU(New York University), New York, NY, and at UCLA (University of CaliforniaLos Angeles), Los Angeles, CA.

7 Walter William Rouse BALL, English mathematician, 1850–1925. He had workedin Cambridge, England.

8 Reinhard ILLNER, German-born mathematician. He has worked in Kaiserslautern,Germany, at Duke University, Durham, NC, and at University of Victoria, BritishColumbia.

Page 163: From hyperbolic systems to kinetic theory: a personalized quest

17

Asymptotic Estimates for the Broadwelland the Carleman Models

For nonnegative initial data with a small total mass∫

R(u0 + v0 + 2w0) dx,

the asymptotic behaviour is that of a free streaming solution, i.e. without thenonlinear interaction terms, but with a particularity that w tends to 0, andthat is due to the presence of the w2 term in the equations. Actually, using aremark of Raghu VARADHAN,1 which simplified a result of Thomas BEALE,2

which I shall discuss later, the result is true for all nonnegative data with afinite total mass.

In principle, models of kinetic theory have no interaction between parti-cles travelling at the same velocity, but one should remember that the fourvelocities model has a term u3u4 corresponding to particles going in oppositedirections and parallel to the y axis, and it is only because the initial datahave been assumed independent of y that the velocities seem to be the same,equal to 0, but 0 is just the projection of the velocity onto the x axis; actuallythe conservation of kinetic energy is concerned with u+ v + 2w and not withu+ v as it would have been if the particles with density w had a zero velocity.

The presence of the w2 term acts as a destruction mechanism, and indeedthe particles from the third and fourth families eventually all transform intoparticles of the first and second families by collisions. One may wonder whythe process is not symmetric and why the collisions of particles of the first andsecond families do not produce enough particles of the third and fourth fam-ily. My analysis, which went further than the result of MIMURA and TakaakiNISHIDA, explained that the hypothesis of finite mass, together with the factthat the particles created in the first and second families are taken away (be-cause they have different velocities) puts a severe limitation on the productionof particles of the third and fourth families, which are not replaced, so thesefamilies die out.1 Sathamangalam Raghu Srinivasa VARADHAN, Indian-born mathematician, born

in 1940. He works at NYU (New York University), New York, NY.2 James Thomas BEALE, American mathematician. He works at Duke University,

Durham, NC.

Page 164: From hyperbolic systems to kinetic theory: a personalized quest

144 17 Asymptotic Estimates for the Broadwell and the Carleman Models

Once one knows that u ∈ V1, v ∈ V−1, w ∈ V0 ∩L2, one deduces that thereexist u∞, v∞ ∈ L1(R) such that, as t tends to ∞, one has

∫R|u(x, t) − u∞(x− t)| dx→ 0∫

R|v(x, t) − v∞(x+ t)| dx→ 0∫

R|w(x, t)| dx → 0.

(17.1)

The conservation of mass∫

R(u+v+2w) dx and the conservation of momentum∫

R(u− v) dx imply that

∫R(u∞ + v∞) dx =

∫R(u0 + v0 + 2w0) dx∫

R(u∞ − v∞) dx =

∫R(u0 − v0) dx,

(17.2)

and solving this system gives∫

Ru∞ dx =

∫R(u0 + w0) dx∫

Rv∞ dx =

∫R(v0 + w0) dx,

(17.3)

and therefore it is more natural to express the conservation of mass and theconservation of momentum as

(u+ w)t + ux = 0(v + w)t − vx = 0, (17.4)

emphasizing u+w as the mass which will eventually go to infinity on the rightside and v+w as the mass which will eventually go to infinity on the left side.

In the two-dimensional four velocities model, one may tag particles, andthis point of view might be classical, but I only noticed it a few years ago,while trying with Chun LIU to prove global existence results for the two-dimensional four velocities model;3 I had been led to integrating along thedirection (1,−1) by a remark of Robert PESZEK,4 but such integrals hadalready appeared before in a method of Shuichi KAWASHIMA,5 and I mayhave been just finding an intuitive explanation for his estimates, which I hadnot read, but asked my student Kamel HAMDACHE to read and generalize,6

3 Chun LIU, Chinese-born mathematician. He was a post doctoral associate ofCNA (Center for Nonlinear Analysis) at CMU (Carnegie Mellon University),Pittsburgh, PA, and he works now at Penn State (Pennsylvania State University),University Park, PA.

4 Robert W. PESZEK, Polish-born mathematician. He was a post doctoral associateof CNA (Center for Nonlinear Analysis) at CMU (Carnegie Mellon University),Pittsburgh, PA, and he works now at MTU (Michigan Technological University),Houghton, MI.

5 Shuichi KAWASHIMA, Japanese mathematician. He works at Kyushu University,Fukuoka, Japan.

6 Kamel HAMDACHE, French mathematician, born in 1948. He has worked in Al-giers, Algeria, and then in various laboratories of CNRS (Centre National de la

Page 165: From hyperbolic systems to kinetic theory: a personalized quest

17 Asymptotic Estimates for the Broadwell and the Carleman Models 145

which he did. When two particles from the first and second families collide,one may decide that it is the particle from the first family which switches tothe third family, and when two particles from third and fourth families collidethat it is the particle from the third family which switches to the first family.Conservation of mass is (u1 + u2 + u3 + u4)t + (u1 − u2)x + (u3 − u4)y = 0,conservation of momentum in x is (u1−u2)t+(u1 +u2)x = 0 and conservationof momentum in y is (u3 − u4)t + (u3 + u4)y = 0, from which one deduces(u1 + u3)t + (u1)x + (u3)y = 0, and it is natural to integrate along parallelsto the direction (1,−1), because for a particle of the first or third family onecannot predict the position

(ξ(t), η(t)

)that it will occupy at time t, but one

can predict what ξ(t) + η(t) will be; indeed, one has ξ′(t) + η′(t) = 1, becauseone has ξ′(t) = 1 and η′(t) = 0 while the particle is in the first family andξ′(t) = 0 and η′(t) = 1 while the particle is in the third family. If one defines

M13(x, y, t) =∫

R

(u1 + u3

)(x+ z, y − z, t) dz, so that

(M13)x = (M13)y, or M13(x, y, t) = N13(x+ y, t), (17.5)

one obtains(M13)t + (M13)x = 0, (17.6)

so that one can compute M13 directly from the initial data,

M13(x, y, t) =∫

R

(v1 + v3

)(x− t+ z, y − z) dz. (17.7)

If one has proven that the asymptotic behaviour is that u1, u2, u3, u4 lookeventually like u∞1 (x−t, y), u∞2 (x+t, y), u∞3 (x, y−t), u∞4 (x, y+t) (which is truefor nonnegative data with small L2 norm), and if mi is the integral of u∞i , thenone has m1 +m3 =

∫R2(v1 +v3) dx dy; similarly, m1 +m4 =

∫R2(v1 +v4) dx dy,

m2 +m3 =∫

R2(v2 + v3) dx dy, and m2 +m4 =∫

R2(v2 + v4) dx dy, but it is notclear if there are simple formulas giving separately m1,m2,m3,m4.

Coming back to the asymptotic behaviour for the Broadwell model, it isimportant to realize that there is no precise shape for the limiting functions u∞and v∞, and that they can be arbitrary nonnegative functions with compactsupport for example, if one accepts to translate them. Indeed let ϕ, ψ be twononnegative functions with compact support, and consider the initial datau0(x) = ϕ(x− a), v0(x) = ψ(x− b), w0(x) = 0, with a, b ∈ R chosen in such away that the support of u0 is entirely to the right of the support of v0; in thatcase the explicit solution will be u(x, t) = ϕ(x− a− t), v(x, t) = ψ(x− b+ t),w(x, t) = 0, because these formulas imply u v−w2 = 0. However, if one wantsexactly u∞ = ϕ, and v∞ = ψ, and the support does not satisfy the condition,

Recherche Scientifique), at ENSTA (Ecole Normale Superieure des TechniquesAvancees), Palaiseau, at ENS (Ecole Normale Superieure) Cachan, in Bordeaux,at Universite Paris-Nord, Villetaneuse, and at Ecole Polytechnique, Palaiseau,France. He did his thesis (doctorat d’etat, 1986) under my supervision.

Page 166: From hyperbolic systems to kinetic theory: a personalized quest

146 17 Asymptotic Estimates for the Broadwell and the Carleman Models

one has a more technical problem of scattering (which I have only studied forsystems satisfying condition (S) for small data).

The condition (S) (or its generalization) is not satisfied by the Carlemanmodel, and I was not trying to include it in my analysis (as it is not a modelfrom kinetic theory, and global L∞ bounds were known for that model), but Ithen learnt of an estimate by Reinhard ILLNER and Michael REED,7 that thesolution of the Carleman model with nonnegative data with finite total mass,i.e.∫

R(u0 + v0) dx = m <∞ satisfies a uniform estimate

0 ≤ u(x, t), v(x, t) ≤ C(m)t

, (17.8)

and that shows that the asymptotic behaviour is quite different than for theBroadwell model; I included a simplified proof of their result in the appendixof my 1980 report, but the estimates for C(m) were much too large. Theirresult suggested me to look at self-similar solutions of the Carleman model,which I shall describe in a moment, and from that study I conjectured a boundC(m) = O(m2 + 1) in the decay estimate, which I proved a few years after,and I shall describe that later, in connection with the method of generalizedinvariant regions. One cannot hope for a faster decay, because of conservationof mass, i.e.

∫R

(u(x, t) + v(x, t)

)dx = m, and if one starts with initial data

having their support in an interval of length L, then the support at timet is included in an interval of length L + 2t, and therefore one must havem ≤ (L+2t)

(||u(·, t)||L∞(R) + ||v(·, t)||L∞(R)

); by letting L tend to 0, it shows

that C(m) ≥ m4 . One has C(m) ≥ 1 for all m > 0, because if u(x0) > 0

then f(t) = u(x0 + t, t) satisfies the differential inequality f ′ + f2 ≥ 0 withf(0) > 0, and the solution satisfies f(t) ≥ f(0)

1+f(0) t for all t > 0; by letting f(0)tend to ∞, it gives C(m) ≥ 1.

For any of our semi-linear hyperbolic systems with a quadratic nonlinear-ity, if ui, i = 1, . . . ,m, is a solution, then ui, i = 1, . . . ,m, is a solution if onedefines it by ui(x, t) = λui(λx, λ t), i = 1, . . . ,m, where λ > 0 is arbitrary. Itis then natural to look for solutions such that ui = ui for i = 1, . . . ,m, inde-pendently of λ, and that means that ui(x, t) = 1

t Ui(xt

)(by choosing λ = 1

t ),and such solutions are called self-similar.

One looks for a self-similar nonnegative solution of the Carleman model,u(x, t) = 1

t U(xt

), v(x, t) = 1

t V(xt

), with finite total mass m, which must then

be∫

R(U + V ) dσ. One finds that t2(ut + ux + u2 − v2) = −U − σ U ′ + U ′ +

U2 − V 2 = 0, and t2(vt − vx − u2 + v2) = −V − σ V ′ − V ′ − U2 + V 2 = 0,where σ = x

t and ′ = ddσ , i.e.

((1 − σ)U

)′ + U2 − V 2 = 0((1 + σ)V

)′ + U2 − V 2 = 0,(17.9)

7 Michael Charles REED, American mathematician, born in 1942. He works at DukeUniversity, Durham, NC.

Page 167: From hyperbolic systems to kinetic theory: a personalized quest

17 Asymptotic Estimates for the Broadwell and the Carleman Models 147

from which one deduces by subtracting the two equations

(1 − σ)U − (1 + σ)V = C0, (17.10)

and one only considers the case C0 = 0; the reason is that when t tends to 0,the data for U and for V converge to α δ0 and to β δ0, and one then expectsthat U and V vanish if |x− t| > t, because of the finite speed of propagation,which means that U(σ) and V (σ) vanish for |σ| > 1. Using U = (1 + σ)Z andV = (1−σ)Z, one finds that [(1−σ2)Z]′ +4σZ2 = 0, which after division byZ2 gives a linear equation (σ2−1)

(1Z

)′− 2σZ +4σ = 0, which has the particular

solution 1Z = 2, and general solution 2 + C(1 − σ2); this gives

U(σ) = 1+σ2+C(1−σ2) in [−1, 1), 0 outside,

V (σ) = 1−σ2+C(1−σ2) in (−1, 1], 0 outside, (17.11)

for which one must have C > −2. One sees that V (σ) = U(−σ), so that bothU and V have the same integral, and the relation between C and m is

m =∫ +1

−1

22 + C(1 − σ2)

dσ, (17.12)

which can be computed explicitly (and m tends to 0 as C tends to +∞ andtends to 0 as C tends to −2), and I shall come back to this computation later.

For the Broadwell model, the self-similar solutions do not have finite mass,but I have suggested that they could be useful for another question, discussedlater.

[Taught on Friday October 12, 2001.]

Page 168: From hyperbolic systems to kinetic theory: a personalized quest

18

Oscillating Solutions; the 2-D Broadwell Model

There are various reasons why it is useful to consider what happens for se-quences of solutions of evolution equations when one starts from a sequence ofinitial data which converges only weakly. My motivation in the mid 1970s wasthat topologies like weak convergence and more general topologies of weaktype, like those appearing in homogenization, are a good way to express therelations between different scales, the finest scale being called microscopic (ormesoscopic for those who are rigid enough to consider that the term micro-scopic only applies to the level of atoms) and the coarsest scale being calledmacroscopic; I had initiated that philosophy in the early 1970s, influenced bysome work of Evariste SANCHEZ-PALENCIA.

Homogenization is understood in the general context that I had developedwith Francois MURAT in the early 1970s, i.e. related to the H-convergenceapproach that we had introduced, which is a little more general than theG-convergence approach that Sergio SPAGNOLO had developed in the late1960s, with the help of Ennio DE GIORGI.1 I had borrowed the term homog-enization from Ivo BABUSKA, but I applied it in general situations, withouta restriction to a periodically modulated framework, which I had first seenin the work of Evariste SANCHEZ-PALENCIA, and the way I used that termcertainly conformed to the spirit of what Ivo BABUSKA had meant when heintroduced it. However, among those who often use the mathematical toolsthat I had developed for the general framework, many limit themselves tothe periodically modulated case, for one reason or another, but project theirlimitations on their students by not emphasizing that the method that theyuse had been developed for a general framework; probably for some other rea-son, they rarely mention the pioneering work that had been done by Evariste1 Ennio DE GIORGI, Italian mathematician, 1928–1996. He received the Wolf Prize

in 1990, for his innovating ideas and fundamental achievements in partial differ-ential equations and calculus of variations, jointly with Ilya PIATETSKI-SHAPIRO.He had worked at Scuola Normale Superiore, Pisa, Italy.

Page 169: From hyperbolic systems to kinetic theory: a personalized quest

150 18 Oscillating Solutions; the 2-D Broadwell Model

SANCHEZ-PALENCIA in the early 1970s, precisely on the periodic frameworkthat they want to limit themselves to.

According to my philosophy, the weak convergence is adapted to somequantities, which are usually coefficients of differential forms (as it appearedafter discussions with Joel ROBBIN), for example it applies to the density ofmass ρ, and to the linear momentum q, which both appear in the equationexpressing conservation of mass, ∂ρ

∂t + div(q) = 0, but if one wants to definean effective velocity u for transport of mass by writing q = ρ u, then the weakconvergence may not be adapted to u itself.2 If some physical phenomenonoccurs at a microscopic level, and a model pretends to describe the relevantphysical quantities observed at a macroscopic level, then the macroscopicequations should be stable when using the right type of weak convergencewhich describes the passage from the microscopic or mesoscopic level to themacroscopic level (and one should then be careful about identifying what theright convergence should be); if they are not stable it means that the effectiveequations have not been identified correctly.

I was wondering if equations used in kinetic theory are stable with respectto weak convergence, which is well adapted for densities of particles, but Iobserved that most discrete velocity models are not stable. This negativefact is in itself difficult to use, because these models are not believed to beexact but are considered as simplifications; this mathematical exercise shouldthen be considered in its right context, that it may help in developing bettermathematical tools that one needs for studying more complicated models,believed to describe accurately a part of the physical reality.

My first step was to show that a simple model like the Carleman model(which is not a model from kinetic theory as it does not conserve momen-tum), is not stable with respect to weak convergence. In order to see that, Iconsidered sequences an, bn satisfying 0 ≤ an, bn ≤ M in R, and I used thesolutions un, vn satisfying

(un)t + (un)x + (un)2 − (vn)2 = 0 in R × (0,∞); un(·, 0) = an in R,(vn)t − (vn)x − (un)2 + (vn)2 = 0 in R × (0,∞); vn(·, 0) = bn in R,

(18.1)

and one has 0 ≤ un, vn ≤ M . If an ⇀ a∞ and bn ⇀ b∞ in L∞(R) weak �,then I wanted to show that it is not always true that un and vn convergein L∞(

R × (0,∞))

weak � to the solutions u∞ and v∞ corresponding to theinitial data a∞ and b∞. I later characterized the oscillations created in thesequence, and I shall describe that in another lecture, but at that time I did2 For charged particles, one denotes by � the density of electric charge and by j the

density of electric current, and conservation of charge takes the form ∂�∂t

+div(j) =0, but one usually does not introduce an effective velocity for transport of chargedefined by j = �u, because the “particles” carrying the charges have very differentmasses and velocities, as they are light electrons or heavy ions, and an averagevelocity would be useless.

Page 170: From hyperbolic systems to kinetic theory: a personalized quest

18 Oscillating Solutions; the 2-D Broadwell Model 151

not need to be as precise, and integrating along characteristic lines, I firstobserved that

un(x, t) = an(x− t) +O(t); vn(x, t) = bn(x+ t) +O(t). (18.2)

Taking an = 1 + sin(nx) and bn = 1 (so that M = 2) gives a∞ = b∞ = 1but (an)2 ⇀ 3

2 , and as (un)t + (un)x = −(an(x − t) + O(t))2 +

(1 + O(t)

)2

for which a subsequence converges weakly to − 12 + O(t), one finds that the

weak � limit u∗ of a subsequence um is 1 − t2 + O(t2), different from u∞ = 1

(and the weak � limit v∗ of a subsequence vm is 1 + t2 +O(t2), different from

v∞ = 1).The same type of negative result applies to the Broadwell model, and using

un(·, 0) = vn(·, 0) = 1 and wn(·, 0) = 1 + sin(n ·) (which imply an estimate0 ≤ un, vn, wn ≤M(t) by Proposition 15.2), one obtains un = 1 +O(t), vn =1 +O(t) and wn(x, t) = sin(nx) +O(t) and then weak limits of subsequencesare of the form u∗ = 1 + t

2 +O(t2), v∗ = 1 + t2 + O(t2), w∗ = 1 − t

2 + O(t2),different from the solution u∞ = v∞ = w∞ = 1.

I know a class of semi-linear hyperbolic systems which has the propertyof being stable by weak convergence, which is precisely the class satisfyingcondition (S) that I have already described, and that property follows froma simple application of the div-curl lemma that I had proven in 1974 withFrancois MURAT,3 which we generalized a few years later to a more generaltheory, called compensated compactness.4 The initial form of the div-curllemma is as follows.

Lemma 18.1. If Ω is an open subset of RN and

E(n) ⇀ E(∞) in L2(Ω; RN ) weak, and∂E

(n)i

∂xj− ∂E

(n)j

∂xiis bounded in L2(Ω) for all i, j = 1, . . . , N,

(18.3)

andD(n) ⇀ D(∞) in L2(Ω; RN ) weak, and∑N

i=1∂D

(n)i

∂xiis bounded in L2(Ω),

(18.4)

3 As I have already mentioned, Joel ROBBIN had provided a different proof in 1975,using differential forms and Hodge decomposition.

4 The name was coined by Jacques-Louis LIONS, who had asked Francois MURAT

to generalize the div-curl lemma as part of the work for his thesis, and he hadgiven him an article of SCHULENBERGER & WILCOX, which he thought related;Francois MURAT proved a result of sequential weak continuity for a more generalquadratic setting, using a condition of constant rank, choosing a slightly differentmethod than the one that we had followed for proving the div-curl lemma. Igeneralized the framework for predicting the weak limits of all quadratic forms(without imposing a rank condition on the differential constraints used), the germof the theory of H-measures which I developed ten years after.

Page 171: From hyperbolic systems to kinetic theory: a personalized quest

152 18 Oscillating Solutions; the 2-D Broadwell Model

then (E(n).D(n)) converges to (E(∞).D(∞)) weakly � in the sense of Radonmeasures,5 i.e.

Ω

(E(n).D(n))ϕdx→∫

Ω

(E(∞).D(∞))ϕdx for all ϕ ∈ Cc(Ω). (18.5)

Proof : The initial proof that we followed, which I used later for the moregeneral compensated compactness theory, is a simple adaptation of a proofby Lars HORMANDER of the compactness of the injection of H1

0 (Ω) intoL2(Ω) for Ω bounded (and even for Ω having finite Lebesgue measure), andit uses the Fourier transform; it differs from the other proof that we hadlearnt from our advisor, Jacques-Louis LIONS, based on a characterizationof compact sets in Lp, due to FRECHET and/or KOLMOGOROV. Choosingψ ∈ Cc(Ω) equal to 1 on the support of ϕ, one replaces E(n) by ϕE(n)

and D(n) by ψD(n), which satisfy similar hypotheses, and one must showthat with the added hypothesis that E(n) and D(n) have their support ina fixed compact set of R

N (and one extends them by 0 outside Ω), onehas∫

RN (E(n).D(n)) dx → ∫RN (E(∞).D(∞)) dx, which one checks by apply-

ing the Plancherel theorem, i.e. one proves that∫

RN (FE(n).FD(n)) dξ →∫

RN (FE(∞).FD(∞)) dξ. Because FE(n) converges pointwise to FE(∞) andis uniformly bounded, it converges in L2

loc(RN ; RN ) strong by the Lebesgue

dominated convergence theorem, and the only technical point is to show that(FE(n).FD(n)) is small at infinity. This follows from decomposing the twovectors FE(n)(ξ) and FD(n)(ξ) on the subspaces parallel to ξ or perpendicu-lar to ξ; the Lagrange identity |ξ|2|a|2 = |∑i ξiai|2 +

∑i<j |ξiaj − ξjai|2 for

all a ∈ CN and all ξ ∈ RN permits us to estimate the component a‖ on the

subspace parallel to ξ by |ξ|2|a‖|2 =∑i<j |ξiaj − ξjai|2 and the component

a⊥ on the subspace perpendicular to ξ by |ξ|2|a⊥|2 = |∑i ξiai|2, and this im-plies that |ξ|2|FE(n)

⊥ (ξ)|2 =∑

i<j |ξiFE(n)j (ξ) − ξjFE(n)

i (ξ)|2 ∈ L1(RN ) and

|ξ|2|FD(n)‖ (ξ)|2 = |∑i ξiFD(n)

i (ξ)|2 ∈ L1(RN ), so that |ξ|(FE(n).FD(n)) isbounded in L1(RN ). ��

The way the div-curl lemma is used for proving the stability with respectto weak convergence of semi-linear systems in one space variable satisfyingcondition (S) is the following.

Lemma 18.2. If ω ⊂ R2 and un ⇀ u∞, vn ⇀ v∞ in L2(ω) weak and

∂un

∂t + c1∂un

∂x bounded in L2(ω)∂vn

∂t + c2∂vn

∂x bounded in L2(ω),(18.6)

then if c1 �= c2 one has unvn ⇀ u∞v∞ weakly � in the sense of Radonmeasures.5 Under the preceding hypotheses, I have shown that one cannot always take ϕ to be

the characteristic function of a smooth set with closure in Ω, so the convergencedoes not hold in general in L1

loc(Ω) weak.

Page 172: From hyperbolic systems to kinetic theory: a personalized quest

18 Oscillating Solutions; the 2-D Broadwell Model 153

Proof : Using x1 = x and x2 = t, one applies the div-curl lemma withE(n) = (un,−c1un) and D(n) = (c2vn, vn), and one deduces that (c2−c1)unvnconverges to (c2 − c1)u∞v∞ weakly � in the sense of Radon measures. ��

I had introduced the particular class (S) of first-order semi-linear hyper-bolic systems with quadratic nonlinearities because I knew that class to bestable by weak convergence, a simple example of compensated compactness,but the existence theorem that I proved after that is something of a differentnature, which I later called compensated integrability, and I also coined theterm compensated regularity for another type of result that I had obtained,after it had been improved by Raphael COIFMAN,6 Pierre-Louis LIONS, YvesMEYER and Steven SEMMES,7 using Hardy space H1, because they had cre-ated some confusion by wrongly claiming that they had improved a result ofcompensated compactness,8 and as I consider that the worst sin of a teacheris to mislead students and researchers, I coined the new terms (compensatedintegrability, compensated regularity) precisely for explaining the differences.

I had checked that the class (S) is almost the right one,9 by considering ageneral system of the form

∂ui

∂t + Ci.grad(ui) = Fi(u1, . . . , um) in RN × (0, T ), i = 1, . . . ,m

ui(·, 0) = vi in RN , i = 1, . . . ,m,

(18.7)

with C1, . . . , Cm ∈ RN , and F1, . . . , Fm locally Lipschitz functions on R

m,so that for bounded initial data in L∞(RN ) the solution exists on an interval(0, T ), with T depending eventually upon the L∞ norm of the initial data, and

6 Ronald Raphael COIFMAN, Israeli-born mathematician, born in 1941. He workedat Washington University, St Louis, MO, and at Yale University, New Haven, CT.

7 Stephen William SEMMES, American mathematician, born in 1962. He works atRice University, Houston, TX.

8 What they had done could hardly be called an improvement of the div-curl lemmaanyway, because with more hypotheses (i.e. curl(E(n)) = 0 and div(D(n)) = 0)they did not even prove the convergence of the whole sequence (En.Dn) to(E∞.D∞), which I had shown by a simple integration by parts in the partic-ular case where curl(E(n)) = 0, because if E(n) = grad(un) then (E(n).D(n)) =∑

i

∂(unD(n)i

)

∂xi− undiv(D(n)) and un converges strongly. Using more hypotheses,

they had proven that (E(n).D(n)) is bounded in H1, which is the dual of VMO,so that a subsequence converges in H1 weak �, but they did not identify limits,so their result is not about compensated compactness.

9 I know that this kind of sentence which I like to use is not so good from agrammatical point of view, but it is my way of recalling that mathematical truthsare not subject to change with time, i.e. if a mathematical result has been provenin the past, then it is still right in the present, and it will still be right in thefuture: I had proven something in the past, and my result is true. If I had writtenthat it was true, some readers may wrongly interpret that it is like some “truths”which evolve with time, like the statements which physicists often make, whichdepend upon being believed by a majority, until they are shown to be wrong!

Page 173: From hyperbolic systems to kinetic theory: a personalized quest

154 18 Oscillating Solutions; the 2-D Broadwell Model

I looked for those nonlinearities which make the system weakly � stable, i.e. forall sequences of initial data converging in L∞(RN ) weak � the correspondingsolutions converge in L∞(

RN × (0, T )

)weak � to the solution corresponding

to the limit. I proved then that it is true if and only if either all the Fi areaffine, or if N = 1 and each Fi has the form

Fi(u1, . . . , um) =∑

j,k

ai,j,kujuk + affine(u1, . . . , um), (18.8)

where the coefficients ai,j,k are such that ai,j,k = ai,k,j for all i, j, k = 1, . . . ,m,and satisfy condition (S):

Cj = Ck implies ai,j,k = 0 for all i, (18.9)

so that, apart from the added affine parts (which are not so natural in kinetictheory, except for using Galilean invariance), the condition that I had foundis indeed the more general one.

Of course, the fact that a system is not stable by weak convergence doesnot mean that one cannot prove the existence and uniqueness of solutionsfor it, and a way to see the difference between compensated compactnessand compensated integrability is to consider a classical remark of EmilioGAGLIARDO,10 and of Louis NIRENBERG, in their independent proofs of theSobolev embedding theorem, which states in the case N = 3 that

∫R3 |u1(x2, x3)| |u2(x1, x3)| |u3(x1, x2)| dx1 dx2 dx3

≤ ||u1||L2(R2)||u2||L2(R2)||u3||L2(R2),(18.10)

and is a simple consequence of the Cauchy–Schwarz inequality, because thefunction v3 defined by v3(x1, x2) =

∫R|u1(x2, x3)| |u2(x1, x3)| dx3 satisfies

|v3(x1, x2)|2 ≤ (∫R|u1(x2, x3)|2 dx3

)(∫R|u2(x1, x3)|2 dx3

), which one then in-

tegrates in (x1, x2) to obtain ||v3||L2(R2) ≤ ||u1||L2(R2)||u2||L2(R2). However,there is no analogous result of compensated compactness, i.e. if ∂un

i

∂xi= 0 and

uni ⇀ u∞i in L∞ weak � for i = 1, 2, 3, then in general un1un2u

n3 does not

converge to u∞1 u∞2 u∞3 in L∞ weak �; actually, the compensated compactnesstheory shows that only affine functions F have the property that one candeduce that F (un1 , u

n2 , u

n3 ) converges to F (u∞1 , u

∞2 , u

∞3 ) in L∞ weak �.

The preceding type of estimate is useful for proving uniform L2 estimatesfor the (two-dimensional) four velocities model, for nonnegative initial datawith a small L2 norm. In 1985, Takaaki NISHIDA had mentioned having provenan existence theorem for small data in L2, after I had mentioned my compu-tations to him; I realize now that my computations did not prove existence,and were just a first step.10 Emilio GAGLIARDO, Italian mathematician, born in 1930. He worked at Univer-

sita di Pavia, Pavia, Italy.

Page 174: From hyperbolic systems to kinetic theory: a personalized quest

18 Oscillating Solutions; the 2-D Broadwell Model 155

For nonnegative initial data v1, v2, v3, v4 belonging to L2(R2) and havingsmall norms, one looks for bounds for the (nonnegative) solutions of the (two-dimensional) four velocities model of the form

0 ≤ u1(x, y, t) ≤ U1(x− t, y)0 ≤ u2(x, y, t) ≤ U2(x+ t, y)0 ≤ u3(x, y, t) ≤ U3(x, y − t)0 ≤ u4(x, y, t) ≤ U4(x, y + t),

(18.11)

with U1, U2, U3, U4 ∈ L2(R2). From (u1)t + (u1)x + u1u2 = u3u4, and u2 ≥ 0,one finds that 0 ≤ u1(x, y, t) ≤ v1(x − t, y) +

∫ t0f(x − s, y, t − s) ds, with

f(ξ, η, τ) = U3(ξ, η − τ)U4(ξ, η + τ), so that f(x− s, y, t− s) = U3(x− s, y −t + s)U4(x − s, y + t − s) and

∫ t0f(x − s, y, t − s) ds =

∫ t0U3(x − t + σ, y −

σ)U4(x − t+ σ, y + σ) dσ, and therefore

0 ≤ u1(x, y, t) ≤ U1(x− t, y), withU1(ξ, η) = v1(ξ, η) +

∫∞0U3(ξ + σ, η − σ)U4(ξ + σ, η + σ) dσ.

(18.12)

Similarly, one has

0 ≤ u2(x, y, t) ≤ U2(x+ t, y), withU2(ξ, η) = v2(ξ, η) +

∫∞0U3(ξ − σ, η − σ)U4(ξ − σ, η + σ) dσ

0 ≤ u3(x, y, t) ≤ U3(x, y − t), withU3(ξ, η) = v3(ξ, η) +

∫∞0 U1(ξ − σ, η + σ)U2(ξ + σ, η + σ) dσ

0 ≤ u4(x, y, t) ≤ U4(x, y + t), withU4(ξ, η) = v4(ξ, η) +

∫∞0 U1(ξ − σ, η − σ)U2(ξ + σ, η − σ) dσ.

(18.13)

One wants a fixed point of the mapping (U1, U2, U3, U4) �→ (U1, U2, U3, U4);it is a well-defined mapping from

(L2(R2)

)4 into itself, because it is likethe Gagliardo–Nirenberg remark (18.10), using different directions; writingUj(a, b) = Fj(a+ b, a− b) for j = 3, 4, one has

|U1(ξ, η) − v1(ξ, η)|2 = | ∫∞0F3(ξ + η, ξ − η + 2σ)F4(ξ + η + 2σ, ξ − η) dσ|2

≤ 14 (∫∞

0|F3(ξ + η, τ)|2 dτ)(∫∞

0|F4(τ, ξ − η)|2 dτ)

(18.14)by the Cauchy–Schwarz inequality, and using dξ dη = 1

2d(ξ + η) d(ξ − η) onethen finds that∫

R2 |U1(ξ, η) − v1(ξ, η)|2 dξ dη ≤ 18 (∫

R2 |F3|2 dξ dη)(∫

R2 |F4|2 dξ dη)= 1

2 (∫

R2 |U3|2 dξ dη)(∫

R2 |U4|2 dξ dη), (18.15)

and therefore

||U1||L2(R2) ≤ ||v1||L2(R2) +1√2||U3||L2(R2)||U4||L2(R2), (18.16)

and similarly

Page 175: From hyperbolic systems to kinetic theory: a personalized quest

156 18 Oscillating Solutions; the 2-D Broadwell Model

||U2||L2(R2) ≤ ||v2||L2(R2) + 1√2||U3||L2(R2)||U4||L2(R2)

||U3||L2(R2) ≤ ||v3||L2(R2) + 1√2||U1||L2(R2)||U2||L2(R2)

||U4||L2(R2) ≤ ||v4||L2(R2) + 1√2||U1||L2(R2)||U2||L2(R2).

(18.17)

If maxi ||vi||L2(R2) = ε < 12√

2, then choosing maxi ||Ui||L2(R2) ≤ 2ε implies

maxi ||Ui||L2(R2) ≤ 2ε, and the mapping is a strict contraction on this set, withconstant θ ≤ 2

√2 ε < 1; the mapping has a (unique) fixed point, and as long

as the solution exists it satisfies the bounds with the functions U1, U2, U3, U4

found.

[Taught on Monday October 15, 2001.]

Notes on names cited in footnotes for Chapter 18, PIATETSKI-SHAPIRO,11

SCHULENBERGER,12 WILCOX,13 WASHINGTON,14 W. RICE.15

11 Ilya PIATETSKI-SHAPIRO, Russian-born mathematician, born in 1929. He re-ceived the Wolf Prize in 1990, for his fundamental contributions in the fieldsof homogeneous complex domains, discrete groups, representation theory and au-tomorphic forms, jointly with Ennio DE GIORGI. He worked in Tel Aviv, Israel.

12 John R. SCHULENBERGER, American mathematician. He worked in Denver, CO,at University of Utah, Salt Lake City, UT and at Texas Tech University, Lubbock,TX.

13 Calvin Hayden WILCOX, American mathematician. He worked at University ofWisconsin, Madison, WI, and at University of Utah, Salt Lake City, UT.

14 George WASHINGTON, American general, 1732–1799. First President of theUnited States.

15 William Marsh RICE, American financier and philanthropist, 1816–1900. RiceUniversity, Houston, TX, is named after him.

Page 176: From hyperbolic systems to kinetic theory: a personalized quest

19

Oscillating Solutions: the Carleman Model

After showing that the Carleman model is not stable by weak convergence, Idid not try immediately to characterize the oscillations. My philosophy thatgood physical models should be stable with respect to some adapted conver-gence did not apply to that model, as it is not a model of kinetic theory,and one reason why I was led to study oscillations for this model was relatedto studying the asymptotic behaviour (i.e. as t tends to ∞) of the solution.The question of looking at the asymptotic behaviour often has no physicalinterest, as most models have lost their validity long before time has becomelarge enough,1 but discrete velocity models are not very good physics, and theCarleman model is not about physics at all, and I was interested by the mathe-matical result of decay in C(m)

t for nonnegative solutions with finite total massm, that Reinhard ILLNER and Michael REED had obtained. It is easy to un-derstand that the nonlinearities describe some kind of self-destructive process,and I wanted to understand more about what was going on. If one starts frominitial data with compact support in an interval of length L, the support attime t is included in an interval of length L+2t, and the solution being O

(1t

)it

is then natural to rescale the x variable and the u and v functions in oppositeways, and one is led to consider the sequences un and vn defined by1 One example is the kind of nonsense that one often hears from some people who

pretend to work on turbulence, as letting time tend to infinity has hardly anythingto do with turbulence. Large time could be of importance if one is working in aninfinite domain and one rescales the equations in an appropriate way, but thosewho advocate this question are usually working in a box, often with periodicconditions, and any resemblance to turbulence in these conditions could only bea lucky accident. Of course, turbulent flows show complicated behaviour, and ithas been known since POINCARE that ordinary differential equations may showstrange effects as time tends to ∞, and those who have coined the word chaos havecertainly decided to translate what POINCARE did into a more recent language,but making people believe that the two problems are related is pure politicalpropaganda.

Page 177: From hyperbolic systems to kinetic theory: a personalized quest

158 19 Oscillating Solutions: the Carleman Model

un(x, t) = nu(nx, n t), vn(x, t) = n v(nx, n t). (19.1)

They stay bounded in L∞(R2) by the Illner–Reed estimate, and as they satisfythe same Carleman model, I found it natural to start by investigating whathappens for general bounded sequences of solutions, which I generated byconsidering bounded sequences of (nonnegative) initial data.2

I started with a sequence

0 ≤ an, bn ≤M in R, (19.2)

and I considered the Carleman model

(un)t + (un)x + (un)2 − (vn)2 = 0 in R × (0,∞); un(·, 0) = an in R

(vn)t − (vn)x − (un)2 + (vn)2 = 0 in R × (0,∞); vn(·, 0) = bn in R,(19.3)

for which one has the uniform estimate

0 ≤ un, vn ≤M in R × (0,∞). (19.4)

I then wondered if the knowledge of oscillations in the sequence (an, bn), forexample if the Young measure for the sequence (an, bn),3 which describes theone-point statistics for the data by identifying all the weak � limits of f(an, bn)for all continuous functions f , is sufficient for deducing the Young measure forthe sequence of solutions (un, vn). Indeed, I found that the Young measurefor (un, vn), which is a tensor product, is actually determined by the soleknowledge of the Young measure for an and the Young measure for bn (whichis less information than the Young measure for (an, bn), of course), and thisproperty is actually valid for all systems of two equations of the form

(un)t + C1(un)x = F (un, vn) in R × (0, T ); un(·, 0) = an in R

(vn)t + C2(vn)x = G(un, vn) in R × (0, T ); vn(·, 0) = bn in R,(19.5)

if C1 �= C2, if F,G are locally Lipschitz continuous and if the solutions staybounded for the time interval considered (the analysis being a little more

2 It is not necessary to consider nonnegative data, but without this condition onemust assume that the solutions stay bounded on some interval [0, T ] independentof n.

3 Laurence Chisholm YOUNG, English-born mathematician, 1905–2000. He hadworked in Cape Town, South Africa, and at University of Wisconsin, Madison,WI. I had met Laurence YOUNG in Madison in 1971 during my first visit to UnitedStates, and as my English was not so good I had conversed with him in French,which he spoke without accent, and he might have learnt it when his father (W.H.YOUNG) was teaching in Lausanne, Switzerland. I only learnt much later abouthis work in the calculus of variations, and when I pioneered the introduction ofYoung measures in the partial differential equations of continuum mechanics inthe late 1970s, I used the term parametrized measures instead, that I had heardin seminars on “control theory” in Paris, France.

Page 178: From hyperbolic systems to kinetic theory: a personalized quest

19 Oscillating Solutions: the Carleman Model 159

technical in this general case). One should notice that even for the case of twoequations, the same results would not hold if there was more than one spacevariable (except for affine functions F,G, of course).

I shall also show later that the same result does not always hold for threeequations, by investigating the case of the Broadwell model.

I assume that

(an)k ⇀ Ak in L∞(R) weak �, k = 1, . . .(bn)k ⇀ Bk in L∞(R) weak �, k = 1, . . . , (19.6)

and this is equivalent to using the Young measure for the sequence an and theYoung measure for the sequence bn.4 I extract a subsequence (um, vm) suchthat

(um)k ⇀ Uk in L∞(R × (0,∞)

)weak �, k = 1, . . .

(vm)k ⇀ Vk in L∞(R × (0,∞)

)weak �, k = 1, . . . , (19.7)

and I shall identify the list of all Ui and all Vj in terms of the list of all Ak andall B, which shows that it is not necessary to extract a subsequence. Thereis something special here, that the Young measure of (um, vm) is a tensorproduct, so that (19.7) implies

(um)j(vm)k ⇀ UjVk in L∞(R × (0,∞)

)weak �, j, k = 1, . . . , (19.8)

which is equivalent to the Young measure being a tensor product, and this isa consequence of the div-curl lemma, if one notices that

((um)j

)t+((um)j

)x

= j(um)j−1((vm)2 − (um)2

)

is bounded in L∞(R × (0,∞)

)((vm)k

)t− ((vm)k

)x

= k(vm)k−1((um)2 − (vm)2

)

is bounded in L∞(R × (0,∞)

),

(19.9)

so that (um)j(vm)k ⇀ UjVk in L∞(R×(0,∞)

)weak �, for all integers j, k ≥ 0

by Lemma 18.2 (and one takes U0 = V0 = 1, of course); one deduces easilythat the weak � limit of f(um)g(vm) is the product of the weak � limit off(um) by the weak � limit of g(vm), for all continuous functions f, g. I thendeduced the equations satisfied by the list of all Uj and all Vk by passing tothe limit in the equations for (um)j and the equations for (vm)k, and I found

4 Because the sequence an is bounded, each continuous function f can be approx-imated uniformly by polynomials on the closed bounded interval where the an

take their values, by the Weierstrass theorem, and for a polynomial P , the limitof P (an) is a finite combination of the Ak, and this permits one to identify thelimit of f(an). Although the list of all Ak is equivalent to the knowledge ofthe Young measure, it is not easy to extract the information, but after GeorgePAPANICOLAOU suggested using a particular class of oscillating initial data, whichI shall describe later, it appeared that there is a simple way to present the com-putations, where the Young measure becomes explicit.

Page 179: From hyperbolic systems to kinetic theory: a personalized quest

160 19 Oscillating Solutions: the Carleman Model

(Uj)t + (Uj)x = j Uj−1V2 − j Uj+1 in R × (0,∞); Uj(·, 0) = Aj , j = 1, . . .(Vk)t − (Vk)x = k Vk−1U2 − k Vk+1 in R × (0,∞); Vk(·, 0) = Bk, k = 1, . . . .

(19.10)To prove the uniqueness of the solution (so that the extraction of a subse-quence is not necessary), one must use the bounds

0 ≤ Uk, Vk ≤Mk in R × (0,∞), k = 1, . . . , (19.11)

which follow from the uniform bound on um and vm. For two solutions sat-isfying this infinite system, I denoted by δ Uk and δ Vk the differences of thecorresponding solutions, then by subtracting the corresponding inequalities Iobtained

|(δ Uk)t + (δ Uk)x| ≤ k |δ Uk+1| + kM2|δ Uk−1| + kMk−1|δ V2| in R × (0,∞);δ Uk(·, 0) = 0 in R, k = 1, . . .

|(δ Vk)t − (δ Vk)x| ≤ k |δ Vk+1| + kM2|δ Vk−1| + kMk−1|δ U2| in R × (0,∞);δ Vk(·, 0) = 0 in R, k = 1, . . . ,

(19.12)and I improved the initial bounds |δ Uk(x, t)|, |δ Vk(x, t)| ≤ 2Mk in R× (0,∞)by integrating the preceding inequalities in t, and I obtained

|δ Uk(x, t)|, |δ Vk(x, t)| ≤ 2.3.kMk+1t in R × (0,∞), k = 1, . . . , (19.13)

and then I used these new bounds instead of 2Mk, and I repeated this proce-dure, and that gave

|δ Uk(x, t)|, |δ Vk(x, t)| ≤ 2.32k(k + 1)Mk+2 t2

2! in R × (0,∞), k ≥ 1|δ Uk(x, t)|, |δ Vk(x, t)| ≤ . . .

|δ Uk(x, t)|, |δ Vk(x, t)| ≤ 2.3r+1k(k + 1) . . . (k + r)Mk+r tr+1

r!in R × (0,∞), k ≥ 1, r ≥ 2,

(19.14)

and letting r tend to ∞, I deduced that δ Uk = δ Vk = 0 in R × (0, T ) if3M T < 1, proving then uniqueness on

(0, 1

3K

); a reiteration of the argument

gives then uniqueness for all t ≥ 0.

I deduced an important effect from the sole knowledge of the equations forU1 and U2, by introducing the quantity

σu(x, t) =√U2(x, t) −

(U1(x, t)

)2 in R × (0,∞), (19.15)

which measures the strength of the oscillations in the sequence un. Of course,one has U2 ≥ (U1)2 a.e. in R × (0,∞), and U2 − (U1)2 is a quantity similarto what the internal energy is for a gas, measuring the amount of kineticenergy that cannot be described in terms of the macroscopic velocity; thecomputations shown here are then like deriving information for the internalenergy by using a part of the equation describing the complete phenomena,and the analogy with questions of kinetic theory may become more apparentonce the equation for Young measures is described in more detail.

Page 180: From hyperbolic systems to kinetic theory: a personalized quest

19 Oscillating Solutions: the Carleman Model 161

From (U2)t + (U2)x + 2U3 − 2U1V2 = 0, I subtracted the equation (U1)t +(U1)x + U2 − V2 = 0 multiplied by 2U1, and I obtained(U2 − (U1)2

)t+(U2 − (U1)2

)x

+ 2(U3 − U1U2) = 0 in R × (0,∞);(U2 − (U1)2

) |t=0= A2 − (A1)2 in R.(19.16)

Because un is bounded there are inequalities that U3 must satisfy once U1 andU2 are known, and because un ≥ 0 one of these inequalities has a very simpleform:5 developing un(un − U1)2,6 one finds (un)3 − 2U1(un)2 + (U1)2un ≥ 0,giving at the limit U3 ≥ 2U1U2−(U1)3, or U3−U1U2 ≥ U1U2−(U1)3 = U1

(U2−

(U1)2). Formally writing

((σu)2

)t= 2σu (σu)t for example and simplifying by

σu,7 one obtains

(σu)t + (σu)x + U1σu ≤ 0 in R × (0,∞), (19.17)

and a similar analysis for the oscillations in the sequence vn gives

(σv)t − (σv)x + V1σv ≤ 0 in R × (0,∞). (19.18)

I learnt from these inequalities that, independently of the detail of the oscil-lations in the sequence vn, the strength of the oscillations in the sequence untends to decrease along the natural characteristic lines, and the local averageof un can be seen as a factor for making the strength decrease, in accordancewith considering the process described by the equation as a self-destructionmechanism, but one should observe that the equation is not an exact one, andU1 may be replaced by the larger quantity U2

U1in the decay term.

I also learnt an important property, that the oscillations can only be cre-ated at initial time, and this can be deduced from a weaker form of the in-equality (σu)t + (σu)x ≤ 0, because if one has A2 = (a1)2 on a measurablesubset ω of the real line, so that σu starts equal to 0 on ω, then σu is 0 almosteverywhere on the points (x, t) with x − t ∈ ω; indeed, σ is nonincreasingalong the characteristic lines and as it starts 0 and cannot become negative,it must stay 0 there. This property will be used for studying the asymptoticbehaviour of solutions.

The impossibility of creating oscillations is shared by all systems with onlytwo equations, but it is not always true for some systems of three equations,as I shall show for the Broadwell model.

[Taught on Wednesday October 17, 2001.]5 The analysis of oscillations can be carried out for initial data changing sign, but

one must restrict attention to an interval in time where a bound exists.6 A better inequality can be obtained by developing un(un − w)2, giving U3 −

2wU2 +w2U1 ≥ 0 for all w, and therefore (U2)2 ≤ U1U3; this implies U3−U1U2 ≥(U2)2

U1− U1U2 = U2

U1

(U2 − (U1)2

), and one has U2

U1≥ U1, of course.

7 A natural procedure for proving such a statement, which I first learnt as a studentfrom a work of Olga OLEINIK, consists in writing the equation for

√ε2 + σ2

u forε > 0, and then letting ε tend to 0.

Page 181: From hyperbolic systems to kinetic theory: a personalized quest

162 19 Oscillating Solutions: the Carleman Model

Notes on names cited in footnotes for Chapter 19, W.H. YOUNG,8 PAPANICO-

LAOU,9 and for the preceding footnotes, CHISHOLM-YOUNG,10 HARDINGE.11

8 William Henry YOUNG, English mathematician, 1863–1942. There are many re-sults attributed to him which may be joint work with his wife, Grace, as theycollaborated extensively. He had worked in Liverpool, England, in Calcutta, In-dia, holding the first Hardinge professorship (1913-1917), in Aberystwyth, Wales,and in Lausanne, Switzerland.

9 George C. PAPANICOLAOU, Greek-born mathematician, born in 1943. He hasworked at NYU (New York University), New York, NY, and at Stanford Univer-sity, Stanford, CA.

10 Grace Emily CHISHOLM-YOUNG, English mathematician, 1868–1944.11 Sir Charles HARDINGE, first Baron HARDINGE of Penshurst, English diplomat,

1858–1944. He was Viceroy and Governor-General of India (1910–1916).

Page 182: From hyperbolic systems to kinetic theory: a personalized quest

20

The Carleman Model: Asymptotic Behaviour

I apply now what I have found about oscillating solutions for the Carlemanmodel to the study of the asymptotic behaviour, as t tends to ∞, of thesolution of the system for fixed nonnegative initial data with finite total mass,i.e.

ut + ux + u2 − v2 = 0 in R × (0,∞); u(·, 0) = a in R

vt − vx − u2 + v2 = 0 in R × (0,∞); v(·, 0) = b in R,(20.1)

with

a, b ∈ L∞(R) ∩ L1(R), a, b ≥ 0 a.e. in R,

R

(a+ b) dx = m <∞. (20.2)

Of course, my analysis uses the uniform Illner–Reed bound

0 ≤ u(x, t), v(x, t) ≤ C(m)t

a.e. in R × (0,∞), (20.3)

but a good estimate of C(m) is not necessary. In order to analyse what isgoing on for large t, I consider the sequence (un, vn) defined by

un(x, t) = nu(nx, n t), vn(x, t) = n v(nx, n t) in R × (0,∞), (20.4)

and because one has 0 ≤ un, vn ≤ C(m)t , the solutions are uniformly bounded

for t ≥ ε > 0. The first result relies on the finite speed of propagation.

Lemma 20.1. For every ε, η > 0, the sequences un and vn converge to 0in L∞ weak � and Lploc strong for 1 ≤ p < ∞ on the subsets {(x, t) | x ≤−t− η, t ≥ ε} and {(x, t) | x ≥ t+ η, t ≥ ε}.Proof : Because (un + vn)t + (un − vn)x = 0, which expresses conservation ofmass, one sees by integrating on the subset {(x, s) | x ≤ x0 − s, 0 ≤ s ≤ t}that

Page 183: From hyperbolic systems to kinetic theory: a personalized quest

164 20 The Carleman Model: Asymptotic Behaviour

∫ x0−t

−∞(un+vn)(·, t) dx+2

∫ t

0

un(x0−s, s) ds =∫ x0

−∞(un+vn)(·, 0) dx, (20.5)

and therefore, using un ≥ 0, one has∫ x0−t

−∞(un + vn)(·, t) ≤

∫ x0

−∞(an + bn) dx, (20.6)

where an(x) = n a(nx) and bn(x) = n b(nx) on R; similarly, one has

∫ +∞

x1+t

(un + vn)(·, t) ≤∫ +∞

x1

(an + bn) dx, (20.7)

and this is valid for x0 = −η and x1 = +η, and because all the mass ofan and bn concentrates at 0 and eventually enters the interval (−η,+η), onededuces that un + vn converges to 0 in L1

loc(Ω) strong, where Ω is either{(x, t) | x ≤ −t− η} or {(x, t) | x ≥ t+ η}. Adding the constraint t ≥ ε > 0permits one to use the uniform L∞ estimate and the bound in L∞ togetherwith the convergence in L1

loc strong implies the convergence in Lploc strong forevery p ∈ [1,∞) (by using Holder inequality),1 and in L∞ weak �. ��

Lemma 20.2. Some subsequence (un′ , vn′) converges, and any limit (u∗, v∗)of a subsequence is automatically a solution of the Carleman model for t > 0,having support in {(x, t) | −t ≤ x ≤ t}, and having total mass m.

Proof : Because of the uniform L∞ bound for t ≥ ε > 0, and using the diagonalargument of CANTOR,2 one may extract a subsequence such that every powerof un′ or vn′ converges in L∞ weak � for any set {(x, t) | t ≥ ε} (and one onlyneeds that un′ , vn′ , (un′)2, (vn′)2 converge in L∞ weak �). Then one observesthat σu = 0 for x < −t (and for x > t) by applying Lemma 20.1, and theinequality (σu)t + (σu)x ≤ 0 implies that σu = 0 for x < t, and thereforeσu = 0 almost everywhere; this implies strong convergence of un′ in L2

loc, andtherefore strong convergence in Lploc for 1 ≤ p < ∞ because of the uniformbound in L∞. A similar argument applies to σv, which is 0 in x > t by applyingLemma 20.1, and satisfies (σv)t−(σv)x ≤ 0 so that σv = 0 for x > −t. Becauseof strong convergence, one may take the limit of the equation for t ≥ ε forevery ε > 0, and because

∫R(un + vn) dx = m for every t > 0 one obtains∫

R(u∗ + v∗) dx = m for all t > 0. ��If one knew that all the sequence converges then the limit would automat-

ically be self-similar, and (u∗, v∗) would be the self similar solution of totalmass m; however, one has extracted a subsequence, and k u∗(k x), is the limit

1 Otto Ludwig HOLDER, German mathematician, 1859–1937. He had worked inLeipzig, Germany.

2 Georg Ferdinand Ludwig Philipp CANTOR, Russian-born mathematician, 1845–1918. He had worked in Halle, Germany.

Page 184: From hyperbolic systems to kinetic theory: a personalized quest

20 The Carleman Model: Asymptotic Behaviour 165

of k un′(k x) = (k n′)u(k n′ x), and one cannot conclude because k n′ may notbe a part of the subsequence, in which case the limit would be u∗(x). In 1980,I had derived a complicated proof that any solution with support in |x| ≤ tmust be self-similar, but I had not written it down and I have forgotten someof the details (I remember that I had used in an essential way the L1 contrac-tion property for the Carleman model, which had been noticed by ThomasLIGGETT). As for every unwritten proof, it might be that it was not complete,and one may prefer to consider this result as a conjecture.

Before trying to apply the same ideas to the Broadwell model, where im-portant differences will appear, it is useful to mention another reason why thistype of study may be useful, and it is related to what is usually described asletting the mean free path tend to 0, but after discussing the principles usedfor the derivation of the Boltzmann equation, it will be apparent that it isonly reasonable for rarefied gases, and that it does not make any sense to useit for dense gases, and to pretend that it explains the behaviour of fluids.

For ε > 0 (believed to represent a mean free path between collisions), oneconsiders the system

uεt + uεx + 1ε

(uε vε − (wε)2

)= 0 in R × (0,∞), uε(·, 0) = a in R

vεt − vεx + 1ε

(uε vε − (wε)2

)= 0 in R × (0,∞), vε(·, 0) = b in R

wεt − 1ε

(uε vε − (wε)2

)= 0 in R × (0,∞), wε(·, 0) = c in R.

(20.8)

For initial data which are nonnegative and bounded, the solution exists for allt > 0 by applying Proposition 15.2, because one may apply the estimate for thecase ε = 1, to uε

ε ,vε

ε ,wε

ε ; the bound obtained in L∞ is unfortunately much toolarge as ε tends to 0. However, the conservation of mass shows that for t > 0,the functions uε(·, t), vε(·, t), wε(·, t) are uniformly bounded in L1(R). In thecase of initial conditions with compact support, which is not a big restrictionbecause of the finite speed of propagation, the entropy inequality gives abound independent of ε for the integral of uε log(uε), vε log(vε), wε log(wε),which implies that uε(·, t), vε(·, t), wε(·, t) stay in a weakly compact set ofL1(R), and one may extract a subsequence such that uε, vε, wε converge inL1loc weak to u∗, v∗, w∗. One then defines the density of mass � and the density

of momentum q by� = u∗ + v∗ + 2w∗q = u∗ − v∗,

(20.9)

and one finds the equation�t + qx = 0 (20.10)

for conservation of mass, and the equation

qt + (u∗ + v∗)x = 0 (20.11)

for conservation of momentum.A natural problem is then to express u∗ + v∗ in terms of � and q, and

the computations are now purely formal, and not much is known about the

Page 185: From hyperbolic systems to kinetic theory: a personalized quest

166 20 The Carleman Model: Asymptotic Behaviour

validity of the procedure. For the Boltzmann equation, this procedure gives theEuler equation for ideal fluids (i.e. with no viscosity),3 but it is purely formaland has not been proven to be valid (despite the name of HILBERT beingattached to that formal expansion!). Another formal derivation, attributedto CHAPMAN and ENSKOG,4,5 the Chapman–Enskog procedure, makes theNavier–Stokes equation appear (with a small viscosity). In the context of theBroadwell model, the formal idea is that uε vε − (wε)2 must be small andtherefore one postulates that u∗v∗ − (w∗)2 = 0; under this postulate one has�2 = u2 + v2 +6w2 +4uw+4vw = q2 +4uw+4vw+8w2 = q2 +4�w, givingw as a function of �, and showing that

u∗v∗ − (w∗)2 = 0 implies u∗ + v∗ =�2 − q2

2�. (20.12)

The system in (�, q) becomes then a quasi-linear hyperbolic system of conser-vation laws.

However, using the inequality(log(a) − log(b)

)(a2 − b2) ≥ 2(a − b)2 for

all a, b > 0,6 one finds that√uεvε − wε tends to 0 in L2

loc strong, and it isindeed true, as I shall show later, that both

√uεvε and wε are bounded in L2,

but both could be oscillating and if this was the case, the formal derivationwould be wrong.7 My analysis does not address directly this question, butconsiders the case ε = 1 and studies how oscillations will propagate if oneputs oscillations in the initial data.

Before studying oscillations for the Broadwell model, it is useful to observethat letting the mean free path go to 0 for the Carleman model is a much easierquestion, without much interest.

Lemma 20.3. For a, b ∈ L∞(R) with a, b ≥ 0 in R, the solutions (uε, vε) of

3 In his lectures about physics [14], FEYNMAN wrote that the Euler equation de-scribes “dry water” and that the Navier–Stokes equation describes “wet water”.

4 Sydney CHAPMAN, English mathematician, 1888–1970. He had worked in Cam-bridge, in Manchester, in London and in Oxford, England, where he held theSedleian chair of natural philosophy.

5 David ENSKOG, Swedish mathematician, 1884–1947. He had worked in Stock-holm, Sweden.

6 The inequality is invariant if one replaces a, b by t a, t b for t > 0, and it is enoughto take a = 1 + x and b = 1, and the inequality is then log(1 + x) ≥ 2x

x+2for

x ≥ 0; one has equality for x = 0 and the right inequality between the derivatives,1

1+x≥ 4

(x+2)2for x ≥ 0.

7 Although I pointed out this possibility many years ago, most people do not seemto believe in the possibility of oscillations, and some people prove theorems sayingthat if some function of the solution converges strongly then another function ofthe solution converges strongly, and although these results could be valid, theydo not rule out the possibility that there could be oscillations and that none ofthese particular functions of the solution would converge strongly.

Page 186: From hyperbolic systems to kinetic theory: a personalized quest

20 The Carleman Model: Asymptotic Behaviour 167

(uε)t + (uε)x + 1ε

((uε)2 − (vε)2

)= 0 in R × (0,∞); uε(·, 0) = a in R

(vε)t − (vε)x − 1ε

((uε)2 − (vε)2

)= 0 in R × (0,∞); vε(·, 0) = b in R

(20.13)converge to

u∗ = v∗ =a+ b

2in R × (0,∞). (20.14)

Proof : If 0 ≤ a, b ≤ M , then one has 0 ≤ uε, vε ≤ M , and one canextract a subsequence such that uε ⇀ u∗, vε ⇀ v∗ in L∞(

R × (0,∞))

weak �. Integrating(uε log(uε) + vε log(vε)

)t+(uε log(uε) − vε log(vε)

)x

+1ε

((uε)2 − (vε)2

)(log(uε) − log(vε)

)= 0, and using the inequality 1

ε

((uε)2 −

(vε)2)(

log(uε) − log(vε)) ≥ 2

ε (uε − vε)2 shows that uε − vε tends to 0 in L2

loc

strong, and therefore u∗ = v∗, and because (u∗ + v∗)t + (u∗ − v∗)x = 0 and(u∗ + v∗) |t=0= a + b by taking the limit of (uε + vε)t + (uε − vε)x = 0, onefinds that (u∗)t = 0 and 2u∗ |t=0= a+ b. ��

A different scaling for the Carleman model creates a more technical prob-lem,

(uε)t + 1ε (u

ε)x + 1ε2

((uε)2 − (vε)2

)= 0 in R × (0,∞); uε(·, 0) = a in R

(vε)t − 1ε (v

ε)x − 1ε2

((uε)2 − (vε)2

)= 0 in R × (0,∞); vε(·, 0) = b in R,

(20.15)which was studied by Tom KURTZ.8 Like for the linear case, it creates a dif-fusion equation at the limit, but of a nonlinear degenerate type. One extractsa subsequence such that uε ⇀ u∗, vε ⇀ v∗ in L∞(

R × (0,∞))

weak �, andalso 1

ε (uε − vε) ⇀ q in L2

loc weak, and that last inequality assumes also thata, b ∈ L1(R) (because one cannot use the finite speed of propagation anymoreas it tends to ∞); of course, a consequence is that u∗ = v∗, and taking thelimit of (uε + vε)t + 1

ε (uε − vε)x = 0 one obtains

2(u∗)t + qx = 0 in R × (0,∞); u∗ |t=0=a+ b

2in R. (20.16)

In order to find a relation between q and u and ux, one subtracts the twoequations and one multiplies by ε, so that ε(uε−vε)t+(uε+vε)x+ 2

ε

((uε)2 −

(vε)2)

= 0, and formally one postulates that 2ε

((uε)2−(vε)2

)= 2

ε (uε−vε)(uε+

vε) converges to 4q u∗, so that the guess is

(u∗)x + 2q u∗ = 0 in R × (0,∞), (20.17)

showing that u∗ satisfies

(u∗)t −( (u∗)x

4u∗

)

x= 0 in R × (0,∞). (20.18)

8 Thomas Gordon KURTZ, American mathematician. He works at University ofWisconsin, Madison, WI.

Page 187: From hyperbolic systems to kinetic theory: a personalized quest

168 20 The Carleman Model: Asymptotic Behaviour

In order to prove that this is the right equation, Tom KURTZ used techniquesof contraction semi-groups in L1, and constructed enough solutions of thelimiting equation. I have a different method, which requires my improvementof the Illner–Reed bound, namely C(m) = O(1 + m2), and I shall prove itlater; the reason is that if one writes uε(x, t) = ε2Uε(ε x, t) and vε(x, t) =ε2V ε(ε x, t), then Uε and V ε satisfy the usual Carleman model but for asequence of initial data of total norm m

ε , and therefore the bound that oneobtains is 0 ≤ uε, vε ≤ ε2C

(mε

)1t , which is ≤ K

t if one has shown that C(m) =O(1 +m2); then an application of the div-curl lemma shows that (uε + vε)2

converges weakly to (u∗ + v∗)2 for t ≥ η > 0, and therefore uε + vε convergesstrongly to 2u∗ and the preceding formal computation is proven.

If one considers a sequence of solutions of the Broadwell model with asequence of nonnegative bounded data,

ut + ux + u v − w2 = 0 in R × (0,∞); un(·, 0) = an in R

vt − vx + u v − w2 = 0 in R × (0,∞); vn(·, 0) = bn in R

wt − u v + w2 = 0 in R × (0,∞); wn(·, 0) = cn in R,(20.19)

with 0 ≤ an, bn, cn ≤ M , then one obtains a sequence of solutions satisfying0 ≤ un, vn, wn ≤ F (M, t), by Proposition 15.2. One extracts a subsequence(for which one keeps the index n for simplification) such that the sequence ofinitial data corresponds to a Young measure, and for example

(an)i(bn)j(cn)k ⇀ Di,j,k in L∞(R) weak �, i, j, k = 0, . . . , (20.20)

with the notation Ai = Di00, Bj = D0j0, Ck = D00k, for i, j, k = 0, . . ., andone wonders if the sequence of solutions corresponds to a Young measure, i.e.if one can identify all the following weak � limits:

(un)i(vn)j(wn)k ⇀ Xi,j,k in L∞(R) weak �, i, j, k = 0, . . . , (20.21)

with the notation

Ui = Xi,0,0, Vj = X0,j,0,Wk = X0,0,k, for i, j, k = 0, . . . . (20.22)

The equations for (un)i, (vn)j , (wn)k and the div-curl lemma show that

Xi,j,0 = UiVj , X0,j,k = VjWk, Xi,0,k = UiWk in R × (0,∞), i, j, k = 0, . . . ,(20.23)

but one needs at least to identify X1,1,1, and I shall show that it is not alwaysequal to U1V1W1.

[Taught on Wednesday October 24, 2001 (Friday October 19 and MondayOctober 22 were mid-semester break).]

Notes on names cited in footnotes for Chapter 20, SEDLEY.9

9 Sir William SEDLEY, English philanthropist, 1558–1618. He endowed a chair ofnatural philosophy at Oxford, England.

Page 188: From hyperbolic systems to kinetic theory: a personalized quest

21

Oscillating Solutions: the Broadwell Model

For the sequence of solutions of the Broadwell model, one can write equationsfor powers

((un)i

)t+((un)i

)x

+ i(un)ivn − i(un)i−1(wn)2 = 0in R × (0,∞), i = 1, . . .(

(vn)j)t− ((vn)j

)x

+ j un(vn)j − j(vn)j−1(wn)2 = 0in R × (0,∞), j = 1, . . .(

(wn)k)t− k unvn(wn)k−1 + k(wn)k+1 = 0

in R × (0,∞), k = 1, . . .

, (21.1)

and one observes an important difference between the equations for powersof un or vn on one side, and the equations for powers of wn on the otherside. In the equations for powers of un or vn, there only appear products(un)ivn, (un)i−1(wn)2, un(vn)j , (vn)j−1(wn)2 whose limits can be expressedin terms of the list of all Ui, Vj , Wk, and one deduces

(Ui)t + (Ui)x + i UiV1 − i Ui−1W2 = 0 in R × (0,∞);Ui(·, 0) = Ai in R, i = 1, . . .

(Vj)t − (Vj)x + j U1Vj − j Vj−1W2 = 0 in R × (0,∞);Vj(·, 0) = Bj in R, j = 1, . . . ,

(21.2)

while in the equation for powers of wn, there is a term unvn(wn)k−1 whoselimit in the case k ≥ 2 cannot be determined in the same way.1 Taking thelimit of the equations for un and for (un)2 gives1 In my 1978 lectures at Heriot–Watt University, I had already used that idea for

finding more necessary conditions for sequential weak continuity under differen-tial constraints. The basic example, which I had thought of in connection with theBroadwell model, was that in R

2 if one has bounds on (fn)x, (gn)y , and (hn)x +(hn)y , then one cannot always pass to the limit in the product fngnhn, althoughthe product f g h satisfies the first necessary condition for sequential weak continu-ity; for example fn(x, y) = sin(n y), gn(x, y) = cos(nx), hn(x, y) = sin(nx− n y)define sequences converging to 0 in L∞(R2) weak �, satisfying (fn)x = (gn)y =

Page 189: From hyperbolic systems to kinetic theory: a personalized quest

170 21 Oscillating Solutions: the Broadwell Model

(U1)t + (U1)x + U1V1 −W2 = 0 in R × (0,∞); U1(·, 0) = A1 in R

(U2)t + (U2)x + 2U2V1 − 2U1W2 = 0 in R × (0,∞); U2(·, 0) = A2 in R.(21.3)

Multiplying the first equation by 2U1 and subtracting from the second, onededuces that σu =

√U2 − (U1)2 satisfies the equation

((σu)2

)t+((σu)2

)x

+2V1(σu)2 = 0, or

(σu)t + (σu)x + V1σu = 0 in R × (0,∞), (21.4)

Similarly, taking the limit of the equations for vn and for (vn)2 gives

(V1)t − (V1)x + U1V1 −W2 = 0 in R × (0,∞); V1(·, 0) = B1 in R

(V2)t − (V2)x + 2U1V2 − 2V1W2 = 0 in R × (0,∞); V2(·, 0) = B2 in R.(21.5)

Multiplying the first equation by 2V1 and subtracting from the second, onededuces that σv =

√V2 − (V1)2 satisfies the equation

(σv)t + (σv)x + U1σv = 0 in R × (0,∞). (21.6)

The equations for σu and σv show that the oscillations in the sequences unor vn cannot be created, and that the strength of these oscillations decreasesin terms of the sole local average of vn for σu, and the sole local average ofun for σv; this is in accordance with the fact that particles from the firstor second families disappear by collisions with particles from the oppositefamily; contrary to what happens with the Carleman model, the equations forσu and for σv for the Broadwell model are exact. The situation is different forstudying the oscillations of wn, and taking the limit of the equation for wngives

(W1)t − U1V1 +W2 = 0 in R × (0,∞); W1(·, 0) = C1 in R, (21.7)

while taking the limit of the equation for (wn)2 gives

(W2)t − 2X111 + 2W3 = 0 in R × (0,∞); W2(·, 0) = C2 in R, (21.8)

and one should find more about X111. When I was doing this analysis in theearly 1980s, I already knew that one cannot expect X111 = U1V1W1, but Idid not understand how to describe the evolution of oscillations, until GeorgePAPANICOLAOU proposed to restrict the class of initial data to periodicallymodulated functions, a question which I shall describe next. In the generalcase, I estimated the difference X111 − U1V1W1 in order to find informationon σw =

√W2 − (W1)2.

(hn)x + (hn)y = 0, but fngnhn = sin2(n y) cos2(nx) − 14

sin(2n x) sin(2n y) con-verges to 1

4in L∞(R2) weak �.

Page 190: From hyperbolic systems to kinetic theory: a personalized quest

21 Oscillating Solutions: the Broadwell Model 171

Lemma 21.1. One has the inequality2

|X111 − U1V1W1| ≤ σuσvσw. (21.9)

Proof : One first notices that (un−U1)(vn−V1)(wn−W1) converges to X111−U1V1W1 in L∞(R2) weak �, because by developing one finds one term unvnwnwhich converges toX111, three terms of the form −unvnW1, each of which con-verges to −U1V1W1, three terms of the form unV1W1, each of which convergesto U1V1W1, and one term −U1V1W1. Then one observes that for every α > 0one has ±(un−U1)(vn−V1)(wn−W1) ≤ α

2 (un−U1)2+ 12α (vn−V1)2(wn−W1)2,

which at the limit gives ±(X111−U1V1W1) ≤ α2 (σu)2 + 1

2α (σv)2(σw)2; outsidea subset of measure 0 the inequality is true for all positive rationals α andtherefore for all real positive α, and then for a point x where all these inequal-ities are true, one minimizes in α > 0 and the minimum is σuσvσw. ��

I then deduced a differential inequality for σw. Multiplying the equa-tion for W1 by 2W1 and subtracting from the equation for W2, one obtains((σw)2

)t+2(X111−U1V1W1)+2(W3−W1W2) = 0; as seen before, the fact that

wn ≥ 0 implies (W2)2 ≤ W1W3 and therefore 2(W3 −W1W2) ≥ 2W2W1

(σw)2 ≥2W1(σw)2, and with Lemma 21.1 one deduces that

((σw)2

)t+ 2W1(σw)2 ≤

2σuσvσw, or(σw)t +W1σw ≤ σuσv in R × (0,∞). (21.10)

This inequality shows that a factor for decreasing the strength of oscillationsin wn is the local average of wn, in accordance with the fact that particles ofthe third family disappear by interaction between themselves (as it is really aninteraction between the third and fourth family for the four velocities model),but there is a new effect, related to the right side σuσv: oscillations in wncould be amplified, and even created if they are not present, but one needsboth oscillations in un and oscillations in vn for that, because both σu andσv must be positive to make an increase in σw possible. However, becausethere is an inequality, one cannot be sure that σu > 0 and σv > 0 is enoughto create oscillations, and as I shall show next, it is not always the case, andcreation takes place or not according to a resonance effect.

My analysis failed to describe the evolution of the Young measure fora subsequence (un, vn, wn); it is not always a tensor product, as this wouldimply X111 = U1V1W1, but its three projections in (u, v), (v, w), (u,w) aretensor products. As no equation is known for unvnwn, this approach does notsay if the Young measure can be characterized in terms of the Young measurefor (an, bn, cn), and a further computation done with George PAPANICOLAOU

in the early 1980s shows that it is not always so, and it shows that somenonlocal correlations play a role, but our analysis was only done for the caseof periodically oscillating initial data, and I could not understand the general2 In the mid 1990s, Alexander MIELKE, who might not have seen my computations,

told me that he could prove the inequality with a better constant in front.

Page 191: From hyperbolic systems to kinetic theory: a personalized quest

172 21 Oscillating Solutions: the Broadwell Model

case; however, Guy METIVIER has told me that he has solved that question.3

It is useful to understand why I use Young measures, and not be mistakenabout what they say and what they cannot say. In the early 1970s, when Iwas working on homogenization with Francois MURAT, before I had heardthe word itself (that Ivo BABUSKA had borrowed from nuclear engineers),but after realizing that we had rediscovered and generalized the idea of G-convergence that Sergio SPAGNOLO had developed with Ennio DE GIORGI,we were led to try to find optimal bounds for what physicists call effectivecoefficients, a term which I learnt much later from George PAPANICOLAOU.I did not know the term Young measures then, and in my 1978 Heriot–Wattlectures I used the term parametrized measures which I had heard about in“control theory”, in the seminar of Robert PALLU DE LA BARRIERE,4 but themain difficulty was that except in dimension 1, the effective properties of amixture are not described by proportions alone. I had been quite puzzled thento find that some theoretical physicists, LANDAU and LIFSHITZ,5,6 pretendedto compute a formula for the conductivity of a mixture in terms of the pro-portions alone, but had I known a little more about the way physicists think,I would have deduced that they were only talking about an approximation. Itwas clear then for mathematicians in the early 1970s, at least those who paidattention to what I and others had proven in homogenization, that Youngmeasures are not the right tool for describing microstructures, when there isno underlying one-dimensional pattern, although they may be useful as a toolfor obtaining a partial understanding; in the late 1970s, I had used this toolfor expressing the content of the compensated compactness theory, and it wasprobably the first application of this idea to partial differential equations, out-side the restricted geometrical context which Laurence YOUNG had thoughtabout. I had first shown that there are no possible oscillations for some scalarquasi-linear equations in one space variable, but oscillations cannot be killedas fast for semi-linear systems in one space variable, and it was a little surpris-ing then that the compensated compactness theory could help characterize theoscillations in systems of two equations like the Carleman model. It is impor-tant to notice that in the compensated compactness theory, Young measuresare just used as a passive tool, because they cannot by themselves see the3 Guy METIVIER, French mathematician, born in 1950. He worked at Purdue Uni-

versity, West Lafayette, IN, at Universite de Rennes I, Rennes, France, and atUniversite de Bordeaux I, Talence, France.

4 Robert PALLU DE LA BARRIERE, French mathematician, born in 1922. He workedin Caen and at Universite Paris VI (Pierre et Marie Curie), Paris, France.

5 Lev Davidovich LANDAU, Azerbaijan-born physicist, 1908–1968. He received theNobel Prize in Physics in 1962, for his pioneering theories for condensed matter,especially liquid helium. He had worked in Leningrad, in Kharkov, and in Moscow,Russia.

6 Evgenii Mikhailovich LIFSCHITZ, Russian physicist, 1915–1985. He had workedin Moscow, Russia.

Page 192: From hyperbolic systems to kinetic theory: a personalized quest

21 Oscillating Solutions: the Broadwell Model 173

differential structure used to express the equations; Young measures are onlya language for expressing what the compensated compactness theory says,and as the compensated compactness uses micro-local objects (and I madethis point more precise by the introduction of H-measures [18]), the Youngmeasures can only express some of the consequences which do not make use ofthe differential structure. For a more interesting situation like the Broadwellmodel, the compensated compactness theory is not powerful enough for de-scribing what is happening, and if one had found a better mathematical tool,some of the consequences could probably be expressed in terms of Young mea-sures, but Young measures cannot be the important part of the argument, andone should not use the term Young measures (or come back to the old termof parametrized measures) as if it had a magical power. More and more, onehears people who replace knowledge by incantation, believing that by usingtechnical words their message will be thought to be deep, a question thatFEYNMAN had considered in [15].7

In the early 1980s, George PAPANICOLAOU mentioned that when the ini-tial data are periodically modulated, i.e. of the form

an(x) = a(x,

x

εn

), (21.11)

and for a quantity propagating at speed c, he guessed that the solution wouldhave the form

A(x,x− c t

εn, t), (21.12)

and he expected a simple equation for the function A(x, y, t). We checkedeasily the case of the Carleman model.

Proposition 21.2. The solutions (un, vn) of the Carleman model with initialdata

un(x, 0) = a(x, xεn

)in R

vn(x, 0) = b(x, xεn

)in R,

(21.13)

with 0 ≤ a, b ≤M in R × R, periodic of period 1 in y (and smooth enough topresent no difficulties with measurability, and for weak limits to be obtainedby averaging in y), are such that

7 FEYNMAN described the teaching of his father on that question, saying that whenhis father was taking him for a walk and observed a bird, he would tell him thename of the bird and give him his imagined version of what people call that bird invarious parts of the world, and his father concluded by telling him that if he knewthe name of the bird in all these languages, he would still know nothing aboutthe bird. He also described the behaviour of some graduate students in physics,who learned physics as if it was a foreign language, and did not understand therelation with the real world.

Page 193: From hyperbolic systems to kinetic theory: a personalized quest

174 21 Oscillating Solutions: the Broadwell Model

un(x, t) −A(x, x−tεn

, t)→ 0 strongly in R × (0,∞)

vn(x, t) −B(x, x+t

εn, t)→ 0 strongly in R × (0,∞),

(21.14)

where the convergence holds in Lploc strong for 1 ≤ p < ∞ and L∞ weak �,and A,B are periodic with period 1 in y and are the solutions of

At(x, y, t) +Ax(x, y, t) +A2(x, y, t) − ∫ 1

0B2(x, z, t) dz = 0

in R × (0, 1) × (0,∞)Bt(x, y, t) −Bx(x, y, t) −

∫ 1

0A2(x, z, t) dz +B2(x, y, t) = 0

in R × (0, 1) × (0,∞),

(21.15)

with initial data

A(x, y, 0) = a(x, y), B(x, y, 0) = b(x, y) in R × (0, 1). (21.16)

Proof : One extracts a subsequence such that un ⇀ U1, vn ⇀ V1, (un)2 ⇀ U2

and (vn)2 ⇀ V2 in L∞ weak �, and one solves

At(x, y, t) +Ax(x, y, t) +A2(x, y, t) − V2(x, t) = 0 in R × (0, 1) × (0,∞);A(x, y, 0) = a(x, y) in R × (0, 1),

(21.17)and one wants to show that un(x, t) − A

(x, x−tεn

, t)

tends to 0 strongly inR × (0,∞); one observes that 0 ≤ a(x, y) ≤ M and 0 ≤ V2 ≤ M2 imply0 ≤ A(x, y, t) ≤ M for t > 0. One defines un by un(x, t) = A

(x, x−tεn

, t), and

one observes that

(un)t + (un)x + (un)2 − V2 = 0 in R × (0,∞); un(·, 0) = un(·, 0). (21.18)

One wants to show that un − un converges to 0 strongly, and one writes anequation for (un − un)2, namely((un−un)2

)t+((un−un)2

)x+2(un+un)(un−un)2 = 2(un−un)

((vn)2−V2

),

(21.19)and besides using 2(un+un)(un−un)2 ≥ 0, one notices that 2(un−un)

((vn)2−

V2

)⇀ 0 in L∞ weak � by an application of the div-curl lemma, because

(un − un)t + (un − un)x is bounded in L∞ and((vn)2 − V2

)t− ((vn)2 − V2

)x

is bounded in L∞; if a subsequence of (un− un)2 converges weakly to �, thenone finds �t + �x ≤ 0 and � |t=0= 0 and therefore � = 0, as one cannot have� < 0. As a consequence, one deduces that U2, the weak � limit of (un)2, isthe weak � limit of A2

(x, x−tεn

, t), which is given by averaging with respect to

the fast variable, i.e. U2(x, t) =∫ 1

0 A2(x, z, t) dz. Similarly, one solves

Bt(x, y, t) −Bx(x, y, t) − U2(x, t) +B2(x, y, t) = 0 in R × (0, 1) × (0,∞);B(x, y, 0) = b(x, y) in R × (0, 1),

(21.20)

Page 194: From hyperbolic systems to kinetic theory: a personalized quest

21 Oscillating Solutions: the Broadwell Model 175

and one shows that vn(x, t)−B(x, x+t

εn, t)

tends to 0 strongly in R×(0,∞), and

one deduces that V2(x, t) =∫ 1

0 B2(x, z, t) dz. This shows that A,B satisfies

the desired equations, and because one has uniqueness for that system (whichis a locally Lipschitz perturbation of something explicit), one deduces that itis true for the whole sequence. ��

The case of the Broadwell model is a little more technical.

Proposition 21.3. The solutions (un, vn, wn) of the Broadwell model withinitial data

un(x, 0) = a(x, xεn

)in R

vn(x, 0) = b(x, xεn

)in R

wn(x, 0) = c(x, xεn

)in R,

(21.21)

with 0 ≤ a, b, c ≤M in R×R, periodic of period 1 in y (and smooth enough),are such that

un(x, t) −A(x, x−tεn

, t)→ 0 strongly in R × (0,∞)

vn(x, t) −B(x, x+t

εn, t)→ 0 strongly in R × (0,∞)

wn(x, t) − C(x, xεn

, t)→ 0 strongly in R × (0,∞),

(21.22)

where the convergence holds in Lploc strong for 1 ≤ p <∞ and L∞(R× (0, T )

)

weak � for every 0 < T <∞, and A,B,C are periodic with period 1 in y andare the solutions of

At(x, y, t) +Ax(x, y, t) +A(x, y, t)∫ 1

0 B(x, z, t) dz − ∫ 1

0 C2(x, z, t) dz = 0

in R × (0, 1) × (0,∞)Bt(x, y, t) −Bx(x, y, t) +B(x, y, t)

∫ 1

0A(x, z, t) dz − ∫ 1

0C2(x, z, t) dz = 0

in R × (0, 1) × (0,∞)Ct(x, y, t) −

∫ 1

0 A(x, y − z, t)B(x, y + z, t) dz + C2(x, y, t) = 0in R × (0, 1) × (0,∞),

(21.23)with initial data

A(x, y, 0) = a(x, y), B(x, y, 0) = b(x, y), C(x, y, 0) = c(x, y) in R × (0, 1).(21.24)

Proof : One extracts a subsequence such that un ⇀ U1, vn ⇀ V1, wn ⇀ W1,(un)2 ⇀ U2, (vn)2 ⇀ V2 and (wn)2 ⇀ W2 in L∞ weak �, and because onehas 0 ≤ un, vn, wn ≤ F (M, t), one deduces that 0 ≤ U1, V1,W1 ≤ F (M, t) and0 ≤ U2, V2,W2 ≤ F 2(M, t). One solves

At(x, y, t) +Ax(x, y, t) +A(x, y, t)V1(x, t) −W2(x, t) = 0in R × (0, 1) × (0,∞); A(x, y, 0) = a(x, y) in R × (0, 1), (21.25)

Page 195: From hyperbolic systems to kinetic theory: a personalized quest

176 21 Oscillating Solutions: the Broadwell Model

and one wants to show that un(x, t) − A(x, x−tεn

, t)

tends to 0 strongly inR × (0,∞); one defines un by un(x, t) = A

(x, x−tεn

, t), and one observes that

(un)t + (un)x + unV1 −W2 = 0 in R × (0,∞); un(·, 0) = un(·, 0). (21.26)

One wants to show that un − un converges to 0 strongly, and one writes anequation for (un − un)2, namely

((un − un)2

)t+((un − un)2

)x

+ 2V1(un − un)2

= 2un(un − un)(vn − V1) + 2(un − un)((wn)2 −W2

),

(21.27)

and besides using 2V1(un−un)2 ≥ 0, one notices that 2un(un−un)(vn−V1) ⇀0 and 2(un−un)

((wn)2−W2

)⇀ 0 in L∞ weak � by an application of the div-

curl lemma, because(un(un−un)

)t+(un(un−un)

)x

and (vn−V1)t−(vn−V1)xare bounded in L∞, and because (un − un)t + (un − un)x and

((wn)2 −W2

)t

are bounded in L∞; if a subsequence of (un−un)2 converges weakly to �, thenone finds �t + �x ≤ 0 and � |t=0= 0 and therefore � = 0; as a consequence, onededuces that U1(x, t) =

∫ 1

0A(x, z, t) dz. Similarly, one solves

Bt(x, y, t) −Bx(x, y, t) +B(x, y, t)U1(x, t) −W2(x, t) = 0in R × (0, 1) × (0,∞); B(x, y, 0) = b(x, y) in R × (0, 1), (21.28)

and one shows that vn(x, t) − B(x, x+t

εn, t)

tends to 0 strongly in R × (0,∞),

and one deduces that V1(x, t) =∫ 1

0 B(x, z, t) dz.The next step is more technical, and consists in replacing the term unvn

by a simpler term, and considering the solution zn of the equation

(zn)t − hn + (zn)2 = 0 in R × (0,∞); zn(·, 0) = wn(·, 0), (21.29)

where

hn(x, t) = H(x, xεn

, t)

in R × (0,∞)

H(x, y, t) =∫ 1

0A(x, y − z, t)B(x, y + z, t) dz in R × (0, 1) × (0,∞),

(21.30)

so that one has

zn(x, t) = C(x, xεn

, t)

in R × (0,∞)

Ct(x, y, t) −∫ 1

0A(x, y − z, t)B(x, y + z, t) dz + C2(x, y, t) = 0

in R × (0, 1) × (0,∞); C(x, y, 0) = c(x, y) in R × (0, 1).

(21.31)

The estimates are technical, because it is not that unvn − hn is small, butthat after integrating in t the difference is small, and one must use bounds onA,B but also their moduli of continuity in t helps. ��

For a system of two equations, the assumption that the initial data areperiodically modulated is not a big restriction, because in that case the Young

Page 196: From hyperbolic systems to kinetic theory: a personalized quest

21 Oscillating Solutions: the Broadwell Model 177

measure of the solution is determined by the Young measure of the initial data;apart from measurability questions (which I am not so fond of), for every se-quence an creating a Young measure, one can create a periodically modulatedfunction a such that an and a

(x, xεn

)define the same Young measure; one can

also perform rearrangements in the y variable without changing the Youngmeasure. However, for the Broadwell model, the term

∫ 1

0 A(y− z)B(y+ z) dzchanges if one rearranges A or B, and therefore the oscillations in the solu-tions depend upon something more precise than Young measures, as there areresonance effects which play a role. One should also notice that if one preparesperiodically modulated oscillations with different periods for a, b, c, then theresonance effects cannot occur if some ratios are irrational.

Using Fourier series, i.e. A(x, y, t) =∑

m∈ZAm(x, t)e2i πmy, B(x, y, t) =

∑m∈Z

Bm(x, t)e2i π my, C(x, y, t) =∑

m∈ZCm(x, t)e2i πmy, then

∫ 1

0 A(x, y −z, t)B(x, y + z, t) dz =

∑m∈Z

Am(x, t)Bm(x, t)e2i πmy, and the system forA,B,C can be written as an infinite system

(A0)t + (A0)x +A0B0 −∑

k∈ZCkC−k = 0

(Am)t + (Am)x +AmB0 = 0 for m �= 0(B0)t − (B0)x +A0B0 −

∑k∈Z

CkC−k = 0(Bm)t − (Bm)x +A0Bm = 0 for m �= 0(C0)t −A0B0 +

∑k∈Z

CkC−k = 0(C2m)t −AmBm = 0 for m �= 0(C2m+1)t = 0 for all m,

(21.32)

with the corresponding Fourier coefficients of a, b, c as initial data. The coeffi-cients A0, B0, C0 are nonnegative, but the other coefficients may be complex,with A−m = Am for example. It is important to observe that such a system isa natural consequence of the Broadwell model, once one follows my philosophyof checking stability with respect to an adapted weak convergence; physicistsoften derive similar systems for what they call particles, and they invent somegames for explaining the equations that they use, but there is no need to in-vent a game for solving the preceding infinite system, or to use a language ofparticles for talking about the solution of the system, as any mathematicianwho has learnt functional analysis knows. Actually, the term particle itself isjust a remnant of an 18th century point of view on mechanics (called classicalmechanics), which deals with rigid bodies and ordinary differential equations,by opposition to continuum mechanics, which is an 18/19th century pointof view on mechanics, and deals with partial differential equations. It is im-portant to understand that there are no particles, but just waves, i.e. partialdifferential equations with a hyperbolic character.

[Taught on Friday October 26, 2001.]

Page 197: From hyperbolic systems to kinetic theory: a personalized quest

178 21 Oscillating Solutions: the Broadwell Model

Notes on names cited in footnotes for Chapter 21, HERIOT,8 WATT,9

MIELKE.10

8 George HERIOT, Scottish goldsmith, 1563–1624. Heriot–Watt University in Ed-inburgh, Scotland, is partly named after him.

9 James WATT, Scottish engineer, 1736–1819. He had worked in Glasgow, Scotland.Heriot–Watt University in Edinburgh, Scotland, is partly named after him.

10 Alexander MIELKE, German mathematician, born in 1958. He works in Stuttgart,Germany.

Page 198: From hyperbolic systems to kinetic theory: a personalized quest

22

Generalized Invariant Regions; the VaradhanEstimate

Around 1984, I learnt of a computation by Thomas BEALE, who had shownthat for bounded nonnegative data with finite total mass, the solution of theBroadwell model is globally bounded in L∞(R) (but his global bound wasnot expressed in an explicit way). I then simplified a part of his analysis, anddeveloped a method which I called the generalized invariant region method.1

In his analysis, Thomas BEALE introduced two functions, which are potentialfunctions related to the conservation of mass and the conservation of momen-tum, expressed in the form

(u+ w)t + ux = 0 in R × (0,∞)(v + w)t − vx = 0 in R × (0,∞). (22.1)

In view of these, it is natural to introduce the functions U and V by

U(x, t) =∫ x−∞(u(z, t) + w(z, t)

)dz

V (x, t) =∫ +∞x

(v(z, t) + w(z, t)

)dz,

(22.2)

and the important properties of U and V are that they are nonnegative andthat both their derivatives in x and in t are expressed in terms of u, v, w,namely

limx→−∞ U(x, t) = 0; limx→+∞ U(x, t) =∫

R(u0 + w0) dx

limx→−∞ V (x, t) =∫

R(v0 + w0) dx; limx→+∞ V (x, t) = 0

Ux = u+ w ≥ 0; Ut = −u ≤ 0; Vx = −(v + w) ≤ 0; Vt = −v ≤ 0.(22.3)

I had shown that for bounded nonnegative data with small norm in L1 theasymptotic behaviour as t tends to ∞ is that u looks like u∗(x − t), v looks1 In 1985, Takaaki NISHIDA mentioned that the method is the same as one that

Tai-Ping LIU had used, for regularization by artificial viscosity of systems ofconservation laws, I believe. Henri CABANNES also mentioned that he had useda similar idea in the 1950s.

Page 199: From hyperbolic systems to kinetic theory: a personalized quest

180 22 Generalized Invariant Regions; the Varadhan Estimate

like v∗(x + t) and w looks like 0, and the integral of u∗ is∫

R(u0 + w0) dx,

while the integral of v∗ is∫

R(v0 + w0) dx. Conservation of mass expresses

that∫

R

(u(·, t) + v(·, t) + 2w(·, t)) dx is independent of t, and conservation

of momentum expresses that∫

R

(u(·, t) − v(·, t)) dx is independent of t, and

it is equivalent to say that∫

R

(u(·, t) + w(·, t)) dx and

∫R

(v(·, t) + w(·, t)) dx

are independent of t, and the physical interpretation of these quantities isthat the first one is the mass which eventually finds its way to +∞ and thatthe second one is the mass which eventually finds its way to −∞. Actually,although I had only shown that for small initial mass, it is true for any finiteinitial mass; I am not sure if Thomas BEALE had shown that, but it doesfollow from an improvement by Raghu VARADHAN, which was shown to meby Kamel HAMDACHE.

The introduction of U and V is then quite natural. The function U in-creases from 0 to

∫R(u0 + w0) dx (the mass ending up to +∞), and U(x, t)

measures how much of the mass going to +∞ has already gone to the rightof the point x at time t, and because Ut = −u ≤ 0 the flow to the right isirreversible. Similarly, the function V decreases from

∫R(v0 +w0) dx (the mass

ending up to −∞) to 0, and V (x, t) measures how much of the mass goingto −∞ has already gone to the left of the point x at time t, and becauseVt = −v ≤ 0 the flow to the left is irreversible. As I shall show in more detail,U(·, t) and V (·, t) permit one to give a measure of the amount of interactionbetween the particles which will take place after time t.2

The method of invariant regions, which does not give any interesting resultfor nonnegative solutions of the Broadwell model, consists in looking for aset C ⊂ R

3, necessarily of the form [0, α] × [0, β] × [0, γ], such that if theinitial data take their values in C then the solution has values in C for allt > 0. A natural improvement is to have α, β, γ functions of t and to ask thestronger requirement that if at time s the values taken belong to C(s) thenat any later time t the values taken belong to C(t), and this implies somedifferential inequalities for α, β, γ which have no globally bounded solution(the requirement is much stronger than the physical one, that initial datataking their values in C(0) give rise to a solution with values in C(t) at timet, a problem that one does not know how to analyse well).

What I call the method of generalized invariant regions consists, in theexample of the Broadwell model, in looking for inequalities of the form2 Before these results, I had already pointed out an analogy with the method that

James GLIMM had introduced for quasi-linear systems of conservation laws, wherehe used an hypothesis of small variation. The relation between his problem andmine is that his estimates were for equations of the form Ut +

(F (U)

)x

= 0,

and that it is V = Ux which satisfies a semi-linear equation Vt + ∇F (U).Vx +∇2F (U) : (V, V ) = 0; however, even around a constant U my condition (S) is notsatisfied, because of genuine nonlinearity hypotheses. The analogy between thesetwo questions became much clearer after the estimate of Raghu VARADHAN.

Page 200: From hyperbolic systems to kinetic theory: a personalized quest

22 Generalized Invariant Regions; the Varadhan Estimate 181

0 ≤ u(x, t) ≤ α(t, U(x, t), V (x, t)

)

0 ≤ v(x, t) ≤ β(t, U(x, t), V (x, t)

)

0 ≤ w(x, t) ≤ γ(t, U(x, t), V (x, t)

),

(22.4)

and this takes advantage of the fact that one can express the derivatives of Uand V in terms of u, v, w.

Traditionally, proving L∞ estimates consists in comparing the solutionto a constant function, but I observed that the solution does not look likea constant function, as for large t the solution u looks like u∗(x − t), forexample; however, I noticed that for large t the function U also looks likeU∗(x − t), and therefore it seems much more natural to compare u to U inorder to obtain an L∞ bound. Similarly it seems natural to compare v to V .I wrote the inequalities that the general functions α, β, γ must satisfy, but Isoon restricted my attention to particular inequalities

0 ≤ u ≤ λ(ε+ U), 0 ≤ v ≤ μ(ε+ V ), 0 ≤ w ≤ ν, (22.5)

where ε, λ, μ, ν are positive constants. One uses Ut + Ux = Vt − Vx = w,and one wants that if u = λ(ε + U) then ut + ux ≤ λ(Ut + Ux) so that theinequality cannot change in the evolution; this gives w2 − u v ≤ λw, andconsidering the worst case v = 0, one is led to impose ν ≤ λ, so that w ≤ λ.Similarly, one wants that if v = μ(ε+V ) then vt−vx ≤ μ(Vt−Vx), which givesw2 − u v ≤ μw, and considering the worst case u = 0, one is led to imposeν ≤ μ; finally, one wants that if w = ν then wt ≤ 0, which gives u v−w2 ≤ 0,and one is led to impose λμ(ε + U)(ε + V ) ≤ ν2, and because Ut ≤ 0 andVt ≤ 0, it is enough to impose that λμ(ε+U0)(ε+ V0) ≤ ν2 in R. In the casewhere the initial data (nonnegative with finite total mass) satisfy

U0V0 ≤ θ < 1 in R, (22.6)

then one chooses ε > 0 such that

(ε+ U0)(ε+ V0) ≤ θ′ < 1 in R, (22.7)

and one computes

λ0 =∣∣∣∣∣∣

u0

ε+ U0

∣∣∣∣∣∣L∞(R)

, μ0 =∣∣∣∣∣∣

v0

ε+ V0

∣∣∣∣∣∣L∞(R)

, ν0 = ||w0||L∞(R), (22.8)

and one must satisfy the inequalities λ0 ≤ λ, μ0 ≤ μ, ν0 ≤ ν and ν ≤ λ, ν ≤μ, λμ θ′ ≤ ν2; one may take λ = μ = ν = max{λ0, μ0, ν0}, for example, andthis shows that if (ε+ U0)(ε+ V0) ≤ 1 in R, then

ξ(t) = max{∣∣∣∣∣∣ u(·,t)ε+U(·,t)

∣∣∣∣∣∣L∞(R)

,∣∣∣∣∣∣ v(·,t)ε+V (·,t)

∣∣∣∣∣∣L∞(R)

, ||w(·, t)||L∞(R)

}

is nonincreasing in t ∈ (0,∞),(22.9)

and one deduces global L∞ bounds,

Page 201: From hyperbolic systems to kinetic theory: a personalized quest

182 22 Generalized Invariant Regions; the Varadhan Estimate

0 ≤ u(x, t) ≤ max{λ0, μ0, ν0}(ε+∫

R(u0 + w0) dx

)in R × (0,∞)

0 ≤ v(x, t) ≤ max{λ0, μ0, ν0}(ε+∫

R(v0 + w0) dx

)in R × (0,∞)

0 ≤ w(x, t) ≤ max{λ0, μ0, ν0} in R × (0,∞),

(22.10)

recalling that the hypothesis (ε+ U0)(ε+ V0) ≤ 1 in R has been used.

Of course, if the total mass m =∫

R(u0 + v0 + 2w0) dx is small enough one

has U0V0 ≤ θ < 1, and more precisely if m < 2 one can take θ = m2

4 , becauseone has U0V0 ≤ (∫

R(u0 + w0) dx

)(∫R(v0 + w0) dx

) ≤ 14

(∫R(u0 + w0) dx +

∫R(v0 + w0) dx

)2. However, the condition U0V0 ≤ θ < 1 can be valid for datawith large mass if the initial distribution of mass is adequate, and actuallyone may have U0V0 = 0 everywhere if w0 = 0 and the support of u0 isentirely to the right of the support of v0, and in that case the solution isu(x, t) = u0(x − t), v(x, t) = v0(x − t), w(x, t) = 0. This kind of hypothesis istherefore much better that an hypothesis of small mass, and it has also anotherinteresting feature, that it is not conserved by rearrangement. This type ofcondition reminds one more of the idea used by James GLIMM for quasi-linearsystems of conservation laws, and the analogy became even clearer after anidea of Raghu VARADHAN,3 who considered the quantity

I(t) =∫∫

x<y

(u(x, t) + w(x, t)

)(v(y, t) + w(y, t)

)dx dy, (22.11)

which measures a potential of interaction left at time t.

Lemma 22.1. (S.R.S. Varadhan) For initial data which are nonnegative andwith finite total mass, I(t) is nonincreasing and

dI

dt= −

R

(2u v + uw + v w

)(x, t) dx. (22.12)

Proof : I had noticed that if one applies the div-curl lemma to a sequencesatisfying

(u+ w)t + ux = 0 in R × (0,∞)(v + w)t − vx = 0 in R × (0,∞), (22.13)

one can pass to the limit in v(u + w) + u(v + w), but I had not found howto use that information; actually, it is exactly the same computation whichgives the result of Raghu VARADHAN, but I had not thought of attaching anyimportance to the functions U or V , and multiplying the first equation by Vor the second equation by U gives the desired result:

(V (u+ w)

)t+(V u)x

+ v(u+ w) + (v + w)u = 0(U(v + w)

)t− (U v)

x+ u(v + w) + (u+ w)v = 0, (22.14)

3 The result was mentioned to me by Kamel HAMDACHE, and I do not know if ithad been motivated by simplifying the computations of Thomas BEALE, or hadbeen obtained independently.

Page 202: From hyperbolic systems to kinetic theory: a personalized quest

22 Generalized Invariant Regions; the Varadhan Estimate 183

if one observes that∫

R

(V (u+w)

)dx =

R

((u+w)(x, t)

∫ +∞

x

(v+w)(y, t) dy)dx = I(t), (22.15)

or∫

R

(U(v+w)

)dx =

R

((v+w)(y, t)

∫ y

−∞(u+w)(x, t) dx

)dy = I(t), (22.16)

and I(t) is easily understood as a measure of the interaction that can takeplace after time t. ��

The estimate of Raghu VARADHAN has at least two interesting conse-quences.

The first application is that what I had proven for small mass is true forany finite mass; the difficulty that I had was to find a bound for the integral ofu v, and now, by integrating

(V (u+w)

)t+(V u)x+ v(u+w)+ (v+w)u = 0,

one has∫

R

∫ ∞

0

u v dx dt ≤ I(0) ≤(∫

R

(u0 + w0) dx)(∫

R

(v0 + w0) dx). (22.17)

Then a bound for the integral of w2 follows, and the solutions belong to thefunctional spaces that I had introduced, u ∈ V1, v ∈ V−1, w ∈ V0, whichimplies the asymptotic behaviour for large t, i.e. u looks like u∗(x− t), v lookslike v∗(x + t), and w tends to 0 (as w∗(x) = 0 because w ∈ L2).

The second application is that in the problem with ε > 0, supposedto represent a mean free path between collisions, one had previously foundthat

√uεvε − wε converges strongly to 0 in L2

(R × (0,∞)

), but one did

not know if each term belonged to L2(R × (0,∞)

); now the estimate gives∫

R

∫∞0 uε vε dx dt ≤ I(0), because only the conservation laws have been used

in proving Lemma 22.1, and the results are then valid for all ε > 0.

The problem of letting ε tend to 0, which is more a mathematical ques-tion than a physical one, is still open in general. What Russell CAFLISCH

and George PAPANICOLAOU have proven, is that when the formal limitingequation, which is a quasi-linear hyperbolic system, has a smooth solutionfor 0 ≤ t ≤ T , then on that interval of time uε, vε, wε converge to u∗, v∗, w∗satisfying w∗ =

√u∗v∗, and � = u∗ + v∗ + 2w∗ and q = u∗ − v∗ is the smooth

solution of the quasi-linear system; it is not known if this is valid after theappearance of a shock for the (�, q) system. Russell CAFLISCH has consideredthe case of Riemann data for the (�, q) system, in the case where the solu-tion is a single shock, but he has not succeeded in proving that the formalexpansion is valid. I have conjectured that it does not always converge to theformal limit, and it was one particular reason why I had studied oscillating se-quences of the Broadwell model, but I have also thought that the equation forself-similar solutions (used only locally as they do not have finite total mass)

Page 203: From hyperbolic systems to kinetic theory: a personalized quest

184 22 Generalized Invariant Regions; the Varadhan Estimate

could be the key to some of the missing estimates. Although the Broadwellmodel is far removed from physics, it is an important training ground for de-veloping better mathematical tools for more interesting models, so that onemust consider all these questions as interesting challenges.

The method of generalized invariant regions also gives interesting L∞

bounds for the Carleman model, and I proved in this way the global Illner–Reed estimate with a bound in O(m2 + 1) for C(m), and the order cannotbe improved because it appears for the self-similar solutions. The study ofself-similar solutions, i.e.

u(x, t) = 1t U(xt

)

v(x, t) = 1t V(xt

),

(22.18)

which after using the variable σ = xt and ˙= d

dσ leads to the system

−U − σ U + U + U2 − V 2 = 0−V − σ V − V − U2 + V 2 = 0,

(22.19)

was solved byU = (1 − σ)ZV = (1 + σ)Z(1 − σ2)Z − 2σ Z + 4σZ2 = 0,

(22.20)

and 1Z satisfies a linear equation, giving

Z =1

2 + γ(σ2 − 1), (22.21)

and the parameter γ must be < 2. For γ near 2, Z behaves like 1σ2+ε2 with

ε > 0 small, and the L∞ norm behaves like 1ε2 , while the mass m =

∫ +1

−1Z dσ

behaves like πε ; for self-similar solutions, the L∞ norm is then O(m2) for large

m.For applying the method of generalized invariant regions, one uses

(u+ v)t + (u − v)x = 0, (22.22)

and one introduces

W (x, t) =∫ x

−∞

(u(z, t) + v(z, t)

)dz, (22.23)

so thatWx = u+ v; Wt = v − u, (22.24)

and in particular Wt +Wx = 2v and Wt −Wx = −2u. One looks for boundsof the form

u ≤ A(W )t

v ≤ B(W )t ,

(22.25)

Page 204: From hyperbolic systems to kinetic theory: a personalized quest

22 Generalized Invariant Regions; the Varadhan Estimate 185

which one can easily replace by u ≤ A(W )t+ε , v ≤ B(W )

t+ε with ε > 0 small

in order to avoid the singularity at t = 0. When u = A(W )t , one wants to

have ut + ux ≤ the corresponding derivative −A(W )t2 + A′(W )

t (Wt + Wx), i.e.v2 − u2 ≤ −A(W )

t2 + A′(W )t 2v; checking v = 0 gives A2 ≥ A, or A ≥ 1 and

checking v = B(W )t gives

2BA′ ≥ A+B2 −A2 and A ≥ 1, (22.26)

and similarly, when v = B(W )t , one wants to have vt− vx ≤ the corresponding

derivative −A(W )t2 +A′(W )

t (Wt−Wx), i.e. u2−v2 ≤ −B(W )t2 −B′(W )

t 2u; checkingu = 0 gives B2 ≥ B, or B ≥ 1 and checking u = A(W )

t gives

−2AB′ ≥ B +A2 −B2, and B ≥ 1. (22.27)

Using the analogy with the computation for self-similar solutions, one chooses

A(W ) = (1 + σ)Z(σ)B(W ) = (1 − σ)Z(σ), (22.28)

anddW

dσ= 2Z(σ) (22.29)

shows that equality is obtained instead of inequalities; due to the constraintsA ≥ 1, B ≥ 1, one cannot use the entire interval −1 < σ < 1, and as W mustbe allowed to vary between 0 and m, one must use the self-similar solutionwith a value of γ corresponding to a larger mass, and m′ = m + 2 ensuresthat the integral for the interval where both A and B are ≥ 1 has an integralat least m. This proof provides an L∞ bound O(m2) for large m.

[Taught on Monday October 29, 2001.]

Page 205: From hyperbolic systems to kinetic theory: a personalized quest

23

Questioning Physics; from Classical Particlesto Balance Laws

I have been discussing discrete velocity models for a few reasons. One of themis that they are simpler than the Boltzmann equation, which I shall investigatenow; although this type of model was introduced by MAXWELL around thesame period (around 1860), they seem to have been neglected for a long time;maybe the work of Renee GATIGNOL [16] was one of the first attempts (around1970) to go beyond a few classical examples and study this type of model ina general way. From the physical point of view, models with all the velocitiesof the same length lack the possibility of showing temperature effects, buteven if this is not the case, there is another defect which was pointed out tome by Clifford TRUESDELL in 1975, that they lack the important property ofinvariance by rotation; however, I only understood why invariance by rotationcould be important after 1990, after I had thought that one way to avoid theangular cut-off hypothesis for the kernel in the Boltzmann equation is to usetechniques like those known to specialists of harmonic analysis, like CharlesFEFFERMAN and STEIN, for proving the restriction theorem on spheres, andI have mentioned that at the end of Chapter 14.

The defect of having little physical relevance is not so important if onementions it,1 and discrete velocity models are still an interesting mathematicalarena because there are a few questions which have not been answered yet,suggesting that better mathematical tools must be created.2

1 It becomes quite important if someone pretends that these models have any phys-ical relevance, as it either shows some limited understanding of what continuummechanics or physics are about, or much worse, an intention to mislead. I hadstarted in 1984 to point out a few defects of the Boltzmann equation, but I havemade a curious observation, in that case and in other situations: as many mathe-maticians only want to pretend that the equations that they study are related tocontinuum mechanics or physics, they close their ears to any information aboutthe defects of the models that they use, and the result is that knowledge spreadsat a much slower pace than misinformation does.

2 Many people mistake development for research, but in research it is difficult toascertain in advance what the important features for tackling an unsolved problem

Page 206: From hyperbolic systems to kinetic theory: a personalized quest

188 23 Questioning Physics; from Classical Particles to Balance Laws

The Boltzmann equation was introduced after analysing the behaviourof particles submitted to forces at a distance, and then postulating someprobabilistic outputs of collisions (or nearby collisions).

As has been mentioned already, one must know at what step one haspostulated a probabilistic game, or made any other assertion which one hasnot proven, because if one wants to understand more about the gigantic puzzleof the real world, one must first backtrack to a point where one had not yetpostulated something about the answer, in order to look for a better wayto solve the problem.3 From this point of view, the discrete velocity modelsare postulated at too early a stage, and they are lacking the beginning ofthe derivation of the Boltzmann equation, where one invokes a computationinvolving two particles and forces at a distance before postulating the form ofa kernel; however, although the defect of postulating probabilities occurs later,some defects appear already in the first stage, and they give more reasons whythe Boltzmann equation is not really suitable for describing gases which arenot rarefied.

Classical mechanics is an 18th century point of view of mechanics, whereordinary differential equations are used as the basic mathematical tool, and itdeals with rigid bodies, often assimilated to points. The particles invoked forderiving the Boltzmann equation are assimilated to points and their kineticenergy only has a translation part, unlike in a game of billiards, where ballshave a rotational kinetic energy and spin is an important effect to be taken intoaccount for predicting the result of a collision. I shall show later a particular

will be. The simplified versions of a problem that are invented or the new oneswhich are proposed may have lost some important feature of the physical problem,and new obstacles may be created in the “simpler” versions, which are not presentin the initial problem. It is part of the reseach work to decide if one should pursuein one direction or investigate in another one, and at the end it might appearthat the mathematical problems concerning discrete velocity models, althoughinteresting mathematically, are not really relevant to realistic questions, but oneshould not forget that the Boltzmann equation itself is not so realistic.

3 When I learnt about ionic solutions in chemistry, I was puzzled by the type ofargument which the teacher used: after applying the law of action of masseshe obtained a polynomial equation, and then he assumed that the unknown xwas small so that he would neglect all powers of x compared to x and solve alinear equation, and obtaining a value like 10−2 he would observe that indeedx was small. Of course, if one considers an equation x3 − 3x + ε = 0, where εis a small positive quantity, this argument says that one looks for the simplifiedequation −3x + ε = 0, and one accepts x = ε

3; however, the equation does have

a solution x = ε3

+ O(ε3), but also two other solutions x = ±√3 − ε

6+ O(ε2),

and I find it better to mention that there are theorems about the way roots ofpolynomials depend upon the coefficients of the polynomial, and that one shouldcheck for other solutions even if they are not small, and that one should consideran evolution equation so that a study of stability could be performed around eachof the solutions in order to ascertain which ones have some chance to be observed.

Page 207: From hyperbolic systems to kinetic theory: a personalized quest

23 Questioning Physics; from Classical Particles to Balance Laws 189

case of the Boltzmann equation, called the hard-sphere case, where particlesare spheres of radius a which only interact when two spheres collide, i.e. whenthe distance of two centres is 2a, but for all other cases one assumes thatparticles feel a force which depends upon the distance between the particles.

Forces at a distance is the first defect of this approach, but this point ofview which goes back to NEWTON was only challenged by POINCARE (andmaybe EINSTEIN) in the theory of relativity, and this defect was not knownthen to BOLTZMANN or to MAXWELL in the 1860s. If I have understood cor-rectly,4 one problem created by the notion of instantaneous forces acting ata distance is the question of instantaneity; it is not difficult for a mathemati-cian to imagine that each particle is paired with an angel who computes thetotal force that his particle feels by adding the forces created by all the otherparticles of the universe, because he is in telepathic connection with all thecorresponding angels,5 and this is what a mathematician means by writingv(x) =

∫K(x, y)u(y) dy, independently of the way one will evaluate u(y), the

kernel K(x, y), or the integral itself. If POINCARE and EINSTEIN understoodthat there is a problem for putting all the clocks (of the particles) at the sametime, physicists do not seem to be as bothered by the notion of distance, andthey talk about a universe in expansion while hiding the strange methods usedfor computing “distance”.6

4 Physicists like to make fun of mathematicians for not understanding some ofthe games that they invent, but one reason is that mathematicians are not sogood at guessing, and physicists rarely express clearly all the rules of the gamesthat they play; they often discard some old rule and replace it by a new one,and sometimes they even discard completely a game that they have been playingfor many years. Mathematicians’ duty is to be precise and they are trained tounderstand implications, but although a theorem proven today will remain trueforever, mathematicians should be careful when claiming that what they do isimportant because it is related to applications; often, they have not learnt enoughabout the practical applications that they mention, or they do not care much if amodel that they use may be soon discarded as obsolete, and it may have alreadybeen obsolete before they started working on it.

5 I follow the French, where the word for angel is masculine (un ange), and genderis a grammatical notion, and it is not related to an early debate, when peoplehad argued if angels were male or female, and they should have thought thatthey could be both, or after all that they might be neither. Of course, it does notmatter at all for my argument if angels exist or not.

6 The nearby stars move slightly with respect to the background, so their distanceis measured by parallax, up to a few light years or about one parsec, I suppose,which is the distance at which the diameter of the earth orbits around the sun,which is about 280 million kilometers, is seen under an angle of one second ofarc. After that the distance of the stars is too great to measure, but one hasobserved some relation between luminosity and distance for those stars which arenear enough, and so one switches to measuring luminosity, and one pretends thatone is measuring “distance”, and far away one switches to something else by wayof another observed relation that one postulates to be always true, so that when

Page 208: From hyperbolic systems to kinetic theory: a personalized quest

190 23 Questioning Physics; from Classical Particles to Balance Laws

As I mentioned earlier, what I call the Maxwell–Heaviside equation iswhat others call the Maxwell equation, because it was HEAVISIDE who wrotethe equation that one uses now, a huge simplification of what MAXWELL

had derived, because MAXWELL was thinking in purely mechanical termsfor transmitting the electric field and the magnetic field, probably becausethey correspond to transversal waves which were supposed to propagate onlythrough solids, so I gather that it was related to what physicists called etherfor a while, which might be the same as what they call the vacuum nowa-days. Although some mathematical theories were first developed because ofquestions in physics or in continuum mechanics, the results proven for ordi-nary differential equations or for partial differential equations are not linkedto what one thought were good equations for describing the physical world,and mathematicians do not really need to know what were all the philosophi-cal problems that physicists had in changing their intuitive description of theworld, but the mathematicians who have doubts about the validity of somemodels, and who start enquiring about how the equations that one proposesfor them to solve had been derived, certainly face quite challenging situationsfor their talents of detective.

The experience of MICHELSON and MORLEY seemed to show that the ve-locity of light c does not change in a frame which moves at a constant velocitywith respect to a first one,7,8 and it might have been a reason why POINCARE

(and maybe EINSTEIN) was led to replace NEWTON’s point of view of forcesacting instantaneously at a distance, and develop the new point of view whereparticles feel a field and interact with it, the field being a solution of a hy-perbolic system having only the velocity of light c as the characteristic speed,a mathematical consequence being that to relate the measurements in thetwo frames, one must use the Lorentz group of transformations, instead ofthe Galilean group of transformations. From a mathematical point of view,the result is that instead of ordinary differential equations one must workwith partial differential equations (of hyperbolic type), and this should nothave been a surprise to anyone who had understood the passage from classicalmechanics, which is an 18th century point of view of mechanics based on ordi-nary differential equations, to continuum mechanics, which is a 19th centurypoint of view of mechanics, based on partial differential equations. However,the attitude of using a classical mechanics point of view and talking aboutparticles is still prevalent in physics, and one of the reasons why physicistsstill interpret quantum mechanics in terms of probabilities is that they want

astronomers say that the redshift is proportional to distance, one has to wonder ifthey have not postulated it and use the redshift as a measure of their “distance”.

7 Albert Abraham MICHELSON, Polish-born physicist, 1852–1931. He received theNobel Prize in Physics in 1907, for his optical precision instruments and thespectroscopic and metrological investigations carried out with their aid. He hadworked in Worcester, MA, and in Chicago, IL.

8 Edward Williams MORLEY, American physicist, 1838–1923. He had worked inCleveland, OH.

Page 209: From hyperbolic systems to kinetic theory: a personalized quest

23 Questioning Physics; from Classical Particles to Balance Laws 191

to describe the behaviour of nonexistent particles, while they are actuallylooking at waves. The mathematical way to understand waves deals with par-tial differential equations of hyperbolic type, and certainly not with ordinarydifferential equations, even Hamiltonians, but it might be because of somelimiting situations, like geometrical optics derived from the scalar wave equa-tion, that physicists may have thought that there was nothing wrong aboutkeeping an 18th century point of view, instead of learning the consequencesof the 19th century point of view and going forward.

A similar situation exists when one starts from a problem in linearized elas-ticity, and one derives the Saint-Venant approximation for elongated bodies,and the set of formulas obtained form the basic rules of resistance of materials,which engineers use for computing the behaviour of systems of bars and beamsin buildings. There are two attitudes if one needs to deal with a structure likethe hyperbolic paraboloids used as cooling towers for power plants;9 the firstone is to go back to the theory of linearized elasticity and to derive equationsvalid for thin shells, and then to discretize the equations obtained in orderto perform numerical simulations; the second one is to imagine the structureas an assemblage of a huge number of bars and beams. Obviously, the sec-ond solution resembles the first after one has performed a discretization, butone learns in numerical analysis that not all discretizations are good,10 butalthough good engineers have often invented interesting numerical schemes, itseems that the proofs that a numerical scheme converges always rely on thefirst approach and the identification of an adapted variational framework.

The particles that the physicists use are like the bars and beams that theengineers use for computing a thin shell structure; contrary to the appearance,the structure is not full of holes but resists the wind, and these strange barsand beams which oppose the wind are a little similar to the strange particleswhich manage to be in many places at the same time. Obviously, this typeof difficulty disappears if one understands that continuum mechanics recre-ates classical mechanics in some limiting cases, and there is no doubt thatchronologically, continuum mechanics was partly obtained as a limit of classi-9 The cooling towers are about one hundred metres high and thin enough so that

one uses shell theory for studying their elastic behaviour, or better their visco-elastic behaviour (because concrete is a visco-elastic material), in particular forthe way they react to the wind. From afar, one does not always see that theydo not touch the ground, and they are built on pylons, because their purpose isto create an upward draught of air, and I had thought that this shape had beenfound very efficient for creating a strong draught, but I was told that the shapehas been used since the 19th century for a much simpler reason, because it isvery easy to build, and that is because of the two families of straight lines whichgenerate these hyperboloids.

10 There are questions of consistency to check, or one may approach the solution of adifferent equation, and there are conditions of stability to check, or the numericalscheme may diverge.

Page 210: From hyperbolic systems to kinetic theory: a personalized quest

192 23 Questioning Physics; from Classical Particles to Balance Laws

cal mechanics,11 like in the work of D. BERNOULLI and of CAUCHY, that willhelp us understand a little more about forces; what a force is will not be foundin this way, and the physicists’ description of what happens at the atomic levelis not so clear. In the early 1980s, I was already wondering about what a forceis, and I asked the question to a few people; Robin KNOPS pointed out thatsome definitions can be circular,12 because a force is something which is mea-sured with a dynamometer, and a dynamometer is based on the theory oflinearized elasticity, and what one has measured is a displacement, so at theend one has not really defined what a force is, but some terms are called forcesin the equations of linearized elasticity, or the equations of finite elasticity. Iheard later that experiments cannot be independent of a theory for interpret-ing the result of the experiment, and this shows why physics is necessarilyquite different from mathematics. I have understood some questions about“particles” because of some mathematical results for H-measures [18], which Ihad developed for another purpose, and I hope to derive a better mathemat-ical tool, which will explain more questions about “particles” and about the“forces that bind them”.

In some simple linear partial differential equations, the relation betweenforces acting at a distance and the equivalent effect of a field can be seeneasily. If one considers a repartition of fixed electric charges �, and one usesthe Maxwell–Heaviside equation for the vacuum and without a magnetic field,so that curl(E) = 0, div(D) = � and D = ε0E, then using E = −grad(V ) fordefining the electrostatic potential V chosen to be 0 at infinity, one has theLaplace/Poisson equation −ε0ΔV = �. Using the elementary solution 1

4π r of−Δ in R

3, one has V = 14π ε0r

� �, i.e.

V (x) =∫

R3

14π ε0|x− y| �(y) dy, (23.1)

showing that a charge q′ at y creates a potential q′

4π ε0|x−y| at x; in that casethe force on a charge q at x is q E, and it looks as if the charge q′ at y iscreating a force of magnitude |q q′|

4π ε0|x−y|2 on the particle at x, the force beingrepulsive if the two charges have the same sign, and attractive if they haveopposite sign.

Forces inversely proportional to the square of the distance suggest then thepresence of a Laplacian in an equation, and when one knows that an elemen-tary solution of −Δ+α2 is e−α r

4π r , which had been introduced by YUKAWA,13 for

11 The Euler equation for ideal fluids was guessed directly, and not obtained after alimiting process, I believe.

12 Robin John KNOPS, English mathematician, born in 1932. He worked at Heriot–Watt University, Edinburgh, Scotland.

13 Hideki YUKAWA, Japanese physicist, 1907–1981. He received the Nobel Prize inPhysics in 1949, for his prediction of the existence of mesons on the basis oftheoretical work on nuclear forces. He had worked in Kyoto, Japan.

Page 211: From hyperbolic systems to kinetic theory: a personalized quest

23 Questioning Physics; from Classical Particles to Balance Laws 193

describing the short range of nuclear forces, one understands that this type oftruncated potential may appear because of the presence of a term of order zeroin an equation with a Laplacian. Francois MURAT and Doina CIORANESCU

have studied the apparition of a zero-order term by homogenization,14 whichthey call a strange term coming from nowhere, but George PAPANICOLAOU,who has studied a probabilistic version with Raghu VARADHAN, mentionedthat such examples of screening effects are common in physics, for examplein plasmas, where one uses α = 1

RD, where RD is called the Debye radius.15

However, there are other potentials which physicists use, like the Lennard-Jones potential,16 which is attractive with a force in r−6 when particles arefar apart and repulsive with a force in r−12 when particles are close together,for which I do not know any relation with a system of partial differentialequations.

Physicists also use forces which do not depend only upon the position ofa particle but also upon its velocity v, and the electrostatic force q E alreadymentioned is actually a truncated form of the Lorentz force q(E + v×B). Ofcourse, one needs to use the Maxwell–Heaviside equation

div(B) = 0; Bt + curl(E) = 0div(D) = �; −Dt + curl(H) = j,

(23.2)

and as the density of charge � is interpreted as an average of point chargesqi and the current density j is interpreted as an average of qivi, there is adensity of Lorentz force

�E + j ×B, (23.3)

whose power density is (j.E).17

14 Doina POP-CIORANESCU, Romanian-born mathematician. She works at CNRS(Centre National de la Recherche Scientifique) and Universite Paris VI (Pierre etMarie Curie), Paris, France.

15 Petrus (Peter) Josephus Wilhelmus DEBYE, Dutch-born physicist, 1884–1966.He received the Nobel Prize in Chemistry in 1936, for his contributions to ourknowledge of molecular structure through his investigations on dipole momentsand on the diffraction of X-rays and electrons in gases. He had worked in Zurich,Switzerland, in Utrecht, The Netherlands, in Gottingen, in Leipzig and in Berlin,Germany, and then at Cornell University, Ithaca, NY.

16 Sir John Edward LENNARD-JONES, British chemist, 1894–1954. He had workedin Bristol and in Cambridge, England.

17 The Maxwell–Heaviside equation can be expressed in terms of differential forms,one 2-form ω2 having coefficients E and B whose exterior derivative is 0 (andtherefore the exterior derivative of a 1-form ω1 having coefficients V and A, thescalar and vector potentials), and another 2-form ω′

2 having coefficients D andH , whose exterior derivative is a 3-form ω3 having coefficients � and j. Usingthe Euclidean structure of R

3 one can associate a 1-form ω′1 to ω3, and the

exterior product of ω2 and ω′1 (or an interior product of ω2 and ω3) is a 3 form

ω′3 whose coefficients are �E + j × B and (j.E). What puzzled me in the mid

1970s, after I had learnt about this formulation from Joel ROBBIN, was that the

Page 212: From hyperbolic systems to kinetic theory: a personalized quest

194 23 Questioning Physics; from Classical Particles to Balance Laws

I shall show that if one denotes by f(x,v, t) a density of particles at pointx and time t and having velocity v, then in the case where there are noforces acting on the particles, f satisfies the equation ∂f

∂t + v.∂f∂x = 0, so thatf(x,v, t) = f(x− tv, v, 0).

Let us consider what happens when there are forces acting on the particles.If one tries to understand what forces are, one is bound to stumble upon otherconcepts which have not been defined in a clear way, like mass. In tryingto understand what forces between particles are, and what the mass of aparticle is, one difficulty is that there are no particles in the real world andthey are only idealizations; only waves exist and any explanation of originmust start at a very small scale (where physicists talk of quantum effects,which I have proposed to look at in a different way), and then one mustexplain what important quantities are needed at mesoscopic levels and at ourmacroscopic level. Unfortunately, there is much left to be understood in thisdirection, and one is bound to use the intermediate description of continuummechanics (where questions appearing at microscopic levels and mesoscopiclevels are often mentioned), and its simplification of classical mechanics (whereone always forgets to mention the restrictive assumptions which are made);rigid particles are in the realm of classical mechanics and I shall start inthat way, but the limiting behaviour for letting a number of particles tend toinfinity will take us into the realm of continuum mechanics, if not further.

Let M(t) denote the position of a particle, which then has velocityv = dM(t)

dt and acceleration a = d2M(t)dt2 ; if there are forces acting on the parti-

cle, Newton’s law is then force = mass× acceleration, i.e. m d2M(t)dt2 = F (t),

and mass is just a positive parameter. A force is actually known by itswork,18 or its power,19 and multiplying the equation by dM(t)

dt one finds thatddt

(m2

∣∣dM(t)

dt

∣∣2)

=(F (t).dM(t)

dt

), and m

2

∣∣dM(t)

dt

∣∣2 is the kinetic energy of the

particle. Relativistic effects are not taken into account here, and in that casea particle with rest mass m0 is said to have a mass depending upon its velocityby the formulam = m0√

1−v2/c2 , and its energy is given by the formula e = mc2,

usually attributed to EINSTEIN, but which had been in print before,20 so thatfor v small compared to the velocity of light c one has e − e0 ≈ m0

2 v2, giv-

ing the classical formula for the kinetic energy, but I think that it does notreally make much sense using a classical mechanics framework for discussing“relativistic particles”.21

weak topology is natural for ω1, ω2, ω′2 and ω3, but �E+ j×B and (j.E) are not

among the sequentially weakly continuous functionals, and it suggested that theweak topology is not adapted to forces.

18 work = force × displacement.19 power = force × velocity.20 It seems that POINCARE had used it in 1900, and DE PRETTO in 1903.21 FEYNMAN wrote that, because of the Lorentz compression of length in the di-

rection of the movement, he thought of electrons moving at a velocity near the

Page 213: From hyperbolic systems to kinetic theory: a personalized quest

23 Questioning Physics; from Classical Particles to Balance Laws 195

The force usually depends upon the position of the particle, and some-times upon its velocity, like for the Coriolis force,22 which was actually firstintroduced by LAGRANGE, or the Lorentz force in electromagnetism,23 andone often talks of a force field defined everywhere and not only at places wherethere are particles, and one may think that the force field at a point could bemeasured if one could add a new particle at that point.

If one considers many particles with the same mass m and the same chargeq, feeling the Lorentz force created by an electric field E and a magneticinduction field B (depending upon (x, t)), then each particle position satisfiesan equation

md2M

dt2= q(E +

dM

dt×B), (23.4)

and if there are many particles and one takes a limit for an infinite number ofparticles while keeping the ratio q

m = γ, the limit density f(x,v, t) satisfiesthe equation

∂f

∂t+

3∑

i=1

vi∂f

∂xi+ γ

3∑

i=1

(Ei +

3∑

j,k=1

εi,j,kvjBk

) ∂f∂vi

= 0. (23.5)

Let us deduce the equations for fluid quantities, like the density � and themomentum P related to the (macroscopic) velocity u by P = � u, defined by

�(x, t) =∫

R3 f(x,v, t) dvPi(x, t) =

∫R3 vif(x,v, t) dv for i = 1, 2, 3. (23.6)

Integrating the equation in v over R3 gives the conservation of mass (or con-

servation of charge)∂�

∂t+

3∑

i=1

∂Pi∂xi

= 0, (23.7)

at least if f tends to 0 fast enough as v tends to ∞, so that integrals inv of derivatives in v are 0, because one finds that γ

(∑iEi∫

R3∂f∂vi

dv +∑i,j,k εi,j,kBk

∫R3 vj

∂f∂vi

dv)

= 0, because εi,j,kvj ∂f∂vi= εi,j,k

∂(vjf)∂vi

, as the com-pletely antisymmetric tensor εi,j,k is such that εi,j,k = 0 if two of the indicesi, j, k are equal. If one multiplies by vk before integrating in v, one obtainsthe equation expressing the balance of momentum,

velocity of light c as flat pancakes, but using an idea of BOSTICK, I think theymay look more like flat doughnuts.

22 Gaspard Gustave DE CORIOLIS, French mathematician, 1792–1843. He hadworked in Paris, France.

23 Using the formula (u.∇)u = −u×curl u+grad |u|22

, Euler equation for an incom-pressible ideal (inviscid) fluid, �0(∂t + u.∇)u + gradp = f (and div u = 0) takes

the form of the Lorentz force f = �0(E+ u×B) with E = ∂tu+ grad(

p�0

+ |u|22

)

and B = −curl u, which satisfy the corresponding part of Maxwell–Heavisideequation, div B = 0 and ∂tB + curl E = 0.

Page 214: From hyperbolic systems to kinetic theory: a personalized quest

196 23 Questioning Physics; from Classical Particles to Balance Laws

∂Pk

∂t +∑3i=1

∂Ri,k

∂xi= γ(�E + P ×B)k

Ri,k =∫

R3 vivkf dv for i, k = 1, 2, 3,(23.8)

and if one defines the symmetric Cauchy stress tensor σ by

σi,k = −∫

R3(v − ui)(v − uk)f dv for i, k = 1, 2, 3, (23.9)

then

3∑

i=1

∂Ri,k∂xi

=3∑

i=1

∂(� uiuk)∂xi

−3∑

i=1

∂σi,k∂xi

for k = 1, 2, 3. (23.10)

Similar computations were done in the 1860s by BOLTZMANN, but in theBoltzmann equation a force different from the Lorentz force appears, whichis supposedly computed from the interaction of pairs of particles (and thatimplicitly assumes that one is dealing with a rarefied gas).

[Taught on Wednesday October 31, 2001.]

Notes on names cited in footnotes for Chapter 23, BOSTICK,24 DE PRETTO,25

and for the preceding footnotes, STEVENS.26

24 Winston Harper BOSTICK, American physicist, 1916–1991. He had worked atStevens Institute of Technology, Hoboken, NJ.

25 Olinto DE PRETTO, Italian industrialist, 1857–1921.26 Edwin Augustus STEVENS, American engineer and philanthropist, 1795–1868.

The Stevens Institute of Technology, Hoboken, NJ, is named after him.

Page 215: From hyperbolic systems to kinetic theory: a personalized quest

24

Balance Laws; What Are Forces?

When one considers a finite number of particles, with particle i having massmi, and position M i(t), and feeling a force F i(t), conservation of mass is justthe fact that the masses mi are independent of t.

The condition dmi

dt = 0 for all i is equivalent to the equation

∂�

∂t+

3∑

k=1

∂Pk∂xk

= 0 in the sense of distributions, (24.1)

where one writes formally

� =∑im

iδ(Mi(t),t)

P =∑im

i dMi

dt δ(Mi(t),t),(24.2)

but � and P should be considered as Radon measures (or distributions) in(x, t), acting as

〈�, ϕ〉 =∑

i

∫Rmiϕ(M i(t), t) dt

〈P, ϕ〉 =∑i

∫Rmi dM

i(t)dt ϕ(M i(t), t) dt,

(24.3)

for all ϕ which are continuous with compact support (or C∞ with compactsupport).

In order to check (24.1), one takes ϕ to be C1 with compact support, andone has

⟨∂�∂t +

∑3k=1

∂Pk

∂xk, ϕ⟩

= −⟨�, ∂ϕ∂t

⟩−∑3

k=1

⟨Pk,

∂ϕ∂xk

= −∑i

∫Rmi d[ϕ(Mi(t),t)]

dt dt =∑

i

∫R

dmi

dt ϕ(M i(t), t) dt,(24.4)

so that, assuming the position of the particles to be distinct, one sees that(24.1) is equivalent to dmi

dt = 0 for all i.

When one lets the number of particles tend to infinity, one rescales themasses of the particles in order to have the corresponding � (density of mass)

Page 216: From hyperbolic systems to kinetic theory: a personalized quest

198 24 Balance Laws; What Are Forces?

and the corresponding P (density of linear momentum) converge to Radonmeasures (or distributions), and equation (24.1) stays valid at the limit, as itis written in the sense of distributions.

For a subsequence of � to converge to a Radon measure it is sufficient thatfor every compact K and every T > 0 there exists a constant C(K;T ) suchthat

∑{i|Mi(t)∈K}m

i ≤ C(K;T ) for 0 < t < T (assuming that the initial timeof interest is 0). For a subsequence of P to converge to a Radon measure it issufficient that for every compact K and every T > 0 there exists a constantC1(K;T ) such that

∑{i|Mi(t)∈K}m

i∣∣dMi

dt

∣∣ ≤ C1(K;T ) for 0 < t < T .

Because the masses mi are positive, the condition for a subsequence of � toconverge to a distribution is the same, from the remark of Laurent SCHWARTZ

that nonnegative distributions coincide with nonnegative Radon measures;however, it is different for the density of linear momentum, and it may happenthat a subsequence of P converges to a distribution without converging to aRadon measure, or even that it converges to a Radon measure in the sense ofdistributions but not in the sense of Radon measures; the same considerationswill arise when dealing with forces.

To study the balance of linear momentum, one uses the equations of motion

mi d2M i

dt2= F i for all, (24.5)

but it is not yet important to know what the forces F i are, i.e. I shall not usethe information

F i =∑

j =i Fi,j , with F i,j the force exerted on particle i by particle j

F i,j is in the direction of the particle j,with F i,j + F j,i = 0 for all i �= jF i,j depending only upon the distance between particle i and particle j.

(24.6)Besides � and P already used, the equation of balance of momentum uses thetensor R and the resultant force F defined by

R =∑

imi(dMi

dt ⊗ dMi

dt

)δ(Mi(t),t)

F =∑

i Fiδ(Mi(t),t),

(24.7)

which should be considered as Radon measures (or distributions) in (x, t), andthe equation of balance of linear momentum takes the form

∂P∂t

+3∑

k=1

∂Rk,∂xk

= F for � = 1, 2, 3, (24.8)

because⟨∂P�

∂t +∑3k=1

∂Rk,�

∂xk, ϕ⟩

= −∑i

∫Rmi dM

i�

dt∂ϕ∂t (M

i(t), t) dt

−∑i,k

∫Rmi dM

ik

dtdMi

dt∂ϕ∂xk

(M i(t), t) dt

=∑

i

∫Rmi d

2Mi�

dt2 ϕ(M i(t), t) dt = 〈F, ϕ〉,(24.9)

Page 217: From hyperbolic systems to kinetic theory: a personalized quest

24 Balance Laws; What Are Forces? 199

as a consequence of

ddt

(dMi

dt ϕ(M i(t), t))

= dMi�

dt

(∂ϕ∂t (M

i(t), t) +∑3k=1

dMik

dt∂ϕ∂xk

(M i(t), t))

+ d2Mi�

dt2 ϕ(M i(t), t).(24.10)

A limit in the sense of Radon measures requires that∑

{i|Mi(t)∈K} |F i(t)| ≤C2(K,T ) for 0 < t < T , and this is questionable, and it will be discussedlater.

The macroscopic velocity u of transport of mass is defined by writingPk = � uk for k = 1, 2, 3, and then one writes

Rk, = � uku − σk,, for k, � = 1, 2, 3, (24.11)

and σ is the Cauchy stress tensor, which is symmetric.The basic idea in kinetic theory is to introduce a density of particles

f(x,v, t) which sees the position and the velocity of the particles, and thenone writes

�(x, t) =∫

R3 f(x,v, t) dvP (x, t) =

∫R3 v f(x,v, t) dv

R(x, t) =∫

R3(v ⊗ v)f(x,v, t) dv.(24.12)

From these formulas, one deduces that the Cauchy stress tensor is given by

σ = −∫

R3

((v − u) ⊗ (v − u)

)f(x,v, t) dv, (24.13)

and in the case of a gas at equilibrium, the density is a Gaussian and theCauchy stress tensor reduces to a hydrostatic pressure, i.e. σi,j = −p δi,j.

In this new point of view, the particle i will be a Dirac mass of weight mi

at the point(M i(t), dM

i(t)dt , t

)in the (x,v, t) space, and in order to derive an

equation for f one needs to understand more about the forces.

I have mentioned that forces may correspond to objects which are notnecessarily Radon measures, but are distributions in the sense of LaurentSCHWARTZ. One classical notion in physics is that of a dipole, and it is thelimit of a sequence k(δa − δb) when the points a and b get very near andthe coefficient k tends to ∞ in such a way that |k| |a − b| converges to a(nonzero) constant; Laurent SCHWARTZ had noticed that these objects arejust derivatives (in the sense of distributions, of course) of Dirac masses.1

Recalling that 〈δa, ϕ〉 = ϕ(a), one has 〈 ∂δa

∂xj, ϕ〉 = −〈δa, ∂ϕ∂xj

〉 = − ∂ϕ∂xj

(a). Forexample, in one dimension, the sequence μn = n(δ1/n − δ0) is not boundedin the space of Radon measures M(R), and if one uses the Banach space ofRadon measures with finite total mass (dual of C0(R), the space of continousfunctions tending to 0 at ∞, with the sup norm), the norm of μn is 2n.

1 DIRAC might have used this intuition for using derivatives of his “function”.

Page 218: From hyperbolic systems to kinetic theory: a personalized quest

200 24 Balance Laws; What Are Forces?

However, μn is bounded in the space of distributions of order ≤ 1, because forϕ a function of class C1 with compact support, one has |〈μn, ϕ〉| ≤ max |ϕ′|,and actually μn converges to −(δ0)′, because 〈μn, ϕ〉 = n

(ϕ(

1n

) − ϕ(0))

=ϕ′(0) + o(1) → ϕ′(0) = 〈−(δ0)′, ϕ〉; this is consistent with the fact that ifwn(x) = n for 0 < x < 1

n and 0 elsewhere, then wn converges to δ0 in the senseof Radon measures, and in the sense of distributions one has (wn)′ = −μn.

One is in a similar situation, where limits may not exist in the sense ofRadon measures but may exist in the sense of distributions, when particles getvery near, and forces between them become large, for example when one dealswith forces depending upon the distance as negative powers of the distance.

It is important then to have some idea about what are reasonable hy-potheses concerning forces, and for this I shall show a model, which has beenused by D. BERNOULLI for approximating the movement of a string by thatof small masses linked by springs; of course, this is a model for a solid, andone should be careful about the way one uses it for questions about liquids,or about gases. BERNOULLI was interested in the frequencies of vibrations,as for a violin string, and he did not derive the string equation (i.e. the waveequation in one dimension), and D’ALEMBERT is credited for writing the one-dimensional wave equation,2 but I do not think that he derived it by goingfurther than BERNOULLI’s analysis, and he may just have written an equa-tion that would have solutions of the form u(x, t) = f(x− c t) for an arbitrary(smooth) function f , as well as solutions of the form v(x, t) = g(x + c t) foran arbitrary (smooth) function g, and checking that the equation that he hadwritten, utt−c2uxx = 0, had the general solution f(x−c t)+g(x+c t). POISSON

may have been the first to work on the three-dimensional wave equation.3

[Taught on Friday November 2, 2001.]

2 Jean LE ROND, known as D’ALEMBERT, French mathematician, 1717–1783. Hehad worked in Paris, France.

3 His motivation may have been the study of pressure waves in gases.

Page 219: From hyperbolic systems to kinetic theory: a personalized quest

25

D. Bernoulli: from Masslets and Springsto the 1-D Wave Equation

In a gas, the forces between particles can become quite large when two par-ticles are near, but these forces are of the same magnitude but opposite indirection and there is then some cancellation effect; in a limiting process, whenthe number of particles gets large and one rescales their masses, one will findthat some sequence may converge in the sense of distributions but not in thesense of Radon measures.

In order to study this phenomenon, I shall use a discrete model of a vibrat-ing string, by considering small masses connected with springs, an idea goingback to D. BERNOULLI, but in order to do the analysis completely I shall con-sider longitudinal waves,1 while the vibration of a violin string is a transversalwave,2 for which the analysis uses a linearization, and at the limit one obtainsthe string equation, i.e. the one-dimensional wave equation. I shall show laterthe corresponding analysis for two- or three-dimensional bodies, which uses alinearization too; it was first used by CAUCHY, and it generates the equationfor linearized elasticity.

I consider small masses m1, . . . ,mN−1 moving on a line between 0 and L,and occupying positions 0 < x1(t) < . . . < xN−1(t) < L, and I use x0(t) = 0and xN (t) = L. For j = 1, . . . , N , there is a spring with constant κj > 0 andequilibrium length �j > 0 between the masses at xj−1 and at xj (but x0 andxN are actually walls, which do not move). The increase in length of spring jis xj − xj−1 − �j , so that the force exerted at xj is −κj(xj − xj−1 − �j) andthe force exerted at xj−1 is κj(xj − xj−1 − �j); one deduces that the equationfor the movement of the mass j is1 One generates a longitudinal wave in a metallic bar by hitting it with a hammer,

in the direction of the length, so that the motion of the points and the directionof propagation of the wave are along the length of the bar.

2 Because the movement of a point is perpendicular to the string, along which thewave propagates.

Page 220: From hyperbolic systems to kinetic theory: a personalized quest

202 25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation

mjd2xjdt2

+κj(xj−xj−1−�j)−κj+1(xj+1−xj−�j+1) = 0, for j = 1, . . . , N−1,

(25.1)and one must pay attention to the fact that, once these equations are written,it is not clear if the evolution will enforce xj−1(t) < xj(t) for j = 1, . . . , Nand all t > 0.3 If one denotes by yj the equilibrium position of mass j, thenone must have

κj(yj − yj−1 − �j) − κj+1(yj+1 − yj − �j+1) = 0,for j = 1, . . . , N − 1, with y0 = 0, yN = L,

(25.2)

and the existence of a solution of (25.2) is equivalent to its uniqueness, as itis a linear system with N − 1 equations for N − 1 unknowns, but one mustcheck if the solution satisfies yj−1 < yj for j = 1, . . . , N . To prove uniqueness,one considers the homogeneous version of (25.2),

κj(zj − zj−1) − κj+1(zj+1 − zj) = 0, for j = 1, . . . , N − 1, with z0 = zN = 0,(25.3)

then multiplying by zj and summing in j gives∑N

j=1 κj(zj − zj−1)2 = 0, andtherefore all the zj are equal and must be 0 as z0 = zN = 0.

Existence being proven, let FL be defined by

FL = κj(zj − zj−1) = κj(yj − yj−1 − �j) for all j = 1, . . . , N, (25.4)

which uses (25.3), so that FL is the force that one should apply at the pointL in order to maintain equilibrium with the last point at position L; one findseasily from (25.4) that

( 1κ1

+ . . .+1κN

)FL = L− (�1 + . . .+ �N ), (25.5)

so that

if L ≥ �1 + . . .+ �N , then FL ≥ 0 and yj−1 < yj for j = 1, . . . , N, (25.6)

and this is the case when the springs under no tension have lengths that donot add up to L and one must stretch them. In the case when the springsunder no tension have total length larger than L, and one must compress thesprings (remembering that the model has ruled out buckling), one wants tohave yj−1 < yj for j = 1, . . . , N , which is �j + FL

κj> 0 for j = 1, . . . , N , or

FL > − minj=1,...,N

κj�j, (25.7)

and using (25.5) one deduces that

3 The reason is that the law of force applied to a spring to compress it has beenlinearized, and as a consequence only a finite force is necessary to squeeze it tozero length. Of course, buckling is not taken into account in the model.

Page 221: From hyperbolic systems to kinetic theory: a personalized quest

25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation 203

�1 + . . .+ �N < L+(

1κ1

+ . . .+ 1κN

)minj=1,...,N κj�j

implies yj−1 < yj for j = 1, . . . , N,(25.8)

and the condition (25.8) is automatically true if all the �j are equal and allthe κj are equal.

Writingxj(t) = yj + zj(t) for j = 0, . . . , N, (25.9)

one obtains the equations

mjd2zjdt2

+ kj(zj − zj−1)− kj+1(zj+1 − zj) = 0, for j = 1, . . . , N − 1. (25.10)

Multiplying by dzj

dt and summing in j gives

N−1∑

j=1

mj

2

(dzjdt

)2

+N∑

j=1

kj2

(zj − zj−1)2 = constant, (25.11)

but this constant is not always the total energy, sum of the kinetic energy,which is here

K(t) =N−1∑

j=1

mj

2

(dxjdt

)2

=N−1∑

j=1

mj

2

(dzjdt

)2

, (25.12)

and of the potential energy (i.e. the elastic energy stored inside the springs),which is here

P(t) =∑Nj=1

κj

2 (xj − xj−1 − �j)2 =∑N

j=1κj

2 (zj − zj−1 + yj − yj−1 − �j)2

=∑Nj=1

κj

2

(zj − zj−1 + FL

κj

)2

=∑Nj=1

κj

2 (zj − zj−1)2 + FL

2

(L− (�1 + . . .+ �N)

),

(25.13)because

∑Nj=1(zj−zj−1) = zN −z0 = 0, and (25.5) gives FL

(1κ1

+ . . .+ 1κN

)=

L− (�1 + . . .+ �N), so the constant is the total energy only in the case whenL = �1 + . . .+ �N , corresponding to FL = 0.

One may naively think that if one starts with the springs under no tension,occupying the length �1 + . . .+ �N and one applies the force FL until the endpoint is at position L, then the work of the force is FL

(L−(�1 + . . .+�N )

)and

this is not the value FL

2

(L− (�1 + . . .+ �N )

)which appears in the preceding

formula, but if one behaves in such a naive way, one will not end up with massj at position yj and with velocity 0, of course. If at time 0 one starts withx1(0) = �1, x2(0) = �1 + �2, . . . , xN (0) = �1 + . . . + �N , with dxj

dt (0) = 0 forj = 1, . . . , N , and one applies a force F (t), then the system is changed andone no longer has xN (t) = L but mN

d2xN

dt2 + κN (xN − xN−1 − �N) = F (t),while before there was no mass mN involved; between time 0 and T the work

Page 222: From hyperbolic systems to kinetic theory: a personalized quest

204 25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation

of the force is going to be∫ T

0 F (t)dxN

dt dt; multiplying equation j by dxj

dt andsumming in j one obtains that d

dt

[K(t) + mN

2

(dxN

dt

)2 +P(t)]

= F (t)dxN

dt , andtherefore the work done between time 0 and T is K(T )+mN

2

(dxN

dt (T ))2+P(T );

if at time T one has succeeded in having xN (T ) = L and dxN

dt (T ) = 0, thenthe work done by the force is exactly the total energy of the system that onehad considered from the start. The possibility of finding a force F (t) suchthat at time T the xj(T ) and dxj

dt (T ) take given values for j = 1, . . . , N is aquestion of controllability; there is an algebraic characterization for that andit can be checked that the system is indeed controllable if one has κj > 0 forall j.4

For simplicity, let us assume now that all the masses are equal to m and allthe strengths of the springs are equal to κ, and one looks at periodic solutionsof the form x(t) = y+eiω ta, where y is the equilibrium solution, and one findsthat one must have M0a = ω2a, where M0 is the symmetric matrix definedby

(M0)i,j = 0 for j �= i or j �= i± 1(M0)i,i = 2κ

m for i = 1, . . . , N − 1(M0)i,i−1 = (M0)i,i+1 = −κ

m for i = 1, . . . , N − 1,(25.14)

and one either discards the condition for (M0)1,0 and (M0)N−1,N in this list,or one considers that one must have a0 = aN = 0. Recalling trigonometricformulas, one can find explicitly the eigenvectors and eigenvalues of M0 bychoosing p = 1, . . . , N − 1, and then a defined by

ai = sin( i p πN

)for i = 1, . . . , N − 1, (25.15)

which gives the eigenvalue

ω2p = 2

κ

m

[1 − cos

(p πN

)]=

4κm

sin2( p π

2N

), (25.16)

so the corresponding frequencies of vibration of the system are

νp =ωp2π

=√κ

π√m

sin( p π

2N

), (25.17)

4 In the case of a system dXdt

= AX + B u, with X of size M , the necessary andsufficient condition for controllability is that the rank of the matrix with block

columns Y = (B AB . . . AM−1B ) is M . Here M = 2N with X =

(xdxdt

)

and A =

(0 I

−M0 0

), where M0 is a tridiagonal matrix, and B = e2N ; AB

puts eN in the span of the columns of Y , and then A2B puts M0eN , which is acombination of e2N−1 and e2N if κN > 0, so e2N−1 is in the span and A3B putseN−1 and A4B puts M0eN−1, which adds e2N−2 in the span if κN−1 > 0, and soon.

Page 223: From hyperbolic systems to kinetic theory: a personalized quest

25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation 205

and therefore if N is large the lowest frequency is ν1 ≈√k

2N√m

.Then, the problem is to let N tend to infinity, while rescaling correctly

m and κ. To rescale mass, it is natural to take m = m∗N , where m∗ is the

mass of the vibrating string that one is trying to model, which one thinksas divided into N equal parts. The rescaling for κ is less intuitive; as forceis mass × acceleration, the unit for κ is mass × time−2 and if mass is m∗

N

one may choose time = time∗N , which corresponds to κ = N κ∗, and this

corresponds to keeping the lowest frequency almost fixed, and it might be theway BERNOULLI thought. One may also choose to have a constant (nonzero)speed of propagation so length and time should be rescaled in the same way,but BERNOULLI could not have thought in terms of wave speed, as he hadnot derived the one-dimensional wave equation. One may also ask that forcesstay bounded and do not tend to 0, because from the experimental evidenceof tuning a violin (which was probably the intuition of BERNOULLI) somehigh tension must be put on the strings but certainly not an infinite one; apiece of the string of length L

N might show an increase in length of the sameorder O

(1N

)but for a force O(1) and therefore κ must be O(N). Another way

of looking at the problem would be to have a bounded kinetic energy and abounded potential energy; for the kinetic energy one has N terms which areO(

1N

)as m is O

(1N

)and dxj

dt is of order O(1), and for the potential energy oneexpects xj−xj−1−�j to be O

(1N

)and in order to have κ(xj−xj−1−�j)2 also

of order O(

1N

), one needs to have κ of order O(N); I do not know if thinking

in terms of potential energy was a natural approach for BERNOULLI.

All the preceding considerations consist in arguing about physical intu-ition, but one should be aware that one’s physical intuition might be wrong,and in facing a physical problem that one is not sure about, one should turn tothe mathematical side and prove various theorems, under different hypotheses.Suppose then that one scales m = m∗

N , as this part is clear, and that one usesa sequence κ(N) which one is ready to let behave in various ways as N → ∞;suppose that one starts from an initial datum where xj(0) = yj = j L

N , withdxj

dt = O(1) for j = 1, . . . , N − 1, so that K(0) = O(1) and P(0) = 0; fromthe solution of the differential system one may construct a function uN whichat any time t is continuous and piecewise affine with uN

(j LN , t)

= xj(t) − yj,and one wonders what happens to uN as N → ∞.

If κ(N)N → 0, one finds that every weak limit u∞ of a subsequence extracted

from the sequence uN satisfies ∂2u∂t2 = 0; this is the case where there is only

kinetic energy and no elastic behaviour giving rise to a nonzero potentialenergy. If κ(N)

N → ∞, one finds that every weak limit u∞ satisfies ∂2u∂x2 = 0;

this is the case where the string behaves as a rigid body. If κ(N)N → κ �= 0, one

finds that every weak limit u∞ satisfies ∂2u∂t2 −c2 ∂

2u∂x2 = 0, with c2 = κL2

m∗ ; this isthe case corresponding to a vibrating string (without dissipation, while the realone slowly loses energy). These results can be proven with standard variationaltechniques, commonly used in the abstract part of numerical analysis, where

Page 224: From hyperbolic systems to kinetic theory: a personalized quest

206 25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation

the problem considered is usually the opposite one, starting from the waveequation and wanting to approach its solution, but the ideas are the sameand use the bound on the total energy in a crucial way; notice that the choicexj(0) = yj for all j was chosen as a simplification, but if one wants to startwith a nonzero potential energy, then it is better to take it bounded if onewants a subsequence of uN to converge in a reasonable functional space.

One important result in the preceding computation is that forces wereO(1), and this suggests that one must consider the convergence in the sense ofdistributions for the sequence of Radon measures denoted before

∑i FiδMi(t),

as it may not stay bounded in the space of Radon measures. However, oneshould pay attention that the preceding model is a model for a one-dimensionalsolid, and one should think about the differences between fluids and gases. Ifone considers water (H2O), then a mole weighs around 18 grams and occupiesaround 18 cm3 if it is liquid,5 and slightly more if it is solid,6 while thecorresponding amount of water vapour occupies 22.4 dm3, so there is a factor1,244 for increase in volume, corresponding to a factor around 10.8 for increasein distance between molecules; the number of molecules is huge, about theAvogadro number, 6.022 ×1023 in 22.4 dm3, but I am not able to determinethe size of the forces between molecules, for which one certainly needs moreexperimental information. For example, one transforms one gram of ice intowater at 0◦C with 80 calories, then one needs about 100 calories to heat itup to the boiling temperature of 100◦C under the usual atmospheric pressureof 1 bar (= 105 pascals), and then 537 calories to transform it into vapour,and one calorie is 4.18 joules. One difficulty is that a part of this energythat one has supplied has been used for breaking bonds and a part used forgiving internal energy to the gas, whose origin is supposed to be the kineticenergy of molecules. In understanding bonds, physicists play with Lennard-Jones potentials, which have the form A

r6

(6

r6 − 2), where � is a characteristic

length, where the potential attains it minimum, and A6 is the energy in the

bond; however, it seems to me to be one possible model among millions, sothat for real forces between molecules I cannot really explain anything forsure.

Anyway, one important thing to observe is that internal forces requiregoing beyond Radon measures and this is how the Cauchy stress tensor σappears, corresponding to a force g given by gi =

∑j∂σi,j

∂xj, and distributions

of order 1 have therefore appeared in a natural way.5 I suppose that it was one basic idea of the French scientists who developed the

metric system at the end of the 18th century, that one just had to carry a gradedstick to know the unit of length, and then the unit of mass was derived, with a kilo-gram being the mass of a litre of water, occupying a cube of side 10 centimetres.

6 The case of water is special, as for almost all other liquids there is a loss of volumeduring solidification, and I have only heard of bismuth as an other exception.

Page 225: From hyperbolic systems to kinetic theory: a personalized quest

25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation 207

In the point of view of looking at particles in the (x,v, t) space, one con-siders the Radon measure denoted formally by

f =∑

i

miδ(Mi(t),V i(t)) with V i =dM i

dt, (25.18)

but it is actually a Radon measure in (x,v, t) acting on a function ϕ by

〈f, ϕ〉 =∫ T

0

i

miϕ(M i(t),

dM i(t)dt

, t)dt. (25.19)

In this framework one is led to denote

Φ(M,V, t) the force acting on a particleof unit mass at M having velocity V at time t, (25.20)

and under suitable hypotheses one can derive an equation expressing bothconservation of mass and the balance of linear momentum, and this equationis

∂f

∂t+

3∑

j=1

vj∂f

∂xj+

3∑

j=1

Φj∂f

∂vj= 0, under the hypothesis divvΦ = 0. (25.21)

Indeed, for a finite sum and for a function ϕ of class C1 with compact support,one has

⟨∂f∂t, ϕ⟩

= −⟨f,∂ϕ

∂t

⟩= −

i

∫ T

0

mi ∂ϕ

∂t

(M i(t),

dM i(t)dt

, t)dt, (25.22)

and if one notices that

ddt

[ϕ(M i(t), dM

i(t)dt , t

)]= ∂ϕ

∂x

(M i(t), dM

i(t)dt , t

).dM

i

dt

+∂ϕ∂v

(M i(t), dM

i(t)dt , t

).d

2Mi

dt2 + ∂ϕ∂t

(M i(t), dM

i(t)dt , t

),

(25.23)

one deduces that

⟨∂f∂t, ϕ⟩

=3∑

k=1

⟨vkf,

∂ϕ

∂xk

⟩+

3∑

k=1

⟨Φkf,

∂ϕ

∂vk

⟩, (25.24)

from which one deduces that

∂f

∂t+

3∑

j=1

vj∂f

∂xj+

3∑

j=1

Φj∂f

∂vj+ f divvΦ = 0. (25.25)

The Lorentz force in electromagnetism, or the Coriolis force in moving frames,satisfy divvΦ = 0.

Page 226: From hyperbolic systems to kinetic theory: a personalized quest

208 25 D. Bernoulli: from Masslets and Springs to the 1-D Wave Equation

Another aspect that should be kept in mind is that one should actuallybe interested not only in the density of particles in some regions of x spaceor (x,v) space, but also in correlations of distances between particles; forexample if particles are all spheres of radius a, one looks at the probabilityof finding particles whose centres are at a distance d, for d ≥ 2a. Using amolecular dynamics approach, i.e. computing the evolution of a large numberof particles with interactions following a given force law, one can computea few averages and correlations of positions, and compare to experimentalmeasurements (done for example by using neutron scattering), and one can fitthe best values of the parameters of the force law used (two for Lennard-Jonespotentials). This approach has the advantages and defects of all numericalmethods in the absence of a well developed theory, that it can only provideconjectures. There is a Percus–Yevick equation for correlations,7,8 but I havenot studied the subject enough to judge its validity.

[Taught on Monday November 5, 2001.]

7 Jerome Kenneth PERCUS, American mathematician. He works at NYU (NewYork University), New York NY.

8 George J. YEVICK, American physicist. He works at Stevens Institute of Tech-nology, Hoboken NJ.

Page 227: From hyperbolic systems to kinetic theory: a personalized quest

26

Cauchy: from Masslets and Springs to 2-DLinearized Elasticity

We have seen that in the case of equal small masses m and springs of equalstrength κ, the equations are

m d2xi

dt2 + κ(2xi − xi−1 − xi+1) = 0for i = 1, . . . , N − 1, with x0(t) = 0, xN (t) = L.

(26.1)

By rescaling,

m = m∗N , κ = N κ∗, give m∗ d

2xi

dt2 +N2κ∗(2xi − xi−1 − xi+1) = 0for i = 1, . . . , N − 1.

(26.2)

Letting N tend to +∞, gives an approximation of the wave equation

∂2u

∂t2− c2

∂2u

∂x2= 0, with c2 =

κ∗L2

m∗. (26.3)

Choosing � = LN as mesh size (often denoted Δx or h), and defining xj(t) =

u(j �, t), one checks the consistency of the scheme by considering a smoothsolution of (25.3) and using the Taylor expansion of u at the point (j �, t), oneobtains xj+1(t) = u

((j + 1)�, t

) ≈ u + � ∂u∂x + 2

2∂2u∂x2 + o(�2) and xj−1(t) =

u((j−1)�, t

) ≈ u− � ∂u∂x + 2

2∂2u∂x2 +o(�2), where u, ∂u∂x and ∂2u

∂x2 are evaluated atthe point (j �, t), and therefore as N2κ∗

m∗�2 = c2, one deduces that N2κ∗

m∗(2xi −

xi−1 − xi+1) = −c2 ∂2u∂x2 + o(1).

This computation is the consistency of the difference scheme with respectto the wave equation, and it helps in proving that, if the numerical scheme con-verged, the limit would satisfy the wave equation; that the numerical schemedoes converge is related to a different property, the stability of the numericalscheme, and this property can be deduced from the bound on the total en-ergy that was derived for the discrete approximation. It is actually a generalremark, due to Peter LAX, that for linear partial differential equations, the

Page 228: From hyperbolic systems to kinetic theory: a personalized quest

210 26 Cauchy: from Masslets and Springs to 2-D Linearized Elasticity

consistency and the stability of a numerical scheme usually imply its conver-gence; of course, the linearity is a crucial assumption in this remark.1

The wave equation ∂2u∂t2 − c2 ∂

2u∂x2 = 0 is conservative, but real materials

are slightly dissipative, and for this reason one often considers the dissipativemodel

∂2u

∂t2− c2

∂2u

∂x2+

∂u

∂t= 0, (26.4)

where τ is a characteristic time. Multiplying by ∂u∂t , one deduces that2

dE(t)dt + 1

τ

∫ L0

(∂u∂t

)2

dx = 0, with E(t) =∫ L

0

[12

(∂u∂t

)2

+ c2

2

(∂u∂x

)2]dx,

the total energy at time t,(26.5)

so that E(t) is nonincreasing. Actually, the total energy tends to 0 exponen-tially, and one way to see this is to multiply the equation by ∂u

∂t + ετ u and one

obtains

dFdt +G = 0, with F (t) =

∫ L0

[12

(∂u∂t

)2

+ c2

2

(∂u∂x

)2

+ ετ u

∂u∂t + ε

2τ2 u2]dx

and G(t) =∫ L

0

[1−ετ

(∂u∂t

)2 + ε c2

τ

(∂u∂x

)2]dx,

(26.6)and one then notices that for 0 < ε < 1 both F and G are equivalent to theenergy, so that the differential inequality implies their exponential decay.

Another way to see the exponential decay of total energy is to decomposefunctions on the basis of eigenvectors of −∂2u

∂x2 with Dirichlet conditions, i.e.en =

√2 sin nπ x

L for n = 1, . . .. Looking for solutions of the form eλ ten(x)gives the equation λ2 + λ

τ + c2n2π2

L2 = 0; for n ≥ L2c πτ the real part of λ is

−12τ and for 1 ≤ n < L

2c πτ the values of λ are real and negative. This showsthat the exponential decay is uniform above some threshold. In elastic bars(which are modelled in a different way), the decay of modes has been studiedby David RUSSELL,3 and as the experimental evidence is not compatible witha dissipative term in ∂u

∂t , he has proposed heuristic convolution terms.

Actually, the model studied with small masses and springs is not re-ally a good model for the motion of a violin string, because I have studied1 The stability permits one to extract a subsequence which converges weakly, or

weakly � in some adapted functional space; using linearity and transpositionfor making the translations act on the test functions, the consistency makes thecorresponding translation operator converge to the transposed partial differrentialoperator.

2 This computation requires enough smoothness for the solution, and the proof thatthe result is indeed true with initial data of finite total energy requires a littlemore care.

3 David L. RUSSELL, American mathematician. He worked at University of Wis-consin, Madison, WI, and at Virginia Tech (Virginia Polytechnic Institute andState University), Blacksburg VA.

Page 229: From hyperbolic systems to kinetic theory: a personalized quest

26 Cauchy: from Masslets and Springs to 2-D Linearized Elasticity 211

longitudinal waves, where the displacements of material points are in the direc-tion of the propagating wave, while the waves in a violin string are transversalwaves, where the displacements of material points are in a direction perpen-dicular to the propagating wave. It would have been better to consider themasses moving in a two-dimensional (or three-dimensional) space, and askthat the mass initially at i � (with � = L

N ) is at the point (xi, yi), but as theincrease in length is

√(xi − xi−1)2 + (yi − yi−1)2 − �, the formulas become

more difficult to study (and one loses the important linearity hypothesis inthe argument of Peter LAX about consistency and stability).

A limitation of many models used for solids is that they only considernearest neighbour interactions, and this is not compatible with the beliefthat particles attract or repel each other depending upon their distances,except for the case of very short-range potentials, which is not what Lennard-Jones potentials are about, for example. If one was studying vibrations aroundan equilibrium position, then it would be like having point i and point jlinked by a spring of weight κi,j , and one would have to consider the potentialenergy

∑i=j

κi,j

2 (xj − xi − (j − i)�)2, and if u was a smooth function and

xi(t) = u(i �, t), then besides quantities proportional to∫ L

0

∣∣∂u∂x

∣∣2 dx one could

well see quantities of the type∫∫

(0,L)×(0,L)|u(x)−u(y)|2w(x, y) dx dy for some

weight function w, and it is worth mentioning that the norms of fractionalSobolev spaces show similar quantities.4

I consider now the two-dimensional linearized elasticity, as studied byCAUCHY. He considered a square lattice with small masslets of size m atthe points (i �, j �) for integers i, j, with springs of strength κ along the hori-zontal and vertical lines, but also springs of strength κ′ along the two diagonaldirections.5 The scaling is now � = L

N and m = m∗N2 , corresponding to a finite

density of mass at the limit N → ∞, but κ = κ∗ independent of N , becauseκm must have the dimension 1

time2 , and time scales as length in order to havea fixed velocity of propagation of waves; another way to interpret the scalingfor κ is that one does not impose forces O(1) at each point of the boundary,but a force per unit of length, i.e. a (two-dimensional) pressure, so that forces

4 For 0 < s < 1 and 1 ≤ p < ∞, the Sobolev space W s,p(RN) is the space of

function u ∈ Lp(RN) such that∫∫

RN×RN

|u(x)−u(y)|p|x−y|N+s p dx dy < ∞. For a bounded

open set Ω ⊂ RN with smooth boundary, one must be careful with boundary

conditions, and for 0 < s ≤ 1p

one has W s,p0 (Ω) = W s,p(Ω), while for 1

p< s < 1

one has W s,p0 (Ω) �= W s,p(Ω), but in the case s = 1

pthere is another natural

space, the Lions–Magenes space W1/p,p00 (Ω), for which the functions extended by

0 outside Ω belong to W 1/p,p(RN), also equal to the space of u ∈ W 1/p,p(I)satisfying u

d1/p ∈ Lp(Ω) where d is the distance to the boundary ∂Ω.5 Without the diagonal springs, the lattice is quite weak, and the infinite lattice

has a family of equilibria, where all the squares become lozenges, without increas-ing the length of any of the springs; with the diagonal springs these equilibriadisappear.

Page 230: From hyperbolic systems to kinetic theory: a personalized quest

212 26 Cauchy: from Masslets and Springs to 2-D Linearized Elasticity

are O(

1N

); for a three-dimensional problem, mass scales as m = m∗

N3 and κ

scales as κ = κ∗N , and forces are O

(1N2

), so that pressure is O(1).

The displacement has two components, denoted u1 and u2, and one usesthe notation uki,j for uk(i �, j �); one assumes that u1 and u2 are smooth func-tions in order to use Taylor expansion for identifying the partial differentialequation governing the motions of the masslets in the limit N → ∞. Onemakes an assumption of linearized elasticity, corresponding to the approxi-mation that the directions of the springs are almost fixed and only the dis-placements in the directions of the springs are felt. Considering the forcesat the point (i �, j �), there are two horizontal forces κ(u1

i+1,j − u1i,j) and

κ(u1i−1,j − u1

i,j), two vertical forces κ(u2i,j+1 − u2

i,j) and κ(u2i,j−1 − u2

i,j), andforces along the diagonals, but it is only for a particular value of κ′

κ that onefinds the behaviour of an isotropic (linearized) elastic material. CAUCHY’sapproach only gave the case λ = μ in the more general family of isotropic(linearized) elastic materials proposed by LAME,6 where the Cauchy stresstensor has the form

σi,j = μ( ∂ui∂xj

+∂uj∂xi

)+ λ δi,j

k

∂uk∂xk

, (26.7)

and the general equations of linearized elasticity are

�∂2ui∂t2

−∑

j

∂σi,j∂xj

= 0 for all i, (26.8)

which in the isotropic case (26.7) give the Lame equation

�∂ui∂t2

− μΔui − (λ+ μ)∂[div(u)]∂xi

= 0 for all i, (26.9)

which imply that both div(u) and curl(u) satisfy wave equations, but withdifferent speeds of propagation; the P-waves (or pressure waves) correspond

to the wave equation for div(u) and have velocity√

2μ+λ� and the S-waves

(or shear waves) correspond to the wave equation for curl(u) and have thevelocity

√μ� , which in practice is smaller.7

[Taught on Wednesday November 7, 2001.]

Notes on names cited in footnotes for Chapter 26, MAGENES.8

6 Gabriel LAME, French mathematician, 1795–1870. He had worked in St. Peters-burg, Russia and in Paris, France.

7 Because most real materials have λ > 0. In seismology, one makes the approxima-tion that the ground is linearly elastic (and even isotropic!), and it is useful thatP-waves travel faster than S-waves, because in earthquakes the P-waves are notdangerous and they signal the danger coming, as it is the S-waves which destroythe buildings which have not been designed carefully.

8 Enrico MAGENES, Italian mathematician, born in 1923. He worked at Universitadi Pavia, Pavia, Italy.

Page 231: From hyperbolic systems to kinetic theory: a personalized quest

27

The Two-Body Problem

The preceding computations, motivated by understanding the magnitude offorces of interaction, have dealt with quite simplified models where only lin-earized elastic effects were involved, so that no large displacement could betaken into account, where there was no temperature, and where a much toosimple crystalline framework was involved,1 but they have shown the differ-ence of order of magnitude in one, two or three dimensions.

The lack of temperature is an important restriction for a model whichis supposed to be realistic, but temperature is actually an equilibrium con-cept, and not much is understood about nonequilibrium situations, but if oneincreases temperature slowly it is reasonable to assume that one will movealong equilibria without noticeable dynamical effects. What is observed forreal materials is that after increasing the temperature of a solid one eventu-ally reaches a critical temperature where there is a change of phase, either atransformation into a solid phase with a different organization of atoms, or atransformation into a liquid phase, and at a higher critical temperature theliquid transforms into a gas.2 An important effect is the latent heat, whichseems to be the energy necessary for breaking bonds, which are either in-terpreted in terms of classical mechanics, in which case one talks about theminimum energy that one needs for escaping from the attraction of a stable1 Crystals are not good for elasticity, and polycrystals, which show different grains

with various crystalline orientations are observed, but not much is understoodabout how grain boundaries move (as it is certainly not a local question!).

2 The experimental physicists who have studied phase transitions have consideredthe various crystalline orientations that a solid may prefer under various condi-tions (of temperature and pressure), the temperature of fusion (or sublimation)of a solid, the temperature of boiling of a liquid, with the latent heat involved,and the question of triple points, where one goes continuously from one phase toanother without latent heat involved (for water, it happens at a temperature of374◦ Celsius and a pressure of 220 bars).

Page 232: From hyperbolic systems to kinetic theory: a personalized quest

214 27 The Two-Body Problem

equilibrium, or in terms of quantum mechanics, where physicists’ ideas alwayslook a little strange.

According to the classical point of view, there are not many bonds leftbetween atoms in a gas, and particles may move quite freely, but one shouldremember that only rare gases are made of individual atoms, and oxygen ornitrogen prefer binary molecules O2 or N2, for example. One basic assumptionin kinetic theory is to avoid molecules, which besides kinetic energy show ro-tational energy and internal energy of vibration (when the distance of the twoatoms forming the molecule varies). Another basic assumption of kinetic the-ory is to consider only binary interactions between particles, and one talks ofcollisions or nearby collisions, and one estimates the probability of such colli-sions, and such a description could only be reasonable for a rarefied gas.3 Fora rarefied plasma, where particles are electrically charged (lighter electronsand heavier ions), one usually considers that there are no collisions at all, andone works with the Vlasov equation,4 coupled with the Maxwell–Heavisideequation, of course, or a simplified version of it, as the Laplace/Poisson equa-tion. Physicists often mention particles which cannot be discerned one fromthe other, and most of the mathematical work in kinetic theory deals with agas made of identical atomic particles, despite the fact that in applicationsmost gases are mixtures of different molecules, but that particles cannot bediscerned is not a bad hypothesis at all, because talking about particles is justan approximation for describing localized waves, and these particles do notreally exist as classical ones.

One considers then only two classical particles rushing towards each otherwithout noticing the crowd of other particles around them, but most of thetime they do not really collide. In the case where particles are rigid spheres ofradius a, the two particles collide only if the distance of their centres becomes≤ 2a at some time; in other words, if a particle is fixed and one wants to knowif another moving particle will collide it, one considers that the moving particlesweeps a circular cylinder of section 4π a2 and will hit the fixed particle onlyif its centre belongs to the cylinder. If particles attract each other with a lawdepending upon their distances, then particles which are too far apart areessentially undisturbed, i.e. they do not acquire much kinetic energy becauseof a close encounter, and one talks about a scattering cross-section (whichwould be π a2 in the case of rigid spheres) by considering the particles whichwould change their direction by more than π

2 , i.e. are reflected backwards.

3 Because one basic assumption in the kinetic theory of gases is that the gas israrefied, I strongly disagree with the physical interpretation of letting the meanfree path between collisions tend to 0, which one calls the fluid dynamical limit,and I suggest considering that as a strictly mathematical problem, because it isbad physics (which probably explains the interest of some political group for thattype of questions). Actually, I conjecture that the Hilbert expansion is false ingeneral, because of the appearance of oscillations.

4 Anatoliı Aleksandrovich VLASOV, Russian physicist, 1908–1975.

Page 233: From hyperbolic systems to kinetic theory: a personalized quest

27 The Two-Body Problem 215

The motion of two particles in a field of central forces was solved long ago inclassical mechanics (while the n-body problem is still not so well understood),and the equations are

m1d2M1

dt2= F1, m2

d2M2

dt2= F2, (27.1)

and with the only information that

F1 + F2 = 0, (27.2)

one already finds that the centre of gravity G, defined by

(m1 +m2)G = m1M1 +m2M2, (27.3)

moves with constant speed, because adding the equations gives

(m1 +m2)d2Gdt2

= 0, so thatdGdt

= constant. (27.4)

If one moves with the centre of gravity, which one then takes as the origin, i.e.G = 0, then the information that the forces are along the line joining the twoparticles leads to the first two Kepler laws; the angular momentum computedat G = 0 is

Ω = m1M1 ∧ dM1

dt+m2M2 ∧ dM2

dt, (27.5)

which is m1m2

(m1 +m2)M1 ∧ dM1dt , so that

dΩdt

= m1M1 ∧F1 +m2M2 ∧F2 = 0, (27.6)

because both F1 and F2 are along M1M2, which are parallel to 0M1 and0M2 by the choice G = 0. If Ω �= 0, i.e. particles are not both moving on aline, then M1 and dM1

dt are in the plane perpendicular to Ω, giving the firstKepler law, that the two particles move in a plane, while the second Keplerlaw that the area swept by GM1 is proportional to time comes precisely fromthe fact that M1 ∧ dM1

dt is a constant vector. The third Kepler law only holdsfor forces in 1

distance2 , that particles follow ellipses (or more generally conicsections) with G a focus with a precise relation between size and period.

I was told that KEPLER had postulated that the planets follow ellipseswith the sun at a focus, and he had needed precise astronomical measurementsfor discovering how the planets moved on these ellipses, like those made byBRAHE.5 When I visited Klaus KIRCHGASSNER in Stuttgart in 1987,6 he had5 Tyge BRAGE (Tycho BRAHE), Danish-born astronomer, 1546–1601. He had

worked in Prague, now capital of the Czech republic.6 Klaus KIRCHGASSNER, German mathematician, born in 1931. He worked in

Stuttgart, Germany.

Page 234: From hyperbolic systems to kinetic theory: a personalized quest

216 27 The Two-Body Problem

told me that BRAHE did not want to give his measurements to KEPLER, whohad then managed to steal them after BRAHE had died. Of course, KEPLER

had wrongly guessed that the sun is at a focus, because if there was only oneplanet it would be the centre of gravity of the sun/planet pair which wouldbe at a focus, but the centre of gravity of the solar system falls inside the sunanyway. The orbits are not exactly ellipses, because there are many planets,and LAGRANGE had been the first to develop a theory of perturbations forstudying that question, which became useful after Uranus had been found in1781 by HERSCHEL in a systematic survey of the sky,7 because its irregularmotion led to the hypothesis that it was perturbed by another planet.8

Actually, the situation considered in kinetic theory is not to think abouttrajectories as ellipses but as hyperbolas (as in the trajectories of somecomets), and consider the limiting velocities before and after a “collision”(which should only be thought of as a near collision). This creates a picture ofa gas which does not allow for the possibility of having particles moving withtheir cohort of satellites, like small solar systems or better like double stars(or multiple ones) if all particles are considered identical. This restriction isrelated to the postulate that the only type of energy considered for a gas istranslational kinetic energy.

In the study of the possible outputs of a “collision”, the timing is usuallynot considered, and it is assumed that two particles with velocities v and wat a point x and at a time t may transform into two particles with velocitiesv′ and w′ in an instantaneous way, at the same point x and at the same timet; moreover one postulates some probability distribution among the outputs.

Because one assumes that all particles are identical, conservation of massis just the fact that two particles colliding give two particles as the output;conservation of linear momentum is equivalent to

v + w = v′ + w′, (27.7)

conservation of angular momentum is automatically verified because the twoparticles are at the same point, before and after the collision, and conservationof kinetic energy is equivalent to

|v|2 + |w|2 = |v′|2 + |w′|2, (27.8)

and one deduces that7 William HERSCHEL, German-born astronomer, 1738–1822. He had worked in

England.8 Both J.C. ADAMS and LE VERRIER successfully applied the theoretical work of

LAGRANGE and found the correct position of that planet, but CHALLIS failedto see it; LE VERRIER was better served by GALLE, and got full credit for thediscovery, and the right to call it Neptune, although it had actually been observedbefore, by LALANDE in 1795, and even by Galileo in 1613.

Page 235: From hyperbolic systems to kinetic theory: a personalized quest

27 The Two-Body Problem 217

|v′ − w′|2 = |v − w|2. (27.9)

Writing v′ = v + aα with a ∈ R and α ∈ R3 with |α| = 1, one must have

w′ = w − aα, and then |w′ − v′|2 = |v − w + 2aα|2 = |v − w|2, so that4a(v − w,α) + 4a2 = 0, and apart from the trivial solution a = 0, one musthave a = (w − v,α), and as the case a = 0 is then obtained by choosing αperpendicular to w − v, the general solution is

for α ∈ S2,

{v′ = v + (w − v,α)αw′ = w − (w − v,α)α , (27.10)

and in particular one has

w′ − v′ = (I − 2α ⊗ α)(w − v). (27.11)

If one defines the angle θ ∈ [0, π] by

(w − v,α) = |w − v| cos θ, (27.12)

then θ = π2 corresponds to v′ = v and w′ = w, which happens if particles

miss each other in the collision, while the case θ = 0 corresponds to v′ = wand w′ = v. In the frame of the centre of gravity G, where v + w = 0, onehas v′ = (I − 2α⊗α)v, so one sees two particles arriving with the velocity ofapproach |w−v|

2 and leaving with the same velocity but in a direction makingan angle 2θ. In the frame linked with the centre of gravity there is a symmetryaround the line of approach of the particles, and therefore one postulates thatthe various angles θ are obtained as outputs with a probability which onlydepends upon |w − v| and θ.

[Taught on Friday November 9, 2001.]

Notes on names cited in footnotes for Chapter 27, J.C. ADAMS,9 LE VER-

RIER,10 GALLE,11 LALANDE.12

9 John Couch ADAMS, English astronomer, 1819–1892. He had worked in Cam-bridge, England.

10 Urbain Jean Joseph LE VERRIER, French astronomer, 1811–1877. He had workedin Paris, France.

11 Johann Gottfried GALLE, German astronomer, 1812–1910. He had worked inBerlin, Germany.

12 Joseph-Jerome LE FRANCOIS DE LA LANDE, French astronomer, 1732–1807. Hehad worked at College de France, Paris, France.

Page 236: From hyperbolic systems to kinetic theory: a personalized quest

28

The Boltzmann Equation

If there were no forces on the particles, the evolution equation for the densityof particles f(x,v, t) would be the free transport equation

∂f

∂t+ v.

∂f

∂x= 0. (28.1)

The presence of collisions transforms this equation into a form

∂f

∂t+ v.

∂f

∂x= Q(f, f), (28.2)

where the nonlinearity Q(f, f) takes into account the disappearance of par-ticles with velocity v by collision against particles with velocity w (creatingparticles with velocities v′ and w′), but also the appearance of particles withvelocity v (by collisions of particles with velocities v′ and w′). Q(f, f) is cho-sen to be quadratic in f , by an argument that the probability of collision ofparticles with velocity v1 and particles with velocity v2 is proportional tof(x,v1, t)f(x,v2, t), the product of the densities of the two types of particles,and this assumes that some independence property holds. I think that thisargument only makes sense for a rarefied gas, where the picture is like thatof hyperbolic orbits of some comets, but if one is not in a rarefied situation,either one thinks in terms of classical mechanics, and the simple descriptionusing f(x,v, t) seems too naive, and it seems natural to add correlations ofposition to the description, or one thinks from a modern point of view wherethere are only waves which in some limiting situation may look as “particles”;one is not in such a limiting situation and one must understand better aboutthe wave nature of these particles that one is dealing with.1

1 Using analogies with my H-measures [18], and their variants, which are quadraticmicro-local objects, the function f(x,v, t) looks like the density of a such a micro-local measure, and if the underlying equation was a linear hyperbolic system inx with a quadratic conservation law, I would expect a linear transport equation

Page 237: From hyperbolic systems to kinetic theory: a personalized quest

220 28 The Boltzmann Equation

Formally

Q(f, f)=∫

R3×S2 k(v,w,α)(f(x,v′, t)f(x,w′, t)−f(x,v, t)f(x,w, t)

)dw dα,

with notation (27.10),(28.3)

i.e. v′ = v + (w− v,α)α and w′ = w− (w− v,α)α, and where the kernel kis nonnegative. Due to symmetries, the kernel k(v,w,α) has the form

k(v,w,α) = K(|v − w|, θ), with notation (27.12), (28.4)

i.e.((w − v),α

)= |w − v| cos θ, and an analytic expression of K can be de-

duced from the precise force law used for attraction (or repulsion) of particles,and more precisely, for an attractive force,

a law in 1distances gives K proportional to |v − w|γ , γ = s−5

s−1 ,

s = 5, γ = 0, giving K(θ), is referred to as Maxwellian molecules,s = +∞, γ = +1, is referred to as the hard-sphere case.

(28.5)

However, the main problem is that the kernel tends to ∞ for θ = π2 , and for

an attractive force,

a law in1

distancesgives a singularity in

1| cos θ|ν , ν =

s+ 1s− 1

. (28.6)

Of course, if θ = π2 then one has f(x,v′, t)f(x,w′, t)− f(x,v, t)f(x,w, t) = 0

(because v′ = v and w′ = w), and therefore one has an indeterminate formin the integrand. One way to avoid this problem is to use the angular cut-offassumption made by Harold GRAD, which consists in changing the kernel nearθ = π

2 so that it becomes integrable in θ.

The Boltzmann equation is postulated, and one should not exaggerate itsimportance and pretend (as too many seem to believe) that starting fromthe Boltzmann equation and deducing by purely formal considerations other(postulated) equations used for describing the behaviour of real fluids, like theEuler equation or the Navier–Stokes equation, gives more credence to theseequations. One may be interested in purely mathematical questions concerningthe Boltzmann equation, and one interesting mathematical question is to avoidmaking the angular cut-off assumption; in doing that it seems that one shouldbe able to estimate

cancellations in the difference f(x,v′, t)f(x,w′, t) − f(x,v, t)f(x,w, t),(28.7)

in (x, ξ) (and there is no problem about denoting this dual variable v), but theunderlying equation should be a semi-linear hyperbolic system in x instead, withthe same kind of quadratic conservation laws, and more general objects than H-measures must be developed for the analysis, i.e. I do not think that one shouldsearch for a nonlinear equation for the density of an H-measure at all.

Page 238: From hyperbolic systems to kinetic theory: a personalized quest

28 The Boltzmann Equation 221

but most of the time one estimates f(x,v′, t)f(x,w′, t) and f(x,v, t)f(x,w, t)independently, so that no cancellation can be studied, and one is led to limitthe strength of the kernel for the purpose of proving results, and this is not avery scientific point of view.

The problem of θ being near π2 is that of grazing collisions, for which

particles only change their velocity very slightly in the interaction, and theresult of many such small changes in velocity is often described by a diffusionin velocity space, giving rise to the Fokker–Planck equation

∂f

∂t+ v.

∂f

∂x− κΔvf = 0. (28.8)

However, some people write a nonlinear Fokker–Planck equation with a diffu-sion depending upon f , so that Maxwellian distributions satisfy it, or derivesuch a nonlinear equation from the Boltzmann equation, which is not very log-ical, as grazing collisions are not well taken care of in the Boltzmann equation.

Because of the invariance of the number of particles in each collision, onededuces that ∫

R3Q(f, f) dv = 0, (28.9)

if the Fubini theorem can be applied, of course; similarly, because v′ + w′ =v + w for each collision, one deduces that

R3vjQ(f, f) dv = 0 for j = 1, 2, 3, (28.10)

and because |v′|2 + |w′|2 = |v|2 + |w|2 for each collision, one deduces that∫

R3|v|2Q(f, f) dv = 0. (28.11)

From these equalities, one deduces conservation laws for fluid quantitiesdefined by integration in v. One defines the density of mass �(x, t) by

�(x, t) =∫

R3f(x,v, t) dv, a.e. x ∈ R

3, (28.12)

the (macroscopic) velocity u(x, t) by

�(x, t)uj(x, t) =∫

R3vjf(x,v, t) dv, for j = 1, 2, 3, a.e. x ∈ R

3, (28.13)

the Cauchy stress tensor σ by

σi,j = −∫

R3

(vi − ui(x, t)

)(vj − uj(x, t)

)f(x,v, t) dv, a.e. x ∈ R

3, (28.14)

Page 239: From hyperbolic systems to kinetic theory: a personalized quest

222 28 The Boltzmann Equation

the internal energy per unit of mass e(x, t) by

�(x, t)e(x, t) =∫

R3

|v − u(x, t)|22

f(x,v, t) dv, a.e. x ∈ R3, (28.15)

the density of total energy E(x, t) by

E(x, t) =�(x, t)|u(x, t)|2

2+ �(x, t)e(x, t), a.e. x ∈ R

3, (28.16)

and the heat flux q(x, t) by

qi(x, t) =∫

R3

(vi − ui(x, t)

) |v−u(x,t)|22 f(x,v, t) dv

for i = 1, 2, 3, a.e. x ∈ R3.

(28.17)

By integrating the Boltzmann equation in v, conservation of mass becomes

∂�

∂t+

3∑

j=1

∂(� uj)∂xj

= 0 in R3, (28.18)

by multiplying the Boltzmann equation by vi and integrating in v, the balanceof linear momentum becomes

∂(� ui)∂t

+3∑

j=1

∂(� uiuj)∂xj

−3∑

j=1

∂σi,j∂xj

= 0 for i = 1, 2, 3, in R3, (28.19)

and by multiplying the Boltzmann equation by |v|22 and integrating in v, the

balance of energy becomes

∂E

∂t+

3∑

j=1

∂(E uj)∂xj

−3∑

i,j=1

∂(σi,jui)∂xj

+3∑

j=1

∂qj∂xj

= 0 in R3, (28.20)

and conservation of angular momentum then follows from the symmetry ofthe Cauchy stress tensor.

There is an important identity which is always valid,

2�(x, t)e(x, t) +3∑

i=1

σi,i(x, t) = 0, (28.21)

and in the case of a gas at local equilibrium, one has

σi,j = −p δi,j , for i, j = 1, 2, 3, where p is the pressure, (28.22)

so that

Page 240: From hyperbolic systems to kinetic theory: a personalized quest

28 The Boltzmann Equation 223

p(x, t) =2�(x, t)e(x, t)

3at equilibrium for such a gas. (28.23)

Property (28.23) is valid for perfect gases, but not for real gases, whose equa-tion of state is not compatible with the preceding relation between �, p ande, so that real gases are not so well described by the Boltzmann equation.

Definition 28.1. A function ϕ defined on R3 is a collision invariant, if it

satisfies

ϕ(v′) + ϕ(w′) = ϕ(v) + ϕ(w), whenever (27.7) and (27.8) are satisfied.(28.24)

Each function ϕ defined by

ϕ(v) = a |v|2 + (b.v) + c for all v ∈ R3, (28.25)

for a, c ∈ R and b ∈ R3 is a collision invariant, and one may wonder if there

are other collision invariants besides those given by (28.24). BOLTZMANN hadshown that if ϕ is of class C1 then every collision invariant has this form, andGRONWALL removed the smoothness hypothesis.2 Below I follow the proofgiven by Clifford TRUESDELL and Robert MUNCASTER in their book [22],3

where they mention that Lennart CARLESON and FROSTMAN included a proofof theirs when they edited the posthumous book of CARLEMAN [1].

Proposition 28.2. Every measurable collision invariant has the form (28.25).

Proof : One looks for

ϕ(v) + ϕ(w) = ψ(v + w, |v|2 + |w|2) for all v, w ∈ R3, (28.26)

and ψ must be measurable on{(u, s) | s ≥ |u|2

2

},4 and as one may add a

constant to ϕ, one assumes that ϕ(0) = 0 so that ψ(0, 0) = 0, and usingw = 0 gives

ψ(v, |v|2) = ϕ(v), so that ψ(v, |v|2) + ψ(w, |w|2) = ψ(v + w, |v|2 + |w|2)for all v, w ∈ R

3.(28.27)

2 Thomas Hakon GRONWALL, Swedish-born mathematician, 1877–1932. He hadworked as an engineer, then at Princeton University, Princeton, NJ, and atColumbia University, New York, NY.

3 Robert Gary MUNCASTER, American mathematician, born in 1948. He works atUniversity of Illinois, Urbana-Champaign, IL.

4 If w = u−v, then |v|2 + |u−v|2 is minimum for v = u2

so that ψ is only evaluated

at points (u, s) with s ≥ |u|22

. That ψ is measurable can be seen from an explicitchoice, for example if u �= 0 by taking v = a u, w = (1−a)u, with 2a2−2a+1 = s

|u|2

and choosing the root a ≥ 12, and if u = 0 by taking v = −w =

√s√2e for a fixed

unit vector e.

Page 241: From hyperbolic systems to kinetic theory: a personalized quest

224 28 The Boltzmann Equation

Using w = −v gives

ψ(0, 2|v|2) = ψ(v, |v|2) + ψ(−v, |v|2) for all v, w ∈ R3, (28.28)

and then in the case where v.w = 0, one deduces that

ψ(0, 2|v|2 + 2|w|2) = ψ(0, 2|v + w|2)= ψ(v + w, |v + w|2) + ψ(−v − w, |v + w|2)

= ψ(v + w, |v|2 + |w|2) + ψ(−v − w, |v|2 + |w|2)= ψ(v, |v|2) + ψ(w, |w|2) + ψ(−v, |v|2) + ψ(−w, |w|2)

= ψ(0, 2|v|2) + ψ(0, 2|w|2),

(28.29)

so thatψ(0, a) + ψ(0, b) = ψ(0, a+ b) for all a, b ≥ 0, (28.30)

and it is classical that (28.30) implies that there exists a constant C such that

ψ(0, a) = C a for all a ≥ 0. (28.31)

One defines the (measurable) function g by

g(v) = ψ(v, |v|2) − ψ(0, |v|2), for all v ∈ R3, (28.32)

so that g is odd by (28.28), and additive on orthogonal pairs by (28.27) and(28.31), and it remains to show that g is additive.

Let m and n be unit vectors which are orthogonal, then

g(α2m+ αβ n) = g(α2m) + g(αβ n)g(β2m− αβ n) = g(β2m) − g(αβ n), (28.33)

but as α2m± αβ n and β2m∓ αβ n are orthogonal one deduces that

g((α2 + β2)m

)= g(α2m+ αβ n) + g(β2m− αβ n) = g(α2m) + g(β2m)

g((α2 − β2)m

)+ g(2αβ n) = g(α2m+ αβ n) + g(−β2m+ αβ n)

= g(α2m) − g(β2m) + 2g(αβ n),(28.34)

and as g(2x) = 2g(x) by the preceding case, one deduces that

g((α2 − β2)m

)= g(α2m) − g(β2m). (28.35)

This shows that g(x + y) = g(x) + g(y) if x and y are parallel. Then for twoarbitrary vectors v and w one writes w = αv + z with z orthogonal to v andtherefore g(v+w) = g

((1 +α)v

)+ g(z) = g(v) + g(αv) + g(z) = g(v) + g(w),

and the classical result then implies that g is linear. ��Another important observation of BOLTZMANN is that∫

R3×R3f(x,v, t) log

(f(x,v, t)

)dx dv is nonincreasing with time, (28.36)

Page 242: From hyperbolic systems to kinetic theory: a personalized quest

28 The Boltzmann Equation 225

and this follows from ∫

R3Q(f, f) log f dv ≤ 0, (28.37)

which is proven by showing that∫

R3×R3 K(|v − w|, θ)(f(v′)f(w′) − f(v)f(w))log f(v) dv dw ≤ 0

for x, t,α given.(28.38)

One observes that the kernel is invariant by the exchange of v and w, andalso by the change of variables (v,w) �→ (v′,w′) and that dv′ dw′ = dv dw,so that the integral considered is

−14

∫R3×R3 K(|v − w|, θ)(f(v′)f(w′) − f(v)f(w)

)(log f(v′) + log f(w′) − log f(v) − log f(w)

)dv dw,

(28.39)

and then one uses the fact that K ≥ 0 and that with a = f(v′)f(w′) andb = f(v)f(w) one has (log a− log b)(a− b) ≥ 0 (if a, b > 0, and by continuityif a or b is 0).

One also deduces from the preceding computation that

if∫

R3×R3 f(x,v, t) log(f(x,v, t)

)dx dv is constant

then f(v′)f(w′) − f(v)f(w) = 0,(28.40)

because (log a − log b)(a − b) = 0 implies a = b (and one has assumed thatK > 0); this means that log f is a collision invariant, and therefore that

log f(x,v, t) = a(x, t)|v|2 + (b(x, t).v) + c(x, t), i.e. f is a local Maxwellian(28.41)

which implies Q(f, f) = 0, and one needs to have a < 0 in order to have fintegrable in v, so that �(x, t) is defined. Of course, if

Q(f, f) = 0 implies∫

R3×R3f(x,v, t) log

(f(x,v, t)

)dx dv = constant,

(28.42)so that the two conditions are “equivalent” (one should check that integra-bility properties of the solution f are sufficient for the Fubini theorem to beapplicable for proving that this “equivalence” is true).

If Q(f, f) = 0 then f satisfies a free transport equation, and thereforeg(x,v, t) = log f(x,v, t) = a(x, t)|v|2 +(b(x, t).v)+ c(x, t) also satisfies a freetransport equation. In the expression of ∂g∂t +v. ∂g∂x = 0, the coefficient of vi|v|2is ∂a

∂xi= 0, the coefficient of vivj is ∂bi

∂xj+ ∂bj

∂xi= 0 if i �= j and ∂bi

∂xi+ ∂a

∂t = 0

if i = j, the coefficient of vi is ∂bi

∂t + ∂c∂xi

= 0, and the constant coefficient is∂c∂t = 0. One deduces that a is a function of t alone and c is a function of xalone. Using the identity

2 ∂2bi

∂xj∂xk= ∂

∂xj

(∂bi

∂xk+ ∂bk

∂xi

)− ∂

∂xi

(∂bk

∂xj+ ∂bj

∂xk

)+ ∂

∂xk

(∂bj

∂xi+ ∂bi

∂xj

)

for all i, j, k = 1, 2, 3,(28.43)

Page 243: From hyperbolic systems to kinetic theory: a personalized quest

226 28 The Boltzmann Equation

one finds that ∂2bi

∂xj∂xk= 0 for all i, j, k = 1, 2, 3, because a depends only upon t,

and therefore one has b(x, t) = M(t)x+b0(t) and M(t)+MT (t)+2 dadt I = 0.Using dM

dt x + db0

dt + gradxc = 0, one deduces that dMdt is symmetric and

independent of t and that db0

dt is independent of t. One has dMdt + d2a

dt2 I = 0, sothat M(t) = − da

dt I+N with N independent of t and satisfying NT +N = 0; amust be a quadratic in t and b0 affine in t, and then c(x) = 1

2d2adt2 |x|2+ db0

dt .x+c0 for a constant c0. All this shows that g(x,v, t) is a linear combination of|x− tv|2, (x− tv).v, xivj−xjvi for all i �= j, |v|2, vi for all i and 1 (solutionsof the free transport equation must have the form h(x−tv,v), and one shouldnotice that xivj − xjvi can be written as (xi − vit)vj − (xj − vjt)vi).

Of course, if f is a stationary solution of the Boltzmann equation, i.e.a solution independent of t, then it must be a global Maxwellian, f(x,v) =ea |v|

2+(b.v)+c with a < 0, and it is useful to relate the coefficients a,b, c to themacroscopic quantities �,u, e defined by � =

∫R3 f(v) dv, �u =

∫R3 v f(v) dv

and � e =∫

R3|v−u|2

2 f(v) dv. From∫

Re−π x

2dx = 1 one deduces by a change

of variable that∫

Re−ax

2dx =

√πa for a > 0, and by an integration by parts

that∫

Rx2e−ax

2dx = 1

2a

√πa for a > 0; one deduces that

R3e−a |w|2 dw =

π3/2

a3/2, and

R3|w|2e−a |w|2 dv =

3π3/2

2a5/2for a > 0. (28.44)

One may then write the global Maxwellian distribution as

f(v) = �a3/2

π3/2e−a |v−u|2 , and it gives e =

32a. (28.45)

In this model, one has σi,j = −p δi,j and p = 2� e3 , and if one uses the

relationde+ p d

(1�

)= T ds, (28.46)

where T is the absolute temperature and s the entropy per unit of mass, thisgives de− 2e d�

3� = T ds, so that 1T is an integrating factor of de− 2e d�

3� , and oneof these integrating factors is 1

e , giving s0 = log e − 23 log � + constant (and

the other multiplying factors are then of the form ϕ(s0)e ). What BOLTZMANN

found is that one may define s by

� s =∫

R3f(v) log

(f(v))dv. (28.47)

Indeed, it is∫

R3 �a3/2

π3/2 e−a |v−u|2(log � + 3

2 log a − 32 log π − a |v − u|2) dv =

�(log � + 3

2 log a − 32 log π − 1

2

), so that s = log � + 3

2 log a − 32 log π − 1

2 =23s0 + constant. As the unit of temperature was already chosen, the term

Page 244: From hyperbolic systems to kinetic theory: a personalized quest

28 The Boltzmann Equation 227

a |v−u|2 in the exponential is written as 1k T

|v−u|22 , where k is the Boltzmann

constant.5

Based on the knowledge of equilibrium solutions for the Boltzmann equa-tion, with or without exterior force potentials, BOLTZMANN devised the rulesnow used in statistical physics; one only considers systems in thermal equi-librium in this framework, and one postulates that the state of a system isindexed by the absolute temperature T , and the rule says that there is a“probability” to find the system in a state of energy W , which is proportionalto exp

(− WkT

). Of course, the basic rule of this game makes no sense but for

large systems whose parts are connected enough to interact and settle quicklyto a unique temperature.6

[Taught on Monday November 12, 2001.]

5 k = 1.3807 10−23 joule kelvin−1; the joule is the unit for energy, newton metre,or kilogram metre2 second−2.

6 Specialists of plasma physics have observed that in their experiments lighter elec-trons tend to settle quickly to some temperature, while heavier ions tend to settlequickly to another temperature, and their experiments do not last long enoughfor these two temperatures to come together.

Page 245: From hyperbolic systems to kinetic theory: a personalized quest

29

The Illner–Shinbrot and the HamdacheExistence Theorems

In the early 1980s, I had asked my student Kamel HAMDACHE to try to extendto the Boltzmann equation the method that I had created for some discretevelocity models in one space dimension, namely use functions satisfying 0 ≤f(x,v, t) ≤ F (x − tv,v), and the problem was to discover a good class offunctional spaces for the functions F so that a fixed point argument could beused for small initial data.

The first to obtain a result in this direction were Reinhard ILLNER andSHINBROT,1 who in 1983 treated the case K(|v−w|, θ) = |v−w|κ(θ) with κintegrable, corresponding to hard spheres; their choice was to take F (x,v) =e−α |x|2 , i.e. Maxwellians in x (instead of the classical Maxwellians in v), andthey proved global existence for small (nonnegative) initial data.

Then Kamel HAMDACHE extended their result by considering F (x,v) =e−α |x|2h(v) for h ∈ Lp with p �= ∞; in the case of forces in 1

distances withangular cut-off, he was able to treat the case of small (nonnegative) initialdata and prove global existence for s > 7

3 (the value of p depending upon s).In the summer of 1984, at a meeting in Santa Fe, NM, I checked with him

that, without the hypothesis of smallness for the (nonnegative) initial data,one can prove a local existence theorem, which requires 2 < s < ∞ (we didnot publish this result).

Then Kamel HAMDACHE extended the method to the case with diffusionin x or diffusion in v (the Fokker–Planck equation), in such a way that hecould let the diffusion coefficient tend to 0 and recover the results withoutdiffusion, but his solution is more technical in that case and it uses a family ofexplicit solutions which are exponentials of quadratic functions in (x,v); heremarked that in the case of the Boltzmann equation, the choice of F is suchthat f(x,v, t) = F (x − tv,v) satisfies both Q(f, f) = 0 and the Boltzmann

1 Marvin SHINBROT, American-born mathematician, 1928–1987. He had worked inVictoria, British Columbia.

Page 246: From hyperbolic systems to kinetic theory: a personalized quest

230 29 The Illner–Shinbrot and the Hamdache Existence Theorems

equation, and having changed the linear part in order to include diffusionterms, he had to use a class of explicit solutions of the linear equation.

I shall sketch the basic idea behind the computations of Reinhard ILLNER

& SHINBROT and of Kamel HAMDACHE.

One considers an iterative method f (n) �→ f (n+1) defined by the equation

∂f(n+1)

∂t + v.∂f(n+1)

∂x

=∫

R3×S2 K(|v − w|, θ)(f (n)(v′)f (n)(w′) − f (n+1)(v)f (n)(w))dw dα,

(29.1)with f (n+1)(x,v, 0) = g(x,v), and this method is chosen because if g ≥ 0 andf (n) ≥ 0 then one has f (n+1) ≥ 0; indeed, f (n+1) satisfies a linear equation

∂f (n+1)

∂t+ v.

∂f (n+1)

∂x+ a(n)f (n+1) = b(n), with f (n+1)(x,v, 0) = g(x,v),

(29.2)and g ≥ 0 with b(n) ≥ 0 implies f (n+1) ≥ 0; that this is the case if f (n) ≥ 0follows from

b(n)(x,v, t) =∫

R3×S2K(|v − w|, θ)f (n)(x,v′, t)f (n)(x,w′, t) dw dα. (29.3)

The sign of a(n) is not so important for proving that f (n+1) ≥ 0, but it isuseful for obtaining an upper bound for f (n+1), and indeed f (n) ≥ 0 impliesa(n) ≥ 0, because

a(n) =∫

R3×S2K(|v − w|, θ)f (n)(w) dw dα. (29.4)

One deduces that 0 ≤ f (n+1) ≤ ϕ(n+1), where ϕ(n+1) is the solution of

∂ϕ(n+1)

∂t+ v.

∂ϕ(n+1)

∂x= b(n) with ϕ(n+1)(x,v, 0) = g(x,v), (29.5)

and ϕ(n+1) is given explicitly by

ϕ(n+1)(x,v, t) = g(x − tv,v) +∫ t

0 b(n)(x − (t− s)v,v, s) ds

= g(x − tv,v) +∫

R3×S2 K(|v − w|, θ)(∫ t0f (n)(x − (t− s)v,v′, s)f (n)(x − (t− s)v,w′, s) ds

)dw dα.

(29.6)

One wants to find a function F such that if 0 ≤ f (n)(x,v, t) ≤ F (x − tv,v)then one has 0 ≤ f (n+1)(x,v, t) ≤ F (x − tv,v); of course, one will also needthe mapping f (n) �→ f (n+1) to be a strict contraction in an adapted norm,but that is essentially the same type of estimate which is needed. Of course,it is enough to show that ϕ(n+1)(x,v, t) ≤ F (x − tv,v), and because

Page 247: From hyperbolic systems to kinetic theory: a personalized quest

29 The Illner–Shinbrot and the Hamdache Existence Theorems 231

ϕ(n+1)(x,v, t) ≤ g(x − tv,v) +∫

R3×S2 K(|v − w|, θ)(∫ t0F (x − (t− s)sv − sv′,v′)F (x − (t− s)sv − sw′,w′) ds

)dw dα,

(29.7)it is enough to find F satisfying

g(ω,v) +∫

R3×S2 K(|v − w|, θ)(∫ t0 F (ω + s (v − v′),v′)F (ω + (v − sw′),w′) ds

)dw dα ≤ F (ω,v).

(29.8)One now chooses

F (ω,v) = e−α |ω|2h(v), with α > 0, (29.9)

and one notices that

F (ω+s (v−v′),v′)F (ω+s (v−w′),w′) = h(v′)h(w′)e−α |ω|2e−α |ω−s (v−w)|2 ,(29.10)

because

|ω + s (v − v′)|2 + |ω + (v − w′)s|2 = |ω|2 + |ω − (v − w)s|2. (29.11)

One deduces that

if g(ω,v) ≤ e−α |ω|2g0(v), one must find h such thatg0(v)+

∫R3×S2 K(|v−w|, θ)h(v′)h(w′)

(∫ t0 e

−α |ω−s (v−w)|2 ds)dw dα≤h(v).

(29.12)Then, for a unit vector e parallel to v − w, one uses2

∫ t

0

e−α |ω−s (v−w)|2 ds≤∫ ∞

0

e−α |ω−s (v−w)|2 ds=1

|v−w|∫ ∞

0

e−α |ω−s e|2 ds,

(29.13)(and +∞ if w = v), and the supremum of

∫∞0e−α |ω−s e|2 ds is obtained by

letting ω tend to +∞ in the direction of e, and the supremum is∫

Re−αx

2dx =√

πα .In the case considered by Reinhard ILLNER and SHINBROT, where K(|v−

w|, θ) = |v − w| k(θ) with k integrable, if g0(v) ≤ β0 then one may takeh(v) = β with β ≤ β0 + C β2, with C =

√πα

∫R3×S2 k(θ) dw dα.

It is a purely mathematical problem to consider a gas filling out the wholespace and to wonder about what happens to such a gas which at time 0 hasa finite mass, finite momentum and finite kinetic energy, but an importantfeature for real gases is that they must be contained,3 and the boundaryconditions are important.

2 If one only looks at local existence, then one also uses∫ t

0e−α |ω−s (v−w)|2 ds ≤ t.

3 One may consider that the atmosphere around the earth is not contained andthat indeed a few particles escape the earth’s gravitational field.

Page 248: From hyperbolic systems to kinetic theory: a personalized quest

232 29 The Illner–Shinbrot and the Hamdache Existence Theorems

For a discrete velocity model like the Broadwell model, restricted to hav-ing x ∈ (0, L), one may consider a purely mathematical question like pe-riodic solutions in space, i.e. u(L, t) = u(0, t) and v(L, t) = v(0, t), whichis like considering (0, L) as a circle, but more realistic boundary conditionsare u(0, t) = v(0, t) and u(L, t) = v(L, t), which express the fact that par-ticles bounce on the boundary of the interval. In a three-dimensional set-ting, this is the case of specular reflection, whose expression involves the(exterior) normal n; if (v,n) > 0 the particle is hitting the boundary, butif (v,n) < 0 it is coming from the boundary, and the particle hitting theboundary with velocity vin comes out instantaneously with velocity vout,and conservation of energy gives |vin| = |vout| and the change in momen-tum vin − vout is considered to be parallel to n,4 so that the formula isvout = (I − 2n⊗ n)vin, equivalent to vin = (I − 2n⊗ n)vout. The boundarycondition is then f(x,v, t) = f(x, (I − 2n⊗ n)v, t) for all v.

MAXWELL had already imagined another type of boundary condition, thatthe particles hitting the boundary are first absorbed by the boundary and thenare (immediately) re-emitted by the boundary in all directions, accordingto Lambert’s law,5 and with the distribution in velocity of the Maxwelliandistribution corresponding to the temperature of the boundary. Reality seemsto be between these two extremes.6

[Taught on Wednesday November 14, 2001.]

4 The exchanges of momentum by all these particles hitting the boundary are re-sponsible for the pressure.

5 Johann Heinrich LAMBERT, French-born mathematician, 1728–1777. He hadworked in Berlin, Germany.

6 At a meeting in Grado, Italy, in 1986, I heard about an experiment which hadbeen done on the space shuttle, for which particles had a very high velocity andarrived all with the same incidence on a plate, and were reflected in variousdirections; the highest probability of reflection was near the specular reflection,but a few particles were reflected in quite odd directions. The explanation seemsto be that particles may enter inside the boundary and interact with the atomsthere, and this process might be very sensitive to the velocity of the particles, theangle of incidence, and the nature of the material of which the boundary is made;physicists involve questions of quantum mechanics in these calculations, and oneis then reminded that particles are just localized waves anyway; of course, onesees that one should not expect the nonspecular reflections to be instantaneous.

Page 249: From hyperbolic systems to kinetic theory: a personalized quest

30

The Hilbert Expansion

There is a formal procedure, called the Hilbert expansion, which considersthe Boltzmann equation with a small parameter ε, often called the mean freepath between collisions. Another parameter is used in bounded domains, theKnudsen number,1 which is a dimensionless number, the ratio of a charac-teristic length of the container to the mean free path between collisions. Oneconsiders

∂f

∂t+ v.

∂f

∂x=

1εQ(f, f), (30.1)

and the Hilbert expansion postulates that

f(x,v, t) = f0(x,v, t) + ε f1(x,v, t) + ε2f2(x,v, t) + . . . , (30.2)

and formally one finds that Q(f, f) = 0, so that f is a local Maxwellian, andthe macroscopic parameters solve the Euler equation, for an ideal fluid. Avariant of this formal procedure, the Chapman–Enskog procedure, makes theNavier–Stokes equation appear (with a small viscosity).

Of course, one should always be careful with formal expansions, becausethere is no good reason to believe that the solution will appear the way thatone postulates, and it may happen that the expansion is valid in some casesbut not in others; actually, it is known that there are boundary layer effects to

1 Martin Hans Christian KNUDSEN, Danish Physicist, 1871–1949. He had workedin Copenhagen, Denmark.

Page 250: From hyperbolic systems to kinetic theory: a personalized quest

234 30 The Hilbert Expansion

consider too, either near the boundary or near the initial time.2,3 I conjecturethat there might be oscillation effects in some cases, whose presence wouldrender the expansion wrong.4 Anyway, letting ε tend to 0 is only a mathe-matical question, because the assumptions used for deriving the Boltzmannequation were that the gas was rarefied, and that pairs of particles could in-teract without being bothered by other particles, i.e. that there were onlytwo-body problems to consider and no n-body problems with n ≥ 3.

One has

Q(f, f) = Q(f0, f0) + 2εB(f0, f1) + ε2(Q(f1, f1) + 2B(f0, f2)

)+ . . . , (30.3)

where B is the symmetric bilinear mapping defining the quadratic mappingQ. The only term in ε−1 in the equation, is Q(f0, f0), and one is led to imposethat

Q(f0, f0) = 0, so that f0(x,v, t) = �(x, t)a(x,t)3/2

π3/2 e−a(x,t) |v−u(x,t)|2

with a(x, t) = 12k T (x,t) , or e(x, t) = 3

2a(x,t) .(30.4)

Looking at the terms in ε0, one deduces that

∂f0

∂t+ v.

∂f0

∂x= 2B(f0, f1), (30.5)

and the problem is to find an equation for f0 alone. One observes that, what-ever f1 is, one has

∫R3 B(f0, f1) dv = 0∫R3 viB(f0, f1) dv = 0 for i = 1, 2, 3,∫R3 |v|2B(f0, f1) dv = 0,

(30.6)

2 One may start from initial data which are not local Maxwellians, and in theBroadwell model it means that u0v0 − w2

0 �= 0 on a set of positive measure. Inthat case, the intuition is that, because of the factor 1

ε, there is a boundary layer

in time where the transport does not play any role (at least for the first term). Forthe Broadwell model, it means that one studies the ordinary differential equationdudt

= dvdt

= − dwdt

= −u v + w2, u(0) = a ≥ 0, v(0) = b ≥ 0, w(0) = c ≥ 0, with anaccelerated time, and as u+ v+w = a+ b+ c and u− v = a− b, one can solve thesystem explicitly by a quadrature, but because u log(u) + v log(v) + 2w log(w)is a Lyapunov function which stops decreasing only where u v −w2 = 0, one cancompute the limit as t→ ∞ without writing the solution.

3 Aleksandr Mikhailovich LYAPUNOV, Russian mathematician, 1857–1918. He hadworked in Kharkov and in St Petersburg, Russia, and in Odessa (then in Russia,now in Ukraine).

4 Some people have shown that the expansion is valid under some assumptions,but if there were oscillations their assumptions would not hold, and it would notmean that their proofs are wrong (i.e. they are not proofs), but that their resultsmay not be applicable in some cases. In other words, these statements say thatif there are no problems, then everything is OK, and there are different ways toexpress the hypothesis that there are no problems.

Page 251: From hyperbolic systems to kinetic theory: a personalized quest

30 The Hilbert Expansion 235

as these terms are the coefficients of ε in the identities∫

R3 Q(f, f) dv = 0∫R3 viQ(f, f) dv = 0 for i = 1, 2, 3,∫R3 |v|2Q(f, f) dv = 0.

(30.7)

One then multiplies (30.5) by 1, by vi and by |v|2 and one integrates in v,and this gives equations satisfied by the macroscopic quantities �, u, and e; inthese equations the pressure p appears, defined by 2� e = 3p, because the factthat f0 is a function of |v−u|2 gives σi,j = −p δi,j for i, j = 1, 2, 3; one shouldnotice that it also gives the heat flux q = 0. This shows that the macroscopicquantities defined by f0 satisfy the Euler equation with the equation of state2� e = 3p, i.e.

∂�

∂t+

3∑

j=1

∂(� uj)∂xj

= 0, (30.8)

for conservation of mass,

∂(� ui)∂t

+3∑

j=1

∂(� ui uj)∂xj

+∂p

∂xi= 0 for i = 1, 2, 3, (30.9)

for the balance of linear momentum and

∂E

∂t+

3∑

j=1

∂((E + p)uj)∂xj

= 0, (30.10)

for the balance of energy, where E = �( |u|2

2 + e).

The limiting problem is a quasi-linear hyperbolic system of conservationlaws, and one knows that discontinuities may happen in finite time for thiskind of equation, if the data are too large for example; however, in spacedimension > 1 there is a dispersion effect which may win over the nonlineartendency of creating discontinuities (shocks or contact discontinuities), andthere are small smooth data for which the solution exists for all time andstays smooth. For some situation of this kind, Takaaki NISHIDA has shownthat the solution of the Boltzmann equation exists for all time and convergesas ε tends to 0 to the (smooth) solution of the Euler equation.5

Russell CAFLISCH and George PAPANICOLAOU have worked out the analo-gous result for the (one-dimensional) Broadwell model, but for the finite time5 In order to give a meaning to such comparisons, one associates to a functionf defined on R

3 × R3 × (0,∞) its moments � =

∫R3 f dv, �u =

∫R3 f v dv,

� |u|22

+ � e =∫

R3 f|v|22dv, which give macroscopic quantities �, u, e which one

may compare to those appearing in the Euler equations, and to three functions�, u, e defined on R

3 × (0,∞) one associates the local Maxwellian f having thesecharacteristics, which one may compare to the one appearing in the Boltzmannequation.

Page 252: From hyperbolic systems to kinetic theory: a personalized quest

236 30 The Hilbert Expansion

where a solution of the quasi-linear hyperbolic system of conservation lawshas a smooth solution. For the case of a Riemann problem giving rise to asingle shock solution of the quasi-linear hyperbolic system, Russell CAFLISCH

has tried without success to perform the same analysis and show that a solu-tion of the Broadwell model does exist and converges as ε tends to 0 to thediscontinuous solution of the quasi-linear hyperbolic system of conservationlaws. I have suggested that as ε tends to 0 the sequence of solutions mightdevelop oscillations and might converge only in a weak topology to a differentfunction, in which case some effective equation would have to be discoveredand studied.

The Chapman–Enskog procedure is slightly different from the Hilbert ex-pansion, and creates the (compressible) Navier–Stokes equation, where theCauchy stress tensor is given by

σi,j = 2μ εi,j − p δi,j , with εi,j =12

( ∂ui∂xj

+∂uj∂xi

)for i, j = 1, 2, 3, (30.11)

and the viscosity μ is > 0; when μ tends to 0 one formally finds the Eu-ler equation again, but flows with small μ (or high Reynolds number) mayshow turbulent effects,6 and an effective equation for turbulent flows is notknown; although one should always be careful not to exchange the order oflimits without first proving that one is allowed to do so, it lends credence tothe possibility of oscillations in the sequence of solutions of the Boltzmannequation when ε tends to 0.

Most mathematicians working on the Navier–Stokes equation nowadaysuse a simplified incompressible model,7 following the pioneering work of JeanLERAY in the 1930s, followed in the 1950s by Eberhard HOPF and by OlgaLADYZHENSKAYA. In three space dimensions, global existence of smooth so-lutions for the incompressible Navier–Stokes equation is conjectured,8 and for6 Osborne REYNOLDS, Irish-born mathematician, 1842–1912. He had worked in

Manchester, England.7 The simplification comes from the fact that � and μ are independent of the

temperature, and therefore one may solve the equation for u independently of theequation for balance of energy. Incompressibility is expressed by � = constant,which implies div(u) = 0; the condition div(u) = 0 is also true for a mixture offluids if each one is incompressible but the fluids are not miscible, because d�

dt= 0

in this case (as usual, ddt

= ∂∂t

+∑3

j=1uj

∂∂xj

).8 Jean LERAY seems to have thought that singularities may appear, and that this

was related to turbulent flows, but turbulence has not much to do with regularity(or with letting t tend to ∞ as many deluded mathematicians think), but hasbeen related to fluctuations in velocity at least since REYNOLDS. Jindrich NECAS

told me at some time that he thought that singularities do occur, but later thathe was not so sure anymore. I believe that solutions stay smooth, but I insistthat it is a mathematical problem without much physical relevance, because themathematical difficulty is that there could be large gradients that one does not

Page 253: From hyperbolic systems to kinetic theory: a personalized quest

30 The Hilbert Expansion 237

the Euler equation it was usually thought that singularities would appear ina finite time (but it is not so clear now that it is so); in the early 1980s,Shmuel KANIEL had proposed an approach,9 which he was not able to followcompletely, for proving smoothness of solutions, and one interesting feature(which I was hearing for the first time then) was to create a kinetic equationwith equilibria described by rectangular curves instead of Gaussian curves,namely

f(x,v, t) = a(x, t) if |v − u(x, t)| ≤ r(x, t), andf(x,v, t) = 0 if |v − u(x, t)| > r(x, t); (30.12)

one deduces that

� =∫|v−u|≤r a dv = 4π a r3

3 , so that∫|v−u|≤r av dv = �u

� e =∫|v−u|≤r

|v−u|22 a dv = 2π a r5

5 , so that e = 3r2

10 .(30.13)

This idea was later used with more success in one dimension for proving theexistence of some quasi-linear hyperbolic systems of conservation laws, byPierre-Louis LIONS, Benoıt PERTHAME & Eitan TADMOR.10,11

[Taught on Friday November 16, 2001.]

Notes on names cited in footnotes for Chapter 30, NECAS.12

know how to control, but that would mean a lot of energy dissipated by viscosity,and in a real fluid it would make the temperature increase, and therefore theviscosity would decrease and evacuation of the heat would become easier then,and this realistic scenario (for which the flow may look turbulent) cannot occurin the mathematical problem where the equation of balance of energy has beendecoupled, because the viscosity has been chosen to be independent of tempera-ture.

9 Shmuel KANIEL, Israeli mathematician. He works at The Hebrew University,Jerusalem, Israel.

10 Benoıt PERTHAME, French mathematician. He worked in Orleans, and works nowat Universite Paris VI (Pierre et Marie CURIE), Paris, France.

11 Eitan TADMOR, Israeli-born mathematician. He has worked at UCLA (Universityof California at Los Angeles), Los Angeles, CA, and at University of Maryland,College Park, MD.

12 Jindrich NECAS, Czech-born mathematician, 1929–2002. He had worked at North-ern Illinois University, De Kalb, IL, and at Charles University, Prague, first inCzechoslovakia, then capital of the Czech Republic.

Page 254: From hyperbolic systems to kinetic theory: a personalized quest

31

Compactness by Integration

In proving existence for some problems of transport, there is an interestingeffect of compactness by integration, called the averaging lemma, which wasfirst mentioned to me by Benoıt PERTHAME as a question, which he solvedafterward,1 that if a sequence fn converges weakly to f∞, and is such that∂fn

∂t + v.∂fn

∂x is nice enough, and all fn are 0 outside a compact set in v, then�n defined by �n(x, t) =

∫fn(x,v, t) dv in R

N × R converges strongly to �∞defined by �∞ =

∫f∞ dv.

This result cannot be proven with the compensated compactness ideasthat I had developed with Francois MURAT (because they are restricted topartial differential equations with constant coefficients), but it reminded me ofa result of Lars HORMANDER concerning a class of hypoelliptic operators thathe had introduced,2 because an example of his general theory is that f ∈ L2

loc,∂f∂t +v.∂f∂x ∈ L2

loc and ∂f∂vi

∈ L2loc for i = 1, . . . , N implies that f ∈ H

1/2loc ; for the

particular class of operators considered by Lars HORMANDER, the regularitydepends upon the number of levels of commutators that one needs to computein order to generate derivatives in all directions, and in the example one has[ ∂∂vi, ∂∂t + v. ∂∂x ] = ∂

∂xifor i = 1, . . . , N .3

I thought that the lack of information on the partial derivatives in v wasbalanced by an integration in v instead, and a precise mathematical result uni-fying the two types of results was obtained later by Patrick GERARD,4 using1 It then appeared as a joint work of Francois GOLSE, Pierre-Louis LIONS, Benoıt

PERTHAME and Remi SENTIS.2 A (linear) differential operator P (x,D) is hypoelliptic if, when P (x,D)u = f andf is of class C∞ in an open set ω, then u is necessarily of class C∞ in ω.

3 Another example in R2 is that u ∈ L2

loc, ∂u∂x

∈ L2loc and xm ∂u

∂y∈ L2

loc im-

ply u ∈ H1/(m+1)loc (if m is a nonnegative integer); here one has [ ∂

∂x, xm ∂

∂y] =

xm−1 ∂∂y, . . . , [ ∂

∂x, x ∂

∂y] = ∂

∂y. Using a partial Fourier transform in y, one easily

proves the same result for any nonnegative real m.4 Patrick GERARD, French mathematician, born in 1961. He works at Universite

Paris-Sud, Orsay, France.

Page 255: From hyperbolic systems to kinetic theory: a personalized quest

240 31 Compactness by Integration

his micro-local defect measures, which are almost the same objects which I hadcalled H-measures (but he defined them independently), the difference beingthat he had developed his theory for functions with values in a Hilbert space(for applying it to L2 in the variable v), while the only examples which I hadthought of were of finite dimensions. I had actually tried to use my H-measuresfor proving results of compactness by integration, without success, but whatI had tried was different from the idea that Patrick GERARD used, and Ichecked afterward that his line of proof works with H-measures, i.e. it is notnecessary to develop a theory for functions with values in infinite-dimensionalHilbert spaces. However, this approach is not good enough for finding a moreprecise result of Pierre-Louis LIONS, that � belongs to a fractional Sobolevspace.

One can avoid the general theory of Lars HORMANDER in some cases, likeour example with information on f, ∂f∂t + v.∂f∂x ,

∂f∂vj

∈ L2(RN × RN × R) for

j = 1, . . . , N , by using a partial Fourier transform.

Lemma 31.1. If f, ∂f∂t +v.∂f∂x ,∂f∂vj

∈ L2(RN ×RN ×R) for j = 1, . . . , N , then

f ∈ H1/2loc (RN × R

N × R).

Proof : Denoting by (ξ, τ) the dual variable of (x, t), one obtains Ff, (τ +v.ξ)Ff, ∂(Ff)

∂vj∈ L2; then for (ξ, τ) fixed one has

2�∫

RN

(τ + v.ξ)Ff ∂(Ff)∂vj

dv =∫

RN

(τ + v.ξ)∂(|Ff |2)∂vj

dv = −∫

RN

ξj |Ff |2 dv,(31.1)

which is true for smooth functions with compact support, and extends by adensity argument in our case.5 Multiplying (31.1) by sign(ξj) and integratingin (ξ, τ) gives∫

RN×RN×R

|ξj | |Ff |2 dξ dτ dv ≤ 1π

∣∣∣∣∣∣∂f

∂t+ v.

∂f

∂x

∣∣∣∣∣∣L2

∣∣∣∣∣∣∂f

∂vj

∣∣∣∣∣∣L2

for j = 1, . . . , N.

(31.2)Then, one notices that

|τ |1 + |v| ≤

|τ + v.ξ|1 + |v| +

|v.ξ|1 + |v| ≤ |τ + v.ξ| + |ξ|, (31.3)

so that ∫RN×RN×R

|τ |1+|v| |Ff |2 dξ dτ dv

≤ ∫RN×RN×R

(|τ + v.ξ| + |ξ|) |Ff |2 dξ dτ dv <∞.(31.4)

5 The truncation step is based on the Lebesgue dominated convergence theorem,and then the regularizing step is done by convolution, noticing that τ + v.ξ hasbounded partial derivatives, as one works on a compact set.

Page 256: From hyperbolic systems to kinetic theory: a personalized quest

31 Compactness by Integration 241

One deduces a bound in H1/2 in all variables, if one restricts attention to abounded set in v. ��

Of course, a proof by Fourier transform is restricted to an L2 framework,and it is useful to consider a different type of proof, valid in an Lp frameworkfor 1 ≤ p ≤ ∞; it will also show how commutators appear in a natural way inthe theory.

Lemma 31.2. One denotes L0 = ∂∂t +

∑Nj=1 vj

∂∂xj

, and Lk = ∂∂vk

for k =

1, . . . , N (so that the commutator [Lk, L0] = LkL0 − L0Lk is ∂∂xk

for k =1, . . . , N). For s ∈ R, one denotes by S0(s) the group of operators defined by

(S0(s)f)(x, v, t) = f(x− s v, v, t− s) a.e. in RN × R

N × R, (31.5)

which is a group of isometries in Lp(RN×RN×R), with infinitesimal generator

L0, so that

||S0(s)f − f ||p ≤ |s| ||L0f ||p for all s ∈ R, f ∈ Lp(RN × RN × R), 1 ≤ p ≤ ∞.

(31.6)For s ∈ R, one denotes by Vk(s) for k = 1, . . . , N the group of operatorsdefined by

(Vk(s)f)(x, v, t) = f(x, v − s ek, t) a.e. in RN × R

N × R, (31.7)

where e1, . . . , eN is the canonical basis of RN , which is a group of isometries

in Lp(RN × RN × R), with infinitesimal generator Lk, so that

||Vk(s)f − f ||p ≤ |s| ||Lkf ||p for all s ∈ R, f ∈ Lp(RN × RN ×R), 1 ≤ p ≤ ∞.

(31.8)

Proof : The fact that they are isometries comes from the fact that the mappings(x, v, t) �→ (x− s v, v, t− s) as well as (x, v, t) �→ (x, v − s ek, t) have Jacobiandeterminant 1, on R

N ×RN ×R and for all s ∈ R. The infinitesimal generator

of S0 is obtained by looking for the limit as s→ 0 of f−S0(s)fs , and for p = +∞

one only asks for this limit to exist for the L∞ weak � topology; of course,for f ∈ L1

loc(RN × R

N × R) the limit exists in the sense of distributions andis L0f . The same remarks hold for the infinitesimal generator of Vk. That(31.6) and (31.8) hold follows from the characterization of the infinitesimalgenerators and from the fact that one deals with groups of isometries. ��

There is a discrete analogue of the commutation relation[∂∂vk

, ∂∂t+v.∂∂x

]=

∂∂xk

.

Lemma 31.3. For s ∈ R, one denotes by Xk(s) for k = 1, . . . , N the groupof operators defined by

(Xk(s)f)(x, v, t) = f(x− s ek, v, t) a.e. in RN × R

N × R, (31.9)

Page 257: From hyperbolic systems to kinetic theory: a personalized quest

242 31 Compactness by Integration

which is a group of isometries in Lp(RN×RN×R), with infinitesimal generator

∂∂xk

. For a, b ∈ R, and 1 ≤ k ≤ N , one has

Xk(a b) = S0(−b)Vk(−a)S0(b)Vk(a) (31.10)

Proof : Indeed, let g1, g2, g3, g4 be defined by g1 = Vk(a)f , g2 = S0(b)g1,g3 = Vk(−a)g2, and g4 = S0(−b)g3. One has g1(x, v, t) = f(x, v − a ek, t),g2(x, v, t) = g1(x − b v, v, t − b) = f(x − b v, v − a ek, t − b), g3(x, v, t) =g2(x, v+a ek, t) = f(x−b v−a b ek, v, t−b), and g4(x, v, t) = g3(x+b v, v, t+b) =f(x− a b ek, v, t) = (Xk(a b)f)(x, v, t). ��

Then one has the following discrete version of Lemma 31.1.

Lemma 31.4. Let 1 ≤ p ≤ ∞, k ∈ {1, . . . , N}, and let f ∈ Lp(RN ×RN ×R)

satisfy

there exists α ∈ (0, 1] such that ||S0(s)f − f ||p ≤ A |s|α for all s ∈ R,(31.11)

there exists βk ∈ (0, 1] such that ||Vk(s)f − f ||p ≤ Bk |s|βk for all s ∈ R,(31.12)

then one has

||Xk(s)f − f ||p ≤ c(α, βk)Aβk/(α+βk)Bα/(α+βk)|s|αβk/(α+βk) for all s ∈ R.(31.13)

Proof : By (31.10), one has

Vk(a)S0(b)[Xk(a b)f − f ] = S0(b)Vk(a)f − Vk(a)S0(b)f, (31.14)

and because Vk(a) and S0(b) are isometries,

||Xk(a b)f − f ||p = ||S0(b)Vk(a)f − Vk(a)S0(b)f ||p, (31.15)

and then one observes that

S0(b)Vk(a)f − Vk(a)S0(b)f = S0(b) (Vk(a)f − f) − Vk(a) (S0(b)f − f)+ S0(b)f − f + f − Vk(a)f,

(31.16)and one deduces that

||Xk(a b)f − f ||p ≤ 2||Vk(a)f − f ||p + 2||S0(b)f − f ||p ≤ 2A |a|α + 2Bk|b|βk ,for all a, b ∈ R.

(31.17)Finally, one minimizes the right-hand side of (31.17) for a b = s. ��

If ||L0f ||p < ∞ and ||Lkf ||p < ∞, then one can take α = βk = 1, so thatαβk

α+βk= 1

2 , and (31.13) then corresponds to f having half a derivative, but factually belongs to an interpolation space slightly larger than H1/2 (in xk).

Page 258: From hyperbolic systems to kinetic theory: a personalized quest

31 Compactness by Integration 243

Using (31.11) and (31.13), one can estimate ||Tsf − f ||Lp(RN×K×R) for acompact K ⊂ R

N , where (Tsf)(x, v, t) = f(x, v, t− s).

Finally, I adapt the argument of Patrick GERARD to a simple situation ofcompactness by integration, but I refer to [18] for the definitions of the termsand properties of H-measures used in the proof.

Lemma 31.5. Writing functions of (x, t, v), if

fn ⇀ 0 in L2(RN × R × RN ) weak and

∂fn

∂t +∑Nj=1 vj

∂fn

∂xj→ 0 in H−1

loc (RN × R × R

N ) strong, (31.18)

then defining �n by

�n(x, t) =∫

RN

fn(x, v, t)ϕ(v) dv for ϕ ∈ L2(RN ), x ∈ RN , t ∈ R, (31.19)

one has�n → 0 in L2

loc(RN × R) strong. (31.20)

Proof : It is equivalent to show that for any sequence un converging weakly to0 in L2(RN × R) and keeping its support compact, the scalar product of �nand un converges to 0, i.e. the scalar product of fn and gn = un⊗ϕ convergesto 0. If μ is the H-measure of a subsequence (fm, gm), it means that one mustshow that μ12 = 0.

Denoting by ξ, τ, ω the dual variables of x, t, v, the localization principletransforms (31.18) into

(τ + (v, ξ)

)μ11 =

(τ + (v, ξ)

)μ12 = 0, (31.21)

and because ∂gn

∂vj→ 0 in H−1

loc (RN ×R×R

N) strong, the localization principleimplies

ωjμ21 = ωjμ

22 = 0 for j = 1, . . . , N. (31.22)

On the support of μ12, one then has τ + (v, ξ) = 0 by (31.21) and ω = 0 by(31.22), so that one cannot have ξ = 0, and therefore for each (x, t, ξ, τ) theset of v such that (x, t, v, ξ, τ, 0) belongs to the support of μ12 is included ina hyperplane and thus has Lebesgue measure 0. It remains to show that μ12

has an L1 density in v to deduce by the Fubini theorem that μ12 = 0, andthis comes from the fact that

μ22 = ν ⊗ |ϕ|2, (31.23)

where ν is the H-measure corresponding to a subsequence of um. ��Although the proof of Patrick GERARD is a little more general, his method

does not seem suitable to deduce some generalizations of Pierre-Louis LIONS,alone or in collaboration with Ron DIPERNA and Yves MEYER.

Page 259: From hyperbolic systems to kinetic theory: a personalized quest

244 31 Compactness by Integration

[Taught on Monday November 19, 2001.]

Notes on names cited in footnotes for Chapter 31, GOLSE,6 SENTIS.7

6 Francois GOLSE, French mathematician. He works at Universite Paris 7 (DenisDiderot), Paris, France.

7 Remi SENTIS, French mathematician. He works at CEA (Commissariat al’Energie Atomique), France.

Page 260: From hyperbolic systems to kinetic theory: a personalized quest

32

Wave Front Sets; H-Measures

In the late 1970s, I had developed the method of compensated compactness,partly with Francois MURAT, and I had used Young measures for explaining inmore “classical” terms what it meant,1 and I was wondering how to introducea new object with a dual variable ξ to describe the transport of oscillations.2

Why was I looking for a dual variable ξ? I agree that I was short sighted,but in the late 1970s, I wanted to find if oscillations were transported in away similar to the “propagation of singularities” in linear hyperbolic equationswhich Lars HORMANDER and his school were studying. I knew that propaga-tion of singularities is fake physics,3 and I understood later that it was pushed1 I thought that the parametrized measures which I had heard about in seminars on

“control theory” were a classical concept, and in the summer of 1978 I had paidattention to introducing them in my Heriot–Watt course without any probabilisticlanguage, as it is completely irrelevant to the questions of continuum mechanicsand physics that I was interested in. I was the first to use Young measures forquestions of partial differential equations, but I must warn the reader that a fewhave afterward claimed, explicitly or by omitting to mention my contributions,that it was their idea, and as they have unfortunately written a lot of nonsensecorresponding to what Young measures are good for, I must say that I have hadno part in their unscientific method of misleading students and researchers.

2 Some authors insist on distinguishing between “oscillations” and “concentrationeffects”, but their reasons are not always very good, and the basic compensatedcompactness result treats these two questions in a unified way, as do the H-measures, which I introduced ten years after [18].

3 Light is described by the Maxwell–Heaviside equation, and not by the wave equa-tion, but Lars HORMANDER seems to have found it too challenging to developmathematical tools for the systems of partial differential equations that one en-counters in continuum mechanics and in physics. Even if he had never studiedmuch continuum mechanics or physics, and had not felt the difference betweenthe Maxwell–Heaviside equation and the wave equation, he should have knownthat a ray of light transports energy, and could not then be related to the question

Page 261: From hyperbolic systems to kinetic theory: a personalized quest

246 32 Wave Front Sets; H-Measures

forward for that particular reason,4 but it was possible that the oscillationsfor a first-order linear hyperbolic equation, satisfying

N∑

j=1

bj∂un∂xj

= fn (32.1)

would also involve the associated bicharacteristic rays, defined by

dxj

dt = bj(x(t))

= ∂P∂ξj

, j = 1, . . . , N, with P (x, ξ) =∑N

j=1 bj(x)ξjdξj

dt = − ∂P∂xj

, j = 1, . . . , N,(32.2)

so my first idea was to look for a mathematical object more general than aYoung measure, in that it would have a variable ξ, which would play a rolein proving results of propagation. I thought of introducing functionals of theform

∫Ω F(x, un, grad(un)

), with F being positively homogeneous of degree

0 in the last variable, but I did not find much in that direction.After I had mentioned my idea of adding a ξ variable, George PAPANICO-

LAOU had mentioned the Wigner transform,5 which consists in associating toa function u on R

N the wave function W defined on RN × R

N by

W (x, ξ) =∫

RN

u(x+

y

2

)u(x− y

2

)e−2i π (y,ξ) dy, (32.3)

which makes sense for u ∈ L2(RN ), giving W ∈ Cb(RN × RN ). If one adds

u ∈ L1(RN ), so that W is bounded in ξ with values in L1(RN ), one has∫

RN

W (x, ξ) dx = |Fu(ξ)|2, (32.4)

and

of micro-local regularity which interested him, as his wave front set is a no man’sland where one does not study what happens.

4 It was pure propaganda to call the results of propagation of micro-local regularity“propagation of singularities”, and one might be surprised that Lars HORMANDER

had fallen hostage to that political propaganda. Others before him had fallenhostages to a political propaganda of a different kind, which consisted in brain-washing students into believing that the world is described by differential equa-tions, as if 19th century continuum mechanics and physics had never happened,and no one had understood the difference between ordinary and partial differentialequations during the whole 20th century!

5 Jeno Pal (Eugene Paul) WIGNER, Hungarian-born physicist, 1902–1995. He re-ceived the Nobel Prize in Physics in 1963, for his contributions to the theory ofthe atomic nucleus and the elementary particles, particularly through the dis-covery and application of fundamental symmetry principles, jointly with MariaGOEPPERT-MAYER and J. Hans D. JENSEN. He had worked at Princeton Uni-versity, Princeton, NJ.

Page 262: From hyperbolic systems to kinetic theory: a personalized quest

32 Wave Front Sets; H-Measures 247

RN

W (x, ξ) dξ = |u(x)|2. (32.5)

I did not find a way to use this idea either.In 1984, I had the idea of a mathematical tool for computing a correction

in a problem of homogenization which had shown an unexpected quadraticeffect, but I only tried to develop it in 1986 to prove results of small amplitudehomogenization, where it served for computing a correction which is quadraticwith respect to a small parameter, and there is not yet a general theory forcomputing the following terms.6

After having defined H-measures for that question of small amplitude ho-mogenization, with a variable ξ in the definition, I wondered if this mathemat-ical tool helps in proving propagation results for oscillations and concentrationeffects for equation (32.1), and indeed it does, with the bicharacteristic rays(32.2) playing a role.

My definition has a vague analogy with Lars HORMANDER’s definition ofthe wave front set of a distribution T , also called the essential singular supportof T , whose projection onto the x space is the singular support of T which hadbeen defined by Laurent SCHWARTZ as the complement of the largest open setω ⊂ Ω such that the restriction of T to ω is a C∞ function.7 After localizing inx ∈ Ω ⊂ R

N by considering ϕT for ϕ ∈ C∞c (Ω), Lars HORMANDER declares

that T is micro-locally regular at (x0, ξ0) if ϕ(x0) �= 0, ξ0 �= 0 and F(ϕT )decays fast in a conic neighbourhood of the direction ξ0. Then the set of points6 In a periodic framework, one can describe all the terms, and the formula for the

quadratic correction suggests the general formula proven with H-measures [18],which is valid in a general case, but some people are misled by this similarity anddo not understand what the mathematics says. If one invents weak convergence(which F. RIESZ did) and then for a continuous periodic function f one considersthe sequence un defined by un(x) = f(nx), it is easy to see that un converges inL∞(R) weak � to a constant, which is the average of f in a period. If one listensto a physicist postulating a behaviour of a physical quantity to be f

(x, x

εn

)where

f(x, y) is periodic in y and εn is a small characteristic length, one understandseasily that the average f(x) of f(x, y) in y may serve as a macroscopic value,but it is doubtful that one will invent weak convergence to explain that whatthis physicist has been doing is to say that f

(x, x

εn

)and f(x) are very near

in a weak topology, without wondering if that weak topology is adapted to theequation that this quantity satisfies (which is why homogenization was not reallyunderstood before mathematicians became interested in the question, because oneneeds a different topology than the classical weak topology!). No one having seenthe formula for quadratic corrections in the periodic case had deduced a correctmathematical definition of H-measures [18]. Even now that I have given such adefinition, no one has yet understood the definition of a mathematical object thathelps calculate the following corrections in a general framework!

7 If for a nonempty family ωi, i ∈ I , of open subsets of Ω the restriction of T toωi is a C∞ function, then using a C∞ partition of unity one deduces that therestriction of T to the union ω = ∪i∈Iωi is a C∞ function, hence there exists alargest open set where T is C∞.

Page 263: From hyperbolic systems to kinetic theory: a personalized quest

248 32 Wave Front Sets; H-Measures

where T is micro-locally regular is open, and the complement of these pointsis the wave front set of T , which is then closed.

Conversely, I work with a sequence un converging weakly in L2loc(Ω) to

u∞, and I localize in x by considering (un − u∞)ϕ for ϕ ∈ Cc(Ω), and then Ilocalize in all directions ξ �= 0 by extracting a subsequence m→ ∞ such thatfor every ψ ∈ C(SN−1) (and every ϕ ∈ Cc(Ω)) one has

RN

∣∣F((um − u∞)ϕ

)∣∣2 ψ( ξ|ξ|)dξ → L(ϕ, ψ) as m→ ∞, (32.6)

and it is obvious that for each ϕ ∈ Cc(Ω) there exists a nonnegative Radonmeasure μϕ ∈ M(SN−1) such that L(ϕ, ψ) = 〈μϕ, ψ〉 for all ψ ∈ C(SN−1),but the interesting question is the dependence of μϕ with respect to ϕ, and Iproved that

there exists μ ∈ M(Ω × SN−1), μ ≥ 0, such that L(ϕ, ψ) = 〈μ, |ϕ|2 ⊗ ψ〉

for all ϕ ∈ Cc(Ω), ψ ∈ C(SN−1),(32.7)

where μ denotes the H-measure associated to the subsequence um. For vector-valued functions, if Un ⇀ U∞ in L2

loc(Ω; Cp) weak, one can extract a subse-quence m → ∞ such that for every ϕ1, ϕ2 ∈ Cc(Ω) and every ψ ∈ C(SN−1)one has∫

RN F((Umj − U∞j )ϕ1

)F((Umk − U∞k )ϕ2

)ψ(ξ|ξ|)dξ → Lj,k(ϕ1, ϕ2, ψ)

as m→ ∞, for j, k = 1, . . . , p,(32.8)

and it is obvious that for each ϕ1, ϕ2 ∈ Cc(Ω) there exists a complex Radonmeasure μj,kϕ1,ϕ2

∈ M(SN−1) such that Lj,k(ϕ1, ϕ2, ψ) = 〈μj,kϕ1,ϕ2, ψ〉 for all

ψ ∈ C(SN−1), but the interesting question is the dependence of μj,kϕ1,ϕ2with

respect to ϕ1, ϕ2, and I proved that

there exists an Hermitian symmetric nonnegative μ = (μj,k)j,k=1,...,p,μj,k ∈ M(Ω × S

N−1), j, k = 1, . . . , p, such thatLj,k(ϕ1, ϕ2, ψ) = 〈μj,k, ϕ1ϕ2 ⊗ ψ〉

for all ϕ1, ϕ2 ∈ Cc(Ω), ψ ∈ C(SN−1),(32.9)

where μ denotes the H-measure associated to the subsequence Um.8 In con-structing my theory of H-measures [18], I wanted to avoid the regularity hy-potheses which Joseph KOHN and Louis NIRENBERG had chosen for theirtheory of pseudo-differential operators,9 and which Lars HORMANDER hasalso used for his theory of Fourier integral operators, because they are not8 Charles HERMITE, French mathematician, 1822–1901. He had worked in Paris,

France.9 Joseph John KOHN, Czech-born mathematician, born in 1932. He works at Prince-

ton University, Princeton, NJ.

Page 264: From hyperbolic systems to kinetic theory: a personalized quest

32 Wave Front Sets; H-Measures 249

adapted to problems from continuum mechanics or physics, where interfacesoccur and discontinuous coefficients appear, and I developed a calculus of“pseudo-differential” operators with minimal regularity hypotheses.10 An im-portant property, from which the compensated compactness theorem follows,is what I called the localization principle, where for continuous coefficientsAj,k, j = 1, . . . , N, k = 1, . . . , p, I proved that

if∑N

j=1

∑pk=1

∂(Aj,kUmk )

∂xjbelongs to a compact of H−1

loc (Ω) strong, then∑N

j=1

∑pk=1 ξjAj,kμ

k, = 0 for � = 1, . . . , p.(32.10)

If T satisfies∑

j bj∂T∂xj

= g with b1, . . . , bn, g ∈ C∞(Ω), Lars HORMANDER

proves that the wave front set of T is included in the zero set of P definedin (32.2), using an argument related to the stationary phase principle. Usinga first commutation lemma (that a commutator is compact),11 I prove thatif (32.1) holds with b1, . . . , bN of class C1 and fn belonging to a compact ofH−1loc strong, then the support of μ is included in the zero set of P .Assuming that the bj are real, Lars HORMANDER proves (using his theory

of Fourier integral operators) that micro-local regularity for T is propagatedalong the bicharacteristic rays defined by (32.2), so that the wave front setof T is a union of bicharacteristic rays. Using a second commutation lemma(and a result of Alberto CALDERON for avoiding more than C1 regularityon b1, . . . , bN ), assuming that fn converges in L2

loc strong,12 I prove that μsatisfies an homogeneous first-order partial differential equation in x and ξ,whose characteristic curves are related to the bicharacteristic rays defined by(32.2).13

10 Because I deal with Radon measures, I use continuous test functions, and somecare must be taken for the case of coefficients in L∞, but one must pay attentionthat the transport properties use C1 coefficients, therefore refraction effects atinterfaces cannot be studied yet.

11 In proving the existence of H-measures, I also used Laurent SCHWARTZ’s kerneltheorem in order to prove that a distribution kernel exists and then that it is anonnegative measure by a positivity argument, another much simpler remark ofLaurent SCHWARTZ. Jacques-Louis LIONS had told me once that he had writtena simple proof of the kernel theorem with Lars GARDING, which I then read,so that I knew that I could avoid sending my readers to the initial proof, but Ieven simplified the argument a little more so that I only used classical results infunctional analysis, that one teaches with Hilbert–Schmidt operators, but I didnot explain that when I wrote [18].

12 If fn converges in L2loc weak, one may need to extract another subsequence, and

the first-order equation has a source term, related to the H-measure for the pair(un, fn).

13 The equations for bicharacteristic rays are not really defined on Ω × SN−1, and

SN−1 should be replaced by the quotient space obtained from R

N \ {0} by iden-tifying half lines. One can enforce ξ ∈ S

N−1 by replacing the second line of (32.2)

Page 265: From hyperbolic systems to kinetic theory: a personalized quest

250 32 Wave Front Sets; H-Measures

Pseudo-differential operators were introduced (by Joseph KOHN and LouisNIRENBERG) for questions concerning elliptic operators,14 and the mappingwhich to initial data associates the solution at time t of the wave equation, forexample, is not given by a pseudo-differential operator, and Lars HORMANDER

introduced the larger class of Fourier integral operators for working with ques-tions of linear hyperbolic equations, and his approach works for the scalar waveequation (with C∞ coefficients), but I do not think that it applies to systemswith smooth coefficients if they cannot be reduced to scalar equations.15

It may seem a miracle then that I was able to deal with the propagationof oscillations and concentration effects for a large class of linear hyperbolicsystems with C1 coefficients (the wave equation, the Maxwell–Heaviside equa-tion, the linearized elasticity equation), by using only H-measures [18], whichmimic methods from pseudo-differential operators.

[Taught on Monday November 26, 2001 (Wednesday 21 and Friday 23 fellduring Thanksgiving recess).]

Notes on names cited in footnotes for Chapter 32, GOEPPERT-MAYER,16 J.H.JENSEN,17 SCHMIDT.18

bydξj

ds= − ∂P

∂xj+ξj

(∑N

k=1ξk

∂P∂xk

), for j = 1, . . . , N . It is actually useful to distin-

guish ξ and −ξ, although for real sequences the H-measures charges in the sameway ξ and −ξ, and this has consequences which physicists know, that one needsnonlinearity to send a beam of light in one direction without sending the sameamount of energy in the opposite direction (one puts a light bulb at the focus ofa parabola to send a beam in one direction, but the parabola must be a mirrorto reflect forward the energy from the light bulb which is sent backward, and thenonlinearity comes from what happens inside the mirror).

14 They are linked to singular integrals, but the specialists of harmonic analysis whohad specialized on questions of singular integrals had not created a calculus wherethe symbols of the operators play an important role.

15 I first heard Lars HORMANDER talk at a conference in Jerusalem, Israel, in thesummer of 1972, and I understood that he had introduced these ideas as anattempt to characterize lacunas, i.e. describe the precise support of the elementarysolution of a linear hyperbolic equation with constant coefficients, and he couldat least say what the singular support of the elementary solution is.

16 Maria GOEPPERT-MAYER, German-born physicist, 1906–1972. She received theNobel Prize in Physics in 1963, with J. Hans D. JENSEN, for their discoveries con-cerning nuclear shell structure, jointly with Eugene P. WIGNER. She had workedin Chicago, IL, and at USCD (University of California San Diego), La Jolla, CA.

17 J. Hans D. JENSEN, German physicist. He received the Nobel Prize in Physicsin 1963, with Maria GOEPPERT-MAYER, for their discoveries concerning nuclearshell structure, jointly with Eugene P. WIGNER. He had worked in Hannover, andin Heidelberg, Germany.

18 Erhard SCHMIDT, German mathematician, 1876–1959. He had worked in Bonn,Germany, in Zurich, Switzerland, in Erlangen, Germany, in Breslau (then in Ger-many, now Wroc�law, Poland), and in Berlin, Germany.

Page 266: From hyperbolic systems to kinetic theory: a personalized quest

33

H-Measures and “Idealized Particles”

For a general scalar wave equation

∂t

(�∂un∂t

)−

N∑

i,j=1

∂xi

(ai,j

∂un∂xj

)= fn in Ω × (0, T ), (33.1)

with continuous coefficients, one assumes that un ⇀ u∞ in H1loc

(Ω × (0, T )

)

weak, and one applies the theory to Un = gradt,x(un), i.e. p = N + 1, andone denotes x0 = t. For a subsequence Um defining an H-measure μ, thelocalization principle (using curl(Um) = 0) gives μj,k = ξjξkν, for j, k =0, . . . , N for a nonnegative Radon measure ν. Then, if fm stays in a compactof H−1

loc

(Ω × (0, T )

)strong, the localization principle (now using (33.1)) gives

Qν = 0, with Q = � ξ20 −

N∑

i,j=1

ai,jξiξj . (33.2)

Then, if one assumes that fm converges in L2loc

(Ω × (0, T )

)strong, and that

one really has a wave equation, i.e. the coefficients are independent of t, � isreal and a is real and symmetric,1 and there exist α, β ∈ (0,∞) with a ≥ α Iand � ≥ β a.e. in Ω, one deduces that ν satisfies a partial differential equationin (x0, . . . , xN ) and (ξ0, . . . , ξN ), written in weak form as

〈ν, {Q,Φ}〉 = 0 for all Φ ∈ C1c

(Ω × (0, T )× S

N), (33.3)

where the Poisson bracket of two functions in (x, ξ) is defined by

{g, h} =N∑

j=0

( ∂g∂ξj

∂h

∂xj− ∂g

∂xj

∂h

∂ξj

), (33.4)

1 One could allow a to have complex entries and be Hermitian symmetric, butcomplex coefficients are not so natural for the wave equation, while they do appearnaturally for the Dirac equation.

Page 267: From hyperbolic systems to kinetic theory: a personalized quest

252 33 H-Measures and “Idealized Particles”

so that the characteristic curves of the first-order equation (33.3) satisfiedby ν are the bicharacteristic rays associated to Q, i.e.

dxj

ds = ∂Q∂ξj

, j = 0, 1, . . . , N,dξj

ds = − ∂Q∂xj

, j = 0, 1, . . . , N.(33.5)

If(x(s), ξ(s)

)is a solution of (33.5), then for λ �= 0,

(x(λ s), λ ξ(λ s)

)is also a

solution, so that (33.5) induces a differential equation on the quotient spacementioned.2 Patrick GERARD has pointed out to me that if � and ai,j areonly of class C1 for i, j = 1, . . . , N , then ∂Q

∂xjare only continuous in x for

j = 1, . . . , N , and uniqueness of solutions of (33.5) might not hold.

Conservation of energy for (33.1) is

∂∂t

(�2

∣∣∣∂un

∂t

∣∣∣2

+∑N

i,j=1ai,j

2∂un

∂xi

∂un

∂xj

)−∑N

i,j=1∂∂xi

(ai,j

∂un

∂xj

∂un

∂t

)= fn

∂un

∂t

in Ω × (0, T ),(33.6)

so that the difference between the limit of �2∣∣∣∂um

∂t

∣∣∣2

+∑Ni,j=1

ai,j

2∂um

∂xi

∂um

∂xjand

�2

∣∣∣∂u∞∂t

∣∣∣2

+∑N

i,j=1ai,j

2∂u∞∂xi

∂u∞∂xj

corresponds to a part of the energy hiddenat a mesoscopic level, i.e. a form of internal energy, which is then

internal energy =∫

SN

(� ξ20

2+

N∑

i,j=1

ai,jξiξj2

)dν(x, ξ), (33.7)

and because Qν = 0 there is equipartition of energy,3 i.e. half the in-ternal energy has a kinetic origin,

∫SN

� ξ202 dν(x, ξ), which is the limit of

�2

∣∣∂un

∂t

∣∣2 − �

2

∣∣∂u∞∂t

∣∣2, and half the internal energy has a potential origin,

∫SN

(∑Ni,j=1

ai,jξiξj

2

)dν(x, ξ), which is the limit of

∑Ni,j=1

ai,j

2∂un

∂xi

∂un

∂xj−∑N

i,j=1ai,j

2∂u∞∂xi

∂u∞∂xj

. One should observe that it is not the internal energy that sat-isfies a partial differential equation in x (and it means also time, which isx0), but another object, linked to an H-measure of the sequence, which doessatisfy an equation in (x, ξ).

It is important to observe that this transport equation has not been postu-lated like all the equations from kinetic theory, and it has been deduced from2 It means that one can enforce ξ ∈ S

N by replacing the second line of (33.5) bydξj

ds= − ∂Q

∂xj+ ξj

(∑N

k=0ξk

∂Q∂xk

), for j = 0, 1, . . . , N .

3 Oscillating solutions of the Maxwell–Heaviside equation show another form ofequipartition of energy, because (Dn.En) − (Bn.Hn) ⇀ (D∞.E∞) − (B∞.H∞)(i.e. the action is a robust quantity), but the density of electromagnetic energy is12

((Dn.En)+(Bn.Hn)

), so that for the electromagnetic energy which is hidden at

a mesoscopic level, half has electric origin, the limit of 12

((Dn.En)− (D∞.E∞)

),

and half has magnetic origin, the limit of 12

((Bn.Hn) − (B∞.H∞)

).

Page 268: From hyperbolic systems to kinetic theory: a personalized quest

33 H-Measures and “Idealized Particles” 253

a balance law. The explanation is that some part of a conserved quantity mayhide itself at a mesoscopic level, and because of the linearity of the equationa complete analysis was possible, and all kinds of ways of hiding energy ata mesoscopic level have been automatically taken into account. Of course,it was important that only oscillating solutions compatible with (33.1) wereconsidered: if a guess or probabilities had been used, it would have made theresult doubtful.

In the case � = 1 and ai,j = c2δi,j , one has Q = ξ20 −c2

(∑Ni,j=1 ξ

2j

), so that

ν lives on ξ20 = c2

(∑Ni,j=1 ξ

2j

), and one can parametrize those points on S

N

by choosing η ∈ SN−1 and then having ξ0 = ±c√

1+c2and ξj = ηj√

1+c2for j =

1, . . . , N (becauseQ is independent of x one has dξds = 0, so that ξ stays on SN ).

One has {Q,Φ} = 2ξ0 ∂Φ∂t − 2c2∑Nj=1 ξj

∂Φ∂xj

= 2c√1+c2

(±∂Φ∂t − c

∑Nj=1 ηj

∂Φ∂xj

),

so that the equation for ν corresponds to a transport with velocity c in thedirection ∓η. One could say that the energy hidden at a mesoscopic level istransported by “idealized particles”, moving in all directions with velocity c,and because the equation is linear, these “idealized particles” do not interactwhen they go through the same point with different directions.

One might be tempted to call these “idealized particles” photons, butthere is of course no possible quantification h ν, and because H-measures donot use any characteristic length they cannot distinguish between differentfrequencies, so that if there were photons, the H-measure would only see thetotal energy of all the photons moving in a given direction, for all frequencies(supposed to be very large). Actually, it is still not clear to me what are thesephotons that physicists mention,4 but they cannot be properties of the waveequation or of the Maxwell–Heaviside equation, which are linear and do notcontain the Planck constant h in their coefficients, and my conjecture is thatthey are related to the coupling of the Maxwell–Heaviside equation and theDirac equation, in the way DIRAC had proposed but without the zero-orderterm containing the mass of the electron; in this coupled equation the densityof charge � and the density of current j are expressed in terms of ψ ∈ C

4

which describes matter, and the equation for ψ has a coupling term with acoefficient in 1

h , which is linear in ψ, and linear in the scalar potential V andin the vector potential A.5

4 I find appealing the proposition of BOSTICK concerning electrons, but I cannotguess what his proposition concerning photons means.

5 Photons seem to result from interaction of light and matter, but as I have notbeen able yet to develop a theory valid for semi-linear hyperbolic systems, Iconjecture that for oscillating solutions with large frequency ν of the Maxwell–Heaviside/Dirac coupled equation, the only possible transfers of energy betweenthe electromagnetic field and the matter field described by ψ are (almost) mul-tiples of h ν. In some way, I think about photons in the way gusts of wind arejust a particular type of solution of the equations of hydrodynamics, and no onethinks of explaining laminar flows as a superposition of small gusts of wind.

Page 269: From hyperbolic systems to kinetic theory: a personalized quest

254 33 H-Measures and “Idealized Particles”

I find a sign that EINSTEIN had not understood the ideas of POINCARE

about relativity, is that he seemed to believe in forces at a distance in havingproposed a quite impossible scenario where light rays are bent near the sunbecause of the mass of the sun, and if he had understood that the Maxwell–Heaviside equation is hyperbolic (as is the wave equation, which does notreally describe light!), he would have known that a light ray only feels thelocal properties of matter along its way (although he could not know muchabout what is going on near the surface of the sun, as it is not so clearthat we know enough about that now), but he should certainly have thoughtabout two other phenomena concerning light. The first one is about mirages,which correspond to objects hidden behind the horizon, where it is not themass of the earth which plays a role, but the Brillouin effect,6 that the indexof refraction of air depends upon its temperature. The second one is abouta computation by AIRY,7 who had wondered why the solution of the waveequation is not zero in the shadow of an obstacle, i.e. the shadow only existsin the approximation of geometrical optics, but there what happens is not soclear, because it was only in the 1950s that Joseph KELLER developed hisgeometric theory of diffraction (GTD),8 where in guessing how grazing raysfollow the geodesics of the boundary, he had taken into account some explicitcomputations made in the same spirit as AIRY. Although Joseph KELLER

had mentioned early on that his expansions are not good near caustics, itis still not really understood why they give good results away from caustics,but he had mentioned something else to me more recently (in the fall of1990 in Stanford, I think), that the phenomenon of grazing rays which hehad studied is similar to the tunnelling effect in quantum mechanics, and Iconsider that a good possibility to avoid the probabilistic ideas that physicistsuse for this question, but I have not been able to find a way to explain hiscomputations; however, after discussing this question with Michael VOGELIUS

(in the summer of 2005, in Grenoble, France), I had the feeling that one mightexplain his computation by the existence of a boundary layer with a width oforder ν−1/3 in places with a finite radius of curvature (for large frequenciesν).

As there is not yet a generalization of my theory of H-measures to semi-linear hyperbolic systems, there are still guesses of quantum mechanics thatcannot be explained in a rational way, but certainly the proof of trans-port theorems for H-measures has already shown a crucial error of quantum6 Leon BRILLOUIN, French physicist, 1889–1969. He had worked in Paris, France.7 George Biddell AIRY, English mathematician, 1801–1892. He had worked in

Greenwich, England, as the seventh Astronomer Royal.8 Joseph Bishop KELLER, American mathematician, born in 1923. He received the

Wolf Prize for 1996/97, for his innovative contributions, in particular to elec-tromagnetic, optical, acoustic wave propagation and to fluid, solid, quantum andstatistical mechanics, jointly with Yakov G. SINAI. He worked at NYU (New YorkUniversity), New York, NY, and at Stanford University, Stanford, CA.

Page 270: From hyperbolic systems to kinetic theory: a personalized quest

33 H-Measures and “Idealized Particles” 255

mechanics: there are no particles playing esoteric games, there are only waves,but waves may hide conserved quantities like energy and momentum at var-ious mesoscopic levels, and one needs effective equations for describing themacroscopic effects of these hidden quantities. Of course, thermodynamicswas only a first guess for that question, and this theory should be generalizedin view of the new understanding which came out of the new mathematicaltools which I developed in the last quarter of the 20th century for understand-ing this question which I call beyond partial differential equations, which is forme the key to understanding the continuum mechanics and the physics of the20th century, plasticity, turbulence, and atomic physics.

It was a mistake to start from ordinary differential equations (with Hamil-tonian structure) and to deduce partial differential equations of Schrodingertype, and before H-measures it was already clear that Schrodinger equationsare simplified models where one has let the velocity of light c tend to ∞,9

but after H-measures it is clear that one should start from partial differentialequations, preferably of hyperbolic type, or of an intermediate type that onewould have proven to be natural (i.e. without postulating it), and one shouldderive effective equations, without postulating either that they correspond toan ordinary differential system of Hamiltonian type!

[Taught on Wednesday November 28, 2001.]

Notes on names cited in footnotes for Chapter 33, SINAI.10

9 In the spring of 1983, while I visited MSRI (Mathematical Sciences ResearchInstitute) in Berkeley, CA, I had stumbled upon an article from a physics journalwhere one started from the Dirac equation and one deduced the Schrodingerequation by letting c tend to ∞, but I do not know if everything was proventhere, because I was not able to find that article again when I looked for itsreference a few years after.

10 Yakov Grigor’evich SINAI, Russian-born mathematician, born in 1935. He receivedthe Wolf Prize for 1996/97, for his fundamental contributions to mathematicallyrigorous methods in statistical mechanics and the ergodic theory of dynamicalsystems and their applications in physics, jointly with Joseph B. KELLER. Heworks at Princeton University, Princeton, NJ.

Page 271: From hyperbolic systems to kinetic theory: a personalized quest

34

Variants of H-Measures

Homogenization, in the way I had developed it with Francois MURAT in theearly 1970s, has no small characteristic length in it (as the periodically modu-lated framework is only a particular example, which too many only consider,as if they could not understand the general case), and no probabilities (whichis one of the diseases which has plagued 20th century sciences, for which Iam trying to find a cure), and it was natural that I should first develop H-measures, which use no characteristic length, because the test functions ψ arehomogeneous of degree zero.

From my proof of a transport theorem for the H-measures associated tosolutions of (32.1), I saw how to generalize it to a large class of systems,those admitting a sesqui-linear balance law for their complex solutions,1 andthe first example of the wave equation (33.1) led to the transport equation(33.3), and what it says is that in the limit of infinite frequencies the rules ofgeometrical optics apply to all solutions of (33.1), and it is worth pointing outthat this is not what the usual understanding of geometrical optics is about.

The classical geometrical optics approach to the wave equation2

∂2u

∂t2− c2Δu = 0, (34.1)

is to construct asymptotic solutions of the form

u(x, t) = Aei ν ϕ, (34.2)

1 Sesqui is a prefix meaning one and a half, as the antilinearity is counted as half.2 The transport result for H-measures applies to many hyperbolic equations or sys-

tems (like the Maxwell–Heaviside equation, the equation of linearized elasticity,or the Dirac equation), but geometrical optics only seems to apply to the waveequation, apart from homogeneous isotropic media, where the components of so-lutions of the Maxwell–Heaviside equation or of the Lame equation satisfy scalarwave equations.

Page 272: From hyperbolic systems to kinetic theory: a personalized quest

258 34 Variants of H-Measures

where the amplitude A(x, t) and the phase ϕ(x, t) have an expansion in termsof the large frequency ν,

A = A0 +A1

ν+ . . . ; ϕ = ϕ0 +

ϕ1

ν+ . . . . (34.3)

Putting (34.2) in (34.1), and identifying the coefficients of ν2 gives anHamilton–Jacobi equation for ϕ0 (called the eikonal equation)

|ϕ0t |2 = c2|gradx(ϕ0)|2, (34.4)

whose solution stops being smooth at caustics, and then identifying the coef-ficients of ν gives a linear transport equation for A0:

A0tϕ

0t − c2

(gradx(A0), gradx(ϕ0)

)+ϕ0tt − c2Δϕ0

2A0 = 0. (34.5)

Based on similar considerations, it has been guessed that energy is transportedalong bicharacteristic rays, but if one looks at what has been done, one seesthat at best, i.e. if one estimates all the coefficients Aj and ϕj and one provesconvergence of the series (34.3) in some domain, one has only proven that itis true for a particular type of solutions, and away from caustics. For whathappens at caustics, MASLOV has proposed a formal expansion,3 which I thinkpredicts a jump of π

2 for the phase when one crosses caustics.4

Conversely, what my theorem with H-measures says is that for all solutionsof the wave equation, in the limit of infinite frequencies where some energymay be hidden at a mesoscopic level, this energy is transported along bichar-acteristic rays and the amounts moving in various directions are taken intoaccount by a new variable ξ, which is related to the direction of the gradientof the phase in the particular case considered by geometrical optics, and thereis no difficulty in having waves moving in infinitely many directions at thesame time because the Radon measure ν takes care of recording how muchenergy moves in each direction. Geometrical optics gets in trouble at causticsbecause it is designed to follow just one distorted plane wave, and causticsare precisely the points where one needs to consider plane waves arriving withslightly different directions. Actually, a difficulty appears at caustics for theH-measures, if one wants to study the regularity of its density in ξ, because3 Victor P. MASLOV, Russian mathematician. He works in Moscow, Russia.4 Because Jean LERAY had written a one page preface to the French translation

(by LASCOUX) of a book by MASLOV, I asked him (in the early 1990s, I think)if my interpretation was right, and I was surprised by his answer. Although hehad written in his preface that Lars HORMANDER’s theory of Fourier integraloperators is the wrong thing and that MASLOV was looking at the right question,he only answered to me that what MASLOV had done is formal, i.e. it is notmathematics. Of course, I knew this, but I had asked that question to JeanLERAY because I thought that he had read the book, and that he could then tellme what MASLOV was conjecturing.

Page 273: From hyperbolic systems to kinetic theory: a personalized quest

34 Variants of H-Measures 259

it is precisely at caustics that a limitation of the regularity occurs, and onethen sees the advantage of the weak formulation (33.3).

Having learnt more continuum mechanics or physics than most mathe-maticians, I have difficulty in being interested in oversimplified physical sit-uations, and if I have to consider an oversimplified model, for example onewhich uses only one characteristic length, I usually warn about the limitationsof such questions. However, for a simple question of showing one limitationof H-measures and how to overcome it, I had proposed a way to introduceone characteristic length εn (tending to 0), by adding one variable xN+1 forintroducing the sequence

V n(x1, . . . , xN , xN+1) = Un(x1, . . . , xN ) cosxN+1

εn, (34.6)

and then by considering the H-measure μ ∈ M(Ω×R×SN ) for the sequence

V n; actually, because μ is independent of the variable xN+1, it really cor-responds to an element of M(Ω × S

N ). Shortly after, I learnt that PatrickGERARD had already made a more elaborate proposal where, assuming un

scalar as a simplification, he considered a subsequence for which

limm→∞∫

RN |F(ϕum)|2ψ(εmξ) dξ = 〈μsc, |ϕ|2 ⊗ ψ〉,for all ϕ ∈ C∞

c (RN ), ψ ∈ S(RN ), (34.7)

and he called μsc ∈ M(Ω × RN ) the semi-classical measure associated to

the subsequence (because some examples that he had in mind are related toquestions that physicists call semi-classical). For technical reasons he usedF(ϕum) and not F(ϕ (um − u∞)

)in his definition, and his regularity hy-

pothesis on ψ has the reason that he had in mind a more general localizationprinciple, using higher-order derivatives (multiplied by the correct power ofεn). Although rather different, our two definitions are actually quite related,and if my choice of S

N as a quotient of RN+1 is not so good, his definition

consists in using {x ∈ RN+1 | xN+1 = 1}, which misses what my definition

puts on the equator {x ∈ SN | xN+1 = 0}, and this defect is related to hav-

ing chosen test functions ψ which vanish at ∞, but another defect is to havechosen test functions ψ which are continuous at 0, and my approach has thisdefect too. The motivation for the use of ψ(εmξ) in (34.7) comes from the factthat if un(x) = v(x)w

(xεn

)with v smooth with compact support and w pe-

riodic, then F(ϕun) is mostly localized at distances O(

1εn

)from the origin.5

As a consequence, if δn

εn→ 0, and un(x) = v(x)w

(xδn

), then the semi-classical

measure computed with εn is 0, because ψ is 0 at ∞; if ηn → 0 and ηn

εn→ ∞,

and un(x) = v(x)w(xηn

), then the semi-classical measure computed with εn

is concentrated at 0 but it mixes the information corresponding to various di-rections, because ψ is continuous at 0. This second defect can be corrected by5 Because ξ is the dual variable of x, and the use of e±2i π (ξ,x) forces (ξ, x) to have

no dimension, if εn is used to scale x, then 1εn

is used to scale ξ.

Page 274: From hyperbolic systems to kinetic theory: a personalized quest

260 34 Variants of H-Measures

using test functions ψ which behave like ψ0

(ξ|ξ|)

near 0 (with ψ0 ∈ C(SN−1)),and the first defect can be corrected by using test functions ψ which behavelike ψ∞

(ξ|ξ|)

near ∞ (with ψ∞ ∈ C(SN−1)), for example.6 Using such testfunctions ψ corresponds to considering R

N \ {0} and compactifying it with asphere S

N−1 at 0 and a sphere SN−1 at ∞, and a generalized measure (for

which I prefer not to use a term like semi-classical, because it is not wise togive too many different names to variants of H-measures) may charge thesetwo spheres and by a natural projection of this compactified space on S

N−1

one recovers the H-measure.Without taking such precautions near 0 and at ∞, it is false that the

knowledge of a semi-classical measure for a sequence gives the H-measure ofthat sequence,7 as was written by Pierre-Louis LIONS and Thierry PAUL,8

when they later found a different way to define the same objects that PatrickGERARD had introduced, which they wanted to call a different name, Wignermeasures, because they had discovered a way to introduce semi-classical mea-sures by using the Wigner transform. After George PAPANICOLAOU had toldme about the Wigner transform (32.3), I could not have thought of doingwhat Pierre-Louis LIONS and Thierry PAUL did, i.e. to look at

Wn(x, ξ) =∫

RN

u(x+

εny

2

)u(x− εny

2

)e−2i π (y,ξ) dy, (34.8)

and to show thatWm ⇀ μsc as m→ ∞ (34.9)

because I did not want to use any characteristic length in my general con-struction. I understood later that WIGNER had observed that (32.3) impliesthat

if i ut −Δu = 0, then∂W

∂t−

N∑

j=1

ξj∂W

∂xj= 0, (34.10)

and he would have liked to interpret W (x, ξ) as a density of particles movingwith velocity ξ, if W had been nonnegative. Marc FEIX told me afterwardthat WIGNER had proven that the convolution in ξ with e−α |ξ|2 is nonnega-tive,9 and he had characterized the best α > 0, and he told me that he had6 A careful analysis concerning a commutation lemma shows that at ∞ it is enough

to ask that ψ belong to the space BUC(RN ) of bounded uniformly continuousfunctions. However, one must pay attention that this space is not separable.

7 Except, of course, if the spheres at 0 and at ∞ are not charged by the generalizedmeasure, and Patrick GERARD had coined two words to express this fact. In otherwords, it is only true in a dull physical world with only one characteristic length,and it is worth pointing out that there are people who know the statement tobe wrong but nevertheless repeat it, probably because they like to advocate fakecontinuum mechanics or physics.

8 Thierry PAUL, French mathematician. He works at Universite Paris IX-Dauphine,Paris, France.

9 Marc R. FEIX, French physicist, 1928–2005. He had worked in Orleans, France.

Page 275: From hyperbolic systems to kinetic theory: a personalized quest

34 Variants of H-Measures 261

mentioned this fact to Pierre-Louis LIONS, who with his coauthor neverthelessattributed the idea to a Japanese. I had not tried to read the detail of whatthey had written, as I thought that they only wanted to show that they hadread about physics, while they had also shown a complete lack of physicalintuition in thinking that H-measures could be deduced from semi-classicalmeasures, but after Patrick GERARD had explained to me that they wantedto show that the limit of Wn is nonnegative, I immediately suggested a sim-pler proof using correlations, which actually opens the road to a new kind ofgeneralization.

Before talking about correlations, I find it useful to mention another im-portant observation learnt from H-measures, which shows a new kind of defectof the classical equations of kinetic theory; I had mentioned it once to Pierre-Louis LIONS in the early 1990s, but I may not have mentioned it in printbefore. It is that the density of particles f(x,v, t), that one uses in the Boltz-mann equation, or other equations in kinetic theory, looks pretty much likethe density of a variant of H-measures, i.e. one should think of it already asa quadratic micro-local object with respect to the waves, which are the onlyreal thing behind all that. It is then not so logical to introduce quadraticquantities in f , and it would be more natural to have a (micro-local) cubicquantity in the waves appear, and although such a general object has notbeen constructed yet, one can have a guess about that by using three-pointcorrelations.

I have mentioned before the Percus–Yevick equation for correlations, whichI think was postulated, so that I would not attach too much faith to it, but Isuggest that it should be understood as a hint that the ideas used in kinetictheory have been terribly simplistic, and that new ideas like correlations ofpositions should be thought about. When I discussed Kepler’s laws I pointedout that in the Boltzmann equation one mostly thinks in terms of two-bodyproblems with hyperbolas as trajectories, forgetting completely the case oftrajectories looking like ellipses, which must occur more and more if the gasis less and less rarefied, and in describing a gas with plenty of trajectories likethat, one might find that correlations of positions play an important role.

I should also recall the delays which take place during the close encounters,and mention the importance of considering equations with nonlocal terms, intime, but also in space.

However, after making a list of other ideas to use in classical descriptions,one must recall that in the end, the main problem is that real gases are notclassical at all, and they are made of waves!10

10 One may be interested in what happens to a “gas” of small metallic spheres rollingon a smooth plane surface and colliding, and one may compare theoretical resultsto experiments, and there is no doubt that one may improve on the Boltzmannequation for that, but in the end one will know more about a hard-sphere model,which no real gas follows!

Page 276: From hyperbolic systems to kinetic theory: a personalized quest

262 34 Variants of H-Measures

Apart from probabilistic frameworks, which I do not recommend for ex-plaining what happens in the real world, one needs a characteristic length(or time) for defining correlations. If u is periodic with period T , one definesk-point correlations by computing Ck(h1, . . . , hk) = 1

T

∫ T0u(t + h1) · · ·u(t +

hk) dt, and by applying this idea to the fast variable in a periodically modu-lated framework u

(x, xεn

)one is led to the following natural definition.

Definition 34.1. If for k ≥ 2 one has un ⇀ u∞ in Lkloc(Ω) weak, one definesthe k-point correlation measure Ck(h1, . . . , hk) using the characteristic lengthεn tending to 0 by

〈Ck(h1, . . . , hk), ϕ〉 = limn→∞∫Ωun(x+ εnh1) · · ·un(x+ εnhk)ϕ(x) dx,

for ϕ ∈ Cc(Ω), h1, . . . , hk ∈ RN ,

(34.11)for real functions, but also

〈C2(h1, h2), ϕ〉 = limn→∞∫Ω un(x+ εnh1)un(x+ εnh2)ϕ(x) dx,

for ϕ ∈ Cc(Ω), h1, h2 ∈ RN ,

(34.12)

for complex functions, in the particular case k = 2.

The definition makes sense because εn → 0 and for x ∈ support(ϕ), onehas x+ εnh1, . . . , x + εnhk ∈ Ω for n large enough. For given h1, . . . , hk, thesequence un(· + εnh1) · · ·un(· + εnhk) is bounded in L1

loc(Ω), so that thereexists a subsequence which converges in M(Ω) weak �, and using the Cantordiagonal argument one can extract a subsequence such that (34.11) holds forall h1, . . . , hk ∈ Q

N , and then using the uniform continuity of ϕ it also holdsfor all h1, . . . , hk ∈ R

N . One should notice that although a local bound in L3

seems natural for defining three-point correlations, such an hypothesis is notreally adapted to hyperbolic equations, because of an observation of WalterLITTMAN concerning the lack of Lp estimates for the wave equation if p �= 2,11

and it might be that either new functional spaces must be invented, or thatone must use ideas of compensated regularity (which is not the same thing ascompensated compactness!) for defining some special parts of the three-pointcorrelation measures.

Lemma 34.2. One has

Ck(h1 + z, . . . , hk + z) = Ck(h1, . . . , hk) for all h1, . . . , hk, z ∈ RN , (34.13)

andm∑

i,j=1

C2(hi, hj)λiλj ≥ 0 for all m ≥ 1, h1, . . . , hm ∈ RN , λ1, . . . , λm ∈ C

N ,

(34.14)11 Walter LITTMAN, American mathematician. He worked at University of Min-

nesota, Minneapolis, MN.

Page 277: From hyperbolic systems to kinetic theory: a personalized quest

34 Variants of H-Measures 263

so that C2(h, 0) is the Fourier transform (in its second argument ξ ∈ RN ) of

a nonnegative measure ∈ M(Ω × RN ).

Proof : Translating all the hj by z corresponds to evaluating ϕ at x − εnz in(34.11), and the uniform continuity of ϕ gives (34.13), while (34.14) is justsaying that the weak � limit of

∣∣∑m

j=1 λjun(x + εnhj)∣∣2 is ≥ 0. Denoting

Γ2(h) = C2(h + z, z) for all h, z ∈ RN , (34.14) says that

∑mi,j=1 Γ2(hi −

hj)λiλj ≥ 0, so that a theorem of BOCHNER on functions of positive type,12

extended by Laurent SCHARTZ to (tempered) distributions of positive type,tells us that Γ2 is the Fourier transform of a nonnegative measure. ��

It is not too difficult to check that it is the semi-classical measure μscwhich is behind this formula, but the interest of this lemma is more that ithelps understand what is behind the definition of the Wigner transform, thatit is like the Fourier transform of a two-point correlation function, and thatfor questions of symmetry it is better to use Γ2(h) = C2

(h2 ,

−h2

).

Although I know of no analogous result that would play the role of theBochner theorem for what concerns k-point correlations with k ≥ 3, one maynevertheless obtain partial differential equations satisfied by Ck directly,13

but it might be important to investigate “natural formulations” for the casesk ≥ 3, in parallel with the search for cubic and higher-order corrections insmall amplitude homogenization, or a question which I consider of a greaterimportance, extending the theory of H-measures to semi-linear hyperbolicsystems. My approach is not to try to read what physicists have done, asthey often use what I call pseudo-logic,14 or put in their hypotheses whatthey want to find in the conclusion, but I would not be surprised that amathematical answer might explain some of the strangely efficient formalmethods introduced by FEYNMAN, although by using completely differentideas.15

12 Salomon BOCHNER, Polish-born mathematician, 1899–1982. He had worked inMunchen (Munich), Germany, and at Princeton University, Princeton, NJ.

13 For k = 2, I had made such an observation for an equation of the form unt −

ε2nΔun = fn, where I used natural bounds in the L2 norm of εngrad(un), butPatrick GERARD then taught me a simpler derivation which uses only bounds inthe L2 norm of un, and we then checked that this method gives partial differentialequations for Ck for k ≥ 3.

14 This is how I qualify an “argument” that an hypothesis A seems to imply aconclusion like B, and as one observes something that looks like B it must bethat A is true! Apart from showing a strange lack of imagination, it suggests thatwhoever uses that kind of “reasoning” has never heard of basic logic.

15 One attributes all kinds of statements to FEYNMAN, and it might be true thatafter having shown a formal argument, and then heard a mathematician mentionthat he could prove that in a mathematical way, he had wondered why anyonewould bother to do such a (useless) thing. In the presence of a formal argumentpresented by a physicist, I think that the question for a mathematician is not to

Page 278: From hyperbolic systems to kinetic theory: a personalized quest

264 34 Variants of H-Measures

The first question which I had overlooked concerning H-measures con-cerned taking into account initial conditions for the transport equations thatI had obtained for H-measures, because the scalar case that I had solved wasnot general enough. For the wave equation with smooth coefficients, it wasdone by Gilles FRANCFORT and Francois MURAT,16 with the technical adviceof Patrick GERARD, using classical pseudo-differential operators, but moreremains to be done for general systems.

In the same way, there is not much understood about the boundary con-ditions to impose for H-measures.

The second question which I had overlooked concerned the Dirac equation,and I had asked my student Nenad ANTONIC to look at it,17 but I was sur-prised to see that there was nothing in the answer that suggested a questionabout electrons. The reason was that the zero-order term containing m0 (themass of the electron) plays no role if one considers it independent of n.

What Patrick GERARD observed is that the zero-order term is large andhas a behaviour in 1

ε2nfor a small characteristic length, and that one should

then consider the semi-classical measure associated with εn → 0. To do thisanalysis, he had to freeze the coefficients involving the potentials V and A,assumed to be smooth and given, so that the Dirac equation is then linear inψ, with the velocity of light c, the charge of the electron e and the mass of theelectron m0 appearing in its coefficients. He observed that the equation thathe obtains for the semi-classical measure can be interpreted as describing twotypes of particles, of charge ±e and relativistic mass m0√

1−v2/c2 evolving under

the Lorentz force ±e(E + v×B), with E and B related to V and A as usual,but E and B have not been asked to solve the Maxwell–Heaviside equation. Itshows that DIRAC had really done a superb job in creating his equation, andthe work of Patrick GERARD explains in a mathematical way what the physi-cists had meant by saying that the Dirac equation both describes “electrons”and “positrons”.

It is important to notice that Patrick GERARD’s computation shows thatthe Lorentz force does not exist at the level of the “particles”, here called“electrons” and “positrons” because of the values of their mass and theirelectric charge, but that it is dependent upon m0 appearing explicitly in theequation, and on V and A being smooth enough on a scale much larger thanεn.

In 1984, I had suggested that the term containing m0 should appear as ahomogenization correction and would correspond to the mass being entirelymade of electromagnetic energy stored inside the particles (with “Einstein

make sense of the path that he/she has followed, but usually to understand whathe/she was really looking for.

16 Gilles FRANCFORT, French mathematician, born in 1957. He works at UniversiteParis-Nord, Villetaneuse, France.

17 Nenad ANTONIC, Croatian mathematician. He works in Zagreb, Croatia. He wasmy PhD student (1992) at CMU (Carnegie Mellon University), Pittsburgh, PA.

Page 279: From hyperbolic systems to kinetic theory: a personalized quest

34 Variants of H-Measures 265

equation” e = mc2), and shortly afterwards I read of a similar propositionby BOSTICK (but not involving the Dirac equation), and this should make Vand A change enough on a scale of the order of εn.

These considerations seem to imply that once again one is thrown into thesemi-linear world, which is not understood yet.

Conclusion: It is important to observe then which mathematical results areproven, for what equations, and which are the hypotheses used. It is importantto understand enough about how the accepted “laws of physics” should bemodified when the rules used by physicists seem illogical, because physicistshave assumed some macroscopic equations to be valid at a mesoscopic levelor a microscopic level, while it seems clear that the equations have a differentform at these levels, even though this form might not be understood yet.

It might be useful to recall how ideas about chemistry have evolved. Firstone observed reactions in given proportions, and one assumed that the sameproportions were used at a microscopic level. Then one invoked time for thereactions to take place, so that one entered the realm of ordinary differentialequations. Then one invoked also space for the constituents to be moved tothe place where reactions took place, so that one entered the realm of par-tial differential equations. Then one observed the appearance of small scalescreated by turbulent mixing, and one invented all kind of variants of thermo-dynamics, in order to avoid having to think about what was really happeningat a microscopic level or at a mesoscopic level: one had then moved from earlychemistry to chemical engineering.

At some point one started changing the equations at a microscopic level,because the physicists had invented quantum mechanics,18 without question-ing some of the strange rules that had been invented, and one started com-puting orbitals and requiring larger and larger computers for playing a gamewhich should have been criticized from the start, for example because therules of quantum mechanics had been invented in order to fit what one hadobserved for electromagnetism in the vacuum, and that is hardly the kind ofenvironment that one finds in chemistry!

The art of the engineers makes it possible to tame some phenomena forwhich one does not have the right equations, but it is the role of the scientiststo discover what these missing equations are, as part of their duty to find thereal laws of nature.

It is my feeling that one has not really found the laws of nature, because onehas made the mistake to continue thinking in terms of the classical mechanicsof the 18th century and the continum mechanics of the 19th century, with the18 Many chemists think that they should mimic physicists, and many physicists think

that they should mimic mathematicians, and choose to do astrophysics, probablybecause of an irrational tendency of believing a questionable classification of aphilosopher of sciences, COMTE, who had put mathematics above astronomy,physics and chemistry, in that order.

Page 280: From hyperbolic systems to kinetic theory: a personalized quest

266 34 Variants of H-Measures

mathematics created in the 19th and the 20th century, instead of observingthat the continuum mechanics and the physics of the 20th century requiremathematical tools that are beyond partial differential equations, which shouldbe developed in the 21st century, maybe along the line of what I have starteddoing since the 1970s.

[Taught on Friday November 30, 2001.]

Notes on names cited in footnotes for Chapter 34, LASCOUX,19 COMTE.20

19 Jean LASCOUX, French physicist. He worked at Ecole Polytechnique, Palaiseau,France.

20 Auguste COMTE, French philosopher, 1798–1857. He had worked in Paris, France.

Page 281: From hyperbolic systems to kinetic theory: a personalized quest

35

Biographical Information

[In a reference a-b, a is the lecture number, 0 referring to the Preface, and bthe footnote number in that lecture.]

ABEL, 0-52 BOCHNER, 34-12 CARNEGIE, 2-25ADAMS J.C., 27-9 BOLTZMANN, 0-12 CARNOT S., 1-70AIRY, 33-7 BONAPARTE N., 1-68 CARTAN E., 8-24ALAOGLU, 10-16 BOREL E., 1-73 CARTAN H., 8-23D’ALEMBERT, 24-2 BOSE, 11-5 CAUCHY, 1-12ALEXANDER the G., 2-32 BOSTICK, 23-24 CAVENDISH, 0-47ALFVEN, 1-1 BOYLE, 1-19 CELSIUS, 1-28AL KHWARIZMI, 9-23 BRAHE, 27-5 CHALLIS, 1-50AL MAMUN, 9-26 BRENIER, 5-5 CHAPMAN S., 20-4AMPERE, 1-4 BRILLOUIN, 33-6 CHARLES IV, 0-56ANTONIC, 34-17 BROADWELL, 14-7 CHARLES X, 1-67D’ARC, 0-58 DE BROGLIE L., 0-15 CHARLES J., 1-37AVOGADRO, 1-38 BROUWER, 2-6 CHERET, 7-1−−− BROWN N., 0-53 CHISHOLM-YOUNG, 19-10BABUSKA, 0-29 BROWN R., 1-41 CIORANESCU, 23-14BACHELIER, 8-5 BRUN, 7-2 CLAUSIUS, 1-69BALL W.R., 16-7 BUNYAKOVSKY, 10-11 CLEBSCH, 2-20BANACH, 9-2 BURGERS, 0-19 COIFMAN, 18-6BATEMAN, 4-1 −−− COLE, 4-4BEALE, 17-2 CABANNES, 14-13 COMTE A., 34-20BECQUEREL, 0-60 CAFLISCH, 16-6 CONLEY, 6-5BELLMAN, 4-7 CALDERON A., 13-4 CORIOLIS, 23-22BENILAN, 4-11 CALVIN, 0-6 CORNELL, 0-55BERKELEY, 0-57 CANTOR, 20-2 COURANT, 1-71BERNOULLI D., 10-3 CARATHEODORY, 4-6 CRAFOORD, 13-16BESSEL, 2-35 CARLEMAN, 13-2 CRANDALL, 4-12BIOT, 1-6 CARLESON, 14-27 CRISTIN, 1-29

Page 282: From hyperbolic systems to kinetic theory: a personalized quest

268 35 Biographical Information

CURIE, 0-44 FRANCFORT, 34-16 HERMITE, 32-8−−− FRECHET, 9-22 HERSCHEL, 27-7DAFERMOS C., 0-25 FRIEDRICHS, 1-72 HESSE, 6-2D’ALEMBERT, 24-2 FROBENIUS, 2-21 HILBERT D., 0-40D’ARC, 0-58 FROSTMAN, 14-28 HIRZEBRUCH, 2-31DAUTRAY, 0-4 FUBINI, 9-4 HODGE, 16-1DA VINCI, 0-33 FULLER, 1-64 HOLDER O., 20-1DE BROGLIE L., 0-15 −−− HOPF E., 0-36DEBYE, 23-15 GAGLIARDO, 18-10 HOPKINS, 4-16DE GIORGI, 18-1 GALILEI, 1-16 HORMANDER, 2-11DE KLEIN, 8-10 GALLE, 27-11 Hugo of St Victor, 0-42DE PRETTO, 23-25 GANTMAKHER, 12-12 HUGONIOT, 1-54DE VRIES, 0-21 GARDING, 2-10 −−−DIDEROT, 0-46 GATIGNOL, 14-1 ILLNER, 16-8DI PERNA, 0-24 GAUSS, 1-5 ITO, 0-51DIRAC, 0-17 GAY LUSSAC, 1-35 −−−DIRICHLET, 2-33 GEL’FAND, 6-4 JACOBI, 3-6DUHEM, 1-56 GEORGE II, 6-12 JENSEN J.H., 32-17DUKE, 0-54 GERARD P., 31-4 JOHN, 13-8DUNFORD, 15-2 GERMAIN, 8-1 JORDAN C., 12-8−−− GERSHGORIN, 12-5 JOST, 8-2EARNSHAW, 3-2 GLIMM, 0-61 JOULE, 1-34EINSTEIN, 0-14 GODUNOV, 2-2 −−−EKSTROM, 1-30 GOEPPERT MAYER, 32-16 KANIEL, 30-9ENSKOG, 20-5 GOLSE, 31-6 KAWASHIMA, 17-5EOTVOS L., 14-32 GOUDSMIT, 8-20 KELLER J.B., 33-8EUCLID, 2-13 GRAD, 14-20 Kelvin, 1-33EULER, 0-41 GREEN, 8-15 KEPLER, 1-11−−− GRONWALL, 28-2 KEYFITZ, 4-10FAHRENHEIT, 1-26 GUIRAUD, 9-21 KIRCHGASSNER, 27-6FARADAY, 1-8 −−− KIRCHHOFF, 1-48FEFFERMAN C., 14-22 HAAR, 10-8 KNOPS, 23-12FEIX, 34-9 HADAMARD, 9-24 KNUDSEN, 30-1FERDINAND II, 1-23 HAMDACHE, 17-6 KODAIRA, 4-21FERMAT, 10-1 HAMILTON, 1-18 KOHN J., 32-9FEYNMAN, 1-10 HARDINGE, 19-11 KOLMOGOROV, 8-16FICK, 4-18 HARDY, 13-7 KOLODNER, 13-3FIELDS, 0-45 HARVARD, 1-76 KORTEWEG, 0-20FOIAS, 2-8 HEATH, 8-6 KRONECKER, 2-12FOKKER, 8-10 HEAVISIDE, 1-3 KRUZHKOV, 0-37FORSYTH, 4-20 HEDBERG, 16-4 KURTZ, 20-8FOURIER J.-B., 2-30 HERIOT, 21-8 −−−

Page 283: From hyperbolic systems to kinetic theory: a personalized quest

35 Biographical Information 269

LADYZHENSKAYA, 2-7 MELLON A., 2-26 PETTIS, 15-3LAGRANGE, 1-14 METIVIER, 21-3 PHILLIPS, 0-31LALANDE, 27-12 MEYER Y., 13-11 PIATETSKI-SHAPIRO, 18-11LAMBERT, 29-5 MICHELSON, 23-7 PIOLA, 1-47LAME, 26-6 MIELKE, 21-10 PLANCHEREL, 9-6LANDAU L.D., 21-5 MILNOR, 13-12 PLANCK, 0-48LAPLACE, 1-43 MIMURA, 14-16 POINCARE H., 0-13LASCOUX, 34-19 MORAWETZ, 2-1 POISSON, 1-42LAX, 0-18 MORLEY, 23-8 PURDUE, 1-75LEBESGUE, 1-74 MUNCASTER, 28-3 −−−LENNARD JONES, 23-16 MURAT F., 0-10 RADON, 4-15LERAY, 2-4 −−− RAIZER, 9-18LE ROND, 24-2 NAPOLEON I, 1-68 RANKINE, 1-53LEVERRIER, 27-10 NAVIER, 0-1 Rayleigh, 1-55LEWY, 4-9 NECAS, 30-12 REAUMUR, 1-27LIFSCHITZ, 21-6 NEEL, 1-62 REED, 17-7LIGGETT, 14-15 NEUMANN F., 2-34 VAN RENNSELAER, 4-19LINNE, 1-31 NEWTON, 0-16 REY, 1-22LIONS J.-L., 0-5 NIRENBERG, 13-9 REYNOLDS, 30-6LIONS P.-L., 14-21 NISHIDA, 0-26 RICE W.M., 18-15LIPSCHITZ, 1-13 NOBEL, 0-59 RIEMANN, 1-51LITTLEWOOD, 16-3 NOETHER A., 2-18 RIESZ F., 13-13LITTMAN, 34-11 −−− RIESZ M., 13-6LIU C., 17-3 OCCHIALINI, 8-21 ROBBIN, 16-2LIU T.P., 3-8 OLEINIK, 0-38 ROCKEFELLER, 8-22LOMONOSOV, 9-25 ORNSTEIN L., 8-12 RUSSELL D., 26-3LORENTZ G.G., 13-18 OVADIA, 7-3 RUTGERS, 6-10LORENTZ H.A., 1-3 −−− −−−LOVASZ, 14-29 PALLU DE LA B., 21-4 SADLEIR, 13-15LUCAS H., 0-43 PAPANICOLAOU, 19-9 SAINT-VENANT, 2-27LYAPUNOV A.M., 30-3 PARRY, 3-3 SANCHEZ-PALENCIA, 0-30−−− PARSEVAL, 11-7 SANTORIO, 1-21MAGENES, 26-8 PASCAL, 10-2 SAVART, 1-7MAJDA A., 4-13 PAUL, T., 34-8 SAVILE, 13-14MANDEL, 5-3 PECCOT, 6-8 SCHAUDER, 2-5MARCINKIEWICZ, 13-17 PEETRE, 14-26 SCHMIDT, 32-18MARIOTTE, 1-20 PEGO, 6-3 SCHRODINGER, 0-49MARKOV, 12-1 PERCUS, 25-7 SCHULENBERGER, 18-12MASLOV, 34-3 PERRIN, 1-40 SCHWARTZ L., 0-9MAXWELL, 0-11 PERRON, 12-2 SCHWARZ, 10-12DE MEDICI F., 1-23 PERTHAME, 30-10 SCHWINGER, 1-66DE MEDICI L., 1-25 PESZEK, 17-4 SEDLEY, 20-9

Page 284: From hyperbolic systems to kinetic theory: a personalized quest

270 35 Biographical Information

SEMMES, 18-7 TADMOR, 30-11 WATT, 21-9SENTIS, 31-7 TARTAR, 4-14 WAYNE, 13-19SERRIN, 2-9 TAYLOR B., 8-17 WEIERSTRASS, 10-15SHINBROT, 29-1 THOM, 3-5 WEIL A., 2-28SIEGEL, 6-9 THOMSON, 1-33 WIENER, 8-8SINAI, 33-10 THOMPSON, 14-30 WIGNER, 32-5SMOLLER, 0-23 THROOP, 4-17 WILCOX, 18-13SOBOLEV, 0-3 TOMONAGA, 1-65 WOLF, 0-50SOUTHWELL, 1-24 TRUESDELL, 14-17 −−−SPAGNOLO, 0-28 TULANE, 15-5 YALE, 14-31STANFORD, 0-32 −−− YEVICK, 25-8STEIN, 14-23 UHLENBECK G., 8-13 YOUNG L.C., 19-3STEKLOV, 2-29 −−− YOUNG W.H., 19-8STEVENS, 23-26 DE LA VALLEE POUSSIN, 15-4 YUKAWA, 23-13STOKES, 0-2 VARADHAN S.R.S., 17-1 −−−STROMER, 1-32 VLASOV, 27-4 ZARANTONELLO E., 0-22STRUTT, 1-55 VOGELIUS, 6-11 ZEEMAN, 1-63SYNGE, 2-24 −−− ZEL’DOVICH, 9-17−−− WASHINGTON, 18-14 ZYGMUND, 13-5

Page 285: From hyperbolic systems to kinetic theory: a personalized quest

36

Abbreviations and Mathematical Notation

Abbreviations for states: For those not familiar with geography, I have men-tioned England, Scotland, and Wales, without mentioning that they are partof UK (United Kingdom), I have mentioned British Columbia and Ontario,without mentioning that they are part of Canada, and I have mentioned afew of the fifty states in the United States of America: AZ = Arizona, CA =California, CO = Colorado, CT = Connecticut, IL = Illinois, IN = Indiana,KY = Kentucky, LA = Louisiana, MA = Massachusetts, MD = Maryland,MI = Michigan, MN = Minnesota, MO = Missouri, NC = North Carolina,NJ = New Jersey, NM = New Mexico, NY = New York, OH = Ohio, PA =Pennsylvania, RI = Rhode Island, TX = Texas, UT = Utah, VA = Virginia,WI = Wisconsin.

• a.e.: almost everywhere.• B(x, r): open ball centred at x and radius r > 0, i.e. {y ∈ E | ||x−y||E < r}(in a normed space E).• BMO(RN ): space of functions of bounded mean oscillation on R

N , i.e. semi-

norm ||u||BMO < ∞, with ||u||BMO = supcubes Q

∫Q

|u−uQ| dx|Q| < ∞ (uQ =

∫Qu dx

|Q| , |Q| = meas(Q)).• BV (Ω): space of functions of bounded variation in Ω, whose partial deriva-tives (in the sense of distributions) belong to Mb(Ω), i.e. have finite totalmass.• C(Ω): space of scalar continuous functions in an open set Ω ⊂ R

N (E0(Ω)in the notation of L. SCHWARTZ).• C(Ω; Rm): space of continuous functions from an open set Ω ⊂ R

N intoRm.

• C(Ω): space of scalar continuous and bounded functions on Ω, for an openset Ω ⊂ R

N .• C0(Ω): space of scalar continuous bounded functions tending to 0 at theboundary of an open set Ω ⊂ R

N , equipped with the sup norm.

Page 286: From hyperbolic systems to kinetic theory: a personalized quest

272 36 Abbreviations and Mathematical Notation

• Cc(Ω): space of scalar continuous functions with compact support in anopen set Ω ⊂ R

N .• Ckc (Ω): space of scalar functions of class Ck with compact support in anopen set Ω ⊂ R

N .• Ck(Ω): space of scalar continuous functions with continuous derivatives upto order k in an open set Ω ⊂ R

N .• Ck(Ω): restrictions to Ω of functions in Ck(RN ), for an open set Ω ⊂ R

N .• C0,α(Ω): space of scalar Holder continuous functions of order α ∈ (0, 1)(Lipschitz continuous functions if α = 1), i.e. bounded functions for whichthere exist M such that |u(x) − u(y)| ≤ M |x− y|α for all x, y ∈ Ω ⊂ R

N ; itis included in C(Ω).• Ck,α(Ω): space of functions of Ck(Ω) whose derivatives of order k belongto C0,α(Ω) ⊂ C(Ω), for an open set Ω ⊂ R

N .• curl: rotational operator (curl(u))i =

∑jk εijk

∂uj

∂xk, used for open sets Ω ⊂

R3.

• Dα: ∂α1

∂xα11. . . ∂

αN

∂xαNN

(for a multi-index α with αj nonnegative integers, j =

1, . . . , N).• D′(Ω): space of distributions T in Ω, dual of C∞

c (Ω) (D(Ω) in the no-tation of L. SCHWARTZ, equipped with its natural topology), i.e. for ev-ery compact K ⊂ Ω there exists C(K) and an integer m(K) ≥ 0 with|〈T, ϕ〉| ≤ C(K) sup|α|≤m(K) ||Dαϕ||∞ for all ϕ ∈ C∞

c (Ω) with support inK.• div: divergence operator div(u) =

∑i∂ui

∂xi.

• F : Fourier transform, Ff(ξ) =∫

RN f(x)e−2iπ(x,ξ) dx.• F : inverse Fourier transform, Ff(ξ) =

∫RN f(x)e+2iπ(x,ξ) dx.

• grad(u): gradient operator, grad(u) =(∂u∂x1

, . . . , ∂u∂xN

).

• Hs(RN ): Sobolev space of temperate distributions (∈ S′(RN )), or functionsin L2(RN ) if s ≥ 0, such that (1 + |ξ|2)s/2Fu ∈ L2(RN ) (L2(RN ) for s = 0,W s,2(RN ) for s a positive integer).• Hs(Ω): space of restrictions to Ω of functions from Hs(RN ) (for s ≥ 0), foran open set Ω ⊂ R

N .• Hs

0(Ω): for s ≥ 0, closure of C∞c (Ω) in Hs(Ω), for an open set Ω ⊂ R

N .• H−s(Ω): for s ≥ 0, dual of Hs

0(Ω), for an open set Ω ⊂ RN .

• H(div;Ω): space of functions u ∈ L2(Ω; RN ) with div(u) ∈ L2(Ω), for anopen set Ω ⊂ R

N .• H(curl;Ω): space of functions u ∈ L2(Ω; R3) with curl(u) ∈ L2(Ω; R3), foran open set Ω ⊂ R

3.• H1(RN ): Hardy space of functions f ∈ L1(RN ) with Rjf ∈ L1(RN ), j =1, . . . , N , where Rj , j = 1, . . . , N are the (M.) Riesz operators.• H(θ): class of Banach spaces satisfying (E0, E1)θ,1;J ⊂ E ⊂ (E0, E1)θ,∞;K .• ker(A): kernel of a linear operator A ∈ L(E;F ), i.e. {e ∈ E | Ae = 0}.• L(E;F ): space of linear continuous operators M from the normed space Einto the normed space F , i.e. with ||M ||L(E;F ) = supe=0

||M e||F||e||E <∞.

Page 287: From hyperbolic systems to kinetic theory: a personalized quest

36 Abbreviations and Mathematical Notation 273

• Lp(A), L∞(A): Lebesgue space of (equivalence classes of a.e. equal) mea-surable functions u with ||u||p =

(∫A |u(x)|p dx)1/p < ∞ if 1 ≤ p < ∞, with

||u||∞ = inf{M | |u(x)| ≤ M a.e. in A} < ∞, for a Lebesgue measurable setA ⊂ R

N (spaces also considered for the induced (N−1)-dimensional Hausdorffmeasure if A = ∂Ω for an open set Ω ⊂ R

N with a smooth boundary).• Lploc(A): (equivalence classes of) measurable functions whose restriction toevery compact K ⊂ A belongs to Lp(K) (for 1 ≤ p ≤ ∞), for a Lebesguemeasurable set A ⊂ R

N .• Lp((0, T );E

): (weakly or strongly) measurable functions u from (0, T ) into

a separable Banach space E, such that t �→ ||u(t)||E belongs to Lp(0, T ) (for1 ≤ p ≤ ∞).• |α|: length of a multi-index α = (α1, . . . , αN ), |α| = |α1| + . . .+ |αN |.• Lip(Ω): space of scalar Lipschitz continuous functions, also denoted C0,1(Ω),i.e. bounded functions for which there exists M such that |u(x) − u(y)| ≤M |x− y| for all x, y ∈ Ω ⊂ R

N ; it is included in C(Ω).• loc: for any space Z of functions in an open set Ω ⊂ R

N , Zloc is the spaceof functions u such that ϕu ∈ Z for all ϕ ∈ C∞

c (Ω).

• M f : maximal function of f , i.e. M f(x) = supr>0

∫B(x,r)

|f(y)|dy|B(x,r)| .

• M(Ω): space of Radon measures μ in an open set Ω ⊂ RN , dual of Cc(Ω)(equipped with its natural topology), i.e. for every compact K ⊂ Ω thereexists C(K) with |〈μ, ϕ〉| ≤ C(K)||ϕ||∞ for all ϕ ∈ Cc(Ω) with support in K.• Mb(Ω): space of Radon measures μ with finite total mass in an open setΩ ⊂ R

N , dual of C0(Ω), the space of continuous bounded functions tendingto 0 at the boundary of Ω (equipped with the sup norm), i.e. there exists Cwith |〈μ, ϕ〉| ≤ C ||ϕ||∞ for all ϕ ∈ Cc(Ω).• meas(A): Lebesgue measure of A, sometimes denoted |A|.• | · |: norm in H , or sometimes the Lebesgue measure of a set.• || · ||: norm in V .• || · ||∗: dual norm in V ′.• p′: conjugate exponent of p ∈ [1,∞], i.e. 1

p + 1p′ = 1.

• p∗: Sobolev exponent of p ∈ [1, N), i.e. 1p∗ = 1

p − 1N for Ω ⊂ RN and N ≥ 2.

• R+: (0,∞).• R

N+ : {x ∈ R

N | xN > 0}.• R(A): range of a linear operator A ∈ L(E;F ), i.e. {f ∈ F | f = Ae forsome e ∈ E}.• Rj : Riesz operators, j = 1, . . . , N , defined by F(Rju)(ξ) = i ξjFu(ξ)

|ξ| onL2(RN ); natural extensions to R

N of the Hilbert transform, they map Lp(RN )into itself for 1 < p <∞, and L∞(RN ) into BMO(RN ).• S(RN ): Schwartz space of functions u ∈ C∞(RN ) with xαDβu bounded forall multi-indices α, β with αj , βj nonnegative integers for j = 1, . . . , N .• S′(RN ): temperate distributions, dual of S(RN ), i.e. T ∈ D′(RN ) and thereexists C and an integer m ≥ 0 with |〈T, ψ〉| ≤ C sup|α|,|β|≤m ||xαDβψ||∞ forall ψ ∈ S(RN ).

Page 288: From hyperbolic systems to kinetic theory: a personalized quest

274 36 Abbreviations and Mathematical Notation

• �: convolution product (f � g)(x) =∫

RN f(x− y)g(y) dy.• supp(·): support; for a continuous function u from a topological space intoa vector space, it is the closure of {x | u(x) �= 0}, but for a locally integrablefunction f , a Radon measure μ, or a distribution T defined on an open setΩ ⊂ R

N , it is the complement of the largest open set ω where f , μ, or T is 0,i.e. where

∫ω ϕf dx = 0, or 〈μ, ϕ〉 = 0 for all ϕ ∈ Cc(Ω), or 〈T, ϕ〉 = 0 for all

ϕ ∈ C∞c (Ω).

• Wm,p(Ω): Sobolev space of functions in Lp(Ω) whose derivatives (in thesense of distributions) of length ≤ m belong to Lp(Ω), for an open setΩ ⊂ R

N .•Wm,p(Ω; Rm): Sobolev space of functions fromΩ into R

m whose componentsbelong to Wm,p(Ω), for an open set Ω ⊂ R

N .• x′: in R

N , x = (x′, xN ), i.e. x′ = (x1, . . . , xN−1).• xα: xα1

1 . . . xαN

N for a multi-index α with αj nonnegative integers for j =1, . . . , N , for x ∈ R

N .

• Δ: Laplacian∑N

j=1∂2

∂x2j

, defined on any open set Ω ⊂ RN .

• δij : Kronecker symbol, equal to 1 if i = j and equal to 0 if i �= j (fori, j = 1, . . . , N).• εijk: for i, j, k ∈ {1, 2, 3}, completely antisymmetric tensor, equal to 0 if twoindices are equal, and equal to the signature of the permutation 123 �→ ijk ifindices are distinct (i.e. ε123 = ε231 = ε312 = +1 and ε132 = ε321 = ε213 =−1).• γ0: trace operator, defined for smooth functions by restriction to the bound-ary ∂Ω, for an open set Ω ⊂ R

N with a smooth boundary, and extended bydensity to functional spaces in which smooth functions are dense.• Λ1: Zygmund space, |u(x+h)+u(x−h)− 2u(x)| ≤M |h| for all x, h ∈ R

N .• ν: exterior normal to Ω ⊂ R

N , open set with Lipschitz boundary.• ρε: smoothing sequence, with ρε(x) = 1

εN ρ1

(xε

)with ε > 0 and ρ1 ∈

C∞c (RN ) with

∫x∈RN ρ1(x) dx = 1, and usually ρ1 ≥ 0.

• τh: translation operator of h ∈ RN , acting on a function f ∈ L1

loc(RN ) by

τhf(x) = f(x− h) a.e. x ∈ RN .

• ΩF : {x ∈ RN | xN ≥ F (x′)}, for a continuous function F , where x′ =

(x1, . . . , xN−1).

Page 289: From hyperbolic systems to kinetic theory: a personalized quest

References

[1] CARLEMAN T., Problemes mathematiques dans la theorie cinetique des gaz,Publ. Sci. Inst. Mittag-Leffler. 2, Almqvist & Wiksells Boktryckeri Ab, Upp-sala 1957, 112 pp.

[2] CHERET R., Detonation of Condensed Explosives, Springer-Verlag, New York,1993.

[3] COURANT R. & FRIEDRICHS K.O., Supersonic Flow and Shock Waves, In-terscience Publishers, Inc., New York, 1948, xvi+464 pp. Reprinting of the1948 original, Applied Mathematical Sciences, Vol. 21. Springer-Verlag, NewYork-Heidelberg, 1976, xvi+464 pp.

[4] DAFERMOS C., Hyperbolic Conservation Laws in Continuum Physics (Grund-lehren der Mathematischen Wissenschaften, 325. Springer-Verlag, Berlin,2000, xvi+443 pp.

[5] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 1. Physical Origins andClassical Methods, xviii+695 pp., Springer-Verlag, Berlin-New York, 1990.

[6] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 2. Functional and Varia-tional Methods, xvi+561 pp., Springer-Verlag, Berlin-New York, 1988.

[7] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 3. Spectral Theory and Ap-plications, x+515 pp., Springer-Verlag, Berlin, 1990.

[8] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 4. Integral Equations andNumerical Methods, x+465 pp., Springer-Verlag, Berlin, 1990.

[9] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 5. Evolution Problems. I,xiv+709 pp., Springer-Verlag, Berlin, 1992.

[10] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 6. Evolution Problems. II,xii+485 pp., Springer-Verlag, Berlin, 1993.

[11] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 7. Evolution: Fourier,Laplace, xliv+344+xix pp., INSTN: Collection Enseignement. Masson, Paris,1988 (reprint of the 1985 edition).

Page 290: From hyperbolic systems to kinetic theory: a personalized quest

276 References

[12] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 8. Evolution: semi-groupe,variationnel, xliv+345–854+xix pp., INSTN: Collection Enseignement. Mas-son, Paris, 1988 (reprint of the 1985 edition).

[13] DAUTRAY Robert & LIONS Jacques-Louis, Mathematical Analysis and Nu-merical Methods for Science and Technology, Vol. 9. Evolution: numerique,transport, xliv+855–1303 pp., INSTN: Collection Enseignement. Masson,Paris, 1988 (reprint of the 1985 edition).

[14] FEYNMAN R., LEIGHTON R.B. & SANDS M., The Feynman Lectures onPhysics: The Definitive and Extended Edition, 3 vol, Addison-Wesley, 2005.

[15] FEYNMAN R., Surely You’re Joking, Mr. Feynman!, Vintage, UK, 1992.[16] GATIGNOL R., Theorie cinetique des gaz a repartition discrete de vitesses,

Lecture Notes in Physics 36, Springer, Berlin, 1975.[17] TARTAR L., Une introduction a la theorie mathematique des systemes hyper-

boliques de lois de conservation, Publicazioni 682, Istituto di Analisi Numer-ica, Pavia, 1989.

[18] TARTAR L., H-measures, a new approach for studying homogenisation, oscil-lations and concentration effects in partial differential equations. Proc. Roy.Soc. Edinburgh Sect. A 115, 1990, no. 3-4, 193–230.

[19] TARTAR L., Compensation effects in partial differential equations. Memoriedi Matematica e Applicazioni, Rendiconti della Accademia Nazionale delleScienze detta dei XL, Ser. V, vol. XXIX, 2005, 395–454.

[20] TARTAR L., An Introduction to Navier–Stokes Equation and Oceanography,271 pp., Lecture Notes of Unione Matematica Italiana, Vol. 1, Springer,Berlin-Heidelberg-New York, 2006.

[21] TARTAR L., An Introduction to Sobolev Spaces and Interpolation Spaces, 248pp., Lecture Notes of Unione Matematica Italiana, Vol. 3, Springer, Berlin-Heidelberg-New York, 2007.

[22] TRUESDELL C. & MUNCASTER R., Fundamentals of Maxwell’s Kinetic The-ory of a Simple Monatomic Gas. Treated as a Branch of Rational Mechanics.Pure and Applied Mathematics, 83. Academic Press, Inc. [Harcourt BraceJovanovich, Publishers], New York-London, 1980, xxvii+593 pp.

Page 291: From hyperbolic systems to kinetic theory: a personalized quest

Index

balance of momentum, 10, 209

Boltzmann equation, 37, 68, 85, 116,126–129, 168, 189–191, 198,222–225, 228, 229, 231, 235–238,263

Borel measures, 85, 86Broadwell model, 119, 122, 124, 126,

131, 134, 142, 147–149, 153, 161,163, 167, 168, 170–172, 175, 177,179, 181, 182, 185, 186, 234,236–238

Burgers equation, 12, 13, 26, 34, 36, 39,40, 45, 47, 55–61, 63–65, 126

Burgers–Hopf equation, 39

Carleman model, 111, 126, 132, 148,152, 159, 160, 165–170, 172, 174,175, 186

Cauchy problem, 20, 24, 32, 53Cauchy stress, 10, 198, 201, 208, 214,

223, 224, 238CFL condition, 41, 42, 53, 94

Chapman–Enskog procedure, 168, 235,238

classical mechanics, 1, 3, 179, 190, 192,193, 196, 215, 216, 221, 267

collision invariants, 225, 227

collision operator, 116collisions, 3, 5, 65, 66, 94, 95, 115–117,

119, 129, 145, 167, 172, 185, 190,216, 218, 219, 221, 223, 235

condition (S), 137, 139, 142, 148, 153,154, 156, 182

conservation laws, 10, 17, 23, 37, 38, 40,46, 50, 52, 61, 168, 181, 182, 184,185, 223, 237–239

conservation of angular momentum,116, 218, 224

conservation of charge, 152, 197conservation of energy, 10, 11, 116–118,

145, 218, 234, 254conservation of mass, 6, 8, 10, 12, 52,

107, 109, 111, 116–118, 127, 134,143, 146–148, 152, 165, 167, 181,182, 197, 199, 209, 218, 224, 237

conservation of momentum, 111,116–118, 134, 143, 146, 147, 167,181, 182, 218

contact discontinuities, 18, 24, 29, 32,35, 36, 237

continuum mechanics, 1, 3, 6, 14, 17,18, 49, 54, 86, 179, 189, 192, 193,196, 250, 257, 261, 267, 268

Coriolis force, 197, 209correlation measures, 264

Dirac equation, 95, 255, 266, 267

eikonal equation, 260Einstein equation, 266entropies, 11, 18, 23, 24, 38, 51, 52entropy, 6, 10–12, 23, 35, 38, 113, 117,

119, 122, 131–134, 167, 228entropy condition, 24equation of state, 12, 21, 22, 33, 37,

225, 237equipartition of energy, 254

Page 292: From hyperbolic systems to kinetic theory: a personalized quest

278 Index

Euler equation, 3, 6, 8, 10, 127, 168,222, 235, 237–239

Eulerian point of view, 10

finite differences, 41, 50, 93finite elasticity, 4, 194finite speed of propagation, 5, 19, 42,

96, 122, 131, 132, 134, 149, 165,167, 169

fluid dynamical limit, 216fluid quantities, 197, 223Fokker–Planck equation, 66, 68, 223,

231

Galilean invariance, 8, 21, 22, 35, 36,41, 45, 57, 61, 128, 156

Galilean transformations, 45, 64, 192gas dynamics, 10, 12, 18, 21, 25, 26, 28,

35–38, 53geometric theory of diffraction, 256geometrical optics, 193, 256, 259, 260

H-measures, 55, 175, 194, 242, 245, 249,250, 252–257, 259–263, 265, 266

Hamilton–Jacobi equation, 40, 260heat, 6, 11, 12, 23, 52, 94, 208, 239heat conductivity, 12, 23, 127heat equation, 40, 65–68, 73, 76–78, 85,

86, 94, 95heat flux, 94, 224, 237Hilbert expansion, 216, 235, 238hyperbolic equations, 42, 264hyperbolic orbits, 221hyperbolic systems, 95, 192hyperbolicity, 19–21, 23, 27, 179, 192,

193, 256, 257

ideal fluids, 3, 6, 168, 235internal energy, 6, 11, 12, 20, 94, 162,

208, 216, 224, 254internal forces, 208internal structure, 115

jump condition, 10, 11

Lagrangian point of view, 9, 10Lame equation, 214Laplace/Poisson equation, 194, 216latent heat, 215Lax condition, 36, 54

Lax E-condition, 47, 49, 50linear hyperbolic equations, 247, 248,

252linear hyperbolic systems, 18, 19, 107,

252linearized elasticity, 193, 194, 203, 213,

214, 252Lorentz force, 1, 81, 195, 197, 198, 209,

266Lorentz group, 3, 192

Maxwell equation, 1–3, 96, 192, 194,195, 216, 252, 255, 256, 266

micro-local defect measures, 242molecular dynamics, 210

Navier–Stokes equation, 18, 168, 222,235, 238

Oleinik E-condition, 46, 47, 49–54

parametrized measures, 174, 175Percus–Yevick equation, 210, 263Piola–Kirchhoff stress, 10

quantum mechanics, 66, 193, 215, 256,267

quasi-linear hyperbolic systems, 17, 18,20, 21, 32, 36–38, 46, 55, 60, 168,185, 237–239

Radon measures, 43, 51, 52, 85–89, 91,112, 154, 155, 174, 199–203, 208,209, 250, 253, 260

Rankine–Hugoniot condition, 11, 32–36,45–47, 50, 51, 53, 54, 58, 61

rarefaction waves, 9, 24, 28, 48, 49, 57,61

real fluids, 222, 239Riemann invariants, 10, 18, 24–27, 35Riemann problem, 18, 24, 26–29, 59, 61,

238

Schrodinger equation, 66, 69, 95, 257semi-classical measures, 261–263, 265,

266semi-linear hyperbolic systems, 3, 148,

153, 155, 256, 265shocks, 12, 17, 18, 24, 29, 32, 35–38, 41,

53–55, 57–60, 62, 185, 237, 238

Page 293: From hyperbolic systems to kinetic theory: a personalized quest

Index 279

thermodynamical quantities, 6, 11, 12,21–23, 34, 35, 38

thermodynamics, 6, 9, 11, 22, 23, 26,36, 37, 55, 110, 257, 267

viscous fluids, 18

Vlasov equation, 216

wave equation, 8, 13, 193, 202, 203, 207,208, 211, 212, 214, 252, 253, 255,256, 259, 260, 264, 266

weak solutions, 32–35, 38, 46, 47, 53Wigner measures, 262

Young measures, 162, 174, 175, 179,247, 248

Page 294: From hyperbolic systems to kinetic theory: a personalized quest

Editor in Chief: Franco Brezzi

Editorial Policy

1. The UMI Lecture Notes aim to report new developments in all areas of mathematicsand their applications - quickly, informally and at a high level. Mathematical textsanalysing new developments in modelling and numerical simulation are also welcome.

2. Manuscripts should be submitted (preferably in duplicate) toRedazione Lecture Notes U.M.I.Dipartimento di MatematicaPiazza Porta S. Donato 5I – 40126 Bolognaand possibly to one of the editors of the Board informing, in this case, the Redazioneabout the submission. In general, manuscripts will be sent out to external referees forevaluation. If a decision cannot yet be reached on the basis of the first 2 reports, furtherreferees may be contacted. The author will be informed of this. A final decision topublish can be made only on the basis of the complete manuscript, however a refereeingprocess leading to a preliminary decision can be based on a pre-final or incompletemanuscript. The strict minimum amount of material that will be considered shouldinclude a detailed outline describing the planned contents of each chapter, abibliography and several sample chapters.

3. Manuscripts should in general be submitted in English. Final manuscripts shouldcontain at least 100 pages of mathematical text and should always include

– a table of contents;– an informative introduction, with adequate motivation and perhaps some

historical remarks: it should be accessible to a reader not intimately familiarwith the topic treated;

– a subject index: as a rule this is genuinely helpful for the reader.

4. For evaluation purposes, manuscripts may be submitted in print or electronic form (printform is still preferred by most referees), in the latter case preferably as pdf- or zippedps- files. Authors are asked, if their manuscript is accepted for publication, to use theLaTeX2e style files available from Springer’s web-server atftp://ftp.springer.de/pub/tex/latex/svmonot1/ for monographsand atftp://ftp.springer.de/pub/tex/latex/svmultt1/ for multi-authored volumes

5. Authors receive a total of 50 free copies of their volume, but no royalties. They areentitled to a discount of 33.3% on the price of Springer books purchased for theirpersonal use, if ordering directly from Springer.

6. Commitment to publish is made by letter of intent rather than by signing a formalcontract. Springer-Verlag secures the copyright for each volume. Authors are free toreuse material contained in their LNM volumes in later publications: A brief written (ore-mail) request for formal permission is sufficient.