numerical approximation of the ohta{kawasaki...

Numerical Approximation of

the Ohta–Kawasaki Functional

Quentin Parsons

Kellogg College

University of Oxford

A thesis submitted for the degree of

M.Sc. in Mathematical Modelling and Scientific Computing

Trinity 2012

Acknowledgements

First and foremost, I would like to say a special word of thanks to my de-

partmental and thesis project supervisor, Prof. Endre Suli. His assistance

in crafting the results presented in Chapter 2 and his patient explanation

of various aspects of the problem to me are very much appreciated. His

enthusiasm for his subject and his vast, yet seemingly instantly available

knowledge of this branch of mathematics are consistently awe-inspiring.

I’m also indebted to him for repeatedly reading and reviewing my clumsy

attempts at type-setting this document.

I would also like to express my gratitude to Dr Kathryn Gillow for her

help, guidance and faith in me throughout this year at Oxford. Her help

in uncovering the source of ‘the missing mass’ in my finite element code

was instrumental in keeping me sane and on track at a very trying time.

This publication was based on work supported in part by Award No KUK-

C1-013-04, made by King Abdullah University of Science and Technology

(KAUST).

Abstract

We consider the Ohta–Kawasaki functional as a model for the free energy

of a diblock copolymer melt.

Following a brief overview of the physical parameters and quantities in-

volved, we derive the related Ohta–Kawasaki dynamic equation using a

suitable gradient flow method, and exhibit the result as a coupled system

of partial differential equations. Mass conservation is demonstrated after

which boundedness and stability proofs are presented in detail.

We then examine the time-sequence of finite element approximations to

the evolution problem. Using the weak form analysis as a model, we es-

tablish mass conservation, boundedness and finally, stability. We defer

convergence and error analysis of the numerical method, but we do im-

plement it in Matlab in one and two spatial dimensions. Full details

of the numerical algorithm are presented, including a simple, fixed point

method for resolving the non-linearity in the problem.

Thereafter, we use the Matlab code to study the effects of varying the

mass and non-local energy coefficient in two dimensions and find that our

results are consistent with those derived by other numerical methods, as

well as physical laboratory experiments conducted with diblock copoly-

mers.

We conclude with a set of suggestions for further work on the problem.

A website (http://people.maths.ox.ac.uk/parsons/) was created to accom-

pany this thesis. It includes links to animations of the two dimensional

evolution simulations, a full specification of the Matlab implementation

and finally, results of some of the one dimensional experiments that were

conducted.

This on-line version includes corrections made to the hardcopy that was

submitted to the Oxford Exam Schools on 6 September, 2012.

http://people.maths.ox.ac.uk/parsons/

Contents

1 Introduction 1

1.1 Statement of the Ohta–Kawasaki functional . . . . . . . . . . . . . . 2

1.2 Deriving the Ohta–Kawasaki functional . . . . . . . . . . . . . . . . . 3

2 Analysis 5

2.1 Context . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2.2 The dynamic equation by H−1(Ω) gradient flow . . . . . . . . . . . . 7

2.3 The weak form of the dynamic equation . . . . . . . . . . . . . . . . 8

2.3.1 Mass conservation . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.3.2 Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

2.3.3 Boundedness . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

2.3.4 Stability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

2.4 The finite element approximation . . . . . . . . . . . . . . . . . . . . 18

2.4.1 Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

2.4.2 Mass conservation . . . . . . . . . . . . . . . . . . . . . . . . . 21

2.4.3 Boundedness . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

2.4.4 Stability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

3 Implementation 36

3.1 A specific numerical scheme . . . . . . . . . . . . . . . . . . . . . . . 36

3.2 Results summary: two dimensional space . . . . . . . . . . . . . . . . 40

3.2.1 The effect of varying mass . . . . . . . . . . . . . . . . . . . . 40

3.2.2 The effect of varying the non-local energy coefficient . . . . . . 44

4 Conclusion 49

4.1 Opportunities for further study . . . . . . . . . . . . . . . . . . . . . 49

References 51

i

A Useful Mathematical Results 54

A.1 Gronwall’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

A.2 Sobolev inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

A.3 The Holder inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . 56

A.4 Other useful identities and inequalities . . . . . . . . . . . . . . . . . 57

B Calculation Essentials 59

B.1 Derivation of the dynamic equation . . . . . . . . . . . . . . . . . . . 59

B.2 Approach and results highlights: the weak form . . . . . . . . . . . . 61

B.3 Approach and results highlights: the finite element approximation . . 62

C Supplementary Results and Graphs 63

C.1 Varying mass . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

C.1.1 Computational effort/cost . . . . . . . . . . . . . . . . . . . . 64

C.1.2 Component energy evolution . . . . . . . . . . . . . . . . . . . 65

C.2 Varying the non-local energy coefficient . . . . . . . . . . . . . . . . . 66

C.2.1 The initial condition . . . . . . . . . . . . . . . . . . . . . . . 66

C.2.2 Computational effort/cost . . . . . . . . . . . . . . . . . . . . 66

C.2.3 Total energy evolution . . . . . . . . . . . . . . . . . . . . . . 68

C.2.4 The extended run for σ = 2 . . . . . . . . . . . . . . . . . . . 68

C.2.5 h-independence . . . . . . . . . . . . . . . . . . . . . . . . . . 70

ii

List of Figures

1.1 Diblock copolymers . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.2 Diblock copolymer phase diagram (adapted from [mit12]) . . . . . . . 4

2.1 The orthogonal projection of the initial condition u0 ∈ H1(Ω) onto Vh 20

3.1 Typical initial conditions (2D) . . . . . . . . . . . . . . . . . . . . . . 42

3.2 Typical ‘metastable’ end-state solutions (2D) for various m . . . . . . 42

3.3 Typical total free energy evolution over time (2D) . . . . . . . . . . . 43

3.4 SCMFT context . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

3.5 End-state solutions (2D) for various σ . . . . . . . . . . . . . . . . . . 47

3.6 Competing free energy components (2D) for various σ. The red circles

depict the component energy levels of the initial condition. . . . . . . 48

B.1 Argument summary – weak form boundedness and stability . . . . . 61

B.2 Argument summary – finite element approximation boundedness and

stability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

C.1 The two finite element grids (2D) that were used . . . . . . . . . . . 63

C.2 Non-linear work (2D) for various m . . . . . . . . . . . . . . . . . . . 64

C.3 Typical component energy evolution over time (2D) . . . . . . . . . . 65

C.4 Initial condition used to test the effect of varying σ . . . . . . . . . . 66

C.5 Non-linear work (2D) for various σ . . . . . . . . . . . . . . . . . . . 67

C.6 Total free energy evolution (2D) for various σ . . . . . . . . . . . . . 68

C.7 Extended run results for σ = 2 (2D) . . . . . . . . . . . . . . . . . . . 69

C.8 Results for σ = 200 (2D) after 500 time steps on two different meshes 71

iii

List of Tables

3.1 Physical and control parameters – varying mass in two dimensions . . 40

3.2 Physical and control parameters – varying σ in two dimensions . . . . 44

iv

Chapter 1

Introduction

A diblock coplymer is a linear molecule comprising two subchains of what we will

term ‘type A’ and ‘type B’ monomers that repel each other, but that are joined co-

valently. This strong bond prevents them from breaking apart. In this project, we

focus on what happens as these materials are rapidly cooled. At the molecular level

according to [Cho03], ‘below a critical temperature, even a weak repulsion between

unlike monomers A and B induces a strong repulsion between the subchains caus-

ing [them] to segregate’. It is this repulsive action that gives these molecules their

remarkable ability to rearrange into ordered, periodic structures. A simple diblock

copolymer and the way in which these molecules rearrange themselves is depicted in

Fig. 1.1. A large collection of such molecules is called a ‘melt’.

Figure 1.1: Diblock copolymers

As will become apparent presently, studies of these materials have led to models

that can accurately predict the morphologies into which they will rearrange, based

on a few simple physical parameters. As such, these materials are often referred

1

to as ‘designer soft materials’ ([BF99]) with applications as diverse as upholstery

and bedding to computer memory. The book by Hamley ([Ham03]) provides an

excellent background to these materials – indeed, given their commercial application,

the literature related to polymers is vast. Other variants such as triblock and branched

copolymers (see [BF99]) exist but we restrict our attention to the diblock case here.

1.1 Statement of the Ohta–Kawasaki functional

The physical tendency of copolymer melts is to rearrange themselves in such a way as

to minimise their free energy by forming structures that minimise contacts between

the unlike monomers ([Cho03]). We model this free energy with the (scaled) Ohta–

Kawasaki functional (originally derived in [OK86]) which we now present. In doing

so, we are introduced to all of the important physical parameters:

E(u) = 12

∫Ω

(ε2 |∇u|2 + 1

2

(1− u2

)2+ σ

∣∣(−∆)−1/2(u−m)∣∣2) dΩ. (1.1)

We start by designating the type A monomer density1 in the melt by a; then the

density of type B monomers is clearly 1− a (if we assume incompressibility), and the

difference between the type A and type B monomer densities is given by 2a− 1. We

label this quantity the ‘mass’, m, of the system so that

a =m+ 1

2(1.2)

where a and m are parameters that describe the melt on average. Note that we

assume that all of the molecules in the melt have exactly the same composition.

The function u = u(x, t) in (1.1) describes the difference between the type A and

type B monomer densities as a function of space and time. It is clear that we may

describe a particular melt as comprising (on average), a certain proportion of type A

or B monomers, but obviously at small scales we will have regions where we will find

‘only type A’ or ‘only type B’ monomers. We therefore identify u(x, t) = 1 with the

(local) presence of type A monomers and u(x, t) = −1 with the presence of type B

monomers. E(u) is then a functional (of u) that describes the total free energy in the

melt.

It is obvious that the average of u over the space occupied by the melt is equivalent

to its ‘mass’ and so intuitively we expect that this mass should be conserved as the

1Equivalently, we can think of this quantity as the length of the type A monomer chain as a

proportion of the total length of each diblock copolymer macromolecule.

2

molecules rearrange themselves. These ideas are formalised later in Equations (2.2)

and (2.18). Note that (1.1) is non-dimensional and has been re-scaled in space so that

we work on the unit cube (which we denote Ω and model as (0, 1)d for d = 1, 2, 3)

whereas the melt physically occupies the volume D.

Additionally, ε represents the (scaled) interfacial thickness at the A and B monomer

intersections ([CR03]), and depends on various physical parameters and characteris-

tics of the melt according to

ε2 =l2

3a(1− a)χ|D|2/3. (1.3)

We have the following interpretations:

• |D| is the volume of the physical domain which the melt occupies;

• l (a dimensional parameter) is the so-called Kuhn statistical length measuring

the average distance between two adjacent monomers or ‘the average monomer

space size’ ([CPW09]); and

• χ is the Flory-Huggins interaction parameter, which measures the incompati-

bility of the type A and B monomers.

Finally, σ (which we will term the ‘non-local energy coefficient’) is specified by

σ =36|D|2/3

a2(1− a)2l2χN2P

. (1.4)

The new parameter here is NP , the index of polymerisation measuring the number of

monomers that occur per macromolecule.2

1.2 Deriving the Ohta–Kawasaki functional

The derivation of (1.1) was first completed by Ohta and Kawasaki ([OK86]) using

mean field theory. The Self-Consistent Mean Field Theory (‘SCMFT’) subsequently

gained prominence as the best theoretical device for predicting the behaviour of di-

block copolymers based on a and the product χNP . Matsen and Schick ([MS94]) then

developed a spectral method whereby predictions of low energy morphologies could

be made in the (a, χNP ) plane, the results of which can be seen in Fig. 1.2. A key lim-

itation of this approach, however, is that the analysis of energy minimisers follows an

2Note: we use NP instead of the usual N because the latter is used later on in our numerical

method to describe the number of time steps over which we run the finite element method.

3

ansatz-based approach, whereas (1.1) is more suited to analysing minimisers without

any such bias. Choksi and Ren showed ([CR03]) in fact that (1.1) can be regarded as

an ‘offspring’ of the SCMFT (subject to various approximations and scalings) and it

is for this reason that we use Fig. 1.2 to guide some of our work. This figure predicts

the stable physical morphology (in three dimensions) resulting from combinations of

a and χNP and exhibits an uncanny agreement with laboratory experiments. The

phases are labeled ‘L’ for lamellar, ‘G’ for gyroid, ‘C’ for hexagonally packed cylin-

ders, ‘S’ for spheres, and ‘CPS’ for close packed spheres. Structures with a < 0.5 are

formed by the Type A monomers (the Type B monomers being in the majority, fill

the free space between the ‘visible’ structures) while those resulting when a > 0.5 are

formed by the type B monomers; the latter structures are denoted with apostrophes

’ in Fig. 1.2 (as in C’, S’ and CPS’).

Figure 1.2: Diblock copolymer phase diagram (adapted from [mit12])

A two-dimensional version of Fig. 1.2 would depict lamellae for small m, spots

(approximate circles) for non-zero m and disorder for small χNP (see [CMW11]).

We proceed now to explore various theoretical aspects of the functional in (1.1)

and its finite element approximation.

4

Chapter 2

Analysis

2.1 Context

We start by defining the mathematical context within which the analysis will be

performed.1 For the time being, we largely ignore the physical significance of the

various components of the problem but focus instead on a mathematical analysis

of the underlying equations. Our starting point is the Ohta–Kawasaki Free Energy

Functional (‘OKFEF’) given above2 in (1.1) as

E(u) = 12

∫Ω

(ε2 |∇u|2 + 1

2

(1− u2

)2+ σ

∣∣(−∆N)−1/2(u−m)∣∣2) dΩ

where

−∫

Ω

u dΩ :=1

|Ω|

∫Ω

u dΩ = m (2.2)

1In this chapter, the notation ‘‖·‖’ will imply the L2(Ω)-norm and similarly, ‘(·, ·)’ with no

subscript will refer to the L2(Ω)-inner product.2An alternate form of the OKFEF is

E(u) = 12

∫Ω

(ε2 |∇u|2 + 1

2

(1− u2

)2+ σ |∇v|2

)dΩ (2.1)

where −∆v = u−m. It is possible to make sense of this definition of the OKFEF, but in order to do

so, conditions need to be specified on v to ensure the existence of a unique solution to the problem

−∆v = u−m. Specifically, if we require

∂v

∂n

∣∣∣∣∂Ω

= 0

then it is possible to specify a function space H2(Ω)∩H1∗ (Ω) (see Equation (2.4)) within which the

Lax-Milgram Theorem assures us of the existence of a unique solution to this linear problem.

5

for the unit d-cube Ω = (0, 1)d, d = 1, 2, 3, and we require

∂u

∂n

∣∣∣∣∂Ω

= 0. (2.3)

We now formally define the operator (−∆N)−1/2. In preparation for this, we consider

the space

H1∗ (Ω) =

u ∈ H1(Ω) :

∫Ω

u dΩ = 0

(2.4)

where, as usual, we use Hk(Ω) to denote the Sobolev space W k,2(Ω) of all locally

summable functions u : Ω → R with the property that Dαu ∈ L2(Ω), in the weak

sense, for |α| ≤ k (see [Eva98], Section 5.2.2).

The subscript ‘N ’ on the operator (−∆N)−1/2 specifies that it ‘carries’ with it a

zero Neumann boundary condition (without which writing down a negative power of

the Laplace operator is meaningless) such that if

(−∆N)−1w = g ⇒ −∆Ng = w with w ∈ H1∗ (Ω), (2.5)

we must have

∂g

∂n

∣∣∣∣∂Ω

= 0.

We specify that the operator (−∆N)−1/2 should act in such a way that the so-called

‘H−1(Ω) inner product’ can be written in one of the following three equivalent forms:

〈w, v〉H−1(Ω) :=

((−∆N)−1w, v)((−∆N)−1/2w, (−∆N)−1/2v

)(w, (−∆N)−1v)

∀ w, v ∈ H1∗ (Ω). (2.6)

In this context, we define H−1(Ω) as the dual of the space H1∗ (Ω); the reason for our

interest in this space is explained in Section 2.2. Specifically, if w = v, we define

〈w,w〉H−1(Ω) := ‖w‖2H−1(Ω) and then we can write

‖w‖2H−1(Ω) =

((−∆N)−1w,w

)=((−∆N)−1/2w, (−∆N)−1/2w

)=∥∥(−∆N)−1/2w

∥∥2

=

∫Ω

∣∣(−∆N)−1/2w∣∣2 dΩ (2.7)

and if u(·, t) ∈ H1(Ω) then u−m ∈ H1∗ (Ω) by virtue of (2.2). A comparison of (2.7)

and (1.1) thus reveals that we can write the OKFEF as

E(u) = 12

∫Ω

(ε2 |∇u|2 + 1

2

(1− u2

)2)

dΩ +σ

2‖u−m‖2

H−1(Ω) . (2.8)

6

Our approach in analysing the OKFEF in the forthcoming sections is summarised in

Fig. B.1 on p. 61. This figure need not be consulted at all, but could prove useful as

a map in what follows.

2.2 The dynamic equation by H−1(Ω) gradient flow

Given any initial state u(x, 0), the system is assumed to evolve in such a way that

u ∈ arg minv∈H1(Ω)E(v) := 12

∫Ω

ε2 |∇v|2 + 12

(1− v2

)2dΩ + 1

2σ ‖v −m‖2

H−1(Ω).

It is clear that we need an expression for ut to model the system evolution, and we

get the weak form of this by setting

〈ut, φ〉H−1(Ω) = −〈DE(u), φ〉 ∀ φ ∈ H1∗ (Ω), (2.9)

where DE(u) is the ‘variational’ or Gateaux derivative of the functional E(u) defined

by

〈DE(u), φ〉 = limα→0

E(u+ αφ)− E(u)

α. (2.10)

We use the H−1(Ω) inner product on the left-hand side of (2.9) because using an

alternative (for instance, the L2(Ω) inner product) nullifies the mass conservation

property we establish in Section 2.3.1 (see [Zha06]). Now,


1

2α

[∫Ω

(ε2 |∇ (u+ αφ)|2 + 1

2

(1− (u+ αφ)2)2

−ε2 |∇u|2 − 12

(1− u2

)2)

dΩ

+12σ ‖u+ αφ−m‖2

H−1(Ω) −12σ ‖u−m‖2

H−1(Ω)

]. (2.11)

At this point, we observe that the term ‖u+ αφ−m‖2H−1(Ω) above is meaningful since

we can write

‖u+ αφ−m‖2H−1(Ω) =

∥∥∥∥ (u−m)︸︷︷︸∈ H1

∗ (Ω)

+ αφ︸︷︷︸∈ H1

∗ (Ω)

∥∥∥∥2

H−1(Ω)

,

i.e. u+αφ−m ∈ H1∗ (Ω) as is required for our definition of the H−1(Ω) inner product

and its induced norm ‖·‖H−1(Ω) to make sense. We now define

N (u) = u(u2 − 1) (2.12)

7

and after much fairly straightforward manipulation and the application of Equation

(2.6) (see Appendix B.1 for full details) we arrive at

〈DE(u), φ〉 = −∫

Ω

φε2∆Nu−N (u)− σ(−∆N)−1

(u−m

)dΩ. (2.13)

If we now substitute (2.13) into (2.9) and rearrange, we have

〈ut, φ〉H−1(Ω) −∫

Ω

φε2∆Nu−N (u)− σ(−∆N)−1

(u−m

)dΩ = 0 ∀ φ ∈ H1

∗ (Ω).

Combining this result with the definition noted in (2.6) we obtain∫Ω

φ

(−∆N)−1ut − ε2∆Nu+N (u) + σ(−∆N)−1(u−m

)dΩ = 0 ∀ φ ∈ H1

∗ (Ω)

which means that

(−∆N)−1ut − ε2∆Nu+N (u) + σ(−∆N)−1(u−m

)= 0.

Finally, if we apply the operator (−∆N) to both sides of the previous equation, we

are led to the Ohta–Kawasaki Dynamic Equation (‘OKDE’)

ut + ∆N

(ε2∆Nu−N (u)

)+ σ(u−m) = 0. (2.14)

2.3 The weak form of the dynamic equation

We start with (2.14) and introduce a new function w = −ε2∆u + N (u) so that we

can write the OKDE as the coupled system

ut −∆w + σ(u−m) = 0

w = −ε2∆u+N (u)

in Ω× (0, T ] (2.15)

with

∂u

∂n

∣∣∣∣∂Ω

= 0 and∂w

∂n

∣∣∣∣∂Ω

= 0 on (0, T ] (2.16)

and

u(x, 0) = u0(x), x ∈ Ω, (2.17)

8

with u0 ∈ H2(Ω). Additionally, we define3

m =1

|Ω|

∫Ω

u0 dΩ

(:= −∫

Ω

u0 dΩ

)(2.18)

and note that

N (u) ≡ Φ′(u) where Φ(u) = 14(1− u2)2. (2.19)

2.3.1 Mass conservation

We start by demonstrating that the system defined above exhibits mass conservation

and in so doing, illustrate the importance and utility of the chosen boundary condition

on u and w. We have∫Ω

(u−m)t dΩ =

∫Ω

ut dΩ (since m is a constant)

=

∫Ω

∆w dΩ− σ∫

Ω

(u−m) dΩ (from (2.15))

=

∫∂Ω

∂w

∂nds︸︷︷︸

= 0

−σ∫

Ω

(u−m) dΩ (Divergence Theorem).

It follows that

d

dt

∫Ω

(u−m) dΩ + σ

∫Ω

(u−m) dΩ = 0

which we recognise as a differential equation for∫

Ω(u−m) dΩ with solution∫

Ω

(u−m)(t) dΩ =

∫Ω

(u0 −m) dΩ

︸︷︷︸

= 0 by (2.18)

e−σt, ∀ t ∈ [0, T ]. (2.20)

The implication of this result is that

m =1

|Ω|

∫Ω

u(x, t) dΩ =1

|Ω|

∫Ω

u0 dΩ ∀ t ∈ [0, T ]; (2.21)

i.e. the model (and specifically its zero Neumann spatial boundary condition) auto-

matically implies the conservation of relative concentration of monomer types over

time. This agrees with physical intuition since the process whereby diblock copoly-

mers rearrange themselves does not result in the creation or destruction of type A

or B monomers. Formally, we see that the coupled system of equations is consistent

with the definition expressed in (2.2), as required.

3We will see presently that the forthcoming definition is entirely consistent with that given in

(2.2) i.e. the system we work with exhibits ‘mass conservation’ as it should.

9

2.3.2 Formulation

We develop the two-part weak form by taking an L2(Ω)-inner product between each

of the two equations in (2.15) and a test function v ∈ H1(Ω). For the first equation

we seek u(·, t) ∈ H1(Ω) such that

(ut, v)−∫

Ω

v∆w dΩ + σ

∫Ω

(u−m)v dΩ = 0 ∀ v ∈ H1(Ω)

or

(ut, v) + (∇w,∇v) + σ (u−m, v) = 0 ∀ v ∈ H1(Ω) and t ∈ (0, T ], (2.22)

after using Theorem 9, p. 58. For the second equation in (2.15), we note the zero

Neumann boundary condition on u, and recalling that N (u) = Φ′(u), obtain

(w, v) = ε2 (∇u,∇v) + (Φ′(u), v) ∀ v ∈ H1(Ω).

So in this case, we seek w(·, t) ∈ H1(Ω) such that

(w, v) = ε2 (∇u,∇v) + (Φ′(u), v) ∀ v ∈ H1(Ω) and t ∈ (0, T ]. (2.23)

2.3.3 Boundedness

We will now show that solutions u(·, t) to Equations (2.22) and (2.23) are bounded

in the L∞(Ω) and H2(Ω) norms for all t ∈ [0, T ]. To start, we take v = w in (2.22)

to see that

(ut, w) + ‖∇w‖2 + σ

∫Ω

(u−m)w dΩ = 0. (2.24)

Then, setting v = ut in (2.23) we have

(w, ut) = ε2 (∇u,∇ (ut)) + (Φ′(u), ut)

⇒ (ut, w) = 12ε2

d

dt‖∇u‖2 +

d

dt(Φ(u), 1) . (2.25)

Subtracting (2.24) from (2.25) and rearranging then gives

12ε2

d

dt‖∇u‖2 +

d

dt(Φ(u), 1) + ‖∇w‖2 + σ

∫Ω

(u−m)w dΩ = 0. (2.26)

We consider the last term in (2.26):∫Ω

(u−m)w dΩ =

(u−−

∫Ω

u dΩ, w −−∫

Ω

w dΩ

)(using (2.20) and (2.21))

≤∥∥∥∥u−−∫

Ω

u dΩ

∥∥∥∥∥∥∥∥w −−∫Ω

w dΩ

∥∥∥∥ (Cauchy–Schwarz ineq.)

≤ c2P ‖∇u‖ ‖∇w‖ (2.27)

10

using Poincare’s inequality (Theorem 8, p. 57). Using (2.27) in (2.26), we have

12ε2

d

dt‖∇u‖2 +

d

dt(Φ(u), 1) + ‖∇w‖2 ≤ σc2

P ‖∇u‖ ‖∇w‖

≤ 12σ2c4

P ‖∇u‖2 + 1

2‖∇w‖2

by Young’s inequality (Theorem 7, p. 57), and so

ε2d

dt‖∇u‖2 + 2

d

dt(Φ(u), 1) + ‖∇w‖2 ≤ σ2c4

P ‖∇u‖2 .

Integrating this in time over [0, t] and rearranging gives

ε2 ‖∇u(t)‖2 + 2 (Φ(u(t)), 1) +

∫ t

0

‖∇w(s)‖2 ds ≤ ε2 ‖∇u0‖2 + 2 (Φ(u0), 1)

+σ2c4

P

ε2

∫ t

0

ε2 ‖∇u(s)‖2 + 2 (Φ(u(s)), 1) +

∫ s

0

‖∇w(r)‖2 dr

ds,

where we have added non-negative terms on the right-hand side in preparation for

Gronwall’s Lemma (Theorem 1, p. 54). Using this result, we see that

ε2 ‖∇u(t)‖2 + 2 (Φ(u(t)), 1) +

∫ t

0

‖∇w(s)‖2 ds

≤ε2 ‖∇u0‖2 + 2 (Φ(u0), 1)

exp

(σ2c4

P

ε2t

)≤ C (ε, σ, u0, cP , T ) ∀ t ∈ [0, T ]. (2.28)

Consequently, ∫ t

0

‖∇w(s)‖2 ds ≤ C (ε, σ, u0, cP , T ) (2.29)

and also

maxt∈[0,T ]

‖∇u(t)‖ ≤ C (ε, σ, u0, cP , T ) . (2.30)

We now take v = ut in (2.22) and arrive at

‖ut‖2 + (∇w,∇ut) + σ

∫Ω

(u−m)ut dΩ = 0. (2.31)

If we differentiate (2.23) with respect to t, we obtain

(wt, v) = ε2 (∇ut,∇v) + (Φ′′(u)ut, v)

11

and so setting v = w in this latter equation yields

12

d

dt‖w‖2 = ε2 (∇w,∇ut) + (Φ′′(u)ut, w) . (2.32)

Adding (2.32) to ε2 times (2.31) then gives

ε2 ‖ut‖2 + 12

d

dt‖w‖2 + 1

2σε2

d

dt

∫Ω

(u−m)2 dΩ = (Φ′′(u)ut, w)

or

12

d

dt

[‖w‖2 + σε2

∫Ω

(u−m)2 dΩ

]+ ε2 ‖ut‖2 =

((3u2 − 1)ut, w

)(2.33)

since Φ′′(u) = 3u2 − 1. Now((3u2 − 1)ut, w

)≤ 3

∥∥u2w∥∥ ‖ut‖+ ‖w‖ ‖ut‖ (Cauchy–Schwarz ineq.)

≤(

9

ε2∥∥u2w

∥∥2+ε2

4‖ut‖2

)+

(1

ε2‖w‖2 +

ε2

4‖ut‖2

)=ε2

2‖ut‖2 +

9

ε2∥∥u2w

∥∥2+

1

ε2‖w‖2 ,

where we used Young’s inequality (Theorem 7, p. 57) twice to obtain the second line.

If we substitute this into the right-hand side of (2.33), rearrange and make use of

Holder’s inequality (Theorem 5, p. 56) with conjugate exponents p = 3/2 and q = 3,

we see that

d

dt

[‖w‖2 + σε2 ‖u−m‖2]+ ε2 ‖ut‖2 ≤ 18

ε2‖u‖4

L6(Ω) ‖w‖2L6(Ω) +

2

ε2‖w‖2

≤ 18

ε2c6S ‖u‖

4H1(Ω) ‖w‖

2H1(Ω) +

2

ε2‖w‖2 ,

where we used Sobolev’s inequality (Theorem 3, p. 55) in d = 1, 2, 3 dimensions to

obtain the last line. Integrating this in time over [0, t] and noting the definition of

the H1(Ω)-norm in the ‖w‖2H1(Ω) term, leads to

‖w(t)‖2 + σε2 ‖u(t)−m‖2 + ε2∫ t

0

‖ut‖2 ds

≤ ‖w0‖2 + σε2 ‖u0 −m‖2 +18c6

S

ε2maxt∈[0,T ]

‖u‖4H1(Ω)

∫ t

0

‖∇w(s)‖2 ds

+

(2

ε2+

18c6S

ε2maxt∈[0,T ]

‖u‖4H1(Ω)

)∫ t

0

‖w(s)‖2 + σε2 ‖u(s)−m‖2 + ε2

∫ s

0

‖ut‖2 dr

ds,

where, as in the lead-up to Equation (2.28), we have again added non-negative terms

on the right-hand side in preparation for Gronwall’s Lemma (Theorem 1, p. 54).

12

Using this result again, we see that, for all t ∈ (0, T ],

‖w(t)‖2 + σε2 ‖u(t)−m‖2 + ε2∫ t

0

‖ut‖2 ds

≤(‖w0‖2 + σε2 ‖u0 −m‖2 +

18c6S

ε2maxt∈[0,T ]

‖u‖4H1(Ω)

∫ t

0

‖∇w(s)‖2 ds

)× exp

(2

ε2+

18c6S

ε2maxt∈[0,T ]

‖u‖4H1(Ω)

)t

. (2.34)

Now

‖u‖2H1(Ω) = ‖u−m+m‖2 + ‖∇u‖2

≤ 2

∥∥∥∥u−−∫Ω

u dΩ

∥∥∥∥2

+ 2

∫Ω

m2dΩ + ‖∇u‖2 (Equation A.6, p. 57)

≤(2c2P + 1

)‖∇u‖2 + 2m2

∫Ω

dΩ (Poincare’s ineq. – Thm. 8, p. 57)

=(2c2P + 1

)‖∇u‖2 + 2m2|Ω|

≤ C (ε, σ, u0, cP , T,m, |Ω|) ,

where we used (2.30) in the last step to eliminate ‖∇u‖2. From this result and (2.29),

we see that all of the terms on the right-hand side of (2.34) are bounded by a constant;

specifically

‖w(t)‖2 + σε2 ‖u(t)−m‖2 + ε2∫ t

0

‖ut‖2 ds

≤ C (w0, ε, σ, u0, cP , cS, T,m, |Ω|) . (2.35)

Since w0 := −ε2∆u0 +N (u0) depends on ε and u0 only, we can suppress w0 in (2.35)

and since all of the terms on the left-hand side of (2.35) are non-negative, we see that

‖w(·, t)‖ ≤ C (ε, σ, u0, cP , cS, T,m, |Ω|) (2.36)

and also

‖u(·, t)−m‖ ≤ C (ε, σ, u0, cP , cS, T,m, |Ω|) . (2.37)

Now as in [S12], we set v = ∆u and then,

(∇u,∇(∆u)) = (∇u,∇v)

= (−∆u, v) (Theorem 9, p. 58)

= −‖∆u‖2 (2.38)

13

since ∂u/∂n|∂Ω = 0. As a result, setting v = ∆u in (2.23) gives

(w,∆u) = −ε2 ‖∆u‖2 + (Φ′(u),∆u) .

Consequently,

ε2 ‖∆u‖2 ≤ (‖w‖+ ‖Φ′(u)‖) ‖∆u‖ ,

which implies that

ε2 ‖∆u‖ ≤ ‖w‖+∥∥u3 − u

∥∥≤ ‖w‖+ ‖u‖+ ‖u‖3

L6(Ω)

≤ ‖w‖+ ‖u‖+ c3S ‖u‖

3H1(Ω)

using Sobolev’s inequality (Theorem 3, p. 55). So we have

‖∆u‖ ≤ 1

ε2

(‖w‖+ ‖u‖+ c3

S ‖u‖3H1(Ω)

)for any t ∈ [0, T ] and thus

maxt∈[0,T ]

‖∆u‖ ≤ 1

ε2

(maxt∈[0,T ]

‖w‖+ maxt∈[0,T ]

‖u−m‖+ ‖m‖+ c3S maxt∈[0,T ]

(‖u‖2 + ‖∇u‖2)3

)≤ 1

ε2

(maxt∈[0,T ]

‖w‖+ maxt∈[0,T ]

‖u−m‖+m |Ω|1/2

+c3S maxt∈[0,T ]

(2 ‖u−m‖2 + 2m2|Ω|+ ‖∇u‖2

)3)

≤ C (ε, σ, u0,m, cS, cP , T, |Ω|) ,

where we use Equation A.6, p. 57 and then apply (2.36) to ‖w‖, (2.37) to ‖u−m‖and (2.30) to ‖∇u‖2.

Assuming that ∂Ω is of class C2 or that Ω is a convex polygonal (for d = 2) or poly-

hedral (for d = 3) domain, we have by Theorem 10, p. 58 that maxt∈[0,T ] ‖u(·, t)‖H2(Ω) ≤C maxt∈[0,T ] ‖∆u‖ for some C > 0, and so

maxt∈[0,T ]

‖u(·, t)‖H2(Ω) ≤ C (ε, σ, u0,m, cS, cP , T, |Ω|) . (2.39)

By Sobolev embedding, which says that the H2(Ω)-norm bounds the infinity-norm

for d = 1, 2, 3 (Theorem 4, p. 55), we therefore have

maxt∈[0,T ]

‖u(·, t)‖L∞(Ω) ≤ C (ε, σ, u0,m, cS, cP , T, |Ω|) . (2.40)

14

2.3.4 Stability

Now that we have established the boundedness of the (weak form of the) PDE, we

move on to show its stability. For convenience we adopt the following simple notation:

we suppose that the weak solution can be expressed as a pair of functions U = (u,w)

that solve (2.22) and (2.23).

Suppose then that the problem has two solutions U1 = (u1, w1) and U2 = (u2, w2).4

For the solution U1 we have

(u1,t, v) + (∇w1,∇v) + σ (u1 −m, v) = 0 ∀ v ∈ H1(Ω)

and

(w1, v) = ε2 (∇u1,∇v) + (Φ′(u1), v) ∀ v ∈ H1(Ω).

Similarly for U2 we can write

(u2,t, v) + (∇w2,∇v) + σ (u2 −m, v) = 0 ∀ v ∈ H1(Ω)

and

(w2, v) = ε2 (∇u2,∇v) + (Φ′(u2), v) ∀ v ∈ H1(Ω).

Subtracting these two problems leads to the following problem in u ≡ u1 − u2 and

w ≡ w1 − w2 for which we define U = (u,w) = (u1 − u2, w1 − w2) = U1 − U2:

find u(·, t) ∈ H1(Ω) s.t. (ut, v) + (∇w,∇v) + σ (u, v) = 0 ∀ v ∈ H1(Ω) (2.41)

and

find w(·, t) ∈ H1(Ω) s.t. (w, v) = ε2 (∇u,∇v) + (Φ′(u1)− Φ′(u2), v) ∀ v ∈ H1(Ω)

(2.42)

for every t ∈ (0, T ]. We notice immediately the non-linearity in the second equation

of the problem statement. To analyse this problem, we set v = u in (2.41) to obtain

ε2 12

d

dt‖u‖2 + ε2 (∇u,∇w) + ε2σ ‖u‖2 = 0 (2.43)

after multiplying through by ε2 and observing that (ut, u) = 12

∫Ω

(u2)t dΩ = 12

ddt‖u‖2.

Then, putting v = w in (2.42) gives

‖w‖2 = ε2 (∇u,∇w) + (Φ′(u1)− Φ′(u2), w) . (2.44)

4In what follows, the notation u1,t will denote the time derivative of the solution u1; the subscript

‘1’ will not imply an x (or first spatial) derivative.

15

Subtracting (2.44) from (2.43) and rearranging leads to

ε2 12

d

dt‖u‖2 + ‖w‖2 + ε2σ ‖u‖2 = (Φ′(u1)− Φ′(u2), w) . (2.45)

Now Φ′(u) = u(u2− 1) is a differentiable function and so by the mean value theorem,

Φ′(u1)− Φ′(u2) = Φ′′ (θu1 + (1− θ)u2)u

for some θ ∈ [0, 1]. Substituting this into (2.45), and noting that Φ′′(u) = 3u2 − 1

gives

ε2 12

d

dt‖u‖2 + ‖w‖2 + ε2σ ‖u‖2 =

∫Ω

uw[3 (θu1 + (1− θ)u2)2 − 1

]dΩ (2.46)

for some θ ∈ [0, 1]. According to (2.40), u1 and u2 are bounded in the infinity norm

as they are two solutions of the problem. Consequently, if we define two constants

K1 = K1(ε, σ, (u1)0,m, cS, cP , T, |Ω|) and K2 = K2(ε, σ, (u2)0,m, cS, cP , T, |Ω|) such

that

‖u1‖L∞(Ω) = K1 <∞ and ‖u2‖L∞(Ω) = K2 <∞,

then from Equation (2.46) we obtain

12ε2

d

dt‖u‖2 + ‖w‖2 + ε2σ ‖u‖2 ≤

∫Ω

|u||w|[3 (θK1 + (1− θ)K2)2 + 1

]dΩ

≤ C ‖u‖ ‖w‖ (Cauchy–Schwarz ineq.)

≤ 12‖w‖2 + 1

2C2 ‖u‖2

by Young’s inequality (Theorem 7, p. 57), and where we defined a new constant

C = 3 (θK1 + (1− θ)K2)2 + 1 = C(ε, σ, (u1)0, (u2)0,m, cS, cP , T, |Ω|).

Moving the 12‖w‖2 term over to the left, we have

12ε2

d

dt‖u‖2 + 1

2‖w‖2 + ε2σ ‖u‖2 ≤ 1

2C2 ‖u‖2 . (2.47)

At this stage, there are two cases to consider:

1) ε2σ ≥ 12C2: in this case, we can rearrange (2.47) to obtain

ε2d

dt‖u‖2 + ‖w‖2 ≤ 0

16

which we integrate in time over [0, t] to see that

ε2 ‖u(t)‖2 +

∫ t

0

‖w(s)‖2 ds ≤ ε2 ‖u0‖2

for any t ∈ [0, T ], where u0 = u(x, 0) ≡ (u1)0 − (u2)0. We conclude that

‖u(·, t)‖ ≤ ‖u0‖ for any t ∈ [0, T ],

i.e. u ∈ L∞ (0, T ;L2(Ω)) where

‖u‖L∞(0,T ;L2(Ω)) := ess sup0≤t≤T ‖u(·, t)‖ ≤ ‖u0‖ <∞. (2.48)

In addition, we have(∫ t

0

‖w(·, s)‖2 ds

)1/2

≤ ε ‖u0‖ for any t ∈ [0, T ],

and so, if we let the upper limit of integration t = T , then w ∈ L2 (0, T ;L2(Ω))

where

‖w‖L2(0,T ;L2(Ω)) :=

(∫ T

0

‖w(·, s)‖2 ds

)1/2

≤ ε ‖u0‖ <∞. (2.49)

If we now define the norm of U = (u,w) ∈ L∞ (0, T ;L2(Ω)) × L2 (0, T ;L2(Ω)) in

such a way that

‖U‖2L∞(0,T ;L2(Ω))×L2(0,T ;L2(Ω)) := ‖u‖2

L∞(0,T ;L2(Ω)) + ‖w‖2L2(0,T ;L2(Ω)) , (2.50)

then adding the squares of (2.48) and (2.49) leads to

‖U‖L∞(0,T ;L2(Ω))×L2(0,T ;L2(Ω)) ≤√

1 + ε2 ‖(u1)0 − (u2)0‖ , (2.51)

which implies that the weak form of the problem is stable if ε2σ ≥ 12C2.

2) ε2σ < 12C2: in this case, we can rearrange (2.47) to obtain

ε2d

dt‖u‖2 + ‖w‖2 ≤

(C2 − 2ε2σ

)‖u‖2 .

We integrate this in time over [0, t] and obtain

ε2 ‖u(t)‖2 +

∫ t

0

‖w(s)‖2 ds ≤ ε2 ‖u0‖2

+C2 − 2ε2σ

ε2

∫ t

0

ε2 ‖u(s)‖2 +

∫ s

0

‖w(r)‖2 dr

ds,

17

where we have added a strategic non-negative term on the right-hand side in the

usual way in preparation for the application of Gronwall’s Lemma (Theorem 1,

p. 54). In turn, this result implies that

ε2 ‖u(t)‖2 +

∫ t

0

‖w(s)‖2 ds ≤ ε2 ‖u0‖2 exp

(C2 − 2ε2σ

ε2t

)= ε2C2 ‖u0‖2 ,

where we defined another new constant

C =

√√√√exp

(C2 − 2ε2σ

ε2t

)= C(ε, σ, (u1)0, (u2)0,m, cS, cP , T, |Ω|).

As in the first case of the stability proof, we conclude that

‖u(·, t)‖ ≤ C ‖u0‖ for any t ∈ [0, T ]

i.e. u ∈ L∞ (0, T ;L2(Ω)) using the definition in (2.48). Furthermore,(∫ t

0

‖w(·, s)‖2 ds

)1/2

≤ εC ‖u0‖ for any t ∈ [0, T ],

and so, once again, w ∈ L2 (0, T ;L2(Ω)) using the definition in (2.49). Using

the definition in (2.50), we see again that U = (u,w) ∈ L∞ (0, T ;L2(Ω)) ×L2 (0, T ;L2(Ω)) and

‖U‖L∞(0,T ;L2(Ω))×L2(0,T ;L2(Ω)) ≤ C√

1 + ε2 ‖(u1)0 − (u2)0‖ , (2.52)

that is, the weak form is stable if ε2σ < 12C2.

We conclude that the weak form of the problem is stable under all conditions.

Next, we shall define a finite element approximation of the problem in (2.22) and

(2.23) with implicit Euler time-stepping. We shall then show that bounds analogous

to those derived for the weak form of the problem also hold for the numerical method.

2.4 The finite element approximation

Our approach in analysing the finite element approximation to the OKDE is sum-

marised in Fig. B.2, p. 62. It is clear that this figure, and the structure of the

underlying arguments, is very similar to that presented in Fig. B.1. As with the

analysis of the weak form, this graphical argument summary can be ignored entirely,

but could prove useful as a guide through the lengthy arguments that follow.

18

2.4.1 Formulation

Let Th be a triangulation of Ω (i.e. the spatial portion of the space-time domain)

consisting of shape-regular triangles and let Vh denote a finite-element subspace of

H1(Ω) defined on Th. Let ∆t = T/N,N ≥ 1. The finite element problem has the

following ingredients:

• A finite element approximation to the problem in physical space where unh rep-

resents the approximation of u(·, n∆t) and wnh represents the approximation of

w(·, n∆t) in Vh; and

• A backward Euler approximation of the time derivative ut.

Such a finite element approximation to the weak form of the problem can be stated

in two parts as respective counterparts to (2.22) and (2.23) as follows: we wish to

find unh, wnh ∈ Vh such that(unh − un−1

h

∆t, vh

)+ (∇wnh ,∇vh) + σ (unh −m, vh) = 0 ∀ vh ∈ Vh (2.53)

and

(wnh , vh) = ε2 (∇unh,∇vh) + (Φ′(unh), vh) ∀ vh ∈ Vh (2.54)

for n = 1, 2, . . . N where Φ′(u) = u(u2 − 1). For the sake of completeness, we remark

that the zero Neumann boundary condition specified earlier for u and w has no explicit

weak (or finite element) formulation per se. It is implicitly present in Equations (2.53)

and (2.54) in that it was used when Theorem 9 (p. 58) was applied to create the weak

forms of the equations in Equations (2.22) and (2.23).

For the initial condition, we consider what we will call the ‘H2(Ω) orthogonal

projection’ of the initial condition u0 ∈ H1(Ω) onto the space Vh, a geometric inter-

pretation of which is depicted in Fig. 2.1.

Conceptually, we require u0 − u0h to be ‘perpendicular’ to the space Vh so that

our approximation u0h of u0 in Vh is as close as possible (in some norm) to u0 itself.

In order to define ‘perpendicular’ functions, we require an inner product and to this

end, it is convenient to define the ‘H2(Ω) inner product’ on H1(Ω) as

〈u, v〉H2(Ω) = (u, v) + (∇u,∇v) + (∆hu,∆hv) for u, v ∈ H1(Ω), (2.55)

where, for u ∈ H1(Ω), we define ∆hu ∈ Vh such that

(−∆hu, vh) = (∇u,∇vh) ∀ vh ∈ Vh.

19

Figure 2.1: The orthogonal projection of the initial condition u0 ∈ H1(Ω) onto Vh

The significance of the ‘h’-subscript stems from the fact that ∆hu depends on the

discretisation being considered. We note that for a given function u ∈ H1(Ω), ∆hu ∈Vh is unique.5

Using this inner product, we define the representation u0h ∈ Vh of the initial con-

dition u0 ∈ H1(Ω) such that the following orthogonality relation holds (see Fig. 2.1):⟨u0 − u0

h, vh⟩H2

(Ω)= 0 ∀ vh ∈ Vh. (2.56)

The implication of this definition is that u0h ∈ Vh is the closest element to u0 ∈ H1(Ω)

as measured by the (induced) H2(Ω)-norm which is given by

‖u‖2H2

(Ω) = ‖u‖2 + ‖∇u‖2 + ‖∆hu‖2 . (2.57)

Equivalently, we can state the condition in Equation (2.56) as⟨u0h, vh

⟩H2

(Ω)= 〈u0, vh〉H2

(Ω) ∀ vh ∈ Vh

or, if u0 ∈ H2(Ω) with ∂u0/∂n = 0|∂Ω, as we shall henceforth suppose,(u0h, vh

)+(∇u0

h,∇vh)

+(∆hu

0h,∆hvh

)= (u0, vh) + (∇u0,∇vh) + (∆Nu0,∆hvh) ∀ vh ∈ Vh. (2.58)

5As in [S12], suppose we have some given u ∈ H1(Ω) and seek wh ∈ Vh such that

(wh, vh) = (∇u,∇vh) ∀ vh ∈ Vh.

We will show that there exists one such wh ∈ Vh and label it ‘−∆hu’. Define the linear functional

l (vh) := (∇u,∇vh) and observe by the Cauchy–Schwarz inequality that

|l (vh)| ≤ ‖∇u‖ ‖∇vh‖ ≤ ‖∇u‖ ‖vh‖H1(Ω) ≤ C(h) ‖∇u‖ ‖vh‖

since all norms on the finite-dimensional space Vh are equivalent. It follows that l is a bounded

linear functional on Vh; consequently by the Riesz Representation Theorem there exists a unique

wh ∈ Vh such that l(vh) = (wh, vh) for every vh ∈ Vh, which we label −∆hu.

20

Notice that in the last inner product on the right-hand side of (2.58) we have retrieved

(∆hu0,∆hvh) = (∆Nu0,∆hvh) because for such a function u0,

(−∆hu0, vh) = (∇u0,∇vh) = (−∆Nu0, vh) ∀ vh ∈ Vh,

by Theorem 9, p. 58 and noting that ∆hu0 ∈ Vh.

2.4.2 Mass conservation

We start by demonstrating that the sequence of finite element approximations exhibits

mass conservation. We take vh ≡ 1 in (2.53) to obtain(unh − un−1

h

∆t, 1

)+ σ (unh −m, 1) = 0.

If we now write unh − un−1h = (unh −m)− (un−1

h −m), we retrieve((unh −m)(1 + σ∆t)− (un−1

h −m), 1)

= 0

which is the same as

(1 + σ∆t) (unh −m, 1) =(un−1h −m, 1

)which, in turn, implies by induction that

(unh −m, 1) = (1 + σ∆t)−1(un−1h −m, 1

)= · · · = (1 + σ∆t)−n

(u0h −m, 1

).

However, if we set vh ≡ 1 ∈ Vh in (2.58) and note that ∆h1 = 0, then we are left with(u0h, 1)

= (u0, 1) ⇒(u0h −m, 1

)= (u0 −m, 1)

if we subtract (m, 1) from each side. We know that (u0 −m, 1) = 0 because this is

just a rearranged form of (2.18), and so

(unh −m, 1) = 0 for n = 0, 1, 2, . . . , N, (2.59)

which expresses conservation of mass in the sequence of finite element approximations.

2.4.3 Boundedness

We now show that the sequence of finite element approximations is bounded, uni-

formly in h. Taking vh = wnh in (2.53) gives(unh − un−1

h

∆t, wnh

)+ ‖∇wnh‖

2 + σ (unh −m,wnh) = 0 (2.60)

21

and then setting vh =(unh − un−1

h

)/∆t in (2.54) yields(

wnh ,unh − un−1

h

∆t

)= ε2

(∇unh,∇

unh − un−1h

∆t

)+

(Φ′(unh),

unh − un−1h

∆t

). (2.61)

Subtracting (2.60) from (2.61) and noting the symmetry of terms in the inner product

gives

ε2(∇unh,

∇unh −∇un−1h

∆t

)+

(Φ′(unh),

unh − un−1h

∆t

)+ ‖∇wnh‖

2 + σ (unh −m,wnh) = 0.

Now we re-write ∇unh in the left-most inner product of this last equation as

∇unh =∆t

2

∇unh −∇un−1h

∆t+∇unh +∇un−1

h

2

so that

ε2

2∆t

(∇(unh − un−1

h

),∇(unh − un−1

h

))+(∇unh +∇un−1

h ,∇unh −∇un−1h

)+

(Φ′(unh),

unh − un−1h

∆t

)+ ‖∇wnh‖

2 + σ (unh −m,wnh) = 0. (2.62)

If we multiply Equation (2.59) by −∫

Ωwnh dΩ ∈ R, we see that(

unh −m,−∫

Ω

wnh dΩ

)= 0 ⇒ (unh −m,wnh) =

(unh −m,wnh −−

∫Ω

wnh dΩ

)(2.63)

and so if we substitute for (unh −m,wnh) in (2.62) using (2.63) and rearrange, we

obtain

ε2

2∆t

(‖∇unh‖

2 −∥∥∇un−1

h

∥∥2)

+ε2

2∆t

∥∥∇(unh − un−1h )

∥∥2+

1

∆t

(Φ′(unh), unh − un−1

h

)+ ‖∇wnh‖

2 + σ


∫Ω

wnh dΩ

)= 0. (2.64)

From the Taylor series expansion of Φ(b) about a we determine that

Φ′(a)(a− b) = Φ(a)− Φ(b) + 12Φ′′(η)(b− a)2

for some η ∈ (a, b). We have Φ(u) = 14(1 − u2)2 so Φ′(u) = u3 − u and finally

Φ′′(u) = 3u2 − 1 ≥ −1. Hence

Φ(a)− Φ(b)− 12(b− a)2 ≤ Φ′(a)(a− b)

so certainly,

(Φ(a), 1)− (Φ(b), 1)− 12‖a− b‖2 ≤ (Φ′(a), a− b) . (2.65)

22

Consequently, if we make the associations a = unh and b = un−1h in (2.65), we obtain

(Φ(unh), 1)−(Φ(un−1

h ), 1)− 1

2

∥∥unh − un−1h

∥∥2 ≤(Φ′(unh), unh − un−1

h

).

If we substitute this for(Φ′(unh), unh − un−1

h

)in (2.64), multiply through by 2∆t, and

rearrange, we obtain

ε2 ‖∇unh‖2 + 2 (Φ(unh), 1) + ε2

∥∥∇(unh − un−1h )

∥∥2+ 2∆t ‖∇wnh‖

2

≤ ε2∥∥∇un−1

h

∥∥2+ 2

(Φ(un−1

h ), 1)

+∥∥unh − un−1

h

∥∥2 − 2σ∆t


∫Ω

wnh dΩ

)≤ ε2

∥∥∇un−1h

∥∥2+ 2

(Φ(un−1

h ), 1)

+∥∥unh − un−1

h

∥∥2

+2σ∆t ‖unh −m‖∥∥∥∥wnh −−∫

Ω

wnh dΩ

∥∥∥∥ (Cauchy–Schwarz ineq.)

≤ ε2∥∥∇un−1

h

∥∥2+ 2

(Φ(un−1

h ), 1)

+∥∥unh − un−1

h

∥∥2+ 2σ∆tc2

P ‖∇unh‖ ‖∇wnh‖

since

‖unh −m‖ =

∥∥∥∥unh −−∫Ω

unh dΩ

∥∥∥∥ ≤ cP ‖∇unh‖ and

∥∥∥∥wnh −−∫Ω

wnh dΩ

∥∥∥∥ ≤ cP ‖∇wnh‖

by Poincare’s inequality (Theorem 8, p. 57). We now apply Young’s inequality (The-

orem 7, p. 57) to the last term on the right-hand side above and subtract ∆t ‖∇wnh‖2

from both sides of the result to conclude that

ε2 ‖∇unh‖2 + 2 (Φ(unh), 1) + ε2

∥∥∇(unh − un−1h )

∥∥2+ ∆t ‖∇wnh‖

2

≤ ε2∥∥∇un−1

h

∥∥2+ 2

(Φ(un−1

h ), 1)

+∥∥unh − un−1

h

∥∥2+ σ2c4

P∆t ‖∇unh‖2 . (2.66)

Setting vh = unh − un−1h in (2.53) and multiplying through by ∆t gives∥∥unh − un−1

h

∥∥2+ ∆t

(∇wnh ,∇

(unh − un−1

h

))+ σ∆t

(unh −m,unh − un−1

h

)= 0 (2.67)

which implies by the Cauchy–Schwarz inequality that∥∥unh − un−1h

∥∥2 ≤ ∆t ‖∇wnh‖∥∥∇ (unh − un−1

h

)∥∥+ σ∆t ‖unh −m‖∥∥unh − un−1

h

∥∥≤ ∆t ‖∇wnh‖

∥∥∇ (unh − un−1h

)∥∥+ σ∆tcP ‖∇unh‖∥∥unh − un−1

h

∥∥≤ ∆t ‖∇wnh‖

∥∥∇ (unh − un−1h

)∥∥+σ2∆t2c2

P

2‖∇unh‖

2 + 12


∥∥2

using the Young and Poincare inequalities (Theorems 7, 8, p. 57). It follows that∥∥unh − un−1h

∥∥2 ≤ ∆t ‖∇wnh‖ 2∥∥∇ (unh − un−1

h

)∥∥+ σ2c2P∆t2 ‖∇unh‖

2

≤ 12∆t ‖∇wnh‖

2 + 2∆t∥∥∇ (unh − un−1

h

)∥∥2+ σ2c2

P∆t2 ‖∇unh‖2 , (2.68)

23

again, by Young’s inequality (Theorem 7, p. 57). Substituting (2.68) into (2.66) for∥∥unh − un−1h

∥∥2gives for each n that

ε2 ‖∇unh‖2 + 2 (Φ(unh), 1) + (ε2 − 2∆t)

∥∥∇(unh − un−1h )

∥∥2+ 1

2∆t ‖∇wnh‖

2

≤ ε2∥∥∇un−1

h

∥∥2+ 2

(Φ(un−1

h ), 1)

+ σ2c2P (c2

P + ∆t)∆t ‖∇unh‖2 .

Summing this through n = 1, 2, . . . , k, (k ≤ N) gives

ε2k∑

n=1

‖∇unh‖2 +2

k∑n=1

(Φ(unh), 1)+(ε2−2∆t)k∑

n=1

∥∥∇(unh − un−1h )

∥∥2+ 1

2∆t

k∑n=1

‖∇wnh‖2

≤ ε2k∑

n=1

∥∥∇un−1h

∥∥2+ 2

k∑n=1

(Φ(un−1

h ), 1)

+ σ2c2P (c2

P + ∆t)∆tk∑

n=1

‖∇unh‖2.

Canceling like terms on each side then leads to

(ε2 − σ2c2

P (c2P + ∆t)∆t

) ∥∥∇ukh∥∥2+ 2

(Φ(ukh), 1

)+ (ε2 − 2∆t)

k∑n=1

∥∥∇(unh − un−1h )

∥∥2

+ 12

k∑n=1

∆t ‖∇wnh‖2 ≤ ε2

∥∥∇u0h

∥∥2+ 2

(Φ(u0

h), 1)

+ σ2c2P (c2

P + ∆t)k−1∑n=1

∆t ‖∇unh‖2.

Note that in the above, we moved the last term in the sum on the end of the right-

hand side (i.e. σ2c2P (c2

P +∆t)∆t∥∥∇ukh∥∥2

) over to the left-hand side. If we now assume

that

∆t ≤ ε2

4⇒ ε2

2≤ ε2 − 2∆t (2.69)

and

ε2

2≤ ε2 − σ2c2

P (c2P + ∆t)∆t ⇒ σ2c2

P (c2P + ∆t)∆t ≤ ε2

2, (2.70)

then we can replace the coefficients of∥∥∇ukh∥∥2

and∑k

n=1

∥∥∇(unh − un−1h )

∥∥2on the

left-hand side with ε2/2 to obtain

ε2

2

∥∥∇ukh∥∥2+ 2

(Φ(ukh), 1

)+ε2

2

k∑n=1

∥∥∇(unh − un−1h )

∥∥2+ 1

2

k∑n=1

∆t ‖∇wnh‖2

≤ ε2∥∥∇u0

h

∥∥2+ 2

(Φ(u0

h), 1)

+2

ε2σ2c2

P (c2P + ∆t)

k−1∑n=1

∆t

(ε2

2‖∇unh‖

2

). (2.71)

24

The discrete Gronwall Lemma (Theorem 2, p. 55) then gives

12ε2∥∥∇ukh∥∥2

+ 2(Φ(ukh), 1

)+ 1

2ε2

k∑n=1

∥∥∇(unh − un−1h )

∥∥2+ 1

2

k∑n=1

∆t ‖∇wnh‖2

≤(ε2∥∥∇u0

h

∥∥2+ 2

(Φ(u0

h), 1))

exp

(2σ2c2

P (c2P + ∆t)

ε2T

)= C (ε, σ, u0, cP , T ) (2.72)

for every k = 1, 2, . . . , N . Because all of the terms on the left-hand side of (2.72)

above are non-negative, we have

k∑n=1

∆t ‖∇wnh‖2 ≤ C (ε, σ, u0, cP , T ) ∀ k = 1, 2, . . . , N (2.73)

and ∥∥∇ukh∥∥2 ≤ C (ε, σ, u0, cP , T ) ∀ k = 1, 2, . . . , N. (2.74)

The inequality (2.72) can be seen as the discrete analogue of (2.28) while (2.73) and

(2.74) correspond respectively to (2.29) and (2.30).

We shall now establish a discrete version of (2.34). We start by writing (2.54) at

time tn and tn−1 and subtracting the two equations that result to get(wnh − wn−1

h

∆t, vh

)= ε2

(∇unh −∇un−1

h

∆t,∇vh

)+

(Φ′(unh)− Φ′(un−1

h )

∆t, vh

),

so setting vh = wnh in this, we have(wnh − wn−1

h

∆t, wnh

)= ε2

(∇unh −∇un−1

h

∆t,∇wnh

)+


h )

∆t, wnh

).

If we now re-write wnh in the left-most inner product of this last equation as

wnh =wnh − wn−1

h

2+wnh + wn−1

h

2

and rearrange, we obtain

ε2(∇unh −∇un−1

h

∆t,∇wnh

)=

1

2∆t

(‖wnh‖

2 −∥∥wn−1

h

∥∥2)

+1

2∆t

∥∥wnh − wn−1h

∥∥2

−(

Φ′(unh)− Φ′(un−1h )

∆t, wnh

). (2.75)

If we multiply (2.67) by ε2/∆t2, we obtain

ε2∥∥∥∥unh − un−1

h

∆t

∥∥∥∥2

+ ε2(∇wnh ,∇

unh − un−1h

∆t

)+ σε2

(unh −m,

unh − un−1h

∆t

)= 0, (2.76)

25

so then if we substitute (2.75) into (2.76) we see that

ε2∥∥∥∥unh − un−1

h

∆t

∥∥∥∥2

+1

2∆t

(‖wnh‖

2 −∥∥wn−1

h

∥∥2)

+1

2∆t


∥∥2

+σε2(unh −m,

unh − un−1h

∆t

)=


h )

∆t, wnh

). (2.77)

If we now re-write unh −m in the last inner product on the left-hand side of (2.77) as

unh −m =unh − un−1

h

2+

(unh −m) + (un−1h −m)

2

and expand, we determine that

ε2∥∥∥∥unh − un−1

h

∆t

∥∥∥∥2

+1

2∆t

(‖wnh‖

2 −∥∥wn−1

h

∥∥2)

+1

2∆t


∥∥2

+σε2

2∆t

(‖unh −m‖

2 −∥∥un−1

h −m∥∥2)

+σε2

2∆t


∥∥2

=


h )

∆t, wnh

). (2.78)

We focus our attention on the last term in (2.78). Since Φ′(u) = u3 − u, we see that(Φ′(u)− Φ′(v)

∆t, w

)=

((u3 − u)− (v3 − v)

∆t, w

)=

(u− v

∆t

[(u2 + uv + v2)− 1

], w

).

Hence, the right-hand side of (2.78) is(Φ′(unh)− Φ′(un−1

h )

∆t, wnh

)≤

∥∥∥∥unh − un−1h

∆t

∥∥∥∥∥∥[(unh)2 + unhun−1h + (un−1

h )2]wnh∥∥

+


∆t

∥∥∥∥ ‖wnh‖ (2.79)

by the triangle and Cauchy–Schwarz inequalities in L2(Ω). Now∥∥[(unh)2 + unhun−1h + (un−1

h )2]wnh∥∥2

≤ 3(∥∥(unh)2wnh

∥∥2+∥∥unhun−1

h wnh∥∥2

+∥∥(un−1

h )2wnh∥∥2)

(Cauchy–Schwarz ineq.)

≤ 3(‖unh‖

4L6(Ω) ‖w

nh‖

2L6(Ω) + ‖unh‖

2L6(Ω)

∥∥un−1h

∥∥2

L6(Ω)‖wnh‖

2L6(Ω)

+∥∥un−1

h

∥∥4

L6(Ω)‖wnh‖

2L6(Ω)

)(see Note (1) below)

≤ 3(c6S ‖unh‖

4H1(Ω) ‖w

nh‖

2H1(Ω) + c6

S ‖unh‖2H1(Ω)

∥∥un−1h

∥∥2

H1(Ω)‖wnh‖

2H1(Ω)

26

+c6S

∥∥un−1h

∥∥4

H1(Ω)‖wnh‖

2H1(Ω)

)(see Note (2) below)

= 3c6S

(‖unh‖

4H1(Ω) + ‖unh‖

2H1(Ω)

∥∥un−1h

∥∥2

H1(Ω)+∥∥un−1

h

∥∥4

H1(Ω)

)‖wnh‖

2H1(Ω)

≤ 9c6S

(max

0≤k≤N

∥∥ukh∥∥H1(Ω)

)4

‖wnh‖2H1(Ω) . (2.80)

Notes:

1) We use the Holder inequalities (Theorems 5, 6, p. 56) as follows: firstly,∥∥(unh)2wnh∥∥2 ≤

(∫Ω

(unh)4p dΩ

)1/p(∫Ω

(wnh)2q dΩ

)1/q

= ‖unh‖4L6(Ω) ‖w

nh‖

2L6(Ω) ,

where we set p = 32

and q = 3. Then∥∥unhun−1h wnh

∥∥2 ≤(∫

Ω

(unh)2p dΩ

)1/p(∫Ω

(un−1h )2q dΩ

)1/q (∫Ω

(wnh)2r dΩ

)1/r

= ‖unh‖2L6(Ω)

∥∥un−1h

∥∥2

L6(Ω)‖wnh‖

2L6(Ω) ,

where we set p = q = r = 3. Finally,∥∥(un−1

h )2wnh∥∥2 ≤

∥∥un−1h

∥∥4

L6(Ω)‖wnh‖

2L6(Ω) using

the same reasoning as for the ‖(unh)2wnh‖2

term above, but replacing unh with un−1h .

2) We use Sobolev’s inequality (Theorem 3, p. 55) with ‖unh‖L6(Ω) ≤ cS ‖unh‖H1(Ω) and

‖wnh‖L6(Ω) ≤ cS ‖wnh‖H1(Ω).

Taking square roots on both sides of (2.80) then gives∥∥[(unh)2 + unhun−1h + (un−1

h )2]wnh∥∥ ≤ 3c3

S

(max

0≤k≤N


)2

‖wnh‖H1(Ω) .

Substituting this back into (2.79) for∥∥[(unh)2 + unhu

n−1h + (un−1

h )2]wnh∥∥ then yields(

Φ′(unh)− Φ′(un−1h )

∆t, wnh

)≤∥∥∥∥unh − un−1

h

∆t

∥∥∥∥‖wnh‖+ 3c3

S

(max

0≤k≤N


)2

‖wnh‖H1(Ω)

. (2.81)

Noting the triangle inequality and Equation (2.74) we have

max0≤k≤N

∥∥ukh∥∥H1(Ω)≤ max

0≤k≤N

∥∥ukh −m∥∥H1(Ω)+ ‖m‖H1(Ω)

= max0≤k≤N

(∥∥ukh −m∥∥2+∥∥∇ukh∥∥2

)1/2

+ ‖m‖H1(Ω)

≤ max0≤k≤N

(c2P

∥∥∇ukh∥∥2+∥∥∇ukh∥∥2

)1/2

+ ‖m‖H1(Ω)

= (1 + c2P )1/2 max

0≤k≤N

∥∥∇ukh∥∥+m |Ω|1/2

= C(ε, σ, u0,m, cP , T, |Ω|),

27

where we applied Poincare’s inequality (Theorem 8, p. 57) to obtain the third line

and (2.74) in the last step. Hence, from (2.81) we deduce that(Φ′(unh)− Φ′(un−1

h )

∆t, wnh

)≤∥∥∥∥unh − un−1

h

∆t

∥∥∥∥‖wnh‖+ C ‖wnh‖H1(Ω)

,

where we define C := C (ε, σ, u0,m, cP , cS, T, |Ω|) as a shorthand in the forthcoming

manipulations. We substitute this into the right-hand side of (2.78) and obtain

ε2∥∥∥∥unh − un−1

h

∆t

∥∥∥∥2

+1

2∆t

(‖wnh‖

2 −∥∥wn−1

h

∥∥2)

+1

2∆t


∥∥2

+σε2

2∆t

(‖unh −m‖

2 −∥∥un−1

h −m∥∥2)

+σε2

2∆t


∥∥2

≤∥∥∥∥unh − un−1

h

∆t

∥∥∥∥‖wnh‖+ C ‖wnh‖H1(Ω)

≤ ε2

2


∆t

∥∥∥∥2

+1

2ε2

(‖wnh‖+ C ‖wnh‖H1(Ω)

)2

(Young’s ineq.; Thm. 7, p. 57)

≤ ε2

2


∆t

∥∥∥∥2

+1 + C2

2ε2

(‖wnh‖

2 + ‖wnh‖2H1(Ω)

)(Cauchy–Schwarz ineq.)

=ε2

2


∆t

∥∥∥∥2

+1 + C2

ε2‖wnh‖

2 +1 + C2

2ε2‖∇wnh‖

2 .

If we rearrange terms, we see that

ε2∥∥∥∥unh − un−1

h

∆t

∥∥∥∥2

+1

∆t

(‖wnh‖

2 −∥∥wn−1

h

∥∥2)

+1

∆t


∥∥2

+σε2

∆t

(‖unh −m‖

2 −∥∥un−1

h −m∥∥2)

+σε2

∆t


∥∥2

≤ 2(1 + C2)

ε2‖wnh‖

2 +1 + C2

ε2‖∇wnh‖

2 .

If we sum this through n = 1, 2, . . . k, multiply through by ∆t, and rearrange we get(1− 2(1 + C2)

ε2∆t

)∥∥wkh∥∥2+ ε2

k∑n=1

∆t


∆t

∥∥∥∥2

+k∑

n=1


∥∥2+ σε2

∥∥ukh −m∥∥2+ σε2

k∑n=1


∥∥2 ≤∥∥w0

h

∥∥2

+ σε2∥∥u0

h −m∥∥2

+ C +2(1 + C2)

ε2

k−1∑n=1

∆t ‖wnh‖2

where we apply (2.73) to∑k

n=1 ∆t ‖∇wnh‖2, and define C = C (ε, σ, u0,m, cP , cS, T, |Ω|)

in the above as the greater of C and the constant C(ε, σ, u0, cP , T ) from (2.73). If we

28

now assume that

1− 2(1 + C2)

ε2∆t ≥ 1

2⇒ ∆t ≤ ε2

4(1 + C2)(2.82)

then we can write

∥∥wkh∥∥2+ 2ε2

k∑n=1

∆t


∆t

∥∥∥∥2

+ 2k∑

n=1


∥∥2

+ 2σε2∥∥ukh −m∥∥2

+ 2σε2k∑

n=1


∥∥2

≤ 2(∥∥w0

h

∥∥2+ σε2

∥∥u0h −m

∥∥2+ C(ε, σ, u0, cP , T )

)+

4(1 + C2)

ε2

k−1∑n=1

∆t ‖wnh‖2.

The discrete Gronwall Lemma (Theorem 2, p. 55), then allows us to deduce that

∥∥wkh∥∥2+ 2ε2

k∑n=1

∆t


∆t

∥∥∥∥2

+ 2k∑

n=1


∥∥2

+2σε2∥∥ukh −m∥∥2

+ 2σε2k∑

n=1


∥∥2

≤ 2(∥∥w0

h

∥∥2+ σε2

∥∥u0h −m

∥∥2+ C(ε, σ, u0, cP , T )

)exp

(4(1 + C2)

ε2T

)= C(ε, σ, u0,m, cP , cS, T, |Ω|). (2.83)

Consequently, ∥∥wkh∥∥ ≤ C(ε, σ, u0,m, cP , cS, T, |Ω|) ∀ k = 1, 2, . . . , N (2.84)

and ∥∥ukh −m∥∥ ≤ C(ε, σ, u0,m, cP , cS, T, |Ω|) ∀ k = 1, 2, . . . , N. (2.85)

Note: here w0h ∈ Vh is defined by(

w0h, vh

)= −ε2

(∆hu

0h, vh

)+(Φ′(u0h

), vh)∀ vh ∈ Vh,

and therefore, if we set vh = w0h in the above we have∥∥w0

h

∥∥2= −ε2

(∆hu

0h, w

0h

)+(Φ′(u0h

), w0

h

)≤ ε2

∥∥∆hu0h

∥∥∥∥w0h

∥∥+∥∥Φ′

(u0h

)∥∥ ∥∥w0h

∥∥ (Cauchy–Schwarz ineq.)

⇒∥∥w0

h

∥∥ ≤ ε2∥∥∆hu

0h

∥∥+∥∥Φ′

(u0h

)∥∥ .29

However, if we put vh = u0h in (2.58) we get(

u0h, u

0h

)+(∇u0

h,∇u0h

)+(∆hu

0h,∆hu

0h

)=(u0, u

0h

)+(∇u0,∇u0

h

)+(∆Nu0,∆hu

0h

)i.e. if we use the Cauchy–Schwarz inequality repeatedly,∥∥u0

h

∥∥2+∥∥∇u0

h

∥∥2+∥∥∆hu

0h

∥∥2 ≤ ‖u0‖∥∥u0

h

∥∥+ ‖∇u0‖∥∥∇u0

h

∥∥+ ‖∆Nu0‖∥∥∆hu

0h

∥∥≤

(‖u0‖2 + ‖∇u0‖2 + ‖∆Nu0‖2)1/2

×(∥∥u0

h

∥∥2+∥∥∇u0

h

∥∥2+∥∥∆hu

0h

∥∥2)1/2

.

So then ∥∥u0h

∥∥2+∥∥∇u0

h

∥∥2+∥∥∆hu

0h

∥∥2 ≤ ‖u0‖2 + ‖∇u0‖2 + ‖∆Nu0‖2 . (2.86)

Since Φ′(unh) = (unh)3 − unh for n = 0, 1, 2, . . . N , we also have that

‖Φ′(unh)‖ ≤∥∥(unh)3

∥∥+ ‖unh‖

= ‖unh‖3L6(Ω) + ‖unh‖

≤ c3S ‖unh‖

3H1(Ω) + ‖unh‖ (2.87)

by Sobolev’s inequality (Theorem 3, p. 55). Specifically, for n = 0,

∥∥u0h

∥∥3

H1(Ω)=(∥∥u0

h

∥∥2

H1(Ω)

)3/2

≤(‖u0‖2 + ‖∇u0‖2 + ‖∆Nu0‖2)3/2

by (2.86). Substituting for ‖u0h‖

3H1(Ω) in (2.87) we have∥∥Φ′(u0

h)∥∥ ≤ c3

S

(‖u0‖2 + ‖∇u0‖2 + ‖∆Nu0‖2)3/2

+(‖u0‖2 + ‖∇u0‖2 + ‖∆Nu0‖2)1/2

≤ C(u0, cS)

if we apply (2.86) to ‖u0h‖ as well. Hence ‖w0

h‖ ≤ C(ε, u0, cS) and so we were able to

absorb w0h in the constant C appearing on the right-hand side of (2.83).6

Next, we take vh = ∆hunh in (2.54) and obtain

(wnh ,∆hunh) = ε2 (∇unh,∇(∆hu

nh)) + (Φ′(unh),∆hu

nh)

= −ε2 (∆unh,∆hunh) + (Φ′(unh),∆hu

nh)

6See Equation (2.35) for the analogous situation in the continuous case.

30

so then

ε2 ‖∆hunh‖

2 = − (wnh ,∆hunh) + (Φ′(unh),∆hu

nh)

≤ ‖wnh‖ ‖∆hunh‖+ ‖Φ′(unh)‖ ‖∆hu

nh‖

by the Cauchy–Schwarz inequality. Consequently, if we divide through by ‖∆hunh‖,

we are left with

ε2 ‖∆hunh‖ ≤ ‖wnh‖+ ‖Φ′(unh)‖

≤ ‖wnh‖+ c3S ‖unh‖

3H1(Ω) + ‖unh‖

by (2.87). Hence, arguing as on p. 14 in the build-up to Equation (2.39), we see that

max0≤n≤N

‖∆hunh‖ ≤

1

ε2

[max

0≤n≤N‖wnh‖+ c3

S max0≤n≤N

(‖∇unh‖

2 + ‖unh‖2)3/2

+ max0≤n≤N

‖unh‖]

≤ 1

ε2

[max

0≤n≤N‖wnh‖+ c3

S max0≤n≤N

(‖∇unh‖

2 + 2 ‖unh −m‖2

+2 ‖m‖2)3/2+ max

0≤n≤N‖unh −m‖+ ‖m‖

].

Noting (2.74), (2.84) and (2.85) (and (2.86) for the case n = 0) leads to:

‖∆hunh‖ ≤ C(ε, σ, u0,m, cP , cS, T, |Ω|) for n = 0, 1, . . . , N. (2.88)

Assuming that the finite element triangulation is quasi-uniform,7 we also have that∥∥∥∥unh −−∫Ω

unh dΩ

∥∥∥∥L∞(Ω)

≤ C ‖unh‖1−θ ‖∆hu

nh‖

θ

where θ = 1/2 if d = 2, and θ = 3/4 if d = 3 (see [BB99] and [BB01]). Thus we

deduce that

‖unh‖L∞(Ω) ≤ C(ε, σ, u0,m, cP , cS, T, |Ω|) for n = 0, 1, . . . , N. (2.89)

The inequality (2.89) will be crucial in establishing stability in the next section, and

in the error analysis of the method, which we schedule as a future opportunity.8

7A quasi-uniform family of triangulations is one in which each triangulation has the property

that the ratio of the longest side of any triangle therein, h, to the radius of the smallest inscribed

circle is bounded by some constant across the family, as h → 0. (See Defn. 4.4.13 of [BS07].) The

uniform triangulations depicted in Fig. C.1, which we use for our two-dimensional simulations, can

clearly be regarded as members of the same quasi-uniform family.8We note here from [S12] that the error analysis of the finite element method will proceed along

similar lines to the stability analysis in Section 2.3.4, and therefore the bound (2.89) will be crucial,

just as (2.40) was in Section 2.3.4.

31

2.4.4 Stability

We are now in a position to establish a stability relation similar to (2.47) for the finite

element approximation. Following similar reasoning to that noted in Section 2.3.4,

we suppose that the finite element problem has two solutions at time n∆t which we

denote(unh,1, w

nh,1

)and

(unh,2, w

nh,2

). We recall that the physical problem is solved by

u(x, t) and its approximation unh, but that the weak and finite element forms of the

problem have two-part solutions (u(x, t), w(x, t))). Now we define

Unh := (unh, w

nh) =

(unh,1, w

nh,1

)−(unh,2, w

nh,2

)and observe that Un

h solves the problem of finding unh, wnh ∈ Vh such that(

unh − un−1h

∆t, vh

)+ (∇wnh ,∇vh) + σ (unh, vh) = 0 ∀ vh ∈ Vh (2.90)

and

(wnh , vh) = ε2 (∇unh,∇vh) +(Φ′(unh,1)− Φ′(unh,2), vh

)∀ vh ∈ Vh. (2.91)

These equations are obtained by writing Equations (2.53) and (2.54) for(unh,1, w

nh,1

)and

(unh,2, w

nh,2

)and subtracting the results. In (2.90), we set

vh = unh =∆t

2

unh − un−1h

∆t+unh + un−1

h

2

to see that

ε2

2∆t


∥∥2+

ε2

2∆t

(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ ε2 (∇wnh ,∇unh) + ε2σ ‖unh‖2 = 0

(2.92)

after multiplying through by ε2. Then, setting vh = wnh in (2.91) gives

‖wnh‖2 = ε2 (∇unh,∇wnh) +

(Φ′(unh,1)− Φ′(unh,2), wnh

). (2.93)

Adding (2.92) and (2.93) then yields

ε2

2∆t


∥∥2+

ε2

2∆t

(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ ‖wnh‖2 + ε2σ ‖unh‖

2

=(Φ′(unh,1)− Φ′(unh,2), wnh

).

Arguing as we did after (2.45) on p. 16, we know that we can write this last result as

ε2

2∆t


∥∥2+

ε2

2∆t

(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ ‖wnh‖2 + ε2σ ‖unh‖

2

=

∫Ω

unhwnh

[3(θunh,1 + (1− θ)unh,2

)2 − 1]

dΩ

32

for some θ ∈ [0, 1]. We know according to (2.89) that∥∥unh,1∥∥L∞(Ω),∥∥unh,2∥∥L∞(Ω)

≤ C(ε, σ, u0,m, cP , cS, T, |Ω|) for n = 0, 1, . . . , N,

since they are both finite element solutions, and so in the same way as we did fol-

lowing (2.46) on p. 16, we define three constants K(h)1 , K

(h)2 and Ch (corresponding

respectively to K1, K2 and C) and argue that

ε2

2∆t


∥∥2+

ε2

2∆t

(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ 12‖wnh‖

2 + ε2σ ‖unh‖2 ≤ 1

2C2h ‖unh‖

2 .

Then, since∥∥unh − un−1

h

∥∥2 ≥ 0, we can drop this term on the left and write

1

2∆tε2(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ 12‖wnh‖

2 + ε2σ ‖unh‖2 ≤ 1

2C2h ‖unh‖

2 . (2.94)

This is the discrete analogue of Equation (2.47). As in Section 2.3.4, we again have

two cases to consider:

1) ε2σ ≥ 12C2h: in this case, (2.94) gives

ε2(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ ∆t ‖wnh‖2 ≤ 0.

If we sum this through n = 1, 2, . . . k, (k ≤ N) and rearrange, we obtain

ε2∥∥ukh∥∥2

+k∑

n=1

∆t ‖wnh‖2 ≤ ε2

∥∥u0h

∥∥2

for any k = 1, 2, . . . , N , where u0h ≡ u0

h,1 − u0h,2. We conclude that∥∥ukh∥∥ ≤ ∥∥u0

h

∥∥ for any k = 1, 2, . . . , N,

i.e. uh ∈ `∞ (0, T ;L2(Ω)) where we define

‖uh‖`∞(0,T ;L2(Ω)) := max0≤k≤N

∥∥ukh∥∥ ≤ ∥∥u0h

∥∥ <∞. (2.95)

In addition, we have(k∑

n=1

∆t ‖wnh‖2

)1/2

≤ ε∥∥u0

h

∥∥ for any k = 1, 2, . . . , N,

and so if we set the upper limit of summation to k = N (as we may), then

wh ∈ `2 (0, N ;L2(Ω)) where we define

‖wh‖`2(0,T ;L2(Ω)) :=

(N∑n=1

∆t ‖wnh‖2

)1/2

≤ ε∥∥u0

h

∥∥ <∞. (2.96)

33

If we now define the norm of Uh = (uh, wh) ∈ `∞ (0, T ;L2(Ω))× `2 (0, T ;L2(Ω)) in

such a way that

‖Uh‖2`∞(0,T ;L2(Ω))×`2(0,T ;L2(Ω)) := ‖uh‖2

`∞(0,T ;L2(Ω)) + ‖wh‖2`2(0,T ;L2(Ω)) (2.97)

then addition of the squares of (2.95) and (2.96) gives

‖Uh‖`∞(0,T ;L2(Ω))×`2(0,T ;L2(Ω)) ≤√

1 + ε2∥∥u0

h,1 − u0h,2

∥∥ (2.98)

which is the discrete version of (2.51) and means that the finite element approxi-

mation to the problem is stable if ε2σ ≥ 12C2h.

2) ε2σ < 12C2h: in this second case, (2.94) gives

ε2(‖unh‖

2 −∥∥un−1

h

∥∥2)

+ ∆t ‖wnh‖2 ≤ ∆t

(C2h − 2ε2σ

)‖unh‖

2 .

Summing the above through n = 1, 2, . . . k, (k ≤ N) in the usual way and rear-

ranging gives

ε2∥∥ukh∥∥2

+k∑

n=1

∆t ‖wnh‖2 ≤ ε2

∥∥u0h

∥∥2+(C2h − 2ε2σ

) k∑n=1

∆t ‖unh‖2.

We move the k-th term in the series on the right-hand side over to the left-hand

side and obtain

(ε2 −∆t

(C2h − 2ε2σ

))∥∥ukh∥∥2+

k∑n=1

∆t ‖wnh‖2

≤ ε2∥∥u0

h

∥∥2+(C2h − 2ε2σ

) k−1∑n=1

∆t ‖unh‖2.

Assuming(ε2 −∆t

(C2h − 2ε2σ

))≥ ε2

2⇒ ∆t ≤ ε2

2(C2h − 2ε2σ

) (2.99)

then implies that

ε2

2

∥∥ukh∥∥2+

k∑n=1

∆t ‖wnh‖2 ≤ ε2

∥∥u0h

∥∥2

+2(C2h − 2ε2σ

)ε2

k−1∑n=1

∆t

ε2

2‖unh‖

2 +n∑

m=1

∆t ‖wmh ‖2

,

34

where we have strategically added a non-negative term on the right-hand side

in preparation for the application of the discrete Gronwall Lemma (Theorem 2,

p. 55). Using this result, we conclude that

ε2

2

∥∥ukh∥∥2+

k∑n=1

∆t ‖wnh‖2 ≤ ε2C2

h

∥∥u0h

∥∥2,

where we define

Ch :=

√√√√√exp

2(C2h − 2ε2σ

)ε2

n∆t

= Ch(ε, σ, u0

h,1, u0h,2,m, cS, cP , T, |Ω|

).

We notice immediately that∥∥ukh∥∥ ≤ √2Ch∥∥u0

h

∥∥ for any k = 1, 2, . . . , N,

i.e. uh ∈ `∞ (0, T ;L2(Ω)). Moreover,(k∑

n=1

∆t ‖wnh‖2

)1/2

≤ εCh∥∥u0

h

∥∥ for any k = 1, 2, . . . , N,

and so once again, wh ∈ `2 (0, T ;L2(Ω)). Taking the sum of squares as in the Case

(1) proof, we have

‖Uh‖`∞(0,T ;L2(Ω))×`2(0,T ;L2(Ω)) ≤ Ch√

2 + ε2∥∥u0

h,1 − u0h,2

∥∥ (2.100)

which is the discrete version of (2.52) and means that the finite element approxi-

mation to the problem is stable if ε2σ < 12C2h.

We conclude that the sequence of finite element approximations of the problem is

bounded and stable under all circumstances if ∆t satisfies the conditions listed in

(2.82) and (2.99). We notice that these conditions are sufficient (but not necessary)

for boundedness and stability, and that they are also independent of the discretization

parameter h, which measures the fineness of the triangulation.

35

Chapter 3

Implementation

We are now in a position to write down the details related to a specific finite element

implementation of the problem. The matrix form derived below is formally the same

for d = 1, 2, 3 but is easier to imagine with a specific dimension and space Vh in mind.

As a result, we tend to develop the implementation model with one spatial dimension

(d = 1) in mind. Full details related to the Matlab implementation can be seen in

[Par12b].

3.1 A specific numerical scheme

We impose a uniform discretisation on the spatial domain involving M equal subdi-

visions in each direction, assumed constant for all tn, 0 ≤ n ≤ N . We then define

the corresponding set of piecewise linear basis ‘hat’ functions φi in the usual way, for

instance in one dimension as

φi(x) =

(1− |x− xi|

h

)+

for 0 ≤ i ≤ (M + 1)d − 1

with d = 1. Thereafter, we specify the (M + 1)d-dimensional space Vh from the

previous chapter in terms of these basis functions as

Vh = span φi0≤i≤(M+1)d−1 .

In order to find solutions unh and wnh to the finite element problem, we need to write

the coupled problem specified in Equations (2.53) and (2.54) in matrix form. To start,

we express unh and wnh as members of Vh, viz.

unh =

(M+1)d−1∑i=0

Uni φi(x), n = 0, 1, . . . N, (3.1)

36

and

wnh =

(M+1)d−1∑i=0

W ni φi(x), n = 0, 1, . . . N. (3.2)

Notice that we can identify the (approximate) solutions unh and wnh at time tn = n∆t

respectively with the vectors Un (with components Uni ) and W n (with components

W ni ) in R(M+1)d−1.

Instead of taking the full H2-projection of some function u0 ∈ H2(Ω) onto Vh (see

Section 2.4.1), we specify an initial condition ‘on the grid’ of the form

u0(x) =

(M+1)d−1∑i=0

Riφi(x)

and then require that the following simplified version of Equation (2.58) holds:(u0h, vh

)= (u0, vh) for all vh ∈ Vh,

that is, (M+1)d−1∑i=0

U0i φi, vh

=

(M+1)d−1∑i=0

R0iφi, vh

for all vh ∈ Vh.

Writing the equation above for each φj in the finite-dimensional basis of Vh and using

linearity leads to a system of (M + 1)d equations for the initial condition as follows:

(M+1)d−1∑i=0

U0i (φi, φj) =

(M+1)d−1∑i=0

R0i (φi, φj) (3.3)

for 0 ≤ j ≤ (M + 1)d − 1. Proceeding in the normal way, we now define a ‘mass

matrix’

M = (mij) where mij = (φj, φi) =

∫Ω

φjφi dΩ (3.4)

and a ‘stiffness matrix’

S = (sij) where sij = (∇φj,∇φi) =

∫Ω

∇φj · ∇φi dΩ. (3.5)

Using these, it is clear that an equivalent matrix problem to that in Equation (3.3) is

MU0 = MR

37

or, if M is invertible (which it is),

U0 = M−1MR = IR = R. (3.6)

We consider Equations (2.53) and (2.54). As with the initial condition, we write these

equations for each φj in the basis of Vh, use linearity and obtain the coupled system

(1 + σ∆t)

(M+1)d−1∑i=0

Uni (φi, φj) + ∆t

(M+1)d−1∑i=0

W ni (∇φi,∇φj)

=

(M+1)d−1∑i=0

Un−1i (φi, φj) + σ∆t (m,φj) (3.7)

and

(M+1)d−1∑i=0

W ni (φi, φj) = ε2

(M+1)d−1∑i=0

Uni (∇φi,∇φj) +

Φ′

(M+1)d−1∑i=0

Uni φi

, φj

(3.8)

for 0 ≤ j ≤ (M + 1)d − 1. We see that we have a significant non-linearity in the last

term on the right-hand side of (3.8). To address this, we define an iterative scheme

for each time step as follows: we know that Φ′(u) = u(u2− 1) and so we approximateΦ′

(M+1)d−1∑i=0

Uni φi

, φj

≈

((M+1)d−1∑

i=0

Un,ki φi︸︷︷︸

‘u’

(M+1)d−1∑l=0

Un,k−1l φl

2

− 1

︸︷︷︸

‘u2 − 1’

, φj

)

=

(M+1)d−1∑i=0

Un,ki

∫Ω

(M+1)d−1∑l=0

Un,k−1l φl

2

φiφj dΩ

−

(M+1)d−1∑i=0

Un,ki

(∫Ω

φiφj dΩ

)using linearity, for each j, 0 ≤ j ≤ (M + 1)d − 1. Now we define the matrix

L(n,k) =(l(n,k)ij

)where l

(n,k)ij =

∫Ω

(M+1)d−1∑l=0

Un,k−1l φl

2

φjφi dΩ (3.9)

and write the coupled system in (3.7) and (3.8) as an iterative scheme using (3.4),

(3.5) and (3.9) as

(1 + σ∆t)MU (n,k) + ∆tSW (n,k) = F (n) (3.10)

38

with (−ε2S + M − L(n,k)

)U (n,k) + MW (n,k) = 0 (3.11)

where the vector F (n) =(F

(n)0 , F

(n)1 , . . . , F

(n)

(M+1)d−1

)Tis defined component-wise via

F(n)j =

(MUn−1

)j

+ σ∆t (m,φj) for 0 ≤ j ≤ (M + 1)d − 1. (3.12)

We summarise the scheme as((1 + σ∆t)M ∆tS

−ε2S + M − L(n,k) M

)(U (n,k)

W (n,k)

)=

(F (n)

0

)(3.13)

where we define

Un := limk→∞

U (n,k) and W n := limk→∞

W (n,k) (3.14)

and specify the starting condition at each time level as the ending condition from the

previous one, that is to say, we set U (n,0) := Un−1.

We summarise the high level algorithm for arbitrary spatial dimensions as follows:

1) We solve (3.6) to find U0, the projection of the initial condition onto Vh.1

2) We then define U (1,0) = U0 and use this in Equation (3.9) to build the matrix

L(1,1) and in Equation (3.12) to build the vector F (1).

3) We solve the matrix problem (3.13) to find U (1,1) and W (1,1) from((1 + σ∆t)M ∆tS

−ε2S + M − L(1,1) M

)(U (1,1)

W (1,1)

)=

(F (1)

0

).

4) U (1,1) is then used in Equation (3.9) to build the matrix L(1,2). The matrix problem

(3.13) is then solved to find U (1,2) and W (1,2). The process is repeated until sub-

sequent vector pairs(U (1,k), U (1,k−1)

)are considered sufficiently close2 as to have

‘converged’. The result is defined as the solution (U1,W 1) at the first time-step.

1In practice, we wish to specify a precise initial mass, but observe that a set of appropriately

distributed random numbers typically has a mass that is only close to what we want. We achieve fine

control by managing the value of the initial condition on one node in two dimensions (see Section

1.5 of [Par12b] for details). Hence, we speak of ‘all-but pseudo-random’ initial conditions.2In the code, we use a stopping condition of the following form to terminate the process:∥∥∥U (1,k) − U (1,k−1)

∥∥∥∞< TOL.

39

5) We repeat the process above at subsequent time-steps, increasing n by one at each

step, until we have n = N .

3.2 Results summary: two dimensional space

Our aim in this section is to use our Matlab implementation to examine the effect

on the free-energy evolution and end-state morphology, of varying the initial system

mass, m, and the non-local energy coefficient, σ, in two spatial dimensions.3

3.2.1 The effect of varying mass

Our exploration of the effects of varying the system mass is based on the parameters

depicted in Table 3.1.

Type Parameter Values

Physical Mass (m) 0.0 0.4

ε 0.08

σ 10

Control ∆t (:= ε2) 0.0064

Time Steps (N) 2 000

Total Simulated Time (T ) 12.8 secs

h (in x and y) 1/20

Number of Elements 20× 20× 2 = 800

Number of Nodes 21× 21

Max Non-Linear Iterations 500

Non-Linear Tolerance (TOL) 10−9

Table 3.1: Physical and control parameters – varying mass in two dimensions

Some remarks in respect of these parameter values are in order:

• We consider two cases: the symmetric case m = 0 (in which the polymers have

an equal number of type A and B monomers) and the non-symmetric cases with

3A more fundamental set of Matlab results and analysis including those relating to the one-

dimensional case can be found in [Par12a].

40

m = 0.4 which corresponds to a situation where we have 70% type A and 30%

type B monomers on each copolymer molecule (see Equation (1.2)).

• The remaining physical parameters are not chosen to be particularly physically

meaningful and are based, in part, on parameters used in [Zha06] and our

experiences reported in [Par12a]. Note that we do not take Equations (1.2),

(1.3) and (1.4), into account in compiling the list of parameters used in this

section (or the next one). Our interest here is in the end-values of m, ε and σ

and not how they, in turn, depend on more fundamental quantities.

• Much experimentation went into selecting the values of N noted in Tables 3.1

and 3.2. We note here that setting N = 2 000 seems to offer a fair balance

between program runtime and the requirement that N be large enough for the

simulated system to achieve what we tentatively term ‘metastability’.4

• As noted in [Par12a], we find that setting ∆t = ε2 does not appear to impact

stability or boundedness but it does allow us to achieve longer simulated time

horizons than would have been the case had we meticulously applied the suf-

ficient boundedness and stability conditions specified in inequalities (2.82) and

(2.99).

• The spatial discretisation used for these experiments is relatively coarse (see

Fig. C.1(a), p. 63) but is based on experimentation with reasonable performance

and system runtime. Additionally, in comparing the effect of varying mass, we

do not anticipate much variation in the fine structure of our end-state solutions.

We turn our attention now to a brief discussion of the results that were observed.5

Given that we are varying the system mass, we anticipate significantly different end-

state solutions for the different mass scenarios and so we focus on these.

The two distinct, random initial conditions (we need to start with two different

system masses) are depicted in Fig. 3.1. We can ‘eyeball’ these distinct masses if

we look at the ranges of values expressed in the horizontal colour bars beneath each

graph; Fig. 3.1(b) clearly represents a non-physical initial condition in which |u0h| > 1

at several points yet the method is robust enough for a physically reasonable end-state

(∣∣uNh ∣∣ < 1 everywhere) to emerge at the end.

4We use this term as we have no way of knowing whether lower energies may be achieved at much

later times.5Animations are available on-line as follows: the m = 0.0 simulation can be viewed at

http://youtu.be/bRSco2N018k, and m = 0.4 at http://youtu.be/p95Q8C1o9sU.

41

http://youtu.be/bRSco2N018k

http://youtu.be/p95Q8C1o9sU

x

y

ε=0.08, σ=10 and m=0 at t=0

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5

(a) m = 0

x

y

ε=0.08, σ=10 and m=0.4 at t=0

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5 1

(b) m = 0.4

Figure 3.1: Typical initial conditions (2D)

The evolution pattern observed in most of the simulations is the same: from

a highly variable initial condition, the system quickly moves to a state in which

unh ≈ m after which periodic structures slowly emerge. Our end-state ‘metastable’

solutions are as expected: the symmetric case (m = 0) in Fig. 3.2(a) settles into

a pattern of approximately straight lines of alternating patches of uNh = ±1, while

the non-symmetric case in Fig. 3.2(b) settles into a pattern of repeating circles of

uNh = −1. An informal discussion of the likely stability characteristics of these end-

state solutions is deferred to Appendix C.1.1. The zero Neumann boundary condition

on uNh is clearly satisfied as the contour lines are perpendicular to ∂Ω.

x

y

ε=0.08, σ=10 and m=0 at t=12.8

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5

(a) m = 0

x

y

ε=0.08, σ=10 and m=0.4 at t=12.8

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5

(b) m=0.4

Figure 3.2: Typical ‘metastable’ end-state solutions (2D) for various m

Finally, we notice from Fig. 3.3 that total free energy is a non-increasing function

42

of time irrespective of the system mass, as expected. The end-state morphologies

(lamellar, circular) that result do so because of the interaction and competition be-

tween the three components of the free energy.

0 2 4 6 8 10 120

0.05

0.1

0.15

0.2

0.25

time

Fre

e E

nerg

y

Total Free Energy Evolution over Time

(a) m = 0

0 2 4 6 8 10 120

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y


(b) m = 0.4

Figure 3.3: Typical total free energy evolution over time (2D)

In two dimensions, we put Ω := (0, 1)2 and write the free energy functional6 as

E(unh) =

∫(0,1)2

ε2

2|∇unh|

2︸︷︷︸Part 1

+ 14

(1− (unh)2

)2︸︷︷︸Part 2

dxdy +σ

2‖unh −m‖

2H−1(0,1)2︸︷︷︸

Part 3

.

The ‘Part 1’ contribution to the free energy7 is minimised when |∇unh| = 0 i.e. unhis constant and/or the regions on which |∇unh| 6= 0 are as small as possible. It is

clear from Fig. 3.2 that the system has evolved to a state where∣∣∇uNh ∣∣ ≈ 0 whenever

uNh ≈ ±1 over large portions of the space that are coloured black/white. The internal

layers, on which∣∣∇uNh ∣∣ is large, are also visibly narrow as these penalise (i.e. increase)

the contribution of this part of the energy.

The ‘Part 2’ contribution to the free energy is minimised when the double-well

potential function (1− (unh)2)2

is minimised i.e. when unh ≈ ±1. It is almost as if the

‘Part 1’ contribution wants unh to be constant and the ‘Part 2’ contribution specifies

the value of this constant to be ±1. This is clear from Fig. 3.2.

The minimisation of the ‘Part 3’ contribution to the free energy is expressed in

the patterns that emerge. This part of the energy favours a solution in which unh ≈ m

as often as possible, but since energy is the sum of this and two other components

(that together favour unh = ±1 wherever possible), the effect of this nonlocal8 term

6Appendix C.1.2 includes a graphical view of the evolution of the three free energy components

for these experiments.7What we have labeled ‘Part 1’, ‘Part 2’ and ‘Part 3’ are called the ‘Interfacial’, ‘Double-Well’

and ‘Nonlocal Interaction’ energies, respectively, in [Zha06].8. . . so-called because it involves m which is a global parameter, and an integral over (0, 1)2.

43

is to introduce oscillations into the end-state solutions. Numerical experiments and

ansatz-driven analysis (see [CMW11]) predict that in two dimensions, when m = 0

we can expect linear structures to emerge that have low energy (as in Fig. 3.2(a))

and when m 6= 0, we can expect curved (circular) structures (as in Fig. 3.2(b)).

3.2.2 The effect of varying the non-local energy coefficient

In the previous section, we examined the effect of the three components of the free

energy on pattern formation and observed that whereas the ‘Part 1’ and ‘Part 2’

components are minimised when unh = ±1, the ‘Part 3’ component is minimised when

unh = m. The resulting competition between these components is what results in the

regular patterns that emerge. In this section, we explore the effects of varying the

non-local (‘Part 3’) energy coefficient, σ, based on the parameters presented in Table

3.2. For these four experiments, we set m = 0.4, hold ε constant at 0.02 and make

use of the same random initial condition (see Fig. C.4, p. 66).

Type Parameter Values

Physical Mass (m) 0.4

ε 0.02

σ 2 20 200 800

Control ∆t (:= ε2) 0.0004

Time Steps (N) 3 000 2 000 146

Total Simulated Time (T ) 1.2 secs 0.8 secs 0.06 secs

h (in x and y) 1/30

Number of Elements 30× 30× 2 = 1 800

Number of Nodes 31× 31

Max Non-Linear Iterations 500

Non-Linear Tolerance (TOL) 10−9

Table 3.2: Physical and control parameters – varying σ in two dimensions

We are mostly interested here in the effect on solution periodicity of the param-

eter σ and anticipate the effect depicted in Fig. 3.4 i.e. smaller structures (increased

periodicity) as σ increases. We expect the underlying cause of this effect to manifest

itself in the way the components of the free energy interact and on the size of the

44

end-state structures9 that result. We also make use of a somewhat refined (uniform,

30 × 30) mesh for these experiments in hopeful anticipation of some distinctive fine

structure in the results (see Fig. C.1(b), p. 63).

Figure 3.4: SCMFT context

As expected, we observe in Fig. 3.510

that smaller, ‘more periodic’ structures

generally result from larger values of σ.

The reason we associate increasing σ

(for constant ε) with a move ‘down’ the

SCMFT phase diagram in Fig. 3.4 has

to do with Equations (1.3) and (1.4)

where it is clear that increasing σ is

associated with decreasing NP (smaller

molecules) which in turn decreases the

product χNP . In turn, the way this

change in σ causes these effects becomes clear when we examine how the three com-

ponents of the free energy11 relate over time for various σ in Fig. 3.6:

(a) For σ = 2, Fig. 3.5(a) reveals that the system evolves into a state with a small

number of relatively large structures per unit ‘cube’.12 Fig. 3.6(a) reveals that

with this small value of σ, the ‘Part 3’ energy is dominated by the other two parts

so that its impact on structure formation is minimal.

(b) As σ is increased to σ = 20 (i.e. by a factor of ten), Fig. 3.5(b) shows an end-

9An informal discussion of the likely stability characteristics of these end-state solutions is deferred

to Appendix C.2.2.10Animations for the four experiments are available on-line at http://youtu.be/VIH9cNrG8JQ,

http://youtu.be/qd4CXzlI9Bk, http://youtu.be/z2BlRbPcpjk, and http://youtu.be/2z0K8-9IKxI

respectively.11Appendix C.2.3 includes a graphical view of the evolution of the total free energy for these

experiments.12Empirical evidence suggests that the smaller the value of σ, the longer it takes for the system

to become stable and this experiment was no exception. The structures in Fig. 3.5(a) are clearly

non-uniform, and the total energy evolution graph in Fig. C.6(a), p. 68 shows a marked decline in

energy relatively late in time. This explains why the value chosen for N in Table 3.2 was higher for

this experiment than for the others.

To explore this further, the end-state in Fig. 3.5(a) was subsequently used as the initial condition

for a supplementary simulation of another N = 2 000 steps. This extended the total system runtime

for this experiment from ∼40 hours to ∼60 hours. The results of this extended run are visually

virtually indistinguishable from those in Fig. 3.5(a). Nevertheless, they are included in Appendix

C.2.4 on p. 68.

45

http://youtu.be/VIH9cNrG8JQ

http://youtu.be/qd4CXzlI9Bk

http://youtu.be/z2BlRbPcpjk

http://youtu.be/2z0K8-9IKxI

state solution that includes approximately twice as many structures as we saw for

σ = 2. This is consistent with the prediction in [Zha06] that to double periodicity,

we should increase σ by a factor of approximately eight. If we look at Fig. 3.6(b),

we find that the corresponding ‘Part 3’ energy is proportionately much larger (by

a factor of approximately ten) which we expect for the larger value of σ. In this

case, the effect is richer structure to keep this ‘Part 3’ contribution small. As

a result, the ‘Part 2’ energy approximately doubles as the number of structures

and the volume occupied by internal layers increases.

(c) As we increase σ by another factor of ten to σ = 200 we see the number of

structures in Fig. 3.5(c) approximately doubles again – as expected. In Fig. 3.6(c),

we observe little change in the actual ‘Part 3’ energy compared to Fig. 3.6(b) as

the finer structure keeps its contribution more or less steady. However, we do

see that the ‘Part 2’ energy has approximately doubled again, as the number of

structures and the volume occupied by internal layers increases further.13

(d) The disordered state14 depicted in Fig. 3.5(d) is especially interesting (as is its

animation) as σ is so large in this case that even small deviations from m lead

to the ‘Part 3’ energy dominating the other two components. Consequently, the

system is pushed to a state with uNh ≈ m everywhere.15 As a result, Fig. 3.6(d)

shows that the ‘Part 3’ energy is essentially zero precisely because the solution

tends to ‘disorder’. This result is consistent with Theorem 3.1 of [CPW09] which

predicts that “u ≡ m is the unique global minimiser. . . if 1 −m2 ≤ 2ε√σ.” We

also see that the ‘Part 1’ energy (favouring constant solutions) tends to zero,

while the ‘Part 2’ energy (favouring unh ≈ ±1) increases even further.

We conclude with an observation in respect of the ‘Part 1’ energy: in all cases in

Fig. 3.6, this initially drops dramatically and then increases. This is consistent with

our earlier observation of typical system evolution: the system initially moves to a

13We use this experiment to start examining h-independence in Appendix C.2.5, p. 70. The idea

is to run 500 steps of this experiment on a 100× 100 mesh.14The term ‘disorder’ has physical significance in the literature. It describes the way in which the

copolymer molecules do not exhibit any regular structure at large times.15Fig. 3.5(d) essentially depicts machine noise about an average of u = 0.4 – the range of values

depicted (i.e. the difference between the largest and smallest components of uNh ) is 4.7×10−15. Any

colour variation in the figure is therefore somewhat meaningless and in some sense, the depicted

solution can be regarded as anything but numerically disordered. In addition, this end-state emerged

extremely quickly: after 146 time-steps, the step-by-step solution change was less than machine

epsilon which is why we selected N = 146 for this experiment in Table 3.2.

46

http://youtu.be/2z0K8-9IKxI

state with unh close to m, after which structures emerge. We therefore expect |∇unh|and the ‘Part 1’ energy to be small briefly, while unh ≈ m.

x

y

ε=0.02, σ=2 and m=0.4 at t=1.2

0 0.5 10

0.2

0.4

0.6

0.8

1

−1 −0.5 0 0.5(a) σ = 2

x

y

ε=0.02, σ=20 and m=0.4 at t=0.8

0 0.5 10

0.2

0.4

0.6

0.8

1

−1 −0.5 0 0.5(b) σ = 20

x

y

ε=0.02, σ=200 and m=0.4 at t=0.8

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5(c) σ = 200

x

y

ε=0.02, σ=800 and m=0.4 at t=0.0584

0 0.5 10

0.2

0.4

0.6

0.8

1

0.4 0.4 0.4 0.4 0.4(d) σ = 800 (‘disorder’)

Figure 3.5: End-state solutions (2D) for various σ

47

0 0.5 10

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 0.5 10

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y

Part 2

0 0.5 10

2

4

6

8x 10

−3

time

Fre

e E

nerg

y

Part 3

(a) σ = 2

0 0.50

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 0.50

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

yPart 2

0 0.50

0.005

0.01

0.015

timeF

ree

Ene

rgy

Part 3

(b) σ = 20

0 0.50

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 0.50

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y

Part 2

0 0.50

0.01

0.02

time

Fre

e E

nerg

y

Part 3

(c) σ = 200

0 0.050

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 0.050

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y

Part 2

0 0.050

0.02

0.04

time

Fre

e E

nerg

y

Part 3

(d) σ = 800 (‘disorder’)

Figure 3.6: Competing free energy components (2D) for various σ. The red circles

depict the component energy levels of the initial condition.

48

Chapter 4

Conclusion

The project made a useful start to the analysis of the OKDE and its finite element ap-

proximation. Theoretical boundedness and stability results were established in both

the analytic and approximate cases and the finite element method was implemented

in one and two spatial dimensions in Matlab. Numerical results agreed with phys-

ical expectation and results obtained by other researchers using alternate (spectral)

methods (see for instance [CMW11] and [Zha06]). They are also consistent with the

theoretical boundedness and stability results we established.

4.1 Opportunities for further study

Several pieces of theoretical analysis could be explored including the existence and

uniqueness of solutions of the OKDE and its finite element approximation. A related

analysis of local and global functional minimisers would be useful and might consider,

inter alia questions such as: can we only expect local, metastable energy minimisers

to emerge or does the OKDE lead to a (unique?) global minimiser? If stable, global

minimisers do exist, are they always numerically accessible? Can we guarantee that

(estimated) free energy is a non-increasing function of time in the finite element

approximation? Many open questions exist in this area; the paper by Choksi et al

([CPW09]) gives an overview of some of the current thinking on these issues.

An analysis of the error and convergence characteristics of the finite element

method would also be useful and should be possible within the framework established

by the stability argument presented in this manuscript. In addition, the impact of any

variational crimes should be understood. The most obvious targets for this analysis

include the integration approximations and the non-linear fixed point iteration that

49

we implemented in Matlab.

The biggest shortcoming of our implementation was that it took several hours to

simulate relatively short time horizons.1 A number of technical options were iden-

tified in this regard, including the use of an alternate spatial discretisation and/or

p-adaptivity. Such measures could allow a more accurate solution to be represented

by a smaller set of data.2 Additionally, the fixed point iteration we used to deal with

the non-linear portion of the problem was computationally expensive. An alterna-

tive involving a variant of Newton’s method might be possible which would likely

substantially reduce overall computational cost. Several opportunities are also noted

in respect of upgrading the underlying linear system solvers (conjugate gradients,

GMRES, multigrid etc.) as well as the integration modules. Bespoke three-point

Gaussian quadrature routines seemed a pragmatic choice when the implementation

was designed, but it is possible that alternatives could reduce runtime and improve

accuracy. Moreover, an implementation in a compiled language such as C/C++ or

FORTRAN could speed up processing (in preparation for three spatial dimensions).

Fundamental implementation and modelling enhancements are also possible. The

most obvious, pressing and interesting enhancement is a three-dimensional spatial

implementation. Such a code would allow for an exploration of the full set of ex-

perimentally observed, physically relevant and complex morphologies illustrated in

Fig. 1.2 on p. 4.

Finally, we observe that all of the analysis performed thus far relates to diblock

copolymers in the pure melt. There is also significant industrial interest in block

copolymer solutions (dilute and concentrated) and it would be interesting to study

these. The key complexity anticipated in this regard entails coupling the system

studied in this document with the Navier Stokes equations for a viscous solvent in

some way. To an extent, this would represent a fundamentally new model which would

require its own analysis in respect of solution existence, uniqueness, boundedness and

stability. This work will be the author’s primary focus for the next few years.

1For ε = 0.02 and σ = 2 in two dimensions for instance, it took ∼40 hours to simulate a mere

1.2 seconds (N = 3 000), whereas we would have preferred to simulate 200+ seconds! This example

is quite extreme but it illustrates the general point of the performance intensity of the calculations

we performed.2h-Adaptivity would add significant analytical complexity.

50

References

[Ada75] R.A. Adams. Sobolev Spaces. Pure and applied mathematics. Academic

Press, 1975.

[AW09] K. Atkinson and H. Weimin. Theoretical Numerical Analysis, A Functional

Analysis Framework. Texts in Applied Mathematics. Springer, third edi-

tion, 2009.

[BB99] J.W. Barrett and J.F. Blowey. An improved error bound for a finite element

approximation of a model for phase separation of a multi-component alloy.

IMA Journal of Numerical Analysis, 19(1):147–168, 1999.

[BB01] J.W. Barrett and J.F. Blowey. An improved error bound for a finite element

approximation of a model for phase separation of a multi-component alloy

with a concentration dependent mobility matrix. Numerische Mathematik,

88:255–297, 2001.

[BF99] F.S. Bates and G.H. Fredrickson. Block Copolymers - Designer Soft Ma-

terials. Phys. Today, 52(2), 1999.

[BS07] S.C. Brenner and R. Scott. The Mathematical Theory of Finite Element

Methods. Texts in Applied Mathematics. Springer, 2007.

[Cho03] R. Choksi. Mathematical Aspects of Microphase Separation of Diblock

Copolymers. In Surikaisekikenkyusko Kokyuroku, volume 1330 of Confer-

ence Proceedings, pages 10–17. RIMS, 2003.

[CMW11] R. Choksi, M. Maras, and J. F. Williams. 2D Phase Diagram for Minimiz-

ers of a Cahn–Hilliard Functional with Long-Range Interactions. ArXiv

e-prints, March 2011. arXiv:1103.2964.

[Coh80] D.L. Cohn. Measure Theory. Birkhauser, first edition, 1980.

[CPW09] R. Choksi, M.A. Peletier, and J.F. Williams. On the Phase Diagram for

Microphase Separation of Diblock Copolymers: An Approach via a Non-

local Cahn-Hilliard Functional. SIAM Journal on Applied Mathematics,

69(6):1712–1738, 2009.

[CR03] R. Choksi and X. Ren. On the Derivation of a Density Functional The-

ory for Microphase Separation of Diblock Copolymers. J. Statist. Phys.,

113:151–176, 2003.

51

http://arxiv.org/abs/1103.2964

[Eva98] L.C. Evans. Partial Differential Equations. Graduate Studies in Mathe-

matics. American Mathematical Society, 1998.

[Gri85] P. Grisvard. Elliptic problems in nonsmooth domains. Monographs and

studies in mathematics. Pitman Advanced Pub. Program, 1985.

[GT01] D. Gilbarg and N.S. Trudinger. Elliptic Partial Differential Equations of

Second Order. Classics in Mathematics. Springer, 2001.

[Ham03] I.W. Hamley. The Physics of Block Copolymers. Oxford, reprinted edition,

2003.

[mit12] mitopencourseware. Diblock Copolymer Morphology Diagram. 2012.

This work is licensed under the Creative Commons Attribution-

NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this

license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/ or send

a letter to Creative Commons, 444 Castro Street, Suite 900, Mountain

View, California, 94041, USA. Available from: http://www.flickr.com/

photos/mitopencourseware/3323137283/in/set-72157614617563539

[cited 13 August 2012].

[MS94] M. W. Matsen and M. Schick. Stable and unstable phases of a diblock

copolymer melt. Phys. Rev. Lett., 72:2660–2663, Apr 1994.

[OK86] T. Ohta and K. Kawasaki. Equilibrium morphology of block copolymer

melts. Macromolecules, 19(10):2621–2632, 1986.

[Par12a] Q.K. Parsons. Numerical Approximation of the Ohta–Kawasaki Functional

Concept Testing. 2012. Supporting notes in respect of conceptual tests of

the MATLAB implementation. Available from: http://people.maths.

ox.ac.uk/parsons/Results.pdf.

[Par12b] Q.K. Parsons. Numerical Approximation of the Ohta–Kawasaki Functional

MATLAB Implementation Notes. 2012. Supporting notes in respect of

the MATLAB implementation design. Available from: http://people.

maths.ox.ac.uk/parsons/Specification.pdf.

[S10] E. Suli. Numerical Solution of Ordinary Differential Equations. 2010. Tu-

torial notes supplied for this course as part of the Mathematical Modelling

and Scientific Computing MSc at the University of Oxford.

52

http://www.flickr.com/photos/mitopencourseware/3323137283/in/set-72157614617563539

http://www.flickr.com/photos/mitopencourseware/3323137283/in/set-72157614617563539

http://people.maths.ox.ac.uk/parsons/Results.pdf

http://people.maths.ox.ac.uk/parsons/Results.pdf

http://people.maths.ox.ac.uk/parsons/Specification.pdf

http://people.maths.ox.ac.uk/parsons/Specification.pdf

[S12] E. Suli. Private communication. 2012.

[Zha06] P. Zhang. Periodic Phase Separation: A Numerical Study via a Modified

Cahn–Hilliard Equation, 2006. MSc Thesis, Simon Fraser University.

53

Appendix A

Useful Mathematical Results

Note: several of the results that follow are drawn from [Eva98], and as such, apply to

regions with C1 and C2 boundaries. Given that we work on a unit d-cube Ω = (0, 1)d

in the main text, a more appropriate source for these results would be [Ada75]. This

latter, more advanced work quotes analogous results for Lipschitz domains (i.e. do-

mains that are C1 or C2 a.e.).

A.1 Gronwall’s Lemma

We quote a useful theorem (from [AW09]):

Theorem 1. (Gronwall – continuous) Suppose f is a continuous function on [a, b]

which satisfies

f(t) ≤ g(t) +

∫ t

a

h(s)f(s) ds, t ∈ [a, b]

where g is continuous, h ∈ L1(a, b) and h(t) ≥ 0 a.e.. Then

f(t) ≤ g(t) +

∫ t

a

g(s)h(s) exp

(∫ t

s

h(τ) dτ

)ds for all t ∈ [a, b].

In addition, if g is nondecreasing, then

f(t) ≤ g(t) exp

(∫ t

a

h(s) ds

)for all t ∈ [a, b].

In the case where h(s) = c > 0, these inequalities reduce to:

f(t) ≤ g(t) + c

∫ t

a

g(s)ec(t−s) ds for all t ∈ [a, b].

54

and

f(t) ≤ g(t)ec(t−a) for all t ∈ [a, b].

The discrete counterpart to this, drawn from a result in [S10], is:

Theorem 2. (Gronwall – discrete) If

|ek| ≤ K∗ + L∗k−1∑n=0

∆t|en|, n = 1, 2, . . . N

then

|ek| ≤ K∗eL∗n∆t, n = 1, 2, . . . N.

The notation in this last theorem is drawn from the context of error analysis (as

this was its application in [S10]); but the result is nonetheless formally clear and

applied directly in this paper.

A.2 Sobolev inequalities

Combining Theorem 2 from Section 5.6 of [Eva98] and [S12], we have

Theorem 3. (Sobolev’s inequality) If Ω is a bounded, open subset of Rd with a C1

boundary ∂Ω, and u ∈ W 1,p(Ω) then

‖u‖Lp(Ω) ≤ cS ‖u‖H1(Ω) , 1 ≤ p <∞ for d = 1, 2 (A.1)

and

‖u‖L

2dd−2 (Ω)

≤ cS ‖u‖H1(Ω) , for d > 2.

Specifically, for d = 1, 2, 3,

‖u‖L6(Ω) ≤ cS ‖u‖H1(Ω) . (A.2)

We now quote Theorem 6, part (ii) of Section 5.6 of [Eva98] with k = 2, p = 2,

d = 1, 2, 3:

Theorem 4. (Sobolev embedding) Let Ω be a bounded, open subset of Rd, with a C1

boundary and suppose that u ∈ H2(Ω). If

d < 4

55

then

u ∈ C0,γ(Ω),

where

γ =

12, if d = 1, 3;

any positive number < 1, if d = 2.

Additionally,

‖u‖C0,γ(Ω) ≤ C ‖u‖H2(Ω) ,

where C = C(d, |Ω|), and since C0,γ(Ω) is embedded in L∞(Ω),

‖u‖L∞(Ω) ≤ cS ‖u‖H2(Ω) .

A.3 The Holder inequalities

We quote Proposition 3.3.2 from Section 3 of [Coh80]:

Theorem 5. (Holder’s inequality) Let p and q satisfying 1 ≤ p, q ≤ +∞ be conjugate

exponents i.e.

1

p+

1

q= 1.

If f ∈ Lp(Ω) and g ∈ Lq(Ω), then fg ∈ L1(Ω) and∫Ω

fg dΩ ≤ ‖f‖Lp(Ω) ‖g‖Lq(Ω) . (A.3)

A general form of this theorem (from Result (g) of Appendix B.2 of [Eva98]) is

the following:

Theorem 6. (General Holder inequality) Let 1 ≤ p1, . . . , pm ≤ ∞ with

1

p1

+ . . .+1

pm= 1

and uk ∈ Lpk(Ω) for k = 1, . . . ,m. Then∫Ω

|u1 · · · um| dΩ ≤m∏k=1

‖uk‖Lpk (Ω). (A.4)

56

A.4 Other useful identities and inequalities

We start with a version of Young’s inequality (see [Eva98], Result (c) of Appendix

B.2).

Theorem 7. (Young’s inequality with p = q = 1/2)

ab ≤ 1

2γa2 +

γ

2b2 for any γ > 0. (A.5)

Setting γ = 1 in Equation (A.5) leads to

ab ≤ a2

2+b2

2

whence

2ab ≤ a2 + b2

and so

a2 + 2ab+ b2 ≤ 2(a2 + b2),

that is

(a+ b)2 ≤ 2(a2 + b2). (A.6)

Next we quote Theorem 1 from Section 5.8 of [Eva98]:

Theorem 8. (Poincare’s inequality) Let Ω be a bounded, connected, open subset of

Rd with a C1 boundary ∂Ω and suppose 1 ≤ p ≤ ∞. Then there exists a constant cP ,

depending only on d, p and Ω such that∥∥∥∥v −−∫Ω

v dΩ

∥∥∥∥Lp(Ω)

≤ cP ‖∇v‖Lp(Ω)

for each function v ∈ H1(Ω). Specifically, for p = 2 as in the main body of this paper,

we use ∥∥∥∥v −−∫Ω

v dΩ

∥∥∥∥ ≤ cP ‖∇v‖ . (A.7)

Finally, we apply the Divergence Theorem to manipulate inner products involving

functions in H2(Ω) that have a zero Neumann boundary condition:

57

Theorem 9. (A useful integral identity) If w ∈ H2(Ω) and

∂w

∂n

∣∣∣∣∂Ω

= 0

then ∫Ω

∇v · ∇w dΩ = −∫

Ω

v∆w dΩ (A.8)

for all v ∈ H1(Ω).

Proof. We have

∇ · (v∇w)) = ∇v · ∇w + v∆w

so ∫Ω

v∆w dΩ =

∫Ω

∇ · (v∇w)) dΩ−∫

Ω

∇v · ∇w dΩ

=

∫∂Ω

v∇w · n ds︸︷︷︸Divergence Theorem

−∫

Ω

∇v · ∇w dΩ

=

∫∂Ω

v∂w

∂nds︸︷︷︸

= 0 by the BC

−∫

Ω

∇v · ∇w dΩ

= − (∇w,∇v)

and we are done.

We combine Theorem 8.12 of Section 8.4 of [GT01] with Theorem 4.3.1.4 of [Gri85]

and some remarks in [S12]. The theorem from [GT01] applies to a convex polygon

Ω in R2 while the result in [Gri85] applies to Ω, a subset of Rd with a boundary ∂Ω

that is of class C2. Together, these theorems imply the following:

Theorem 10. Under the conditions noted above in respect of Ω, there is a positive

constant C, independent of u such that

‖u‖H2(Ω) ≤ C ‖∆u‖ . (A.9)

58

Appendix B

Calculation Essentials

This chapter contains details of some of the longer calculations that are abbreviated

in the main body of the text. These sections only make sense with reference to those

parts of Chapter 2 whence they are referenced.

B.1 Derivation of the dynamic equation

Continuing the calculation from Equation (2.11) on p. 7, we see that


1

2α

∫Ω

ε2(|∇u|2 + 2α(∇u · ∇φ) + α2|∇φ|2

)+

1

2

(1− u2 − αφ(2u+ αφ)

)2

+σ[(−∆N)−1

((u−m) + αφ

)][(u−m) + αφ]

−ε2 |∇u|2 − 12

(1− u2

)2

− σ[(−∆N)−1

(u−m

)][u−m]

dΩ

(where we used (2.7) twice)

= limα→0

1

2α

∫Ω

ε2(|∇u|2 + 2α(∇u · ∇φ) + α2|∇φ|2

)+

1

2

(1− u2

)2 − αφ(1− u2

)(2u+ αφ) +

1

2α2φ2 (2u+ αφ)2

+σ[(−∆N)−1

(u−m

)+ α(−∆N)−1φ

][(u−m) + αφ]

−ε2 |∇u|2 − 12

(1− u2

)2

− σ[(−∆N)−1

(u−m

)][u−m]

dΩ

(where we used the fact that (−∆N)−1 is a linear operator)

59

= limα→0

1

2α

∫Ω

ε2(2α(∇u · ∇φ) + α2|∇φ|2

)−αφ

(1− u2

)(2u+ αφ) + 1

2α2φ2 (2u+ αφ)2

+ ασφ[(−∆N)−1

(u−m

)]+ ασ [(u−m) + αφ] (−∆N)−1φ

dΩ

= limα→0

∫Ω

ε2(

(∇u · ∇φ) +α

2|∇φ|2

)−1

2φ(1− u2

)(2u+ αφ) + 1

4αφ2 (2u+ αφ)2

+ 12σφ[(−∆N)−1

(u−m

)]+ 1

2σ [(u−m) + αφ] (−∆N)−1φ

dΩ

=

∫Ω

ε2 (∇u · ∇φ)− φu

(1− u2

) dΩ

+σ

2

∫Ω

φ[(−∆N)−1

(u−m

)]+ [u−m] (−∆N)−1φ

dΩ.

We focus on the last line and write it as

σ

2

(φ, (−∆N)−1

(u−m

))+((−∆N)−1φ, u−m

) =σ

2

(φ, (−∆N)−1

(u−m

))+(φ, (−∆N)−1

(u−m

)) by (2.6) and conclude that

〈DE(u), φ〉 = ε2∫

Ω

(∇u · ∇φ) dΩ−∫

Ω

φu(1− u2

)− σφ(−∆N)−1

(u−m

)dΩ

= −ε2∫

Ω

φ∆Nu dΩ−∫

Ω

φu(1− u2

)− σφ(−∆N)−1

(u−m

)dΩ

after applying Theorem 9 (Appendix A.4, p. 58) to the first integral on the right

above and observing the zero Neumann boundary condition on u (note that we do

not require a zero Neumann boundary condition of φ (or more generally, the space

H1∗ (Ω))). We now define

N (u) = u(u2 − 1

),

which is Equation (2.12), to arrive at

〈DE(u), φ〉 = −∫

Ω

φε2∆Nu−N (u)− σ(−∆N)−1

(u−m

)dΩ,

which is Equation (2.13).

60

B.2 Approach and results highlights: the weak

form

Key results are numbered in Fig. B.1 as in the main body of the text in red.

Figure B.1: Argument summary – weak form boundedness and stability

61

B.3 Approach and results highlights: the finite el-

ement approximation

Key results are numbered in Fig. B.2 as in the main body of the text in red.

Figure B.2: Argument summary – finite element approximation boundedness and

stability

62

Appendix C

Supplementary Results and Graphs

The graphs and results presented here are supplementary to those presented in Section

3.2 of the main text. We start with a view of the two two-dimensional discretisations

that were used: the 20× 20 grid used in Section 3.2.1 (see Fig. C.1(a)) and another,

finer 30× 30 grid used in Section 3.2.2 (see Fig. C.1(b)).

0 0.2 0.4 0.6 0.8 10

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

x

Domain Triangulation: 20 × 20

y

(a) 20× 20 grid (showing elements)

0 0.2 0.4 0.6 0.8 10

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

x

Domain Triangulation:MxN=30×30

y

(b) 30× 30 grid (showing elements)

Figure C.1: The two finite element grids (2D) that were used

63

C.1 Varying mass

C.1.1 Computational effort/cost

Fig. C.2 depicts the work (as measured by the number of iterations of the fixed

point method) that was required to resolve the non-linear portion of the problem

to the tolerance (TOL) specified in Table 3.1 for the two mass-variance experiments.

The low iteration counts at late times in Fig. C.2 suggest that the solutions at the

ends of these time-steps served as excellent initial guesses for the fixed point method

at the start of each subsequent step. Given the relatively large time-step used for

these experiments, we were able to simulate relatively long time periods in these

experiments (∼ 12 seconds) and, evidently, achieve quite stable end-state solutions.

0 2 4 6 8 10 120

5

10

15

20

25

30

35

time

# Ite

ratio

ns

Convergence iterations #elements=20

(a) m = 0

0 2 4 6 8 10 120

5

10

15

20

25

30

time

# Ite

ratio

ns


(b) m = 0.4

Figure C.2: Non-linear work (2D) for various m

The contrast with most of the results depicted in Fig. C.5, in which we simulate

approximately one second in each experiment is quite stark and suggests that the

end-state solutions returned for these latter experiments could well be metastable.

64

C.1.2 Component energy evolution

Fig. C.3 depicts the per-component free energy evolution that resulted from the ex-

periments conducted in Section 3.2.1. The symmetric case (m = 0) is illustrated in

Fig. C.3(a) and the non-symmetric case (m 6= 0) in Fig. C.3(b).

0 5 100

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 5 100

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y

Part 2

0 5 100

0.01

0.02

0.03

0.04

time

Fre

e E

nerg

y

Part 3

(a) m = 0

0 5 100

0.05

0.1

time

Fre

e E

nerg

y

Part 1

0 5 100

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y

Part 2

0 5 100

0.005

0.01

0.015

0.02

time

Fre

e E

nerg

yPart 3

(b) m = 0.4

Figure C.3: Typical component energy evolution over time (2D)

65

C.2 Varying the non-local energy coefficient

C.2.1 The initial condition

The initial condition that was used for the four experiments discussed in Section 3.2.2

can be seen in Fig. C.4 (it was taken from the test result set returned for the σ = 800

test). The mass asymmetry (we experiment with m = 0.4) is visually clear in the

‘colorbar’ at the bottom of the figure.

x

y

ε=0.02, σ=800 and m=0.4 at t=0

0 0.5 10

0.2

0.4

0.6

0.8

1

−0.5 0 0.5 1

Figure C.4: Initial condition used to test the effect of varying σ

C.2.2 Computational effort/cost

Computational cost/effort graphs summarising the four tests of Section 3.2.2 can be

seen in Fig. C.5. Fig. C.5(d) relates to the disordered case and resembles the results

depicted in Fig. C.2 suggesting that this test ended at a highly stable end-state. The

results in the other three graphs suggest a distinct lack of stability, however, as the

66

non-linear fixed point algorithm is still ‘working very hard’ (hundreds of iterations)

to resolve each time step. These results are not unexpected if we compare the to-

tal simulated time in the experiments of Appendix C.1.1 (∼ 13 seconds) with those

here (approximately one second each). They are somewhat difficult, intuitively, to

reconcile with the results depicted in the next section in which we see free energy ap-

parently hardly changing. These phenomena have been reported by other researchers

as well – see for instance Fig. 12 of [CPW09].

0 0.2 0.4 0.6 0.8 1 1.20

50

100

150

200

250

300

time

# Ite

ratio

ns


(a) σ = 2

0 0.2 0.4 0.6 0.80

20

40

60

80

100

time

# Ite

ratio

ns


(b) σ = 20

0 0.2 0.4 0.6 0.80

5

10

15

20

25

30

35

40

45

time

# Ite

ratio

ns


(c) σ = 200

0 0.01 0.02 0.03 0.04 0.050

5

10

15

20

25

30

time

# Ite

ratio

ns



Figure C.5: Non-linear work (2D) for various σ

67

C.2.3 Total energy evolution

Fig. C.6 below depicts the total free energy evolution for each of the four experiments

discussed in Section 3.2.2. We notice immediately that each of these functions is

a non-increasing function of time – as expected. In addition, in all four cases, the

energy initially drops quickly and then remains quite stable. An exception is noted

in Fig. C.6(a) which we explore further in Appendix C.2.4 below.

0 0.2 0.4 0.6 0.8 1 1.20

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y


(a) σ = 2

0 0.2 0.4 0.6 0.80

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y


(b) σ = 20

0 0.2 0.4 0.6 0.80

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y


(c) σ = 200

0 0.01 0.02 0.03 0.04 0.050

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y



Figure C.6: Total free energy evolution (2D) for various σ

C.2.4 The extended run for σ = 2

We include here summary results from extending the experiment conducted for σ = 2

in Section 3.2.2, for an additional 2 000 time-steps (equivalent to a further 0.8 sim-

68

ulated seconds). The end-state solution is depicted in Fig. C.7(a) and the total free

energy evolution in Fig. C.7(b). Evidently very little changes during this additional

time. Given that it took ∼ 60 hours to obtain these results, we defer further exam-

ination of this experiment to a significantly more powerful computer than the one

used thus far in these experiments.

x

yε=0.02, σ=2 and m=0.4 at t=0.8

0 0.5 10

0.2

0.4

0.6

0.8

1

−1 −0.5 0 0.5(a) End-state solution u(x, 2); the graph title

shows t = 0.8 because the simulation was re-

started at t = 0 with the output from the origi-

nal experiment

0 0.2 0.4 0.6 0.80

0.05

0.1

0.15

0.2

time

Fre

e E

nerg

y


(b) Total free energy evolution for σ = 2 from

t = 1.2 secs to t = 2.0 secs

Figure C.7: Extended run results for σ = 2 (2D)

69

C.2.5 h-independence

We conclude with a basic investigation of h-independence to see if our results are in

some way an artifact of the underlying discretisation of Ω. In the short term, this

investigation will serve as a proxy for a formal analysis of the error and convergence

properties of our numerical method.

As noted previously, our finite element code is quite performance intensive and

so working on finer meshes than those depicted in Fig. C.1 is difficult. Nevertheless,

we interpolate1 the initial condition from the σ = 200 experiment from Section 3.2.2

onto a 100 × 100 mesh and then re-execute the experiment for 500 time-steps. We

compare the results on the fine grid to those obtained on the 30× 30 mesh after the

500th time-step in Fig. C.8. A few observations follow:

• It took ∼ 16 hours to obtain the results depicted in Fig. C.8(b) and so further

analysis over longer simulated time horizons is deferred until we have access to

a more powerful computer.

• The contour lines on the 100 × 100 mesh are smoother than their 30 × 30

counterparts – as expected.

• We notice that the coarse 30× 30 mesh is not a subset of the (finer) 100× 100

mesh. Consequently, we retain the same average mass in going to the finer mesh,

but the interpolation has the effect of smoothing the initial condition some-

what.2 We see the effect of this in the energy evolution diagrams in Fig. C.8,

where the initial free energy is somewhat lower on the 100×100 mesh. A detailed

investigation reveals that the ‘Part 1’ energy (related to the gradient on each

element) is lower on the fine mesh, whereas the initial ‘Part 2’ and ‘Part 3’ en-

ergies are largely mesh-independent. The pattern of energy evolution, however,

is quite similar once the numerical method takes over.

• Each of the physical ‘spots’ present in Fig. C.8(a) has a corresponding structure

in Fig. C.8(b) in approximately the same location in Ω. The two spots in the top

and bottom left corners (see the red arrows on the two sub-figures in Fig. C.8)

are in slightly different positions, but there is good empirical evidence to suggest

1We use the ‘interp2’ function in Matlab for this.2Subsequent investigation also revealed that the Matlab ‘interp2’ function does not interpolate

onto finer grids in the way we would want it to for a triangulation (in estimating values at new

nodes, it averages in all directions).

70

that these two solutions are heading towards essentially the same (stable) end-

state.

The same test was executed for the σ = 800 ‘disorder’ case from Fig. 3.5(d). The 100×100 mesh results match those observed on the 30× 30 mesh very closely, suggesting

that our results are not seriously affected by the discretisation on which we work

(notwithstanding our approach to interpolation onto the finer grid with ‘interp2’).

x

y

ε=0.02, σ=200 and m=0.4 at t=0.2000

→

→

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

0 0.05 0.1 0.15 0.20

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

time

Fre

e E

nerg

y

Free Energy Evolution Over Time

−0.5 0 0.5

(a) 30× 30 grid

x

y

ε=0.02, σ=200 and m=0.4 at t=0.2000

→

→

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

0 0.05 0.1 0.15 0.20

0.05

0.1

0.15

0.2

0.25

0.3

time

Fre

e E

nerg

y

Free Energy Evolution Over Time

−0.5 0 0.5

(b) 100× 100 grid

Figure C.8: Results for σ = 200 (2D) after 500 time steps on two different meshes

71

numerical approximation of the ohta{kawasaki...

Documents