data assimilation in high dimensions - nyu courantdtkelly/slides/data_cam.pdf · data assimilation...

22
Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant Institute New York University New York NY www.dtbkelly.com February 3, 2016 DPMMS, University of Cambridge David Kelly (CIMS) Data assimilation February 3, 2016 1 / 12

Upload: others

Post on 27-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Data assimilation in high dimensions

David Kelly

Kody Law Andy Majda Andrew Stuart Xin Tong

Courant InstituteNew York University

New York NYwww.dtbkelly.com

February 3, 2016

DPMMS, University of Cambridge

David Kelly (CIMS) Data assimilation February 3, 2016 1 / 12

Page 2: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

What is data assimilation?

Suppose u satisfiesdu

dt= F (u)

with some unknown initial condition u0. We are most interested ingeophysical models, so think high dimensional, nonlinear, possiblystochastic.

Suppose we make partial, noisy observations at times t = h, 2h, . . . , nh, . . .

yn = Hun + ξn

where H is a linear operator (think low rank projection), un = u(nh), andξn ∼ N(0, Γ) iid.

The aim of data assimilation is to say something about the conditionaldistribution of un given the observations {y1, . . . , yn}

David Kelly (CIMS) Data assimilation February 3, 2016 2 / 12

Page 3: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

How does filtering work: (initialization)

x

y

Ψ

obs

Figure: The blue circlerepresents our guess ofu0. Due to theuncertainty in u0, this isa probability measure.

David Kelly (CIMS) Data assimilation February 3, 2016 3 / 12

Page 4: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

How does filtering work: (forecast)

x

y

Ψ

obs

Figure: Apply the time hflow map Ψ. Thisproduces a newprobability measurewhich is our forecastedestimate of u1. This iscalled the forecast step.

David Kelly (CIMS) Data assimilation February 3, 2016 3 / 12

Page 5: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

How does filtering work: (make an observation)

x

y

Ψ

obs

Figure: We make anobservationy1 = Hu1 + ξ1. In thepicture, we only observethe x variable.

David Kelly (CIMS) Data assimilation February 3, 2016 3 / 12

Page 6: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

How does filtering work: (find best fit using Bayes)

x

y

Ψ

obs

Figure:P(u1|y1) ∝ P(y1|u1)P(u1)(Bayes formula)

David Kelly (CIMS) Data assimilation February 3, 2016 3 / 12

Page 7: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Problems in high dimensions

In numerical weather prediction the state dimension is O(109).

1) Difficult to store a density of this size

2) Computing the ‘forecast step’ is an integration over the statespace.

We need low dimensional approximations of the filtering problem.

We will look at the Ensemble Kalman filter.

David Kelly (CIMS) Data assimilation February 3, 2016 4 / 12

Page 8: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Ensemble Kalman filter (Evensen 94)

x

y

Ψ

obs

Figure: Start with Kensemble membersdrawn from somedistribution. Empiricalrepresentation of u0.The ensemble membersare denoted v

(k)0 .

Only KN numbers are stored.

David Kelly (CIMS) Data assimilation February 3, 2016 5 / 12

Page 9: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Ensemble Kalman filter (Forecast step)

x

y

Ψ

obs

Figure: Apply thedynamics Ψ to eachensemble member.

v(k)0 7→ Ψ(v

(k)0 )

David Kelly (CIMS) Data assimilation February 3, 2016 5 / 12

Page 10: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Ensemble Kalman filter (Make obs)

x

y

Ψ

obs

Figure: Make anobservation.

David Kelly (CIMS) Data assimilation February 3, 2016 5 / 12

Page 11: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Ensemble Kalman filter (Perturb obs)

x

y

Ψ

obs

Figure: Turn theobservation into Kartificial observations byperturbing by the samesource of observationalnoise.

y(k)1 = y1 + ξ

(k)1

David Kelly (CIMS) Data assimilation February 3, 2016 5 / 12

Page 12: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Ensemble Kalman filter (find best fit using Bayes)

x

y

Ψ

obs

Figure: Update eachmember using the‘Kalman updateformula’. This is a linearapproximation of Bayes.

v(k)1 = Ψ(v

(k)0 ) + K 1(y

(k)1 − HΨ(v

(k)0 ))

David Kelly (CIMS) Data assimilation February 3, 2016 5 / 12

Page 13: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Why should mathematicians be interested?

A widely used algorithm (NWP, disease forecasting, chemical reactions)with many questions and not so many answers:

1 - Is the filter stable to perturbations? eg. Will different initializationsconverge? (ergodicity)

2 - Is the filter accurate? Is the posterior consistent with the true signal?

3 - Can we design mathematically sensible alternative algorithms?

4 - Can we understand why/when the filter fails?

David Kelly (CIMS) Data assimilation February 3, 2016 6 / 12

Page 14: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Catastrophic filter divergenceLorenz-96: u̇j = (uj+1 − uj−2)uj−1 − uj + F with j = 1, . . . , 40. PeriodicBCs. Observe every fifth node. (Harlim-Majda 10, Gottwald-Majda 12)

True solution in a bounded set, but filter blows up to machine infinity infinite time!

David Kelly (CIMS) Data assimilation February 3, 2016 7 / 12

Page 15: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

For complicated models, onlyheuristic arguments offered as

explanation.Can we prove it for a simpler constructive model?

David Kelly (CIMS) Data assimilation February 3, 2016 8 / 12

Page 16: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

The rotate-and-lock map (K., Majda, Tong. PNAS 15.)

The model Ψ : R2 → R2 is a composition of two mapsΨ(x , y) = Ψlock(Ψrot(x , y)) where

Ψrot(x , y) =

(ρ cos θ −ρ sin θρ sin θ ρ cos θ

)(xy

)and Ψlock rounds the input to the nearest point in the grid

G = {(m, (2n + 1)ε) ∈ R2 : m, n ∈ Z} .

It is easy to show that this model has an energy dissipation principle:

|Ψ(x , y)|2 ≤ α|(x , y)|2 + β

for α ∈ (0, 1) and β > 0.

David Kelly (CIMS) Data assimilation February 3, 2016 9 / 12

Page 17: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

(a)

Figure: The red squareis the trajectory un = 0.The blue dots are thepositions of the forecastensemble Ψ(v+

0 ),Ψ(v−

0 ). Given thelocking mechanism inΨ, this is a naturalconfiguration.

David Kelly (CIMS) Data assimilation February 3, 2016 10 / 12

Page 18: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

(b)

Figure: We make anobservation (H shownbelow) and perform theanalysis step. The greendots are v+

1 , v−1 .

Observation matrix

H =

(1 0ε−2 1

)

The filter is ‘sure’ that u1 = x̂ (the dashed line). The filter deduces thatthe observation is approximately (y1, y2) = (x̂ , ε−2x̂ + u2).

Thus v±1 ≈ (x̂ ,−ε−2x̂)

David Kelly (CIMS) Data assimilation February 3, 2016 10 / 12

Page 19: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

(c)

Figure: Beginning thenext assimilation step.Apply Ψrot to theensemble (blue dots)

David Kelly (CIMS) Data assimilation February 3, 2016 10 / 12

Page 20: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

(d)

Figure: Apply Ψlock .The blue dots are theforecast ensembleΨ(v+

1 ), Ψ(v−1 ). Exact

same as frame 1, buthigher energy orbit. Thecycle repeats leading toexponential growth.

David Kelly (CIMS) Data assimilation February 3, 2016 10 / 12

Page 21: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

Theorem (K.-Majda-Tong 15 PNAS)

For any N > 0 and any p ∈ (0, 1) there exists a choice ofparameters such that

P(|v (k)n | ≥ Mn for all n ≤ N

)≥ 1− p

where Mn is an exponentially growing sequence.

ie - The filter can be made to grow exponentially for an arbitrarily longtime with an arbitrarily high probability.

David Kelly (CIMS) Data assimilation February 3, 2016 11 / 12

Page 22: Data assimilation in high dimensions - NYU Courantdtkelly/slides/data_cam.pdf · Data assimilation in high dimensions David Kelly Kody Law Andy Majda Andrew Stuart Xin Tong Courant

References

1 - D. Kelly, K. Law & A. Stuart. Well-Posedness And Accuracy Of TheEnsemble Kalman Filter In Discrete And Continuous Time. Nonlinearity(2014).

2 - D. Kelly, A. Majda & X. Tong. Concrete ensemble Kalman filters withrigorous catastrophic filter divergence. Proc. Nat. Acad. Sci. (2015).

3 - X. Tong, A. Majda & D. Kelly. Nonlinear stability and ergodicity ofensemble based Kalman filters. Nonlinearity (2016).

4 - X. Tong, A. Majda & D. Kelly. Nonlinear stability of the ensembleKalman filter with adaptive covariance inflation. To appear in Comm.Math. Sci. (2015).

All my slides are on my website (www.dtbkelly.com) Thank you!

David Kelly (CIMS) Data assimilation February 3, 2016 12 / 12