recursive calibration of ecosystem models using sequential data assimilation mingshi chen¹,...

1
Recursive Calibration of Ecosystem Models Using Sequential Data Assimilation Mingshi Chen¹, Shuguang Liu¹, Larry L. Tieszen², and David Y. Hollinger 3 ¹ SAIC, contractor to the USGS Center for Earth Resources Observation and Science (EROS), Sioux Falls, South Dakota 57198, USA. Work performed under USGS contract 03CRCN0001, ² USGS Center for EROS, Sioux Falls, South Dakota 57198, USA, 3 USDA Forest Service, Northern Research Station, Durham, New Hampshire,03824 USA Introduction The specification of model parameter values to characterize a system is a fundamental issue in contemporary ecosystem science. The conventional inversion methods that treat observations as a whole lack the flexibility to investigate possible temporal evolution of the model parameters. Their main weakness is that all errors from input, output, and model structure are attributed solely to model parameter uncertainties. Sequential data assimilation procedures such as the ensemble Kalman filter (EnKF) have the potential to overcome this drawback by explicitly taking all sources of uncertainty into account step by step. Figure 1. Temporal variations of two key parameters in the flux partition model: (a) light use efficiency (lue) and (b) reference respiration (R ref ). The grey vertical lines indicate the standard deviation around the mean of ensembles. The SEnKF reveals the strong seasonality of the key parameters. Approach Model: We use a flux partition model as our test dynamic model for the SEnKF method. The flux partition model divides net ecosystem exchange (NEE) into gross primary production (GPP) and total ecosystem respiration (RESP) as follows: where subscript t denotes time-dependent, LUE t is light use efficiency, PAR t is photosysnthetically active radiation, NDVI t is the normalized difference vegetation index, R ref,t is respiration when air temperature (T air,t ) equals reference temperature (T ref,t , usually specified as 10 o C), E 0 is temperature sensitivity, and T 0 is a datum of temperature to avoid a denominator of zero in the model, kept constant at –46.02 o C. D temp determines the effect of temperature on photosynthesis, and D VPD expresses the decrease in leaf exchange from both photosynthesis and transpiration caused by vapour pressure deficit (VPD), according to where T min , T opt , and T max denote minimum, optimal, and maximum temperatures for photosynthesis, respectively, VPD is vapour pressure deficit, and v 0 and v 1 are two unknown coefficients. Data: Net Ecosystem Exchange (NEE) of carbon, PAR, Ta, and VPD were from Ameriflux tower station in Howland (Maine, USA). RESP data were calculated from the temperature dependence curve of ecosystem respiration derived from nighttime NEE, GPP data were pseudo- observations as a total of NEE and RESP. NDVI: from MODIS. Time: from 2000 to 2004 Data Assimilation: All unknown parameters were treated as state variables. Evolution models of the parameters similar to that of the state variables were built by applying a Kernel smoothing technique on parameter samples . The initial values of parameters are optimally determined by a nonlinear inverse procedure. An Ensemble Kalman Filter (EnKF) was used to sequentially assimilate measurements into the dynamic model described above, and into evolution models of parameters to improve estimates of GPP, R, NEE and the parameter values. ) 0 , ( 2 min max min max ) ( ) )( ( ) )( ( air opt air air air air T T T T T T T T T T temp Max D ) 1 ( 5 . 0 1 0 1 1 VPD v e v VPD D Conclusion The SEnKF revealed that the parameter values (e.g., light efficiency and reference respiration) possessed strong seasonality or temporal variability (Fig.1). The SEnKF can quickly stabilize the parameter values regardless of their initial values (Fig.2). These demonstrate that the SEnKF can be used to perform recursive model calibration to diagnose the adequacy of the model structure. Figure 3 shows that the SEnKF can significantly improve the estimate of GPP, RESP and NEE (left panel) and reduce up to 70 percent of the variation of the ensemble without data assimilation (right panel). Figure 4 shows that the estimates of GPP, RESP, and NEE using the parameter values modified by the SEnKF had less bias against the observation than the estimates using the base model. This indicates that the SEnKF can extend the model parameter values to predictions when the observations are not available. This capability could be valuable for filling data gaps caused by instrument failure. Goal Developing a smoothed ensemble Kalman filter (SEnKF) to simultaneously estimate system states and model parameters when observations are progressively available in time. quickly stabilize the variation of parameters. The temporal variation of parameters is relatively smooth because a smoothing procedure was implemented in the SEnKF to control the over-dispersion of parameter sampling. y = 0.9017x + 0.2651 R 2 = 0.88 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 observed gpp(gC /m 2 /d) g p p (sen kf) 4.a y = 0.9543x + 0.0226 R 2 = 0.9406 0 2 4 6 8 10 12 0 2 4 6 8 10 12 observed resp(gC /m 2 /d) resp (sen kf) 4.b y = 0.8214x + 1.3419 R 2 = 0.8282 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 observed gpp(gC /m 2 /d) g p p (m odel 4.d y = 0.5157x + 1.0909 R 2 = 0.8037 0 2 4 6 8 10 12 0 2 4 6 8 10 12 observed resp(gC /m 2 /d) resp (m od el) 4.e y = 0.6706x + 1.3618 R 2 = 0.3383 -3 -2 -1 0 1 2 3 4 5 6 -6 -4 -2 0 2 4 6 8 observed nee(gC /m 2 /d) n ee (m od el) 4.f y = 0.7358x + 0.16 R 2 = 0.5759 -6 -4 -2 0 2 4 6 8 -6 -4 -2 0 2 4 6 8 observed nee(gC /m 2 /d) n ee(sen kf) 4.c Figure 3. The left panel compares estimates of GPP, RESP, and NEE generated by the SEnKF and the base model against 20 percent of the assimilated data. Data assimilation accounted for more than 99 percent of the variation in the observations of GPP, REPS, and NEE, The right panel shows reduced ratio of ensemble variances of GPP, RESP, and NEE generated by the SEnKF against that by the base model alone. The results show the SEnKF can more dramatically reduce variances of state variables than the ensemble based only on the Monte Carlo technique. Figure 4. The left panel shows the forecasted values of three state variables (GPP, RESP and NEE) of the model modified by the SEnKF against unassimilated data (80 percent of total observations). The right panel displays the simulation results by the base model against the same unassimilated data. Comparing three pairs of corresponding linear fitting regression equations for GPP, RESP and NEE and their corresponding coefficients of determination (R 2 ), we see that the estimates of the three flux variables using the parameters modified by the SEnKF had less bias against observations than estimates using the base model. D aily variation oflightuse efficiency 0 0.005 0.01 0.015 0.02 2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004 year lu e (m o l C lue 1.a Daily variation ofRref 0 1 2 3 4 5 6 7 8 9 2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004 year R r Rref 1.b U. S. Department of the Interior U. S. Geological Survey )] ( exp[ 0 , 0 , 1 1 0 , T T T T t ref t VPD temp t t t t t t t t air t ref E R RESP D D NDVI PAR LUE GPP RESP GPP NEE EffectofInitiallightuse efficiency 0 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.016 0.018 0.02 2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004 year lu e (m o l C large initial lue m edium initial lue sm all initial lue 2.a EffectofinitialRref 0 1 2 3 4 5 6 7 8 2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004 year R r large initial Rref m edium initial Rref sm all initial Rref 2.b ___SEnKF eq. y = 0.9814x + 0.0111 R 2 = 0.9929 ...M odel eq. y = 0.5279x + 1.0494 R 2 = 0.8278 0 2 4 6 8 10 12 0 2 4 6 8 10 12 observed resp(gC/m 2 /d) sim u lated re 3.b ___ SEnKF eq. y = 0.9946x + 0.001 R 2 = 0.9927

Upload: philomena-oliver

Post on 18-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Recursive Calibration of Ecosystem Models Using Sequential Data Assimilation Mingshi Chen¹, Shuguang Liu¹, Larry L. Tieszen², and David Y. Hollinger 3

Recursive Calibration of Ecosystem Models Using Sequential Data Assimilation

Mingshi Chen¹, Shuguang Liu¹, Larry L. Tieszen², and David Y. Hollinger 3

¹ SAIC, contractor to the USGS Center for Earth Resources Observation and Science (EROS), Sioux Falls, South Dakota 57198, USA. Work performed under USGS contract 03CRCN0001, ² USGS Center for EROS, Sioux Falls, South Dakota 57198, USA, 3USDA Forest Service, Northern Research Station, Durham, New Hampshire,03824 USA

Introduction

The specification of model parameter values to characterize a system is a fundamental issue in contemporary ecosystem science. The conventional inversion methods that treat observations as a whole lack the flexibility to investigate possible temporal evolution of the model parameters. Their main weakness is that all errors from input, output, and model structure are attributed solely to model parameter uncertainties. Sequential data assimilation procedures such as the ensemble Kalman filter (EnKF) have the potential to overcome this drawback by explicitly taking all sources of uncertainty into account step by step.

Figure 1. Temporal variations of two key parameters in the flux partition model: (a) light use efficiency (lue) and (b) reference respiration (Rref). The grey vertical lines indicate the standard deviation around the mean of ensembles. The SEnKF reveals the strong seasonality of the key parameters.

Approach

Model: We use a flux partition model as our test dynamic model for the SEnKF method. The flux partition model divides net ecosystem exchange (NEE) into gross primary production (GPP) and total ecosystem respiration (RESP) as follows:

where subscript t denotes time-dependent, LUEt is light use efficiency, PARt is photosysnthetically active radiation, NDVIt is the normalized difference vegetation index, Rref,t is respiration when air temperature (Tair,t) equals reference temperature (Tref,t, usually specified as 10 oC), E0 is temperature sensitivity, and T0 is a datum of temperature to avoid a denominator of zero in the model, kept constant at –46.02 oC. Dtemp determines the effect of temperature on photosynthesis, and DVPD expresses the decrease in leaf exchange from both photosynthesis and transpiration caused by vapour pressure deficit (VPD), according to

where Tmin, Topt, and Tmax denote minimum, optimal, and maximum temperatures for photosynthesis, respectively, VPD is vapour pressure deficit, and v0 and v1 are two unknown coefficients.

Data:

• Net Ecosystem Exchange (NEE) of carbon, PAR, Ta, and VPD were from Ameriflux tower station in Howland (Maine, USA). RESP data were calculated from the temperature dependence curve of ecosystem respiration derived from nighttime NEE, GPP data were pseudo-observations as a total of NEE and RESP.

• NDVI: from MODIS. Time: from 2000 to 2004

Data Assimilation:

• All unknown parameters were treated as state variables. Evolution models of the parameters similar to that of the state variables were built by applying a Kernel smoothing technique on parameter samples . The initial values of parameters are optimally determined by a nonlinear inverse procedure.

• An Ensemble Kalman Filter (EnKF) was used to sequentially assimilate measurements into the dynamic model described above, and into evolution models of parameters to improve estimates of GPP, R, NEE and the parameter values.

)0,( 2minmax

minmax

)())((

))((

airoptairair

airair

TTTTTT

TTTTtemp MaxD

)1(5.01

011

VPDvevVPDD

Conclusion

• The SEnKF revealed that the parameter values (e.g., light efficiency and reference respiration) possessed strong seasonality or temporal variability (Fig.1). The SEnKF can quickly stabilize the parameter values regardless of their initial values (Fig.2). These demonstrate that the SEnKF can be used to perform recursive model calibration to diagnose the adequacy of the model structure.

• Figure 3 shows that the SEnKF can significantly improve the estimate of GPP, RESP and NEE (left panel) and reduce up to 70 percent of the variation of the ensemble without data assimilation (right panel).

• Figure 4 shows that the estimates of GPP, RESP, and NEE using the parameter values modified by the SEnKF had less bias against the observation than the estimates using the base model. This indicates that the SEnKF can extend the model parameter values to predictions when the observations are not available. This capability could be valuable for filling data gaps caused by instrument failure.

Goal Developing a smoothed ensemble Kalman filter (SEnKF) to simultaneously estimate system states and model parameters when observations are progressively available in time.

Figure 2. Regardless of initial values of parameters, the SEnKF can quickly stabilize the variation of parameters. The temporal variation of parameters is relatively smooth because a smoothing procedure was implemented in the SEnKF to control the over-dispersion of parameter sampling.

y = 0.9017x + 0.2651R2 = 0.88

0

2

4

6

8

10

12

14

0 2 4 6 8 10 12 14observed gpp(gC/m 2/d)

gp

p (

sen

kf)

4.a

y = 0.9543x + 0.0226R2 = 0.9406

0

2

4

6

8

10

12

0 2 4 6 8 10 12observed resp(gC/m 2/d)

resp

(sen

kf)

4.b

y = 0.8214x + 1.3419R2 = 0.8282

0

2

4

6

8

10

12

14

0 2 4 6 8 10 12 14observed gpp(gC/m 2/d)

g

pp

(m

od

el)

4.d

y = 0.5157x + 1.0909R2 = 0.8037

0

2

4

6

8

10

12

0 2 4 6 8 10 12observed resp(gC/m 2/d)

re

sp

(m

od

el)

4.e

y = 0.6706x + 1.3618R2 = 0.3383-3

-2

-1

0

1

2

3

4

5

6

-6 -4 -2 0 2 4 6 8

observed nee(gC/m 2/d)

n

ee (

mo

del)

4.f

y = 0.7358x + 0.16R2 = 0.5759

-6

-4

-2

0

2

4

6

8

-6 -4 -2 0 2 4 6 8

observed nee(gC/m 2/d)

nee(s

en

kf)

4.c

Figure 3. The left panel compares estimates of GPP, RESP, and NEE generated by the SEnKF and the base model against 20 percent of the assimilated data. Data assimilation accounted for more than 99 percent of the variation in the observations of GPP, REPS, and NEE, The right panel shows reduced ratio of ensemble variances of GPP, RESP, and NEE generated by the SEnKF against that by the base model alone. The results show the SEnKF can more dramatically reduce variances of state variables than the ensemble based only on the Monte Carlo technique.

Figure 4. The left panel shows the forecasted values of three state variables (GPP, RESP and NEE) of the model modified by the SEnKF against unassimilated data (80 percent of total observations). The right panel displays the simulation results by the base model against the same unassimilated data. Comparing three pairs of corresponding linear fitting regression equations for GPP, RESP and NEE and their corresponding coefficients of determination (R2), we see that the estimates of the three flux variables using the parameters modified by the SEnKF had less bias against observations than estimates using the base model.

Daily variation of light use efficiency

0

0.005

0.01

0.015

0.02

2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004

year

lu

e (m

ol C

/m

ol P

AR

)

lue

1.a

Daily variation of Rref

0

1

2

3

4

5

6

7

8

9

2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004

year

Rref

Rref

1.b

U. S. Department of the InteriorU. S. Geological Survey

)](exp[

0,0,

110, TTTTtreft

VPDtemptttt

ttt

tairtrefERRESP

DDNDVIPARLUEGPP

RESPGPPNEE

Effect of Initial l ight use efficiency

0

0.002

0.004

0.006

0.008

0.01

0.012

0.014

0.016

0.018

0.02

2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004

year

lu

e(m

ol C

/m

ol P

AR

)

large initial lue

medium initial lue

small initial lue

2.a

Effect of initial Rref

0

1

2

3

4

5

6

7

8

2000 2000.5 2001 2001.5 2002 2002.5 2003 2003.5 2004

year

Rref

large initial Rref

medium initial Rref

small initial Rref

2.b

___SEnKF eq.y = 0.9814x + 0.0111

R2 = 0.9929

…... Model eq.y = 0.5279x + 1.0494

R2 = 0.8278

0

2

4

6

8

10

12

0 2 4 6 8 10 12

observed resp(gC/m2/d)

sim

ula

ted

resp

3.b

___ SEnKF eq.y = 0.9946x + 0.001

R2 = 0.9927

…... Model eq.y = 0.6351x + 1.3768

R2 = 0.3263-6

-4

-2

0

2

4

6

8

-6 -4 -2 0 2 4 6 8

observed nee(gC/m 2/d)

sim

ula

ted

nee

3.c

…... Model eq.y = 0.8011x + 1.4346

R2 = 0.8085

___SEnKF eq.y = 0.9735x - 0.0085

R2 = 0.9956

0

2

4

6

8

10

12

14

0 2 4 6 8 10 12 14observed gpp(gC/m2/d)

sim

ula

ted

gp

p

Model SEnKF

3.a

0102030405060708090

100

0 0.2 0.4 0.6 0.8 1

sdv of model gpp

red

uced

rati

o o

f g

pp3.d

0102030405060708090

100

0 0.2 0.4 0.6 0.8 1

sdv of model resp

red

uced

rati

o o

f re

sp

3.e

0

10

20

30

40

50

60

70

80

90

100

0 0.2 0.4 0.6 0.8 1

sdv of model neere

du

ce r

ati

o o

f n

ee

3.f

Feb. 22, 2007