national center for atmospheric research boulder, colorado ... · mation about how to download...

1
* *** h tk tk+2 y y h h Ensemble Data Assimilation with the Community Land Model and the US National Water Model T. Hoar, M. Gharamti, J. McCreight, A. RafieeiNasab, A. Fox, J. Anderson, N. Collins, J. Hendricks, K. Raeder National Center for Atmospheric Research Boulder, Colorado, USA [email protected] 1. Ensemble Data Assimilation with DART The Data Assimilation Research Testbed (DART) is an open source community software facility for ensemble data as- similation developed at the National Center for Atmospheric Research (NCAR). DART has been free and publically avail- able for more than 10 years. Building an interface between DART and a new model does not require an adjoint and generally requires no modifications to the model code. DART works with dozens of models of varying complexity, including : weather models, e.g. WRF, COAMPS, COSMO, MPAS Atmosphere climate models components, e.g. CAM, POP, CLM, WACCM, MPAS Ocean, ROMS, FESOM, CICE5, WRF- Hydro, NOAH atmospheric chemistry models, e.g. CAM-CHEM, WRF- CHEM low-order and simple research models DART assimilates a wide variety of observations : land observations such as snow cover fraction, ground water depth, tower fluxes, cosmic ray neutron intensity, and microwave brightness temperature observations temperature, winds, moisture from NCEP, MADIS, and SSEC total precipitable water, radar observations, radio occul- tation observations from GPS satellites some observations in the World Ocean Database ... DART provides both state-of-the-art ensemble data assimi- lation capabilities and an interactive educational platform to researchers and students. www.image.ucar.edu/DAReS/DART has infor- mation about how to download DART, the DART educational materials, and how to contact us. 2. DART ‘Manhattan’ Release Highlights Much of the development effort has been to support larger model states using less memory and improve performance, particularly I/O. F Ability to handle much larger model states. Distributing the model state across all tasks means no single task must store the entire state at any time. F One-sided MPI communication allows tasks to request remote data items from other tasks without interrupt- ing their execution or arranging which data items will be needed in advance. F Computing the forward operators for all ensemble mem- bers at the same time leads to better vector code. F Native NetCDF support eliminates conversion between NetCDF model files and DART. This also reduces the high-water mark for disk requirements. F Ensemble data can be read and distributed across all tasks on a variable-by-variable basis, reducing maximum memory requirements. F Diagnostic files are now written in parallel, resulting in faster I/O and lower memory requirements. F Supports externally-computed observation operators. F Supports per-observation-type localization radii. 3. Acknowledgments The National Center for Atmospheric Research is sponsored by the National Science Foundation. Computational resources were provided by NCAR’s Computational and Information Systems Laboratory. 4. Surface Fluxes with CLM4.5 Special Thanks to Andy Fox (University of Arizona) CLM4.5 was used at the Niwot Ridge LTER site to explore the impact of latent heat, sensible heat, and net ecosystem production observations on relevant CLM variables. 9.7 km east of the Continental Divide, in a Subalpine Forest Spun up for 1500 years with site-specific information. 64 ensemble members using an ensemble reanalysis forcing Assimilating tower fluxes of latent heat (LE), sensible heat (H), and net ecosystem production (NEP). unobserved variables: LEAFC, LIVEROOTC, LIVESTEMC, DEAD- STEMC, LITR1C, LITR2C, SOIL1C, SOIL2C, SOILLIQ The model states are being updated at about 8PM local time. FYI: We are assimilating flux observations … Sensible heat Latent heat Net Ecosystem Production Figure 1: The observations of the eddy covariance fluxes are available every 30 minutes. The free run of the model is shown in blue, the assimilation result is shown in red. Figure 2: The free run is shown in blue. The assimilation result is shown in red. Carbon Cycle We have added a number of observations to the CLM-DART workflow, such as leaf area index (LAI) and aboveground biomass - key components of the terrestrial carbon cycle. DART’s adaptive inflation algorithm allows the ensemble to become consistent with the observations and then main- tain a useful ensemble spread. This effort will be moving to CLM5.0 to take advantage of the expanded treatment of the carbon and nitrogen cycles. Forecast Biomass LAI DA Run Observations Free Run Free Run Figure 3: This experiment highlights a study in a semi-arid region in New Mexico. Monthly LAI and annual biomass ob- servations are able to provide a much better representation of the sesaonal cycle and accurate initial conditions for fore- casting. The impact persists for years for different C pools. 5. Soil Moisture with CLM5 The Community Land Model 5.0 was released in February 2018 and is the latest in a series of land models devel- oped through the Community Earth System Model (CESM). CLM4.5 biogeophysics and biogeochemistry can be run from this release code. A new river model (MOSART) is also included. This release was a land-only release. The Functionally Assembled Terrestrial Ecosystem Simula- tor (FATES) is available as a research option. Direct solar Absorbed solar Diffuse solar Downwelling longwave Reflected solar Emitted longwave Sensible heat flux Latent heat flux u a 0 Momentum flux Wind speed Ground heat flux Evaporation Melt Sublimation Throughfall Infiltration Surface runoff Evaporation Transpiration Precipitation Heterotrop. respiration Photosynthesis Autotrophic respiration Litterfall N uptake Vegetation C/N Soil C/N N mineralization Fire Surface energy fluxes Hydrology Biogeochemical cycles Aerosol deposition Soil (sand, clay, organic) Sub-surface runoff Phenology BVOCs Water table Saturated Zone Soil Dust Saturated fraction N dep N fix Denitrification N leaching CH 4 Root litter N 2 O SCF Surface water Bedrock Glacier Lake River Routing Runoff River discharge Urban Land Use Change Wood harvest Disturbance Vegetation Dynamics Growth Competition Wetland Crops Irrigation Flooding CLM5.0 Impermeable Bedrock Figure 4: The representation of the processes in CLM5.0 which was released this February. Gridcell Glacier Lake Landunit Column Patch Urban Vegetated Soil Crop PFT1 PFT2 PFT3 PFT4 … Unirrig Irrig Unirrig Irrig Crop1 Crop1 Crop2 Crop2 … Roof Sun Wall Shade Wall Pervious Impervious TBD MD HD Elevation classes G L C UIrr1 V PFT4 V PFT3 V PFT1 V PFT2 C Irr1 U T,H,M C Irr2 C UIrr2 Figure 5: The hierarchy of a single CLM5.0 gridcell. The only location information is at the gridcell level. We are working on an approach to more appropriately relate obser- vation locations to specific Landunits, Columns or Patches. 0 45 90 135 180 -30 0 30 60 0 2 4 6 8 10 12 14 16 18 kg/m 2 Figure 6: SMAP locations from 23 July 2015 were the ba- sis for a short proof-of-concept experiment. The soil mois- ture (liquid and ice) from CLM at the SMAP locations were used in an observing system simulation experiment. 01-Jul 02-Jul 03-Jul 04-Jul 05-Jul 06-Jul 07-Jul Jul.02,2001 00:00:00 start 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 rmse and bias (model - observation) SOIL_MOISTURE rmse pr=0.44353, po=0.038084 bias pr=-0.0012196, po=-0.00047593 rmse bias 29150 29200 29250 29300 29350 29400 29450 29500 # of obs : o=possible, $=assimilated Figure 7: The proof-of-concept result for soil moisture in the first layer of CLM. This is a 40 member ensemble. 6. U.S. National Water Model The NCAR-supported community Weather Re- search and Forecasting Hydrologic model (WRF-Hydro) is a modeling system and frame- work for hydrologic modeling and model cou- pling. In 2016 a configuration of WRF-Hydro was implemented as the National Water Model (NWM) for the continental United States. Figure 8: Left: The test domain ‘Matthew’ for WRF-Hydro and DART. The domain is approximately 100,000 km 2 . The triangles are at stream gage locations. The LSM runs at 1 km resolution. Right: A depiction of the Stream Order. WRF-Hydro is configured to use the Noah-MP Land Surface Model (LSM) to simulate land surface processes. Separate water routing modules perform diffusive wave surface rout- ing and saturated subsurface flow routing on a 250m grid, and Muskingum-Cunge channel routing down National Hy- drography Dataset (NHDPlusV2) stream reaches. HydroDART can be run in multiple configura- tions, including a ‘channel-only’ configuration. A small case can be run in Docker. Figure 9: The result from a test experiment on a ‘channel- only’ configuration. This is the verification gage. HydroDART is currently configured to assimilate streamflow from 115 gages in the Matthew domain and update the channel model state as well as estimate parameters reg- ulating the efficiency of the link response to fluxes. There are 50225 links/reaches in the domain with 115 gages that are currently being assimilated. Research areas include lo- calization, inflation, and parameter estimation. NWM ensemble DART NWM metadata SetupExperiment 1. SetupExperiment pulls together all the pieces for an experiment and creates ControlDir and RunDir 2. ControlDir is small. Has control scripts Submit, filter, and hydro. RunDir can be quite large. 3. Submit launches (in any order) dependent job filter and dependent job array hydro. 4. Hydro is run after successful completion of filter as determined by the queueing system. 5. Submit can stage multiple cycles of dependent jobs or it can create a ‘cookie’ file and the dependent jobs can submit the next job until the cookie is gone. RunDir hydro hydro Submit Dependent Job Array Dependent Job filter filter operates in RunDir, hydro operates in each underlying directories – one per ensemble member. Submit, filter and hydro are not the actual script names. Member 1 Member N Submit filter hydro ControlDir Requires a large filesystem. Figure 10: Schematic of the workflow of WRF-Hydro and DART. Separate jobs perform the assimilation and a Job Ar- ray is used to run the ensemble of WRF-Hydro instances. Each job can be tuned for an optimal configuration. Since each job runs for a very short time, the jobs may backfill. The queueing system takes care of the job dependencies and automatically resubmits the next job in the series. TR32 April 2018

Upload: others

Post on 04-Sep-2019

1 views

Category:

Documents


0 download

TRANSCRIPT

****

h

tktk+2

y y

hh

Ensemble Data Assimilation with the Community Land Modeland the US National Water Model

T. Hoar, M. Gharamti, J. McCreight, A. RafieeiNasab, A. Fox,J. Anderson, N. Collins, J. Hendricks, K. Raeder

National Center for Atmospheric ResearchBoulder, Colorado, USA

[email protected]

1. Ensemble Data Assimilation with DART

The Data Assimilation Research Testbed (DART) is an opensource community software facility for ensemble data as-similation developed at the National Center for AtmosphericResearch (NCAR). DART has been free and publically avail-able for more than 10 years. Building an interface betweenDART and a new model does not require an adjoint andgenerally requires no modifications to the model code.DART works with dozens of models of varying complexity,including :

• weather models, e.g. WRF, COAMPS, COSMO, MPASAtmosphere

• climate models components, e.g. CAM, POP, CLM,WACCM, MPAS Ocean, ROMS, FESOM, CICE5, WRF-Hydro, NOAH

• atmospheric chemistry models, e.g. CAM-CHEM, WRF-CHEM

• low-order and simple research models

DART assimilates a wide variety of observations :

• land observations such as snow cover fraction, groundwater depth, tower fluxes, cosmic ray neutron intensity,and microwave brightness temperature observations

• temperature, winds, moisture from NCEP, MADIS, andSSEC

• total precipitable water, radar observations, radio occul-tation observations from GPS satellites

• some observations in the World Ocean Database ...

DART provides both state-of-the-art ensemble data assimi-lation capabilities and an interactive educational platform toresearchers and students.

www.image.ucar.edu/DAReS/DART has infor-mation about how to download DART, the DARTeducational materials, and how to contact us.

2. DART ‘Manhattan’ Release Highlights

Much of the development effort has been to support largermodel states using less memory and improve performance,particularly I/O.

F Ability to handle much larger model states. Distributingthe model state across all tasks means no single taskmust store the entire state at any time.

F One-sided MPI communication allows tasks to requestremote data items from other tasks without interrupt-ing their execution or arranging which data items will beneeded in advance.

F Computing the forward operators for all ensemble mem-bers at the same time leads to better vector code.

F Native NetCDF support eliminates conversion betweenNetCDF model files and DART. This also reduces thehigh-water mark for disk requirements.

F Ensemble data can be read and distributed across alltasks on a variable-by-variable basis, reducing maximummemory requirements.

F Diagnostic files are now written in parallel, resulting infaster I/O and lower memory requirements.

F Supports externally-computed observation operators.

F Supports per-observation-type localization radii.

3. Acknowledgments

The National Center for Atmospheric Research issponsored by the National Science Foundation.Computational resources were provided by NCAR’sComputational and Information Systems Laboratory.

4. Surface Fluxes with CLM4.5

Special Thanks to Andy Fox (University of Arizona)CLM4.5 was used at the Niwot Ridge LTER site to explorethe impact of latent heat, sensible heat, and net ecosystemproduction observations on relevant CLM variables.

• 9.7 km east of the Continental Divide, in a Subalpine Forest

• Spun up for 1500 years with site-specific information.

• 64 ensemble members using an ensemble reanalysis forcing

• Assimilating tower fluxes of latent heat (LE), sensible heat (H), andnet ecosystem production (NEP).

• unobserved variables: LEAFC, LIVEROOTC, LIVESTEMC, DEAD-STEMC, LITR1C, LITR2C, SOIL1C, SOIL2C, SOILLIQ

The model states are being updated at about 8PM local time.

FYI: We are assimilating flux observations …

Sens

ible

hea

tLa

tent

hea

tNe

t Eco

syst

emPr

oduc

tion

Figure 1: The observations of the eddy covariance fluxesare available every 30 minutes. The free run of the model isshown in blue, the assimilation result is shown in red.

Figure 2: The free run is shown in blue. The assimilationresult is shown in red.

Carbon CycleWe have added a number of observations to the CLM-DARTworkflow, such as leaf area index (LAI) and abovegroundbiomass - key components of the terrestrial carbon cycle.DART’s adaptive inflation algorithm allows the ensemble tobecome consistent with the observations and then main-tain a useful ensemble spread. This effort will be moving toCLM5.0 to take advantage of the expanded treatment of thecarbon and nitrogen cycles.

Forecast

Biom

ass

LAI

DA Run

Obs

erva

tions

Free Run

Free Run

Figure 3: This experiment highlights a study in a semi-aridregion in New Mexico. Monthly LAI and annual biomass ob-servations are able to provide a much better representationof the sesaonal cycle and accurate initial conditions for fore-casting. The impact persists for years for different C pools.

5. Soil Moisture with CLM5

The Community Land Model 5.0 was released in February2018 and is the latest in a series of land models devel-oped through the Community Earth System Model (CESM).CLM4.5 biogeophysics and biogeochemistry can be runfrom this release code. A new river model (MOSART)is also included. This release was a land-only release.The Functionally Assembled Terrestrial Ecosystem Simula-tor (FATES) is available as a research option.

Direct solar

Absorbed solar

Diff

use

sola

r

Dow

nwel

ling

long

wav

e

Reflected solar

Emitt

ed

long

wav

e

Sens

ible

hea

t flu

x

Late

nt h

eat

flux

ua0

Momentum fluxWind speed

Ground heat flux

Evaporation

Melt

Sublimation

Throughfall

InfiltrationSurface runoff

Evaporation

Transpiration

Precipitation

Heterotrop.respiration

Photosynthesis

Autotrophicrespiration

Litterfall

Nuptake

Vegetation C/N

SoilC/N N mineralization

Fire

Surface energy fluxes Hydrology Biogeochemical cycles

Aerosol deposition

Soil (sand, clay, organic)Sub-surface

runoff

Phenology

BVOCs

Water tableSaturated Zone

Soil

Dust

Saturatedfraction

N depN fix

DenitrificationN leaching

CH4

Root litter

N2OSCF

Surfacewater

Bedrock

Glacier

Lake

River Routing

Runoff

River discharge

Urban

Land Use Change

Wood harvest

Disturbance

Vegetation Dynamics

Growth

CompetitionWetlandCropsIrrigationFlooding

CLM5.0

Impermeable Bedrock

Figure 4: The representation of the processes in CLM5.0which was released this February.

Gridcell

GlacierLake

Landunit

Column

Patch

UrbanVegetated

Soil

Crop

PFT1 PFT2 PFT3 PFT4 …

Unirrig Irrig Unirrig Irrig

Crop1 Crop1 Crop2 Crop2 …

Roof

Sun Wall

ShadeWall

Pervious

Impervious

TBD

MD

HD

Elevation classes

G

L

CUIrr1

VPFT4

VPFT3

VPFT1

VPFT2

CIrr1

UT,H,MC

Irr2C

UIrr2

Figure 5: The hierarchy of a single CLM5.0 gridcell. Theonly location information is at the gridcell level. We areworking on an approach to more appropriately relate obser-vation locations to specific Landunits, Columns or Patches.

0 45 90 135 180

-30

0

30

60

0

2

4

6

8

10

12

14

16

18

kg/m

2

Figure 6: SMAP locations from 23 July 2015 were the ba-sis for a short proof-of-concept experiment. The soil mois-ture (liquid and ice) from CLM at the SMAP locations wereused in an observing system simulation experiment.

01-Jul 02-Jul 03-Jul 04-Jul 05-Jul 06-Jul 07-JulJul.02,2001 00:00:00 start

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

rmse

and b

ias (m

odel

- obs

erva

tion)

SOIL_MOISTURErmse pr=0.44353, po=0.038084 bias pr=-0.0012196, po=-0.00047593

rmsebias

data file: /Users/thoar/svn/DART/cesm_clm/models/clm/matlab/obs_diag_output.nc

29150

29200

29250

29300

29350

29400

29450

29500

# of o

bs : o

=pos

sible,

$=as

simila

ted

Figure 7: The proof-of-concept result for soil moisture inthe first layer of CLM. This is a 40 member ensemble.

6. U.S. National Water Model

The NCAR-supported community Weather Re-search and Forecasting Hydrologic model(WRF-Hydro) is a modeling system and frame-work for hydrologic modeling and model cou-pling. In 2016 a configuration of WRF-Hydrowas implemented as the National Water Model(NWM) for the continental United States.

Figure 8: Left: The test domain ‘Matthew’ for WRF-Hydroand DART. The domain is approximately 100,000 km2. Thetriangles are at stream gage locations. The LSM runs at ≈1 km resolution. Right: A depiction of the Stream Order.

WRF-Hydro is configured to use the Noah-MP Land SurfaceModel (LSM) to simulate land surface processes. Separatewater routing modules perform diffusive wave surface rout-ing and saturated subsurface flow routing on a 250m grid,and Muskingum-Cunge channel routing down National Hy-drography Dataset (NHDPlusV2) stream reaches.

HydroDART can be run in multiple configura-tions, including a ‘channel-only’ configuration. Asmall case can be run in Docker.

Figure 9: The result from a test experiment on a ‘channel-only’ configuration. This is the verification gage.

HydroDART is currently configured to assimilate streamflowfrom 115 gages in the Matthew domain and update thechannel model state as well as estimate parameters reg-ulating the efficiency of the link response to fluxes. Thereare 50225 links/reaches in the domain with 115 gages thatare currently being assimilated. Research areas include lo-calization, inflation, and parameter estimation.

NWMensemble

DARTNWMmetadata

SetupExperiment

1. SetupExperiment pulls together all the pieces for an experiment and creates ControlDir and RunDir

2. ControlDir is small. Has control scripts Submit, filter, and hydro. RunDir can be quite large.

3. Submit launches (in any order) dependent job filter and dependent job array hydro.

4. Hydro is run after successful completion of filter as determined by the queueing system.

5. Submit can stage multiple cycles of dependent jobs or it can create a ‘cookie’ file and the dependent jobs can submit the next job until the cookie is gone.

RunDir

hydrohydro

SubmitDependent Job Array

Dependent Job

filter

filter operates in RunDir, hydro operates in each underlying directories – one per ensemble member.

Submit, filter and hydro are not the actual script names.

Mem

ber 1

Mem

ber N

Submit

filter hydro

ControlDir

Requires a large filesystem.

Figure 10: Schematic of the workflow of WRF-Hydro andDART. Separate jobs perform the assimilation and a Job Ar-ray is used to run the ensemble of WRF-Hydro instances.Each job can be tuned for an optimal configuration. Sinceeach job runs for a very short time, the jobs may backfill.The queueing system takes care of the job dependenciesand automatically resubmits the next job in the series.

TR32 April 2018