statistical properties of the cumulative frequency diagram (cfd)

32
Statistical Properties of the Cumulative Frequency Diagram (CFD). ERF Conference, Nov. 2007 Elgin S. Perry, Statistics Consultant Paul T. Jacobson, Langhei Ecology, LLC Session: SCI-060, Abstract ID:

Upload: mateja

Post on 11-Jan-2016

38 views

Category:

Documents


2 download

DESCRIPTION

Statistical Properties of the Cumulative Frequency Diagram (CFD). ERF Conference, Nov. 2007 Elgin S. Perry, Statistics Consultant Paul T. Jacobson, Langhei Ecology, LLC Session: SCI-060, Abstract ID: 2419. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Statistical Properties of the Cumulative Frequency

Diagram (CFD).

ERF Conference, Nov. 2007Elgin S. Perry, Statistics Consultant

Paul T. Jacobson, Langhei Ecology, LLCSession: SCI-060, Abstract ID: 2419

Page 2: Statistical Properties of the Cumulative Frequency Diagram (CFD)

The Cumulative Frequency Diagram (CFD) is a novel tool for assessing the Pass/Fail of water quality criteria over space and time.

Page 3: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Poster Objectives

• Motivate CFD• Define CFD

• Give simple numerical example

• Review CFD properties based on simple model of water quality

• Review unresolved statistical issues concerning CFD

Page 4: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Motivation for CFD Assessment tool

• Differentiate temporal and spatial variance in criteria assessment

• Allow spatially broad but ephemeral events.

• Allow spatially small but persistent events.

Page 5: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Steps to Compute a CFD

1. Set Compliance criteria, C, (C may be a collection of criteria that vary in time and space)

2. Collect k spatial samples over segment for m dates in assessment period.

3. For each date interpolate (e.g. Kriging) the k samples to n cells over segment.

Page 6: Statistical Properties of the Cumulative Frequency Diagram (CFD)

4. Compute Area-based exceedences For each date estimate proportion of segment

that does not comply with C.

)cI(xn

1ps ijij

n1ij

Page 7: Statistical Properties of the Cumulative Frequency Diagram (CFD)

5. Score Area-base exceedences across dates

• Rank from high to low the Area-based exceedences: psj.

• Score time dimension of each point proportional rank.

• Plot step ptj versus psj.

)rank(ps 1m

1pt j j

Page 8: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Simple CFD example using 5 stations and 3 dates.

Page 9: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Step 1. Collect data at known locations.

date2

1 1

3

1 1

date3

4 2

2

1 1

date1

3 4 5 3

4 4 5 2

3 3 4 1

2 3 3 1

Step 2. Interpolate data to grid cells.

1 2 3 1

2 2 3 2

1 3 2 1

1 1 1 1

4 3 2 2

3 3 2 1

2 2 1 1

1 1 1 1

3 1

5

2 1

Step 3. Determine status of each cell.

1 1 1 1

1 1 1 0

1 1 1 0

0 1 1 0

0 0 1 0

0 0 1 0

0 1 0 0

0 0 0 0

1 1 0 0

1 1 0 0

0 0 0 0

0 0 0 0

Page 10: Statistical Properties of the Cumulative Frequency Diagram (CFD)
Page 11: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Step 4: Percent compliance by date.

psj

date 1 75.00%

date 2 18.75%

date 3 25.00%

Step 5. Rank the percent of space values assign percent of time = (100*R/(M+1))

psj ptj

date 1 75.00% 25.00

date 3 25.00% 50.00

date 2 18.75% 75.00

Page 12: Statistical Properties of the Cumulative Frequency Diagram (CFD)

curve CFD

curve reference

Fraction of Space

Fra

ctio

n o

f T

ime

Simple Example Plotted

Page 13: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 1 Simple example of CFD showing excessive violations of water quality relative to the reference curve.

Page 14: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Establishing Reference Curves

• Use simple model to estimate curve based on 10% expected violations (theoretical curve).

• Empirically estimate curve based on water quality in a reference area (empirical curve).

Page 15: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Reference Curves

theoretical curvebased on 10% noncompliance

empirical curve for DOBased on benthic IBI > 3

Page 16: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig 2. Examples of theoretical reference curve and empirical reference curve.

Page 17: Statistical Properties of the Cumulative Frequency Diagram (CFD)

How is temporal and spatial variability of water quality reflected in shape of the CFD curve?

Using a simple model and probability theory we can compute the theoretical CFD.

Page 18: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Simple Water Quality Model

Xij = u + ai + bij

i = 1, 2, . . . M and j = 1, 2, . . . N.

a is temporal term, var =b is spatial term, var = b

a

Page 19: Statistical Properties of the Cumulative Frequency Diagram (CFD)

CFD as Mean Decreases Below C = 5.

Fraction of Space

Fra

ctio

n o

f T

ime

5

4

32

1

1b1a5C

Page 20: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 3. As the mean of water quality decreases from the criterion, the CFD retreats toward the origin of the Space-Time plane.

Page 21: Statistical Properties of the Cumulative Frequency Diagram (CFD)

CFD as Temporal Variance Increases.

3

5C

1a2a 3a 4a 5aF

ract

ion

of

Tim

e

Fraction of Space

1b

Page 22: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 4. As the temporal variance of water quality increases, the CFD deflects toward broad areas of space in noncompliance for a short time.

Page 23: Statistical Properties of the Cumulative Frequency Diagram (CFD)

CFD as Spatial Variance Increases.

1b

2b

3b

4b

5b

Fraction of Space

Fra

ctio

n o

f T

ime

1a3

5C

Page 24: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 5. As the spatial variance of water quality increases, the CFD deflects toward increasing areas of space persistently in noncompliance.

Page 25: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Statistical Issues

• Current simulation exercises show that the shape of the CFD is influenced by level of sampling. As a result the estimated CFD is biased for the true CFD.

• A confidence envelope for the CFD is analytically complex. This limits statistical inference.

Page 26: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fraction of space

Fra

ctio

n o

f ti

me

Page 27: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 6. Simulation shows that as sample size decreases, the estimated CFD deflects away from the true CFD. This is the result of increasing variability with decreasing sample size.

Page 28: Statistical Properties of the Cumulative Frequency Diagram (CFD)
Page 29: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Fig. 7. Conditional simulation based on Kriging shows promise of correcting the sample size bias and providing a statistical inference too. Confidence bounds were computed based on quantiles of fraction of space computed on conditionally simulated surface estimates using variogram estimates from data.

Page 30: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Conclusion

• The method show promise as a method for addressing temporally and spatially detailed criteria.

• Sampling Bias and Statistical Inference issues need to be resolved.

Page 32: Statistical Properties of the Cumulative Frequency Diagram (CFD)

Futher Details:

U.S.E.P.A. 2007.

Ambient Water Quality Criteria for Dissolved Oxygen, Water Clarity and Chlorophyll a for the Chesapeake Bay and its Tidal Tributaries. 2007 Addendum. EPA 903-R-07-003. Region III Chesapeake Bay Program Office, Annapolis, Md.

Secor, David, et al. (2006)

Review of the Chesapeake Bay Program CFD and Interpolator. Chesapeake Bay Program’s Scientific and Technical Advisory Committee, Chesapeake Research Consortium, Edgewater, Md. http://www.chesapeake.org/stac/stacpubs.html#RR