t5 types of data and insights for hra quantification. · mind (except method- specific data...

Insights for future HRA data collection based on research using HRA data

Katrina M. GrothUniversity of Maryland

Reminder on topic

My talk will center around some of my experiences from using HRA data, rather than discussing a specific data source. It will include a brief discussion of some data sources I've worked with (SACADA, HERA, and sources we gathered to help the IDHEAS expert elicitation) as well as the modeling pieces needed to use the data (e.g., PSFs, task breakdowns, causal models and factorization).

2

Conclusions: Data-informed insights for HRA Data collection1. New data sources enabled by international collaborations

offer great opportunity for HRA2. Models are essential for making data useful3. “Data” is not a synonym for “Numbers”4. Data quality > data quantity

Outline

Data sources I’ve used & Previous work Latest work: Insights from data use

4

Systems Risk and Reliability Analysis lab (SyRRA) Established 2017, Directed by Prof. Katrina Groth. Research into risk & reliability for complex systems

Human + system + environment + complex phenomena

5

Diverse, multi-source dataHuman decision makers

& engineering knowledgeEnhance safety,

reliability, resilience of complex systems

Causal & probabilistic models

Machine learning

Total Risk

0

0.001

0.002

0.003

0.004

0.005

0.006

1 2 3 4 5 6 7 8 9 10

Chart2

0.001

0.0015

0.00185

0.002

0.00175

0.00355

0.00305

0.0049

0.00425

0.0053

Total Risk

Sheet1

510

520

625

824

525

365

257

882

575

792

Sheet1

SI-1

SI-2

Sheet2

5100.00010.000050.001

5200.00010.000050.0015

6250.00010.000050.00185

8240.00010.000050.002

5250.00010.000050.00175

3650.00010.000050.00355

2570.00010.000050.00305

8820.00010.000050.0049

5750.00010.000050.00425

7920.00010.000050.0053

Sheet2

Total Risk

Sheet3

2004-2009 2011 2013 2015 2017 2019 …

Timeline of my HRA data work

2004-2009, UMDPhD on Data informed HRA

•2009 Dissertation on data-informed PSF modeling

•2012 Data-informed PIF hierarchy •2012 Data-informed BN for PIF dependency

2008, INLHERA data collection

2011, SNLU.S. HRA empirical

study

2012-2013, SNL Leveraging

Published Data to quantify IDHEAS

HEPs

2011-2014, SNLBayesian updating Bayesian Networks. Using SPAR-H + Halden data

2017, SNLCapturing

cognitive casual pathways in

HRA

2018, UMDSACADA data-use framework

•Submitted: A hybrid approach to HRA using simulator data, casual models, and cognitive science

2018-2019, UMD/UCLAPHOENIX + casual BNs

and quantification approach

HRA-relevant data sources I’ve explored Comprehensive simulator data w/detailed context & event descriptions

H2ERA (Halden HERA), SACADA, Briefly (intend to go deeper): OPERA & HuReX

Existing HRA models SPAR-H, IDAC, THERP, ASEP, CREAM, CBDT

Retrospective event data with detailed context HERA, HFIS, CORE-DATA, NUCLARR

Retrospective event data without detailed context (or limited to 1-2 factors) CORE-DATA, GRS data (or THERP-like tasks)

Single parameter studies & expert estimates (for a multitude of HRA quantities) Hundreds of published articles from a wide variety of high-consequence industries

(+Significant work aggregating dozens into comprehensive supporting documentation for the IDHEAS expert elicitation study)

Expert Survey for Zwirglmaier work; Engineering Data Compendium

Cognitive Models Dozens of cognitive models (NRC cognitive basis study; IDAC source documents;

Engineering Data Compendium;)

7

Insights from 15 years of HRA data use – Motivation for my work No HRA data sources were designed with causal modeling in

mind (except method-specific data collection, e.g. THERP). This makes modeling significantly harder.

No single HRA data set collects data on all of the factors, causes, and consequences relevant to modeling human performance.

90%+ of the work is in understanding the dataset and mapping dataset variables onto modeling variables. This has to be done for each dataset.

Many good HRA data collection projects have been started & stopped over the years – taking that data with them.

Thus, a “big picture” modeling paradigm is the only way to overcome the inherent limits of HRA data collection

Data-to-model process for “easy” reliability problems

9

Model Form(states & relationships)

Estimating Model Parameters (Quantification)

Final modelcapable of predicting reliabilityquantities of interest

Variables & definitions(For data collection & modeling)

Full scale data source 1

Repeat/update parameters (of final model at reasonable intervals)

Modernize at major intervals

Try to use the “easy” Data-to-model process is causing HRA to fail

10

Single data source (increasing in complexity over years, but few attempts to modernize)

No updating; instead we create a new model

50+ HRA methods exist, we’re not updating them, and we can’t use them together

Largely regression based; every model uses different variables; lacks causality

Data-to-model process for complex problems (like HRA)

11

Model Form(states & relationships)

Estimate Model Parameters (Quantification)

Final modelcapable of predictingHRA quantities of interest


Partial data source 3

Modeling variables &definitions




Modeling constructs (decomposition; causality)

Map of data collectionvariables & definitions onto modelingVariables &definitions

Repeat/update parameters of final model at reasonable intervals

Repeat/update data source designs at major intervals

Previous work created important pieces of model- and data-informed approach to HRA

Taxonomy of PIFs (Groth 2012) Application neutral, clearly defined, non-

overlapping set of factors for modeling

Bayesian Networks causal models (Groth et al 2009, 2013, 2014, 2017) To capture causal relationships &

uncertainty

Bayesian parameter updating algorithm (Groth et al 2014, 2017) To incorporate data into probability

assignments

Full hybrid BN + multi-data quantification approach: paper submitted to RESS in Nov 2018.

12

New: Identified requirements for data-informed HRA models Using HRA data to improve HRA models requires

new approaches Desirable characteristics of advanced HRA models

1. Using underlying causal model rooted in strong technical basis (combining psychological research, operating experience, simulator data)

2. Explicitly representing causal factors that affect performance (& are collected in data)

3. Support qualitative & quantitative HRA4. Framework should be both data-informed and model-informed.5. Flexibility to accommodate changes as our databases mature6. Ability to fuse information from multiple sources of data & models7. Generate detailed insights to improve human performance (beyond

quantifying)

13Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.

Result of 2018 UMD SACADA research – published in PSAM 2018

New: Key elements of the multi-data-informed HRA framework

Framework for data-informed

HRA

Performance influencing

factor taxonomy

Human-machine team failure modes

Bayesian Network causal

modelsData and Bayesian parameter updating

Human-machine task

sequences

14

And model

Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.

Result of 2018 UMD SACADA research – published in PSAM 2018

New: generalized set of HRA quantification elements Core types of variables:

Performance Influencing factors (PIFs)

Crew failure mechanisms Crew failure modes (proximate

causes) Macrocognitive functions

Core set of probabilistic relationships:

15

PIF1

FM1

PC1

MCF

⋯ PCNPC

FM2 FM3 FMNFMFMNFM−1⋯

PIF2 PIF3 PIFNPIFPIFNPIF−1⋯

Result of 2018 UMD SACADA research –draft submitted to RESS in Nov 2018

New: Full Algorithm for “A hybrid approach to HRA using simulator data, causal models, and cognitive science”

Block 1: Causal Factor Mapping (BN structure development) Create a causal map of the relationship between the PIFs, failure mechanisms, proximate causes of

failure, and MCFs by way of Bayesian Network. (Groth 2012; Ekanem &Mosleh 2013; Zwirglmaier, Straub, Groth 2017)

Simplify BN structure using node reduction (Zwirglmaier, Straub, Groth 2017) Block 2: Prior model quantification (BN parameterization)

Map existing HRA method PIFs to PIF taxonomy (Groth 2012) Use existing HRA method to get priors of the probability of MCF error Pr (MCFerror|PIFs) Expert elicit priors for PIFs, Pr (PIFs) (Groth, Swiler, 2014)

Block 3: Bayesian update model parameters Map simulation data source variables to PIFs Use simulation data source to update probability of MCF error Pr (MCFerror|PIFs) (Groth,

Swiler, Smith 2014) (Optional) Perform Bayesian update of the PIF probabilities Pr(PIFs) (Groth, Swiler, Smith

2014) Block 4: Use the BN for HRA activities

16

Result of 2018 UMD SACADA research –draft submitted to RESS in Nov 2018

New: Comprehensive causal maps for PHOENIX (w/ UCLA)

Full causal structures developed by Groth + UCLA using cognitive literature, IDAC, PIF hierarchy, PHOENIX CFMs, Zwirglmaier & Groth 2017 method I Phase CFMs (6 core CFMs, 3 mechanism-

like constructs, 20+ PIFs) D Phase CFMs ( 5 core CFMs; 2

mechanism-like constructs, 20+ PIFs) A Phase CFMs (3 core CFMs, 2 mechanism-

like constructs, 20+ PIFs) ~5 CFMs still need to be modeled. D phase &

Communications related.

Quantification is possible with future work (using algorithm in draft RESS paper) – needs customer

17

New: Draft mapping of SACADA elements to PIFs & FMs to enable quantification SACADA can be used to quantify: Pr(crew failure mode|PIFs)

See submitted RESS paper for full draft of the mapping

Need other data sources for for Pr(PIFs) UMD is now exploring how to use SACADA for modeling

dependency: Pr(HFE|HFE) and Pr(PIFs|PIFs)

18

Gaps & opportunities: Next steps for UMD Project: Develop the theoretical & mathematical foundations to

model “HRA dependency” and enable parameterization using multiple sources (starting with SACADA) Outcome 1: Terminology & definitions (both words and equations).

To resolve significant conceptual problems around:– “HRA dependency”– “Human failure mode / mechanism / proximate cause”– “Probability of Error”& uncertainty

» Overheard at PSAM “Probability of error is just number of failures out of number of opportunities. We just need to count things.”

– Conditional vs. marginal vs. joint probability

Outcome 2: Mathematical framework for modeling dependency (DBNs) Outcome 3: Mathematical framework for quantifying dependency using

simulator data.

19

NRC grant will support 1 year of research toward this broader objective; additional out-year support is needed.

Gaps & opportunities: Long-term UMD interests. Seeking partners & support: your data or model + UMD

algorithms Large projects envisioned

Extended work on HRA dependency (See previous slide) Multi-source data fusion algorithms for HRA

UMD algorithm to combine: SACADA + HuREX + EPRI + OSU + INL + literature + PIF studies +…

“Prior model” full quantification of causal BN version of PHOENIX.

20

Gaps, Opportunities, & Challenges for the HRA community 1. Strong need for consistent terminology & clear definitions of data fields

across data sources – can we start a pathway toward consensus? Implementation gaps

PIF terminology exists – needs to be implemented in data sources, either directly or via implementation of mapping

Clear definitions for each data entry field are needed – differentiates between high value databases and “Garbage In/Out” databases

Research gaps HRA still needs event/task decomposition terminology Need for assessment of types of data which exist & what part of the HRA modeling

problem they address.

2. We need to use all of the data: can we get data sharing agreements in place?

3. We need to discuss modeling & data together – we have models to handle HRA complexity, avoid the urge to oversimplify “and just count” Simulation, simplification, Bayesian methods, causal models, Bayesian Networks

Conclusions: Data-informed insights for HRA Data collection1. New data sources enabled by international collaborations

offer great opportunity for HRA.2. “Data” is not a synonym for “Numbers”

1. The best data sources must capture context & dependency; HFEs are not i.i.d events.

2. HRA also has other data: models, experts, literature;

3. Models are essential for making data useful1. The data sources should be designed to support modeling as much as

possible

4. Data quality >> data quantity We have methods for handling sparse data; but no method exists for the

“garbage in garbage out” problem Having a consistent HRA modeling framework >> getting the “right”

HEP for one single, repeatable simulator event

References1. Groth, Katrina M., Smith, Reuel, & Moradi, Ramin. A hybrid approach to HRA using simulator data, causal models, and

cognitive science. Submitted to RESS, November 2019. In Review.2. Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings

of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.3. Zwirglmaier, K., Straub, D., & Groth, K. M. (2017) Capturing cognitive causal paths in human reliability analysis with

Bayesian network models, Reliability Engineering & System Safety, 158, 117-129.4. Whaley, A.M. et al., “Cognitive Basis for Human Reliability Analysis,” US Nuclear Regulatory Commission, Washington

DC, NUREG-2114, Jan. 2016.5. Boring, R., Mandelli, D., Joe, J., Smith, C., & Groth, K. (2015). A Research Roadmap for Computation-Based Human

Reliability Analysis. Idaho National Laboratory, INL/EXT-15-36051.6. Chang, Y.J., Bley, D., Criscione L., Kirwan, B., Mosleh, A., Madary, T., Nowell, R., Richards, R., Roth, E. M., Sieben, S., &

Zoulis, A. (2014). The SACADA database for human reliability and human performance, Reliability Engineering & System Safety, 125, 117-133.

7. Groth, Katrina M., Smith, Curtis L., Swiler, Laura P., (2014). A Bayesian method for using simulator data to enhance human error probabilities assigned by existing HRA methods. Reliability Engineering & System Safety, 128, 32-40.

8. Groth & Swiler (2013). Bridging the gap between HRA research and HRA practice: A Bayesian Network version of SPAR-H. Reliability Engineering and System Safety, 115, 33-42.

9. Ekanem, N. J., & Mosleh, A. (2013). Human failure event dependency modeling and quantification: A Bayesian network approach, Proceedings of the European Society for Reliability Annual Meeting (ESREL 2013).

10. Groth, Katrina M., & Mosleh, Ali. (2012). Deriving causal Bayesian networks from human reliability analysis data: A methodology and example model. Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, 226, 361-379.

11. Mosleh, A., Shen, S.-H., Kelly, D. L., Oxstrand, J. H., & Groth, K. (2012). A Model-Based Human Reliability Analysis Methodology. Proceedings of the International Conference on Probabilistic Safety Assessment and Management (PSAM 11).

12. Groth & Mosleh (2012). A data-informed PIF hierarchy for model-based Human Reliability Analysis. Reliability Engineering and System Safety, 108, 154-174.

23

Insights for future HRA data collection based on research using HRA dataReminder on topicConclusions: Data-informed insights for HRA Data collectionOutlineSystems Risk and Reliability Analysis lab (SyRRA) Timeline of my HRA data workHRA-relevant data sources I’ve exploredInsights from 15 years of HRA data use – Motivation for my workData-to-model process for “easy” reliability problemsTry to use the “easy” Data-to-model process is causing HRA to failData-to-model process for complex problems (like HRA)Previous work created important pieces of model- and data-informed approach to HRANew: Identified requirements for data-informed HRA modelsNew: Key elements of the multi-data-informed HRA frameworkNew: generalized set of HRA quantification elementsNew: Full Algorithm for “A hybrid approach to HRA using simulator data, causal models, and cognitive science”New: Comprehensive causal maps for PHOENIX (w/ UCLA)New: Draft mapping of SACADA elements to PIFs & FMs to enable quantificationGaps & opportunities: Next steps for UMDGaps & opportunities: Long-term UMD interests.Gaps, Opportunities, & Challenges for the HRA communityConclusions: Data-informed insights for HRA Data collectionReferences

t5 types of data and insights for hra quantification. · mind (except method- specific data...

Documents