t5 types of data and insights for hra quantification. · mind (except method- specific data...

23
Insights for future HRA data collection based on research using HRA data Katrina M. Groth University of Maryland

Upload: others

Post on 11-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

  • Insights for future HRA data collection based on research using HRA data

    Katrina M. GrothUniversity of Maryland

  • Reminder on topic

    My talk will center around some of my experiences from using HRA data, rather than discussing a specific data source. It will include a brief discussion of some data sources I've worked with (SACADA, HERA, and sources we gathered to help the IDHEAS expert elicitation) as well as the modeling pieces needed to use the data (e.g., PSFs, task breakdowns, causal models and factorization).

    2

  • Conclusions: Data-informed insights for HRA Data collection1. New data sources enabled by international collaborations

    offer great opportunity for HRA2. Models are essential for making data useful3. “Data” is not a synonym for “Numbers”4. Data quality > data quantity

  • Outline

    Data sources I’ve used & Previous work Latest work: Insights from data use

    4

  • Systems Risk and Reliability Analysis lab (SyRRA) Established 2017, Directed by Prof. Katrina Groth. Research into risk & reliability for complex systems

    Human + system + environment + complex phenomena

    5

    Diverse, multi-source dataHuman decision makers

    & engineering knowledgeEnhance safety,

    reliability, resilience of complex systems

    Causal & probabilistic models

    Machine learning

    Total Risk

    0

    0.001

    0.002

    0.003

    0.004

    0.005

    0.006

    1 2 3 4 5 6 7 8 9 10

    Chart2

    0.001

    0.0015

    0.00185

    0.002

    0.00175

    0.00355

    0.00305

    0.0049

    0.00425

    0.0053

    Total Risk

    Sheet1

    510

    520

    625

    824

    525

    365

    257

    882

    575

    792

    Sheet1

    SI-1

    SI-2

    Sheet2

    5100.00010.000050.001

    5200.00010.000050.0015

    6250.00010.000050.00185

    8240.00010.000050.002

    5250.00010.000050.00175

    3650.00010.000050.00355

    2570.00010.000050.00305

    8820.00010.000050.0049

    5750.00010.000050.00425

    7920.00010.000050.0053

    Sheet2

    Total Risk

    Sheet3

  • 2004-2009 2011 2013 2015 2017 2019 …

    Timeline of my HRA data work

    2004-2009, UMDPhD on Data informed HRA

    •2009 Dissertation on data-informed PSF modeling

    •2012 Data-informed PIF hierarchy •2012 Data-informed BN for PIF dependency

    2008, INLHERA data collection

    2011, SNLU.S. HRA empirical

    study

    2012-2013, SNL Leveraging

    Published Data to quantify IDHEAS

    HEPs

    2011-2014, SNLBayesian updating Bayesian Networks. Using SPAR-H + Halden data

    2017, SNLCapturing

    cognitive casual pathways in

    HRA

    2018, UMDSACADA data-use framework

    •Submitted: A hybrid approach to HRA using simulator data, casual models, and cognitive science

    2018-2019, UMD/UCLAPHOENIX + casual BNs

    and quantification approach

  • HRA-relevant data sources I’ve explored Comprehensive simulator data w/detailed context & event descriptions

    H2ERA (Halden HERA), SACADA, Briefly (intend to go deeper): OPERA & HuReX

    Existing HRA models SPAR-H, IDAC, THERP, ASEP, CREAM, CBDT

    Retrospective event data with detailed context HERA, HFIS, CORE-DATA, NUCLARR

    Retrospective event data without detailed context (or limited to 1-2 factors) CORE-DATA, GRS data (or THERP-like tasks)

    Single parameter studies & expert estimates (for a multitude of HRA quantities) Hundreds of published articles from a wide variety of high-consequence industries

    (+Significant work aggregating dozens into comprehensive supporting documentation for the IDHEAS expert elicitation study)

    Expert Survey for Zwirglmaier work; Engineering Data Compendium

    Cognitive Models Dozens of cognitive models (NRC cognitive basis study; IDAC source documents;

    Engineering Data Compendium;)

    7

  • Insights from 15 years of HRA data use – Motivation for my work No HRA data sources were designed with causal modeling in

    mind (except method-specific data collection, e.g. THERP). This makes modeling significantly harder.

    No single HRA data set collects data on all of the factors, causes, and consequences relevant to modeling human performance.

    90%+ of the work is in understanding the dataset and mapping dataset variables onto modeling variables. This has to be done for each dataset.

    Many good HRA data collection projects have been started & stopped over the years – taking that data with them.

    Thus, a “big picture” modeling paradigm is the only way to overcome the inherent limits of HRA data collection

  • Data-to-model process for “easy” reliability problems

    9

    Model Form(states & relationships)

    Estimating Model Parameters (Quantification)

    Final modelcapable of predicting reliabilityquantities of interest

    Variables & definitions(For data collection & modeling)

    Full scale data source 1

    Repeat/update parameters (of final model at reasonable intervals)

    Modernize at major intervals

  • Try to use the “easy” Data-to-model process is causing HRA to fail

    10

    Single data source (increasing in complexity over years, but few attempts to modernize)

    No updating; instead we create a new model

    50+ HRA methods exist, we’re not updating them, and we can’t use them together

    Largely regression based; every model uses different variables; lacks causality

  • Data-to-model process for complex problems (like HRA)

    11

    Model Form(states & relationships)

    Estimate Model Parameters (Quantification)

    Final modelcapable of predictingHRA quantities of interest

    Full scale data source 2

    Partial data source 3

    Modeling variables &definitions

    Partial data source 2

    Partial data source 1

    Full scale data source 1

    Modeling constructs (decomposition; causality)

    Map of data collectionvariables & definitions onto modelingVariables &definitions

    Repeat/update parameters of final model at reasonable intervals

    Repeat/update data source designs at major intervals

  • Previous work created important pieces of model- and data-informed approach to HRA

    Taxonomy of PIFs (Groth 2012) Application neutral, clearly defined, non-

    overlapping set of factors for modeling

    Bayesian Networks causal models (Groth et al 2009, 2013, 2014, 2017) To capture causal relationships &

    uncertainty

    Bayesian parameter updating algorithm (Groth et al 2014, 2017) To incorporate data into probability

    assignments

    Full hybrid BN + multi-data quantification approach: paper submitted to RESS in Nov 2018.

    12

  • New: Identified requirements for data-informed HRA models Using HRA data to improve HRA models requires

    new approaches Desirable characteristics of advanced HRA models

    1. Using underlying causal model rooted in strong technical basis (combining psychological research, operating experience, simulator data)

    2. Explicitly representing causal factors that affect performance (& are collected in data)

    3. Support qualitative & quantitative HRA4. Framework should be both data-informed and model-informed.5. Flexibility to accommodate changes as our databases mature6. Ability to fuse information from multiple sources of data & models7. Generate detailed insights to improve human performance (beyond

    quantifying)

    13Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.

    Result of 2018 UMD SACADA research – published in PSAM 2018

  • New: Key elements of the multi-data-informed HRA framework

    Framework for data-informed

    HRA

    Performance influencing

    factor taxonomy

    Human-machine team failure modes

    Bayesian Network causal

    modelsData and Bayesian parameter updating

    Human-machine task

    sequences

    14

    And model

    Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.

    Result of 2018 UMD SACADA research – published in PSAM 2018

  • New: generalized set of HRA quantification elements Core types of variables:

    Performance Influencing factors (PIFs)

    Crew failure mechanisms Crew failure modes (proximate

    causes) Macrocognitive functions

    Core set of probabilistic relationships:

    15

    PIF1

    FM1

    PC1

    MCF

    ⋯ PCNPC

    FM2 FM3 FMNFMFMNFM−1⋯

    PIF2 PIF3 PIFNPIFPIFNPIF−1⋯

    Result of 2018 UMD SACADA research –draft submitted to RESS in Nov 2018

  • New: Full Algorithm for “A hybrid approach to HRA using simulator data, causal models, and cognitive science”

    Block 1: Causal Factor Mapping (BN structure development) Create a causal map of the relationship between the PIFs, failure mechanisms, proximate causes of

    failure, and MCFs by way of Bayesian Network. (Groth 2012; Ekanem &Mosleh 2013; Zwirglmaier, Straub, Groth 2017)

    Simplify BN structure using node reduction (Zwirglmaier, Straub, Groth 2017) Block 2: Prior model quantification (BN parameterization)

    Map existing HRA method PIFs to PIF taxonomy (Groth 2012) Use existing HRA method to get priors of the probability of MCF error Pr (MCFerror|PIFs) Expert elicit priors for PIFs, Pr (PIFs) (Groth, Swiler, 2014)

    Block 3: Bayesian update model parameters Map simulation data source variables to PIFs Use simulation data source to update probability of MCF error Pr (MCFerror|PIFs) (Groth,

    Swiler, Smith 2014) (Optional) Perform Bayesian update of the PIF probabilities Pr(PIFs) (Groth, Swiler, Smith

    2014) Block 4: Use the BN for HRA activities

    16

    Result of 2018 UMD SACADA research –draft submitted to RESS in Nov 2018

  • New: Comprehensive causal maps for PHOENIX (w/ UCLA)

    Full causal structures developed by Groth + UCLA using cognitive literature, IDAC, PIF hierarchy, PHOENIX CFMs, Zwirglmaier & Groth 2017 method I Phase CFMs (6 core CFMs, 3 mechanism-

    like constructs, 20+ PIFs) D Phase CFMs ( 5 core CFMs; 2

    mechanism-like constructs, 20+ PIFs) A Phase CFMs (3 core CFMs, 2 mechanism-

    like constructs, 20+ PIFs) ~5 CFMs still need to be modeled. D phase &

    Communications related.

    Quantification is possible with future work (using algorithm in draft RESS paper) – needs customer

    17

  • New: Draft mapping of SACADA elements to PIFs & FMs to enable quantification SACADA can be used to quantify: Pr(crew failure mode|PIFs)

    See submitted RESS paper for full draft of the mapping

    Need other data sources for for Pr(PIFs) UMD is now exploring how to use SACADA for modeling

    dependency: Pr(HFE|HFE) and Pr(PIFs|PIFs)

    18

  • Gaps & opportunities: Next steps for UMD Project: Develop the theoretical & mathematical foundations to

    model “HRA dependency” and enable parameterization using multiple sources (starting with SACADA) Outcome 1: Terminology & definitions (both words and equations).

    To resolve significant conceptual problems around:– “HRA dependency”– “Human failure mode / mechanism / proximate cause”– “Probability of Error”& uncertainty

    » Overheard at PSAM “Probability of error is just number of failures out of number of opportunities. We just need to count things.”

    – Conditional vs. marginal vs. joint probability

    Outcome 2: Mathematical framework for modeling dependency (DBNs) Outcome 3: Mathematical framework for quantifying dependency using

    simulator data.

    19

    NRC grant will support 1 year of research toward this broader objective; additional out-year support is needed.

  • Gaps & opportunities: Long-term UMD interests. Seeking partners & support: your data or model + UMD

    algorithms Large projects envisioned

    Extended work on HRA dependency (See previous slide) Multi-source data fusion algorithms for HRA

    UMD algorithm to combine: SACADA + HuREX + EPRI + OSU + INL + literature + PIF studies +…

    “Prior model” full quantification of causal BN version of PHOENIX.

    20

  • Gaps, Opportunities, & Challenges for the HRA community 1. Strong need for consistent terminology & clear definitions of data fields

    across data sources – can we start a pathway toward consensus? Implementation gaps

    PIF terminology exists – needs to be implemented in data sources, either directly or via implementation of mapping

    Clear definitions for each data entry field are needed – differentiates between high value databases and “Garbage In/Out” databases

    Research gaps HRA still needs event/task decomposition terminology Need for assessment of types of data which exist & what part of the HRA modeling

    problem they address.

    2. We need to use all of the data: can we get data sharing agreements in place?

    3. We need to discuss modeling & data together – we have models to handle HRA complexity, avoid the urge to oversimplify “and just count” Simulation, simplification, Bayesian methods, causal models, Bayesian Networks

  • Conclusions: Data-informed insights for HRA Data collection1. New data sources enabled by international collaborations

    offer great opportunity for HRA.2. “Data” is not a synonym for “Numbers”

    1. The best data sources must capture context & dependency; HFEs are not i.i.d events.

    2. HRA also has other data: models, experts, literature;

    3. Models are essential for making data useful1. The data sources should be designed to support modeling as much as

    possible

    4. Data quality >> data quantity We have methods for handling sparse data; but no method exists for the

    “garbage in garbage out” problem Having a consistent HRA modeling framework >> getting the “right”

    HEP for one single, repeatable simulator event

  • References1. Groth, Katrina M., Smith, Reuel, & Moradi, Ramin. A hybrid approach to HRA using simulator data, causal models, and

    cognitive science. Submitted to RESS, November 2019. In Review.2. Groth, K. M (2018). A framework for using SACADA to enhance the qualitative and quantitative basis of HRA. Proceedings

    of the 14th Probabilistic Safety Assessment and Management Conference (PSAM 14), Los Angeles, CA.3. Zwirglmaier, K., Straub, D., & Groth, K. M. (2017) Capturing cognitive causal paths in human reliability analysis with

    Bayesian network models, Reliability Engineering & System Safety, 158, 117-129.4. Whaley, A.M. et al., “Cognitive Basis for Human Reliability Analysis,” US Nuclear Regulatory Commission, Washington

    DC, NUREG-2114, Jan. 2016.5. Boring, R., Mandelli, D., Joe, J., Smith, C., & Groth, K. (2015). A Research Roadmap for Computation-Based Human

    Reliability Analysis. Idaho National Laboratory, INL/EXT-15-36051.6. Chang, Y.J., Bley, D., Criscione L., Kirwan, B., Mosleh, A., Madary, T., Nowell, R., Richards, R., Roth, E. M., Sieben, S., &

    Zoulis, A. (2014). The SACADA database for human reliability and human performance, Reliability Engineering & System Safety, 125, 117-133.

    7. Groth, Katrina M., Smith, Curtis L., Swiler, Laura P., (2014). A Bayesian method for using simulator data to enhance human error probabilities assigned by existing HRA methods. Reliability Engineering & System Safety, 128, 32-40.

    8. Groth & Swiler (2013). Bridging the gap between HRA research and HRA practice: A Bayesian Network version of SPAR-H. Reliability Engineering and System Safety, 115, 33-42.

    9. Ekanem, N. J., & Mosleh, A. (2013). Human failure event dependency modeling and quantification: A Bayesian network approach, Proceedings of the European Society for Reliability Annual Meeting (ESREL 2013).

    10. Groth, Katrina M., & Mosleh, Ali. (2012). Deriving causal Bayesian networks from human reliability analysis data: A methodology and example model. Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, 226, 361-379.

    11. Mosleh, A., Shen, S.-H., Kelly, D. L., Oxstrand, J. H., & Groth, K. (2012). A Model-Based Human Reliability Analysis Methodology. Proceedings of the International Conference on Probabilistic Safety Assessment and Management (PSAM 11).

    12. Groth & Mosleh (2012). A data-informed PIF hierarchy for model-based Human Reliability Analysis. Reliability Engineering and System Safety, 108, 154-174.

    23

    Insights for future HRA data collection based on research using HRA dataReminder on topicConclusions: Data-informed insights for HRA Data collectionOutlineSystems Risk and Reliability Analysis lab (SyRRA) Timeline of my HRA data workHRA-relevant data sources I’ve exploredInsights from 15 years of HRA data use – Motivation for my workData-to-model process for “easy” reliability problemsTry to use the “easy” Data-to-model process is causing HRA to failData-to-model process for complex problems (like HRA)Previous work created important pieces of model- and data-informed approach to HRANew: Identified requirements for data-informed HRA modelsNew: Key elements of the multi-data-informed HRA frameworkNew: generalized set of HRA quantification elementsNew: Full Algorithm for “A hybrid approach to HRA using simulator data, causal models, and cognitive science”New: Comprehensive causal maps for PHOENIX (w/ UCLA)New: Draft mapping of SACADA elements to PIFs & FMs to enable quantificationGaps & opportunities: Next steps for UMDGaps & opportunities: Long-term UMD interests.Gaps, Opportunities, & Challenges for the HRA communityConclusions: Data-informed insights for HRA Data collectionReferences