system safety: a systematic processes

47
System Safety: System Safety: A systematic A systematic processes processes 1.H azard Identification 2.Risk A ssessm ent 3.A nalyze R isk C ontrol M easures 4.Risk C ontrols 5.Implem ent R isk C ontrols 6.Follow Through & Review 1.H azard Identification 2.Risk A ssessm ent 3.A nalyze R isk C ontrol M easures 4.Risk C ontrols 5.Implem ent R isk C ontrols 6.Follow Through & Review

Upload: matia

Post on 12-Feb-2016

27 views

Category:

Documents


0 download

DESCRIPTION

System Safety: A systematic processes. 1. Identify the Hazards. 6. Supervise and Review. 5. Implement Risk Controls. 3. Analyze Risk Control Measures. 4. Make Control Decisions. 2. Assess the Risks. Risk Assessment An evaluation of threats in terms of severity and probability. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: System Safety:  A systematic processes

System Safety: System Safety: A systematic processesA systematic processes

1. HazardIdentification

2. RiskAssessment

3. Analyze Risk Control

Measures

4. Risk Controls

5. ImplementRisk Controls

6. Follow Through &

Review

1. HazardIdentification

2. RiskAssessment

3. Analyze Risk Control

Measures

4. Risk Controls

5. ImplementRisk Controls

6. Follow Through &

Review

Page 2: System Safety:  A systematic processes

Risk AssessmentRisk Assessment An evaluation of threats in terms of severity

and probability

1. Identifythe Hazards

4. MakeControl

Decisions

3. Analyze Risk Control

Measures

6. Superviseand Review

2. Assessthe Risks5. Implement

Risk Controls

Page 3: System Safety:  A systematic processes

MISSION FOCUSMISSION FOCUS((HAZARD VERSUS RISK)HAZARD VERSUS RISK)

HAZARDID & Analysis

Identifying and analyzing an existing or potential condition that can impair mission accomplishment (No discussion of mission significance)

RISKAssessment & Mgmt

A hazard for which we haveestimated the severity,probability, and scope with which it can impact our mission and accepted it

Page 4: System Safety:  A systematic processes

Hazard Identification and Analysis Hazard Identification and Analysis during the Life Cycle of a systemduring the Life Cycle of a system

Concept Definition Development Production Deployment Termination

Page 5: System Safety:  A systematic processes

Threat assessment processThreat assessment process

ID HazardousCondition

Complete Risk

Assessment

Q/QAssess Probability

Q/QAssess Severity

Page 6: System Safety:  A systematic processes

THE RISK ASSESSMENT MATRIXTHE RISK ASSESSMENT MATRIX

Probability Frequent Likely Occasional Seldom Unlikely

I

II

III

IV

Catastrophic

Critical

Moderate

Negligible

A B C D ESEVERITY

HighMedium

High

Risk Levels

ExtremelyHigh

Low

Page 7: System Safety:  A systematic processes

A thorough risk assessment process might help you better understand a hazard you have been

exposed to many times before without incident*

* No beavers were assaulted in production of this slide

Page 8: System Safety:  A systematic processes

Hazard SeverityHazard Severity

• What impact will this threat have on people?

• What impact on environment, equipment or facilities?

• What impact on mission?

Page 9: System Safety:  A systematic processes

Severity Categories

• A key factor in establishing a common understanding of a safety programs goal

• MIL-STD 882 uses four categories– Cat 1: Catastrophic– Cat 2: Critical– Cat 3: Marginal– Cat 4: Negligible

Page 10: System Safety:  A systematic processes

Severity Severity QualifiedQualified

CATASTROPHIC - Complete mission failure, death, or loss of system

CRITICAL - Major mission degradation, severe injury, occupational illness, or major system damage

MODERATE - Minor mission degradation, injury, minor occupational illness, or minor system damage

NEGLIGIBLE - Less than minor mission degradation, injury, occupational illness or minor system damage

Page 11: System Safety:  A systematic processes

Severity Severity QuantifiedQuantifiedCATASTROPHIC - Complete mission failure, death, or loss of system and/or costs exceeding $1B

CRITICAL - Major mission degradation, severe injury, occupational illness, or major system damage and/ or costs exceeding $1M

MODERATE - Minor mission degradation, injury, minor occupational illness, or minor system damage and/or costs exceeding $100,000

NEGLIGIBLE - Less than minor mission degradation, injury, occupational illness or minor system damage and/or costs exceeding $10,000

Page 12: System Safety:  A systematic processes

ProbabilityProbabilityExpressed in terms of time, occurrence, proximity, Expressed in terms of time, occurrence, proximity,

etcetc

• Use data to substantiate your assessment• Use descriptive or quantitative terms• Use the cumulative probability of all factors• Examine experientially derived or anecdotal

information from operators• Acknowledge uncertainty – There are no

guarantees

Page 13: System Safety:  A systematic processes

THE RISK ASSESSMENT MATRIXTHE RISK ASSESSMENT MATRIX

Probability Frequent Likely Occasional Seldom Unlikely

I

II

III

IV

Catastrophic

Critical

Moderate

Negligible

A B C D ESEVERITY

HighMedium

High

Risk Levels

ExtremelyHigh

Low

Page 14: System Safety:  A systematic processes

FREQUENT Individual piece of equipment - Occurs often in the life of the system Individual - Occurs often in career Fleet or inventory - Continuously experienced All Personnel exposed - continuously experiencedLIKELY Individual piece of equipment - Occurs several times in the life of the system Individual - Occurs several times in a career Fleet or Inventory - Occurs often All Personnel exposed - Occurs oftenOCCASIONAL Individual piece of equipment - Will occur in the life of the system Individual - Will occur in a career Fleet or Inventory - Occurs several times in the life of the system All Personnel exposed - Occurs sporadically

Qualified Probability CategoriesQualified Probability Categories

Page 15: System Safety:  A systematic processes

SELDOM

Individual piece of equipment - Could occur in the life of the system Individual person - Could occur in a career Fleet or Inventory - Can be expected to occur in the life of the system All Personnel exposed - Seldom occurs

UNLIKELY

Individual piece of equipment - You assume it will not occur in the system lifecycle Individual person - So unlikely you assume it will not occur in a career Fleet or Inventory - Unlikely but could occur in the life of the system All Personnel exposed - Occurs very rarely

Qualified Probability (cont)Qualified Probability (cont)

Page 16: System Safety:  A systematic processes

Probabilities QuantifiedProbabilities Quantified(In terms of failure or exposure rates)(In terms of failure or exposure rates)

• Unlikely: 1 failure in 1,000,000,000 events instead of assuming it will not occur

• Seldom: 1 failure in 500 million exposures instead of it could occur

• Occasional: 1 failure in 1 million exposures instead of it will occur

• Likely: 1 failure in 500,000 exposures instead of it occurs several times

• Frequent:1 failure in 100,000 events instead of it occurs often

Page 17: System Safety:  A systematic processes

Qualitative AssessmentQualitative AssessmentAC 25.1309-1AAC 25.1309-1A

• Design Appraisal• Installation Appraisal• Failure Modes and Effects Analysis• Fault Tree Analysis• Probability Assessment

Page 18: System Safety:  A systematic processes

Quantitative AssessmentQuantitative AssessmentAC 25.1309-1AAC 25.1309-1A

• Probability Analysis (PRA)• Quantitative Probability Terms (QRA)

Page 19: System Safety:  A systematic processes

FAA Fail-Safe Design ConceptAC 25.1309-1A

• The fail-safe design concept considers the effects of failures and combinations of failures in defining a safe design

• The following basic objectives apply:– In any system or subsystem, the failure of any single element,

component, or connection during any one flight should be assumed. Such single failure should not prevent continued safe flight and landing

– Subsequent failures during the same flight, whether detected or latent, should also be assumed unless their joint probability with the first failure is demonstrated to be extremely improbable

Page 20: System Safety:  A systematic processes

Fail-Safe Design ConceptFail-Safe Design Concept• Fail-Safe designs use the following design principals

– A combination of two or more are usually needed to provide a fail-safe design– Redundant or backup systems– Isolation of systems, components and elements– Demonstrated reliability / Periodic inspection– Failure warning and indication– Flight crew procedures– Designed failure effect limits– Designed failure path– Increased margins or factors of safety– Error-tolerant design

Page 21: System Safety:  A systematic processes

Operational and Maintenance ConsiderationsAC 25.1309-1A

• Flight crew action• Ground crew action• Certification check requirements• Flight with inoperative equipment

Page 22: System Safety:  A systematic processes

Quantifying or Qualifying Risk?Quantifying or Qualifying Risk?

Remember Murphy’s Law for Management: Remember Murphy’s Law for Management: “Technology is dominated by those who manage “Technology is dominated by those who manage

what they don’t understand”what they don’t understand”

Page 23: System Safety:  A systematic processes

Risk Acceptance CodesRisk Acceptance Codes

• RAC 1 – Unacceptable• RAC 2 – Undesirable• RAC 3 – Acceptable with controls• RAC 4 - Acceptable

Page 24: System Safety:  A systematic processes

Risk Assessment Shortcomings

• Deficiencies in RACs represent one of the major problems facing the system safety effort

• Quantitative severity and probabilities scales in most RAC matrices are too subjective

• The RAC is a main driver of system safety efforts– This code prioritizes the management emphasis

given to a particular problem

Page 25: System Safety:  A systematic processes

11 22 66 88 121233 44 77 1111 1515

55 99 1010 1414 1616

1313 1717 1818 1919 2020

THE THE “ENHANCED” “ENHANCED” RISK ASSESSMENT RISK ASSESSMENT MATRIXMATRIX

- Numeric Code is used to prioritize hazards and determine their acceptability using a quantitative methodology

ProbabilityFrequent Likely Occasional Seldom Unlikely

I

II

III

IV

Catastrophic

Critical

Moderate

Negligible

A B C D E

SEVERITY

Risk Levels

Page 26: System Safety:  A systematic processes

THE RISK PRIORITY LISTTHE RISK PRIORITY LIST

Highest Risk

Lowest RiskWarranting action

By ranking the hazards, we address them on a “worst-first” basis

Safety dedicated resources are always limited and should be directed at the highest risk

Page 27: System Safety:  A systematic processes

ASSESSMENT CHALLENGESASSESSMENT CHALLENGES

• Over optimism• Over pessimism• Misrepresentation/Misunderstanding• Alarmism / “Accident du Jour”• Indiscrimination• Bias• Inaccuracy

Page 28: System Safety:  A systematic processes

Total Risk Exposure CodesTotal Risk Exposure Codes

• Expanded scale• Probability expressed in Exposure• Severity expressed in Cost• Combined determination expressed in

quantifiable terms $$$$* (Now you are talking a language the bean counters understand)

Page 29: System Safety:  A systematic processes

Verification & ValidationVerification & Validation

• Quality of data establishes process credibility– Avoid GIGO syndrome

• Verify and Validate initial estimates with updated data– Failure rates– Exposure rates– Project lifecycle changes– Number of units in the system

Page 30: System Safety:  A systematic processes

THE PRIORITY LISTTHE PRIORITY LISTWhat does it accomplish?What does it accomplish?

Traditional Risk Management -Traditional Risk Management - Personnel can’t name or prioritize hazards -- can only identify general threats

ORM -ORM - Personnel can name and prioritize RISKS that impact them and their mission In a mature “NORMalNORMal” world, every individual personally benefits by adapting the knowledge of prioritized hazards that exist in their life -- (Due diligence is demonstrated when managers see that their subordinates possess this knowledge)

Page 31: System Safety:  A systematic processes

System Safety PrecedenceSystem Safety PrecedenceA systematic approach to Hazard ID – Risk Assess and ControlA systematic approach to Hazard ID – Risk Assess and Control

• Design to minimize hazards• Robust & Redundant systems, assemblies,

components, etc• Install physical barriers

• Isolate known threatening conditions or environments• Use Warning devices

• Alerts to prevent or reduce unwanted event• Develop Procedures and Training

• Most commonly used & abused hazard control

Page 32: System Safety:  A systematic processes

Risk Analysis

1. Identifythe Hazards

2. Assessthe Risks

3. Analyze Risk Control

Measures

4. MakeControl

Decisions

5. ImplementRisk Controls

6. Superviseand Review

Page 33: System Safety:  A systematic processes

Assessing Risk Controls

Identify controloptions

Determine control effects

Prioritize riskcontrol measures

Page 34: System Safety:  A systematic processes

2 Major Risk Control Approaches

• Employ Macro Risk Control Option(s)– Reject – Avoid – Delay –Transfer –Spread –

Compensate – Reduce

• Implement System Safety Precedence Control Option(s) – Engineer – Guard – Improve Design – Limit

Exposure – Personnel Selection – Train – Warn – Motivate – Reduce Effect - Rehabilitate

Page 35: System Safety:  A systematic processes

““Swiss Cheese” Model of DefensesSwiss Cheese” Model of Defenses

James Reason: “Managing the Risks of Organizational Accidents”

Potential losses (people and assets)

The reality

The ideal

Hazards

Page 36: System Safety:  A systematic processes

““Swiss Cheese” Model of DefensesSwiss Cheese” Model of Defenses

James Reason: “Managing the Risks of Organizational Accidents”

Other ‘holes’ due to latent conditions

Defenses in depth

Some ‘holes’ due to active

failures

Page 37: System Safety:  A systematic processes

Macro OptionsMacro Options

• REJECT– Risk outweighs benefit

• AVOID– Go around the risk, do it in a different way

• DELAY– Maybe the problem will be resolved by time

• If delay is an acceptable option consider if operation is needed at all

• TRANSFER– Better qualified system, i.e.,“Pro’s From Dover”

Page 38: System Safety:  A systematic processes

Macro Options (cont)Macro Options (cont)• SPREAD

– Modular or separate Hazardous Operations

• COMPENSATE– Design parallel and redundant systems

• REDUCE– Design for minimum risk– Incorporate Safety Devices– Provide Warning Devices– Develop SOPs & Train

Page 39: System Safety:  A systematic processes

The Risk Control Macro Option ListThe Risk Control Macro Option List

• Reject• Avoid• Delay• Transfer• Spread• Compensate• Reduce• QUESTION: Why isn’t eliminate on this list?

Page 40: System Safety:  A systematic processes

Determine Risk Control EffectsDetermine Risk Control Effects

• How will this effect probability?• How will this effect severity?• How will this impact other sub-systems?

– Some controls support other sub-systems– Some controls may hinder other sub-systems

• What are the costs vs. benefits? – Direct Costs– Indirect Costs

Page 41: System Safety:  A systematic processes

Direct vs. Indirect CostsDirect vs. Indirect Costs

“As a rule of thumb, it is generally acceptable to calculate indirect costs of

a mishap to be 7 times greater than those costs which can directly be accounted for in the incident or

accident”

Page 42: System Safety:  A systematic processes

Risk Control ROT’sRisk Control ROT’s

• Use the System Safety Precedence order• Choose the most mission supportive

combinations• Use Integrated Product Teams • Look for synergistic enhancements

– Man – Machine – Medium – Mission - Management

Page 43: System Safety:  A systematic processes

Use the 5 M model as you look for Use the 5 M model as you look for systemic issuessystemic issues

• Man: – Doesn’t know– Doesn’t care– Can’t physically accomplish

• Machine:– Poor design– Faulty maintenance– SOP’s

Page 44: System Safety:  A systematic processes

5 M systemic issues (cont)5 M systemic issues (cont)• Medium

– Weak design considerations– Lack of provisions for natural “phenomena”

• Management:– Inadequate procedures– Inadequate policy– Inadequate standards & controls

• Mission:– Poorly thought out– Poorly executed– Weak understanding – Incompatibilities

Page 45: System Safety:  A systematic processes

Providing Management Risk Providing Management Risk Control OptionsControl Options

• Program Manager looking for optimum combinations– Mission supportive

• Some Risk Controls are incompatible– Evaluate full cost versus full benefit

• Be prepared for numbers game

• Some Controls reinforce one another– Win-Win option

• Redundancy = Robustness– Is it needed?

• Can you afford it? i.e., $$$, #’s, real estate

Page 46: System Safety:  A systematic processes

Aid to Decision MakingAid to Decision Making

• Be prepared to assist decisions at the right time– Don’t rush – Make them as late as possible without

negative impact on timeline• Insure decisions are made at the right level

– It should be establish who makes the tough calls– Use RAC or TREC to quantify who, what, when

• Provide Mission supportive options– Use the Macro Option list as a starting point– Be prepared to offer sound advice

Page 47: System Safety:  A systematic processes

Don’t be one who says, “ …data or Don’t be one who says, “ …data or information was not available and our information was not available and our

department could not prove it was unsafe to department could not prove it was unsafe to allow the operation.”allow the operation.”