is the cost of reliability, maintainability, and...

18
© The Aerospace Corporation 2014 Is the Cost of Reliability, Maintainability, and Availability Affordable for Software Intensive Systems? Presented to IEEE Software Technology Conference Long Beach, CA Myron Hecht April, 2014 1

Upload: buicong

Post on 29-May-2018

228 views

Category:

Documents


7 download

TRANSCRIPT

Page 1: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

© The Aerospace Corporation 2014

Is the Cost of Reliability, Maintainability, and Availability Affordable for Software IntensiveSystems?

Presented toIEEE Software Technology ConferenceLong Beach, CA

Myron HechtApril, 2014

1

Page 2: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Outline

• Consideration of RMA in Systems Acquisition is Required by Law• New Policies Have Been Created• Investment in RMA Pays Off• Reliability Best Practices Have Been Established• Methods for Calculating Operation and Sustainment Costs Have Been

Created• Tools for Reliability Analysis, Evaluation, and Tracking Exist

– And advanced tools are being developed• So What’s the Problem?• Conclusions

2

Page 3: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Consideration of RMA in Systems Acquisition is Required by Law• Public Law 111-123 “Weapons System Acquisition Reform Act of

2009,” Section 102 (codified as 10 USC 4 Section 139d)– (b) Director of Systems Engineering-– (1) APPOINTMENT- There is a Director of Systems Engineering, who shall be

appointed by the Secretary of Defense from among individuals with an expertise in systems engineering and development planning.

– . . .– (5) DUTIES- The Director shall—– ``(A) develop policies and guidance for—– ``(i) the use of systems engineering principles and best practices,

generally;– ``(ii) the use of systems engineering approaches to enhance reliability,

availability, and maintainability on major defense acquisition programs;– ``(iii) the development of systems engineering master plans for major

defense acquisition programs including systems engineering considerations in support of lifecycle management and sustainability; and

– ``(iv) the inclusion of provisions relating to systems engineering and reliability growth in requests for proposals;

Page 4: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

New Policies Have Been Created• CJCSI 3170.01, “Operation Of The Joint Capabilities Integration And

Development System,” May, 2007– Materiel Availability and Operational Availability as Key Performance

Parameters (KPPs)– Reliability and Ownership Costs as Key System Attributes

• DoDI 5000.02 interim*, “Operation of the Defense Acquisition System,” November, 2013

– PMs for all programs shall formulate a viable Reliability, Availability, and Maintainability (RAM) strategy….. RAM • shall be integrated within the Systems Engineering processes, • documented in the program’s Systems Engineering Plan (SEP) and

Life-Cycle Sustainment Plan (LCSP), and • assessed during technical reviews, test and evaluation (T&E), and

Program Support Reviews (PSRs).

*superseding 2008 version4

Page 5: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Growth of Software Content in DoD Weapons Systems

5

Source: Scott Lucero, Office of the Deputy Undersecretary of Defense (Acquisition and Technology), “Software Sustainment Challenges in Defense Acquisition”, AIAA Engineering Conference, April 2009, http://www.acq.osd.mil/se/webinars/2009-04-21-SECIE-SW-Sustainment-Lucero-brief.pdf

Page 6: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Growth of Software Importance in USAF Aircraft

Platform Year% Functions Controlled by

SoftwareF-4 1960 8%A-7 1964 10%F-111 1970 20%F-15 1975 35%F-16 1982 45%B-2 1990 65%F-22 2000 80%

6

Source: Scott Lucero, Office of the Deputy Undersecretary of Defense (Acquisition and Technology), “Software Sustainment Challenges in Defense Acquisition”, AIAA Engineering Conference, April 2009, available online at http://www.acq.osd.mil/se/webinars/2009-04-21-SECIE-SW-Sustainment-Lucero-brief.pdf

Page 7: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Investment in RMA Pays Off

• Operations and Support make up 65-80% of the lifecycle cost

• (Predator, Global Hawk, FBCB2) cited as showing Return on Investment of Reliability Improvement of 5:1 to 128:1

• DLA Study of 10-year returns of 15.5:1

7

Source: Ernest Seglie, (Science Advisor, DOT&E), “Investing in Reliability, Availability, and Maintainability and the Effect on Logistics, Operational Support, and Lifecycle Cost”, http://www.gardenstatesole.org/events/symposium08/seglie.pdf

Page 8: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

8

Reliability Best Practices Have Been Established

Program plan

Reliability Modeling,Allocation,

Assessment FRACAS

MethodologyData Contents

Database DescriptionCalculation of rates

Relationship with FMEA/CA

MetholodogiesRelationship with FRACAS,

FMEA/CA

FailureData FMEA/CA

PredictedFailureModesActualFailureModes

Reliability Test Plan

PredictedFailureModes

Priorities

Procedures, Methods

Test Data

Failure Data

MethodologiesTest Strategies

Interrelationship with other tasks

Unanticipated Failure Modes discovered during test

SecuritySystems

Engineering

SecuritySystems

Engineering

SecuritySystem Safety

SecuritySystem Safety

Reliability TestReport

Source: Myron Hecht, Karen Owens, and Joanne Tagami, “Reliability-Related Requirements in Software-Intensive Systems”, Proc. 2007 Reliability and Maintainability Symposium

Page 9: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

9

Methods for Calculating Operation and Sustainment Costs Have Been Created

Expected number of failed items

i

ci

c

i

ic

io μ

λci)!i!(n

n!ic!μ

λi)!i!(n

n!i)(cpL 11 1

0

1

0

Time waiting to start repair

)(q

q LnL

W

ic

ioq μ

λ)!i!(n

n!i)(cpcLL1

0 1

qunsched

q

WMTTRMTBOWMTTR

U

Unavailability

Source: Myron Hecht, “A Model for Determining the Cost of National Airspace System Maintenance Service Levels”, 2005 AIAA Aviation Technology, Integration, and Operations Conference

L= number of items waiting to be fixedWq = waiting time for items to be fixedpo = probability of an empty queueU = unavailabilityMTTR = mean time to repairMTBO = mean time between outages

Number of items awaiting repair

Page 10: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Tools for Reliability Analysis, Evaluation, and Tracking Exist

• General Math / Statistics • Maintainability

– Design • Human Factors

– Maintenance Concept – Not Otherwise Categorized – Other – Prediction

• Detection • MTTR • Not Otherwise

Categorized • Other

– Reliability Centered Maintenance

• Condition Monitoring • Other

– Safety – Testing

• BIT • Other • Testability

• Reliability – Availability – Components

• Applications – Mechanical

• Data – Discrete – Microcircuit – Other – Passive

– Design / Analysis • De-rating • Failure Analysis • Finite Element

Analysis • FME(C)A

– Design FME(C)A

– Process FME(C)A

• FRACAS • FTA • Markov • Mechanical • Modeling /

Simulation – RBD

• Other • Parts Control 10

• Quality – ISO-9000 – Manufacturing – Not Otherwise

Categorized – Other – Statistical Process

Control – TQM

• Risk • Supportability

– Life Cycle Cost – Logistics

• Data • Maintenance

Concept • Other • Spares

– Referral – Warranty

• Thermal Analysis

• Weibull– EOS / ESD

• Susceptibility – Non-operating – Prediction

• Bayesian • CNET • Mechanical • MIL-HDBK-217 • Non-operating • Other • 217 • Telcordia

(Bellcore) – Screening

• Burn-in • ESS • Other

– Software Reliability – System

• Data – Testing

• Accelerated Life

–Source: The Reliability Information Analysis Center,Software Tools Pagehttp://www.theriac.org/informationresources/softwaretools.html

Page 11: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

…and advanced tool sets are being developedModel Driven Design and Analysis Tool Set (Aerospace Corp)

11

Architecture and Error Models

Stochastic Network Model

Results

Page 12: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

…and advanced tool sets are being developed ASSERT* (European Space Agency)

• Source– European Space

Agency and Collaborating Tool Developers

• ESA Autonomous Transfer Vehicle Control System Development Example

– Specify arbitration logic

– Define finite state machine (using SDL)

– Verify (check for exceptions, deadlocks, stop conditions)

– Autogenerate code– Deploy on runtime

environment

System Design Language

Finite State Machine

E. Conquet, M. Perrotin, C. Colombo, M.-A.Esteve, “Results of ASSERT and follow-up studies: technical status and exploitation activities”, AADL User Day, 27 May 2009http://www.assert-project.net 12

* Automated proof-based System and Software Engineering for Real-Time applications

Page 13: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

So What’s the Problem?

• Programs not managing to requirements• Lack of incentives

– RMA Efforts have costs• Impact on design• Impact on cost and schedule• Addition of uncertainty – particularly if RMA group is less competent

than Government oversight– No discernible benefit

• Insufficient EVM and cost accounting techniques• Programmatic Attention Elsewhere• Poor Systems Engineering

13

*Source: Ernest Seglie, (Science Advisor, DOT&E), “Investing in Reliability, Availability, and Maintainability and the Effect on Logistics, Operational Support, and Lifecycle Cost”, http://www.gardenstatesole.org/events/symposium08/seglie.pdf

Page 14: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

What to do

• Accurate Valuation Of RMA-related Benefits As Well As Costs In DoD Affordability Initiatives

• Effective Earned Value Tracking Of RMA Engineering Tasks• Early and Frequent Feedback From RMA Into The Design And

Implementation Activities

14

Page 15: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Valuation Of RMA-related Benefits As Well As Costs In DoD Affordability Initiatives

• Cost: – NRE in engineering and analysis

• Benefits – Greater operational or mission effectiveness– Fewer units required (fleet availability is higher when unit availability is

higher)– Smaller sustainment organization– Lower spares inventory and logistics costs

15

Page 16: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Accurate Earned Value Tracking Of RMA Engineering Tasks

TaskProgram Milestone

ReviewProportion of

Earned Value*

Reliability Program Plan System Requirements Review (SRR) 5%

Reliability/Availability Allocations System Design Review (SDR) 5%

Preliminary Reliability Predictions and Failure Modes and Effects Analysis (FMEA)

Preliminary Design Review (PDR) 5%

Final Reliability Models Predictions and FMEA

Critical Design Review (CDR) 20%

Failure Reporting and Corrective Action System (FRACAS)

Test Readiness Review (TRR) 10%

Life, Stability and Failure Recovery testing(may be part of Day in the Life Testing) and Maintainability TestingComplete

Functional Configuration Audit (FCA) 30%

Requirements Verification Complete Physical Configuration Audit (PCA) 25%

16

*notional values, actual values should be program specific

Page 17: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

Recommended RMA Activities in a Software Intensive Development Program

17

Definition of Degraded Modes, Acceptable Outage Durations,

Maintenance Concepts

R&M Testing (incl. SW)

Redundancy, Failure Containment, Degraded Modes, FMEA* (initial)

R&M Requirements Allocation (incl. SW)

Failure/Recovery Testing

FD&R** Design, FMEA, predictions, FMEA (detailed)

FD&R Implementation, Static Analysis (coding standard conformance), FMEA revisions

DR Data tracking, ID of unanticipated failure modes,

FD&R verification* Failure Modes and Effects Analysis** Failure Detection and Recovery

Page 18: Is the Cost of Reliability, Maintainability, and ...conferences.computer.org/stc/2014/papers/5034a059.pdfIs the Cost of Reliability, Maintainability, and Availability ... defense acquisition

18

Next Steps• Valuation techniques to link between RMA engineering to affordability

– Methods of measurement and accounting of operational suitability, effectiveness, and sustainment engineering for acquisition managers

– Is the link really as strong as congress, the GAO, and RMA advocates believe?

• Methods to assess the RMA-related Key Performance Parameters (KPPs) and Key System Attributes (KSAs) during development

– Metrics– Models

• Methods to identify operations and sustainment costs– Access and analysis of operations and maintenance databases– Methods to improve data recording and collection

• Policies and metrics linked to award fees for incentivizing contractors to invest in RMA engineering

– Linking operational and sustainment costs to award fees and past performance ratings