Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Fayssal Safie/MSFC
August 1, 2000
An Overview of Quantitative Risk Assessment Methods Definitions Qualitative and Quantitative FMEA FMECA Qualitative and Quantitative Fault Tree Analysis (FTA) Probabilistic Risk Assessment (PRA) Reliability Allocation Reliability Prediction Reliability Demonstration Trend Analysis Probabilistic Structural Analysis Design of Experiments (DOE) Statistical Process Control (SPC) Manufacturing Process Capability
2
Definitions
Probability: The chance or the likelihood of occurrence of an event. Risk: The chance of occurrence of an undesired event and the severity of the resulting consequences. Risk Assessment: The process of qualitative risk categorization or quantitative risk estimation. Risk Management: The process of risk identification, risk assessment, risk disposition, and risk tracking and control.
Definitions
Reliability: The probability that an item will perform its intended function for a specified mission profile. Safety: The freedom of injury, damage, or loss of resources. Hazard: The condition that can result in or contribute to a mishap. Mishap: An unintended event that can cause injuries, damage, or loss of resources.
FMEA is an inductive (bottom-up) engineering analysis method. It is intended to analyze system hardware, processes, or functions for failure modes, causes, and effects. Its primary objective is to identify critical and catastrophic failure modes and to assure that potential failures do not result in an adverse effect on safety and system operation. It is an integral part of the design process. It is performed in a timely manner to facilitate a prompt action by design organization and project management.
20-01-44 Turbine Exhaust Duct Assembly P/N: 10206-0002-102 Ref. Des.: None 2 Required Vents HPU turbine exhaust gas to atmosphere outside of the aft skirt. Exhaust Duct Assembly includes: Upper Exhaust Assembly (three bellows) 10206-0003-101 Middle Exhaust Assembly 10206-0007-101 Alt. 10206-0031-851 Alt. 10206-0044-851 Alt. 10206-0045-851 Lower Exhaust Assembly 10206-0010-101 FM Code A01 External leakage of hot exhaust gas (System A and/or B) caused by: Bellows fracture/ fatigue Flange/duct fracture Seal failure Seal surface defect Improper torque Contamination during assembly Improperly lockwired. A,B. Actual loss Loss of containment of hot exhaust gases. A,B. Probable Loss A,B. Probable Loss Fire and explosion. Fire and explosion will lead to loss of the mission, vehicle, and crew. Reaction Time: Seconds C,D,E. No Effect Failure mode not applicable to these phases. C,D,E. No Effect Failure mode not applicable to these phases. C,D,E. No Effect Failure mode not applicable to these phases. a) N/A b) N/A 3 a) None b) N/A Correcting Action: None Timeframe: N/A 1
MIL-STD-1629A, Procedures for Performing a FMECA, discusses the FMECA as a two-step process:
Failure Modes and Effects Analysis (FMEA). Criticality Analysis (CA).
Criticality analysis can be done quantitatively using failure rates or qualitatively using a Risk Priority rating Number (RPN). CA using failure rates requires extensive amount of information and failure data. A RPN is relatively simple measure which combines relative weights for severity, frequency, and detectability of the failure. It is used for ranking high risk items.
9
Effects
Fire and Explosion
Fire and Explosion Fire and Explosion Fire and Explosion Fire and Explosion
Turbine Exhaust External Duct Assembly leakage of hot exhaust P/N gas (System 10206-0002-102 A and/or B)
6. Contamination Fire and during assembly Explosion 7. Improperly lockwired Fire and explosion
10
A FTA is a deductive (top-down) approach that graphically and logically represents events at a lower level which can lead to a top undesirable event. It is a tool that systematically can answer the question of what can go wrong by identifying failure scenarios. It is an excellent tool for analyzing complex systems. Qualitative FTA is predominately a Safety tool.
11
Charging Connector
External Pow er
18 HP var
PT
Pump Battery 1
Flight Computer
18 HP var
PT
Pump Battery 2
PT
Pump Battery 3
Cooling Plate
FWD Manifold
12
MTR-1-PWR Page X
MTR-CTRL-1-OFF
MTR-CTRL-1-PWR
Pump Package 1 Motor Controller Fails Off / Low (Component Failure) MTR-CTRL-1-FOF
Pump Package 1 Motor Controller Commanded Off / Low (Software / Pressure Transducer Error) MTR-1-CTRL-CMD-OFF
PMP-PKG-1-REL-OFF
Page XX Pump Package 1 Relay Fails Off Pump Package 1 Relay Commanded to "Off" Position
PKG-1-REL-FOF
PMP-PKG-1-CMD-OFF
13
14
16
17
System Description: Methane loading system - The methane is stored in a tank in a liquid form and then vaporized and loaded as a gas. This example terminated at valve failure.
18
NO-LOAD-CH4
VLV-1557-OP
VLV-1537-CL
6.50E-06
3.90E-04
6.50E-06
3.90E-04
19
20
22
MLD identifies all significant basic/ initiating events that could lead to loss of vehicle.
Blade Failure
Scenario Number 1 2
QUANTIFICATION OF ESD INITIATING & PIVOTAL EVENTS Event Sequence Diagram (ESD)
Turbine Blade Porosity Inspection Not Effective Porosity Present in Critical Location Porosity in Critical Location Leads to Crack in <4300 sec
3 4 5
Mission Success
Mission Success
Mission Success
Products
1. System Risk 2. Element Risk 3. Subsystem Risk 4. Risk Ranking 5. Sensitivity Analysis etc..
Blade Failure
Mission Success
23
Emergency Coolant P2
P1 D
Normal Coolant
A Coolant System
P1 and P2 are electrically driven pumps, D is a flow detector, and EP (not shown) is the electric power Initiating event is a break in the normal coolant pipe Full system success (S) requires both pumps operating, the detection system, and the electrical power operating One pump operating results in partial success (P) Two pumps failing or failure of electrical power (EP) results in system failure (F) 25
P(EP)
5-F
Q(EP)
6-F
P(.) - Probability of Component Success Q(.) - Probability of Component Failure S - Full System Success P - Partial System Success F - System Failure
26
NSD
S&A
CDF1
CDF2
0.9971161
0.9998843
0.9965403
0.9996991
0.9996991
LSC
NSD
S&A
CDF1
CDF2
NSD - NASA Standard Detonator S&A - Safe and Arm CDF - Confined Detonating Fuse LSC - Linear Shaped Charge
27
Reliability Allocation
Reliability allocation is the top-down process of subdividing a system reliability requirement into subsystem and component requirements. Reliability allocation is performed in order to translate the system reliability requirement into more manageable, lower level requirements.
28
Reliability Allocation
Example
0.999
SSME Reliability
0.99975
0.99975
0.99980
0.99985
HPFTP
0.99987 0.99987
Chamber
0.99985
Nozzle
Turbine Assy
0.999961
0.999909
Housing Assy
0.999945
Rotor Assy
0.999964
Blades
Retainers
29
Reliability Allocation
Benefits: Reliability allocation allows design trade-off studies to be performed in order to achieve the optimum combination of subsystems which meets the system reliability requirement.
30
Reliability Prediction
Reliability prediction is the process of quantitatively estimating the reliability of a system. Reliability prediction is performed to the lowest level for which data is available. The sub-level reliabilities are then combined to derive the system level prediction. Reliability prediction during design is used as a benchmark for subsequent reliability assessments. Predictions provide managers and designers a rational basis for design decisions.
31
Reliability Prediction
Reliability prediction techniques are dependent on the degree of the design definition and the availability of historical data. Similarity analysis techniques: Reliability of a new design is predicted using reliability of similar parts. Probabilistic design techniques: Reliability is predicted using engineering failure models. Techniques that utilize generic failure rates such as MILHDBK 217, Reliability Prediction of Electronic Equipment.
32
Reliability Prediction
Similarity Analysis Example
Fuel Turbo Pump
Assume a Fuel Turbo Pump (FTP) has a historical failure rate of: 50 per 100k firings Assume also the failure mode break down is:
Cracked/Fractured Blades Turbine bearing Failure 35% 25% 20% 10% 10% 100%
33
Reliability Prediction
Similarity Analysis Example
Fuel Turbo Pump
Then the Thermal Stress Failure Rate is: 0.57 X 17.5 = 10/100k firings
34
Reliability Prediction
Similarity Analysis Example
Fuel Turbo Pump
Failure Rate Adjustments established through: Test Results Preliminary Analyses Integrated Product Team (IPT) Input Address "high hitters" - Using Thermal Stress failure rate of 10.0/100k firing Design changes to improve reliability Cum Percent Failure Rate Improvement Reduction Lower Operating Temperatures 20% 2.00 (Test) Hollow Blades 30% (additional) 4.40 (Analysis, Expert Opinion) Material Change 20% (additional) 5.52 35 (Analysis)
Reliability Prediction
Similarity Analysis Example
Fuel Turbo Pump
36
Reliability Prediction
Benefits: Provides a early quantitative evaluation of design Identifies problem areas Identifies parts and components with highest potential reliability improvements Makes full use of lessons learned
37
Reliability Demonstration
Reliability Demonstration is a reliability estimation method that primarily uses test data (objective data) and statistical formulas to calculate demonstrated reliability or to demonstrate numerical reliability goal with some statistical confidence. Models and techniques used in reliability demonstration include Binomial, Exponential, Weibull models. Reliability growth techniques, such as the U.S. Army Material Systems Analysis Activity (AMSAA) and Duane models can also be used to calculate demonstrated reliability. Historically, some military and space programs employed this method to demonstrate reliability goals. For example, a reliability goal of .99 at 95% confidence level is demonstrated by conducting 298 successful tests.
38
Reliability Demonstration
Reliability Calculation through Demonstrated Tests By Using Binomial Statistical Formula
500
(.998)
200
Typical Case: To demonstrate .99 reliability with 95% confidence, it takes 298 successful tests
400
500
600
700
800
900
1000
Reliability Demonstration
Benefits: It provides a way to validate numerical reliability requirement. It provides a way to calculate the reliability that has been demonstrated so far by the item under consideration. It eliminates the subjectivity that is usually embedded in other reliability estimation methods. Through rigorous reliability demonstration test program, design weakness and failures can be revealed and corrective actions can be taken to significantly improve reliability. Limitations: It is very expensive and time-consuming to run through a reliability demonstration program. Data quantity sensitive.
40
Trend Analysis
Problem/performance trending is a statistical characterization of problem/performance data using graphical/descriptive techniques. Performance trending is done using control-type charts. The simplest and most powerful trending tool is the Pareto Chart for problem trending. In general, problem trending involves: Extracting related problem data from a historical problem database. Normalizing raw problem counts into problem rate of occurrence based on prime parameter (starts, seconds of run time). Plotting normalized data to establish a frequency chart. Fitting a trend curve to the frequency plot. Analyzing the fitted curve for trends.
41
Tu r bo
1000 1200 1400 1600 1800 2000 200 400 600 800
m
0
ac hi ne ry
C om bu st io n In s tru m en ta tio n
Pl u m bi ng
En gi ne H ar ne ss es Pr o pe lla nt Vl vs ite rs
Ig n H yd r au lic s In te r co nn ec ts lle Pn e um at ic
Problem Trending
on
tro
s G SE So ftw ar
42
Count
Trend Analysis
Benefits: Performance trending
Helps in identifying potential problems with a performance parameter before it occurs.
Problem trending
Identifies major problem areas for optimum allocation of resources. Evaluates effectiveness of past recurrence control actions. Predicts future failure rates in a given area. Points to desirable and undesirable effects of hardware processing changes. Communicates in simple, logical, visual, and easily understandable presentation.
Limitations: Significant engineering evaluation may be required to isolate appropriate set of problems. Rationale for frequency changes may not be obvious.
43
44
FRACTURE LOCATION
During rig testing the AT/HPFTP Bearing experienced several cracked races. Summary of 440C race fractures / tests: 3 of 4 Fractured
45
Failure Region
46
47
Tolerance fits of rig test bearing Inner race hoop stress contribution at given conditions Shaft and sleeve hoop stress contribution at given conditions.
At Test
3 of 4 failed
Race Configuration
440C w/ actual manufacturing stresses (ie ideal + abusive grinding) 440C w/no manf. stresses 440C w/ideal manf. stresses 9310 w/ ideal manf stresses
It is estimated that 50% of the through ring fractures would result in an engine shutdown. The shutdown 9310 HPFTP Roller Bearing Inner Race Failure Rate is 49 then: 0.50 X 10/100k = 5 fail/100k firings
Benefits:
Used to understand the uncertainty of the design and identify high risk areas. Used to perform sensitivity analysis and trade studies for reliability optimization. Used in identifying areas for further testing.
50
51
Goal: Determine if the weld process is sensitive to cover pass oscillation parameters.
Factors examined included width, dwell and speed, each with three levels: Width - how far does it oscillate : 0.03, 0.10, 0.17 inches Dwell - how long do you pause at the ends of the oscillation : 0.35, 0.52, 0.70 sec Speed - how fast do you oscillate : 10.0, 27.5, 45.0 inches per minute
Responses : Room Temperature and Cryo Tensile strengths Model : Response Surface Model (Box-Behnken) generated and analyzed using ECHIP Software 52 Total number of tests : 16
RT UTS
Dwell = 0.00 50
Cryo UTS
Dwell = 0.00 60 55
45
40
50 45
35
53
54
55
Benefits: Provides a tool to understand variability in design and manufacturing. Reduces time to establish mature design and manufacturing processes. Saves time and money by optimizing the experiment input and output. Reduces potential of nonconformances.
56
57
X-bar
36 34 32 30 28
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Subgroup
58
Range
9 6 3 0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Subgroup
59
2.50 P e r c 2.00 e n t R e s 1.50 Lower Spec Limit Lower Control Limit X bar Percent Residual Volatiles Upper Control Limit Upper Spec Limit
V 1.00 o l s
0.50
Vendor Change Made Vendor Change Corrected
19
37
55
10
13
16
22
25
28
31
34
40
43
46
49
52
58
61
64
67
Sample Number
70
0.00
61
In simple terms, the manufacturing process capability is defined as the ratio of the engineering specification width to the process width (3sigma for one-sided, 6-sigma for two-sided). This ratio is called the process capability index (Cpk). As a rule of thumb: Cpk > 1.33 Capable Cpk = 1.00-1.33 Capable with tight control Cpk < 1.00 Incapable Manufacturing process capability is essential to evaluate the suitability of the process to meet the spec. Manufacturing process capability data are one of essential data sources to support design feasibility and reliability trade study.
63
lox post
ID OD
64
Tolerance boundaries were established as +/- .0005 for both OD and ID.
Results indicate the process capability is feasible to support design and reliability requirement.
65
Nominal
USL
Mean = -.0000095 sigma = .000076 Cpk = 2.14
0 -5 -3 -1 1 3 5 (X 0.0001)
67
Conclusions/Recommendations
QRA is a well-established technology that involves methods and techniques beyond conducting classical PRA studies. QRA is essential to understanding uncertainty and controlling our critical processes. Implementation and use of QRA could be enhanced if
QRA is incorporated as part of the system management process QRA methods and techniques are viewed as part of the system engineering effectiveness tools
QRA is extremely important for the Space Shuttle Program to understand and control risk. QRA techniques are wellestablished, however, the application of the techniques on a larger scale will require careful planning, extensive training, and strong commitment by Shuttle Program management to pursue long term plans.
68