Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
The
Reliability
HANDBOOK
From downtime
to uptime
– in no time!
John D. Campbell, editor
Global Leader, Physical Asset Management
PricewaterhouseCoopers LLP.
The
Reliability
HANDBOOK
CONTENTS
INTRODUCTION CHAPTER THREE CHAPTER SIX
5 Reliability: past,
present, future 23 Is RCM the right
tool for you? 57 Optimizing condition
based maintenance
by John D. Campbell by Jim V. Picknell by Murray Wiseman
Establishing the historical and theor
etical Detem
r ining your reliability needs Getting the most out of your equipment
framework of RCM ■ The 7-step RCM process befoer repair time
■ The RCM “product” ■ Step 1: Data preparation
CHAPTER ONE ■ What can RCM achieve? ■ Events and inspections data
9 Take stock of
your operation
■ RCM decision checklist ■ Step 3: Testing the PHM
■ Step 4: The transition probability model
by Leonard Middleton and Ben Stevens CHAPTER FOUR ■ Discussion of transition probability
Measuring and benchmarking
your plant’serliability 39 The problem of uncertainty
by Murray Wiseman
■ Step 5: The optimal decision
■ Step 6 Sensitivity Analysis
■ Benefits of benchmarking What to do when your reliability ■ Conclusion
■ What to measure? plans aren’t looking soerliable
■ Relating maintenance performance ■ The four basic functions APPENDIX
measures to the business objectives
■ Excess capacity (cost-constrained)
■ Summary
■ Typical distributions
71 Searching the Web for
reliability information
■ Capacity-constrained businesses by Paul Challen
■ An example
■ Compliance with requirements Looking for useful Internet sites?
Heres’
■ Real life considerations — the data problem
■ How well are you performing? where to start
■ Censored data or suspensions
■ Findings and sharing results ■ Reliability Analysis Center
■ External sources of data CHAPTER FIVE ■ Book and print material sources
■ Researching secondary data
■ General benchmarking considerations 49 Optimizing time based
maintenance
■ Professional organizations
■ General information
■ Internal benchmarking by Andrew K.S. Jardine
■ Industry-wide benchmarking Tools for devising a replacement system for
■ Benchmarking with comparable industries your critical components
■ Enhancing reliability through
preventive replacement
■ Block replacement policies
■ Statement of problem
■ Result
■ Age-based replacement policies
■ When to use block replacement
over age replacement
■ Setting time based maintenance policies
JAMES PICKNELL is a principal in PricewaterhouseCoopers Mainte- MURRAY WISEMAN is a principal consultant with Pricewaterhouse-
nance Management Consulting Center of Excellence. He has more than Coopers, and has been in the maintenance field for more than 18 years.
twenty-one years of engineering and maintenance experience including He has been a maintenance engineer in an aluminum smelting opera-
international consulting in plant and facility maintenance management, tion, and maintenance superintendent at a large brewery. He also found-
strategy development and implementation, reliability engineering, ed a commercial oil analysis laboratory where he developed a
spares inventories, life cycle costing and analysis, benchmarking for best Web-enabled Failure Modes and Effects Criticality Analysis system, incor-
practices, maintenance process redesign and implementation of Com- porating an expert system and links to two failure rate/ mode distribution
puterized Maintenance Management Systems (CMMS). You can reach databases at the Reliability Analysis Center. You can contact him at 416-
him at 416-941-8360 or by e-mail at James.V.Picknell@ca.pwcglobal.com. 815-5170 or by e-mail at murray.z.wiseman@ca.pwcglobal.com.
P
PLANT ENGINEERING AND MAINTENANCE
A CLIFFORD/ELLIOT PUBLICATION
Volume 23, Issue 6 December 1999
C
MRO Handbook is published by Clifford/Elliot Ltd., 209 –3228 South Service Road., Burlington, Ontario, L7N 3H8. Telephone (905) 634-2100.
Fax 1-800-268-7977. Canada Post – Canadian Publications Mail Product Sales Agreement 112534. International Standard Serial Number (ISSN) 0710-362X.
Plant Engineering and Maintenance assumes no responsibility for the validity of the claims in items reported.
*Goods & Services Tax Registration Number R101006989.
c a
The Reliability Handbook 3
i n t ro d u c t i o n
Reliability: past,
present, future
Establishing the historical and
theoretical framework of RCM
by John D. Campbell
Not all that long ago, equipment design and production cycles created an environment in which
equipment maintenance was far less important than run-to-failure operation. Today, however, condition
monitoring and the emergence of Reliability Centred Maintenance have changed the rules of the game.
A
t the dawn of the new millenni- sure of the frequency of downtime. ment perf o rmance is, in some ways,
um, it is fitting that this edition Let’s look at an example. In case 1, like dealing with people. If, for the
of P E M’s annual h a n d b o o k your injection moulding machine is past three generations, your fore f a-
should be a discussion on re l i a b i l i t y down for a 24-hour repair job in the thers lived to the ripe old age of 95,
management. A half a millennium ago, middle of what was supposed to be a then there is a strong likelihood you
Galileo figured that the earth revolved solid five-day run. Its availability is will live into your 90s, too. But there
around the sun. A thousand years ago, t h e re f o re 80 percent (120hr- a re a lot of random events that will
efficient wheels were made to revolve 24hr/120hr). The reliability of the ma- happen between now and then. De-
around axles on chariots. Four thou- chine is 96 hours (96hr/1 failure). spite this rando mness, w e can use
sand years ago, the loom was the latest In case 2, your machine is down statistics to help us know what tasks to
engineering marvel. A little closer to 24 times for one hour each time. do (and not to do) to maximize our
the present day, maintenance manage- Its avail ab i lity i s sti ll 80 p erc e n t life span, and when to do them. Do ex-
ment has evolved tremendously over (120hr-24x1hr/120hr), but its reliabili- e rcise three times a week and don’t
the past century. ty is only four hours (96/24 failures)! smoke — and by analogy, do vibration
The maintenance function wasn’t The two measures are, however, monitoring monthly and don’t re-
even contemplated by early equipment closely related: build yearly.
designers, probably because of the un- When we put cost into this equation,
complicated and robust nature of the Availability =
Reliability we start to enter the realm of optimiza-
machinery. But as we’ve moved to built- (Reliability + Maintainability) tion. For this, there are several model-
in obsolescence, we have seen a pro- ing techniques that have proven quite
gression from preventive and planned where maintainability is the mean time useful in balancing run-to-failure, time
maintenance after WWII, to condition to repair. based replacement and co ndition
monitoring, computerization and life One of the most robust approaches based maintenance. Our objective is to
cycle management in the 1990s. Today, for managing reliability is adopting Re- minimize costs and maximize availabili-
evolving equipment characteristics are liability Centred Maintenance. The re- ty and reliability.
dictating maintenance practices, with cently-issued SAE standard for RCM is a Our Physical Asset Management
p redominant tactics changing fro m good place to start to see if you are team at PricewaterhouseCoopers is
run-to-failure, to prevention and now to ready to embrace this methodology. Al- pleased to provide this introduction to
prediction. We’ve come a long way! though SAE defines RCM as a “logical reliability management and mainte-
Reliability management is often mis- technical process...to achieve design re- nance optimization. If you are inter-
understood. Reliability is very specific liability,” it requires the input of every- ested in a broader discussion on these
— it is the process of managing the in- one associated with the equipment to topics, be sure to read our new book
terval between failures. If availability is make the resulting maintenance pro- Mainten ance Excellence: Optimizing
a measure of the equipment uptime, or gram work. Even at that, we are still Equipment Life C ycle Decisions , pub-
conversely the duration of downtime, dealing with a lot of uncertainty. lished by Marcel Dekker, New Yo r k
reliability can be thought of as a mea- Dealing with uncertainty in equip- (Spring 2000). e
The evolution
of reliability
How RCM developed as a
viable maintenance approach
by Andrew K.S. Jardine
Reliability mathematics and engineering really developed during the Second World War, through the
design, development and use of missiles. During this period, the concept that a chain was as strong as
its weakest link was clearly not applicable to systems that only functioned correctly if a number of sub-
systems must first function. This resulted in “the product law for series systems,” which demonstrates
that a highly reliable system requires very highly reliable sub-systems.
T
o illustrate this, consider a system
that consists of three sub-systems
(A, B, and C) that must work in se-
ries for the complete system to func-
tion, as in Figure 1.
In the system, where subsytems have
reliability values of 97 percent, 95 and
98 percent, respectively, then the com-
Figure 1: Series system reliability
plete system has a reliability of 90 per-
cent. (This is obtained from 0.97 x 0.95 of Defense coordinating the reliability The next millennium will see RCM
x 0.98). This is not 95 percent, which analysis of electronic systems by estab- continuing to play a significant role in
would be the case under the case under lishing AGREE (Advisory Group on establishing maintenance programs, but
the weakest-link theory. Reliability of Electronic Equipment). a new feature will be the focus by mainte-
C l e a r l y, for real complex systems The 1960s saw great interest in nance professionals of closely examining
where the design configuration is dra- aerospace applications, and this result- the plans that result from an RCM analy-
matically more complex than Figure 1, ed in the development of re l i a b i l i t y sis, and using procedures which enable
the calculation of system reliability is block diagrams, such as Figure 1, for these plans to be optimized.
more complex. Towards the end of the the analysis of systems. Keen interest in Other sections (Chapter Five,“Opti-
1940s, efforts to improve the reliability system reliability continued into the mizing time-based maintenance”, and
of systems focused on better engineer- 1970s, primarily fuelled by the nuclear Chapter Six,“Optimizing condition
ing design, stronger materials, and industry. Also in the 1960s, there grew based maintenance”) of this handbook
harder and smoother wearing surfaces. an interest in system reliability in the address procedures already available to
For example, General Motors ex- civil aviation industry, out of which re- assist in this thrust of maintenance opti-
tended the useful life of traction motors sulted Reliability Centred Maintenance mization. Both of them cover tools,
used in locomotives from 250,000 miles (RCM), which is covered in Chapter 3 policies, analysis methods, and data
to one million miles by the use of better of this handbook. preparation techniques to further the
insulation, high temperature testing, Subsequent to the early success of implementation of an RCM strategy.
and improved tapered-spherical roller RCM in the aviation industry RCM has
bearings. become the predominant methodology References
In the 1950s, there was extensive de- within enterprises (military, industrial Reliability Engineering and Risk Assess -
velopment of reliability mathematics by etc.) in the 1970s, 80s and 90s to estab- ment, Henley and Komamoto, Prentice-
statisticians, with the U.S. Department lish reliable plant operations. Hall, 1981. e
Take stock of
your operation
Measuring and benchmarking
your plant’s reliability
by Leonard G. Middleton and Ben Stevens
Does your business operate using “cash-box accounting”? That’s where you count your money at the
beginning and end of each month. If you have more when month’s end rolls around, then that’s good. If
not, well you’ll try to do better next month — or at least until the money runs out. No truly successful
large business operates using this model, yet maintenance is often organized and performed without
proper measures to determine its impact on the business’s success. The use of performance measurement
is rapidly increasing in maintenance departments around the world. This stems from a very simple
understanding: You can’t manage what you don’t measure. Performance measurement is therefore a core
element of maintenance management. The methodologies for capturing performance are of vital
importance, since unreliable measurements lead to unreliable conclusions, which lead to faulty actions.
The benefits of benchmarking
B
enchmarking reinforces positive be-
haviour and resource commitment. It
enables faster progress towards goals
by providing experience, through the ex-
ample of the participant and improved
“buy-in.” By using external standards of
performance, the organization can stay
competitive by reducing the risk of being
surpassed by competitors’ performance,
and by customers’ requirements.
Most import a n t l y, benchmarking
meets the information needs of stake-
holders and management by focusing
on critical, value-adding pro c e s s e s ,
while achieving an integrated view of
the business. Benchmarking is a pro-
cess that makes demands of all its par-
ticipants, but it is one of the best ways to
find practices that work well within
Figure 2: Inputs, outputs, and measures of process effectiveness
comparable circumstances.
measurement — the trick is in how to conditions to be in place — including
What should you measure? use the results to achieve the actions that consistent and reliable data, high quality
There is no mystique about performance are needed. This requires a number of analysis, a clear and persuasive presenta-
Internal benchmarking
The principal difficulty in benchmark-
ing is getting the critical data needed
for comparison. Internal benchmark-
ing addresses this issue by comparing
data from other organisations and divi-
sions within the company. Information
can be freely exchanged since it does
not provide an advantage to a competi-
tor. Data exchanged through internal
benchmarking programs is typically
quantitative because qualitative data re-
Figure 6: Acting on the results of a benchmarking study
quires considerably more analysis.
re s e a rching for secondar y data (de- could be government-generated re- The limitation (of internal bench-
scribed below) and benchmarking. ports, industry association reports, or marking) is that knowledge of one’s per-
Benchmarking may be one or more of annual reports of publicly-traded com- f o rmance relative to the extern a l
the following: panies. The data has a number of lim- companies remains unknown. Without
■ 1. internal benchmarking; itations. It is likely to be quantitative outside comparisons, a company is un-
■ 2. benchmarking within the industry; with little information available to un- able to determine whether it is perform-
■ 3. benchmarking with comparable or- derstand the context. It may not be ing at its highest possible capability.
ganisations not in the industry. c u rrent. C omparable per f o rm a n c e
After obtaining it, a critical issue m e a s u res may not be available be- Industry-wide benchmarking
with external data is how comparable it cause their underlying data were not In some industries, there is sharing of
is. Financial data is particularly difficult collected. data through a third party. It may be
to compare, because both financial ac-
counting (including GAAP restrictions)
and management cost accounting prac-
tices vary according to the org a n i s a-
tion’s objectives.
Questions you should ask when ana-
lyzing accounting data include: Are
MRO materials and outside services in-
cluded in the maintenance budget or
purchasing budget? Are replacement-
in-kind projects and turnarounds in-
cluded in the maintenance budget or in
the capital budget? In calculating
equipment availability, does it consider
all downtime, or just unscheduled
downtime? The answers will depend
upon what the organization is trying to
achieve with the calculation.
T
he recently issued SAE Standard, count of different operating environ- pressor installed at a sub-arctic location
JA1011, “Evaluation Criteria for ments to the extent that they can. For ex- may have the same manual and dew
Reliability-Centered Maintenance ample an automobile manual will specify point specifications as one installed in a
(RCM) Processes” outlines a set of crite- different lubricants and anti-freeze den- humid tropical climate. RCM is a
ria with which any process must comply sities that vary with ambient operating method for looking out for your own
to be called RCM. While this new stan- temperature. But they don’t often speci- destiny with respect to fleet, facility and
dard is intended for use in determining fy different maintenance actions based plant maintenance.
if a process qualifies as RCM, it does not on driving style — say, aggressive vs. de-
specify the process itself. The standard fensive — or based on use of the vehicle The 7-step RCM process
does present seven questions that the — like a taxi or fleet versus weekly drives RCM has seven basic steps:
process must answer. This chapter de- to church or to visit grandchildren. In an ■ 1. Identify the equipment / system to
scribes that process and several varia- industrial setting, manuals are not often be analyzed;
tions. You can check the SAE standard tailored to your particular operating en- ■ 2. Determine its functions;
for a comprehensive understanding of vironment. Your instrument air com- ■ 3. Determine what constitutes a fail-
the complete RCM criteria.
We cannot achieve reliability greater
than that designed into systems by their
designers. Each component has its own
unique combination of failure modes,
with their own failure rates. Each combi-
nation of components is unique and fail-
ures in one component may well lead to
the failure of others. Each “system” oper-
ates in a unique environment consisting
of location, altitude, depth, atmosphere,
pressure, temperature, humidity, salini-
ty, exposure to process fluids or prod-
ucts, speed, acceleration, etc. Each of
these factors can influence failure
modes making some more dominant
than others. For example, a level switch
in a lube oil tank will suffer less from cor-
rosion than the same switch in a salt
water tank. And an aircraft operating in
a temperate maritime environment is
likely to suffer more from corro s i o n
than one operating in an arid desert.
Technical manuals often recommend
a maintenance program for equipment
and systems. They sometimes take ac- Figure 12: RCM process overview
The problem
of uncertainty
What to do when your reliability
plans aren’t looking so reliable
by Murray Wiseman
When faced with uncertainty, our instinctive, human reaction is often indecision and distress. We would
all prefer that the timing and outcome of our actions be known with certainty. Put another way, we
would like all problems and their solutions to be deterministic. Problems where the timing and outcome
of an action depend on chance are said to be probabilistic or stochastic. In maintenance, however, we
cannot reject the latter mode, since uncertainty is unavoidable. Rather, our goal is to quantify the
uncertainties associated with significant maintenance decisions to reveal the course of action most likely
to attain our objective. The methods described in this chapter will not only help you deal with
uncertainty but may, we hope, persuade you to treat it as an ally, rather than a foe.
H
ow many of us have been told
since childhood that “failure is
the mother of success” and that
“a fall in the pit is a gain in the wit”.
Nowhere is this folk wisdom more val-
ued than in a maintenance depart-
ment employing the tools of reliability
engineering. In such an enlightened
environment, failures — an imperson-
al fact of life — can be leveraged and
converted to valuable knowledge fol- Figure 18: Monthly failure ages for 48 in-service items
lowed up with productive action. To Function; and 4) the Hazard Function. April, that is, 14, and dividing that by
achieve this lofty but attainable goal in Assume that a population of 48 the total population of items, 48, we
o ur o wn operation s w e re q u i r e a items purchased and placed in service may estimate th at th e cumu lative
sound quantitative approach to main- at the beginning of the year all fail by probability of the item failing in the
tenance uncert a i n t y. So, let’s begin November. first quarter of the year is 14/48. The
our ascent on solid ground — the eas- List the failures in order of their probability that all of the items will fail
ily conceptualized relative frequency f a i l u re ages as in Figure 18. Gro u p before November is 48/48 or 1.
histogram of past failures. them in convenient time segments, in By transforming the numbers of
this case by month. Plot the number failures into probabilities in this way,
The four basic functions of failures in each time segment as in the relative frequency histogram may
In this section we discover the Relative Figure 19 (on page 40). The high bars be converted into a mathematically
F requency Histogram and the four in the centre of Figure19, re p re s e n t more useful form called the probabili-
basic functions: 1) the Probability Den- the most “popular” (or most pro b a- ty density function (PDF). To do so,
sity Function; 2) the Cumulative Distri- ble) failure times. By adding the num- the data are replotted such that the
bution Function; 3) the Reliability b er of failures o ccurrin g prior to a rea under the cur ve re p resents the
Rearranging
Optimizing
time based
maintenance
Tools for devising a replacement
system for your critical components
by Andrew K.S. Jardine
The goal of this chapter is to introduce tools that can be used to derive optimal maintenance and
replacement decisions. Particular attention is placed on establishing the optimal replacement time for
critical components (also known as line-replaceable units, or LRUs) within a system.
Block replacement policies
W
e’ll also take a look at age and and the optimization is to obtain the
block replacement strategies The block replacement policy is some- best balance between the investment in
on the LRU level. This enables times termed the group or constant preventive replacements and the con-
the software package RelCode to be in- i nt e r val policy since preventive re- sequences of failure. This conflicting
troduced as a tool that can be used to placement occurs at fixed intervals of case is illustrated in Figure 23 for a cost
assist maintenance managers optimize time with failure replacements occur- minimization criterion where C(tp) is
their LRU maintenance decisions. We ring whenever necessary. The policy is the total cost per week associated with a
will also take a look at the optimizing illustrated in Figure 22 where Cp and policy of preventively replacing the
criteria of cost minimization, availabili- Cf are the total costs associated with component at fixed intervals of length
ty maximization, and safety re q u i re- preventive and failure replacement re- tp, with failure replacements occurring
ments. spectively. tp is the fixed interval be- whenever necessary. The equation of
tween preventive replacements. In the the total cost curve is provided in sev-
Enhancing reliability through f i g u re, you can see that for the first eral text books including the one by
preventive replacement cycle there is no failure, while there are D u ffuaa, Raouff and Campbell [see
The reliability of equipment can be en- two in the second cycle and none in the ref. 1] . The fo llow ing problem is
hanced by preventively replacing critical third or fourth. As the interval between solved using the software package Rel-
components within the equipment at ap- preventive replacements is increased Code, which incorporates the cost
propriate times. Just what the best time is there will be more failures occurring model, to establish the best preventive
depends on the overall objective, such as between the preventive replacements replacement interval.
cost minimization or availability maxi-
mization. While the best preventive re-
placement time may be the same for both
cost minimization and availability maxi-
mization, this is not necessarily the case.
Data first needs to be obtained and ana-
lyzed before it is possible to identify the
best preventive replacement time. In this
chapter several optimizing procedures
will be presented that can be used with
ease to establish optimal preventive re-
placement times for critical components. Figure 22: Block replacement policy
Result
Figure 24 shows a screen capture from
RelCode from which it is seen that the
optimal preventive replacement time is
4,140 kilometers. In addition the Figure
provides much additional information
that could be valuable to the mainte-
nance planner. For example:
■ The cost saving compared to a run-to-
failure policy is: $ 0.1035/km (55.11%)
■ The cost per kilometer associated
with the best policy is: $ 0.0843/km
Statement of problem
A sugar refinery centrifuge is a complex
machine composed of many parts and
Figure 24: RelCode output: block replacement
subject to sudden failure. A particular
component, the plough-setting blade, is
considered to be a candidate for preven-
tive replacement. The policy to be consid-
e red is the age-based policy with
preventive replacements occurring when
the setting blade reaches a specified age.
What is the optimal policy to so that total
cost per hour is minimized?
To solve the problem the following
data have been acquired:
■ 1. The labour and material cost asso-
Figure 25: Age-based replacement policy
ciated with a preventive or failure re-
Statement of problem times as expensive as a preventive re- placement is $2000;
Bearing failure in the blower used in placement. We need to determine the ■ 2. The value of production losses as-
diesel engines has been determined as optimal preventive replacement interval sociated with a preventive re p l a c e-
occurring according to a Weibull distri- (or block policy) to minimize total cost men t is $ 1000 wh ile fo r a fai lu re
bution with a mean life of 10,000 km and per kilometer. What is the expected cost replacement it is $7000;
a standard deviation of 4,500 km. Failure saving associated with the optimal policy ■ 3. The failure distribution of the set-
in service of the bearing is expensive over a run-to-failure replacement policy? ting blade can be described adequately
and, in total, a failure replacement is 10 Given that the cost of a failure is by a Weibull distribution with a mean
Optimizing
condition based
maintenance
Getting the most out of your
equipment before repair time
by Murray Wiseman
Condition Based Maintenance (CBM) is an obviously good idea. It stems from the logical assumption
that preventive repair or replacements of machinery and their components will be optimally timed if they
were to occur just prior to the onset of failure. Our objective is to obtain the maximum useful life from
each physical asset before taking it out of service for preventive repair.
T
ranslating this idea into an effec- build a model describing the mainte- been an increasing number of re f e r-
tive monitoring program is imped- nance costs and reliability of an item. In ences which include applications to ma-
ed by two difficulties. Chapter 5 we dealt uniquely with situa- rine gas turbines, motor generator sets,
The first is: how to select fro m tions in which the lifetimes of compo- nuclear reactors, aircraft engines, and
among the multitude of monitoring pa- nents were considered independent disk brakes on high-speed trains. In
rameters, those which are most likely to random variables, meaning that no 1995 A.K.S. Jardine and V. Makis at the
indicate the machine’s state of health? other information, other than equip- University of To ronto initiated the
And the second is: how to interpret and ment age, was to be used in scheduling CBM Consortium Lab [ref. 2] whose
quantify the influence of the measure- p reventive maintenance. CBM intro- mission was to develop general-purpose
ments on the remaining useful life duces new information, called covari- software for proportional hazards mod-
(RUL) of the machinery? We’ll address ates, which influence the probability of els analysis. The software was designed
both of these problems in this chapter. failure at time t. Consequently the mod- to be integrated into the operation of a
The essential questions posed when els of Chapter 5 will be extended to in- plant’s maintenance information sys-
implementing a CBM program are: clude the impact of these measure d tem for the purpose of optimizing its
■ 1. Why monitor? covariates (for example, the parts per CBM activities. The result in 1997 was a
■ 2. What equipment components to million of iron in an oil sample, the am- program called EXAKT (produced by
monitor? plitude of vibration at 2 x rpm, etc.) on Oliver Interactive) which, at the time of
■ 3. How (what parameters) to moni- the remaining useful life of machinery this writing, is in its second version and
tor? or their components. The extended rapidly earning attention as a CBM op-
■ 4. When (how often) to monitor? modelling method we introduce in this timizing methodology. The example
■ 5. How to interpret and act upon the chapter, which takes measured data into and associated graphs and calculations
results of condition monitoring? account, is known as Proportional Haz- given in this chapter have been worked
Reliability Centred Maintenance ards Modelling (PHM). using the EXAKT program.
(RCM) as described in Chapter 3 assists Since D.R. Cox’s [ref. 1] 1972 pio- Readers who wish to work through
us in answering questions 1 and 2. Addi- neering paper on the subject of PHM, the examples using the program may
tional optimizing methods are required the vast majority of re p o rted uses of do so by obtaining a demonstration ver-
to handle questions 3, 4, and 5. We Proportional Hazards Modelling have sion from the author [ref 3].
learned in Chapters 4 and 5 that the way been for the analysis of survival data in We, as maintenance engineers, plan-
to approach these types of problems is to the medical field. Since 1985 there have ners, and managers try to perform Con-
Data transformations
The modeller needs to have at his fin-
gertips, not only the actual data, but
also any combinations (transform a-
tions) of that data which he feels may
be influential covariates.
One obvious transformed data field
of interest is the lubricating oil’s age, Figure 30: Cross graph of lead vs iron ppm
which is usually not directly available in iron or lead to be low just after an oil of oil age and iron negates this theory
the database, but can be calculated change and increase linearly as the oil except for very low ages of the lubricat-
knowing the dates of the oil change ages and as particles accumulate and ing oil. Hence the modeller may con-
(OC) events. In the current example stay resident in the oil circulating sys- clude that oil age, in this system, is not
one would expect the “wear metals” tem. A cross graph (Figure 31, page 62) a significant covariate.
Equation 1:
ß ß-1
h(t) = _
η ( tη_ ) e γ1Z1 (t)+γ2Z 2(t)+…+γ nZ n(t)
Discussion of transition
probability
Once the modeller has established the
covariate bands, the software calcu-
lates the Markov Chain Model Transi-
tion Probability Matrix (Figure 33),
Figure 32: The transition probability model
which is a table that shows the proba-
bilities of going from one state (for ex- 0 13.013 Above
ample, light contamination, medium IRON to 13.013 to 26.949 25.949
contamination, heavy contamination) 0 to 13.013 0.864448 0.11918 0.0163723
to another, between inspection inter- 13.013 to 25.949 0.139931 0.657121 0.202947
vals. Or more formally stated “The Above 25.949 0 0 1
table provides a quantitative estimate
Figure 33: The Markov chain model transition probability matrix
of the probability that the equipment
will be found in a particular state at the Step 5: The optimal decision variate, Z — a weighted sum of those co-
next inspection, given its state today.” In this step we “tell” the model (consist- variates having been statistically deter-
It means that given the present value ing thus far of the PHM and the Transi- mined to influence the probability of
range of a variab le we can pre d i c t tion Probability models) the respective failure. Each covariate measured at the
(based on history) with a certain proba- costs of a preventive and failure instigat- most recent inspection will have con-
bility, its value range at the next inspec- ed repair or replacement. tributed its value to Z. That contribu-
tion moment. For example, if the last The replacement decision graph, tion will have been weighted according
inspection showed an iron level less F i g u re 34 (page 66), is the culmina- to its degree of influence on the risk of
than 13 parts per million (ppm) we tion of the entire modelling exercise failure in the next inspection interval.
may, in this example, predict that the to date. It combines the results of the The advantage is evident. On a single
next record will be between 13 and 26 proportional hazards model, the tran- graph one has obtained the distilled in-
ppm with a probability of 12 percent sition probability model, and the cost formation upon which to base a replace-
and more than 26 ppm, with probability function to display the best decision ment decision. The alternative would
1.6 percent. Probabilities such as the 12 policy re g a rding the component or have been to examine trend graphs of
percent and 1.6 percent are called tran- system in question. dozens of condition parameters and
sition probabilities. The ordinate is the composite co- “guess” at whether to repair or replace
or 800-263-3993
Dial One Wolfedale Electric Ltd. 10 52 905-564-8999 or 800-303-7568 905-564-5677 sales@dialonewolfedale.com www.dialonewolfedale.com
Duracell Division
or 800-565-6990
F O RY O U RI N F O R M AT I O N
H OW TO C O N N E C T W I T H A DV E RT I S E R S I N T H I S I S S U E
Toyota Canada Inc., Industrial 47 30-31 416-438-6320 Ext 2327 800-592-3851 forklift@toyota.ca www.forklift.toyota.ca
Equipment Division
L I T E R AT U R E R E Q U E S T F O R M PEM 12/99
2 10 18 26 34 42 50 58 66 74 82 Co. name:
3 11 19 27 35 43 51 59 67 75 83 Address:
5 13 21 29 37 45 53 61 69 77 85 Phone:
6 14 22 30 38 46 54 62 70 78 86 Fax:
Searching the
Web for reliability
information
Looking for useful Internet sites?
Here’s where to start
by Paul Challen
In the preceding pages of The Reliability Handbook, we’ve seen a full discussion about ways in which
maintenance professionals can increase uptime in their plants. Part of this discussion hinges on the
need to use technology to garner the kind of information needed to implement reliability decisions.
Along with the various software packages and databases our authors have recommended, there are
a myriad of other resources available to people searching for reliability information — and one of the
most useful places to look is on the Internet.
T
he following is a by-no-means-com- The RAC web site contains a num- print material on the subject, there is a
prehensive listing of some of the ber of useful information re s o u rc e s , comprehensive general list of books lo-
reliability resources you’ll find on including a bibliographic database of cated at http://www.quality.org/Book-
the Net. We encourage readers to ex- books, standards, journal art i c l e s , store. For books on reliability that you
plore these sites and to follow the many symposium papers, and other docu- can order on-line through the Ama-
reliability links emanating from them. ments on re l i a b i l i t y, maintainability, zon site (www.amazon.com) on the
q u a l i t y, and support a b i l i t y. It also s ub j ect of re l i a b i l i t y, you can tr y :
Reliability Analysis Center maintains a calendar of upcoming http://www.quality.org/Bookstore/Re-
Located at http://rac.iitri.org, this events throughout the industr y, in- liability.htm.
site bills itself as the “center of relia- f o r mat io n ab o u t l in ks t o ot her There is also a complete bibliogra-
bility and maintainability excellence databases, a “data sharing consor- phy of U.S. government reliability doc-
for over 30 years.” The RAC is operat- tium” with information on non elec- uments at http://www. i n c o s e . o rg /
ed by the IIT Research Institute in the t ronic parts reliability data, failure lib/sebib5.html that covers a wide
U.S., and provides information to in- mode distributions, and electrostatic range of technical topics.
d u s t r y via data bases, methodology discharge susceptibility data.
handbooks, state-of-the-art technolo- The RAC has also compiled two im- Professional organizations
gy reviews, training courses and con- p o rtant lists, one of frequently used The RAC site also contains a huge list of
sul t in g ser vices. I ts mi ssi on is t o acronyms in the reliability world, and relevant professional organizations, at
provide technical expertise and infor- another (at http://rac.iitri.org / c g i http://rac.iitri.org/cgi-rac/sites?00013.
mation in the engineering disciplines rac/sites?0) of navigable links to other One of the most useful of these, from
of reliability, maintainability, support- Internet sites on related topics. a reliability standpoint, is the Society for
ability and quality and to facilitate Maintenance & Reliability Professionals
their cost-effective implementation Book and print material sources (SMRP) site at http://www. s m r p . o rg .
throughout all phases of the product For those reliability professionals who This organization is an independent,
or system life cycle. are searching for good old- fashioned n o n - p rofit society devoted to practi-