Professional Documents
Culture Documents
www.elsevier.com/locate/ress
Abstract
The semi-Markov decision model is a powerful tool in analyzing sequential decision processes with random decision epochs. In this paper,
we have built the semi-Markov decision process (SMDP) for the maintenance policy optimization of condition-based preventive
maintenance problems, and have presented the approach for joint optimization of inspection rate and maintenance policy. Through numerical
examples, the improvement of this method is compared with the scheme, which optimizes only over the inspection rate. We also find that
under a special case when the deterioration rate at each failure stage is the same, the optimal policy obtained by SMDP algorithm is a
dynamic threshold-type scheme with threshold value depending on the inspection rate.
q 2004 Elsevier Ltd. All rights reserved.
the threshold-type policy for the maintenance action. No g minimal preventive maintenance threshold
maintenance action is taken if the device deterioration stage is b major preventive maintenance threshold
found to be smaller than the minimal maintenance threshold; li failure rate at stage i
minimal maintenance is performed if the device deterioration Fin time to inspection trigger interval distribution
stage is found to be between the minimal maintenance Fd distribution of the time to carry out an inspection
threshold and the major maintenance threshold; and major Fm preventive minimal maintenance duration distribution
maintenance is carried out if the device deterioration stage is FM preventive major maintenance duration distribution
larger than the major maintenance threshold. This model was FR failure repair time distribution
captured by a stochastic Petri net, and its optimal inspection F system failure state
interval to maximize the system availability is presented.
Closed-form results for such threshold-type condition-based Consider the epoch at which the system fails and enter
maintenance problems are reported in [1]. state F as the fictitious decision epoch. The only action at
Above works have only considered the optimization of this epoch is to repair the system to as good as new state. Let
system parameters. We notice that another important factor in i2IZ{0,1,.,kC1} define a system state where the system
determining the overall system availability is the maintenance is found to be in deterioration stage i with 0%i%k, or failure
policy, i.e. the action to be taken at each deterioration stage. state F represented by iZkC1, during an inspection.1 For
Improperly designed policies may considerably hamper the simplification of representation, in the following of our
ability of the system to meet certain design objectives, even paper we call state 0%i%k inspection state and state iZkC
with carefully chosen parameters. For this reason, in this paper 1 failure state.
we aim at joint optimization of both system parameters and In each state, a decision needs to be made on the action to
system maintenance policy. perform according to the system deterioration stage. The
One possible approach to tackle this problem is by solving action should include both the parameter(s) to determine the
the system model and search for the optimal combination next inspection time q, and the maintenance action (do
exhaustively. However, when the number of failure stages is nothing, perform minimal maintenance, or perform major
large, this work is cumbersome. For this reason, we maintenance). We represent the action at the nth decision
formulate the semi-Markov decision process (SMDP) epoch as a two-tuple (An, q) where An is the maintenance
model [6] for the condition-based maintenance problem, action to be performed and q is the mean time to next
based on the assumption that the system behavior may be inspection. Then, for states iZ0,.,k the possible action
captured by Markovian models. Besides deterioration fail- An is
ures, in our model, we have also considered Poisson failures,
8
which are defined as the type of failures under which the > 0; no action is taken
<
system fails abruptly rather than gradually, as with the
An Z 1; minimal maintenance is performed (1)
deterioration failures. For Poisson failures, we assume >
:
the repair action is to restore the system to the operable 2; major maintenance is performed
state it was just in before the failure.
The joint optimization of system parameters (the inspec- and in state iZkC1 the only action is to repair the system to
tion rate in our case) and the maintenance policy is as good as new state with q as the next inspection time
performed by taking the inspection rate as input parameter distribution parameter.
to the SMDP model. For each individual inspection rate the Let Yn, n2NZ{0,1,.} be the system state at the nth
SMDP model is solved for the optimal policy, based on decision epoch. Then, the state transition probability could
which the system CTMC model is constructed and solved. be derived.
From these results, we may thus obtain the best combinations For the transition probability from one inspection state
of system inspection rate and system maintenance policy. with deterioration stage i to the next inspection state with
This paper is organized as follows: In Section 1 we deterioration stage j we have
introduce the notation, and the Markov model and the SPN
model for the preventive maintenance problem. In Section 2 PðYnC1 Z jjYn Z i; An Z a; qn Z qÞ
we formulate the MDP problem for determining the optimal 8 ðN
>
> j
maintenance policy, and in Section 3 numerical results are > P~ i ðtÞ dFin ðt; qÞ
> a Z0
>
>
presented. Finally, Section 4 concludes the paper. < ðN0
j
Z P~ iK1 ðtÞ dFin ðt; qÞ a Z 1 (2)
>
> ðN
0
2. Notation and the SMDP formulation >
>
>
> j
: P~ 0 ðtÞ dFin ðt; qÞ a Z2
0
In this paper, the following notations are used:
j
where 0%i%kC1, 0%j%k, and P~ i ðtÞ is the probability that and find the stationary policy R(n) that minimize the
the system state changes from i to j without any inspection above equation.
j
event. Computation of P~ i ðtÞ is shown in Appendix A. Step 2. Compute the bounds mn Z minx2I ðVn ðxÞK VnK1 ðxÞÞ
The transition probability from an inspection/failure state and Mn Z maxx2I ðVn ðxÞK VnK1 ðxÞÞ:
to failure state F is If 0%(MnKmn)%3mn, where 3 is a pre-specified
PðYnC1 Z k C 1jYn Z i; An Z a; qn Z qÞ accuracy number, the algorithm is stopped with the
policy R(n). Otherwise, go to step 3.
ðN
kC1 Step 3. nZnC1 and go to step 1.
Z ð1 K Fin ðt; qÞÞdðP~ i ðtÞÞ (3)
0
where 0%i%k.
The expected time to the occurrence of next decision
epoch, given current state i and action a is
8 ðN ðN
>
> ~ kC1
>
> ð1 K P i ðtÞÞð1 K F in ðtÞÞdt C Fd ðtÞ dt 0% i% k; a Z 0
>
> ðN
0 ð0N
>
>
>
> kC1
< ð1 K P~ iK1 ðtÞÞð1 K Fin ðtÞÞ dt C ðFd ðtÞ C Fm ðtÞÞ dt 0% i% k; a Z 1
tði; aÞ Z ð0
N ðN0
(4)
>
> ~ kC1
>
> ð1 K P0 ðtÞÞð1 K Fin ðtÞÞ dt C ðFd ðtÞ C FM ðtÞÞ dt 0% i% k; a Z 2
>
> ðN
0 ðN0
>
>
>
> ~ kC1
: ð1 K P0 ðtÞÞð1 K Fin ðtÞÞ dt C FR ðtÞ dt i Z k C1
0 0
The value iteration algorithm is summarized as follows: and the parameters are chosen as follows
Step 0. Choose V0(x) such that 0%V0(x)%mina k Z 10; td Z 0:5; tm Z 0:5; tM Z 0:5; tR Z 100:
(c(x,a)/t(x,a)) for all x. Let nZ1. and we assume the deterioration rate li at each stage is the
Step 1. Compute the function Vn(x), x2I, from same with a value of 0.03.
" The cost for each state is chosen in such a way that the
cðx; aÞ t X total cost represents the system steady-state availability, i.e.
Vn ðxÞ Z min ; C P V ðyÞ
a2AðxÞ;q tðx; aÞ tðx; aÞ y2I xay nK1 cdZ1, cmZ1 with all other costs equal to zero.
Fig. 1 shows a comparison between the MDP scheme and
t the scheme with fixed minimal and major maintenance
C 1K V ðxÞ; x 2I; ð6Þ
tðx; aÞ nK1 threshold. With MDP scheme, maximal availability of 0.962
28 D. Chen, K.S. Trivedi / Reliability Engineering and System Safety 90 (2005) 25–29
« (A4)
j
dP~ i ðtÞ j jK1
Z Klj P~ i ðtÞ C ljK1 P~ i ðtÞ; i! j% k (A5)
dt
kC1
X
k
m
P~ i ðtÞ Z 1 K P~ i ðtÞ (A6)
mZi
Fig. 2. Thresholds for the optimal maintenance policy.
D. Chen, K.S. Trivedi / Reliability Engineering and System Safety 90 (2005) 25–29 29
kC1
X
k
m
P~ kC1 ðtÞ Z 1 K P~ kC1 ðtÞ (A9)
mZ0
References
j j
and thus P~ kC1 ðtÞZ P~ 0 ðtÞ; 0% j% kC 1:.
Performing Laplace transform, for 0%i%k we have [1] Chen D-Y, Trivedi KS. Closed-form analytical results for condition-
based maintenance. Reliab Eng Syst Saf 2002;(76):43–51.
i i
sP~ i ðsÞ Z 1 K li P~ i ðsÞ (A10) [2] Hosseini MM, Kerr RM, Randall RB. An inspection model with
minimal and major maintenance for a system with deterioration and
j j jK1 Poisson failures. IEEE Trans Reliab 2000;49(1):88–98.
sP~ i ðsÞ Z Klj P~ i ðsÞ C ljK1 P~ i ðsÞ; i! j% k (A11) [3] Legat V, Zaludova AH, Cervenka V, Jurca V. Contribution to
optimization of preventive maintenance. Reliab Eng Syst Saf 1996;
kC1
X
k
m
51:259–66.
P~ i ðsÞ Z 1 K P~ i ðsÞ: (A12) [4] Sheu S-H, Yeh RH, Lin Y-B, Juang M-G. A Bayesian approach to an
mZi adaptive preventive maintenance model. Reliab Eng Syst Saf 2001;71:
j j 33–44.
and P~ kC1 ðsÞZ P~ 0 ðsÞ; 0% j% kC 1. [5] Sim SH, Endrenyi J. Optimal preventive maintenance with repair.
i0
Therefore, the Laplace transform of P~ i ðtÞ may be written IEEE Trans Reliab 1988;37(1):92–6.
as [6] Tijms HC. Stochastic modelling and analysis: a computational
approach. New York: Wiley; 1986.
QjK1
j l [7] Vaurio JK. On time-dependent availability and maintenance optimiz-
P~ i ðsÞ Z Qj mZi m ; i% j% k (A13) ation of standby units under various maintenance policies. Reliab Eng
mZi ðs C lm Þ Syst Saf 1997;56:79–89.