Professional Documents
Culture Documents
A new method for gear crack detection is presented. It consists of the coupling of
adaptive demodulation with an abrupt change detector. The adaptive algorithm is intended
to account for the slow variations of the signal. Two methods for this adaptive
demodulation are proposed: a RLS approach based on a linear version of the signal model
and a LMS approach which directly estimates the physical parameters (amplitude and
phase) for each harmonic considered. Their respective advantages and disadvantages are
discussed and the superiority over the Hilbert transform approach is shown. The crack
detection is formulated as an abrupt change detection problem solved by the sequential
monitoring of the prediction error. Its effectiveness is demonstrated through the application
of the demodulation detection algorithm to gearbox under reliability tests.
7 1997 Academic Press Limited
1. INTRODUCTION
Perfect toothed-gear vibration signals (identical teeth with constant spacing) are periodical
with period Teng = 1/feng where feng is the meshing frequency (product of the number of teeth
and the shaft rotation frequency fr ). Due to non-linearities in the meshing process, the
spectrum of the signal contains not only the fundamental, but also harmonics of the
meshing frequency [1]. Nevertheless, real gears are not perfect. Non-constant teeth spacing
results in a contact point not strictly on the pitch circle and may cause frequency
modulations around its nominal value feng , while irregularities of the teeth contact surfaces
cause mechanical load variations which show up in the vibration signal as amplitude
modulations [3]. Such amplitude and phase modulations are also observed with wear
(increase) or grinding (decrease).
A localised fault, like a gear crack, also involves those two kinds of modulation, but
while the first kind of fault leads to slow variations, a gear crack results in abrupt variations
of the amplitude and phase at the wheel cycle scale. This transitory event is 1/fr periodical
where fr is the rotation frequency of the faulty wheel. This periodical non-stationarity may
be seen as a cyclostationarity [4], depending on the observation scale.
The model corresponding to these phenomena is given by:
m
x(t) = s ak(t) cos (2pkfeng t + 8k(t) ) + e(t) . (1a)
k=1
where m is the number of harmonics considered and e(t) is an additive white noise. The
complex signal y(t) associated to x(t) is the following:
m
y(t) = s ak(t) ej(2pfeng t + 8k (t )) + e(t) . (1b)
k=1
149
0888–3270/97/010149 + 19 $25.00/0/mg960 7 1997 Academic Press Limited
150 . .
where, in this case, e(t) stands for a complex additive white noise, and ak(t) , 8k(t) are the
amplitude and phase modulation laws. These laws are 1/fr periodical, but at one wheel
rotation scale, they have to be considered as random (see [2, 4]). As a consequence, the
adequate framework for studying such phenomena is the class of random amplitude and
phase modulation processes.
The detection of gear cracks has motivated many works [5, 6, 7]. Among the numerous
possible aproaches, one of the most promising is to monitor the amplitude and phase
modulations. To do this, the classical approach is based upon the Hilbert transform of
the real bandpass filtered signal, where instantaneous amplitude and phase are estimated
as the instantaneous amplitude and phase of the analytic signal and the detection is
performed by comparison of these instantaneous values to an adequate threshold.
The approach proposed here is quite different. Indeed, non-stationarities expressed in
model (1a, b) through time-varying amplitude and phase modulations are of two kinds:
slow non-stationarities induced by distributed faults (wear, shaft misalignment); and
abrupt non-stationarities representative of localised faults (cracks). In this framework, an
adaptive algorithm is used to estimate the slowly time-varying amplitude and phase. A
sequential abrupt change detection algorithm is then coupled to the adaptive algorithm
to decide whether a crack is present or not.
The organisation of the paper is as follows. In the next section, we present results about
random amplitude and phase modulations. More particularly, we focus on the equivalent
spectral bandwidth of such processes. In Section 3, two different approaches for the
adaptive demodulation are presented. They are based respectively on a recursive least
squares and a least mean squares algorithm, each depending on a different discrete version
of model (1a). The choice of the design variables is addressed and the results of Section
2 are used to discuss the adequation of the different possible methods including the Hilbert
transform. Then, in Section 4, the question of crack detection is discussed. It is treated
as an abrupt change detection problem which is solved by the sequential monitoring of
the prediction error provided by the adaptive algorithm. Finally, in Section 5, the good
performances achieved with the proposed demodulation/detection algorithm are
demonstrated on a real process, consisting of a car gearbox under reliability tests. Its
application to optimal sensor location is then considered.
g
+a
v 2 =Y(v) = 2 dv
−a
Dvy2 = (5)
g
+a
=Y(v) =2 dv
−a
where for notation simplification the mean frequency is taken equal to 0 (this is always
the case for real modulation laws). Elementary calculations lead to:
Dvy = Dva (6)
The spectral bandwidth of {y(t) } is the same as that of {a(t) } and it does not depend on the
amplitude modulation rate. The latter only acts on the power of the random process.
= A 2 e−a =Z(v − v 0) =2
2
(9)
g
+a
−a
Under the supplementary assumption that the ACF of {8(t) } is also gaussian, the equivalent
spectral bandwidth of {y(t) } is simply given by:
Dvy2 = a 2 Dv82 (10)
Dvy = a Dv8
The interpretation of this result is that, in the case of a gaussian ACF, the spectral
bandwidth increases linearly with the modulation rate. However, the latter does not change
the power of the random process {y(t) }.
152 . .
2.3.
The extension of the previous results to simultaneous amplitude and phase modulations
seems to be tractable, but has not been treated yet. However, according to the previous
results, an important, but not often mentioned, limitation of the Hilbert transform
approach may be pointed out. Indeed it only makes use of a single filter, while two different
width filters could be used to obtain more accurate estimates (one for the amplitude and
one for the phase). We will return to this point in Section 3.3.
m
xn = s ak,n cos (2pkfeng n + 8k,n ) + en . (11)
k=1
where for notation simplicity, it is assumed that the sample frequency equals 1, that is to
say that feng is the normalised meshing frequency. As pointed out before, two types of
non-stationarities are included in model (11): slow non-stationarities are intended to
account for the so-called distributed faults, while abrupt non-stationarities can account for
localised faults, say gear cracks. In this section, we focus on the adaptive estimation of
the slowly varying amplitude and phase. Model (11) may be exploited in two ways. A first
possible approach is to work with the linear version of (11) for which classical recursive
least squares (RLS) techniques are available. A second possible approach directly estimates
the parameters of model (11), namely amplitudes and phases. For this non-linear
estimation, a stochastic gradient (or least mean squares) method is suggested.
xn = HnTun + en
HnT = [cos (2pfeng n) sin (2pfeng n) ··· (cos (2pmfeng n) sin (2pmfeng n)]T. (12)
unT = [a1,n cos 81,n a1,n sin 81,n ··· am,n cos 8m,n am,n sin 8m,n ]T
where un is the parameter vector to estimate. The model being linear with respect to the
parameter, the estimation may be achieved by the RLS algorithm with forgetting factor [8]:
Kn = (l + HnTPn − 1 Hn )Pn − 1 Hn
en = xn − HnTu
n
(13)
u
n = u
n − 1 + Kn en
The forgetting factor l is introduced to enable the tracking of the time-variable parameters.
Its choice will be discussed in Section 3.3. The main advantage of this approach lies in
the linearity of the model used, while its main drawback is that it does not give a direct
access to the physical parameters, namely the amplitudes and the phases of the different
components of the signal under study. Particularly, unwrapping techniques have to be
employed to obtain the phase. This can make the analysis more difficult.
153
3.2.
To handle with the aforementioned limitations, we may perform the direct estimation
of the amplitude and phase. Indeed, the vector parameter:
unT = [a1,n 81,n ··· am,n 8m,n ]T. (14)
may be estimated as the one which minimises the LMS criterion defined by:
J(un ) = E{en2 } (15)
m
en = xn − s ak,n − 1 cos (2pkfeng n + 8k,n − 1 )
k=1
u
n = arg min J(un )
un
The criterion being non-linear with respect to the parameter, the application of the RLS
technique is not possible and non-linear optimisation techniques have to be used.
Derivating the criterion J with respect to u gives:
1 1
J(un ) = (E{en2 }) = −2E{en 9n }
1un 1un
K
G cos (2pfeng n + 81,n − 1 ) L
G
G −a1,n − 1 sin (2pfengn + 81,n − 1) G
9n =G ..
.
G
G G
G cos (2pmfeng n + 8m,n − 1 )
k−am,n − 1 sin (2pmfengn + 8m,n − 1) G
l
Estimating the gradient E{en 9nT } by its instantaneous value, leads to the stochastic gradient
(or LMS) algorithm:
u
n = u
n − 1 + gGen 9
n (16)
m
en = yn − s âk,n − 1 cos (2pkfeng n + 8̂k,n − 1 )
k=1
K
G cos (2pfeng n + 8̂1,n − 1 ) L
G
G −â1,n − 1 sin (2pfengn + 8̂1,n − 1) G
9
n = G ..
.
G
G G
G cos (2pmfeng n + 8̂m,n − 1 ) G
k−âm,n − 1 sin (2pmfengn + 8̂m,n − 1) l
The matrix adaptation gain gG is the design variable which enables to control the tracking
behaviour of the adaptive algorithm. Its choice will be discussed in Section 3.3.
It should be noted that the criterion has an infinite number of minima. As an example,
in the case of a monocomponent carrier, the set of parameter minimising the criterion may
be expressed as:
6$ % 7 6$ % 7
a −a
U= , k $Z * , k $Z
8 + 2kp 8 + (2k + 1)p
The error performance surface corresponding to the LMS criterion has the shape shown
on Fig. 1. Depending on the initial value of the parameter, each local minimum may be
154 . .
reached by the adaptive algorithm. Moreover, the adaptation gain should be chosen small
enough to ensure the stability of the algorithm. In other words, a small gain value ensures
that it remains in the same local minimum.
which corresponds to independent variations of the amplitudes and phases. This gain
matrix does not provide the optimal search direction as defined in [10], but practical results
achieved with it are very satisfactory. These diagonal elements may be fixed by successive
trials. The access to the non-diagonal elements is more complicated because they
Figure 2. Gearbox.
156 . .
describe the dependency between the variations of each parameter. The individual control
of the adaptation gain for each parameter is the main advantage of the LMS approach.
Indeed, as pointed out before, acting on the adaptation gain is equivalent to varying the
filter bandwidth. As a consequence, having an independent control on each gain is
equivalent to choosing filters with bandwidths well-adapted to the variations of each
parameter. This is certainly the most efficient approach because, according to the results
of Section 2, amplitude and phase modulations result in different spectral bandwidths of
the modulated processes.
The RLS algorithm does not offer such a possibility. However, it should be noted that
the recursive propagation of the matrix Pn may be interpreted as an effective procedure
to automatically determine the optimal adaptation matrix gain. Even so, it does not ensure
the optimal search direction since it corresponds to a particular model of the parameter
variations (see [10] for details). The main limitation of the RLS approach is that it does
not enable a direct control of the physical parameter variations. Thus, both of the proposed
methods handle with the limitation of the Hilbert transform method mentioned in Section
2.3.
As a conclusion, Table 1 summarises the respective advantages and disadvantages of the
three different approaches (LMS, RLS, Hilbert), in terms of accuracy of estimation, choice
simplicity of the design variables, numerical complexity and possibility of a sequential data
processing for real time applications. Note that it is possible to further reduce the
numerical complexity of recursive algorithms using faster versions, but this will always be
done to the detriment of their numerical stability.
157
4. SEQUENTIAL DETECTION OF GEAR CRACKS
The crack detection problem may be seen as an abrupt change detection problem which
can be stated as the following hypotheses testing problem:
H0 : un = u
n (un actual parameter, u
n estimated parameter)
\r E N tq :u = un if n Q r
H1 :
u = un + du if n e r
H0 hypothesis simply states that the actual and estimated parameters are (approximately)
equal, while H1 hypothesis states that an abrupt change has occurred at the (unknown)
time r. A classical approach to solve this test is the sequential monitoring of the prediction
error (residues) variance. According to the classification proposed in [11], this is a
one-model approach, because it does not use an estimation of the parameter under H1
hypothesis as in the generalised likelihood ratio test. It also has similarities with the method
proposed in [6], the main difference being the way the residues are obtained (subtraction
of the measured and averaged signals). Assuming that, under H0 , the prediction error is
normally distributed with variance s 2, the statistic Tn = en2 /s 2 is x 2 distributed with 1 dof.
It follows immediately that:
EH0 (Tn − 1) = 0
Moreover, it may be shown [12] that:
EH1 (Tn − 1/Hn ) = entropy under H1 − entropy under H0 + Kullback information
As a consequence, a change will result in an increase of the prediction error energy if, and
only if, EH1 (Tn − 1/Hn ) q 0. The Kullback information is always positive. Thus, a sufficient
condition is that the entropy of the stochastic process under H1 is greater than or equal
to the entropy under H0 . Under the simplifying assumption that the parameters are
constant before and after change, a sufficient condition is that the variance of the additive
noise does not decrease after change.
Considering the statistic Tn − 1 normally distributed with zero mean, the previous
abrupt change detection problem is equivalent to the sequential detection of positive jumps
in a gaussian sequence. This may be done in an optimal manner by the Page–Hinkley test
(see [11] for example), which results in the following stopping rule:
na = inf (n/gn e s) (17)
0 1
n
gn = max gn − 1 + Tn − 1 − , 0 (18)
2
The stopping time na is the first time at which the statistic gn is greater than the threshold
s. The latter has to be chosen in order to achieve the performances desired. The parameter
n represents the minimum jump magnitude which has to be detected. Finally, the change
time may be estimated as the last time at which gn was equal to 0.
To end this section, we discuss some advantages of the proposed approach over the
Hilbert Transform approach. In the latter, the decision is made directly on the estimated
amplitude and phase. However, as pointed out in Section 3, these estimates may be biased.
This is particularly true in the case of localised faults due to their large bandwidths. As
a consequence, decisions based upon this approach may lead to erroneous results. For that
reason, we don’t work directly with the estimated parameters provided by the adaptive
algorithm. On the contrary, the proposed approach tries to measure a distance between
158 . .
Figure 4. Averaged signal without (left) and with (right) fault (First gear, first sensor position).
a reference (time-variable) and the actual model’s by looking at the statistical properties
of the residual signal. The coupling of one of the adaptive algorithms with the abrupt
change detection method is also very important because it will result in a robust detection
method in the sense that it will be insensitive to the slow variations of amplitude and phase.
Finally, an other advantage of this two-step method lies in the fact that it is a sequential
procedure which enables a real-time implementation.
159
vibration signal recorded on the casing represents the response of the mechanical structure
to forces introduced on the one hand by gear meshing and bearings and on the other hand
by the circulation of the lubricant. Faults were induced successively on each wheel of the
driven shaft.
6. CONCLUSIONS
A new method for gear crack detection is proposed. The approach used consists of the
coupling of adaptive demodulation with an abrupt change detector. The adaptive
algorithm is intended to account for the slow variations of the signal. Two methods for
this adaptive demodulation are proposed: a RLS approach based on a linear version of
the signal model and a LMS approach which directly estimates the physical parameters
amplitude and phase for each harmonic considered. Their respective advantages and
disadvantages are discussed and the superiority over the Hilbert Transform approach is
shown. The crack-detection is formulated as an abrupt change detection problem solved
by the sequential monitoring of the prediction error.
The application of the demodulation detection algorithm to gearbox under reliability
tests demonstrates its effectiveness.
The proposed algorithm exhibits good properties in the following points:
the coupling of the two stages ensures good robustness against slow amplitude and phase
variations not representative for crack faults;
the design variables are easy to choose (RLS approach);
the numerical complexity is lower than that of the Hilbert transform approach;
the algorithm is sequential allowing its real time implementation.
162
. .
Figure 8. Maximum of the detection statistic for the 12 sensor locations The optimal sensor location for gears 1 to 5 are respectively: 2, 1, 11, 5 and 5..
163
At the present time, the algorithm is implemented on a DSP board and will be part of a
monitoring tool for gearbox reliability tests.
REFERENCES
1. R. B. R 1981 Journal of Mechanical Design 1, 259–268. A new method of modelling gear
faults.
2. W. D. M 1978 Journal of the Acoustic Society of America 63, 1409–1429. Analysis of the
vibratory excitation of gear systems: basic theory.
3. C. F and P. P 1992 Proceedings of Colloque Progrès Récents des Méthodes de
Surveillance Acoustiques et Vibratoires, 639–649. Surveillance et diagnostic des engrenages.
4. C. C, M. S and J. L. L 1995 Proceedings of the 2nd International
Symposium Acoustical and Vibratory Surveillance Methods and Diagnostic Techniques 1,
391–401. Apport de la théorie des processus cyclostationnaires à l’analyse et au diagnostic des
engrenages.
5. P. D. MF 1986 Journal of Vibration, Acoustics, Stress and Reliability in Design 108,
165–170. Detecting fatigue cracks in gears by amplitude and phase demodulation of the meshing
vibrations.
6. P. D. MF and J. D. S 1985 Proceedings of the Institution of Mechanical Engineers 199,
287–292. A signal processing technique for detecting local defects in a gear from the signal average
of the vibration.
7. D. G and M. S 1990 Mécanique Matériaux Electricité 435, 61–64. Méthodes
avancées de traitement du signal en vue de la surveillance des machines.
8. L. L and T. S¨¨ 1983 Theory and Practice of Recursive Identification. Prentice Hall.
9. O. M 1989 Traitement du Signal 6, 335–345. Adaptatif et non-stationnaire.
10. A. B, P. Ḿ and P. P 1986 Algorithmes Adaptatifs et Approximations
Stochastiques. Masson.
11. M. B and I. N 1993 Detection of Abrupt Changes—Theory and applications.
Prentice Hall.
12. J. S and A. C. S 1980 IEEE Transactions on Information Theory 26, 249–255.
Detecting changes in a time series.
13. V. B, M. T, D. B, A. R, M. C and M. G 1994 Compte Rendu
d’Avancement de Travaux CRAN/PSA. Diagnostic de défauts sur boı̂tes de vitesses par analyse
vibratoire.
14. H. O, D. B, V. B and M. T 1995 Proceedings of the UK Symposium
on Applications of Time–frequency and Time-scale Methods (TFTS 95) , University of Warwick,
Coventry, UK 128–135. A method for analysing gearbox failures using time–frequency
representations.
15. H. O, D. B, V. B and M. T 1995 Proceedings of International
Conference on Acoustics, Speech, and Signal Processing, Detroit, 128–135. Examination of
gearbox cracks using time-frequency distribution.
16. H. Aı¨ 1974 IEEE Transactions on Automatic Control 19, 716–723. A new look at the
statistical model identification.
17. D. B, M. T, T. C and H. N 1993 Proceedings of the IMAC/IFAC, 2nd
International Symposium on Mathematical and Intelligent Model in System Simulation, Brussels
1, 402–406. Application of local test statistics to order estimation.
Let {a(t) } be a random process, which is assumed to be zero mean. Consider the new
random process defined by:
(s) }
Ry(t,s) = E{y(t) y*
where y*(t) denotes the complex conjugate of y(t) . Ry(t, s) will be referred to as the
autocorrelation function (ACF) of {y(t) }. In the stationary case, we have:
Moreover, Ry(t, s) have the hermitian symmetry property. This may be directly verified from
the definition (3):
Ry(t,s) = R*
y(s,t) (A4)
The aim of this appendix is to determine the ACF of {y(t) } as a function of the ACF of
{a(t) }. It will be used to determine the power spectral density (PSD) and the equivalent
spectral bandwidth of {y(t) }.
(s) }
Ry(t,s) = E{y(t) y*
= E{(a0 + aa(t) )(a0 + aa(s) ) ejv0(t − s)}
where Ra(t,s) = Ra(t) is the autocorrelation function of {a(t) }. (stationary case) The PSD is
then obtained as the Fourier transform of the autocorrelation function:
g
+a
(A6)
=A(v) =2 = FT(Ra(t) ) = Ra(t) e−jvt dt
−a
g
+a
v 2 =Y(v) =2 dv
−a
Dvy2 = (A7)
g
+a
=Y(v) = dv
2
−a
g
+a
g
+a
g
+a
g
+a
= A 2 e−a =Z(v − v 0) =2
2
(B5)
It is clear from equation (9) that =Z(v) = play a key role in evaluating =Y(v) = . In particular,
2 2
the study of the bandwidth of =Z(v) =2 with respect to that of =F(v) =2 may give some insight
into the behaviour of =Y(v) =2. In the general case, this is a very difficult problem and we
will only address it for the special case where the process {8(t) } has a gaussian ACF. The
reader should keep in mind that this supplementary assumption only concerns the shape
of the autocorrelation function. However, the pdf of {8(t) } still remains gaussian. So, we
have:
2 2
R8(t) = e−(1/2)t /st
(B6)
=F(v) =2 = z2pst2 e−(1/2)st v
2 2
From now on, we want to determine the spectral bandwidth Dvy of {y(t) }, defined as:
g
+a
v 2=Y(v) =2 dv
−a
Dvy2 =
g
+a
=Y(v) =2 dv
−a
where, for simplicity, the mean frequency is assumed to be equal to zero. So the first step
to determine Dvy is to calculate =Y(v) =2. Recalling the series expansion of ea R8 (t ):
2
a
(a 2R8(t) )n
ea R8 (t ) = s
2
(B7)
n=0
n!
167
From (B6) and (B7), we have:
a
(a 2)n(R8(t) )n
ea R8 (t ) = s
2
n=0
n!
2 2
(R8(t) )n = e−(t /2(st /zn) )
We finally obtain:
$ %
a
(a 2)n
=Y(v) =2 = A 2 e−a 1 + s =Fn(v − v 0) =2
2
(B8)
k=1
n!
where:
=Fn(v) =2 = n
R8(t) e−jvt dt = 2p e (B9)
−a
n
An important remark is that assuming R8(0) = 1, we also have (R8(0) )n = 1 and due to the
Wiener–Kintchine theorem, we have: f =Fn(v) =2 dv = 1. Then the power of {y(t) } calculates
to:
g 0 1
+a a
(a 2)n
=Y(v) =2 dv = A 2 e−a s
2
= A2 (B10)
−a k=0
n!
n=1
n!
with:
g
+a