You are on page 1of 4

Proceedings of the 7th Asian Conference on Refrigeration and Air Conditioning

ACRA2014
May 18-21, 2014, Jeju, KOREA

ACRA2014-XXXXX

ACRA2014
MODEL-BASED PREDICTIVE MAINTENANCE OF CHILLERS
Yan Ke1), Timothy Mulumba1), Wen Shen1), and Afshin Afshari1,2)
1)
Department of Engineering Systems & Management, Masdar Institute of Science and Technology, PO Box 54224, Abu
Dhabi, United Arab Emirates
2)
Corresponding author: aafshari@masdar.ac.ae

ABSTRACT: This paper investigates the advantages of time series modeling for chiller Fault Detection &
Diagnosis (FDD), in comparison to standard data-based or residual-based approaches. Specifically, we study
a hybrid method incorporating a dynamic model and machine learning techniques. Via recursive estimation of
the model parameters, we convert the process measured data into a parameter vector—updated at each
sampling time step. We then apply the Support Vector Machines (SVM) technique to sub-divide the
parameter space, enabling fault classification. The proposed approach is validated using real data measured
during normal and faulty chiller operation, produced by an ASHRAE research project. The results of this
investigation indicate the superiority of the proposed hybrid method over conventional approaches with
higher prediction accuracy and lower false alarm rates. The dynamic model produces a better fit with more
statistically significant coefficients than the conventional, linear regression equivalents. It is therefore easier
to detect a parameter change when a fault occurs.

1. INTRODUCTION multiple linear regression model with diagnosis table,


PCA model with diagnosis table, and linear discriminate
In large Heating Ventilation & Air-Conditioning analysis. Wang et al separated the FDD for system faults
(HVAC) plants, the chiller is the most expensive piece of and sensor faults [7]. They used a reference regression
equipment. Lack of timely maintenance/repair can lead model to validate the performance indices computed
to premature failure of the chiller and incur significant from measurements. Zhao et al proposed a pattern
cost and discomfort. Therefore, Fault Detection & recognition based chiller fault detection method using
Diagnosis (FDD) should be an integral part of a modern support vector data description [8]. Qin et al first
Building Automation System (BAS). proposed a hybrid approach to solve the FDD problem
for VAV terminals [9]. Han et al suggested another
Grimmelius et al developed an empirical fault diagnostic hybrid approach utilizing SVM to implement FDD for
system for a chiller, combining fault detection and chillers [10]. A data set containing normal and faulty
diagnostics in a single step [1]. A reference linear operational data samples is divided into two parts. The
regression was modeled with data from a normally first part with known fault types is trained using SVM.
operating chiller. Peitsman and Bakker used a black box The second part is treated as testing data set to validate
model for fault detection and compared diagnostic the system. This method achieves extremely high
performance of an auto-regressive moving-average accuracy when 10-second data is used. The accuracy
model with exogenous variables (ARMAX) and an drops when the time interval increases. Liang & Du
artificial neural network (ANN) model [2]. ANN models proposed a hybrid method combining a physical model
had slightly better performance than ARMAX models in with SVM [11]. First the measured data is converted to
detecting faults. Tuip et al developed a prototype residuals. Then the residuals are input into SVM for
procedure for an on-line, self-learning fault detection classification.
tool on building level [3]. By taking passive user
behavior into account, the tool aimed to distinguish real 2. PROBLEM DESCRIPTION AND PROPOSED
faults from unexpected user behavior. Jia & Reddy APPROACH
proposed an online model based FDD method for
medium to large chillers [4]. Six process faults were In this paper, we study a hybrid method
identified based on five features developed from fifteen incorporating a dynamic model and machine learning
monitored variables. Kourti applied the principal techniques. Via recursive estimation of the model
component analysis (PCA) method [5]. Reddy proposed parameters, we convert the measured data into a
a general methodology for evaluating FDD methods parameter vector--updated at each sampling time step.
using steady state data and evaluated four multivariate We then apply machine learning techniques, such as
model-based FDD methods against laboratory chiller support vector machines to sub-divide the parameter
performance data [6]. All four methods were data-driven space, enabling fault classification. In comparison to
methods: model-free fault detection with diagnosis table, traditional model-based or data-driven methods, the
combination of time series modeling and machine a realistic implementation, we may be short of sensors to
learning techniques is shown to produce higher capture such comprehensive data set or some sensors
detection/diagnosis accuracy and lower false alarm rates (e.g., pressure sensors) may be difficult to retrofit onto
in FDD for chillers. an existing plant. A feature selection technique is then
crucial to select the most relevant and accessible
Our approach is validated using real data for variables. The top ten most significant dependent
normal and faulty operation, generated in the course of variables are shown in Table 1. From those, we choose
the ASHRAE research project 1043-RP [12]. We use only kW, TCO-TCI and TEO-TEI as benchmark
only 6 features--far fewer than in comparable studies. variables to be modeled as ARX processes (“dependent”
Applicable algorithms for implementing the strategy are variables). As for the exogenous (“independent”)
formulated. After training for generic chiller types, these variables, the most convenient and informative are TCI,
algorithms can be integrated in advanced building TEO and Evap-Tons, respectively, condenser input water
management systems to accurately detect and diagnose temperature, evaporator output water temperature and
the faults commonly observed in chillers. tons of cooling delivered by the evaporator coil (a
calculated variable).
We develop a robust strategy that is suitable for
FDD application to chillers, where both sensor faults and
chiller faults may co-exist. Auto-regressive modeling
with exogenous inputs (ARX) and support vector
machine techniques (SVM) are combined to construct a
new hybrid model for FDD of chillers. The proposed
chiller FDD scheme effectively and accurately (accuracy
> 90% and false alarm rates < 1%) identifies different
types of chiller faults.

3. METHOD
The measured data set is pre-processed by Table 1: Ten most important features/variables selected
estimating the parameters of a dynamic ARX model at from the original data set.
each time step: At each time step, a point in the
parameter space is identified and replaces the original Pre-whitening is applied to the dependent variables
data point for training and testing with SVM. The (KW, TCO TCI and TEO TEI) to determine the suitable
structure of the ARX model is defined by pre-whitening lags for each parameter. In practice, although the system
of the original data. At the validation stage, we may be non-stationary, the data may be transformed to
implement and evaluate three approaches: 1) a pure data- white noise by replacement with the residuals from a
driven approach which applies SVM with different fitted auto-regressive integrated moving-average
kernels directly to the original data set; this is similar to (ARIMA) model. For a quick preliminary analysis,
the approach proposed by Han et al [10]; 2) a hybrid approximate pre-whitening can be done easily by first
approach which pre-processes the data set using a linear differentiating the data (if needed) and then fitting an
regression model and applies SVM on residuals; this is approximate AR model with the order determined by
similar to the approach proposed by Liang & Du [11]; 3) minimizing the Akaike Information Criterion (AIC) [13].
a hybrid approach which pre-processes the data set using The specification of which lags of the independent
an ARX model and applies SVM on parameters. The first variable enter into the model is often done by inspecting
two methods assume that the underlying system is static. the sample cross-correlation function based on the pre-
The third (hybrid) method explicitly accounts for the whitened data.
dynamic nature of the process.
The ARX models produce a better fit with more
The ASHRAE data set contains both normal and statistically significant coefficients than the conventional,
faulty data for a 90-ton (316 kW) chiller. Based on the linear regression equivalents. Additionally, tests on the
results of chiller fault survey [12] and our understanding residuals from these models show that they do not
of the most common faults in the region, five typical violate any of the Gauss-Markov assumptions. We have
faults were investigated in this study: validated the model using out-of-sample data, i.e., data
not used to estimate the parameters but corresponding to
• Reduced condenser water flow (F1), similar operating conditions. The model’s prediction
• Reduced evaporator water flow (F2), performance is excellent with a Root Mean Square Error
• Condenser fouling (F3), (RMSE) of 1.98 (about 2.5% of the peak).
• Non-condensables in refrigerant (F4),
• Refrigerant leak (F5). 3.1 Support Vector Machine (SVM)
In this work, we use the reduced data set, logged The SVM technique is a supervised learning
every two minutes, in order to be closer to a realistic model that categorizes data and recognizes patterns. For
low-cost implementation. For each severity level of each a given a set of training data with known categories,
fault, we have a data file of approximately 432 samples. SVM assigns to each data sample a point in a high
dimensional space. Next, the data is classified by sub-
The original data set consists of 65 monitored dividing the high dimensional space using hyper-planes.
variables which may create heavy computational load for Ideally, the hyper-planes separate the training samples as
models and classifiers if they were all to be analyzed. In widely as possible to unambiguously distinguish new test
sample categories. rates are reported. The proposed hybrid method
outperforms both existing methods. Prediction accuracy
3.2 An Overview of Proposed Hybrid Method and false alarm rates are improved significantly.
A hybrid approach combining ARX model and Direct application of SVM on the data results in an
SVM is desirable to improve the accuracy of the FDD overall (all faults and levels combined) fault detection
analysis. In the proposed hybrid approach, we pre- accuracy of 72.1% while the false alarm rate exceeds
process the data using ARX models to remove auto- and 20%. The overall fault diagnosis accuracy is 83.6%.
cross-correlations between variables. Then, we use Application of SVM on regression model residuals
parameters instead of the original data to do the fault produces an overall fault detection accuracy of 82.6%
detection and diagnosis in SVM. The proposed method while the false alarm rate is 7.4%. As for diagnosis the
includes three separate phases: overall accuracy of this approach is 87.8%.
(1) Transform the original data set (training and With the hybrid approach (ARX and SVM),
testing subsets) into a set of ARX model parameters; overall detection accuracy is better than 90% with a false
alarm rate of about 0.1%. Overall diagnosis accuracy is
(2) Train the SVM classifier for detection (using also enhanced, exceeding 90%.
binary SVM) and diagnosis (using multi-class SVM)
using the parameter data set; 7. CONCLUSION
(3) Apply the trained SVM classifiers on the We compared the proposed hybrid approach with
testing data and compute prediction accuracy and false existing FDD approaches applied to the chiller data set
alarm rates. from ASHRAE project RP-1043. The results show that
our approach outperforms both of the conventional
4. RESULTS methods with higher prediction accuracy and lower false
alarm rates. Furthermore, it is important to point out that
The proposed hybrid method is validated with the we use far fewer dependent variables compared to the
data from ASHRAE Project 1043-RP. We divide the other two approaches (three versus ten), which saves the
reduced data set (two minutes interval) into two subsets cost of sensors and makes practical large-scale
with a ratio of 3:1. The first subset is the “training” data implementation more realistic.
set, where all data samples are assigned to fault labels. A
fault label can be Normal, Fault 1, Fault 2, Fault 3, Fault After training for generic chiller types, these
4 or Fault 5, which corresponds to normal data, reduced algorithms can be integrated in advanced building
condenser water flow, reduced evaporator water flow, management systems to accurately detect and diagnose
condenser fouling, non-condensables in refrigerant and the 5 faults commonly observed in chiller systems.
refrigerant leak, respectively. The second subset (3 times Future work will extend the application of the hybrid
smaller) is the “testing” (or validation) data set, where all method to AHUs. We are also working on a Kalman
data samples have unknown fault label. The fault labels filter based estimation algorithm, which is better suited
of the testing data set are only revealed after the fault to on-line parameter identification.
diagnosis in order to calculate the classification accuracy
and false alarm rates. To assess the performance of the ACKNOWLEDGEMENT
proposed hybrid method, we also implement two
conventional FDD approaches: This research was supported by the Executive Affairs
Authority (EAA) of Abu Dhabi, United Arab Emirates as
(1) The first approach applies SVM directly to the part of the project Predictive Maintenance: Fault
chiller data, which is similar to the method proposed by Detection and Diagnosis of AC Equipment. The authors
Han et al [10]. would also like to thank Mr Michael Vaughn and Ms
Donna Daniel of ASHRAE for their assistance in
(2) The second approach adopts an approach providing the chiller data of the ASHRAE Project RP-
similar to that of Liang and Du [11]. We construct a 1043.
static regression model similar to the one suggested by
the authors of the report of ASHRAE Project 1043-RP. REFERENCES
The difference between the simulation results and the
actual data, i.e. the residuals of the regression model, are [1] H.T. Grimmelius, J.K. Woud, and G. Been. On-line
used for training and testing in SVM. failure diagnosis for compression refrigerant plants.
International Journal of Refrigeration 18(1), pages 31–
41, 1995.
5. DISCUSSION
[2] H.C. Peitsman and V. Bakker. Application of black-
Although the hybrid model uses only the first three box models to HVAC systems for fault detection.
dependent variables from Table 1, we provide all 10 ASHRAE Transactions 102(1), pages 628–640, 1996.
variables to the alternative conventional approaches.
Therefore, the hybrid model is severely handicapped in [3] B. G. C. C. Tuip, M. A. Houten, M. Trcka, and J. L.
the comparison. Nonetheless, it manages to outperform M. Hensen. Occupancy based fault detection on building
the conventional approaches. To facilitate comparison, level - a feasibility study. Proceedings of the 10th
we separate the steps of fault detection and fault International Conference Enhanced Building Operations
diagnosis. Both the prediction accuracy and false alarm ICEBO, Kuwait, page 6, 2010.
support vector data description. Applied Energy, 2013.
[4] Y. Jia and T. Reddy. Characteristic physical parameter
approach to modeling chillers suitable for fault detection, [9] J. Qin and S. Wang. A fault detection and diagnosis
diagnosis, and evaluation. Journal of Solar Energy strategy of VAV air-conditioning systems for improved
Engineering, Transactions of the ASME, v 125, n 3, energy and control performances. Energy and Buildings
Emerging Trends in building Design, Diagnostics, and 37 (10), pages 1035–1048, 2005.
Operations, pages 258–265, 2003.
[10] H. Han, B. Gua, J. Kang, and Z.R. Li. Study on a
[5] T. Kourti. Application of latent variable methods to hybrid SVM model for chiller fdd applications. Applied
process control and multivariate statistical process Thermal Engineering, 2010.
control in industry. Journal of Adaptive Control and
Signal Processing, 2004. [11] J. Liang and R. Du. Model-based fault detection and
diagnosis of HVAC systems using support vector
[6] T.A. Reddy. Application of a generic evaluation machine method. International Journal of
methodology to assess four different chiller FDD Refrigeration, pages 1104–1114, 2007.
methods (rp-1275). HVAC & R Research, v 13, n 5.,
pages 711–729, 2007. [12] M. C. Comstock and J. E. Braun. Experimental data
from fault detection and diagnostic studies on a
[7] S. Wang, Q. Zhou, and F. Xiao. A system-level fault centrifugal chiller, Herrick Laboratories. Purdue
detection and diagnosis strategy for hvac systems University, West Lafayette, IN, HL 99-18, 1999.
involving sensor faults. Energy and Buildings 42 (4),
pages 477–490, 2010. [13] Yosiyuki Sakamoto, Makio Ishiguro, and Genshiro
Kitagawa. Akaike information criterion statistics.
[8] Yang Zhao, Shengwei Wang, and Fu Xiao. Pattern Dordrecht, The Netherlands: D. Reidel, 1986
recognition-based chillers fault detection method using

You might also like