You are on page 1of 10

Computers in Biology and Medicine 43 (2013) 20532062

Contents lists available at ScienceDirect

Computers in Biology and Medicine


journal homepage: www.elsevier.com/locate/cbm

Support vector machine algorithms in the search of KIR gene


associations with disease
Juan C. Cuevas Tello a, Daniel Hernndez-Ramrez b, Christian A. Garca-Seplveda b,n
a
Facultad de Ingeniera, Universidad Autnoma de San Luis Potos. Av. Dr. Manuel Nava No. 8, Zona Universitaria, ZC 78290, San Luis Potos, Mxico
b
Laboratorio de Genmica Viral y Humana, Facultad de Medicina, Universidad Autnoma de San Luis Potos, Avenida Venustiano Carranza #2405, Colonia
Filtros las Lomas, CP 78210, San Luis Potos, Mxico

art ic l e i nf o a b s t r a c t

Article history: Killer-cell immunoglobulin-like receptors (KIR) are membrane proteins expressed by natural killer cells
Received 7 May 2013 and CD8 lymphocytes. The KIR system consists of 17 genes and 614 alleles, some of which bind human
Accepted 30 September 2013 leukocyte antigens (HLA). Both KIR and HLA modulate susceptibility to haematological malignancies,
viral infections and autoimmune diseases. Molecular epidemiology studies employ traditional statistical
Keywords: methods to identify links between KIR genes and disease. Here we describe our results at applying
KIR articial intelligence algorithms (support vector machines) to identify associations between KIR genes
Computational biology and disease. We demonstrate that these algorithms are capable of classifying samples into healthy and
NK cells diseased groups based solely on KIR genotype with potential use in clinical decision support systems.
Articial intelligence
& 2013 Elsevier Ltd. All rights reserved.
Immunogenetics

1. Introduction (L) signals upon binding their cognate ligands. The balance and
integration of these signals modulate NK cell cytotoxicity and
Killer-cell immunoglobulin-like receptors (KIR) are membrane cytokine release [6,7]. Some KIR proteins recognise HLA class I
bound proteins expressed by natural killer (NK) cells and a small molecules and by doing so modulate the activity of NK cells [8,9].
subset of CD8 lymphocytes. NK cells are peripherally circulating KIR proteins interact with classical HLA class I ligands such as HLA-
lymphocytes and key participants of innate immune responses to A (KIR3DL2), HLA-B (KIR3DL1 and  3DS1) and HLA-C (KIR2DL1,
viral infections and tumours. In contrast to the rearranging  2DL2,  2DL3 and  2DS2) as well as with non-classical HLA
antigen-specic receptors expressed by lymphocytes of the adap- ligands such as HLA-G (KIR2DL4) [811]. The ligands of the
tive immune system, the KIR gene repertoire is genetically remaining KIR proteins have yet to be determined.
determined and remains unchanged throughout life [1,2]. To date Virus-infected and malignant cells downregulate their expression
17 KIR genes have been shown to exist, all of which exhibit allelic of HLA in an attempt to evade recognition by the adaptive immune
polymorphism. KIR genes are organised within human chromo- system. NK cells sense this loss (loss of self) through KIR and respond
some 19q13.4 as physically contiguous strings known as haplo- vigorously by directly killing the infected or malignant target cells or
types [3]. A KIR haplotype is composed of centromeric and by releasing cytokines which recruit other immune cells [12,13].
telomeric halves known as haplotype motifs. All known KIR Some KIR genes have been associated with increased susceptibility or
haplotypes contain the four framework genes KIR3DL3,  3DP1, resistance to infectious, autoimmune and metabolic diseases [1419].
2DL4 and  3DL2 but exhibit variation in the content of other NK cells were initially identied by their ability to spontaneously kill
KIR genes (KIR2DL1,  2DL2, 2DL3,  2DL5A,  2DL5B,  2DS1, tumour cells without prior sensitisation [2022]. Historical studies of
2DS2,  2DS3,  2DS4, 2DS5,  2DP1,  3DL1 and  3DS1) [4]. the immunogenetic factors that determine clinical outcome in
KIR gene content diversity has recently been shown to arise from a patients subjected to haemopoietic stem cell transplantation (HSCT)
limited number of haplotype motif combinations (four centro- for haematological malignancies were the rst to highlight the
meric and two telomeric motifs) [5]. clinical relevance of KIR genes in antitumour responses [23].
KIR genes encode for two (2D) or three (3D) extracellular The rst study to suggest such an association described a potent
domain proteins which can transduce activating (S) or inhibitory graft-vs.-leukaemia effect arising from predicted NK cell alloreactivity
in the graft-vs.-host direction amongst patients subjected to HSCT for
leukaemias [24]. Many other studies published since describe KIR
n
Corresponding author. Tel.: 52 444 1135 860.
gene associations with anti-tumour effects and post-transplant
E-mail addresses: ca.garcia.s@gmail.com, clinical endpoints [2531]. Differences in patient demographics,
christian.garcia@uaslp.mx (C.A. Garca-Seplveda). clinical management, the preferred transplant modality and KIR

0010-4825/$ - see front matter & 2013 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.compbiomed.2013.09.027
2054 J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062

typing method have largely contributed to the heterogeneity of the Table 1


ndings. In addition, NK cell anti-tumour activity has been well Clinical data for the haematological cohort.
demonstrated to occur in vitro against a wide variety of haematolo-
n %
gical malignancies [23,32]. In all, these ndings support the notion
that KIR enable NK cells to play a crucial role at conferring resistance Gender Male 23 53
to certain haematological tumours [27,33,34]. Female 20 46
A proper assessment of the functional relevance of KIR genes in Diagnosis Chronic myeloid leukaemia 25 58
Hodgkin's lymphoma 18 42
human diseases should also take into account the allelic diversity of B symptoms Present 30 70
both KIR and HLA and the different afnities of their interaction [35 Absent 13 30
37]. Therefore, the daunting task of identifying associations between ECOG 0 3 7
KIR genes and disease, and of devising clinical algorithms that consider 1 16 37
2 20 46
the high biological complexity of KIR and HLA genes, is more easily
3 3 7
dealt with through computers and articial intelligence algorithms. 4 1 2
Support Vector Machines (SVM) were introduced by Cortes and
Vapnik as support-vector networks and the term widely dissemi-
nated by Cristianini and Shawe-Taylor [38,39]. SVM arose from the
machine-learning community's efforts towards developing articial Table 2
intelligence strategies for complex problems. Authors involved in Data encoding matrix exemplifying the distribution of articial-data-1.
statistical learning and kernel machine development have previously
g1 g2 g3 g4 g5 g6 g7 g8 ... g23 d1
described SVM's with greater detail [40,41]. SVM applications have
mainly focused on classication and regression problems. In classi- p1 0 0 1 0 0 1 1 0 ... 1 1
cation the goal is to produce a generalisable mathematical function p2 0 1 0 0 1 0 0 1 ... 0 1
capable of distinguishing between different classes. This function is p3 0 0 0 0 1 1 1 1 ... 1 1
... ... ... ... ... ... ... ... ... ... ... ...
inferred from available examples (training data) and then put to use p343 1 0 0 0 0 0 0 0 ... 1 1
on untested examples (test data). Regression applications, on the
other hand, produce a mathematical model which best describes the
behaviour of certain datasets so as to infer future trends (i.e., accordance with state and national ethics regulations and lacking
population growth prediction). SVM's excel over other classication personal identifying information so as to ensure patient/donor
algorithms in large part due to their capacity to perform multivariate condentiality.
classications in a non-linear manner and in n-dimensional space
(see below). This advantage allows several layers of complexity to be 2.2. KIR genotyping and encoding
taken into account, such as those typical of KIR, HLA and other
complex genetic systems. SVM are currently regarded state-of-the- KIR gene content was determined using a locally developed
art classication algorithms [40,41]. sequence specic priming polymerase chain reaction (SSP-PCR)
This paper summarises our experience at developing and apply- genotyping technique capable of detecting the presence or absence
ing SVM articial intelligence algorithms to identify associations of of each of the 17 genes [43]. Our SSP-PCR approach does not
KIR genes with disease in a cohort of patients with haematological currently enable us to distinguish between KIR2DL5A and B, the
malignancies. Our innovative use of genetic algorithms for the position of genes within the haplotype or gene copy number
optimisation of the SVM parameters allows for greater freedom from variations. PCR amplicons were resolved in 1.5% agarose gels and
human interference to be achieved. We consider that this innovative digitally documented after ethidium bromide staining. KIR haplotype
approach will prove valuable at identifying KIR gene associations in motifs (cA01, cB01, cB02, cB03, tA01 and tB01) were deterministically
other molecular epidemiological studies and aid in the development inferred from genotyping results as described with greater detail in
of clinical decision support systems (DSS). the original publication [5]. KIR genotyping information was encoded
for each sample as strings of binary values indicating presence (1) or
absence (0) of the 16 genes, a KIR2DS3/S5 gene combination and the
2. Materials and methods six haplotype-motifs for each of the 343 samples. These strings were
,
consolidated on a tab delimited text le as g g1, g2, g3 g23. In this
2.1. Study population text le KIR data for each of the samples p1 to p343 was arranged in
columns g1 through g23. In addition each data entry included a health
Three hundred unrelated blood-donor samples comprising the status variable dened as d1 which was1 in samples obtained from
Mexican Genomic DNA Reference Collection (MGDC-REF) were individuals having a haematological malignancy and  1 in healthy
used as healthy controls for this study. This Mexican mestizo donors, as shown in Table 2.
reference population included 135 (45%) males and 165 (55%)
females aged between 19 and 38 yr (median of 24) of which 75% 2.3. Articial data
were residents of the city of San Luis Potos and 25% were
residents of rural areas of this Mexican state. DNA samples were Two different articial datasets were used to assess the perfor-
extracted from blood-bank discarded leucocyte concentrates mance and establish baseline classication error plots for the SVM.
referred to us by Hospital Central Dr. Ignacio Morones Prieto The rst dataset (Articial-Data-1) was composed of 343 randomly
according to previously published protocols [42]. A more detailed generated KIR genetic trait strings which include KIR genotypes
description of the KIR features present in this reference population which were neither biologically relevant nor subjected to any
is given in the original publication [43]. In addition, 43 DNA haplotypic constraint. This dataset was used to evaluate the cap-
samples obtained from patients with haematological malignancies ability of the SVM at identifying gene associations where none would
(25 with leukaemia and 18 with lymphomas) referred to us by the be expected. KIR genotypic data was randomly generated (simulated)
Haematology Department of Hospital Central Dr. Ignacio Morones to produce an articial population of 343 samples and 23 variables
Prieto were included. Clinical information for the haematological (so as to maintain a similar size to that of the real dataset). To further
cohort is shown in Table 1. All samples were provided to us in help with dataset comparisons, the number of healthy articial
J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062 2055

individuals in this dataset were limited to 300 and the total number extended to linearly non-separable data through a soft margin (C), a
of diseased samples was kept to 43. One hundred of these Articial- trade-off between maximising the margin and misclassifying data
Data-1 sets were assessed through SVM to generate average classi- points. To generate this soft margin the SVM rely on a kernel trick,
cation error, sensitivity and specicity plots. which together with margin maximisation, allows for the classication
A second data set (Articial-Data-2) was used to evaluate the of non-linear data [39,41,46]. The kernel trick allows non-linear data
performance of the SVM at identifying KIR gene associations on distributions to be linearly separable by transforming them to a high-
biologically sound KIR genotypes in which the health status variable dimensional feature space. Fig. 2 graphically demonstrates the perfor-
(d1) was randomly assigned. For this second dataset, the KIR mance of a SVM non-linear classication algorithm at solving such
genotypes present in MGDC-REF were used to produce an articial XOR problem.
,
population of 343, 300 of which were randomly assigned a healthy In our approach, the mapping of g into required the use of a
d1 variable and 43 a diseased variable (to maintain real dataset kernel mapping function. We employed a Gaussian kernel known as
proportions). Again, 100 different articial datasets were used to a radial basis function (RBF) kernel. However, the use of the RBF
generate average classication error, sensitivity and specicity plots. kernel requires two parameters to be optimised and set: the width of
Gaussian functions (s) and the trade-off variable (C) that controls the
2.4. Support vector machine and auxiliary algorithms trade-off between misclassication and the size of the SVM margin.
Traditionally SVM optimisation relies on evaluating classication
All SVM simulations were carried out using Matlab Support Vector performance with different s and C value combinations through a
Machines toolbox for Classication and Regression [44]. In Fig. 1A, a labour intensive, empirical, trial and error approach.
linear function separates four data points into two clearly distinct
classes (dark and open circles). This gure represents a simple
problem in which the data is linearly separable, a task that is easily
resolved by a linear classication algorithm like the Perceptron [45].
The line representing this function is located a distance between the
lower left dark circle and the lower right open circle, a distance that is
referred to as the classication margin [46]. In linear classication
problems, margin optimisation is essential to allow for the
proper separation of data points. However, when the classical XOR
problem is faced with linearly non-separable data points, the classi-
cation task becomes impossible for simple linear classication
algorithms (Fig. 1B left panel). One way in which this XOR problem
can be tackled is through SVM, where each data point is mathema-
tically transformed into a feature space () so as to allow for linear Fig. 2. Non-linear classication using SVM for linearly non-separable data points
classication (Fig. 1B right panel). In SVM, classication margins are representing the XOR problem.

Fig. 1. (A) Classication is possible using a linear algorithm for these four data points (where black circles can be seen as representing diseased patients and open circles a
healthy individual). (B) Kernel trick. Non-linearly distributed input data is easily linearly classied when transformed to a high-dimensional feature space.
2056 J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062

These optimisation procedures are explained with greater 3. Results and discussion
detail in previous publications [46]. Instead of taking this
approach, we chose to search for the optimal s and C values by The diversity and frequency of KIR gene features amongst the
applying a genetic algorithm to the task. For this purpose we healthy study population is described with greater detail in a
employed the Genetic Algorithm Toolbox for MATLAB (The Math- previous publication [43]. The KIR genetic features of the haema-
Works, Inc. Massachusetts, USA)[47]. This approach allowed us to tological cohort were similar to those of the healthy population.
achieve near-ideal classication errors (as close to 0 as possible) However, statistically signicant differences in the carrier frequen-
with greater precision and efcacy. Genetic algorithms achieve cies of KIR2DL2, Group A haplotypes and A,B haplotypes were
this through articial evolution sub-algorithms, which are a observed (see below). KIR haplotypes and haplotype motifs could
stochastic global search and optimisation method based on the be resolved through our deterministic encoding approach for more
principles of biological evolution [48]. Briey, the genetic algo- than 98% of our healthy control's and 100% of our patients
!
rithm begins with an initial generation P 1 having 50 random (allowing for the inherent ambiguities mentioned in the original
chromosomes (pairs of s and C values) as dened by publication) [5].
! For SVM to classify data, they rst have to be taught to
P 1 s1 ; C 1 ; s2 ; C 2 ; s3 ; C 3 ; sm ; C m  recognise the genetic patterns associated with disease using a
small subset of the real data, which is known as training data.
This rst generation is then subjected to articial genetic
Training data is normally a small fraction of the main dataset,
operators such as selection, crossover, mutation and re-insertion
! ! ! whereas the remaining part of the main dataset that is scrutinised
(elitist strategy) to generate P 2 ; P 3 ; P n generations. The
with the SVM is known as the test data. The magnitude of the
tness (f) of each generation is tested by the SVM in the search of
! classication error produced by a SVM on test data after training
the P n producing the lowest classication error and highest
varies depending on the complexity of the underlying pattern that
sensitivity and specicity (minimum f). This process leads to the
is being searched for. Classication error is also inuenced by the
optimisation of SVM parameters through articial evolution. Those
size and proportion of the main dataset that is being used for
generations producing the smallest f were selected for further use
training. In general terms, the larger the training dataset is, the
on real data. The ID3 algorithm was originally introduced by
smaller the classication error will be on test data. Nevertheless,
Quinlan in 1983 and used for rule generation in expert systems
the use of large training datasets (Z95%) leads to undesirable
[49]. We employed the ID3 algorithm as a supplementary data-
function over-tting. Optimal SVM performance is achieved using
mining tool to generate decision trees from our real data set.
training dataset sizes ranging between 50% and 80% [40,45]. Initial
While SVM's are useful to model the relationship that exists
SVM simulations employed an input matrix composed of 35 KIR
between input and output data they are not capable of indicating
variables: 16 genes, 6 classical haplotype combinations (A/A, A/B,
which variables are more relevant during the classication pro-
B/B, all having A, all having B and Bx), 6 haplotype motifs and
cess. By using the ID3 algorithm we were able to visualise an
7 extended haplotypes. However, subsequent trials using different
approximation of the underlying genetic pattern being recognised
combinations of these variables in the matrix demonstrated
by the SVM in the form of a decision tree by using information
differences in SVM performance (data not shown). Final simula-
entropy. Improved versions of ID3 include C4 and C5 algorithms.
tions and those discussed herein were carried out using only the
A plot of the SVM classication error on test data was generated
limited matrix variables mentioned in the KIR genotyping and
at different training-data sizes in 5% increments for the three
encoding subsection of Materials and methods.
datasets. Plots of the sensitivity and specicity achieved by the
We evaluated the classication performance of our SVM algo-
SVM on test-data at different training-data sizes were also
rithm at classifying samples into healthy and diseased groups
produced. ID3 classier models for decision trees were generated
using three different datasets: (1) Articial-Data-1, (2) Articial-
with 5-fold cross-validation for three different training data sizes:
Data-2 and (3) Real Data. The Articial-Data-1 matrix was used to
30%, 50% and 75%. Overall specicity [true negatives/(true nega-
assess baseline performance of the SVM at identifying associations
tives false positives)] of the SVM algorithm was calculated for
between genes and disease where none should in theory exist.
each training dataset size increment along with overall sensitivity
Less than 5% of the 34,300 articial genotypes that were randomly
[true positives/(true positives false negatives)]. Receiver Operat-
generated using this dataset matched actual KIR haplotypes seen
ing Characteristic (ROC) curves were generated by plotting true
in human populations. The Articial-Data-2 matrix allowed us to
positive rates (TPR) against false positive rates (FPR), i.e. sensitivity
evaluate the performance of the SVM at dealing with biologically
vs. 1specicity, achieved at different training dataset sizes for
sound KIR haplotypes or genetic traits. However, these genetic
each of the three datasets [50]. When a discrete classier is
traits were randomly assigned a health status variable and as such
applied to a test set it yields a single confusion matrix (true
do not represent actual KIR haplotypes present in real patients.
positive rate vs. false positive rate) which is represented as a single
The optimal s and C values for each of these datasets were
ROC point [51]. Each ROC point evaluates the performance of the
calculated independently and using the entire datasets. A genetic
SVM on test data at different training data sizes.
algorithm enabled us to quickly determine the best s and C
parameter combination for this task. For Articial-Data-1 the best
2.5. Traditional statistical tests parameters were shown to be C 6.0144 and s 0.047435, while
for both the Articial-Data-2 and Real Dataset these were C
Carrier (phenotypic) frequencies for KIR genes, haplotypes, 0.087604 and s 0.47026. As expected, the classication error
motifs and genotypes were calculated by counting the number of plots shown in Fig. 3 (average classication error produced from
individuals bearing the trait and the percentage that they repre- 100 iterations for each dataset) followed a logical stratication of
sent in either the reference or study populations. KIR gene and results. Randomly generated KIR genotypes (Articial-Data-1)
haplotype carrier frequency comparisons between healthy con- produced high overall classication errors and real data produced
trols and diseased patients employed a two-sided Pearson's 2 or the lowest. The overall classication error plot for Articial-Data-2
Fisher's exact test along with multivariate logistic regression tests was located between the former two. SVM misclassication is
for independence using the SPSS Production Facility (version 16, dened as an incorrect prediction of the health status of a test
SPSS, Inc. Chicago, IL, USA), signicance being established at genotype based on the rules established from training data. These
p o0.05. could be test genotypes predicted to be healthy when they were
J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062 2057

Plotting sensitivity and specicity against training data size


allows the performance of the SVM classier to be further
assessed, see Fig. 4. Overall specicity of the SVM algorithm at
optimal performance range for Articial-Data-1, Articial-Data-2
and Real Data was of 100%, 97% and 96%, respectively. On the other
hand, overall sensitivity of the SVM algorithm at optimal perfor-
mance range for these same datasets was of 8.5%, 13.4% and 58.6%,
respectively. In all cases, SVM performance on real data was better
than that observed with Articial-Data-2. The authors consider
that the high overall specicity seen in the three datasets results
mainly from the abundance of healthy subjects. However, the fact
that sensitivity was much higher in the real data set in comparison
to the two articial datasets reects the algorithms ability to
correctly identify a genetic pattern associated with disease (see
Fig. 4 left panels). ROC curves generated from the specicity and
sensitivity plots for each dataset (at different training dataset
sizes) further illustrate this (Fig. 4, panels on the right). In these
plots the articial data sets produce ROC curves which are closer
to the reference diagonal dotted line that represents a random
Fig. 3. Classication error plot on test-data for both articial and real data as a guess strategy. Calculating area under the curve (AUC) for these
function of different training-data set sizes. plots shows that higher values are present in the real dataset
(0.73570.073, mean 7SD) followed by the Articial-Data-2
actually diseased or vice versa. The mean classication error of our (0.540 70.039) and Articial-Data-1 (0. 543 7 0.052).
SVM algorithm at optimal training dataset sizes (between 50% and Figs. 3 and 4 show the performance of the optimised SVM
80% as mentioned previously) was of 12.1% (ranging from 11.7% to classier and demonstrate the feasibility of applying this approach
12.5%) for Articial-Data-1, 9.96% (ranging from 9.1% to 10.7%) for to predict clinical outcomes based solely on KIR genotyping data
Articial-Data-2 and of 6.5% (ranging from 5.8% to 7.3%) for the Real for diseases that have shown to be associated with these genes. It
Data. Within the optimal training dataset size region, the SVM is clear that the optimal performance of the SVM on Real Data is
algorithm manages to classify KIR genotypes correctly in 87.9% of achieved when using training data sizes that range between 30%
the Articial-Data-1, 90.1% of the Articial-Data-2 and in 93.5% of the and 60%. Within this size range classication error is lowest (5.8%)
Real Dataset. For the three datasets, a statistical comparison of mean and sensitivity is highest (64.5%). A summary of the classication
classication errors revealed pr0.0001; using students-t test errors, sensitivity, specicity, healthy subject number, true positive
throughout all comparisons. One immediate inference arising from and true negative rates achieved on real data is given in Table 3.
this distribution of classication error plots is that the SVM is capable Comparison of the ROC curves and AUC calculations reveals that
of identifying genetic patterns associated with health status. This can those derived from articial dataset were closer to the random-
only arise from a true association of KIR genes with susceptibility to guess diagonal (AUC 0.5) than those derived from real data.
haematological malignancies, as the SVM manages to achieve lower The SVM manages to classify Real Data samples by using n-
classication errors in the real data in comparison to the articial dimensional transformations of KIR genotyping data to generate a
datasets. If KIR genes were not associated with susceptibility or linear plane capable of clearly distinguishing between healthy and
protection to haematological diseases then the classication plot for diseased status. As the visualisation of this n-dimensional feature
the real dataset would be expected to be higher and similar to that of space escapes current graphical means of representation, the ID3
the Articial-Data-2 plot. algorithm allowed us to indirectly generate binary decision trees
The SVM fails to achieve as low classication error on Articial- that highlight some of the most important KIR patterns being
Data-1 given the random nature of the KIR genotype strings used and recognised by the SVM algorithm. ID3 classier models for
,
the absence of a true association between g and d1. The Articial- decision trees were generated on real data using 5-fold cross-
Data-1 plot therefore represents the baseline performance of our validation for three different training data sizes: 30%, 50% and 75%
algorithm and its capacity of nding associations between genetic (see Fig. 5). As expected, our results show that ID3 decision trees
patterns and health status where none are present. This is merely a tend to become more complex as the training data size increases.
reection of the capability of SVM algorithms at identifying patterns The decision tree generated using 30% of training data (panel A in
and, for simplicity, could be regarded as the maximum SVM Fig. 5) requires only 6 decisions and 7 KIR genetic traits to classify
performance on random data and the curve to which all other data the remaining 70% of the dataset (test data) as healthy or diseased
sets should be compared. As one would expect, the SVM algorithm with less than 5.8% classication error, a sensitivity of 61.3% and a
performs even better when real KIR genotypes are used in spite of specicity of 99%. However, at a training dataset of 75% and at a
the randomness with which the health status variable was assigned. point where SVM performance is optimum, the decision tree
With the real dataset however, a true underlying biological pattern (panel C in Fig. 5) requires 8 decisions, 16 KIR variables and a
exists and associates KIR genotype with outcome. This nding alone more complex tree topology to achieve the same, albeit with a
provides further support to a critical role for KIR in early anti-tumour higher specicity (100%). Interestingly, the decision tree generated
responses. That NK cells play a crucial role in immune responses at mid range (panel B in Fig. 5) exhibits features present in both
against haematological malignancies has been well established [52 the rst and third tree. Traditional statistical comparison of KIR
57]. However, little evidence exists of the mechanisms by which KIR carrier frequencies (contingency tables using Pearson's or Fishers'
genes provide NK cells (or other lymphocytes) the capacity to exact test) revealed that KIR2DL2 was more frequent amongst
eradicate or tolerate malignant cells [58]. That classication error patients with haematological malignancy in comparison to the
reaches near-articial-data levels at training-data sizes below 20% healthy donors (77.8% vs. 40.3%, respectively; p r0.0001), Group A
and beyond 80% is only a reection of a disproportionate distribution homozygosity was less frequent (11.1% vs. 32%, respectively;
of healthy to diseased cases at these dataset ratios (4:30 and 30:4, pr 0.0044) and A,B heterozygous haplotypes were more frequent
respectively). (86.7% vs. 58.7%, respectively; pr 0.0002). Although decision trees
2058 J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062

Fig. 4. Sensitivity and specicity plots (left panels) and ROC curve plots (panels on the right) for the two articial datasets and the real dataset as a function of different
training-data set sizes.

generated at 50% and 75% training data did support the impor- training data size. KIR genes having HLA-Bw4 specicity proved to
tance of KIR2DL2, classical KIR haplotype groups showed no be relevant only after three of more HLA-C decisions had taken
relevance. On the other hand, the importance of the centromeric place, which highlights the importance of this specicity in this
haplotype motif cB03 seen in these same trees did not prove to be particular group of haematological diseases. Although our geno-
statistically signicant through traditional analysis. Also evident typing technique was not capable of distinguishing between
from the three ID3 trees is that most of the decisions involved KIR2DL5A and -B, the ID3 tree generated at 75% training size
either KIR genes or haplotype motifs rich in KIR genes having HLA- highlights the importance of this gene as suggested by other
C specicity: 85% of the decisions made at 30% training data, 80% authors [30,59]. A previous report had associated the KIR3DS1
of those at 50% training data and 62% of those made at 75% 2DL52DS52DS1 genotype with protection from Hodgkin's
J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062 2059

Table 3
Support vector machine performance summary on real data.

Training data (%) CE (%) Sens (%) Spec (%) Healthy (n) Diseased (n) True negative True positive

5 12.9 17.1 97.2 285 41 277 7


10 9.4 38.5 98.1 270 39 265 15
15 11 37.8 96.5 255 37 246 14
20 7.3 60 97.5 240 35 234 21
25 6.6 54.5 99.1 225 33 223 18
30 5.8 61.3 99 210 31 208 19
35 6.7 57.1 98.5 195 28 192 16
40 6.8 57.7 98.3 180 26 177 15
45 6.3 58.3 98.8 165 24 163 14
50 5.8 63.6 98.7 150 22 148 14
55 6.5 60 98.5 135 20 133 12
60 6.5 61.1 98.3 120 18 118 11
65 7.4 56.3 98.1 105 16 103 9
70 6.8 53.8 98.9 90 13 89 7
75 5.8 54.5 100 75 11 75 6
80 7.2 44.4 100 60 9 60 4
85 9.6 28.6 100 45 7 45 2
90 11.4 20 100 30 5 30 1
95 11.1 33.3 100 15 3 15 1

CE classication error, Sens sensitivity, Spec specicity.

Fig. 5. ID3 decision trees generated at three different training dataset sizes: (A) 30%, (B) 50% and (C) 75%.

Lymphoma [60]. Our description of a protective effect arising from nding. Given that KIR2DP1 is a pseudogene and that KIR2DL1 and
the presence of the cB03 motif (which harbours KIR2DL3,  2DL5,  2DL3 have such a high carrier frequency in our population, we
2DS5,  2DP1 and  2DL1 genes) is in agreement with this think that this protective effect might depend mainly from the
2060 J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062

presence of KIR2DL5 and 2DS3/S5 locus, an effect that has also been associations with disease susceptibility and progression in future
inferred by previous studies [59,60]The protective effect provided by applications. Current efforts are being focused at our laboratory on
KIR3DS1 that has been reported by other authors was also partially applying this same approach in the search of KIR gene associations
supported by our data [30,60]. However, in our results the 3DS1 gene in larger and more homogeneous disease groups such as HIV/AIDS,
only provided a protective effect when observed in the absence of CMV and A(H1N1) pandemic inuenza.
2DL2 or 2DL5 genes (see Fig. 5, panel c). In a similar manner, we
found that KIR2DS1 gene had a protective effect when present in
genotypes not harbouring KIR3DL1 but also having KIR genes with Conict of interest statement
HLA-C2 specicity such as KIR2DL2,  2DS2 and  2DL3. These
results are in partial agreement with those published previously None declared.
[30,6062]. However, our results did not conrm previously pub-
lished ndings relating KIR2DS3 or the number of activating KIR
genes to haematological malignancy [59,63,64]. Summary
Based on these ndings we believe that KIR genes with HLA-C
specicity render either NK cells or KIR bearing T cells more capable Killer-cell immunoglobulin-like receptors (KIR) are membrane
of detecting and eradicating malignant cells by providing them with bound proteins expressed by natural killer (NK) cells and CD8
an activation-prone repertoire or by enhancing the detection of lymphocytes. To date 17 KIR genes and 614 alleles of these have
higher levels of peptide-MHC class I complexes as has been been shown to exist. The highly polymorphic genetic system o
suggested for EBV associated Hodgkin's Lymphoma [60]. Finally, it Human Leucocyte Antigens (HLA) are ligands for some KIR proteins,
is the authors' belief that the protective effect identied by ID3 in the to date nearly 6000 alleles of HLA genes are known to exist. Tumour
form of the haplotype motif cB03 arises mainly from the KIR2DL3 cells such as those of leukaemia and lymphomas, downregulate their
and KIR2DS3S5 combination. This assumption is based on the fact expression of HLA in an attempt to evade recognition by the adaptive
that the ligand for KIR2DL5 has yet to be resolved. This together with immune system (i.e., CD8 lymphocytes). NK cells (part of the innate
the fact that some KIR2DL5 alleles are not expressed reduces the immune system) sense this loss of HLA expression through KIR and
likelihood of an important role for the KIR2DL3 and KIR2DL5 respond vigorously by killing the malignant target cells or by
combination. In contrast, the extracellular immunoglobulin-like releasing cytokines which recruit other immune cells to destroy
domains of KIR2DL3 and KIR2DS3S5 exhibit high homology [65]. them. The daunting task of identifying epidemiologic associations
This notion is further supported by the fact (1) that neither KIR2DL1 between diverse gene systems and neoplastic diseases is more easily
nor  2DL3 are on their own important steps in the ID3 decision dealt with through computers and articial intelligence algorithms.
processes, (2) that KIR2DL1,  2DL3,  2DL5 and  2DS3S5 are also In this manuscript we describe our results at applying support vector
present in other haplotype motifs which were not identied as being machine articial intelligence algorithms (SVMs) to identify KIR gene
important such as cA01 for KIR2DL1 and  2DL3 combination, cB01 associations in a cohort of patients with haematological malignan-
for the KIR2DL1,  2Dl5 and  2DS3S5 combination. This could be cies. A total of 300 clinically healthy unrelated blood donors (used as
interpreted as resulting in a redundant protective KIR repertoire that controls) and 43 samples obtained from patients with haematologi-
renders KIR-bearing lymphocytes capable of both detecting the cal malignancies (25 with leukaemia and 18 with lymphomas) were
absence of HLA-C1 allotypes resulting from tumour immune evasion genotyped for the 17 KIR genes using an in-house DNA-based PCR-
strategies or the presence of up-regulated HLA-C1 allotypes resulting SSP approach. KIR genotyping strings and clinical outcome (healthy
from increased MHC expression as has been suggested for certain or diseased) were encoded into a matrix le which was then
Hodgkin's lymphomas [66]. Although the molecular genotyping subjected to analysis through a SVM algorithm and the algorithm's
method that we employed did not allow us to include information performance at classifying samples into healthy and diseased eval-
on KIR gene number variations, high-resolution allele typing's, HLA uated and compared to that achieved on an entirely random data
data and the position of genes within specic haplotype motifs, we matrix (set 1) as well as to an additional articial data matrix (set 2)
would not expect the SVM to encounter any difculties at tackling generated using real KIR genotyping strings randomly assigned
the information once made available. healthy or ill status. As expected, the classication error plots
followed a logical stratication of results where the randomly
generated matrix (set 1) produced high overall classication errors
4. Conclusions and real data produced the lowest, with the second articial dataset
lying in between these two. Our results provide further support
While the importance of KIR genes having HLA-C specicity has linking KIR genes with susceptibility to malignant diseases and an
also been suggested by the aforementioned leukaemia suscept- alternative state of the art AI approach to the analysis of complex
ibility studies, our results provide a novel insight into the different genetic systems in epidemiology. We feel that similar approaches
functional arrangements of genes that might be involved. will aid future molecular epidemiology studies as well as in devel-
Although the size and heterogeneity of our study cohort together oping clinical decision support systems.
with the lack of HLA typing data limits the clinical inferences that
can be made from our results, it sets an example for a different
way of analysing the clinical and functional relevance of complex Acknowledgements
genetic systems. Furthermore, our results highlight the possibili-
ties of applying computational supervised learning methods to The authors wish to thank Dr. Oscar Prez Ramrez and Dr.
analyse data and recognise patterns capable of being used in Arturo Snchez Arriaga of the Haematology Service and Blood
clinical decision-making. The capabilities of articial neural net- Bank of Hospital Central Dr. Ignacio Morones Prieto for referring
works at predicting acute graft-vs.-host disease in patients with the patient samples that made this work possible. To Dr. Daniel E.
thalassaemia subjected to unrelated donor HSCT have been pre- Noyola of the Virology Laboratory, Facultad de Medicina Universi-
viously demonstrated [67]. As such we have developed an alter- dad Autnoma de San Luis Potos for proofreading the manuscript.
native and state of the art SVM approach to analyse KIR patterns This work was funded by Grants provided from Universidad
and provide a simple clinical DSS that incorporates such data. We Autnoma de San Luis Potos (P/PIFI2009-24MSU0011E-12) and
consider that this SVM approach will reveal previously unknown Fondo Sectorial de Investigacin en Salud y Seguridad Social
J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062 2061

(SALUD-2009-01-115226). The study sponsor had no role in study [23] R.T. Costello, C. Fauriat, S. Sivori, E. Marcenaro, D. Olive, NK cells: innate
design, collection, analysis and interpretation of data. immunity against hematological malignancies? Trends in Immunology 25
(2004) 328.
[24] L. Ruggeri, M. Capanni, M. Casucci, I. Volpi, A. Tosti, K. Perruccio, E. Urbani, R.
S. Negrin, M.F. Martelli, A. Velardi, Role of natural killer cell alloreactivity in HLA-
mismatched hematopoietic stem cell transplantation, Blood 94 (1999) 333.
[25] S. Cooley, E. Trachtenberg, T.L. Bergemann, K. Saeteurn, J. Klein, C.T. Le, S.
References G. Marsh, L.A. Guethlein, P. Parham, J.S. Miller, D.J. Weisdorf, Donors with
group B KIR haplotypes improve relapse-free survival after unrelated
[1] V. Litwin, J. Gumperz, P. Parham, J.H. Phillips, L.L. Lanier, Specicity of HLA hematopoietic cell transplantation for acute myelogenous leukemia, Blood
class I antigen recognition by human NK clones: evidence for clonal hetero- 113 (2009) 726.
geneity, protection by self and non-self alleles, and inuence of the target cell [26] S.M. Davies, L. Ruggieri, T. DeFor, J.E. Wagner, D.J. Weisdorf, J.S. Miller,
type, Journal of Experimental Medicine 178 (1993) 1321. A. Velardi, B.R. Blazar, Evaluation of KIR ligand incompatibility in mismatched
[2] A. Moretta, C. Bottino, D. Pende, G. Tripodi, G. Tambussi, O. Viale, A. Orengo, unrelated donor hematopoietic transplants. Killer immunoglobulin-like recep-
M. Barbaresi, A. Merli, E. Ciccone, Identication of four subsets of human CD3- tor, Blood 100 (2002) 3825.
CD16 natural killer (NK) cells by the expression of clonally distributed [27] B. Gansuvd, M. Hagihara, Y. Yu, H. Inoue, Y. Ueda, T. Tsuchiya, A. Masui,
functional surface molecules: correlation between subset assignment of NK K. Ando, Y. Nakamura, N. Munkhtuvshin, S. Kato, J.M. Thomas, T. Hotta, Human
clones and ability to mediate specic alloantigen recognition, Journal of umbilical cord blood NK T cells kill tumors by multiple cytotoxic mechanisms,
Experimental Medicine 172 (1990) 1589. Human Immunology 63 (2002) 164.
[3] J. Trowsdale, Genetic and functional relationships between MHC and NK [28] S. Giebel, F. Locatelli, T. Lamparelli, A. Velardi, S. Davies, G. Frumento,
receptor genes, Immunity 15 (2001) 363. R. Maccario, F. Bonetti, J. Wojnar, M. Martinetti, F. Frassoni, G. Giorgiani,
[4] K.C. Hsu, X.R. Liu, A. Selvakumar, E. Mickelson, R.J. O'Reilly, B. Dupont, Killer Ig- A. Bacigalupo, J. Holowiecki, Survival advantage with KIR ligand incompat-
like receptor haplotype analysis by gene content: evidence for genomic ibility in hematopoietic stem cell transplantation from unrelated donors,
diversity with a minimum of six basic framework haplotypes, each with Blood 102 (2003) 814.
multiple subsets, Journal of Immunology 169 (2002) 5118. [29] K.C. Hsu, T. Gooley, M. Malkki, C. Pinto-Agnello, B. Dupont, J.D. Bignon,
[5] C.-W. Pyo, L.A. Guethlein, Q. Vu, R. Wang, L. Abi-Rached, P.J. Norman, S.G. M. Bornhauser, F. Christiansen, A. Gratwohl, Y. Morishima, M. Oudshoorn,
E. Marsh, J.S. Miller, P. Parham, D.E. Geraghty, Different patterns of evolution in O. Ringden, J.J. van Rood, E. Petersdorf, G. International Histocompatibility
the centromeric and telomeric regions of group A and B haplotypes of the Working, KIR ligands and prediction of relapse after unrelated donor
human killer cell Ig-like receptor locus, PLoS One 5 (2010) e15115. hematopoietic cell transplantation for hematologic malignancy, Blood and
[6] K.S. Campbell, M. Colonna, Human natural killer cell receptors and signal Marrow Transplantation 12 (2006) 828.
transduction, International Reviews of Immunology 20 (2001) 333. [30] K. Stringaris, S. Adams, M. Uribe, R. Eniafe, C.O. Wu, B.N. Savani, A.J. Barrett, K.I.
[7] V. Renard, A. Cambiaggi, F. Vely, M. Blery, L. Olcese, S. Olivero, M. Bouchet, R. Donor, Genes 2DL5A, 2DS1 and 3DS1 are associated with a reduced rate of
E. Vivier, Transduction of cytotoxic signals in natural killer cells: a general leukemia relapse after HLA-identical sibling stem cell transplantation for
model of ne tuning between activatory and inhibitory pathways in lympho- acute myeloid leukemia but not other hematologic malignancies, Blood and
cytes, Immunological Reviews 155 (1997) 205. Marrow Transplantation 16 (2010) 1257.
[8] C. Cantoni, M. Falco, A. Pessino, A. Moretta, L. Moretta, R. Biassoni, P49, a [31] H.J. Symons, M.S. Leffell, N.D. Rossiter, M. Zahurak, R.J. Jones, E.J. Fuchs,
putative HLA-G1 specic inhibitory NK receptor belonging to the immunoglo- Improved survival with inhibitory killer immunoglobulin receptor (KIR) gene
bulin superfamily, Journal of Reproductive Immunology 43 (1999) 157. mismatches and KIR haplotype B donors after nonmyeloablative, HLA-
[9] C. Dohring, D. Scheidegger, J. Samaridis, M. Cella, M. Colonna, A human killer haploidentical bone marrow transplantation, Blood and Marrow Transplanta-
inhibitory receptor specic for HLA-A1,2, Journal of Immunology 156 (1996) tion 16 (2010) 533.
3098. [32] L. Ruggeri, M. Capanni, E. Urbani, K. Perruccio, W.D. Shlomchik, A. Tosti,
[10] J.E. Gumperz, J.C. Paterson, V. Litwin, N. Valiante, L.L. Lanier, P. Parham, A. S. Posati, D. Rogaia, F. Frassoni, F. Aversa, M.F. Martelli, A. Velardi, Effectiveness
M. Little, Specicity of two anti-class I HLA monoclonal antibodies that block of donor natural killer cell alloreactivity in mismatched hematopoietic
class I recognition by the NKB1 killer cell inhibitory receptor, Tissue Antigens transplants, Science 295 (2002) 2097.
48 (1996) 278. [33] P.H. Basse, T.L. Whiteside, W. Chambers, R.B. Herberman, Therapeutic activity of
[11] A. Moretta, M. Vitale, C. Bottino, A.M. Orengo, L. Morelli, R. Augugliaro, NK cells against tumors, International Reviews of Immunology 20 (2001) 439.
M. Barbaresi, E. Ciccone, L. Moretta, P58 molecules as putative receptors for [34] M.J. Smyth, K.Y. Thia, S.E. Street, E. Cretney, J.A. Trapani, M. Taniguchi,
major histocompatibility complex (MHC) class I molecules in human natural T. Kawano, S.B. Pelikan, N.Y. Crowe, D.I. Godfrey, Differential tumor surveil-
killer (NK) cells. Anti-p58 antibodies reconstitute lysis of MHC class I- lance by natural killer (NK) and NKT cells, Journal of Experimental Medicine
protected cells in NK clones displaying different specicities, Journal of 191 (2000) 661.
Experimental Medicine 178 (1993) 597. [35] R. Biassoni, A. Pessino, A. Malaspina, C. Cantoni, C. Bottino, S. Sivori, L. Moretta,
[12] K. Krre, How to recognize a foreign submarine, Immunological Reviews 155 A. Moretta, Role of amino acid position 70 in the binding afnity of p50.1 and
(1997) 5. p58.1 receptors for HLA-Cw4 molecules, European Journal of Immunology 27
[13] L.L. Lanier, J.H. Phillips, Inhibitory MHC class I receptors on NK cells and T cells, (1997) 3095.
Immunology Today 17 (1996) 86. [36] M. Vales-Gomez, H.T. Reyburn, M. Mandelboim, J.L. Strominger, Kinetics of
[14] G. Alter, M.P. Martin, N. Teigen, W.H. Carr, T.J. Suscovich, A. Schneidewind, interaction of HLA-C ligands with natural killer cell inhibitory receptors,
H. Streeck, M. Waring, A. Meier, C. Brander, J.D. Lifson, T.M. Allen, Immunity 9 (1998) 337.
M. Carrington, M. Altfeld, Differential natural killer cell-mediated inhibition [37] C.C. Winter, E.O. Long, A single amino acid in the p58 killer cell inhibitory
of HIV-1 replication based on distinct KIR/HLA subtypes, Journal of Experi- receptor controls the ability of natural killer cells to discriminate between the
mental Medicine 204 (2007) 3027. two groups of HLA-C allotypes, Journal of Immunology 158 (1997) 4026.
[15] M. Carrington, M.P. Martin, J. van Bergen, KIR-HLA intercourse in HIV disease, [38] C. Cortes, V. Vapnik, Machine Learning Support Vector Networks, AT&T Labs
Trends in Microbiology 16 (2008) 620. Research, USA (1995) 1995; 273.
[16] S.J. Holm, K. Sakuraba, L. Mallbris, K. Wolk, M. Stahle, F.O. Sanchez, Distinct [39] N. Cristianini, J. Shawe-Taylor, Support Vector Machines and Other Kernel-
HLA-C/KIR genotype prole associates with guttate psoriasis, Journal of Based Learning Methods, Cambridge University Press, UK, 2000.
Investigative Dermatology 125 (2005) 721. [40] T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning: Data
[17] D.P. Lowe, M.A. Cook, S.J. Bowman, D.C. Briggs, U.K. Sjgren's, Interest group, Mining, Inference, and Prediction. Springer Series in Statitstics, Springer,
association of killer cell immunoglobulin-like receptors with primary Sjo- Germany, 2001.
gren's syndrome, Rheumatology (Oxford) 48 (2009) 359. [41] J. Shawe-Taylor, N. Cristianini, Kernel Methods for Pattern AnalysisCambridge
[18] D. Middleton, I. Halfpenny, A. Meenagh, F. Williams, J. Sivula, E. Tuomilehto- University Press, UK, 2004.
Wolf, Investigation of KIR gene frequencies in type 1 diabetes mellitus, Human [42] C.A. Garcia-Sepulveda, E. Carrillo-Acuna, S.E. Guerra-Palomares, M. Barriga-
Immunology 67 (2006) 986. Moreno, Maxiprep genomic DNA extractions for molecular epidemiology
[19] J. Zuniga, V. Romero, J. Azocar, J.N.H. Stern, O. Clavijo, I. Almeciga, L. Encinales, studies and biorepositories, Molecular Biology Reports 37 (2010) 1883.
A. Avendano, M. Fridkis-Hareli, J.P. Pandey, E.J. Yunis, Interaction of KIR genes [43] D.L. Alvarado-Hernandez, D. Hernandez-Ramirez, D.E. Noyola, C.A. Garcia-
and G1M immunoglobulin allotypes confer susceptibility to type 2 diabetes in Sepulveda, KIR gene diversity in Mexican mestizos of San Luis Potosi,
Puerto Rican Americans, Human Immunology 67 (2006) 907. Immunogenetics 63 (2011) 561.
[20] R.B. Herberman, M.E. Nunn, H.T. Holden, D.H. Lavrin, Natural cytotoxic [44] S.R. Gunn, Support Vector Machines for Classication and Regression, Uni-
reactivity of mouse lymphoid cells against syngeneic and allogeneic tumors. versity of Southampton, UK, 1998.
II. Characterization of effector cells, International Journal of Cancer 16 (1975) [45] S. Haykin, Neural Networks: A Comprehensive Foundation, MacMillan Pub-
230. lishing Company, UK, 1999.
[21] R. Kiessling, E. Klein, H. Pross, H. Wigzell, Natural killer cells in the mouse. II. [46] T. Fletcher, Support Vector Machines Explained, University College London,
Cytotoxic cells with specicity for mouse Moloney leukemia cells. Character- UK, 2009.
istics of the killer cell, European Journal of Immunology 5 (1975) 117. [47] A.J. Chippereld, P.J. Fleming, H. Pohlheim, C.M. Fonseca, Genetic Algorithm
[22] R. Kiessling, E. Klein, H. Wigzell, Natural killer cells in the mouse. I. Cytotoxic Toolbox for use with MATLAB, in, Automatic Control and Systems Engineering,
cells with specicity for mouse Moloney leukemia cells. Specicity and University of Shefeld, 1996.
distribution according to genotype, European Journal of Immunology 5 [48] D.E. Goldberg, Genetic Algorithms in Search, Optimisation, and Machine
(1975) 112. Learning, Addison-Wesley, USA, 1989.
2062 J.C. Cuevas Tello et al. / Computers in Biology and Medicine 43 (2013) 20532062

[49] J.P. Ignizio, Introduction to Expert Systems, Mc Graw Hill, USA, 1991. [63] F. Shahsavar, N. Tajik, K.Z. Entezami, M. Fallah Radjabzadeh, B. Asadifar,
[50] M.H. Zweig, G. Campbell, Receiver-operating characteristic (ROC) plots: a K. Alimoghaddam, M. Ostadali Dahaghi, A. Jalali, A. Ghashghaie,
fundamental evaluation tool in clinical medicine, Clinical Chemistry 39 (1993) A. Ghavamzadeh, KIR2DS3 is associated with protection against acute myeloid
561. leukemia, Iranian Journal of Immunology 7 (2010) 8.
[51] T. Fawcett, in: I.E.T. Laboratory (Ed.), ROC Graphs: Notes and Practical [64] G.Q. Wu, Y.M. Zhao, X.Y. Lai, Y. Luo, Y.M. Tan, J.M. Shi, L. Li, W.Y. Zheng, J. Zhang,
Considerations for Researchers, Hewlet-Packard Laboratories, Palo Alto, Cali- X.R. Hu, A.Y. Jin, J.S. He, W.Z. Xie, X.J. Ye, Z. Cai, M.F. Lin, H. Huang, The
fornia, 2004. benecial impact of missing KIR ligands and absence of donor KIR2DS3 gene
[52] M.A. Barber, T. Zhang, B.A. Gagne, C.L. Sentman, NK cells negatively regulate on outcome following unrelated hematopoietic SCT for myeloid leukemia in
antigen presentation and tumor-specic CTLs in a syngeneic lymphoma the Chinese population, Bone Marrow Transplantation 45 (2010) 1514.
model, Journal of Immunology 178 (2007) 6140. [65] D. Ordonez, A. Meenagh, N. Gomez-Lozano, J. Castano, D. Middleton,
[53] M. Bernal, P. Garrido, P. Jimenez, R. Carretero, M. Almagro, P. Lopez, P. Navarro, C. Vilches, Duplication, mutation and recombination of the human orphan
F. Garrido, F. Ruiz-Cabello, Changes in activatory and inhibitory natural killer gene KIR2DS3 contribute to the diversity of KIR haplotypes, Genes and
(NK) receptors may induce progression to multiple myeloma: implications for Immunity 9 (2008) 431.
tumor evasion of T and NK cells, Human Immunology 70 (2009) 854. [66] M.K. Gandhi, J.T. Tellam, R. Khanna, Epstein-Barr virus-associated Hodgkin's
[54] C.D. Brenner, S. King, M. Przewoznik, I. Wolters, C. Adam, G.W. Bornkamm, D. lymphoma, British Journal of Haematology 125 (2004) 267.
H. Busch, M. Rocken, R. Mocikat, Requirements for control of B-cell lymphoma [67] G. Caocci, R. Baccoli, A. Vacca, A. Mastronuzzi, A. Bertaina, E. Piras, R. Littera,
by NK cells, European Journal of Immunology 40 (2010) 494. F. Locatelli, C. Carcassi, G. La Nasa, Comparison between an articial neural
[55] L. Fischer, O. Penack, C. Gentilini, A. Nogai, A. Muessig, E. Thiel, L. Uharek, The network and logistic regression in predicting acute graft-vs-host disease after
anti-lymphoma effect of antibody-mediated immunotherapy is based on an unrelated donor hematopoietic stem cell transplantation in thalassemia
increased degranulation of peripheral blood natural killer (NK) cells, Experi- patients, Experimental Hematology 38 (2010) 426.
mental Hematology 34 (2006) 753.
[56] S. Palmer, C.A. Hanson, C.S. Zent, L.F. Porrata, B. Laplant, S.M. Geyer, S.
N. Markovic, T.G. Call, D.A. Bowen, D.F. Jelinek, N.E. Kay, T.D. Shanafelt,
Prognostic importance of T and NK-cells in a consecutive series of newly
diagnosed patients with chronic lymphocytic leukaemia, British Journal of
Juan Carlos Cuevas received a Masters degree in Computer Sciences (Articial
Haematology 141 (2008) 607.
Intelligence) from Universidad Nacional Autnoma de Mxico and a PhD degree on
[57] A. Uchida, M. Yagita, H. Sugiyama, T. Hoshino, M. Moore, Strong natural killer
Computer Sciences and Articial Intelligence from Birmingham University, United
(NK) cell activity in bone marrow of myeloma patients: accelerated matura-
Kingdom. He is currently full-time research professor at the Faculty of Engineering
tion of bone marrow NK cells and their interaction with other bone marrow
Universidad Autnoma de San Luis Potosi.
cells, International Journal of Cancer 34 (1984) 375.
[58] J.R. Passweg, B. Huard, J.-M. Tiercy, E. Roosnek, HLA and KIR polymorphisms
affect NK-cell anti-tumor activity, Trends in Immunology 28 (2007) 437.
[59] L. Karabon, A. Jedynak, S. Giebel, D. Woowiec, M. Kielbinski, D. Woszczyk,
K. Kapelko-Slowik, K. Kuliczkowski, I. Frydecka, KIR/HLA gene combinations
inuence susceptibility to B-cell chronic lymphocytic leukemia and the Daniel Hernndez-Ramrez is currently working towards his PhD degree at the
clinical course of disease, Tissue Antigens 78 (2011) 129. Human and Viral Genomics Laboratory of the Facultad de Medicina Universidad
[60] C. Besson, S. Roetynck, F. Williams, L. Orsi, C. Amiel, C. Lependeven, G. Antoni, Autnoma de San Luis Potosi, in San Luis Potosi, Mxico. His project relates to
O. Hermine, P. Brice, C. Ferme, P. Carde, D. Canioni, J. Briere, M. Raphael, J. dening the clinical relevance of HLA and KIR diversity on HIV infections amongst
C. Nicolas, J. Clavel, D. Middleton, E. Vivier, L. Abel, Association of killer cell Mexican mestizos.
immunoglobulin-like receptor genes with Hodgkin's lymphoma in a familial
study, PLoS One 2 (2007) e406.
[61] E. Marcenaro, S. Carlomagno, S. Pesce, M. Della Chiesa, A. Moretta, S. Sivori,
Role of alloreactive KIR2DS1( ) NK cells in haploidentical hematopoietic stem
cell transplantation, Journal of Leucocyte Biology 90 (2011) 661.
[62] D. Marin, I.H. Gabriel, S. Ahmad, L. Foroni, H. de Lavallade, R. Clark, S. O'Brien, Christian A. Garca-Seplveda received a Medical degree from Facultad de
R. Sergeant, C. Hedgley, D. Milojkovic, J.S. Khorashad, M. Bua, A. Alsuliman, Medicina, Universidad Autnoma de Coahuila, Mxico and a PhD degree from
A. Khoder, K. Stringaris, N. Cooper, J. Davis, J.M. Goldman, J.F. Apperley, University College London. He is currently a full-time research/professor and head
K. Rezvani, KIR2DS1 genotype predicts for complete cytogenetic response and researcher at the Human and Viral Genomics Laboratory of the Facultad de
survival in newly diagnosed chronic myeloid leukemia patients treated with Medicina Universidad Autnoma de San Luis Potosi, in San Luis Potosi, Mxico.
imatinib, Leukemia 26 (2012) 296.

You might also like