Professional Documents
Culture Documents
com/naturemedicine
LETTERS
1Satoris, Inc., 2686 Middlefield Road, Suite E, Redwood City, California 94063, USA. 2Department of Neurology and Neurological Sciences, Stanford University
School of Medicine, 300 Pasteur Drive, Stanford, California 94305-5235, USA. 3Department of Neurology, Memory and Aging Center, 350 Parnassus Avenue,
Suite 706, San Francisco, California 94117, USA. 4Institute of Clinical Neuroscience, Department of Experimental Neuroscience, Sahlgrenska University Hospital,
University of Goteborg, Bla straket 15, 431 80 Molndal, Sweden. 5Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine,
300 Pasteur Drive, Stanford, California 94305-5550, USA. 6Department of Neurosciences, University of California, San Diego, 9500 Gilman Drive #9127, La Jolla,
California 92093-9127, USA. 7Department of Internal Medicine and Allergology, Wroclaw Medical University, Traugutta 57, 50-417 Wroclaw, Poland. 8Layton Aging &
Alzheimers Disease Center, Oregon Health Sciences University, 3181 Southwest Sam Jackson Park Road, CR131, Portland, Oregon 97201-3098, USA. 9Department
of Psychiatry, Wroclaw Medical University, Pasteura 10, 51-622 Wroclaw, Poland. 10Clinical Memory Research Unit, Department of Clinical Sciences Malmo, Lund
University, Universitetssjukhuset MAS, Ingang 56 plan 7, SE-205 02 Malmo, Sweden. 11Division of Immunology and Rheumatology, Stanford University School
of Medicine, 300 Pasteur Drive, Stanford, California 94305-5166, USA. 12Sun Health Research Institute, 10515 West Santa Fe Drive, Sun City, Arizona 85351, USA.
13Department of Neurosciences, Ophthalmology, and Genetics, University of Genoa, Via A. De Toni 5, 16132 Genoa, Italy. 14Department of Health Research and Policy,
Stanford University School of Medicine, Stanford, 300 Pasteur Drive, California 94305-5405, USA. 15Geriatric Research, Education and Clinical Center, Veterans
Affairs Palo Alto Health Care System, 3801 Miranda Avenue, Palo Alto, California 94304, USA. 16These authors contributed equally to this work. Correspondence
should be addressed to T.W.-C. (twc@stanford.edu).
NATURE MEDICINE
VOLUME 13
NUMBER 11
NOVEMBER 2007
1359
Predictor discovery
in training set
80
Minimal number of predictors
Training set
(43 AD, 40 NDC)
Test set AD
clinically diagnosed AD
(42 AD, 39 NDC, 11 OD)
SAM
Cluster
analysis
Classifier
training
18 predictors
60
120
108
96
88
74
65
52
46
40
33
30
22
18
16
13
12
10
7
7
5
5
4
4
2
2
1
1
1
1
0
LETTERS
40
20
Ten-fold
cross-validation
Training
Cross-validation
Classify
AD and NDC
Classify
diagnosed AD
as AD
MCI
Classify
AD converters
as AD
0.5
AD cluster
1.0
1.5
2.0
Threshold
2.5
3.0
3.5
NDC cluster
CCL18
ANG-2
IL-11
G-CSF
IGFBP-6
ICAM-1
CXCL8
TRAIL-R4
CCL5
PDGF-BB
EGF
GDNF
TNF-
CCL7
CCL15
M-CSF
CCL22
IL-3
IL-1
Clinical diagnosis of
plasma sample donor
AD
NDC
Figure 1 Study outline, clustering of training set, and predictor discovery. (a) Informed consent was obtained from all human subjects according to the ethics
committee guidelines at the respective academic centers. A total of 223 plasma samples were separated into a training set and two test sets as indicated
(Supplementary Table 1). Changes in relative signaling protein concentrations were initially analyzed with SAM, and this was followed by cluster analysis (1).
To discover predictors for classification, we analyzed the training set by PAM (2), then used the PAM predictors to classify the samples in the independent
test set AD (3). Class prediction of presymptomatic Alzheimers disease (AD) was performed on samples from individuals who were diagnosed with MCI at
the date of blood draw (test set MCI, 4). None of the samples from the test sets was used for any part of the predictor discovery process. (b) We analyzed
normalized array measurements of 120 plasma signaling proteins in the training set with SAM to discover differences in protein abundance between samples
of subjects with Alzheimers disease and those of NDC subjects. The nineteen proteins that obtained a significant d-score (q r 3.4%) are presented in a
heat map generated with an unsupervised cluster algorithm. Samples are arranged in columns, proteins in rows. Red shades, increased expression in
Alzheimers disease samples as compared to NDC samples; blue shades, reduced expression; white, median expression. Samples are clustered into AD and
NDC categories as indicated by the first-order branches of the dendrogram (two black bars at the top). (c) Predictor discovery by PAM was performed with
normalized array measurements of 120 signaling proteins in the training set. In training (gray line) and internal cross-validation (black line), decreasing the
centroid threshold (lower x-axis) resulted in an increase in the number of markers (inserted upper x-axis) that were used for classification and calculation of
the classification error (y-axis). This led to the discovery of a minimal set of 18 predictors with lowest possible classification error.
1360
VOLUME 13
NUMBER 11
NOVEMBER 2007
NATURE MEDICINE
LETTERS
Clinical
diagnosis
Training set
n = 83
AD
NDC
n=
43
N
eu
C rop
e r
N reb ote
eu ro c
H rot va tion
e r s
N ma op cul
e t h a
Ph uro opo ic a r fu
a de ie ct nc
In goc ge sis ion tio
n
n
f
Enlam yto era
er ma sis tio
n
Ex gy h tion
pr om
e
Tr sse eo
an d sta
C spo in C sis
ha rt
n
a NS
C ge cro
ha s
ng wit ss B
ed h a B
in gin B
AD g
40
PAM
CCL18
ANG-2
Classified
as AD
41
Classified
as non-AD
33
95%
83%
IGFBP-6
CXCL8
ICAM-1
IL-11
Percent
positive negative
agreement (8499%) (6793%)
with clinical
89%
diagnosis
overall
TRAIL-R4
CCL15
CCL7
P < 0.001
b
Clinical
diagnosis
n=
Test set AD
n = 92
AD
NDC OD
EGF
42
G-CSF
39
CCL5
GDNF
11
M-CSF
PAM
TNF-
IL-3
Classified
as AD
38
Classified
as non-AD
34
10
IL-1
PDGF-BB
3.5
3.0
2.5
2.0
90%
88%
Percent
positive
negative
agreement (7797%)
(7695%)
with clinical
89%
diagnosis
1.0
1.5
2.0
2.5
No PubMed
reports
d-score
overall
P < 0.001
c
Time 0:
diagnosis and
blood draw
47
Percent agreement
PAM
class prediction with follow-up diagnosis
AD Non-AD
91%
positive
20
2
Follow-up
diagnosis
.6
29.6
14
27.8 1.6
AD
22
(7299%)
81%
OD
8
overall
P < 0.001
8
72%
NA
negative
(5188%)
MCI
17
10
NATURE MEDICINE
VOLUME 13
NUMBER 11
NOVEMBER 2007
1361
LETTERS
Table 1 Eighteen plasma signaling proteins that predict clinical
Alzheimers diagnosis
Predictors
d-score
q-value (%)
ANG-2
CCL5
2.1
2.9
r0.05
r0.05
CCL7
CCL15
1.7
1.6
r0.05
r0.05
CCL18
CXCL8
1.9
1.7
3.1
3.1
EGF
G-CSF
2.7
1.9
r0.05
r0.05
GDNF
ICAM-1
1.8
2.2
r0.05
r0.05
IGFBP-6
IL-1a
1.5
2.9
3.1
r0.05
IL-3
IL-11
2.0
2.1
r0.05
r0.05
M-CSF
2.4
r0.05
PDGF-BB
TNF-a
3.4
2.6
r0.05
r0.05
TRAIL-R4
1.8
3.1
In the training set, predictor discovery by PAM identified 18 predictors from the
normalized array measurements of 120 signaling proteins. SAM was used to calculate
d-scores indicating the relative positive (increased) and negative (decreased) changes
in concentration of these proteins in plasma of subjects with Alzheimers disease in
comparison to NDC subjects. SAM calculates a minimal false discovery rate (q-value) for
significance. ANG-2, angiopoietin-2. CCL, chemokine that contains a C-C motif; CXCL,
chemokine that contains a C-X-C motif; G-CSF, granulocyte-colony stimulating factor;
GDNF, glial-derived neurotrophic factor; ICAM-1, intercellular adhesion molecule-1;
IGFBP-1, insulin-like growth factorbinding protein-6; IL, interleukin; PDGF-BB,
platelet-derived growth factor BB; TRAIL-R4, TNF-related apoptosis-inducing
ligand receptor-4.
1362
VOLUME 13
NUMBER 11
NOVEMBER 2007
NATURE MEDICINE