You are on page 1of 34

ITM Assignment 1:

SPSS:Validating a Medical
Database

Group Members:

Abhik Mondal M1701


Achin M1702
Amartya Das M1705
Ankur Kunti M1708
Kuldeep Shrivastava M1715
Nayan Nagar M1717
Saubhik Mondal M1724
Yash Kumar M1738
Yogesh Kumar M1739

Your temporary usage period for IBM SPSS Statistics will expire in
9 days.

GET

FILE='C:\PROGRA~1\IBM\SPSS\STATIS~1\SUBSCR~1\Samples\English\strok
e_invalid.sav'.
DATASET NAME DataSet1 WINDOW=FRONT.
* Validate Data.
VALIDATEDATA VARIABLES=hospsize age agecat gender active obesity
diabetes bp af smoker choles
angina mi nitro anticlot tia time doa rankin0 catscan clotsolv
dhosp result surgery rehab los_rehab
cost rankin1 rankin2 rankin3 barthel1 barthel2 barthel3
recbart1 recbart2 recbart3 ID=hospid patid
physid
/VARCHECKS STATUS=ON PCTMISSING=70 PCTEQUAL=95 PCTUNEQUAL=90
CV=0.001 STDDEV=0
/IDCHECKS INCOMPLETE DUPLICATE
/CASECHECKS REPORTEMPTY=YES SCOPE=ALLVARS
/CASEREPORT DISPLAY=NO.

Validate Data

Notes
Output Created 09-NOV-2017 21:07:45
Comments
Input Data C:\PROGRA~1\IBM\SPSS\S
TATIS~1\SUBSCR~1\Sampl
es\English\stroke_invalid.sav
Active Dataset DataSet1
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working Data 1183
File
Syntax VALIDATEDATA
VARIABLES=hospsize age
agecat gender active obesity
diabetes bp af smoker choles
angina mi nitro anticlot tia
time doa rankin0 catscan
clotsolv dhosp result surgery
rehab los_rehab
cost rankin1 rankin2
rankin3 barthel1 barthel2
barthel3 recbart1 recbart2
recbart3 ID=hospid patid
physid
/VARCHECKS
STATUS=ON
PCTMISSING=70
PCTEQUAL=95
PCTUNEQUAL=90
CV=0.001 STDDEV=0
/IDCHECKS INCOMPLETE
DUPLICATE
/CASECHECKS
REPORTEMPTY=YES
SCOPE=ALLVARS
/CASEREPORT
DISPLAY=NO.
Resources Processor Time 00:00:00.09
Elapsed Time 00:00:00.09

[DataSet1]
C:\PROGRA~1\IBM\SPSS\STATIS~1\SUBSCR~1\Samples\English\stroke_inva
lid.sav

Warnings
Some or all requested output is not displayed because all cases,
variables, or data values passed the requested checks.

Identifier Checks

Incomplete Identifiers
Identifier
Attending
Case Hospital ID Patient ID physician ID
288 OZN 125304
573 6137798782 790697
774 2322241867 176466

Duplicate Identifiers
Cases with Identifier
Number of Duplicate Attending
Duplicate Identifiers Group Duplicates Identifiers Hospital ID Patient ID physician ID
1 2 10, 11 PBW 1406462419 355184
2 2 14, 15 PBW 2191527525 355184
3 2 21, 22 PBW 7237535360 616528
4 2 28, 29 NHV 4592215163 942982
5 2 30, 31 NHV 7628592330 371884
6 2 64, 65 NHV 0300750006 371884
7 2 83, 84 QWS 4590625286 215041
8 2 86, 87 QWS 6272818258 817329
9 2 96, 97 QWS 1959349605 215041
10 3 100, 101, 102 QWS 5856145337 817329
11 3 104, 105, 106 QWS 1543897849 817329
12 2 122, 123 QWS 9535631975 215041
13 2 144, 145 RLD 0052710039 560175
14 2 151, 152 RLD 5058356558 560175
15 2 156, 157 RLD 7779910241 695521
16 2 164, 165 OZN 2970608839 139142
17 2 168, 169 OZN 0165873576 125304
18 2 184, 185 OZN 9355198642 076538
19 2 186, 187 OZN 7080054743 543537
20 2 193, 194 OZN 4458028382 076538
21 2 195, 196 OZN 0240860711 064705
22 2 204, 205 OZN 9604580145 330764
23 2 215, 216 OZN 7063542003 330764
24 2 223, 224 OZN 3887804774 125304
25 2 226, 227 OZN 1263468815 076538
26 3 247, 248, 249 OZN 2232387897 543537
27 2 250, 251 OZN 1238314265 125304
28 2 265, 266 OZN 8298141345 064705
29 2 267, 268 OZN 9301391334 103254
30 2 273, 274 OZN 1038840465 103254
31 2 278, 279 OZN 0816869249 883285
32 3 283, 284, 285 OZN 6075306060 064705
33 2 293, 294 OZN 7778090176 103254
34 2 297, 298 OZN 9587618445 883285
35 2 316, 317 OZN 4261884692 064705
36 3 318, 319, 320 OZN 0309845691 568887
37 2 331, 332 WPA 7944046807 435959
38 2 340, 341 WPA 8943404305 723384
39 2 393, 394 WPA 5159262749 584261
40 3 402, 403, 404 WPA 7173763350 723384
41 3 416, 417, 418 WPA 8014448130 462020
42 2 427, 428 WPA 7195468517 519548
43 3 433, 434, 435 WPA 2247641363 652070
44 2 438, 439 WPA 3451187548 993921
45 2 443, 444 WPA 0900960756 652070
46 2 453, 454 WPA 7844608495 723384
47 2 468, 469 WPA 7975852705 462020
48 2 499, 500 NSR 5479016810 037350
49 2 501, 502 NSR 5116931652 891605
50 2 533, 534 NSR 2630060495 891605
51 3 550, 551, 552 SLB 1052452463 782916
52 2 556, 557 SLB 0050512328 506108
53 3 562, 563, 564 SLB 6461046805 991211
54 2 571, 572 SLB 3777715659 782916
55 3 585, 586, 587 GFG 1087536215 001448
56 2 594, 595 GFG 8883599310 618069
57 2 596, 597 GFG 3789467505 618069
58 2 606, 607 GFG 9523517563 828754
59 2 609, 610 GFG 4650288430 001448
60 2 612, 613 GFG 1915386172 327422
61 2 617, 618 GFG 2729367693 384787
62 2 621, 622 GFG 7974556228 877354
63 3 626, 627, 628 GFG 4544033081 384787
64 2 629, 630 GFG 3397891610 539412
65 2 635, 636 GFG 5253343659 749432
66 2 646, 647 GFG 3185981133 749432
67 2 675, 676 GFG 0247616654 255663
68 2 677, 678 GFG 1299881501 539412
69 2 683, 684 GFG 8045662694 384787
70 3 689, 690, 691 GFG 9575642563 539412
71 3 697, 698, 699 GFG 4030164769 877354
72 3 708, 709, 710 GFG 6553808330 176466
73 2 723, 724 GFG 5439750367 327422
74 2 734, 735 GFG 4306101109 539412
75 2 740, 741 GFG 4103977868 176466
76 3 743, 744, 745 GFG 4332119819 749432
77 2 753, 754 GFG 7338746805 327422
78 2 755, 756 GFG 8242846541 539412
79 2 768, 769 GFG 4327606527 001448
80 2 776, 777 GFG 0450597917 255663
81 2 787, 788 WWL 0826392081 842368
82 2 793, 794 WWL 8349313732 348920
83 2 796, 797 WWL 4939931258 139809
84 2 820, 821 WWL 0825993537 139809
85 2 823, 824 WWL 8879358705 574275
86 2 832, 833 WWL 4045084280 995409
87 2 852, 853 WWL 5030747136 139809
88 2 860, 861 WWL 6321524756 574275
89 2 878, 879 WWL 7056065019 751568
90 2 896, 897 EFX 2644234053 866015
91 3 912, 913, 914 EFX 4391632997 501318
92 2 925, 926 IZO 8005601131 493939
93 3 948, 949, 950 YYH 0437454972 086215
94 2 954, 955 YYH 7840326167 611470
95 2 956, 957 YYH 9138809953 611470
96 2 963, 964 YYH 3087535807 249058
97 2 975, 976 YYH 9746101913 249058
98 3 993, 994, 995 BLA 5582038692 846629
99 2 998, 999 BLA 7796142141 185703
100 3 1005, 1006, BLA 6044244232 657638
1007
101 3 1020, 1021, BLA 3383946667 846629
1022
102 2 1026, 1027 BLA 3796244473 185703
103 2 1035, 1036 ALK 1275385980 346348
104 2 1042, 1043 ALK 8548603108 346348
105 2 1052, 1053 ALK 0614590399 267830
106 2 1059, 1060 ALK 9581070966 700152
107 2 1072, 1073 ALK 6900285569 346348
108 2 1077, 1078 ALK 4510532534 392622
109 2 1097, 1098 ALK 0213143436 185787
110 2 1103, 1104 ALK 0819507927 392622
111 2 1152, 1153 ALK 0172987095 267830
112 2 1159, 1160 ALK 1840044155 700152
113 2 1168, 1169 ALK 9104785006 840883
114 2 1179, 1180 ALK 1147626734 700152

APPLY DICTIONARY
/FROM 'C:\Program
Files\IBM\SPSS\Statistics\Subscription\Samples\English\patient_los.sav'
/SOURCE VARIABLES=age agecat gender diabetes bp smoker choles active
obesity angina mi nitro anticlot time doa clotsolv result cost
/FILEINFO ATTRIBUTES=REPLACE
/VARINFO ATTRIBUTES=REPLACE.
* Delete existing single-variable validation rules.
DATAFILE ATTRIBUTE DELETE=$VD.SRule.
* Delete existing links between variables and rules.
VARIABLE ATTRIBUTE VARIABLES=ALL DELETE=$VD.SRuleRef.
* (Re)define single-variable validation rules.
DATAFILE ATTRIBUTE ATTRIBUTE=
$VD.SRule[1]("Label='0 to 1 Dichotomy', Type='Numeric', Domain='List',
FlagUserMissing='No', "+
"FlagSystemMissing='Yes', FlagBlank='No', CaseSensitive='No',List='0'
'1' ")
$VD.SRule[2]("Label='0 to 2 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='0' "+
"'1' '2' ")
$VD.SRule[3]("Label='0 to 3 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='0' "+
"'1' '2' '3' ")
$VD.SRule[4]("Label='1 to 4 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='1' "+
"'2' '3' '4' ")
$VD.SRule[5]("Label='Nonnegative integer', Type='Numeric',
Domain='Range', Minimum='0', "+
"Maximum='', FlagUserMissing='No', FlagSystemMissing='Yes',
FlagBlank='No', "+
"FlagNoninteger='Yes', FlagUnlabeled='No' ")
$VD.SRule[6]("Label='Nonnegative number', Type='Numeric',
Domain='Range', Minimum='0', "+
"Maximum='', FlagUserMissing='No', FlagSystemMissing='Yes',
FlagBlank='No', "+
"FlagNoninteger='No', FlagUnlabeled='No' ").
* (Re)define links between variables and rules.
VARIABLE ATTRIBUTE
VARIABLES=age ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_age'")
/VARIABLES=agecat ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_agecat'")
/VARIABLES=gender ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_gender'")
/VARIABLES=active ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_active'")
/VARIABLES=obesity ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_obesity'")
/VARIABLES=diabetes ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_diabetes'")
/VARIABLES=bp ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[2]',OutcomeVar='@0to2Categorical_bp'")
/VARIABLES=af ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_af'")
/VARIABLES=smoker ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_smoker'")
/VARIABLES=choles ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_choles'")
/VARIABLES=angina ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_angina'")
/VARIABLES=mi ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_mi'")
/VARIABLES=nitro ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_nitro'")
/VARIABLES=anticlot ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_anticlot'")
/VARIABLES=tia ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_tia'")
/VARIABLES=time ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_time'")
/VARIABLES=doa ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_doa'")
/VARIABLES=catscan ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_catscan'")
/VARIABLES=clotsolv ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_clotsolv'")
/VARIABLES=dhosp ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_dhosp'")
/VARIABLES=result ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_result'")
/VARIABLES=surgery ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[2]',OutcomeVar='@0to2Categorical_surgery'")
/VARIABLES=rehab ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_rehab'")
/VARIABLES=los_rehab ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_los_rehab'")
/VARIABLES=cost ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[6]',OutcomeVar='Nonnegativenumber_cost'")
/VARIABLES=recbart1 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart1'")
/VARIABLES=recbart2 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart2'")
/VARIABLES=recbart3 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart3'").
* 0 to 1 Dichotomy.
DO REPEAT #OV= @0to1Dichotomy_smoker @0to1Dichotomy_tia
@0to1Dichotomy_active @0to1Dichotomy_choles
@0to1Dichotomy_dhosp @0to1Dichotomy_gender @0to1Dichotomy_catscan
@0to1Dichotomy_obesity
@0to1Dichotomy_angina @0to1Dichotomy_nitro @0to1Dichotomy_doa
@0to1Dichotomy_diabetes
@0to1Dichotomy_mi @0to1Dichotomy_af
/#IV= smoker tia active choles dhosp gender catscan obesity angina nitro
doa diabetes mi af.
COMPUTE #OV=NOT(ANY(VALUE(#IV),0,1) OR (MISSING(#IV) AND
NOT(SYSMIS(#IV)))) OR SYSMIS(#IV).
END REPEAT.
* 0 to 2 Categorical.
COMPUTE @0to2Categorical_bp=NOT(ANY(VALUE(bp),0,1,2) OR (MISSING(bp) AND
NOT(SYSMIS(bp)))) OR
SYSMIS(bp).
COMPUTE @0to2Categorical_surgery=NOT(ANY(VALUE(surgery),0,1,2) OR
(MISSING(surgery) AND
NOT(SYSMIS(surgery)))) OR SYSMIS(surgery).
* 0 to 3 Categorical.
COMPUTE @0to3Categorical_rehab=NOT(ANY(VALUE(rehab),0,1,2,3) OR
(MISSING(rehab) AND
NOT(SYSMIS(rehab)))) OR SYSMIS(rehab).
COMPUTE @0to3Categorical_clotsolv=NOT(ANY(VALUE(clotsolv),0,1,2,3) OR
(MISSING(clotsolv) AND
NOT(SYSMIS(clotsolv)))) OR SYSMIS(clotsolv).
COMPUTE @0to3Categorical_anticlot=NOT(ANY(VALUE(anticlot),0,1,2,3) OR
(MISSING(anticlot) AND
NOT(SYSMIS(anticlot)))) OR SYSMIS(anticlot).
* 1 to 4 Categorical.
DO REPEAT #OV= @1to4Categorical_result @1to4Categorical_recbart2
@1to4Categorical_recbart3
@1to4Categorical_recbart1 @1to4Categorical_agecat
/#IV= result recbart2 recbart3 recbart1 agecat.
COMPUTE #OV=NOT(ANY(VALUE(#IV),1,2,3,4) OR (MISSING(#IV) AND
NOT(SYSMIS(#IV)))) OR SYSMIS(#IV).
END REPEAT.
* Nonnegative integer.
COMPUTE Nonnegativeinteger_age=NOT(VALUE(age)>=0 AND
VALUE(age)=TRUNC(VALUE(age)) OR (MISSING(age)
AND NOT(SYSMIS(age)))) OR SYSMIS(age).
COMPUTE Nonnegativeinteger_time=NOT(VALUE(time)>=0 AND
VALUE(time)=TRUNC(VALUE(time)) OR
(MISSING(time) AND NOT(SYSMIS(time)))) OR SYSMIS(time).
COMPUTE Nonnegativeinteger_los_rehab=NOT(VALUE(los_rehab)>=0 AND
VALUE(los_rehab)=TRUNC(VALUE(los_rehab)) OR (MISSING(los_rehab) AND
NOT(SYSMIS(los_rehab)))) OR
SYSMIS(los_rehab).
* Nonnegative number.
COMPUTE Nonnegativenumber_cost=NOT(VALUE(cost)>=0 OR (MISSING(cost) AND
NOT(SYSMIS(cost)))) OR
SYSMIS(cost).
* Mark rule outcome variables as such in SPSS Statistics data dictionary.
VARIABLE ATTRIBUTE VARIABLES=@0to1Dichotomy_smoker TO
Nonnegativenumber_cost
ATTRIBUTE=$VD.RuleOutcomeVar("Yes").
VARIABLE LABELS Nonnegativeinteger_age 'Nonnegative integer:Age in
years'.
VARIABLE LABELS @1to4Categorical_agecat '1 to 4 Categorical:Age
category'.
VARIABLE LABELS @0to1Dichotomy_gender '0 to 1 Dichotomy:Gender'.
VARIABLE LABELS @0to1Dichotomy_active '0 to 1 Dichotomy:Physically
active'.
VARIABLE LABELS @0to1Dichotomy_obesity '0 to 1 Dichotomy:Obesity'.
VARIABLE LABELS @0to1Dichotomy_diabetes '0 to 1 Dichotomy:History of
diabetes'.
VARIABLE LABELS @0to2Categorical_bp '0 to 2 Categorical:Blood pressure'.
VARIABLE LABELS @0to1Dichotomy_af '0 to 1 Dichotomy:Atrial fibrillation'.
VARIABLE LABELS @0to1Dichotomy_smoker '0 to 1 Dichotomy:Smoker'.
VARIABLE LABELS @0to1Dichotomy_choles '0 to 1 Dichotomy:Cholesterol'.
VARIABLE LABELS @0to1Dichotomy_angina '0 to 1 Dichotomy:History of
angina'.
VARIABLE LABELS @0to1Dichotomy_mi '0 to 1 Dichotomy:History of myocardial
infarction'.
VARIABLE LABELS @0to1Dichotomy_nitro '0 to 1 Dichotomy:Prescribed
nitroglycerin'.
VARIABLE LABELS @0to3Categorical_anticlot '0 to 3 Categorical:Taking
anti-clotting drugs'.
VARIABLE LABELS @0to1Dichotomy_tia '0 to 1 Dichotomy:History of transient
ischemic attack'.
VARIABLE LABELS Nonnegativeinteger_time 'Nonnegative integer:Time to
hospital'.
VARIABLE LABELS @0to1Dichotomy_doa '0 to 1 Dichotomy:Dead on arrival'.
VARIABLE LABELS @0to1Dichotomy_catscan '0 to 1 Dichotomy:CAT scan
result'.
VARIABLE LABELS @0to3Categorical_clotsolv '0 to 3 Categorical:Clot-
dissolving drugs'.
VARIABLE LABELS @0to1Dichotomy_dhosp '0 to 1 Dichotomy:Died in hospital'.
VARIABLE LABELS @1to4Categorical_result '1 to 4 Categorical:Treatment
result'.
VARIABLE LABELS @0to2Categorical_surgery '0 to 2 Categorical:Post-event
preventative surgery'.
VARIABLE LABELS @0to3Categorical_rehab '0 to 3 Categorical:Post-event
rehabilitation'.
VARIABLE LABELS Nonnegativeinteger_los_rehab 'Nonnegative integer:Length
of stay for '+
'rehabilitation'.
VARIABLE LABELS Nonnegativenumber_cost 'Nonnegative number:Total
treatment and rehabilitation '+
'costs in thousands'.
VARIABLE LABELS @1to4Categorical_recbart1 '1 to 4 Categorical:Recoded
Barthel index at 1 month'.
VARIABLE LABELS @1to4Categorical_recbart2 '1 to 4 Categorical:Recoded
Barthel index at 3 months'.
VARIABLE LABELS @1to4Categorical_recbart3 '1 to 4 Categorical:Recoded
Barthel index at 6 months'.
VALUE LABELS @0to1Dichotomy_smoker TO Nonnegativenumber_cost 1 'Invalid' 0
'Valid'.
FORMAT @0to1Dichotomy_smoker TO Nonnegativenumber_cost (F1.0).
VARIABLE WIDTH @0to1Dichotomy_smoker TO Nonnegativenumber_cost (4).
VARIABLE LEVEL @0to1Dichotomy_smoker TO Nonnegativenumber_cost (NOMINAL).
* Validate Data.
VALIDATEDATA VARIABLES=hospsize age agecat gender active obesity diabetes
bp af smoker choles
angina mi nitro anticlot tia time doa rankin0 catscan clotsolv dhosp
result surgery rehab los_rehab
cost rankin1 rankin2 rankin3 barthel1 barthel2 barthel3 recbart1
recbart2 recbart3 ID=hospid patid
physid
/VARCHECKS STATUS=ON PCTMISSING=70 PCTEQUAL=95 PCTUNEQUAL=90 CV=0.001
STDDEV=0
/IDCHECKS INCOMPLETE DUPLICATE
/CASECHECKS REPORTEMPTY=YES SCOPE=ALLVARS
/CASEREPORT DISPLAY=YES MINVIOLATIONS=1 CASELIMIT=FIRSTN(100)
/RULESUMMARIES BYVARIABLE.

Validate Data
Notes
Output Created 09-NOV-2017 21:11:19
Comments
Input Data C:\PROGRA~1\IBM\SPSS\S
TATIS~1\SUBSCR~1\Sampl
es\English\stroke_invalid.sav
Active Dataset DataSet1
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working Data 1183
File
Syntax VALIDATEDATA
VARIABLES=hospsize age
agecat gender active obesity
diabetes bp af smoker choles
angina mi nitro anticlot tia
time doa rankin0 catscan
clotsolv dhosp result surgery
rehab los_rehab
cost rankin1 rankin2
rankin3 barthel1 barthel2
barthel3 recbart1 recbart2
recbart3 ID=hospid patid
physid
/VARCHECKS
STATUS=ON
PCTMISSING=70
PCTEQUAL=95
PCTUNEQUAL=90
CV=0.001 STDDEV=0
/IDCHECKS INCOMPLETE
DUPLICATE
/CASECHECKS
REPORTEMPTY=YES
SCOPE=ALLVARS
/CASEREPORT
DISPLAY=YES
MINVIOLATIONS=1
CASELIMIT=FIRSTN(100)
/RULESUMMARIES
BYVARIABLE.
Resources Processor Time 00:00:00.45
Elapsed Time 00:00:00.45

Warnings
Some or all requested output is not displayed because all cases,
variables, or data values passed the requested checks.
Identifier Checks

Incomplete Identifiers
Identifier
Attending
Case Hospital ID Patient ID physician ID
288 OZN 125304
573 6137798782 790697
774 2322241867 176466

Duplicate Identifiers
Cases with Identifier
Number of Duplicate Attending
Duplicate Identifiers Group Duplicates Identifiers Hospital ID Patient ID physician ID
1 2 10, 11 PBW 1406462419 355184
2 2 14, 15 PBW 2191527525 355184
3 2 21, 22 PBW 7237535360 616528
4 2 28, 29 NHV 4592215163 942982
5 2 30, 31 NHV 7628592330 371884
6 2 64, 65 NHV 0300750006 371884
7 2 83, 84 QWS 4590625286 215041
8 2 86, 87 QWS 6272818258 817329
9 2 96, 97 QWS 1959349605 215041
10 3 100, 101, 102 QWS 5856145337 817329
11 3 104, 105, 106 QWS 1543897849 817329
12 2 122, 123 QWS 9535631975 215041
13 2 144, 145 RLD 0052710039 560175
14 2 151, 152 RLD 5058356558 560175
15 2 156, 157 RLD 7779910241 695521
16 2 164, 165 OZN 2970608839 139142
17 2 168, 169 OZN 0165873576 125304
18 2 184, 185 OZN 9355198642 076538
19 2 186, 187 OZN 7080054743 543537
20 2 193, 194 OZN 4458028382 076538
21 2 195, 196 OZN 0240860711 064705
22 2 204, 205 OZN 9604580145 330764
23 2 215, 216 OZN 7063542003 330764
24 2 223, 224 OZN 3887804774 125304
25 2 226, 227 OZN 1263468815 076538
26 3 247, 248, 249 OZN 2232387897 543537
27 2 250, 251 OZN 1238314265 125304
28 2 265, 266 OZN 8298141345 064705
29 2 267, 268 OZN 9301391334 103254
30 2 273, 274 OZN 1038840465 103254
31 2 278, 279 OZN 0816869249 883285
32 3 283, 284, 285 OZN 6075306060 064705
33 2 293, 294 OZN 7778090176 103254
34 2 297, 298 OZN 9587618445 883285
35 2 316, 317 OZN 4261884692 064705
36 3 318, 319, 320 OZN 0309845691 568887
37 2 331, 332 WPA 7944046807 435959
38 2 340, 341 WPA 8943404305 723384
39 2 393, 394 WPA 5159262749 584261
40 3 402, 403, 404 WPA 7173763350 723384
41 3 416, 417, 418 WPA 8014448130 462020
42 2 427, 428 WPA 7195468517 519548
43 3 433, 434, 435 WPA 2247641363 652070
44 2 438, 439 WPA 3451187548 993921
45 2 443, 444 WPA 0900960756 652070
46 2 453, 454 WPA 7844608495 723384
47 2 468, 469 WPA 7975852705 462020
48 2 499, 500 NSR 5479016810 037350
49 2 501, 502 NSR 5116931652 891605
50 2 533, 534 NSR 2630060495 891605
51 3 550, 551, 552 SLB 1052452463 782916
52 2 556, 557 SLB 0050512328 506108
53 3 562, 563, 564 SLB 6461046805 991211
54 2 571, 572 SLB 3777715659 782916
55 3 585, 586, 587 GFG 1087536215 001448
56 2 594, 595 GFG 8883599310 618069
57 2 596, 597 GFG 3789467505 618069
58 2 606, 607 GFG 9523517563 828754
59 2 609, 610 GFG 4650288430 001448
60 2 612, 613 GFG 1915386172 327422
61 2 617, 618 GFG 2729367693 384787
62 2 621, 622 GFG 7974556228 877354
63 3 626, 627, 628 GFG 4544033081 384787
64 2 629, 630 GFG 3397891610 539412
65 2 635, 636 GFG 5253343659 749432
66 2 646, 647 GFG 3185981133 749432
67 2 675, 676 GFG 0247616654 255663
68 2 677, 678 GFG 1299881501 539412
69 2 683, 684 GFG 8045662694 384787
70 3 689, 690, 691 GFG 9575642563 539412
71 3 697, 698, 699 GFG 4030164769 877354
72 3 708, 709, 710 GFG 6553808330 176466
73 2 723, 724 GFG 5439750367 327422
74 2 734, 735 GFG 4306101109 539412
75 2 740, 741 GFG 4103977868 176466
76 3 743, 744, 745 GFG 4332119819 749432
77 2 753, 754 GFG 7338746805 327422
78 2 755, 756 GFG 8242846541 539412
79 2 768, 769 GFG 4327606527 001448
80 2 776, 777 GFG 0450597917 255663
81 2 787, 788 WWL 0826392081 842368
82 2 793, 794 WWL 8349313732 348920
83 2 796, 797 WWL 4939931258 139809
84 2 820, 821 WWL 0825993537 139809
85 2 823, 824 WWL 8879358705 574275
86 2 832, 833 WWL 4045084280 995409
87 2 852, 853 WWL 5030747136 139809
88 2 860, 861 WWL 6321524756 574275
89 2 878, 879 WWL 7056065019 751568
90 2 896, 897 EFX 2644234053 866015
91 3 912, 913, 914 EFX 4391632997 501318
92 2 925, 926 IZO 8005601131 493939
93 3 948, 949, 950 YYH 0437454972 086215
94 2 954, 955 YYH 7840326167 611470
95 2 956, 957 YYH 9138809953 611470
96 2 963, 964 YYH 3087535807 249058
97 2 975, 976 YYH 9746101913 249058
98 3 993, 994, 995 BLA 5582038692 846629
99 2 998, 999 BLA 7796142141 185703
100 3 1005, 1006, BLA 6044244232 657638
1007
101 3 1020, 1021, BLA 3383946667 846629
1022
102 2 1026, 1027 BLA 3796244473 185703
103 2 1035, 1036 ALK 1275385980 346348
104 2 1042, 1043 ALK 8548603108 346348
105 2 1052, 1053 ALK 0614590399 267830
106 2 1059, 1060 ALK 9581070966 700152
107 2 1072, 1073 ALK 6900285569 346348
108 2 1077, 1078 ALK 4510532534 392622
109 2 1097, 1098 ALK 0213143436 185787
110 2 1103, 1104 ALK 0819507927 392622
111 2 1152, 1153 ALK 0172987095 267830
112 2 1159, 1160 ALK 1840044155 700152
113 2 1168, 1169 ALK 9104785006 840883
114 2 1179, 1180 ALK 1147626734 700152

Single-Variable Rules

Rule Descriptions
Rule Description
Nonnegative integer Type: Numeric
Domain: Range
Flag user-missing values:
No
Flag system-missing
values: Yes
Minimum: 0
Flag unlabeled values
within range: No
Flag noninteger values
within range: Yes
$VD.SRule[5]: Rule
0 to 1 Dichotomy Type: Numeric
Domain: List
Flag user-missing values:
No
Flag system-missing
values: Yes
List: 0, 1
$VD.SRule[1]: Rule
1 to 4 Categorical Type: Numeric
Domain: List
Flag user-missing values:
No
Flag system-missing
values: Yes
List: 1, 2, 3, 4
$VD.SRule[4]: Rule

Rules violated at least once are displayed.

Variable Summary
Number of
Rule Violations
Age category 1 to 4 Categorical 1
Total 1
Gender 0 to 1 Dichotomy 1
Total 1
History of angina 0 to 1 Dichotomy 1
Total 1
Time to hospital Nonnegative integer 2
Total 2
Dead on arrival 0 to 1 Dichotomy 1
Total 1

Case Report
Validation Rule
Violations Identifier
Attending physician
a
Case Single-Variable Hospital ID Patient ID ID
175 0 to 1 Dichotomy (1) OZN 0333204686 883285
274 0 to 1 Dichotomy (1) OZN 1038840465 103254
310 Nonnegative integer OZN 2090290204 883285
(1)
437 0 to 1 Dichotomy (1) WPA 2349729006 723384
752 Nonnegative integer GFG 4993307441 828754
(1)
1173 1 to 4 Categorical (1) ALK 8737661990 185787

a. The number of variables that violated the rule follows each rule.

DELETE VARIABLES Nonnegativeinteger_age @1to4Categorical_agecat


@0to1Dichotomy_gender
@0to1Dichotomy_active @0to1Dichotomy_obesity @0to1Dichotomy_diabetes
@0to2Categorical_bp
@0to1Dichotomy_af @0to1Dichotomy_smoker @0to1Dichotomy_choles
@0to1Dichotomy_angina
@0to1Dichotomy_mi @0to1Dichotomy_nitro @0to3Categorical_anticlot
@0to1Dichotomy_tia
Nonnegativeinteger_time @0to1Dichotomy_doa @0to1Dichotomy_catscan
@0to3Categorical_clotsolv
@0to1Dichotomy_dhosp @1to4Categorical_result @0to2Categorical_surgery
@0to3Categorical_rehab
Nonnegativeinteger_los_rehab Nonnegativenumber_cost
@1to4Categorical_recbart1
@1to4Categorical_recbart2 @1to4Categorical_recbart3.
* (Re)define cross-variable validation rules.
DATAFILE ATTRIBUTE ATTRIBUTE=
$VD.CRule[1]("Label='DiedTwice',OutcomeVar='DiedTwice',Expression='doa
= 1 & dhosp = 1'").
* Delete existing single-variable validation rules.
DATAFILE ATTRIBUTE DELETE=$VD.SRule.
* Delete existing links between variables and rules.
VARIABLE ATTRIBUTE VARIABLES=ALL DELETE=$VD.SRuleRef.
* (Re)define single-variable validation rules.
DATAFILE ATTRIBUTE ATTRIBUTE=
$VD.SRule[1]("Label='0 to 1 Dichotomy', Type='Numeric', Domain='List',
FlagUserMissing='No', "+
"FlagSystemMissing='Yes', FlagBlank='No', CaseSensitive='No',List='0'
'1' ")
$VD.SRule[2]("Label='0 to 2 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='0' "+
"'1' '2' ")
$VD.SRule[3]("Label='0 to 3 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='0' "+
"'1' '2' '3' ")
$VD.SRule[4]("Label='1 to 4 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='1' "+
"'2' '3' '4' ")
$VD.SRule[5]("Label='Nonnegative integer', Type='Numeric',
Domain='Range', Minimum='0', "+
"Maximum='', FlagUserMissing='No', FlagSystemMissing='Yes',
FlagBlank='No', "+
"FlagNoninteger='Yes', FlagUnlabeled='No' ")
$VD.SRule[6]("Label='Nonnegative number', Type='Numeric',
Domain='Range', Minimum='0', "+
"Maximum='', FlagUserMissing='No', FlagSystemMissing='Yes',
FlagBlank='No', "+
"FlagNoninteger='No', FlagUnlabeled='No' ")
$VD.SRule[7]("Label='1 to 3 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='1' "+
"'2' '3' ")
$VD.SRule[8]("Label='0 to 5 Categorical', Type='Numeric',
Domain='List', "+
"FlagUserMissing='No', FlagSystemMissing='Yes', FlagBlank='No',
CaseSensitive='No',List='0' "+
"'1' '2' '3' '4' '5' ")
$VD.SRule[9]("Label='0 to 100 by 5', Type='Numeric', Domain='List',
FlagUserMissing='No', "+
"FlagSystemMissing='Yes', FlagBlank='No', CaseSensitive='No',List='0'
'5' '10' '15' '20' '25' "+
"'30' '35' '40' '45' '50' '55' '60' '65' '70' '75' '80' '85' '90' '95'
'100' ").
* (Re)define links between variables and rules.
VARIABLE ATTRIBUTE
VARIABLES=hospsize ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[7]',OutcomeVar='@1to3Categorical_hospsize'")
/VARIABLES=age ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_age1_A'")
/VARIABLES=agecat ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_agecat1_A'")
/VARIABLES=gender ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_gender1_A'")
/VARIABLES=active ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_active1_A'")
/VARIABLES=obesity ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_obesity1_A'")
/VARIABLES=diabetes ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_diabetes1_A'")
/VARIABLES=bp ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[2]',OutcomeVar='@0to2Categorical_bp1_A'")
/VARIABLES=af ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_af1_A'")
/VARIABLES=smoker ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_smoker1_A'")
/VARIABLES=choles ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_choles1_A'")
/VARIABLES=angina ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_angina1_A'")
/VARIABLES=mi ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_mi1_A'")
/VARIABLES=nitro ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_nitro1_A'")
/VARIABLES=anticlot ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_anticlot1_A'")
/VARIABLES=tia ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_tia1_A'")
/VARIABLES=time ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_time1_A'")
/VARIABLES=doa ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_doa1_A'")
/VARIABLES=rankin0 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[8]',OutcomeVar='@0to5Categorical_rankin0'")
/VARIABLES=catscan ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_catscan1_A'")
/VARIABLES=clotsolv ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_clotsolv1_A'")
/VARIABLES=dhosp ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[1]',OutcomeVar='@0to1Dichotomy_dhosp1_A'")
/VARIABLES=result ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_result1_A'")
/VARIABLES=surgery ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[2]',OutcomeVar='@0to2Categorical_surgery1_A'")
/VARIABLES=rehab ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[3]',OutcomeVar='@0to3Categorical_rehab1_A'")
/VARIABLES=los_rehab ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[5]',OutcomeVar='Nonnegativeinteger_los_rehab1_A'")
/VARIABLES=cost ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[6]',OutcomeVar='Nonnegativenumber_cost1_A'")
/VARIABLES=rankin1 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[8]',OutcomeVar='@0to5Categorical_rankin1'")
/VARIABLES=rankin2 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[8]',OutcomeVar='@0to5Categorical_rankin2'")
/VARIABLES=rankin3 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[8]',OutcomeVar='@0to5Categorical_rankin3'")
/VARIABLES=barthel1 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[9]',OutcomeVar='@0to100by5_barthel1'")
/VARIABLES=barthel2 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[9]',OutcomeVar='@0to100by5_barthel2'")
/VARIABLES=barthel3 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[9]',OutcomeVar='@0to100by5_barthel3'")
/VARIABLES=recbart1 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart11_A'")
/VARIABLES=recbart2 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart21_A'")
/VARIABLES=recbart3 ATTRIBUTE=
$VD.SRuleRef[1]
("Rule='$VD.SRule[4]',OutcomeVar='@1to4Categorical_recbart31_A'").
* 0 to 1 Dichotomy.
DO REPEAT #OV= @0to1Dichotomy_dhosp1_A @0to1Dichotomy_nitro1_A
@0to1Dichotomy_catscan1_A
@0to1Dichotomy_af1_A @0to1Dichotomy_choles1_A @0to1Dichotomy_active1_A
@0to1Dichotomy_smoker1_A
@0to1Dichotomy_doa1_A @0to1Dichotomy_obesity1_A
@0to1Dichotomy_angina1_A @0to1Dichotomy_tia1_A
@0to1Dichotomy_mi1_A @0to1Dichotomy_gender1_A
@0to1Dichotomy_diabetes1_A
/#IV= dhosp nitro catscan af choles active smoker doa obesity angina tia
mi gender diabetes.
COMPUTE #OV=NOT(ANY(VALUE(#IV),0,1) OR (MISSING(#IV) AND
NOT(SYSMIS(#IV)))) OR SYSMIS(#IV).
END REPEAT.
* 0 to 100 by 5.
COMPUTE
@0to100by5_barthel3=NOT(ANY(VALUE(barthel3),0,5,10,15,20,25,30,35,40,45,50
,55,60,65,70,75,
80,85,90,95,100) OR (MISSING(barthel3) AND NOT(SYSMIS(barthel3)))) OR
SYSMIS(barthel3).
COMPUTE
@0to100by5_barthel2=NOT(ANY(VALUE(barthel2),0,5,10,15,20,25,30,35,40,45,50
,55,60,65,70,75,
80,85,90,95,100) OR (MISSING(barthel2) AND NOT(SYSMIS(barthel2)))) OR
SYSMIS(barthel2).
COMPUTE
@0to100by5_barthel1=NOT(ANY(VALUE(barthel1),0,5,10,15,20,25,30,35,40,45,50
,55,60,65,70,75,
80,85,90,95,100) OR (MISSING(barthel1) AND NOT(SYSMIS(barthel1)))) OR
SYSMIS(barthel1).
* 0 to 2 Categorical.
COMPUTE @0to2Categorical_bp1_A=NOT(ANY(VALUE(bp),0,1,2) OR (MISSING(bp)
AND NOT(SYSMIS(bp)))) OR
SYSMIS(bp).
COMPUTE @0to2Categorical_surgery1_A=NOT(ANY(VALUE(surgery),0,1,2) OR
(MISSING(surgery) AND
NOT(SYSMIS(surgery)))) OR SYSMIS(surgery).
* 0 to 3 Categorical.
COMPUTE @0to3Categorical_clotsolv1_A=NOT(ANY(VALUE(clotsolv),0,1,2,3) OR
(MISSING(clotsolv) AND
NOT(SYSMIS(clotsolv)))) OR SYSMIS(clotsolv).
COMPUTE @0to3Categorical_rehab1_A=NOT(ANY(VALUE(rehab),0,1,2,3) OR
(MISSING(rehab) AND
NOT(SYSMIS(rehab)))) OR SYSMIS(rehab).
COMPUTE @0to3Categorical_anticlot1_A=NOT(ANY(VALUE(anticlot),0,1,2,3) OR
(MISSING(anticlot) AND
NOT(SYSMIS(anticlot)))) OR SYSMIS(anticlot).
* 0 to 5 Categorical.
DO REPEAT #OV= @0to5Categorical_rankin2 @0to5Categorical_rankin3
@0to5Categorical_rankin0
@0to5Categorical_rankin1
/#IV= rankin2 rankin3 rankin0 rankin1.
COMPUTE #OV=NOT(ANY(VALUE(#IV),0,1,2,3,4,5) OR (MISSING(#IV) AND
NOT(SYSMIS(#IV)))) OR SYSMIS(#IV).
END REPEAT.
* 1 to 3 Categorical.
COMPUTE @1to3Categorical_hospsize=NOT(ANY(VALUE(hospsize),1,2,3) OR
(MISSING(hospsize) AND
NOT(SYSMIS(hospsize)))) OR SYSMIS(hospsize).
* 1 to 4 Categorical.
DO REPEAT #OV= @1to4Categorical_agecat1_A @1to4Categorical_recbart31_A
@1to4Categorical_recbart21_A
@1to4Categorical_recbart11_A @1to4Categorical_result1_A
/#IV= agecat recbart3 recbart2 recbart1 result.
COMPUTE #OV=NOT(ANY(VALUE(#IV),1,2,3,4) OR (MISSING(#IV) AND
NOT(SYSMIS(#IV)))) OR SYSMIS(#IV).
END REPEAT.
* Nonnegative integer.
COMPUTE Nonnegativeinteger_age1_A=NOT(VALUE(age)>=0 AND
VALUE(age)=TRUNC(VALUE(age)) OR
(MISSING(age) AND NOT(SYSMIS(age)))) OR SYSMIS(age).
COMPUTE Nonnegativeinteger_time1_A=NOT(VALUE(time)>=0 AND
VALUE(time)=TRUNC(VALUE(time)) OR
(MISSING(time) AND NOT(SYSMIS(time)))) OR SYSMIS(time).
COMPUTE Nonnegativeinteger_los_rehab1_A=NOT(VALUE(los_rehab)>=0 AND
VALUE(los_rehab)=TRUNC(VALUE(los_rehab)) OR (MISSING(los_rehab) AND
NOT(SYSMIS(los_rehab)))) OR
SYSMIS(los_rehab).
* Nonnegative number.
COMPUTE Nonnegativenumber_cost1_A=NOT(VALUE(cost)>=0 OR (MISSING(cost) AND
NOT(SYSMIS(cost)))) OR
SYSMIS(cost).
* DiedTwice.
COMPUTE DiedTwice=doa = 1 & dhosp = 1.
* Mark rule outcome variables as such in SPSS Statistics data dictionary.
VARIABLE ATTRIBUTE VARIABLES=@0to1Dichotomy_dhosp1_A TO DiedTwice
ATTRIBUTE=$VD.RuleOutcomeVar("Yes").
VARIABLE LABELS @1to3Categorical_hospsize '1 to 3 Categorical:Hospital
size'.
VARIABLE LABELS Nonnegativeinteger_age1_A 'Nonnegative integer:Age in
years'.
VARIABLE LABELS @1to4Categorical_agecat1_A '1 to 4 Categorical:Age
category'.
VARIABLE LABELS @0to1Dichotomy_gender1_A '0 to 1 Dichotomy:Gender'.
VARIABLE LABELS @0to1Dichotomy_active1_A '0 to 1 Dichotomy:Physically
active'.
VARIABLE LABELS @0to1Dichotomy_obesity1_A '0 to 1 Dichotomy:Obesity'.
VARIABLE LABELS @0to1Dichotomy_diabetes1_A '0 to 1 Dichotomy:History of
diabetes'.
VARIABLE LABELS @0to2Categorical_bp1_A '0 to 2 Categorical:Blood
pressure'.
VARIABLE LABELS @0to1Dichotomy_af1_A '0 to 1 Dichotomy:Atrial
fibrillation'.
VARIABLE LABELS @0to1Dichotomy_smoker1_A '0 to 1 Dichotomy:Smoker'.
VARIABLE LABELS @0to1Dichotomy_choles1_A '0 to 1 Dichotomy:Cholesterol'.
VARIABLE LABELS @0to1Dichotomy_angina1_A '0 to 1 Dichotomy:History of
angina'.
VARIABLE LABELS @0to1Dichotomy_mi1_A '0 to 1 Dichotomy:History of
myocardial infarction'.
VARIABLE LABELS @0to1Dichotomy_nitro1_A '0 to 1 Dichotomy:Prescribed
nitroglycerin'.
VARIABLE LABELS @0to3Categorical_anticlot1_A '0 to 3 Categorical:Taking
anti-clotting drugs'.
VARIABLE LABELS @0to1Dichotomy_tia1_A '0 to 1 Dichotomy:History of
transient ischemic attack'.
VARIABLE LABELS Nonnegativeinteger_time1_A 'Nonnegative integer:Time to
hospital'.
VARIABLE LABELS @0to1Dichotomy_doa1_A '0 to 1 Dichotomy:Dead on arrival'.
VARIABLE LABELS @0to5Categorical_rankin0 '0 to 5 Categorical:Initial
Rankin score'.
VARIABLE LABELS @0to1Dichotomy_catscan1_A '0 to 1 Dichotomy:CAT scan
result'.
VARIABLE LABELS @0to3Categorical_clotsolv1_A '0 to 3 Categorical:Clot-
dissolving drugs'.
VARIABLE LABELS @0to1Dichotomy_dhosp1_A '0 to 1 Dichotomy:Died in
hospital'.
VARIABLE LABELS @1to4Categorical_result1_A '1 to 4 Categorical:Treatment
result'.
VARIABLE LABELS @0to2Categorical_surgery1_A '0 to 2 Categorical:Post-
event preventative surgery'.
VARIABLE LABELS @0to3Categorical_rehab1_A '0 to 3 Categorical:Post-event
rehabilitation'.
VARIABLE LABELS Nonnegativeinteger_los_rehab1_A 'Nonnegative
integer:Length of stay for '+
'rehabilitation'.
VARIABLE LABELS Nonnegativenumber_cost1_A 'Nonnegative number:Total
treatment and '+
'rehabilitation costs in thousands'.
VARIABLE LABELS @0to5Categorical_rankin1 '0 to 5 Categorical:Rankin score
at 1 month'.
VARIABLE LABELS @0to5Categorical_rankin2 '0 to 5 Categorical:Rankin score
at 3 months'.
VARIABLE LABELS @0to5Categorical_rankin3 '0 to 5 Categorical:Rankin score
at 6 months'.
VARIABLE LABELS @0to100by5_barthel1 '0 to 100 by 5:Barthel index at 1
month'.
VARIABLE LABELS @0to100by5_barthel2 '0 to 100 by 5:Barthel index at 3
months'.
VARIABLE LABELS @0to100by5_barthel3 '0 to 100 by 5:Barthel index at 6
months'.
VARIABLE LABELS @1to4Categorical_recbart11_A '1 to 4 Categorical:Recoded
Barthel index at 1 month'.
VARIABLE LABELS @1to4Categorical_recbart21_A '1 to 4 Categorical:Recoded
Barthel index at 3 '+
'months'.
VARIABLE LABELS @1to4Categorical_recbart31_A '1 to 4 Categorical:Recoded
Barthel index at 6 '+
'months'.
VARIABLE LABELS DiedTwice 'DiedTwice:doa = 1 & dhosp = 1'.
VALUE LABELS @0to1Dichotomy_dhosp1_A TO DiedTwice 1 'Invalid' 0 'Valid'.
FORMAT @0to1Dichotomy_dhosp1_A TO DiedTwice (F1.0).
VARIABLE WIDTH @0to1Dichotomy_dhosp1_A TO DiedTwice (4).
VARIABLE LEVEL @0to1Dichotomy_dhosp1_A TO DiedTwice (NOMINAL).
* Validate Data.
VALIDATEDATA VARIABLES=hospsize age agecat gender active obesity diabetes
bp af smoker choles
angina mi nitro anticlot tia time doa rankin0 catscan clotsolv dhosp
result surgery rehab los_rehab
cost rankin1 rankin2 rankin3 barthel1 barthel2 barthel3 recbart1
recbart2 recbart3 ID=hospid patid
physid CROSSVARRULES=$VD.CRule[1]
/VARCHECKS STATUS=ON PCTMISSING=70 PCTEQUAL=95 PCTUNEQUAL=90 CV=0.001
STDDEV=0
/IDCHECKS INCOMPLETE DUPLICATE
/CASECHECKS REPORTEMPTY=YES SCOPE=ALLVARS
/CASEREPORT DISPLAY=YES MINVIOLATIONS=1 CASELIMIT=FIRSTN(100)
/RULESUMMARIES BYVARIABLE.

Validate Data

Notes
Output Created 09-NOV-2017 21:22:21
Comments
Input Data C:\PROGRA~1\IBM\SPSS\S
TATIS~1\SUBSCR~1\Sampl
es\English\stroke_invalid.sav
Active Dataset DataSet1
Filter <none>
Weight <none>
Split File <none>
N of Rows in Working Data 1183
File
Syntax VALIDATEDATA
VARIABLES=hospsize age
agecat gender active obesity
diabetes bp af smoker choles
angina mi nitro anticlot tia
time doa rankin0 catscan
clotsolv dhosp result surgery
rehab los_rehab
cost rankin1 rankin2
rankin3 barthel1 barthel2
barthel3 recbart1 recbart2
recbart3 ID=hospid patid
physid
CROSSVARRULES=$VD.C
Rule[1]
/VARCHECKS
STATUS=ON
PCTMISSING=70
PCTEQUAL=95
PCTUNEQUAL=90
CV=0.001 STDDEV=0
/IDCHECKS INCOMPLETE
DUPLICATE
/CASECHECKS
REPORTEMPTY=YES
SCOPE=ALLVARS
/CASEREPORT
DISPLAY=YES
MINVIOLATIONS=1
CASELIMIT=FIRSTN(100)
/RULESUMMARIES
BYVARIABLE.
Resources Processor Time 00:00:00.62
Elapsed Time 00:00:00.61

Warnings
Some or all requested output is not displayed because all cases,
variables, or data values passed the requested checks.
Identifier Checks

Incomplete Identifiers
Identifier
Attending
Case Hospital ID Patient ID physician ID
288 OZN 125304
573 6137798782 790697
774 2322241867 176466

Duplicate Identifiers
Cases with Identifier
Number of Duplicate Attending
Duplicate Identifiers Group Duplicates Identifiers Hospital ID Patient ID physician ID
1 2 10, 11 PBW 1406462419 355184
2 2 14, 15 PBW 2191527525 355184
3 2 21, 22 PBW 7237535360 616528
4 2 28, 29 NHV 4592215163 942982
5 2 30, 31 NHV 7628592330 371884
6 2 64, 65 NHV 0300750006 371884
7 2 83, 84 QWS 4590625286 215041
8 2 86, 87 QWS 6272818258 817329
9 2 96, 97 QWS 1959349605 215041
10 3 100, 101, 102 QWS 5856145337 817329
11 3 104, 105, 106 QWS 1543897849 817329
12 2 122, 123 QWS 9535631975 215041
13 2 144, 145 RLD 0052710039 560175
14 2 151, 152 RLD 5058356558 560175
15 2 156, 157 RLD 7779910241 695521
16 2 164, 165 OZN 2970608839 139142
17 2 168, 169 OZN 0165873576 125304
18 2 184, 185 OZN 9355198642 076538
19 2 186, 187 OZN 7080054743 543537
20 2 193, 194 OZN 4458028382 076538
21 2 195, 196 OZN 0240860711 064705
22 2 204, 205 OZN 9604580145 330764
23 2 215, 216 OZN 7063542003 330764
24 2 223, 224 OZN 3887804774 125304
25 2 226, 227 OZN 1263468815 076538
26 3 247, 248, 249 OZN 2232387897 543537
27 2 250, 251 OZN 1238314265 125304
28 2 265, 266 OZN 8298141345 064705
29 2 267, 268 OZN 9301391334 103254
30 2 273, 274 OZN 1038840465 103254
31 2 278, 279 OZN 0816869249 883285
32 3 283, 284, 285 OZN 6075306060 064705
33 2 293, 294 OZN 7778090176 103254
34 2 297, 298 OZN 9587618445 883285
35 2 316, 317 OZN 4261884692 064705
36 3 318, 319, 320 OZN 0309845691 568887
37 2 331, 332 WPA 7944046807 435959
38 2 340, 341 WPA 8943404305 723384
39 2 393, 394 WPA 5159262749 584261
40 3 402, 403, 404 WPA 7173763350 723384
41 3 416, 417, 418 WPA 8014448130 462020
42 2 427, 428 WPA 7195468517 519548
43 3 433, 434, 435 WPA 2247641363 652070
44 2 438, 439 WPA 3451187548 993921
45 2 443, 444 WPA 0900960756 652070
46 2 453, 454 WPA 7844608495 723384
47 2 468, 469 WPA 7975852705 462020
48 2 499, 500 NSR 5479016810 037350
49 2 501, 502 NSR 5116931652 891605
50 2 533, 534 NSR 2630060495 891605
51 3 550, 551, 552 SLB 1052452463 782916
52 2 556, 557 SLB 0050512328 506108
53 3 562, 563, 564 SLB 6461046805 991211
54 2 571, 572 SLB 3777715659 782916
55 3 585, 586, 587 GFG 1087536215 001448
56 2 594, 595 GFG 8883599310 618069
57 2 596, 597 GFG 3789467505 618069
58 2 606, 607 GFG 9523517563 828754
59 2 609, 610 GFG 4650288430 001448
60 2 612, 613 GFG 1915386172 327422
61 2 617, 618 GFG 2729367693 384787
62 2 621, 622 GFG 7974556228 877354
63 3 626, 627, 628 GFG 4544033081 384787
64 2 629, 630 GFG 3397891610 539412
65 2 635, 636 GFG 5253343659 749432
66 2 646, 647 GFG 3185981133 749432
67 2 675, 676 GFG 0247616654 255663
68 2 677, 678 GFG 1299881501 539412
69 2 683, 684 GFG 8045662694 384787
70 3 689, 690, 691 GFG 9575642563 539412
71 3 697, 698, 699 GFG 4030164769 877354
72 3 708, 709, 710 GFG 6553808330 176466
73 2 723, 724 GFG 5439750367 327422
74 2 734, 735 GFG 4306101109 539412
75 2 740, 741 GFG 4103977868 176466
76 3 743, 744, 745 GFG 4332119819 749432
77 2 753, 754 GFG 7338746805 327422
78 2 755, 756 GFG 8242846541 539412
79 2 768, 769 GFG 4327606527 001448
80 2 776, 777 GFG 0450597917 255663
81 2 787, 788 WWL 0826392081 842368
82 2 793, 794 WWL 8349313732 348920
83 2 796, 797 WWL 4939931258 139809
84 2 820, 821 WWL 0825993537 139809
85 2 823, 824 WWL 8879358705 574275
86 2 832, 833 WWL 4045084280 995409
87 2 852, 853 WWL 5030747136 139809
88 2 860, 861 WWL 6321524756 574275
89 2 878, 879 WWL 7056065019 751568
90 2 896, 897 EFX 2644234053 866015
91 3 912, 913, 914 EFX 4391632997 501318
92 2 925, 926 IZO 8005601131 493939
93 3 948, 949, 950 YYH 0437454972 086215
94 2 954, 955 YYH 7840326167 611470
95 2 956, 957 YYH 9138809953 611470
96 2 963, 964 YYH 3087535807 249058
97 2 975, 976 YYH 9746101913 249058
98 3 993, 994, 995 BLA 5582038692 846629
99 2 998, 999 BLA 7796142141 185703
100 3 1005, 1006, BLA 6044244232 657638
1007
101 3 1020, 1021, BLA 3383946667 846629
1022
102 2 1026, 1027 BLA 3796244473 185703
103 2 1035, 1036 ALK 1275385980 346348
104 2 1042, 1043 ALK 8548603108 346348
105 2 1052, 1053 ALK 0614590399 267830
106 2 1059, 1060 ALK 9581070966 700152
107 2 1072, 1073 ALK 6900285569 346348
108 2 1077, 1078 ALK 4510532534 392622
109 2 1097, 1098 ALK 0213143436 185787
110 2 1103, 1104 ALK 0819507927 392622
111 2 1152, 1153 ALK 0172987095 267830
112 2 1159, 1160 ALK 1840044155 700152
113 2 1168, 1169 ALK 9104785006 840883
114 2 1179, 1180 ALK 1147626734 700152

Single-Variable Rules

Rule Descriptions
Rule Description
Nonnegative integer Type: Numeric
Domain: Range
Flag user-missing values:
No
Flag system-missing
values: Yes
Minimum: 0
Flag unlabeled values
within range: No
Flag noninteger values
within range: Yes
$VD.SRule[5]: Rule
1 to 4 Categorical Type: Numeric
Domain: List
Flag user-missing values:
No
Flag system-missing
values: Yes
List: 1, 2, 3, 4
$VD.SRule[4]: Rule

Rules violated at least once are displayed.

Variable Summary
Number of
Rule Violations
Age category 1 to 4 Categorical 1
Total 1
Time to hospital Nonnegative integer 2
Total 2

Cross-Variable Rules
Rule Number of Violations Rule Expression
DiedTwice 27 doa = 1 & dhosp = 1
Case Report
Validation Rule Violations Identifier
a
Case Single-Variable Cross-Variable Hospital ID Patient ID
20 DiedTwice PBW 1192970826
49 DiedTwice NHV 8717862852
129 DiedTwice QWS 6901932085
138 DiedTwice RLD 1205005069
162 DiedTwice OZN 5546809538
310 Nonnegative integer OZN 2090290204
(1)
414 DiedTwice WPA 3351107142
447 DiedTwice WPA 7163481282
458 DiedTwice WPA 9159094175
462 DiedTwice WPA 2137520354
537 DiedTwice SLB 5246122506
544 DiedTwice SLB 1605957462
620 DiedTwice GFG 8141858966
629 DiedTwice GFG 3397891610
630 DiedTwice GFG 3397891610
639 DiedTwice GFG 3962622031
644 DiedTwice GFG 4271782383
649 DiedTwice GFG 0950686750
653 DiedTwice GFG 0663642766
722 DiedTwice GFG 0418125590
748 DiedTwice GFG 8744721380
752 Nonnegative integer GFG 4993307441
(1)
868 DiedTwice WWL 9714672452
881 DiedTwice WWL 6613279456
915 DiedTwice EFX 2575793702
933 DiedTwice IZO 2807437472
1010 DiedTwice BLA 5284009939
1028 DiedTwice BLA 8021997463
1054 DiedTwice ALK 0950897644
1173 1 to 4 Categorical (1) ALK 8737661990

Case Report
Identifier
Case Attending physician ID
20 355184
49 237418
129 215041
138 695521
162 125304
310 883285
414 462020
447 519548
458 652070
462 723384
537 928076
544 506108
620 828754
629 539412
630 539412
639 327422
644 749432
649 618069
653 001448
722 877354
748 539412
752 828754
868 237547
881 574275
915 501318
933 680253
1010 657638
1028 185703
1054 267830
1173 185787

a. The number of variables that violated the rule follows each rule.