Professional Documents
Culture Documents
Cox Regression
[2]
What if you want to know for how long the customers will remain
with your telecommunications company?
What if you want to know when your customers will cancel their
credit card?
What if you want to know when students are likely to leave the
University?
What if you want to know when students are likely to be employed?
Cox Regression
[2]
Survival Rate:
Chances that the subject will stay at time t
Hazard Rate:
Risk of failure
Time-dependent
Probability of the event happening at time t given that the
individual is at risk at time t
Cox Regression
[2]
Censored Data
These are samples that cannot be tracked anymore.
Ex. We do not know if they found a job.
We do not know if they survived their battle against cancer.
Why?
They did not reply to the survey anymore.
They could no longer be contacted.
Do you still use this data?
Yes it is still data while they are still in the study.
Survival Analysis
[2]
[2]
n.d.(2010). Predictive Modeling with IBM SPSS Modeler Student Guide. IBM Corporation Inc.
Censored Data
Cancer
Death
Censored
Explanation
Event:
Censored:Subjects who survived until the end of the study but we do not kn
what
happened after the studyc
Subjects who dropped out from the study while the study was on
going
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Time
Censored if 0
Age
Time
Censored if 0
Age
59
72
744
50
115
74
769
59
156
66
770
57
268
74
803
39
329
43
855
43
353
63
1040
38
365
64
1106
44
377
58
1129
53
421
53
1206
44
431
50
1227
59
448
56
464
56
475
59
477
64
563
55
638
56
Notations:
Hazard Function:
tj time
nj number at risk
dj number of events
Sj number of censored
j hazard function
(tj ) cumulative hazard function
S(tj ) Survival function
Recall:
nj number at risk
dj number of events
Recall:
Hazard Function
Survival Function
Exponential
Weibull
Gompertz
Log-logistic
[2]
19
[2]
[2]
n.d.(2010). Predictive Modeling with IBM SPSS Modeler Student Guide. IBM Corporation Inc.
21
Where:
= hazard function
t = time
= covariates
x = predictors
22
Interpreting Coefficients
If Coefficient is positive,
Lower duration, higher hazard rates (more likely to
happen)
As an independent variable increases, time-toevent decreases (the sooner the event will happen)
If Coefficient is negative,
Higher duration, lower hazard rate (less likely to
happen)
As an independent variable increases, time-toevent increases
23
QUESTIONS?
24