Professional Documents
Culture Documents
Michael Conerly,
Winston Choi,
Michael Hardin
University of Alabama
Outline of Presentation
Goodness of Fit (GOF) Procedures
Why are these necessary?
Review of Hosmer and Lemeshow
Adapting H-L to data mining
Selection of groups for GOF
Application of New test to real data
What if test fails?
1 Obs1 π1 n×π1
2 Obs2 π2 n×π2
3 Obs3 π3 n×π3
…
k Obsk πκ n×πk
Total n 1.0 n
Chi-square statistic
The statistic
2 (Obsi − Expi )2
χ =∑
nπi (1 − πi )
is approximately Chi-square with k-2
degrees of freedom.
Large values Little or no
agreement between Observed and
predicted counts
Examples
To be added later
Questions?