You are on page 1of 2

u09d2 Classification in Discriminant Analysis

Discuss the concept of classification as regards to discriminant analysis. How do the results of
the classification table tell us if we have a good or poor model for our data?

Response Guidelines

Respond to at least one other posting by offering an example that illustrates the point(s) made in
the posting

The concept of classification in discriminant analysis (DA) involves determining into which
group each case is best aligned based upon the highest classification score. The critical question
in classifying data points in DA is: How well did the DA prediction model predict which group
each case belongs? Addressing this essential question of reliability is central to determining how
well a discriminant analysis prediction model fits the data.

The PASW DA procedure engages this question by producing detailed statistical output data
regarding group differences. When the DA prediction model and derived discriminant functions
are finalized, PASW produces a classification table (similar to a contingency table) which
provides summary data information regarding the number of participants who were correctly
classified as members of each group and the amount of misclassifications (Warner, 2008).

The table facilitates a visual comparative analysis of predicted outcomes with actual data to
determine the reliability of the prediction model. When predicted group membership for an
individual case does not correspond to actual group membership a misclassification or
classification error has occurred. Classification errors are primarily Type I and Type II errors and
can be extremely consequential in clinical environments given the ominous potential of
rendering a false diagnosis.

Noteworthy is that under critical medical conditions, Type II errors or a false negative can indeed
be life threatening, therefore a clinician should endeavor to be diligent in avoiding Type II errors.
Type II errors in DA are best avoided by lowering the cutoff boundaries for group classifications
(Warner, 2008). However, a researcher should keep in mind that lowering the cutoff boundaries
also increases the risk of a Type I error.

Precise post hoc comparative analysis between the prediction model and actual group
membership will assist a researcher in demonstrating the reliability of the prediction model and
the utility of the discriminant functions. In terms of accuracy, post hoc evaluation of the
reliability of the DA discriminant functions with actual data is recommended over a priori
estimations which are typically problematic. Moreover, a researcher may choose to perform a
logistic regression analysis on the same data to compare the two procedures, since both produce
statistical output that includes a classification table. There is some speculation by researchers
that under certain data conditions, logistic regression analysis tends to accurately classify a
higher proportion of cases (Warner, 2008).

Anthony Rhodes
General Psychology PhD

References

Warner, R.M. (2008). Applied statistics: From Bivariate Through Multivariate Techniques.
Thousand Oaks, CA: Sage Publications. ISBN: 9780761927723.

You might also like