Professional Documents
Culture Documents
Chapter 11
Applications of DA
DA is especially useful to understand the
differences and factors leading consumers to make
different choices allowing them to develop
marketing strategies which take into proper
account the role of the predictors.
Examples
Determinants of customer loyalty
Shopper profiling and segmentation
Determinants of purchase and non-purchase
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
Discriminant analysis(DA)
Two groups only, thus a single discriminating value
(discriminating score)
For each respondent a score is computed using the
estimated linear combination of the predictors (the
discriminant function)
Respondents with a score above the discriminating value
are expected to belong to one group, those below to the
other group.
When the discriminant score is standardized to have zero
mean and unity variance it is called Z score
DA also provides information about the discriminating
power of each of the original predictors
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
z 0 1 x1 2 x2 3 x3 4 x4
Discriminant score
Predictors
weekly expenditure on chicken
age
safety of butchers chicken
trust in supermarkets
10
Estimation
The first step is the estimation of the
coefficients, also termed as discriminant
coefficients or weights
Estimation is similar to factor analysis or PCA, as
the coefficients are those which maximize the
variability between groups
In MDA the first discriminating function is the one
with the highest between-group variability, the
second discriminating function is independent from
the first and maximizes the remaining betweengroup variability and so on
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
11
3.Select
the
predictors
12
Coefficient estimates
Additional statistics and diagnostics
Fishers and
standardized
estimates of the
discriminant function
coefficients need to
be asked for
13
Classification options
14
Save classification
15
Function
1
In a typical week how
much do you spend
on fresh or frozen
chicken (Euro)?
From the butcher
Supermarkets
Age
(Constant)
.095
.454
-.297
.025
-2.515
Function
1
In a typical week how
much do you spend
on fresh or frozen
chicken (Euro)?
From the butcher
Supermarkets
Age
Unstandardized coefficients
Unstandardized coefficients
depend on the measurement unit
Standardized
coefficients do not
depend on the
measurement unit
.378
.748
-.453
.394
Most important
predictor
Trust in
supermarkets
has a sign
(thus it reduces
the discriminant
score)
16
Centroids
Functions at Group Centroids
Butcher
no
yes
Funct ion
1
-.307
.594
Butcher
no
yes
Total
Prior
.660
.340
1.000
17
Original
Count
Butcher
no
yes
Ungrouped cases
no
yes
Ungrouped cases
Predicted Group
Membership
no
yes
244
33
88
55
1
1
88.1
11.9
61.5
38.5
50.0
50.0
Total
277
143
2
100.0
100.0
100.0
18
Diagnostics (1)
Boxs M test. This tests whether covariances are equal
across groups
Wilks Lambda (or U statistic) tests discrimination between
groups. It is related to analysis of variance.
Individual WilksLambda for each of the predictors in a discriminant
function; univariate ANOVA (are there significant differences in the
predictors means between the groups?), p-value from the F distribution.
Wilks Lambda for the function as a whole. Are there significant
differences in the group means for the discriminant function p-value from
the Chi-square distribution?
19
Diagnostics (2)
DA returns one eigenvalue (or more eigenvalues for
MDA) of the discriminant function.
These can be interpreted as in principal component
analysis
In MDA (more than one discriminant function)
eigenvalues are exploited to compute how each
function contributes to explain variability
The canonical correlation measures the intensity of
the relationship between the groups and the single
discriminant function
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
20
P-value
Box's M statistic
37.3
0.000
0.85
0.000
Expenditure
0.98
0.002
Age
0.97
0.001
0.91
0.000
Trust in Supermarket
0.98
0.002
Eigenvalue
0.18
Canonical correlation
0.39
% of correct predictions
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
71.2%
MDA
To run MDA in SPSS the only
difference is that the range
has more than two
categories
22
Predictors
Tests of Equality of Group Means
Age
Tasty food
Value for money
Animal welfare
Please indicate your
gross annual household
income range
Wilks'
Lambda
.981
.971
.960
.982
F
1.798
2.761
3.878
1.679
.919
8.272
df1
3
3
3
3
df2
282
282
282
282
Sig.
.148
.042
.010
.172
282
.000
Test Results
Box's M
F
Approx.
df1
df2
Sig.
65.212
1.382
45
53286.386
.045
Discriminant functions
Three discriminant functions (four groups minus one) can
be estimated
Eigenvalues
Function
1
2
3
Eigenvalue % of Variance
.102a
61.0
.051a
30.8
.014a
8.2
Cumulative %
61.0
91.8
100.0
Canonical
Correlation
.304
.221
.116
Wilks'
Lambda
.851
.938
.986
Chi-square
45.098
17.904
3.818
df
15
8
3
Sig.
.000
.022
.282
24
Coefficients
Income is
very relevant
for the first
function
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
25
Structure matrix
Structure Matrix
Function
2
1
Please indicate y our
gross annual hous ehold
income range
Animal welfare
Value for money
Tasty food
Age
.929*
.390*
Income
-.010
.241
-.217
-.021
Value
and
taste
-.206
.891*
.660*
-.204
3
.078
.125
.168
Age
.273
.944*
26
Centroids
Functions at Group Centroids
In a typical week, what
type of fresh or frozen
chicken
do you buy for
'Value' chicken
your household's
'Standard' chicken
home consumption?
'Organic' chicken
'Luxury' chicken
1
-.673
.058
.525
.003
Function
2
-.262
.156
-.470
.052
3
-.040
-.065
-.030
.242
28
29
30
Prediction results
Classification Resultsa
Original
Count
Total
41
157
35
53
54
100.0
100.0
100.0
100.0
100.0
31
32
2.
3.
4.
5.
A one-way ANOVA test is run on each of the predictors, where the target
grouping variable determines the treatment levels. The ANOVA test provides
a criterion value and tests statistics (usually the Wilks Lambda). According to
the criterion value, it is possible to identify the predictor which is most
relevant in discriminating between the groups
The predictor with the lowest Wilks Lambda (or which meets an alternative
optimality criterion) enters the discriminating function, provided the p-value
is below the set threshold (for example 5%).
An ANCOVA test is run on the remaining predictors, where the covariates are
the target grouping variables and the predictors that have already entered
the model. The Wilks Lambda is computed for each of the ANCOVA options.
Again, the criteria and the p-value determine which variable (if any) enter
the discriminating function (and possibly whether some of the entered
variables should leave the model).
The procedure goes back to step 3 and continues until none of the excluded
variables have a p-value below the threshold and none of the entered
variables have a p-value above the threshold (the stopping rule is met).
33
Alternative criteria
Unexplained variance
Smallest F ratio
Mahalanobis distance
Raos V
34
In SPSS
35
Wilks'
Lambda
Tolerance
F to Remove
1.000
8.272
1.000
8.241
.960
1.000
3.863
.919
Age
Tasty food
Value for money
Animal welfare
Please indicate your
gross annual household
income range
Age
Tasty food
Value for money
Animal welfare
Age
Tasty food
Animal welfare
Tolerance
1.000
1.000
1.000
1.000
Min.
Tolerance
1.000
1.000
1.000
1.000
F to Enter
1.798
2.761
3.878
1.679
Wilks'
Lambda
.981
.971
.960
.982
1.000
1.000
8.272
.919
.988
.991
1.000
.992
.987
.821
.992
.988
.991
1.000
.992
.987
.821
.992
1.507
2.437
3.863
1.052
1.549
.793
1.057
.905
.896
.883
.909
.868
.875
.873
36