SlideShare a Scribd company logo
1 of 39
DISCRIMINANT
ANALYSIS
CAN
W
E
ACCURATELY CLASSIFY CONSUM
ERS
FROM
SURVEY RESULTS?
NAMES PRN
AMIT 12020841123
ADVAIT 12020841116
ANANT 12020841119
SANGRAM 12020841120
SHARAD 12020841117
GROUP NO. 4
DISCRIMINANT ANALYSIS
• Discriminant analysis is a statistical procedure which
allows us to classify cases in separate categories to
which they belong on the basis of a set of
characteristic independent variables called predictors
or discriminant variables
• The target variable (the one determining allocation
into groups) is a qualitative (nominal or ordinal) one,
while the characteristics are measured by quantitative
variables.
• DA looks at the discrimination between two groups
• Multiple discriminant analysis (MDA) allows for
classification into three or more groups.
APPLICATIONS OF DA
DA is especially useful to understand the differences and
factors leading consumers to make different choices
allowing them to develop marketing strategies which
take into proper account the role of the predictors.
Examples:
• Determinants of customer loyalty
• Shopper profiling and segmentation
• Determinants of purchase and non-purchase
EXAMPLE ON THE TRUST DATA-SET
• Purchasers of Chicken at the Butcher’s Shop
• Respondents may belong to one of two
groups
• Those who purchase chicken at the butcher’s shop
• Those who do not
• Discrimination between these groups through
a set of consumer characteristics
• Expenditure on chicken in a standard week
• Age of the respondent
• Whether respondents agree (on a seven-point ranking scale) that butchers sell safe chicken
• Trust (on a seven-point ranking scale) towards supermarkets
• Does a linear combination of these four
characteristics allow one to discriminate
between those who buy chicken at the
butcher’s and those who do not?
DISCRIMINANT ANALYSIS (DA)
• Two groups only, thus a single discriminating value
(discriminating score)
• For each respondent a score is computed using the
estimated linear combination of the predictors (the
discriminant function)
• Respondents with a score above the discriminating
value are expected to belong to one group, those below
to the other group.
• When the discriminant score is standardized to have
zero mean and unity variance it is called Z score
• DA also provides information about the discriminating
power of each of the original predictors
MULTIPLE DISCRIMINANT ANALYSIS (MDA) (1)
Discriminant analysis may involve more than
two groups, in which case it is termed
multiple discriminant analysis (MDA).
Example from the Trust data-set
• Dependent variable: Type of chicken purchased ‘in a
typical week’, choosing among four categories: value
(good value for money), standard, organic and luxury
• Predictors: age , stated relevance of taste , value for
money and animal welfare , plus an indicator of
income
MULTIPLE DISCRIMINANT ANALYSIS (2)
• In this case there will be more than one discriminant
function.
• The exact number of discriminant functions is equal to
either (g-1), where g is the number of categories in
classification or to k, the number of independent
variables, whichever is the smaller
• Trust example: four groups and five explanatory
variables, the number of discriminant functions is
three (that is g-1 which is smaller than k=5).
THE OUTPUT OF MDS
Similarities with factor (principal component)
analysis
• the first discriminant function is the most relevant for
discriminating across groups, the second is the second
most relevant, etc.
• the discriminant functions are also independent, which
means that the resulting scores are non-correlated.
• Once the coefficients of the discriminant functions are
estimated and standardized, they are interpreted in a
similar fashion to the factor loadings.
• The larger the standardised coefficients (in absolute
terms), the more relevant the respective variables to
discriminating between groups
There is no single discriminant score in MDA
• group means are computed (centroids) for each of the
discriminant functions to have a clearer view of the
classification rule
RUNNING DISCRIMINANT ANALYSIS
(2 GROUPS)
9
0 1 1 2 2 3 3 4 4z x x x xÎą Îą Îą Îą Îą= + + + +
Discriminant function
(Target variable: purchasers of chicken at the
butcher’s shop)
Discriminant score Predictors
• weekly expenditure on chicken
• age
• safety of butcher’s chicken
• trust in supermarkets
The Îą discriminant coefficients
need to be estimated
FISHER’S LINEAR DISCRIMINANT ANALYSIS
The discriminate function is the starting point.
Two key assumptions behind linear DA
(a) the predictors are normally distributed;
(b) the covariance matrices for the predictors within each of the groups are
equal.
Departure from condition (a) should suggest use of
alternative methods.
Departure from condition (b) requires the use of
different discriminant techniques (usually quadratic
discriminant functions).
In most empirical cases, the use of linear DA is
appropriate.
10
ESTIMATION
The first step is the estimation of the Îą coefficients,
also termed as discriminant coefficients or
weights.
Estimation is similar to factor analysis or PCA, as the
coefficients are those which maximize the
variability between groups
In MDA, the first discriminating function is the one
with the highest between-group variability, the
second discriminating function is independent
from the first and maximizes the remaining
between-group variability and so on
11
SPSS – TWO GROUPS CASE
12
1. Choose the
target variable
2. Define the range of the
dependent variable
3.Select the
predictors
COEFFICIENT ESTIMATES
13
Fisher’s and
standardized estimates
of the discriminant
function coefficients
need to be asked for
Additional statistics and diagnostics
CLASSIFICATION OPTIONS
14
Decide whether prior
probabilities are equal
across groups or group
sizes reflect different
allocation probabilities
These are diagnostic
indicators to evaluate
how well the
discriminant function
predict the groups
SAVE CLASSIFICATION
15
Create new variables in
the data-set,
containing the
predicted group
membership and/or
the discriminant score
for each case and each
function
OUTPUT – COEFFICIENT ESTIMATES
16
Canonical Discriminant Function Coefficients
.095
.454
-.297
.025
-2.515
In a typical week how
much do you spend
on fresh or frozen
chicken (Euro)?
From the butcher
Supermarkets
Age
(Constant)
1
Function
Unstandardized coefficients
Standardized Canonical Discriminant Function Coefficients
.378
.748
-.453
.394
In a typical week how
much do you spend
on fresh or frozen
chicken (Euro)?
From the butcher
Supermarkets
Age
1
Function
Unstandardized coefficients depend
on the measurement unit
Standardized
coefficients do not
depend on the
measurement unit
Most important
predictor
Trust in
supermarkets
has a – sign (thus
it reduces the
discriminant
score)
CENTROIDS
17
Prior Probabilities for Groups
.660 277 277.000
.340 143 143.000
1.000 420 420.000
Butcher
no
yes
Total
Prior Unweighted Weighted
Cases Used in Analysis
Functions at Group Centroids
-.307
.594
Butcher
no
yes
1
Function
Unstandardized canonical discriminant
functions evaluated at group means
These are the means of the
discriminant score for each of the
two groups
Thus, the group of those not
purchasing chicken at the butcher’s
shop have a negative centroid
With two groups, the discriminating score is zero
This can be computed by weighting the centroids with the initial probabilities
From these prior probabilities it follows
that the discriminating score is -0.307 x
0.66 + 0.594 x 0.34 = 0
OUTPUT – CLASSIFICATION SUCCESS
18
Classification Resultsa
244 33 277
88 55 143
1 1 2
88.1 11.9 100.0
61.5 38.5 100.0
50.0 50.0 100.0
Butcher
no
yes
Ungrouped cases
no
yes
Ungrouped cases
Count
%
Original
no yes
Predicted Group
Membership
Total
71.2% of original grouped cases correctly classified.a.
Using the discriminant function, it is possible to correctly classify 71.2% of
original cases (244 no-no + 55 yes-yes)/420
DIAGNOSTICS (1)
• Box’s M test. This tests whether covariances are equal
across groups
• Wilks’ Lambda (or U statistic) tests discrimination
between groups. It is related to analysis of variance.
• Individual Wilks’Lambda for each of the predictors in a discriminant function; univariate ANOVA
(are there significant differences in the predictor’s means between the groups?), p-value from the
F distribution.
• Wilks’ Lambda for the function as a whole. Are there significant differences in the group means for
the discriminant function p-value from the Chi-square distribution?
• The overall Wilks’ Lambda is especially helpful in
multiple discriminant analysis as it allows one to discard
those functions which do not contribute towards
explaining differences between groups.
19
DIAGNOSTICS (2)
DA returns one eigenvalue (or more eigenvalues for
MDA) of the discriminant function.
These can be interpreted as in principal component
analysis
In MDA (more than one discriminant function)
eigenvalues are exploited to compute how each
function contributes to explain variability
The canonical correlation measures the intensity of the
relationship between the groups and the single
discriminant function
20
TRUST EXAMPLE: DIAGNOSTICS
21
Statistic P-value
Box's M statistic 37.3 0.000
Overall Wilks' Lambda 0.85 0.000
Wilks Lambda for
Expenditure 0.98 0.002
Age 0.97 0.001
Safer for Butcher 0.91 0.000
Trust in Supermarket 0.98 0.002
Eigenvalue 0.18
Canonical correlation 0.39
% OF CORRECT
PREDICTIONS
71.2%
Covariance matrices are not
equal
The overall discriminating
power of the DF is good
All of the predictors are
relevant to discriminating
between the two groups
The eigenvalue is the ratio
between variances between
and variance within groups (the
larger the better)
Square root of the ratio between variability
between and total variability
MDA
22
To run MDA in SPSS the only
difference is that the range has
more than two categories
PREDICTORS
23
Test Results
65.212
1.382
45
53286.386
.045
Box's M
Approx.
df1
df2
Sig.
F
Tests null hypothesis of equal population covariance matrices.
Tests of Equality of Group Means
.981 1.798 3 282 .148
.971 2.761 3 282 .042
.960 3.878 3 282 .010
.982 1.679 3 282 .172
.919 8.272 3 282 .000
Age
Tasty food
Value for money
Animal welfare
Please indicate your
gross annual household
income range
Wilks'
Lambda F df1 df2 Sig.
Three predictors only appear to be relevant in
discriminating among preferred types of
chicken
Null rejected at 95% c.l.,
but not at 99% c.l.
DISCRIMINANT FUNCTIONS
24
Eigenvalues
.102a 61.0 61.0 .304
.051a 30.8 91.8 .221
.014a 8.2 100.0 .116
Function
1
2
3
Eigenvalue % of Variance Cumulative %
Canonical
Correlation
First 3 canonical discriminant functions were used in the
analysis.
a.
Three discriminant functions (four groups minus one) can be
estimated
Wilks' Lambda
.851 45.098 15 .000
.938 17.904 8 .022
.986 3.818 3 .282
Test of Function(s)
1 through 3
2 through 3
3
Wilks'
Lambda Chi-square df Sig.
The first two discriminant
functions have a
significant discriminating
power.
COEFFICIENTS
25
Discriminant functions’ coefficients
Unstandardized Standardized
1 2 1 2
Value for money -.043 .603 -.053 .746
Age -.009 -.013 -.148 -.208
Tasty food .169 .416 .152 .374
Animal welfare .186 -.132 .313 -.222
Please indicate your gross
annual household income
range
.652 -.033 .870 -.044
(Constant) -2.298 -4.868
Income is very
relevant for
the first
function
Value for money is
very relevant for the
second function
STRUCTURE MATRIX
26
Structure Matrix
.929* -.021 .078
.390* -.206 .125
-.010 .891* .168
.241 .660* .273
-.217 -.204 .944*
Please indicate your
gross annual household
income range
Animal welfare
Value for money
Tasty food
Age
1 2 3
Function
Pooled within-groups correlations between discriminating
variables and standardized canonical discriminant functions
Variables ordered by absolute size of correlation within function.
Largest absolute correlation between each variable and
any discriminant function
*.
The values in the structure
matrix are the correlations
between the individual
predictors and the scores
computed on the discriminant
functions.
For example, the income
variable has a strong
correlation with the scores of
the first function
The structure matrix help
interpreting the functions
Income
Value and
taste
Age
CENTROIDS
27
Functions at Group Centroids
-.673 -.262 -.040
.058 .156 -.065
.525 -.470 -.030
.003 .052 .242
In a typical week, what
type of fresh or frozen
chicken do you buy for
your household's
home consumption?
'Value' chicken
'Standard' chicken
'Organic' chicken
'Luxury' chicken
1 2 3
Function
Unstandardized canonical discriminant functions evaluated at
group means
The first function discriminates
well between value and
organic (income matters to
organic buyers)
The second allows some discrimination
standard-organic, value-standard,
organic-luxury (taste and value matter)
PLOT OF TWO FUNCTIONS
28
The ‘territorial map’ shows the
scores for the first two functions
considering all groups
Tick ‘separate-groups’ to show
graphs of the first two functions
for each individual group
PLOTS: INDIVIDUAL GROUPS
29
Example: organic chicken
Most cases tend to be relatively
high on function 1 (income)
Example: organic chicken
Most cases tend to be relatively
high on function 1 (income)
PLOTS – ALL GROUPS
30
PREDICTION RESULTS
31
Classification Resultsa
3 38 0 0 41
2 154 1 0 157
1 30 4 0 35
1 51 1 0 53
0 51 3 0 54
7.3 92.7 .0 .0 100.0
1.3 98.1 .6 .0 100.0
2.9 85.7 11.4 .0 100.0
1.9 96.2 1.9 .0 100.0
.0 94.4 5.6 .0 100.0
In a typical week, what
type of fresh or frozen
chicken do you buy for
your household's
home consumption?
'Value' chicken
'Standard' chicken
'Organic' chicken
'Luxury' chicken
Ungrouped cases
'Value' chicken
'Standard' chicken
'Organic' chicken
'Luxury' chicken
Ungrouped cases
Count
%
Original
'Value'
chicken
'Standard'
chicken
'Organic'
chicken
'Luxury'
chicken
Predicted Group Membership
Total
56.3% of original grouped cases correctly classified.a.
The functions do not predict well; most
units are allocated to standard chicken –
on average only 56.3% of the cases are
allocated correctly
STEPWISE DISCRIMINANT ANALYSIS
As for linear regression it is possible to decide whether all predictors
should appear in the equation regardless of their role in
discriminating (the Enter option) or a sub-set of predictors is chosen
on the basis of their contribution to discriminating between groups
(the Stepwise method)
32
THE STEP-WISE METHOD
1. A one-way ANOVA test is run on each of the predictors, where the
target grouping variable determines the treatment levels. The ANOVA
test provides a criterion value and tests statistics (usually the Wilks
Lambda). According to the criterion value, it is possible to identify
the predictor which is most relevant in discriminating between the
groups
2. The predictor with the lowest Wilks Lambda (or which meets an
alternative optimality criterion) enters the discriminating function,
provided the p-value is below the set threshold (for example 5%).
3. An ANCOVA test is run on the remaining predictors, where the
covariates are the target grouping variables and the predictors that
have already entered the model. The Wilks Lambda is computed for
each of the ANCOVA options.
4. Again, the criteria and the p-value determine which variable (if any)
enter the discriminating function (and possibly whether some of the
entered variables should leave the model).
5. The procedure goes back to step 3 and continues until none of the
excluded variables have a p-value below the threshold and none of
the entered variables have a p-value above the threshold (the
stopping rule is met).
33
ALTERNATIVE CRITERIA
Unexplained variance
Smallest F ratio
Mahalanobis distance
Rao’s V
34
IN SPSS
35
The step-wise method allows
selection of relevant
predictors
OUTPUT OF THE STEP-WISE METHOD
36
Variables in the Analysis
1.000 8.272
1.000 8.241 .960
1.000 3.863 .919
Please indicate your
gross annual household
income range
Please indicate your
gross annual household
income range
Value for money
Step
1
2
Tolerance F to Remove
Wilks'
Lambda
Variables Not in the Analysis
1.000 1.000 1.798 .981
1.000 1.000 2.761 .971
1.000 1.000 3.878 .960
1.000 1.000 1.679 .982
1.000 1.000 8.272 .919
.988 .988 1.507 .905
.991 .991 2.437 .896
1.000 1.000 3.863 .883
.992 .992 1.052 .909
.987 .987 1.549 .868
.821 .821 .793 .875
.992 .992 1.057 .873
Age
Tasty food
Value for money
Animal welfare
Please indicate your
gross annual household
income range
Age
Tasty food
Value for money
Animal welfare
Age
Tasty food
Animal welfare
Step
0
1
2
Tolerance
Min.
Tolerance F to Enter
Wilks'
Lambda
Only two predictors are
kept in the model
APPLICATIONS IN MARKETING:
After getting to know the Technical Aspect of this
useful concept,
we can conclude that DA has the following applications
in the field
of Marketing:
• Discriminate analysis, a multivariate technique used
for market segmentation and predicting group
membership is often used for this type of problem
because of its ability to classify individuals or
experimental units into two or more uniquely defined
populations.
• Product research – Distinguish between heavy,
medium, and light users of a product in terms of
their consumption habits and lifestyles.
• Perception/Image research – Distinguish between
customers who exhibit favorable perceptions of a
store or company and those who do not.
• Advertising research – Identify how market segments
differ in media consumption habits.
• Direct marketing – Identify the characteristics of
consumers who will respond to a direct marketing
campaign and those who will not.
THANK
YOU

More Related Content

What's hot

Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysisWansuklangk
 
The chi square test of indep of categorical variables
The chi square test of indep of categorical variablesThe chi square test of indep of categorical variables
The chi square test of indep of categorical variablesRegent University
 
Two way ANOVA
Two way ANOVATwo way ANOVA
Two way ANOVAD-kay Verma
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of varianceRavi Rohilla
 
Discriminant function analysis (DFA)
Discriminant function analysis (DFA)Discriminant function analysis (DFA)
Discriminant function analysis (DFA)benazeer fathima
 
Multidimensional scaling1
Multidimensional scaling1Multidimensional scaling1
Multidimensional scaling1Carlo Magno
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IJames Neill
 
Analysis of variance (ANOVA) everything you need to know
Analysis of variance (ANOVA) everything you need to knowAnalysis of variance (ANOVA) everything you need to know
Analysis of variance (ANOVA) everything you need to knowStat Analytica
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSSLNIPE
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Tesfamichael Getu
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis Dr Athar Khan
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysisRabin BK
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independencejasondroesch
 
Introduction to ANOVAs
Introduction to ANOVAsIntroduction to ANOVAs
Introduction to ANOVAsflorentinoz
 

What's hot (20)

Shovan anova main
Shovan anova mainShovan anova main
Shovan anova main
 
Discriment analysis
Discriment analysisDiscriment analysis
Discriment analysis
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
The chi square test of indep of categorical variables
The chi square test of indep of categorical variablesThe chi square test of indep of categorical variables
The chi square test of indep of categorical variables
 
Two way ANOVA
Two way ANOVATwo way ANOVA
Two way ANOVA
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Discriminant function analysis (DFA)
Discriminant function analysis (DFA)Discriminant function analysis (DFA)
Discriminant function analysis (DFA)
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
Multidimensional scaling1
Multidimensional scaling1Multidimensional scaling1
Multidimensional scaling1
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA I
 
Analysis of variance (ANOVA) everything you need to know
Analysis of variance (ANOVA) everything you need to knowAnalysis of variance (ANOVA) everything you need to know
Analysis of variance (ANOVA) everything you need to know
 
Chi square test
Chi square testChi square test
Chi square test
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSS
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
 
Introduction to ANOVAs
Introduction to ANOVAsIntroduction to ANOVAs
Introduction to ANOVAs
 

Similar to Discriminant analysis group no. 4

Discriminant analysis tehnik analisa Statistika Chap11.ppt
Discriminant analysis tehnik analisa Statistika  Chap11.pptDiscriminant analysis tehnik analisa Statistika  Chap11.ppt
Discriminant analysis tehnik analisa Statistika Chap11.pptwisnuragil2
 
Discriminant analysis-Business Intelligence tool
Discriminant analysis-Business Intelligence toolDiscriminant analysis-Business Intelligence tool
Discriminant analysis-Business Intelligence toolRenuLamba8
 
discriminantfunctionanalysisdfa-200926121304.pptx
discriminantfunctionanalysisdfa-200926121304.pptxdiscriminantfunctionanalysisdfa-200926121304.pptx
discriminantfunctionanalysisdfa-200926121304.pptxADVENTUREARASAN
 
discriminantfunctionanalysisdfa-200926121304(1).pptx
discriminantfunctionanalysisdfa-200926121304(1).pptxdiscriminantfunctionanalysisdfa-200926121304(1).pptx
discriminantfunctionanalysisdfa-200926121304(1).pptxADVENTUREARASAN
 
Discriminant analysis using spss
Discriminant analysis using spssDiscriminant analysis using spss
Discriminant analysis using spssDr Nisha Arora
 
T18 discriminant analysis
T18 discriminant analysisT18 discriminant analysis
T18 discriminant analysiskompellark
 
diiscriminant analysis1.pptx
diiscriminant analysis1.pptxdiiscriminant analysis1.pptx
diiscriminant analysis1.pptxSharumathiR1
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysisDr. Ravneet Kaur
 
discriminant analysis.pdf
discriminant analysis.pdfdiscriminant analysis.pdf
discriminant analysis.pdfYashwanth Rm
 
Multiple discriminant analysis
Multiple discriminant analysisMultiple discriminant analysis
Multiple discriminant analysisMUHAMMAD HASRATH
 
Discrimination Analysis and working on SPSS assignment
Discrimination Analysis and working on SPSS assignmentDiscrimination Analysis and working on SPSS assignment
Discrimination Analysis and working on SPSS assignmentbasitktr
 
Discriminant Analysis.pptx
Discriminant Analysis.pptxDiscriminant Analysis.pptx
Discriminant Analysis.pptxGedaSheko
 
Discriminant Analysis in Sports
Discriminant Analysis in SportsDiscriminant Analysis in Sports
Discriminant Analysis in SportsJ P Verma
 
Discriminate analysis
Discriminate analysisDiscriminate analysis
Discriminate analysisRaja Adapa
 
A Statistical Framework for Cluster Health Assessment and Its Application in ...
A Statistical Framework for Cluster Health Assessment and Its Application in ...A Statistical Framework for Cluster Health Assessment and Its Application in ...
A Statistical Framework for Cluster Health Assessment and Its Application in ...Cognizant
 

Similar to Discriminant analysis group no. 4 (20)

Discriminant analysis tehnik analisa Statistika Chap11.ppt
Discriminant analysis tehnik analisa Statistika  Chap11.pptDiscriminant analysis tehnik analisa Statistika  Chap11.ppt
Discriminant analysis tehnik analisa Statistika Chap11.ppt
 
Discriminant analysis-Business Intelligence tool
Discriminant analysis-Business Intelligence toolDiscriminant analysis-Business Intelligence tool
Discriminant analysis-Business Intelligence tool
 
discriminantfunctionanalysisdfa-200926121304.pptx
discriminantfunctionanalysisdfa-200926121304.pptxdiscriminantfunctionanalysisdfa-200926121304.pptx
discriminantfunctionanalysisdfa-200926121304.pptx
 
discriminantfunctionanalysisdfa-200926121304(1).pptx
discriminantfunctionanalysisdfa-200926121304(1).pptxdiscriminantfunctionanalysisdfa-200926121304(1).pptx
discriminantfunctionanalysisdfa-200926121304(1).pptx
 
Discriminant analysis using spss
Discriminant analysis using spssDiscriminant analysis using spss
Discriminant analysis using spss
 
T18 discriminant analysis
T18 discriminant analysisT18 discriminant analysis
T18 discriminant analysis
 
diiscriminant analysis1.pptx
diiscriminant analysis1.pptxdiiscriminant analysis1.pptx
diiscriminant analysis1.pptx
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
Discriminant analysis.pptx
Discriminant analysis.pptxDiscriminant analysis.pptx
Discriminant analysis.pptx
 
DBMS4.ppt
DBMS4.pptDBMS4.ppt
DBMS4.ppt
 
discriminant analysis.pdf
discriminant analysis.pdfdiscriminant analysis.pdf
discriminant analysis.pdf
 
Multiple discriminant analysis
Multiple discriminant analysisMultiple discriminant analysis
Multiple discriminant analysis
 
Discriminant Analysis
Discriminant AnalysisDiscriminant Analysis
Discriminant Analysis
 
Discrimination Analysis and working on SPSS assignment
Discrimination Analysis and working on SPSS assignmentDiscrimination Analysis and working on SPSS assignment
Discrimination Analysis and working on SPSS assignment
 
UNIT IV.pptx
UNIT IV.pptxUNIT IV.pptx
UNIT IV.pptx
 
Discriminant Analysis.pptx
Discriminant Analysis.pptxDiscriminant Analysis.pptx
Discriminant Analysis.pptx
 
Malhotra18
Malhotra18Malhotra18
Malhotra18
 
Discriminant Analysis in Sports
Discriminant Analysis in SportsDiscriminant Analysis in Sports
Discriminant Analysis in Sports
 
Discriminate analysis
Discriminate analysisDiscriminate analysis
Discriminate analysis
 
A Statistical Framework for Cluster Health Assessment and Its Application in ...
A Statistical Framework for Cluster Health Assessment and Its Application in ...A Statistical Framework for Cluster Health Assessment and Its Application in ...
A Statistical Framework for Cluster Health Assessment and Its Application in ...
 

More from Advait Bhobe

B2 b group 6
B2 b group 6B2 b group 6
B2 b group 6Advait Bhobe
 
Im final (2)
Im final (2)Im final (2)
Im final (2)Advait Bhobe
 
Advait bhobe prn 12020841116
Advait bhobe prn 12020841116Advait bhobe prn 12020841116
Advait bhobe prn 12020841116Advait Bhobe
 
Jubilant foodworks group7_marketing
Jubilant foodworks group7_marketingJubilant foodworks group7_marketing
Jubilant foodworks group7_marketingAdvait Bhobe
 
Intel group7 marketing
Intel group7 marketingIntel group7 marketing
Intel group7 marketingAdvait Bhobe
 
Radio advertising
Radio advertisingRadio advertising
Radio advertisingAdvait Bhobe
 
Case study group 6 retailing in india-the impact of hypermarkets
Case study group 6 retailing in india-the impact of hypermarketsCase study group 6 retailing in india-the impact of hypermarkets
Case study group 6 retailing in india-the impact of hypermarketsAdvait Bhobe
 
Food retail group 6 biscuit industry
Food retail group 6 biscuit industryFood retail group 6 biscuit industry
Food retail group 6 biscuit industryAdvait Bhobe
 
It industry tcs group 7_ssm assignment
It industry tcs group 7_ssm assignmentIt industry tcs group 7_ssm assignment
It industry tcs group 7_ssm assignmentAdvait Bhobe
 
Current indian automobile insurance industry
Current indian automobile insurance industryCurrent indian automobile insurance industry
Current indian automobile insurance industryAdvait Bhobe
 
Case presentation marico limited
Case presentation  marico limitedCase presentation  marico limited
Case presentation marico limitedAdvait Bhobe
 
Industry sector lifestyle group-3
Industry sector lifestyle group-3Industry sector lifestyle group-3
Industry sector lifestyle group-3Advait Bhobe
 
Section b group8 (1)
Section b group8 (1)Section b group8 (1)
Section b group8 (1)Advait Bhobe
 
Group 3 section b
Group 3 section bGroup 3 section b
Group 3 section bAdvait Bhobe
 
Section b group 3 1
Section b group 3    1Section b group 3    1
Section b group 3 1Advait Bhobe
 
Hrm as a source of shareholder value
Hrm as a source of shareholder valueHrm as a source of shareholder value
Hrm as a source of shareholder valueAdvait Bhobe
 
Section bgroup8
Section bgroup8Section bgroup8
Section bgroup8Advait Bhobe
 

More from Advait Bhobe (20)

B2 b group 6
B2 b group 6B2 b group 6
B2 b group 6
 
Im final (2)
Im final (2)Im final (2)
Im final (2)
 
Crm6
Crm6Crm6
Crm6
 
Advait bhobe prn 12020841116
Advait bhobe prn 12020841116Advait bhobe prn 12020841116
Advait bhobe prn 12020841116
 
Jubilant foodworks group7_marketing
Jubilant foodworks group7_marketingJubilant foodworks group7_marketing
Jubilant foodworks group7_marketing
 
Intel group7 marketing
Intel group7 marketingIntel group7 marketing
Intel group7 marketing
 
Radio advertising
Radio advertisingRadio advertising
Radio advertising
 
Case study group 6 retailing in india-the impact of hypermarkets
Case study group 6 retailing in india-the impact of hypermarketsCase study group 6 retailing in india-the impact of hypermarkets
Case study group 6 retailing in india-the impact of hypermarkets
 
Food retail group 6 biscuit industry
Food retail group 6 biscuit industryFood retail group 6 biscuit industry
Food retail group 6 biscuit industry
 
It industry tcs group 7_ssm assignment
It industry tcs group 7_ssm assignmentIt industry tcs group 7_ssm assignment
It industry tcs group 7_ssm assignment
 
Current indian automobile insurance industry
Current indian automobile insurance industryCurrent indian automobile insurance industry
Current indian automobile insurance industry
 
Sm final
Sm finalSm final
Sm final
 
Case presentation marico limited
Case presentation  marico limitedCase presentation  marico limited
Case presentation marico limited
 
Industry sector lifestyle group-3
Industry sector lifestyle group-3Industry sector lifestyle group-3
Industry sector lifestyle group-3
 
S&m (1)
S&m (1)S&m (1)
S&m (1)
 
Section b group8 (1)
Section b group8 (1)Section b group8 (1)
Section b group8 (1)
 
Group 3 section b
Group 3 section bGroup 3 section b
Group 3 section b
 
Section b group 3 1
Section b group 3    1Section b group 3    1
Section b group 3 1
 
Hrm as a source of shareholder value
Hrm as a source of shareholder valueHrm as a source of shareholder value
Hrm as a source of shareholder value
 
Section bgroup8
Section bgroup8Section bgroup8
Section bgroup8
 

Recently uploaded

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Discriminant analysis group no. 4

  • 1. DISCRIMINANT ANALYSIS CAN W E ACCURATELY CLASSIFY CONSUM ERS FROM SURVEY RESULTS? NAMES PRN AMIT 12020841123 ADVAIT 12020841116 ANANT 12020841119 SANGRAM 12020841120 SHARAD 12020841117 GROUP NO. 4
  • 2. DISCRIMINANT ANALYSIS • Discriminant analysis is a statistical procedure which allows us to classify cases in separate categories to which they belong on the basis of a set of characteristic independent variables called predictors or discriminant variables • The target variable (the one determining allocation into groups) is a qualitative (nominal or ordinal) one, while the characteristics are measured by quantitative variables. • DA looks at the discrimination between two groups • Multiple discriminant analysis (MDA) allows for classification into three or more groups.
  • 3. APPLICATIONS OF DA DA is especially useful to understand the differences and factors leading consumers to make different choices allowing them to develop marketing strategies which take into proper account the role of the predictors. Examples: • Determinants of customer loyalty • Shopper profiling and segmentation • Determinants of purchase and non-purchase
  • 4. EXAMPLE ON THE TRUST DATA-SET • Purchasers of Chicken at the Butcher’s Shop • Respondents may belong to one of two groups • Those who purchase chicken at the butcher’s shop • Those who do not • Discrimination between these groups through a set of consumer characteristics • Expenditure on chicken in a standard week • Age of the respondent • Whether respondents agree (on a seven-point ranking scale) that butchers sell safe chicken • Trust (on a seven-point ranking scale) towards supermarkets • Does a linear combination of these four characteristics allow one to discriminate between those who buy chicken at the butcher’s and those who do not?
  • 5. DISCRIMINANT ANALYSIS (DA) • Two groups only, thus a single discriminating value (discriminating score) • For each respondent a score is computed using the estimated linear combination of the predictors (the discriminant function) • Respondents with a score above the discriminating value are expected to belong to one group, those below to the other group. • When the discriminant score is standardized to have zero mean and unity variance it is called Z score • DA also provides information about the discriminating power of each of the original predictors
  • 6. MULTIPLE DISCRIMINANT ANALYSIS (MDA) (1) Discriminant analysis may involve more than two groups, in which case it is termed multiple discriminant analysis (MDA). Example from the Trust data-set • Dependent variable: Type of chicken purchased ‘in a typical week’, choosing among four categories: value (good value for money), standard, organic and luxury • Predictors: age , stated relevance of taste , value for money and animal welfare , plus an indicator of income
  • 7. MULTIPLE DISCRIMINANT ANALYSIS (2) • In this case there will be more than one discriminant function. • The exact number of discriminant functions is equal to either (g-1), where g is the number of categories in classification or to k, the number of independent variables, whichever is the smaller • Trust example: four groups and five explanatory variables, the number of discriminant functions is three (that is g-1 which is smaller than k=5).
  • 8. THE OUTPUT OF MDS Similarities with factor (principal component) analysis • the first discriminant function is the most relevant for discriminating across groups, the second is the second most relevant, etc. • the discriminant functions are also independent, which means that the resulting scores are non-correlated. • Once the coefficients of the discriminant functions are estimated and standardized, they are interpreted in a similar fashion to the factor loadings. • The larger the standardised coefficients (in absolute terms), the more relevant the respective variables to discriminating between groups There is no single discriminant score in MDA • group means are computed (centroids) for each of the discriminant functions to have a clearer view of the classification rule
  • 9. RUNNING DISCRIMINANT ANALYSIS (2 GROUPS) 9 0 1 1 2 2 3 3 4 4z x x x xÎą Îą Îą Îą Îą= + + + + Discriminant function (Target variable: purchasers of chicken at the butcher’s shop) Discriminant score Predictors • weekly expenditure on chicken • age • safety of butcher’s chicken • trust in supermarkets The Îą discriminant coefficients need to be estimated
  • 10. FISHER’S LINEAR DISCRIMINANT ANALYSIS The discriminate function is the starting point. Two key assumptions behind linear DA (a) the predictors are normally distributed; (b) the covariance matrices for the predictors within each of the groups are equal. Departure from condition (a) should suggest use of alternative methods. Departure from condition (b) requires the use of different discriminant techniques (usually quadratic discriminant functions). In most empirical cases, the use of linear DA is appropriate. 10
  • 11. ESTIMATION The first step is the estimation of the Îą coefficients, also termed as discriminant coefficients or weights. Estimation is similar to factor analysis or PCA, as the coefficients are those which maximize the variability between groups In MDA, the first discriminating function is the one with the highest between-group variability, the second discriminating function is independent from the first and maximizes the remaining between-group variability and so on 11
  • 12. SPSS – TWO GROUPS CASE 12 1. Choose the target variable 2. Define the range of the dependent variable 3.Select the predictors
  • 13. COEFFICIENT ESTIMATES 13 Fisher’s and standardized estimates of the discriminant function coefficients need to be asked for Additional statistics and diagnostics
  • 14. CLASSIFICATION OPTIONS 14 Decide whether prior probabilities are equal across groups or group sizes reflect different allocation probabilities These are diagnostic indicators to evaluate how well the discriminant function predict the groups
  • 15. SAVE CLASSIFICATION 15 Create new variables in the data-set, containing the predicted group membership and/or the discriminant score for each case and each function
  • 16. OUTPUT – COEFFICIENT ESTIMATES 16 Canonical Discriminant Function Coefficients .095 .454 -.297 .025 -2.515 In a typical week how much do you spend on fresh or frozen chicken (Euro)? From the butcher Supermarkets Age (Constant) 1 Function Unstandardized coefficients Standardized Canonical Discriminant Function Coefficients .378 .748 -.453 .394 In a typical week how much do you spend on fresh or frozen chicken (Euro)? From the butcher Supermarkets Age 1 Function Unstandardized coefficients depend on the measurement unit Standardized coefficients do not depend on the measurement unit Most important predictor Trust in supermarkets has a – sign (thus it reduces the discriminant score)
  • 17. CENTROIDS 17 Prior Probabilities for Groups .660 277 277.000 .340 143 143.000 1.000 420 420.000 Butcher no yes Total Prior Unweighted Weighted Cases Used in Analysis Functions at Group Centroids -.307 .594 Butcher no yes 1 Function Unstandardized canonical discriminant functions evaluated at group means These are the means of the discriminant score for each of the two groups Thus, the group of those not purchasing chicken at the butcher’s shop have a negative centroid With two groups, the discriminating score is zero This can be computed by weighting the centroids with the initial probabilities From these prior probabilities it follows that the discriminating score is -0.307 x 0.66 + 0.594 x 0.34 = 0
  • 18. OUTPUT – CLASSIFICATION SUCCESS 18 Classification Resultsa 244 33 277 88 55 143 1 1 2 88.1 11.9 100.0 61.5 38.5 100.0 50.0 50.0 100.0 Butcher no yes Ungrouped cases no yes Ungrouped cases Count % Original no yes Predicted Group Membership Total 71.2% of original grouped cases correctly classified.a. Using the discriminant function, it is possible to correctly classify 71.2% of original cases (244 no-no + 55 yes-yes)/420
  • 19. DIAGNOSTICS (1) • Box’s M test. This tests whether covariances are equal across groups • Wilks’ Lambda (or U statistic) tests discrimination between groups. It is related to analysis of variance. • Individual Wilks’Lambda for each of the predictors in a discriminant function; univariate ANOVA (are there significant differences in the predictor’s means between the groups?), p-value from the F distribution. • Wilks’ Lambda for the function as a whole. Are there significant differences in the group means for the discriminant function p-value from the Chi-square distribution? • The overall Wilks’ Lambda is especially helpful in multiple discriminant analysis as it allows one to discard those functions which do not contribute towards explaining differences between groups. 19
  • 20. DIAGNOSTICS (2) DA returns one eigenvalue (or more eigenvalues for MDA) of the discriminant function. These can be interpreted as in principal component analysis In MDA (more than one discriminant function) eigenvalues are exploited to compute how each function contributes to explain variability The canonical correlation measures the intensity of the relationship between the groups and the single discriminant function 20
  • 21. TRUST EXAMPLE: DIAGNOSTICS 21 Statistic P-value Box's M statistic 37.3 0.000 Overall Wilks' Lambda 0.85 0.000 Wilks Lambda for Expenditure 0.98 0.002 Age 0.97 0.001 Safer for Butcher 0.91 0.000 Trust in Supermarket 0.98 0.002 Eigenvalue 0.18 Canonical correlation 0.39 % OF CORRECT PREDICTIONS 71.2% Covariance matrices are not equal The overall discriminating power of the DF is good All of the predictors are relevant to discriminating between the two groups The eigenvalue is the ratio between variances between and variance within groups (the larger the better) Square root of the ratio between variability between and total variability
  • 22. MDA 22 To run MDA in SPSS the only difference is that the range has more than two categories
  • 23. PREDICTORS 23 Test Results 65.212 1.382 45 53286.386 .045 Box's M Approx. df1 df2 Sig. F Tests null hypothesis of equal population covariance matrices. Tests of Equality of Group Means .981 1.798 3 282 .148 .971 2.761 3 282 .042 .960 3.878 3 282 .010 .982 1.679 3 282 .172 .919 8.272 3 282 .000 Age Tasty food Value for money Animal welfare Please indicate your gross annual household income range Wilks' Lambda F df1 df2 Sig. Three predictors only appear to be relevant in discriminating among preferred types of chicken Null rejected at 95% c.l., but not at 99% c.l.
  • 24. DISCRIMINANT FUNCTIONS 24 Eigenvalues .102a 61.0 61.0 .304 .051a 30.8 91.8 .221 .014a 8.2 100.0 .116 Function 1 2 3 Eigenvalue % of Variance Cumulative % Canonical Correlation First 3 canonical discriminant functions were used in the analysis. a. Three discriminant functions (four groups minus one) can be estimated Wilks' Lambda .851 45.098 15 .000 .938 17.904 8 .022 .986 3.818 3 .282 Test of Function(s) 1 through 3 2 through 3 3 Wilks' Lambda Chi-square df Sig. The first two discriminant functions have a significant discriminating power.
  • 25. COEFFICIENTS 25 Discriminant functions’ coefficients Unstandardized Standardized 1 2 1 2 Value for money -.043 .603 -.053 .746 Age -.009 -.013 -.148 -.208 Tasty food .169 .416 .152 .374 Animal welfare .186 -.132 .313 -.222 Please indicate your gross annual household income range .652 -.033 .870 -.044 (Constant) -2.298 -4.868 Income is very relevant for the first function Value for money is very relevant for the second function
  • 26. STRUCTURE MATRIX 26 Structure Matrix .929* -.021 .078 .390* -.206 .125 -.010 .891* .168 .241 .660* .273 -.217 -.204 .944* Please indicate your gross annual household income range Animal welfare Value for money Tasty food Age 1 2 3 Function Pooled within-groups correlations between discriminating variables and standardized canonical discriminant functions Variables ordered by absolute size of correlation within function. Largest absolute correlation between each variable and any discriminant function *. The values in the structure matrix are the correlations between the individual predictors and the scores computed on the discriminant functions. For example, the income variable has a strong correlation with the scores of the first function The structure matrix help interpreting the functions Income Value and taste Age
  • 27. CENTROIDS 27 Functions at Group Centroids -.673 -.262 -.040 .058 .156 -.065 .525 -.470 -.030 .003 .052 .242 In a typical week, what type of fresh or frozen chicken do you buy for your household's home consumption? 'Value' chicken 'Standard' chicken 'Organic' chicken 'Luxury' chicken 1 2 3 Function Unstandardized canonical discriminant functions evaluated at group means The first function discriminates well between value and organic (income matters to organic buyers) The second allows some discrimination standard-organic, value-standard, organic-luxury (taste and value matter)
  • 28. PLOT OF TWO FUNCTIONS 28 The ‘territorial map’ shows the scores for the first two functions considering all groups Tick ‘separate-groups’ to show graphs of the first two functions for each individual group
  • 29. PLOTS: INDIVIDUAL GROUPS 29 Example: organic chicken Most cases tend to be relatively high on function 1 (income) Example: organic chicken Most cases tend to be relatively high on function 1 (income)
  • 30. PLOTS – ALL GROUPS 30
  • 31. PREDICTION RESULTS 31 Classification Resultsa 3 38 0 0 41 2 154 1 0 157 1 30 4 0 35 1 51 1 0 53 0 51 3 0 54 7.3 92.7 .0 .0 100.0 1.3 98.1 .6 .0 100.0 2.9 85.7 11.4 .0 100.0 1.9 96.2 1.9 .0 100.0 .0 94.4 5.6 .0 100.0 In a typical week, what type of fresh or frozen chicken do you buy for your household's home consumption? 'Value' chicken 'Standard' chicken 'Organic' chicken 'Luxury' chicken Ungrouped cases 'Value' chicken 'Standard' chicken 'Organic' chicken 'Luxury' chicken Ungrouped cases Count % Original 'Value' chicken 'Standard' chicken 'Organic' chicken 'Luxury' chicken Predicted Group Membership Total 56.3% of original grouped cases correctly classified.a. The functions do not predict well; most units are allocated to standard chicken – on average only 56.3% of the cases are allocated correctly
  • 32. STEPWISE DISCRIMINANT ANALYSIS As for linear regression it is possible to decide whether all predictors should appear in the equation regardless of their role in discriminating (the Enter option) or a sub-set of predictors is chosen on the basis of their contribution to discriminating between groups (the Stepwise method) 32
  • 33. THE STEP-WISE METHOD 1. A one-way ANOVA test is run on each of the predictors, where the target grouping variable determines the treatment levels. The ANOVA test provides a criterion value and tests statistics (usually the Wilks Lambda). According to the criterion value, it is possible to identify the predictor which is most relevant in discriminating between the groups 2. The predictor with the lowest Wilks Lambda (or which meets an alternative optimality criterion) enters the discriminating function, provided the p-value is below the set threshold (for example 5%). 3. An ANCOVA test is run on the remaining predictors, where the covariates are the target grouping variables and the predictors that have already entered the model. The Wilks Lambda is computed for each of the ANCOVA options. 4. Again, the criteria and the p-value determine which variable (if any) enter the discriminating function (and possibly whether some of the entered variables should leave the model). 5. The procedure goes back to step 3 and continues until none of the excluded variables have a p-value below the threshold and none of the entered variables have a p-value above the threshold (the stopping rule is met). 33
  • 34. ALTERNATIVE CRITERIA Unexplained variance Smallest F ratio Mahalanobis distance Rao’s V 34
  • 35. IN SPSS 35 The step-wise method allows selection of relevant predictors
  • 36. OUTPUT OF THE STEP-WISE METHOD 36 Variables in the Analysis 1.000 8.272 1.000 8.241 .960 1.000 3.863 .919 Please indicate your gross annual household income range Please indicate your gross annual household income range Value for money Step 1 2 Tolerance F to Remove Wilks' Lambda Variables Not in the Analysis 1.000 1.000 1.798 .981 1.000 1.000 2.761 .971 1.000 1.000 3.878 .960 1.000 1.000 1.679 .982 1.000 1.000 8.272 .919 .988 .988 1.507 .905 .991 .991 2.437 .896 1.000 1.000 3.863 .883 .992 .992 1.052 .909 .987 .987 1.549 .868 .821 .821 .793 .875 .992 .992 1.057 .873 Age Tasty food Value for money Animal welfare Please indicate your gross annual household income range Age Tasty food Value for money Animal welfare Age Tasty food Animal welfare Step 0 1 2 Tolerance Min. Tolerance F to Enter Wilks' Lambda Only two predictors are kept in the model
  • 37. APPLICATIONS IN MARKETING: After getting to know the Technical Aspect of this useful concept, we can conclude that DA has the following applications in the field of Marketing: • Discriminate analysis, a multivariate technique used for market segmentation and predicting group membership is often used for this type of problem because of its ability to classify individuals or experimental units into two or more uniquely defined populations.
  • 38. • Product research – Distinguish between heavy, medium, and light users of a product in terms of their consumption habits and lifestyles. • Perception/Image research – Distinguish between customers who exhibit favorable perceptions of a store or company and those who do not. • Advertising research – Identify how market segments differ in media consumption habits. • Direct marketing – Identify the characteristics of consumers who will respond to a direct marketing campaign and those who will not.