Professional Documents
Culture Documents
Kelly Page
Cardiff Business School
E: pagekl@cardiff.ac.uk
T: @drkellypage
T: @caseinsights
FB: kelly@caseinsights.com
Lecture Objectives
Get an overview of the data analysis procedure;
Develop an understanding of the importance and nature
of quality control checks;
Understand the data entry process and data entry
alternatives;
Learn how data are tabulated;
Learn how to set up and interpret cross tabulations;
Comprehend the basic techniques of statistical analysis.
(cc) Kelly Page
Validation &
Editing
Coding
Data
Entry
Machine
Cleaning
of Data
Tabulation &
Statistical
Analysis
Examples
Account Balance: a variable describing how much money you have in the bank
203.45 The Value of your Account Balance at 11.32 a.m. on Friday 13th
February
Broadcast Media a variable which denotes the type media channel that
someone owns
Sony TV the specific value according to which set they own
Customer Satisfaction (CS) a variable which denotes how satisfied a
customers experience with X was
Positive = the values of positive or negative, high or low could be the value of
customer satisfaction and is dependent on how we measured it
(cc) Kelly Page
Considerations
Research objectives
Type of data (e.g., Nominal, Ordinal, Interval,
Ratio)
Sample size (e.g., min=100)
Sampling method (e.g., non-probability)
Graphical Presentation
How to visually display the descriptive profile of the data
88
66
44
44
33
Grand
GrandTotal
Total
22
00
No
No
Yes
Yes
Female
Female
8
Multi-variate cross-tabulation:
Additional filtering criteria - Veteran
Status - Now filtering three items.
Race/Ethnicity
(All)
Are You a Veteran?
Yes
You Liked the Chamber's Services (All)
Count of Respondent
Business Category
Computers/Technology
Construction
Manufacturing
Other
Professional
Grand Total
Gender
Female Male
Grand Total
1
3
4
1
1
5
5
3
2
5
1
1
9
7
16
Gender
Female Male
Grand Total
5
7
12
2
4
6
1
1
13
6
19
1
4
5
15
11
26
1
3
4
4
4
8
1
1
2
1
1
42
42
84
Years in Business
Measures of Central
Tendency!
Mean
Median
Mode
Mean
Standard Error
Median
Mode
Standard Deviation
Sample Variance
Kurtosis
Skewness
Range
Minimum
Maximum
Sum
Count
22.4
2.6
15.0
5.0
23.1
534.5
3.8
2.1
98.0
2.0
100.0
1770.5
79.0
Measures of Dispersion!
Variance
Range
Standard Deviation
Skewness
10
15
12
10
7
0
Female
Yes
5
3
2
4
3
No
Male
Grand Total
Grand Total
Female
Male
Grand Total
6
2
Did You Like the Movie?
14
12
12
10
6
4
No
8
4
5
3
Yes
Grand Total
11
0
Female
Male
Grand Total
Measures of Difference
Differences between two groups (e.g., T-statistic (t-test)
males and females)
F-statistic (Anova)
Measure of difference: T-statistic
Significance of
Difference (p>0.01)
ANOVA
Differences between two or more
groups (e.g., age groups)
Measure of difference: F-statistic
E.g., T-test
13
E.g., ANOVA
14
3. I want to test if a
relationship exists between
2
or more variables
Measures of Association
Correlation
2 x variables
If interval or ratio data pearson
If ordinal data = spearman
E.g., Correlation
16
17
YY
XX
18
19
Factor Analysis
Group data to most important related to criterion (11 items = 2
dimensions of satisfaction)
Perceptual Mapping
Visual representation of perceptions by groups (brand
associations)
Conjoint Analysis
Value of peoples rankings of important product attributes
(consumer choice > price, quality, location)
(cc) Kelly Page
20
Cluster Analysis
The general term for statistical procedures that classify objects or people into
some number of mutually exclusive and exhaustive groups on the basis of two
or more classification variables.
Cluster 1: Men
Cluster 2: Women
21
22
23
Factor Analysis
Procedure for grouping & simplifying data by reducing a large set of values/items
to a smaller set of factors/dimension of a variable by identifying dimensions in the
data .
24
25
26
Perceptual Mapping
Procedure of producing visual representations of consumer perceptions of
products, brands, companies, or other objects / issues.
Expensive
Men
Women
Well Designed
Setting Markers
(cc) Kelly Page
Poorly Designed
Inexpensive
27
Conjoint Analysis
Procedure use to
quantify the value
consumers
associate with
different levels of
product/service
attributes or
features.
28
Other Considerations:
How much missing data?
How big is sample size?
How was data collected random or non-random?
(cc) Kelly Page
29
The content of this work is of shared interest between the author, Kelly
Page and other parties who have contributed and/or provided support
for the generation of the content detailed within.
This work is licensed under a Creative Commons
Attribution-NonCommercial-Share Alike 2.0 UK: England & Wales.
http://creativecommons.org/
Kelly Page (cc)
30