Professional Documents
Culture Documents
1. A company wishing to launch a new brand of digital camera would like to assess if there is a relationship
between manufacturer and camera price, maximum picture capacity and the estimated battery life.
Data collected for 20 digital camera models were used for comparison and the inter-correlation output is
below. Provide a summary analysis with reasons.
Correlations
camera maximum
manufact price in us picture estimated
urer dollars capacity battery life
manufacturer Pearson Correlation 1.000 .119 -.107 .089
Sig. (2-tailed) . .618 .654 .708
N 20 20 20 20
camera price in us dollars Pearson Correlation .119 1.000 .232 -.064
Sig. (2-tailed) .618 . .326 .789
N 20 20 20 20
maximum picture capacity Pearson Correlation -.107 .232 1.000 .106
Sig. (2-tailed) .654 .326 . .656
N 20 20 20 20
estimated battery life Pearson Correlation .089 -.064 .106 1.000
Sig. (2-tailed) .708 .789 .656 .
N 20 20 20 20
2. The data below provides the average rating of a set of consumers on certain brands of Notebooks. 100 is
the highest rating and a rating of 75 is considered average.
Using SPSS, answer the following questions appending the data output sheets as appropriate to support your
answers-
1. What is the average rating and deviation of ratings of the notebooks included in the survey?
2. What is the average rating of notebooks of various companies whose notebooks were included in this
survey?
3. Which is the most and least preferred notebook supplier?
4. Is there a relationship between supplier companies and the ratings received?
3. The marketing team of a company has tracked the number of commercials that were released over a 10 week
period with the sales volumes (in units) achieved in each of the respective weeks.
4. An agribusiness company wishes to determine if its strategies to de-risk its revenues from the rainfalls have
resulted in significant progress over the years. Data on the companys sales revenues and Met department data
on Number of districts receiving normal or excess rain and Percentage of annual rainfall compared to the Long
Period Average is below
2001-02 2759 67 92
2002-03 1913 39 81
3
5. The owner of a newly set up gymnasium and health spa conducted a study of preferences of target
buyers located midtown/downtown, in the suburbs and in the countryside. He gathered their response
ratings on the importance to them for regular exercise (1 = not at all important, 7 = very important) and
the importance to them for an outdoor lifestyle ((1 = not at all important, 7 = very important). The
following output sheet was generated by the statistical data analysis tool:
outdoor lifestyle Exercising regularly * Location of residence
outdoor Exercising
Location of residence lifestyle regularly
Midtown/Downtown Mean 4.1000 3.3000
N 10 10
Std. Deviation 2.2336 1.7670
Suburbs Mean 4.3000 3.9000
N 10 10
Std. Deviation 2.2136 1.6633
Countryside Mean 3.7000 3.6000
N 10 10
Std. Deviation 1.4944 1.7127
Total Mean 4.0333 3.6000
N 30 30
Std. Deviation 1.9561 1.6733
4
outdoor Exercising
Gender lifestyle regularly
Females Mean 4.0667 3.2667
N 15 15
Std. Deviation 2.2509 1.7099
Males Mean 4.0000 3.9333
N 15 15
Std. Deviation 1.6903 1.6242
Total Mean 4.0333 3.6000
N 30 30
Std. Deviation 1.9561 1.6733
One-Sample Test
Test Value = 0
95% Confidence
Interval of the
Sig. Mean Difference
t df (2-tailed) Difference Lower Upper
Location of residence 13.191 29 .000 2.0000 1.6899 2.3101
Exercising regularly 11.784 29 .000 3.6000 2.9752 4.2248
Correlations
1. Is there a relationship between location of the residence of a potential gymnasium user and their habit
of wanting to exercise regularly?
2. Are people of a particular gender more likely to want to exercise regularly?
3. Is there a difference in the attitude ratings of people towards regular exercise in different parts of the
city? Explain what testing procedure you used to arrive at this conclusion.
4. Based on the response ratings received, people located in which part of town are having a habit of
exercising regularly? Would this hence be your preferred choice for locating the new gymnasium and
health spa?
5
5. Who would be a better target for the gymnasium and health spa men or women? Base your opinion
on the ratings received for regular exercise. Explain how you arrived at this decision?
6. The territories in a companys eastern and western regions were rated for sales potential based on the
companys evaluation system. A sales manager wishes to conduct the appropriate statistical test to
determine if there is a difference between the two regions. Relevant data is provided to you on an excel
sheet.
You are required to develop the research hypothesis and follow a step by step procedure to perform this
analysis. Results are required to be submitted in the form of a brief managerial report
7. A Sales Manager is analyzing the performance of 17 territories that come under his jurisdiction. Using
SPSS/EXCEL, analyse the data below and answer the questions that follow
All figures are Rupees in lakhs
1. Are the Sales officers in the 16 territories providing accurate projections compared to actual sales in the
months of August and September?
2. If territories 1 to 7 are in the Northern Region, 8 to 12 are in Central Region and the rest are in the
Southern Region, which area has performed better in terms of aggregate sales in each of the past two
months?
3. Which Region (North, Central or South) has higher average third quarter (Oct to Dec) monthly sales?
4. Which of the months in the third quarter has the highest sales plan?
5. Is there a relationship between the projections of October and December? Explain in detail.
6. Based on the annual target, October projections and sales of September and August, is it possible to
predict October sales for each territory? Present a step-by-step process.
7. Based on the annual target, and projections of September and August, is it possible to predict August sales
for a new territory? Present a step-by-step process.
7
1. A group of 20 customers of sneakers were interviewed in order to ascertain their preference for the
product and their assessment of the sneakers comfort, styling and durability. Their responses are
presented below and also provided to you on an excel sheet-
Using SPSS, generate the required output and answer the questions below
1. Can you develop a model to predict a customers preference rating for your brand of sneakers based on
the three attributes of styling, comfort and durability? How accurate is your model in making this
prediction?
2. Which is the most important attribute that decides a customers preference for the brand of sneakers?
3. Is there a relationship between the customers ranking for styling and comfort?
2. A consumer products company wishes to identify the key reasons why people prefer certain brands of
toothpaste. A survey of 30 consumers was conducted and responses to 6 statements were gathered. These are
summarized below and on an excel sheet
1. A bank has decided to use an appropriate statistical technique to study expenditure patterns of its
customers.
Past research has led to the selection of four criteria, namely, age, monthly income and years after
education and number of weekly transactions as assessment parameters for credit card users. The bank
has mined data for a sample of 25 credit card customers on the basis of these criteria. Past records have
also led to the classification of these25 customers as having proven to be high spenders (1) or mid-level
spenders (2) and low spenders (3).
Using SPSS and the data provided, answer the following questions :
10
4. Based on age and monthly income, can you predict number of weekly transactions that a customer is likely
to make using the credit card? Explain how.
(ii) Using the SPSS output, prepare the predictive model for the bank and interpret all relevant
information contained within it using a step by step procedure.
(iii) Develop the decision making criteria for the banks customer assessment team.
(iv) How would you judge the predictive accuracy of your framework? Give details based on the relevant
SPSS output.
2. A credit card issuing bank has decided to use an appropriate statistical technique to screen applications.
Past research has led to the selection of three criteria, namely, age, income and years since marriage, as
crucial risk assessment parameters for credit card users. The bank has mined data for a sample of 18 credit
card customers on the basis of these three criteria. Past payment records have also led to the classification
of these 18 customers as having proven to be high risk (1) or low risk (2).
This sample information is presented below :
(i) What would be the most appropriate statistical technique to be used for assessing risk levels of future
card applicants? Why do you believe this technique is appropriate?
(ii) Using the SPSS output, prepare the predictive model for the bank and interpret all
relevant information contained within it using a step by step procedure.
(iii) Develop the decision making criteria for the banks risk assessment team.
(iv) How would you judge the predictive accuracy of your framework? Give details based on the relevant
SPSS output.
3. The data below is drawn from a selection of Nike shoes users. The survey had 20 users (mix of males and
females) who were asked how often they use their Nike shoes and then classified as Heavy, Medium and Light
users of the product. They were also asked to rate on a scale of 1 to 7, certain parameters with respect to the
Nike brand.
No Usage Gender Awareness Attitude Preference Intent to buy Brand Loyalty
(Rating scale - 7 : Very Favourable to Nike ; 1 : Very unfavourable to Nike)
Using SPSS, answer the following questions appending the data output sheets as appropriate to support your
answers-
5. What is the usage level likely to be of a male buyer who assigns a 6 rating to awareness of the Nike brand,
a 7 on attitude towards Nike brand, a 5 rating on brand preference, 4 on intent to buy and 5 on loyalty to
Nike?
6. Is there a relationship between brand loyalty and the customers intention to buy?
4. Your company wishes to develop a model that is able to classify whether a consumer is likely to use the internet
for two specific services : shopping & banking. For this, a set of 30 persons was profiled on 5 criteria sex,
familiarity with the internet, number of hours of usage of the internet per week, attitude towards the internet and
attitude towards technology. Their existing usage of the internet services for shopping and banking was also
tracked. The data collected is below and also provided to you on an excel sheet -
1 1 7 14 7 6 1 1
2 2 2 2 3 3 2 2
3 2 3 3 4 3 1 2
4 2 3 3 7 5 1 2
5 1 7 13 7 7 1 1
6 2 4 6 5 4 1 2
7 2 2 2 4 5 2 2
8 2 3 6 5 4 2 2
9 2 3 6 6 4 1 2
10 1 9 15 7 6 1 2
11 2 4 3 4 3 2 2
12 2 5 4 6 4 2 2
13 1 6 9 6 5 2 1
14 1 6 8 3 2 2 2
15 1 6 5 5 4 1 2
16 2 4 3 4 3 2 2
17 1 6 9 5 3 1 1
18 1 4 4 5 4 1 2
19 1 7 14 6 6 1 1
20 2 6 6 6 4 2 2
21 1 6 9 4 2 2 2
22 1 5 5 5 4 2 1
23 2 3 2 4 2 2 2
24 1 7 15 6 6 1 1
25 2 6 6 5 3 1 2
26 1 6 13 6 6 1 1
27 2 5 4 5 5 1 1
28 2 4 2 3 2 2 2
29 1 4 4 5 3 1 2
13
30 1 3 3 7 5 1 2
1. Using SPSS, use the appropriate statistical technique to predict whether a consumer is likely to use the
internet for banking based on the classification criteria of sex, internet familiarity, internet usage and the
attitude ratings towards technology and the internet. Why do you believe that this model is appropriate
for use? How accurate is the predictive model expected to be?
2. When doing a similar study for predicting use of the internet for online shopping, which is the most
important criteria that decides whether a consumer is likely to be a user/non-user of internet shopping
services?
3. Are consumers using the internet for shopping more likely to be users of the internet for banking services
as well?
4. Do women users as a group seem to have a higher usage of the internet?
5. Are male users more familiar with the internet?
5.A restaurant owner has just received an Excellent rating. He wishes to assess what a consumer is willing to
pay for a meal at a restaurant with a similar Excellent rating. The data from 300 restaurants in the city was used
to help arrive at this decision. Output generated by SPSS is below.
Model Summary
Std. Error
Adjusted of the
Model R R Square R Square Estimate
1 .547a .299 .297 7.7882
a. Predictors: (Constant), Quality of meal
ANOVAb
Sum of Mean
Model Squares df Square F Sig.
1 Regression 7716.294 1 7716.294 127.214 .000a
Residual 18075.502 298 60.656
Total 25791.797 299
a. Predictors: (Constant), Quality of meal
b. Dependent Variable: MEALCOST
14
Coefficientsa
Standardi
zed
Unstandardized Coefficien
Coefficients ts
Model B Std. Error Beta t Sig.
1 (Constant) 12.025 1.310 9.183 .000
Quality of meal 7.150 .634 .547 11.279 .000
a. Dependent Variable: MEALCOST
NOTE :Quality of Meal Ratings 1 for Good, 2 for Very Good, 3 for Excellent,
1. In a large scale customer survey by Jet Airways, passengers were asked to indicate on a seven-point
scale (1 = completely agree, 7 = completely disagree), their agreement or disagreement with the set of
10 statements relating to their perceptions and attributes of the airline. The 10 statements were as
follows :
a. Which data analysis technique has been used by Jet Airways? Do you think the technique used is
appropriate? Why?
b. Are there any dependent and independent variables?
c. Determine how many factors can be extracted? What criterion is used to decide on which
variables are associated with which factor? Explain.
d. Interpret and name the factors after you determine the variables associated with each factor?
2. A researcher gathered data from a sample of 25 respondents. The data collected pertained to the level
of agreement that the individual had on seven lifestyle statements. Respondents had to rate their
agreement with each statement on a scale of 1 to 7 (1 signifying complete Disagreement with the
statement and 7 indicating complete agreement). The lifestyle statements were constructed with a
view to ascertain the core personality traits of people being surveyed. The SPSS output gathered during
the analysis is presented below.
Communalities
Initial Extraction
I would rather spend a
quiet evening at home 1.000 .818
rather than go out to party
I always check prices,
1.000 .796
even on small items
Magazines are more
1.000 .790
interesting than movies
I would not buy products
1.000 .800
advertised on billboards
I am a homebody 1.000 .805
I save and use cash
1.000 .841
coupons
Companies waste a lot of
1.000 .796
money advertising
Extraction Method: Principal Component Analysis.
Component Matrixa
Component
1 2 3
I would rather spend a
quiet evening at home .817 .378 8.694E-02
rather than go out to party
I always check prices,
.279 -.714 .457
even on small items
Magazines are more
.887 -2.70E-02 -4.34E-02
interesting than movies
I would not buy products
-.204 .634 .597
advertised on billboards
I am a homebody .664 .505 .329
I save and use cash
5.012E-02 -.604 .689
coupons
Companies waste a lot of
-.684 .383 .426
money advertising
Extraction Method: Principal Component Analysis.
a. 3 components extracted.
a
Rotated Component Matrix
Component
1 2 3
I would rather spend a
quiet evening at home .897 -8.25E-02 -7.57E-02
rather than go out to party
I always check prices,
4.855E-02 -.232 .860
even on small items
Magazines are more
.762 -.440 .125
interesting than movies
I would not buy products
.214 .867 -5.25E-02
advertised on billboards
I am a homebody .868 .224 -1.74E-02
I save and use cash
-5.69E-02 9.057E-02 .911
coupons
Companies waste a lot of
-.351 .817 -7.28E-02
money advertising
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.
a. Rotation converged in 4 iterations.
20
Component 1 2 3
1 .882 -.444 .154
2 .417 .586 -.695
3 .218 .678 .702
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.
3 . A group of potential buyers of Refrigerators were polled to ascertain their attitudes in order to decide on the
companys forthcoming ad campaign and target markets. 25 Respondents were polled using the following brief
questionnaire administered at the exit point of a popular mall.
Resp # v1 v2 v3 v4 v5 v6 v7
1 6 2 7 6 5 3 5
2 5 7 5 6 6 6 4
3 5 3 4 5 6 6 7
4 3 2 2 5 1 3 2
5 4 2 3 2 2 1 3
6 2 6 2 4 3 7 5
7 1 3 3 6 2 5 7
8 3 5 1 4 2 5 6
9 7 3 6 3 5 2 4
10 6 3 3 4 4 6 5
11 6 6 2 6 4 4 7
12 3 2 2 7 6 1 6
13 5 7 6 2 2 6 1
14 6 3 5 5 7 2 3
15 3 2 4 3 2 6 5
16 2 7 5 1 4 5 2
17 3 2 2 7 2 4 6
18 6 4 5 4 7 3 3
19 7 2 6 2 5 2 1
20 5 6 6 3 4 5 3
21 2 3 3 2 1 2 6
22 3 4 2 1 4 3 6
21
23 2 6 3 2 1 5 3
24 6 5 7 4 5 7 2
25 7 6 5 4 6 5 3
Using an appropriate statistical technique, can you identify the key factors that the company must bear in mind
based on the buyer profiling generated with the responses received. State a step by step procedure used by you to
make these conclusions.
4..A manufacturer of two wheelers is attempting to analyse the factors affecting demand for his brand. Look at
the analytical output below and by interpreting the data, answer the questions that follow :
1. What is the analytical method being used and why do you believe it is appropriate?
2. How many variables are being studied? Which are the dependent and independent variables?
3. Explain using a step by step procedure, how the Marketing Manager can arrive at the specific reasons why
people in this target market prefer to buy two wheelers?
Using this analysis, what in your view, would be the next steps that the marketing team of this two wheeler
company should take? Why
23
1. You are the sales manager for a consumer products company. You are keen to understand
whether the sales projections submitted by your team are accurate. To analyse this, you have
drawn up the following data based on the targets and sales achievement for July and August
2015. What would be your conclusions on the accuracy of the sales forecasts of your teams?
Give reasons to support your views. (5 marks)
Correlations
july sales july sales august sales august sales
plan actuals plan actuals
**
Pearson Correlation 1 .687 .276 .660**
july sales plan Sig. (2-tailed) .002 .283 .004
N 17 17 17 17
** *
Pearson Correlation .687 1 -.486 .231
july sales actuals Sig. (2-tailed) .002 .048 .373
N 17 17 17 17
Pearson Correlation .276 -.486* 1 .539*
august sales plan Sig. (2-tailed) .283 .048 .026
N 17 17 17 17
** *
Pearson Correlation .660 .231 .539 1
august sales actuals Sig. (2-tailed) .004 .373 .026
N 17 17 17 17
**. Correlation is significant at the 0.01 level (2-tailed).
*. Correlation is significant at the 0.05 level (2-tailed).
24
2. Based on your experience, you have identified three predictors for a consumers purchase
patterns at your mall. You have observed that the value of purchase (in rupees) of a family
depends on the monthly income of the head of the household and the spouse as well as the
age of their children. You have collected this information from a sample of 48 customers and
based on this you wish to prepare a model to predict potential future customer purchase (in
rupees).
Variables Entered/Removeda
a. Predictors: (Constant), Age of child, Monthly income of spouse, Monthly income of head of household
ANOVAa
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig.
Coefficients
B Std. Error Beta
(Constant) 3067.122 2201.388 1.393 .171
Monthly income of head of
.174 .081 .455 2.146 .037
1 household
Monthly income of spouse .279 .119 .494 2.348 .023
Age of child -314.571 179.301 -.101 -1.754 .086
25
1. For a consumer household where the head of the household earns Rs 100000 per month and
their spouse earns Rs 50000 per month having a child aged 12 years old, what would you predict
as the likely value of purchase at the mall?
3. If past data was unavailable or unreliable, do you have any other alternative methodology to
make such a forecast? Explain briefly how or why not? (5 marks)