Professional Documents
Culture Documents
INSTRUCTIONS TO INVIGILATORS:
Question 1
The following data represent the number of cases of salad dressing purchased per
week by a local supermarket chain over a period of 30 weeks.
b. After examining the stem and leaf display prepared in a., comment on the
possible distribution of the number of cases purchased by the supermarket
chain.
(2 marks)
c. When the table of descriptive statistics was printed below, the ink in the
printer was running low and many of the important descriptive statistics were
missing. Find all these missing values.
Number of cases
Mean
Standard Error
Median
Mode
Standard Deviation
Sample Variance
Kurtosis 0.4838
Skewness 0.8075
Range 77
Minimum 56
Maximum 133
Sum 2641
Count 30
Confidence Level(95.0%) 7.0157
(10 marks)
e. If you had to make a prediction of the number of cases of salad dressing that
would be ordered next week, how many cases would you predict? Why?
(2 marks)
i. What is the probability that the consumer will buy the computer or the
software package or both?
(2 marks)
ii. Assuming the claim is true, what is the probability of observing two or
fewer doctors recommending the product?
(2 marks)
iii. Given the sampling results, do you believe the advertisement? Explain.
(2 marks)
i. If the pass mark was 50%, what proportion of the class failed?
(3 marks)
a. The taxation department wants to estimate to within $500, the average income
of all workers, with 95% confidence. How large a sample should the
department take if the standard deviation for incomes is known to be $3000?
(5 marks)
b. A sample of 25 male students from QBM 117 had their heights recorded. It
was found that the average and standard deviation of their heights was 180cm
and 5cm respectively.
ii. From historical data it was found that the average height of males in
QBM117 was 175cm. Is there reason to believe that there has been an
increase in males height? Use 5%.
(8 marks)
a. A television manufacturer claims that not more than 10% of its television sets
will need any repair during their first 2 years of operation. To test this claim, a
random sample of 100 TV sets are monitored for the first two years of their
operation and 14 are found to need repair. At the 5% level of significance, test
the manufacturers claim. [Use H 0 : p 0.1 H A : p 0.1 ]
(8 marks)
50000
40000
Salary ($)
30000
20000
10000
0
0 10 20 30 40 50
Experience (years)
Histogram of residuals
15
Frequency
10
5
0
0
0
00
00
00
00
20
90
20
50
19
36
53
70
-4
-3
-1
Residuals
8000
6000
4000
Residuals
2000
0
-2000
0 10 20 30 40 50
-4000
-6000
-8000
Experience (years)
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.80322468
R Square 0.64516989
Adj. R Square 0.63777759
Std Error 2520.28938
Observations 50
ANOVA
df SS MS F Significance F
Regression 1 554364839 554364839 87.2760049 2.239E-12
Residual 48 304889211 6351858.56
Total 49 859254050
iii. From the scatterplot, it appears that salary is linearly related to years of
experience. Test this relationship at a 5% level of significance.
(4 marks)
iv. With reference to the plots provided, does it appear that the linear
model fitted in Excel is appropriate?
(4 marks)
These questions are to be answered on the General Purpose Answer Sheet provided.
Use a 2B pencil only.
1. You are looking at the sales figures for 35 companies. The variable is
2. We have a set of data with 200 data values. The minimum value is 2.5 and the
maximum value is 80. Which set of class intervals should be used to prepare
the frequency distribution.
A.
> 0 up to and including 5
> 5 up to and including 10
> 10 up to and including 15
> 15 up to and including 20
> 20 up to and including 25
> 25 up to and including 30
> 30 up to and including 35
> 35 up to and including 40
> 40 up to and including 45
> 45 up to and including 50
> 50 up to and including 55
> 55 up to and including 60
> 60 up to and including 65
> 65 up to and including 70
> 70 up to and including 75
> 75 up to and including 80
B.
0 to 10
10 to 20
20 to 30
30 to 40
40 to 50
50 to 60
60 to 70
70 to 80
C.
> 2.5 up to and including 12.5
> 12.5 up to and including 22.5
> 22.5 up to and including 32.5
> 32.5 up to and including 42.5
> 42.5 up to and including 52.5
> 52.5 up to and including 62.5
> 62.5 up to and including 72.5
> 72.5 up to and including 82.5
5. The histogram following shows the length of hospital stay (in days) for a
sample of patients.
20
Freqeuncy
15
10
5
0
1 5 9 13 17 21 25 29 33
Length of stay (days)
A. positively skewed.
B. negatively skewed.
C. approximately normally distributed.
D. unimodal.
E. bimodal.
6. If a box of camera film has 10 rolls in it and 3 have already been used. What
is the probability of selecting 2 rolls and finding both are unused?
A. 0.067
B. 0.21
C. 0.467
D. 0.49
E. 0.933
A. 0.9893
B. 0.9890
C. 0.4893
D. 0.0107
E. 0.4890
8. find the probability that exactly 3 cars arrive in the next ten minutes.
A. 0.172
B. 0.117
C. 0.010
D. 0.003
E. 0.007
9. find the probability that fewer than 5 cars arrive in the next five minutes.
A. 0.067
B. 0.038
C. 0.616
D. 0.176
E. 0.440
A. lower.
B. higher.
C. remains the same.
D. depends on whether it is a one or two tailed test.
E. none of the above.
13. In a packet of 50 seeds, two did not germinate. Estimate a 95% confidence
interval for the population proportion of seeds that will not germinate.
0.04 0.96
A. 0.04 1.96
50
0.04 0.96
B. 0.04 1.645
50
0.96 0.04
C. 0.96 1.96
50
0.96 0.04
D. 0.96 1.645
50
E. ˆ and nq
none of the above since both np ˆ are not greater than 5.
H 0 : 100
H A : 100
15. If x was found to be equal to 102.3, given that 2 25 and n = 36, the
appropriate test statistic would be
102.3 100
A. t
5 / 36
102.3 100
B. z
5 / 36
102.3 100
C. t
5
102.3 100
D. z
5
102.3 100
E. z
25 / 36
16. The p-value for a certain hypothesis test was 0.15. The level of significance
used for this test was 0.05. With this information we should
A. reject H 0 .
B. do not reject H 0 .
C. accept H A .
D. do not reject H A .
E. none of the above.
17. The residuals formed when a regression line is fitted to a data set should
ideally
A. be normally distributed.
B. have an expected value of zero.
C. have a variance which is independent of the value of the independent
variable.
D. be independent from each other.
E. possess all the characteristics of A., B., C. and D.
Use the following information to answer questions 18., 19. and 20.
SUMMARY OUTPUT
Pric
Regression Statistics
Multiple R 0.422719 1
(per share)
Earnings
R Square 0.178691 0
Adj R Square 0.133063
-1 0
Std Error 0.443519
-2
Observations 20
ANOVA
df SS MS F Sig F
Regression 1 0.770359 0.770359 3.916237 0.06332857
Residual 18 3.540761 0.196709
Total 19 4.31112
Coeffs StdError t Stat P-value Lower 95% Upper 95% Lower 99.0% Upper 99.0%
Intercept -0.644023 0.173623 -3.709317 0.001605 -1.0087913 -0.279254 -1.14378624 -0.14425904
Price ($) 0.02382 0.012037 1.978948 0.063329 -0.0014682 0.04910918 -0.01082715 0.05846812
18. The correlation between the earnings per share and the closing stock price is
A. 0.179
B. –0.179
C. 0.423
D. –0.423
E. unable to be determined from the information provided.
A. -$1.14 to -$0.14
B. -$0.01 to $0.06
C. -$1.01 to -$0.28
D. -$0.00 to $0.05
E. unable to be determined from the information given
A. t 0.01,19
B. t 0.005,18
C. t 0.01,17
D. t 0.01,18
E. t 0.005,19