Professional Documents
Culture Documents
LIS397C
Version 3.5
LIS 397C
Doty
Practice Exercises 3.5
1.
s
2
s
N
11
13
10
Compute:
(a) n
(b) x
(c) x
(d) s
Copyright Philip Doty, University of Texas at Austin, August 2004
(e) s
(f) median
(g) mode
(h) range
(i) CV
3.
51
53
58
54
52
50
50
57
59
51
57
53
51
55
59
52
56
54
5.
6.
7.
positively skewed
validity
box plot
whiskers
8.
9.
Distribution 2
Distribution 3
10.
4
10
6
3
3
1
1
1
2
3
4
5
6
7
85
25
31
40
44
51
19
10
1
2
4
6
8
10
12
13
14
8
20
25
35
40
45
24
20
(d)
(e)
(f)
(g)
(h)
(i)
11.
Define:
error model
quartile
percentile
Q1
Q2
Q3
freq (x)
cf (x)
rel freq (x)
cum rel freq (x)
PR
z-scores
deviation score
s of z-scores
x of z-scores
centile
Interquartile Range (IQR)
non-response bias
self-selection
12.
13.
freq (x)
9
8
6
3
2
3
9
5
2
6
15.
16.
Define:
sampling distribution
Central Limit Theorem
Standard Error (SE)
ND(, )
decile
Copyright Philip Doty, University of Texas at Austin, August 2004
10
descriptive statistics
inferential statistics
effect size
confidence interval (C.I.) on
Student's t
degrees of freedom (df)
random sampling
stratified random sample
11
18.
19.
20.
From previous research, we know that the standard deviation of the ages
of public library users is 3.9 years. If the "average" age of a sample of 90
public library users is 20.3 years, construct:
a. a 95% C.I. around
b. a 90% C.I. around
c. a 99% C.I. around .
What does a 95% interval around mean?
What is our best estimate of ?
12
21.
The sample size in problem 20 was increased to 145, while the "average"
age of the sample remained at 20.3 years. Construct three confidence
intervals around with the same levels of confidence as in Question 20.
22.
23.
24.
freq(x)
43
42
39
36
35
34
6
11
3
7
7
15
Calculate:
(a) x
(b) s
(c) SE = x
(d) E( x )
(e) Q 1 , Q 2 , and Q 3
(f) mode
(g) CV
(h) IQR
(i) a 95% C.I. on
(j) the width of the confidence interval in part (i)
(k) a 99% C.I. on
(l) the width of the confidence interval in part (k)
(m) PR (percentile rank) of x = 42
(n) our best estimates of and from the data.
Copyright Philip Doty, University of Texas at Austin, August 2004
13
25.
26.
14
27.
> 5, 10
> 10
Novice
Intermediate
Expert
14
15
22
20
16
11
19
9
2
28.
29.
Define:
statistical hypothesis
Ho
H1
p
Type I error
Type II error
2
nonparametric
contingency table
statistically significant
2
E (expected value) in
2
O (observed value) in
30.
/p
2
df/R/C [in situation]
H o / H1
2 / /df
15
/Type I error
E/O/
Type I/Type II error
2
16
(a) n = 10
(b) x = 78
(c) x=
(d) s =
78
= 7.8
10
nx2
n 1
714 608.4
= 11.73 = 3.43
9
(e) s 2 = 11.73
10 + 1
n +1
th score =
th score = 5.5th score
2
2
5thscore + 6thscore
8+9
med = 5.5th score =
=
= 8.5
2
2
(f) P(med) =
(g) mode = 9
(h) range = highest value - lowest value = 13 - 2 = 11
(i) coefficient of variation (CV) =
4.
s 3.43
=
= 0.44
x 7.8
(a) n = 20
(b) x = 1079
(c) x=
(d) s =
x = 1079 = 53.95
n
20
nx2
n 1
58395 58212
= 9.63 = 3.1
19
(e) s 2 = 9.63
17
n +1
20 + 1
th score =
th score = 10.5th score
2
2
(f) P(med) =
median =
53 + 54
= 53.5
2
10.
s
3.1
=
= 0.06
x 53.95
(d) mean = x=
s2 =
s3 =
nx2
n 1
s =
18
s
5.21 5.71 3.31
=
,
,
= 0.19,1.39,0.35
x 27.7 4.1 9.5
19
13. (a) x
freq x
9
8
6
3
2
cum freq x
3
9
5
2
6
25
rel freq x
25
22
13
8
6
0.12
0.36
0.20
0.08
0.24
1.00
1.00
0.88
0.52
0.32
0.24
(b) N = 25
(c) Range = H - L = 9 - 2 = 7
(d) P(median) = N + 1 th position = 13th observation; median = 6
2
(e) mode = 8
(f)
(g) =
147
= 5.88
25
2
N 2
1041 (25)(5.88)2
=
25
174.64
= 7.07 = 2.66
25
P(Q1 ) =
1+ P(Q2 ) 1+ 13
=
= 7th observation from the beginning; Q1= 3
2
2
P(Q 3 ) =
1+ P(Q2 ) 1+ 13
=
= 7th observation from the end; Q3= 8
2
2
2.66
20
21
(l) z =
x
(for populations)
z6 =
x 6 5.88
8 5.88
2 5.88
=
= 0.05;z 8 =
= 0.80;z 2 =
= 1.46
2.66
2.66
2.66
z3 =
3 5.88
9 5.88
= 1.08;z 9 =
= 1.17
2.66
2.66
x 45 50
=
= 2.00 0.4772
2.5
52.6 50
= 1.04
2.5
58 50
= 3.2 0.4993
2.5
z = - 0.55
x 50
2.5
x 50 = 1.375
x = 48.63
z = 1.27
x 50
2.5
x = 53.175
z = 1.27 =
22
(f) z45 =
45 50
= 2.00
2.5
49 50 1
=
= 0.4 0.1554
2.5
2.5
2, 5, 9, 5, 3, 6, 6, 3, 1, 13
(a) E(x) = = 4.1
(b) SE = x =
20.
2.93
=
= 0.93
n
10
3.9
SE = x =
=
= 0.41
n
90
0.95
= 0.4750 1.96 = z ]
2
x zSE x+ zSE
z90%= 1.65
x zSE x+ zSE
z99%= 2.58
x zSE x+ zSE
23
x , 20.3 years
24
21. N = 145,
19.67 20.93
0.05
df = n - 1
n = 20, 9, , 1
t = 2.093, 2.306, 1.960, ?
0.01
df = n - 1
t = 2.861, 3.355, 2.576, ?
24. (a) x=
(b) s =
x
n
1844
= 37.63
49
nx2
n 1
(c) SE = x =
70048 69385
= 3.72
48
s
3.72
=
= 0.537
n 1 6.93
(d) E(x) =
(e) P(Q2 ) =
n +1
th position = 25th observation; Q2 = 36
2
P(Q1 ) =
1+ P(Q2 ) 1+ 25
=
= 13th observation from the beginning; Q1= 34
2
2
P(Q 3 ) =
1+ P(Q2 ) 1 + 25
=
= 13th observation from the end; Q3= 42
2
2
(f) mode = 34
Copyright Philip Doty, University of Texas at Austin, August 2004
25
(g) CV =
s
3.72
=
= 0.10
x 37.63
(h) IQR = Q3 Q1 = 42 - 34 = 8
26
0.05
27
27.
TIME (MINS)
5
>5, 10
>10
Novice
14 (21.12)
20 (19.46)
19 (12.42)
53
Intermediate
15 (15.94)
16 (14.69)
9 (9.38)
40
Expert
22 (13.95)
51
11 (12.85)
47
2 (8.20)
30
35
128
EXPERTISE
RxC
n
2 =
(O E )2
E
2 =
(14 21.12) 2 (15 15.94) 2 (22 13.95)2 (20 19.46) 2 (16 14.69) 2
+
+
+
+
21.12
15.94
13.95
19.46
14.69
E=
2 = 2.40 + 0.06 + 4.65 + 0.01 + 0.12 + 0.27 + 3.49 + 0.02 + 4.69 = 15.71
df = (R - 1)(C - 1) = 2 x 2 = 4
2 (4, 0.10) = 7.78
2
28. (4, 0.05) = 9.49
28
29