You are on page 1of 15

PENGENALAN ALAT-ALAT UJI STATISTIK

DALAM PENELITIAN SOSIAL


Tatang A Gumanti
2010
Choosing the right test: 3
DV is
Dichotomous Categorical Continuous
IV is/are:
Dichot-
omous
Chi-square Chi-square t-test
Cate-
gorical
Chi-square Chi-square ANOVA
Contin-
uous
Discriminant
function
analysis
Discriminant
function
analysis
Correlation or
regression
Type of Scale and
Appropriate Statistical Test
Type of Scale Measure of
Central Tendency
Measure of
Dispersion
Statistical Test
Nominal Mode None Chi-Square
Ordinal Median Percentile Chi-Square
Interval or Ratio Mean Standard
Deviation
T-test, ANOVA
Measurement scales
Nominal Scale
numbers assigned to the object serve as labels for
identification i.e. gender (male, female); store type;
accommodation type
(mode, frequency, percentage)

Ordinal Scale
a scale that arranges objects or alternatives
according to their magnitude in an ordered
relationship i.e. preference ranking for a product;
social class
(median, semi-interquartile range)
Measurement scales
Interval Scale
a scale that both arranges objects according to their
magnitude and also distinguishes this ordered
arrangements in units of equal intervals i.e. attitudes,
opinions (5 point likert scale)
(mean, standard deviation, variance, range)
Ratio Scale
a scale that has absolute rather than relative quantities
i.e. income, sales, costs, market share
possess an absolute zero point and interval properties
(mean, standard deviation, variance + all lower level
descriptive statistics)
Parametric versus non -
parametric statistics
Statistical techniques can be classified as -
Parametric statistics
the use is based on the assumption that the
population from which the sample is drawn is
normally distributed and data are collected on an
interval or ratio scale.
Non-Parametric statistics
makes no explicit assumptions regarding the
normality of distribution in the population (less
stringent requirements) and are used when the data
are collected on a nominal or ordinal scale.
Methods of scaling
Response scales
rating scales: estimates magnitude of a
characteristic
ranking scale: rank order preference
sorting scales: arrange or classify concepts
choice scales: selection of preferred
alternative
Testing Statistical Hypotheses
example
Suppose
Assume and population is normal, so sampling
distribution of means is known (to be normal).
Rejection region:
Region (N=25):


We get data

Conclusion: reject null.
75 : ; 75 :
1 0
= = H H
10 = o
3 2 1 0 -1 -2 -3
Z

Z

Z

1.96
-1.96
Don't reject Reject
Reject
Likely Outcome
If Null is True
79 ; 25 = = X N
92 . 78 08 . 71
25
10
96 . 1 75 =
X
Tests of Normality
.246 93 .000 .606 93 .000
TOTAL TIME SPENT
ON THE INTERNET
Stati sti c df Si g. Stati sti c df Si g.
Kolmogorov-Smi rnov
a
Shapi ro-Wi lk
Li ll i efors Si gni fi cance Correcti on
a.
The test of normality
Problem 1 asks about the results of the test of normality. Since the sample
size is larger than 50, we use the Kolmogorov-Smirnov test. If the sample
size were 50 or less, we would use the Shapiro-Wilk statistic instead.

The null hypothesis for the test of normality states that the actual
distribution of the variable is equal to the expected distribution, i.e., the
variable is normally distributed. Since the probability associated with the
test of normality is < 0.001 is less than or equal to the level of significance
(0.01), we reject the null hypothesis and conclude that total hours spent on
the Internet is not normally distributed. (Note: we report the probability as
<0.001 instead of .000 to be clear that the probability is not really zero.)

The answer to problem 1 is false.

Confidence intervals in z
For large samples (N>100) can use z.


Suppose

Then

If
M
M
est
y
z
o

.
) (
=
N
N
y y
N
s
est
y
M
1
) (
.
2

= =

o
200 ; 5 ; 10 : ; 10 :
1 0
= = = = N s H H
y

35 .
14 . 14
5
200
5
. = = = =
N
s
est
y
M
o
05 . 96 . 1 83 . 2 ; 83 . 2
35 .
) 10 11 (
11 < > =

= = p z y
Difference Between Means (2)
We can estimate the standard error of
the difference between means.

For large samples, can use z
2
2
2
1
. . .
M M diff
est est est o o o + =
diff
est
y y
diff
z
o

2 1 2 1
) (
=
3 ; 100 ; 12
2 ; 100 ; 10
0 : ; 0 :
2 2 2
1 1 1
2 1 1 2 1 0
= = =
= = =
= =
SD N y
SD N y
H H
36 .
100
13
100
9
100
4
. = = + =
diff
est o
05 . ; 56 . 5
36 .
2
36 .
0 ) 12 10 (
< =

=

= p z
diff
Independent Samples t (2)
(

+
+
+
=
2 1
2 1
2 1
2
2 2
2
1 1
2
) 1 ( ) 1 (
.
N N
N N
N N
s N s N
est
diff
o
diff
est
y y
diff
t
o

2 1 2 1
) (
=
7 ; 83 . 5 ; 20
5 ; 7 ; 18
0 : ; 0 :
2
2
2 2
1
2
1 1
2 1 1 2 1 0
= = =
= = =
= =
N s y
N s y
H H
47 . 1
35
12
2 7 5
) 83 . 5 ( 6 ) 7 ( 4
. =
(

+
+
=
diff
est o
. . ; 36 . 1
47 . 1
2
47 . 1
0 ) 20 18 (
s n t
diff
=

=

=
t
crit
= t(.05,10)=2.23
Confidence Intervals in t
With a small sample size, we compute the same numbers
as we did for z, but we compare them to the t distribution
instead of the z distribution.
25 ; 5 ; 10 : ; 10 :
1 0
= = = = N s H H
y

1
25
5
. = = =
N
s
est
y
M
o
1
1
) 10 11 (
11 =

= = t y
064 . 2 ) 24 , 05 (. = t
1<2.064, n.s.
Interval =
] 064 . 13 , 936 . 8 [ ) 1 ( 064 . 2 11

M
t y o
Interval is about 9 to 13 and contains 10, so n.s.
(c.f. z=1.96)
Rejection Regions (1)
1-tailed vs. 2-tailed tests.
The alternative hypothesis tells the tale
(determines the tails).
If
100 :
0
= H
100 :
1
= H
Nondirectional; 2-tails
100 :
1
> H 100 :
1
< H Directional; 1 tail
(need to adjust null for
these to be LE or GE).
In practice, most tests are two-tailed. When you see
a 1-tailed test, its usually because it wouldnt be
significant otherwise.
Rejection Regions (2)
1-tailed tests have better power on the
hypothesized side.
1-tailed tests have worse power on the
non-hypothesized side.
When in doubt, use the 2-tailed test.
It it legitimate but unconventional to use
the 1-tailed test.

You might also like