Professional Documents
Culture Documents
_________ RELIABILITY?
HOW DO WE DETERMINE
ESTIMATE
S A B C D E F G H I J X̅ σ
DAY 1 18 16 5 13 15 12 12 5 8 10 11.8 4.42
(X1)
𝑥1 = scores in one set 𝑥2 = scores in other set σ 𝑥1 𝑥2 = sum of each 𝑥1 score times its corresponding 𝑥2 score
𝑥ҧ1 = mean of the X scores 𝑥ҧ 2 = mean of the Y scores N = 10 (number of people in the group)
PROs
Easier to develop (unlike alternate-forms/parallel forms)
CONs
Maturation
Extraneous variables during the time interval
Carry over effects
DIFFERENCES:
PARALLEL FORMS
- Each form of the test, the means and the variances of observed test scores are equal
- Means correlate equally to the true score
- Correlate equally with other measures
ALTERNATE FORMS
- Simply different versions of a test that have been constructed so as to be parallel
- Do no meet the requirements for the legitimate designation parallel
- Typically designed to be equivalent with respect to variables such as content and level
of difficulty SARMIENTO, MICAH NICOLE V. 3PSY7
ALTERNATE-FORM RELIABILITY/PARALLEL-FORM RELIABILITY
DIFFERENCES:
PARALLEL FORMS
- Refers to an estimate of the extent to which item sampling and other
errors have affected test scores on versions of the same test when, for
each for of the test, the means and variances of observed test scores are equal
ALTERNATE FORMS
- Refers to an estimate of the extent to which these different forms of the
same test have been affected item sampling error, or other error
SIMILARITIES:
Used to assess the consistency of the results of two tests constructed in the
same way from the same content domain (similar form, content, and level
of difficulty)
To determine whether scores will generalize across different sets of items or
tasks
The two forms of the test are correlated to yield a coefficient of
equivalence
Determines how comparable are two different versions of the same measure.
KEYWORDS: TWO TESTS, EQUIVALENT
SARMIENTO, MICAH NICOLE V. 3PSY7
ALTERNATE-FORM RELIABILITY/PARALLEL-FORM RELIABILITY
S 1 2 3 4 5 6 7 8 9 10 X̅ σ
FORM 18 16 5 13 15 12 12 5 8 10 11.8 4.42
A (XA)
PROs
Solves the problem of carry over effects
CONs
Time consuming (to develop)
Difficult to develop
PROCEDURE:
Develop/use a single test
1. Divide the test into equivalent halves
2. Calculate a Pearson r between scores on the two halves of the
test
3. Adjust the half-test reliability using the Spearman-Brown
formula
SARMIENTO, MICAH NICOLE V. 3PSY7
SPLIT-HALF RELIABILITY
𝑛𝑟𝑥𝑥
𝑟𝑆𝐵 =
1 + 𝑛 − 1 𝑟𝑥𝑥
Spearman-Brown formula
2𝑟ℎℎ
𝑟𝑆𝐵 =
1 + 𝑟ℎℎ
SARMIENTO, MICAH NICOLE V. 3PSY7
SPLIT-HALF RELIABILITY
S 1 2 3 4 5 6 7 8 9 10 X̅ σ
FORM 18 16 5 13 15 16 12 5 8 10 11.8 4.42
A (XA)
𝑛𝑟𝑥𝑥
𝑟𝑆𝐵 =
1 + 𝑛 − 1 𝑟𝑥𝑥
Spearman-Brown formula
2𝑟ℎℎ
𝑟𝑆𝐵 =
1 + 𝑟ℎℎ
SARMIENTO, MICAH NICOLE V. 3PSY7
SPLIT-HALF RELIABILITY
Spearman-Brown formula
2𝑟ℎℎ
𝑟𝑆𝐵 =
1 + 𝑟ℎℎ
2𝑟ℎℎ
𝑟𝑆𝐵 =
Spearman-Brown formula
1 + 𝑟ℎℎ
2(.73)
𝑟𝑆𝐵 =
1 + (.73)
𝑟𝑆𝐵 = 0.846382
SARMIENTO, MICAH NICOLE V. 3PSY7
ESTIMATING RELIABILITY USING TWO SETS OF SCORES
𝑘 𝜎𝑖 2
𝑟𝑥𝑥 =𝛼= 1− 2
𝑘−1 𝜎 𝑥
Participants
A 3 4 4 3 5 19
B 4 3 4 3 3 17
C 2 3 3 2 3 13
D 4 4 5 3 4 20
E 3 2 4 3 3 15
F 3 2 3 2 3 13
𝑘 𝜎𝑖 2
𝑟𝑥𝑥 = 𝛼 = 1− 2
𝑘−1 𝜎 𝑥
5 2.4166
𝑟𝑥𝑥 = 𝛼 = (1 − )
4 7.4722
α = .85
A 3 4 2 4 1 2 3 2 4 2 27
B 2 1 2 4 2 1 2 3 1 3 21
C 4 3 1 2 4 5 1 4 2 3 29
D 1 3 2 3 3 4 2 5 3 4 30
E 5 4 3 5 1 2 2 2 2 2 28
F 2 5 4 5 2 3 2 3 3 5 34
G 3 2 4 3 2 5 2 1 2 5 29
H 4 1 2 2 1 1 2 4 1 1 19
I 2 3 3 1 3 3 2 5 3 3 28
J 1 5 2 2 3 2 2 2 2 2 23
mean= 2.7 3.1 2.5 3.1 2.2 2.8 2 3.1 2.3 3 26.8
total var 20.4
〖𝜎𝑖〗item var1.788889 2.1 0.944444 1.877778 1.066667 2.177778 0.222222 1.877778 0.9 1.777778 ∑ var i= 14.73333
CRONBACH’S COEFFICIENT ALPHA
𝑘 𝜎𝑖 2
𝑟𝑥𝑥 = 𝛼 = 1− 2
𝑘−1 𝜎 𝑥
10 14.73
𝑟𝑥𝑥 = 𝛼 = (1 − )
9 20.4
α = .3088
Kuder-Richardson Formula 20
𝑘 𝜎 2 𝑥 − σ 𝑝𝑖 𝑞𝑖
𝑟𝑥𝑥 = 𝐾𝑅 − 20 =
𝑘−1 𝜎 2𝑥
𝑘 = number of test questions
𝜎 2 𝑥 = the test variance
𝑝𝑖 = the proportion of test takers answering an item correctly (p)
𝑞𝑖 = the proportion of test takers answering an item incorrectly (1-p)
σ 𝑝𝑖 𝑞𝑖 = the sum of each item’s p value times its corresponding q value
Question 1 2 3 4 5 6 Total
Score (𝑥)ҧ
A Y Y Y Y N Y 5 𝑥ҧ = 2.8
B Y N N Y N Y 3 𝜎 2 = 1.76
C Y N N Y N N 2 𝜎 =1.3266
D N Y N N N N 1 𝑘=6
E N Y N N Y Y 3 N=5
𝑝𝑖 .6 .6 .2 .6 .2 .6 𝑥ҧ = 2.8
𝑞𝑖 .4 .4 .8 .4 .8 .4
(p)(q) .24 .24 .16 .24 .16 .24 σ 𝑝𝑖 𝑞𝑖 =
1.28
Kuder-Richardson Formula 20
𝑘 𝜎 2 𝑥 − σ 𝑝𝑖 𝑞𝑖
𝑟𝑥𝑥 = 𝐾𝑅 − 20 =
𝑘−1 𝜎 2𝑥
6 (1.76)−1.28
rxx = KR − 20 = = .3273 = .33
6−1 1.76
Kuder-Richardson Formula 21
𝑥(𝑘
ҧ − 𝑥)ҧ
𝑟𝑥𝑥 = 𝐾𝑅 − 21 = 1 −
𝑘 (𝜎 2 )
𝑘 = number of test questions
𝑥ҧ = the mean number of questions correct
𝜎 2 = total variance
Person A B C D E F G H I J 𝑥̅ 𝜎
Grade 8 6 2 3 7 6 2 3 8 6 5.1 2.2561
K=8
Kuder-Richardson Formula 21
𝑥ҧ 𝑘 − 𝑥ҧ
𝑟𝑥𝑥 = 𝐾𝑅 − 21 = 1 −
𝑘 (𝜎 2 )
5.1 8−5.1
rxx = KR − 21 = 1 − = .6367 = .64
8(5.09)
SARMIENTO, MICAH NICOLE V. 3PSY7
OTHER INTER-ITEM CORRELATIONS OR INTERNAL
CONSISTENCY/HOMOGENEITY CORRELATIONS
Absolute differences
Between 1 & 2 1
Between 1 & 3 2
Between 2 & 3 1
https://www.ijme.net/archive/2/cronbachs-alpha.pdf
http://www.proftesting.com/test_topics/pdfs/test_quality_reliability.pdf
https://testing.wisc.edu/Reliability.pdf
https://web.stanford.edu/dept/SUSE/SEAL/Reports_Papers/methods_papers/G%20Theory%20
Hdbk%20of%20Statistics.pdf
https://www.tutorialspoint.com/statistics/reliability_coefficient.htm
http://tx.liberal.ntu.edu.tw/~purplewoo/Literature/!DataAnalysis/Reliability%20Analysis.htm