Professional Documents
Culture Documents
10 Test Scores
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1. Differentiate between descriptive and inferential statistics;
2. Calculate various central tendency measures;
3. Explain the use of standard scores;
4. Calculate Z score and T score;
5. Describe the characteristics of the normal curve; and
6. Explain the role of norms in standardised tests.
INTRODUCTION
Do you know that all the data you have collected on the performance of
students have to be analysed? In this topic, we will focus on the analysis and
interpretation of the data you have collected about the knowledge, skills and
attitudes of your students. Information you have collected about your students
can be analysed and interpreted quantitatively and qualitatively. For the
quantitative analysis of data, various statistical tools are used. For example,
statistics are used to show the distribution of scores on a geography test and the
average score obtained by a group of students.
Even the use of percentages may not be meaningful. For example, getting
64 percent in the test may be considered good if the test was a difficult one. On
the other hand, if the test was easy, then 64 percent may be considered to be only
average. In other words, to get a more accurate picture of the scores obtained
by students on the test, the teacher should:
(a) Find out which student obtained the highest marks in the class and the
number of questions correctly answered;
(b) Find out which student obtained the lowest marks in the class and the number
of questions correctly answered; and
(c) Find out the number of questions correctly answered by all students in the
class.
This illustrates that the marks obtained by students in a test should be carefully
examined. It is not enough to just report the marks obtained. More information
should be given about the marks obtained, and to do this you have to use
statistics. Some teachers may be afraid of statistics while others may regard it as
too time-consuming. In fact, many of us often use statistics without being aware
of it. For example, when we talk about average rainfall, per capita income,
interest rates and percentage increase in our daily lives, we are using the
language of statistics. What is statistics?
(a) Mean
The mean is the most commonly used measure of central tendency. When
we talk about an average, we usually refer to the mean. The mean is
simply the sum of all the values (marks) divided by the total number of
items (students) in the set. The result is referred to as the arithmetic mean.
Using the data from Figure 10.1 and applying the formula below, you can
calculate the mean.
Mean
x 35 40 41 ............... 75 1863 53.23
N 35 35
(b) Median
The median is determined by sorting the score obtained from lowest to
highest values and taking the score that is in the middle of the sequence.
For the example in Figure 10.1, the median is 52. There are 17 students with
scores less than 52 and 17 students whose scores are greater than 52. If
there is an even number of students, there will not be a single point at the
middle. So, you calculate the median by taking the mean of the two middle
points i.e. divide the sum of the two scores by 2.
(c) Mode
The mode is the most frequently occurring score in the data set. Which
number appears most often in your data set? In Figure 10.1, the mode is 57
because seven students obtained that score. However, you can also have
more than one mode. If you have two modes, it is bimodal.
Figure 10.2: Graph showing the distribution of Bahasa Malaysia test scores
SELF-CHECK 10.1
10.2.2 Dispersion
Although the mean tells us about the groups average performance, it does not
tell us how close to the average or mean the students scored. For example, did
every student score 80 percent on the test or were the scores spread out from 0
to 100 percent? Dispersion is the distribution of the scores, and among the
measures used to describe spread are range and standard deviation.
(a) Range
The range of scores in a test refers to the lowest and highest scores obtained
in the test. The range is the distance between the extremes of a distribution.
x x
2
Marks x x x
35 35 39 = 4 (4)2 = 16
39 39 39 = 0 (0)2 = 0
45 45 39 = 6 (6)2 = 36
40 40 39 = 1 (1)2 = 1
32 32 39 = 7 (7)2 = 49
42 42 39 = 3 (3)2 = 9
37 37 39 = 2 (2)2 = 4
44 44 39 = 5 (5)2 = 25
36 36 39 = 3 (3)2 = 9
41 41 39 = 2 (2)2 = 4
Sum = 390
Mean ( x ) = 39 ( x x ) = 153
N = 10
x x 2 153
Standard Deviation N 1
9
17
4.12
(i) The first step is to find the mean, which is 390 divided by 10 = 39;
(ii) Next is to subtract the mean from each score in the column labelled
x x . Note that all numbers in this column are positive. The squared
differences are then summed and the square root calculated; and
(iii) The standard deviation is the positive square root of 153 divided by 9
and the result is 4.12.
Note that the smaller the standard deviations, the greater the scores tend to
bunch around the mean and vice-versa. Hence, it is not enough to just
examine the mean alone because the standard deviation tells us a lot about
the spread of the scores around the mean. Which class do you think
performed better? The mean does not tell us which class performed better.
Class C performed the best because approximately two-thirds of the
students scored between 38 and 40.
SELF-CHECK 10.2
Refer to Figure 10.5 which shows the distribution of the scores obtained by
38 students on a biology test. There is a positive skew because it has a
longer tail in the positive direction. What does it mean? It means that more
students were getting low scores in the biology test which indicates that the
test was too difficult. Alternatively, it could mean that the questions were
not clear or the teaching methods and materials did not bring about the
desired learning outcomes.
ACTIVITY 10.1
A teacher administered an English test to 10 students in her class. The
students earned the following marks: 14, 28, 48, 52, 77, 63, 84, 87, 90 and
98. For the distribution of marks, find the following:
(a) Mean;
(b) Median;
With these raw scores alone, what can you say about Zulindas performance on
these tests or her standing in the class? Actually, you cannot say very much.
Without knowing how these raw scores compare to the total distribution of raw
scores for each subject, it is difficult to draw any meaningful conclusions
regarding her relative performance in each of these tests.
Assume that the scores of all three tests are approximately normally distributed.
The mean and standard deviation of the three tests are as shown in Table 10.2.
Table 10.2: Mean and Standard Deviation for the Three Tests
Based on this additional information, what statements can you make regarding
Zulindas relative performance on each of these three tests? The following are
some conclusions you can make:
(a) Zulinda did best on the history test and her raw score of 72 falls at a point
one standard deviation above the mean;
(b) Her next best score is English and her raw score of 40 falls exactly on the
mean of the distribution of the scores; and
(c) Finally, even though her raw score for science was 80, it falls one standard
deviation below the mean.
Converting Zulindas raw scores into z scores, we can say that she achieved a:
10.3.1 Z Score
What is a z score? How do you calculate the z score? A z score is a type of
standard score. The term standard score is the general name for converting a
raw score to another scale using a predetermined mean and a predetermined
standard deviation. Z scores tell how many standard deviations away from the
mean the score is located. Z scores can be positive or negative. A positive z score
indicates that the value is above the mean while a negative z score indicates that
the value is below the mean. A z score is a raw score that has been transformed
or converted to a scale with a predetermined mean of 0 and a predetermined
standard deviation of 1. A z score of 6 means that the score is 6 standard
deviations below the mean.
The formula used for transforming a raw score into a z score involves subtracting
the mean from the raw score and dividing it by the standard deviation.
x x
z
SD
x x 52 70 18
z 2.4
SD 7.5 7.5
The z score calculated for the raw score of 52 is 2.4 which means that Kumars
score for the geography test is located 2.4 standard deviations below the mean.
Test 1 Test 2
Seng Huat 30 50
Mei Ling 45 35
Mean 42 47
Standard Deviation 7 8
The teacher could use the mean to determine who was better. But, both students
have the same mean. How does the teacher decide? Using a z score can tell the
teacher how far from the mean are the scores of the two students and thus who
performed better. Using the formula, the teacher calculates the z score shown
below:
Upon examination of the table above, the teacher finds that both Seng Huat and
Mei Ling have negative z scores for the total of both tests. However, Mei Ling has
a higher total z score (1.07) compared with Seng Huats total z score (1.34). In
other words, Mei Lings total score was closer to the mean and therefore the
teacher concludes that Mei Ling did better than Seng Huat.
Z scores are relatively simple to use but many educators are reluctant to use it,
especially when test scores are reported as negative numbers. How would you
like to have your mathematics score reported as 4? For these reasons, alternative
standard score methods are used such as the T score.
10.3.3 T Score
The T score was developed by W. McCall in the 1920s and is one of the many
standard scores currently used. T scores are widely used in psychology and
education, especially when reporting performance in standardised tests. The T
score is a standardised score with a mean of 50 and a standard deviation of 10.
The formula for calculating the T score is:
T = 10(z) + 50
For example, a student has a z score of 1.0 and after converting it to T score, you
get the following:
When converting z scores to T scores, you should be careful not to drop the
negatives. Dropping the negatives will result in a completely different score.
ACTIVITY 10.2
Convert the following z scores to T scores.
Z score T score
+1.0
2.4
+1.8
Why would you use T scores rather than z scores when reporting the
performance of students in the classroom?
Note that in Figure 10.6, z scores are indicated from +1 to +4 and 1 to 4 with
the mean as 0. Each interval is equal to one standard deviation. Similarly, T
scores are reported from 10 to 90 (interval of 10) with the mean set at 50. Each
interval of 10 is equal to one standard deviation.
10.5 NORMS
In norm-referenced assessment, an individuals performance is evaluated in
relation to other peoples performances. Norm-referenced tests are seldom used
in Malaysia but in the United States, standardised tests are widely used. Perhaps,
because of the decentralised education system in the United States, school-based
assessment is extensively practised. There are no national examinations like the
PMR or SPM in Malaysia. Hence, teachers in the United States who want to find
out how their students are performing compared with other students in the
country rely on norm-referenced tests to compare the performances of their
students with the performances of other students in the norm group.
Representativeness: When you compare your students with the rest of the
population, you want to ensure that the norm sample is representative. In other
words, the individuals tested in the norm sample must consist of the appropriate
age group, taking into consideration gender differences, geographic location and
cultural differences. For example, the eight-year-olds selected for the norm
sample should reflect eight-year-olds in the country according to gender (male
and female), geographic location (urban or rural) and cultural differences. For
example, the norm sample consists of 3,000 Malaysian primary school children
with 500 students for each age group (seven-year-olds = 500 students, eight-year-
olds = 500 and so forth). The norm sample should consist of children in all the
states of Malaysia, include all the ethnic groups in the country and be drawn
Score Percentile
50 96
49 90
48 84
47 78
46 70
47 66
46 58
45 50
44 45
Percentile ranks (percentiles) are used in standardised tests which allow teachers
to compare the performance of their students with the norm group. An eight-
year-old student who obtained a score of 48 on the test has a percentile rank of
84. It means that the student is reading at a level as well as, or better than,
84 percent of other eight-year-old students in the test. Similarly, an eight-year-old
who obtains a percentile rank of 45 is reading as well as, or better than,
45 percent of eight-year-olds in the norm sample.
To use norms effectively, you should be sure that the norm sample is appropriate
both for the purpose of testing and for the person being tested. If you recognise
that the test norms are inadequate, you should be cautious because you
may obtain misleading information about the abilities of your students. The
organisation responsible for developing the norms should clearly state the
groups tested because you want to ensure that the norm sample is similar to your
students. In other words, the norm sample should consist of the same type of
people in the same proportion as is found in the population of reference. The
norm sample should be large enough to be stable over time.
SELF-CHECK 10.3
1. List some characteristics of the normal curve.
The term central tendency refers to the middle value and is measured
using the mean, median and mode. It is an indication of the location of scores.
The mean is simply the sum of all the values (marks) divided by the total
number of items (students) in the set.
The range of scores in a test refers to the lowest and highest scores obtained
in the test.
A negative skew has a longer tail in the negative direction. A positive skew
has a longer tail in the positive direction.
The standard score refers to a raw score that has been converted from one
scale to another scale using the mean and standard deviation.
Z score tells how many standard deviations away from the mean the score is
located.
The normal curve (also called the bell curve) is a hypothetical curve that is
supposed to represent all natural occurring phenomena.