You are on page 1of 15

38

3.1 Subjects of the Study


3.1.1 Population
The tenth year students of SMAN 1 Cirebon in the academic year
2011/2012 were taken as population. In agreement with Muijs (2004: 15) defines
population is the group of people we want to generalize to, the writer needs to be
clear on what its population is. The total number of population was 280 students
divided into 9 classes. It means that each class had around 30-32 students. The
reason for choosing the tenth year students was because it was done to the fact
that procedure text was taught for the tenth year students and SMAN 1 Cirebon
has been applying genre-based approach to English language teaching. Moreover,
SMAN 1 is a pioneering international-standard schools are now integrated in the
learning technology in the classroom. Therefore, the writer decided to choose this
school based on this consideration.
3.1.2 Sample
Sometimes, the number of population is too big so it is out of the reach. In
this case, the research is conducted to the part of the whole population. This part
must have the characteristics that represent the whole population. According to
Fraenkel and Wallen (2005: 92) a sample in a research study is the group on
which information is obtained.

According to Fraenkel and Wallen (2005: 92) a sample in a research study is the

group on which information is obtained.

In selecting sample, the writer used probability / random sampling. There are

several types of probability samples; one of them is simple random samples. In short,

a selected sample selection technique is to use simple random sampling. According to

Cohen, Manion, and Morrison (2007: 110) states,

In simple random sampling, each member of the population under


study has an equal chance of being selected and the probability of
a member of the population being selected is unaffected by the
selection of other members of the population, i.e. each selection is
entirely independent of the next.
39

This study used one class as the sample. The sample was divided into two

groups consisting of 5 students in experimental group and 5 students in control

group. The simple random samples that purposefully chosen due to both classes are

the class which has similar schedule in English lesson compared to the other class as

Mackey and M. Gass (2005) states: “it refers to a sample that has been selected in

such a way that each member of a population has an equal chance of being selected”.

The reasons why the writer chose these one class which ten students were

because the averages of the English scores were not significantly different, and the

same English teacher taught. The writer did lottery to choose which one would be

experiment class and control class.

3.2 Variable

In order to carry out any sort of measurement, we need to think about variables; that

is, characteristics that vary from person to person, text to text, or object to object.

Simply put, variables are features or qualities that change (Mackey and M. Gass,

2005: 101). There are two main variable types:

Independent and Dependent.

1) Independent variable
40

The independent variable is the one that we believe may “cause” the results

(Mackey and M. Gass, 2005: 103). Based on the definition, the independent

variable of this study is the use of VoiceThread in teaching Procedure text.

2) Dependent variable

The dependent variable is the one we measure to see the effects the independent

variable has on it (Mackey and M. Gass, 2005: 103). Based on the definition, the

dependent of this study is students’ achievement. The students’ score of

understanding the Procedure text test indicates the students’ achievement.

3.3 Research Design

In this research the writer used true exprimental design, especially the pretest-

posttest equivalent groups design.

The pretest-posttest groups design figures:

R O1 X O2

R O3 C O4

X gain = O2-O1 O1 O3 = pretest X = experiment

C gain = O4-O3 O2 O4 = posttest C = control


41

This design is one of the most effective in minimizing the threats to

experimental validity. Pretests are administered before the application of the

experimental and control treatments and posttests at the end of the treatment period.

The experimental group will receive the ‘treatment’, the control group will not. Both

groups will receive a pre-test on whatever instrument is used to assess the effect of

the experiment (e.g. a test) before the treatment is given, and a posttest, usually on the

same instrument, after the treatment has been given (Muijs,2004: 18).

Based on this statement, the writer gave the pretest to both groups. The writer

gave a treatment, i.e. giving VoiceThread tutorial in understanding the Procedure text

twice a week, to the experiment group as an alternative way of teaching Procedure

text. Their own English teacher in case is replaced by the writer temporarily taught

the control group by using ordinary exercise as usual in written text.

3.4 Instrument

Fraenkel and Wallen (2005: 112) states that an instrument is the device (such

as a pencil-and-paper test, a questionnaire, or a rating scale) the researcher uses to

collect data. Before collecting the data, the writer made instrument such as a test and

a questionnaire.

1. Test
42

According to Arikunto (2006: 150), “Tes adalah serentetan pertanyaan

atau latihan serta alat lain yang digunakan untuk mengukur keterampilan,

pengetahuan intelegensi, kemampuan atau bakat yang dimiliki oleh individu atau

kelompok”. The forms of tests used in this study is a written test about the form of

essays (description) as much as 7 (seven) about the trial, which will be used to pre-

test and post-test. On the pretest aimed to measure the ability of neither the early

experimental group nor the control group. While on the post-test performed to

determine the learning outcomes of students in understanding the procedure text in

both groups after getting a different treatment. These results are to determine if there

is its influence on the learning outcomes of the procedure text using VoiceThread in

each class and find out which one is better.

To get a good research instrument, previously tested in advance about tests in

the class who had received the material and are outside the study sample. Instrument

testing was conducted with the aim to determine whether or not fulfilled about the

validity of the test, reliability of the test, discriminating power, and difficulty level.

2. Questionnaire

Brown (2001: 6) defined questionnaires (a subset of survey research as “any

written instruments that present respondents with a series of questions or statements

to which they are to react either by writing out their answers or selecting them among

existing answer”. After doing the post-test, the writer distributed questionnaire in the

last activity. The purpose was to answer the questions about the students’
43

achievement in receiving the material from the writer. A questionnaire that would be

used in this research was consist of ten statement on the subject of implementation

the VoiceThread’s use, to find out students’ attitude after following the learning

process using VoiceThread as a medium in understanding the Procedure text.

The questionnaire used in accordance with the Likert scale consisting two kinds of

statements that are favorable and unfavorable. According to Swetnam (2004: 61),

Likert scale assessment is:

1. Favorable; strongly agree= 4, agree= 3, disagree= 2, strongly disagree= 1

2. Unfavorable; strongly disagree= 4, disagree= 3, agree= 2, strongly agree=1


44

3.5 Condition of the Test

3.5.1 Validity of the Test

Validity refers to the appropriateness, meaningfulness, correctness, and

usefulness of the inferences a researcher makes. Validity is the most important idea to

consider when preparing or selecting an instrument for use (Fraenkel and Wallen,

2005: 150). In addition, according to Muijs (2004: 66), validity is probably the single

most important aspect of the design of any measurement instrument in educational

research. Validity is a standard criterion that shows whether the instrument is valid or

not.

The writer used empirical validity since this research deals with statistical. To

calculate the empirical validity of each item, the writer used Pearson Product Moment

proposed by Arikunto (2006: 170). The formula is:

Where :

rxy : the coeffecient of correlation between X and Y variable or

validity of each item

N : the number of students/subject participating in the test


45

X : the sum of X scores

Y : the sum of Y scores

ΣY : the sum of total score for each student

ΣX : the sum of score in each item

ΣXY : the sum of multiple score from each student with the total score

in each item

ΣX2 : the sum of the square score in each item

ΣY2 : the sum of square total score from each student

Because the result of the instrument is higher than the critical value, it is

considered that the instrument is valid. The computation of validity can be seen in

appendix B

Table 3.1
Classification of Validity Coefficient
No. rxy Scores Interpretation
1. 0.80 < rxy ≤ 1.00 Very High
2. 0.60 < rxy ≤ 0.80 High
3. 0.40 < rxy ≤ 0.60 Medium
4. 0.20 < rxy ≤ 0.40 Low
5. 0.00 < rxy ≤ 0.20 Very Low
6. rxy ≤ 0.00 Invalid
Source: Jihad & Haris (2009: 180)
46

Then to test the meaning of validity (correlation coefficient) about an essay is

used the t test statistic proposed by Sudjana (2005: 377) that,

Where: t = different power

When tcount > ttable then the question is valid, but when tcount ≤ ttable then the question is

invalid and not used for research instrument.

3.5.2 Reliability of the Test

Reliability in quantitative research is essentially a synonym for


dependability, consistency and replicability over time, over
instruments and over groups of respondents. It is concerned with
precision and accuracy; some features, e.g. height, can be measured
precisely, while others, e.g. musical ability, cannot.

(Cohen, Manion, and Morrison, 2007: 146)

Reliability shows whether the instrument is reliable and can be used as a

device to collect the data. Reliability means the stability of test scores when the test is

used. To measure the reliability of the test, the writer used the formula:
47

where:

r11 : index reliability

k : number of item

αb2 : item variance

αt2 : total variance

To find out the variance of each item, the formula is :

Then to find out the total variance, the formula is:

Because the result of the instrument is higher than the critical value, it is

considered that the instrument is reliable. The computation of reliability can be seen

in appendix B.
48

Table 3.2

Classification of Reliability Coefficient

No. r11 Scores Interpretation


1. r11 ≤ 0.20 Very Low
2. 0.20 ≤ r11 ≤ 0.40 Low
3. 0.40 ≤ r11 ≤ 0.60 Medium
4. 0.60 ≤ r11 ≤ 0.80 High
5. 0.80 ≤ r11 ≤ 1.00 Very High
Source: Jihad & Haris (2009: 181)

3.5.3 The Difficulty Level


Analysis of difficulty level is intended to select the items which have good

level of difficulty. An item considered having good level of difficulty if it is not too

difficult or too easy. To calculate the value of the level of difficulty, the following

formula according to Jihad and Haris (2009: 188) was used:

where:

IK : the difficulties of the index

SA : the total scores achieved by students of upper group


49

SB : the total scores achieved by students of lower group

n : number of students from the upper and lower groups

Max. : the maximum scores

If the index difficulty is high, an item considered easy, on the other hand, an

item is considered difficult if it is low. The computation of the difficulty level can be

seen in appendix B.

Table 3.3
Classification of Difficulty Level Coefficient
No. Difficulty Level (IK) Scores Interpretation
1. IK = 0.00 Very Difficult
2. 0.00 < IK ≤ 0.30 Difficult
3. 0.30 < IK ≤ 0.70 Medium
4. 0.70 < IK ≤ 1.00 Easy
5. IK = 1.00 Very Easy
Source: Jihad & Haris (2009)

3.5.4 Discriminating Power


Discriminating power from a question stated how far of the question ability is
able to distinguish between testy who know the answers properly with testy who
cannot answer that question (the answer wrong testy). To calculate the discriminating
power about the form of description used the formula according to Jihad and Haris
(2009: 189) is:

𝑆𝐴 −𝑆𝐵
DP = 1 where :
−𝑥 𝑛 𝑥 𝑀𝐴𝑋
2
50

DP : Discriminating Power
SA : the total scores achieved by students of upper group
SB : the total scores achieved by students of lower group
n : number of students from the upper and lower groups
Max. : the maximum scores

Table 3.4
Classification of Discriminating Power Coefficient
No. Discriminating Power (DP) Scores Interpretation
1. DP ≤ 0.00 Very Poor
2. 0.00 < DP ≤ 0.20 Poor
3. 0.20 < DP ≤ 0.40 Fair
4. 0.40 < DP ≤ 0.70 Good
5. 0.70 < DP ≤ 1.00 Very Good
Source: Jihad & Haris (2009)

3.6 Method of Data Analysis


After collecting the data, the writer analyzed them by using statistical

analysis. The writer transcribed the students’ test on paper and then she scored the

result of test.

To differentiate whether the students’ result of understanding the Procedure

text using VoiceThread and without using VoiceThread is significant or not the writer

used the t-test formula as follows:

𝑀𝑒 −𝑀𝑐
t=
𝑆𝑠𝑒+𝑆𝑠𝑐 1 1
√( )( + )
𝑁𝑒+𝑁𝑐−2 𝑁𝑒 𝑁𝑐
51

where:

t : t-test

Me : the mean difference of the experiment group

Mc : the mean difference of control group

Sse : sum of quadrate deviation of the experiment group

Ssc : sum of quadrate deviation of the control group

Ne : the number of experiment group

Nc : the number of control group

Before calculating the t-test the writer calculated mean, variance, and standard

deviation.

3.6.1 Mean

Mean is computed by using the formula:

where:

: mean
𝑋

ΣX : sum of scores

N : number of scores
52

3.6.2 Variance

The variance of a group of scores is computed by using the following

formula:

where:

S2 : variance

X : individual scores

N : number of scores in the group

3.6.3 Standard Deviation

The standard deviation of a group of scores is computed by using the following

formula:

where:

S : standard deviation

X : individual scores

N : number of scores in group

You might also like