Professional Documents
Culture Documents
Note that the failure to reject does not mean the null hypothesis is
true. There is no formal outcome that says “accept “. It only means we
do not have sufficient evidence to support .
Note will ALWAYS have an equal sign (and possibly a less than or greater
than symbol, depending on the alternative hypothesis). The alternative
hypothesis has a range of values that are alternatives to the one in .
Illustration:
: defendant is innocent;
: defendant is guilty.
Notes: Right – tailed and left – tailed tests are distinguished by the way the greater
than or less than points. It is the direction where alternative places the true mean.
1. A researcher thinks that if expectant mothers use vitamins, the birth weight of
the babies will increase. The average birth weight of the population is 8.6 pounds.
Statistical test – uses the date obtained from a sample to make a decision about
whether the null hypothesis should be rejected.
Test Value (test statistic) – the numerical value obtained from a statistical test.
Two types of Errors:
is True Is False
Reject Type I error Correct decision
Do not reject Correct decision Type II error
Critical or Rejection Region – the range of values for the test value that indicate
a significant difference that the null hypothesis should be rejected.
Non – critical or Non – rejection Region – the range of values for the test value
that indicates that the difference was probably due to no chance and that the
null hypothesis should be rejected.
Critical Value (CV) – separates the critical region from the non – critical region,
i.e., when we should reject from when we should not reject .
The location of the critical value depends on the inequality sign of the
alternative hypothesis.
Depending on the distribution of the test value, you will use different
tables to fine critical value.
One – tailed test – indicates that the null hypothesis should be rejected when the
test value is in the critical region on one side.
Left – tailed test – when the critical region is on the left side of the distribution
of the test value.
Right – tailed test – when the critical region is on the right side of the
distribution of the test value.
Two – tailed test – the null hypothesis should be rejected when the test value is in
either of two critical regions on either side of the distribution of the test value.
Step 5: Make the decision (reject or not reject the null hypothesis).
Example 1. Lodi Country High School seniors have an average NAT score of 1,020.
From a random sample of 144 Lodi High School students we find the average NAT
score to be 1,100 with a standard deviation of 12.4. We want to know if these high
school students are representative of the overall population. What are our
hypothesis?
2. A recent survey of college campuses across Batangas claims that students
spend an average of 2.7 hours a day using their cellphones. A random sample of
35 BatState-U students showed an average use of 2.9 hours a day, with a standard
deviation of 0.4 hours. Do BatState-U college students use their cellphones more
than the typical Batangas college student?
3. A package of gum claims that the flavor lasts more than 39 minutes. What
would be the null hypothesis of a test to determine the validity of the claim? What
sort of test is this?
4. An ice pack claims to stay cold between 35 and 65 minutes. What would be
the null hypothesis of a test to determine the validity of the claim? What sort of
test would it be?
5. Mrs. Dudley is a grade 9 English teacher who is marking 2 papers that are
strikingly similar. She is concerned that one of her students is cheating, but she is
not sure which one of the two is guilty. Mrs. Dudley meets with the two students
(Laura and Greg) who have similar papers, and suspects that Greg is probably
the one who is guilty. She decides that she must deal with the situation in the same
way as in a justice trial, in order to determine who is innocent and who is guilty.
Describe the type I and type II errors that may be committed in the statement
above.
5.2 TEST FOR MEAN (PowerPoint)
5. Draw a conclusion.
General Formula:
Statistic − Parameter
Test Statistic =
Standard Error
Example 1. The leader of the association of jeepney drivers claims that the
average daily take home pay of all jeepney drivers in Pasay City is Php. 400.00. A
random sample of 100 jeepney drivers in Pasay City was interviewed and the
average daily take home pay of all jeepney drivers in Pasay City is different from
Php. 400.00. Assume that the population variance is Php 92.00.
Example 2. According to a study done last year, the average monthly expenses
for cellphone loads of high school students in Batangas was Php 350.00. A statistics
student believes that this amount has increased since January of this year. Is there
a reason to believe that this amount has really increased if a random sample of
60 students has an average monthly expenses for cellphone loads of Php 380?
Use a 0.05 level of significance. Assume that the population standard deviation is
Php 77.00.
Exercise 1. The head of the Math department announced that the mean score of
Grade 9 students I the first periodic examination in Mathematics was 89 and the
standard deviation was 12. One students who believed that the mean score was
less than this, randomly selected 34 students and computed their mean score.
She obtained a mean score of 85. At 0.01 level of significance, test the student’s
belief.
Exercise 2.A company which produces batteries claims that the life expectancy
of their batteries is 90 hours. In order to test the claim, a consumer interest group
tested a random sample of 40 batteries. The test resulted to a mean life
expectancy of 87 hours. Using a 0.05 level of significance, can it be concluded
that the life expectancy of their batteries is less than 90 hours? Assume that the
population standard deviation is known to be 10 hours.
5.2 TEST FOR MEAN (Book)
Statistic − Parameter
Test Statistic =
Standard Error
When doing a hypothesis testing for a mean, the observed value refers to the
sample mean while the expected value of the population mean when the null
hypothesis is assumed to be true. The standard error of the mean is computed as
the number:
Remember that the Central Limit Theorem allows you to use the standard normal
distribution to approximate the distribution of sample means provided that n≥ 30
(that is, the sample size is large). Moreover, for large samples even when the value
of the population standard deviation 𝜎 is unknown, the standard error is
computed by using the formula:
We use the z test for a mean to conduct a statistical test for a population
mean when the population is normal and 𝜎 is known, or when the sample size
is n≥ 30. The test statistic is given by the formula
Level of
One-tailed Test Two-tailed Test
significance 𝛼
The kind of test that will certainly interest those who want to know the
percentage of the population who are in favor of a certain idea or
concept.
In testing for a Population Proportion p, use the z-test for a proportion to
conduct a statistical test for a proportion p on the assumption that np≥5
and nq≥5. The test statistic is defined by the equation: Formula:
Example:
1. Beefy burgee, a fast food restaurant claims that 85% of the burger
fanatics prefer to eat in their place. To test this claim, a random
sample of 90 burger customers are selected at random and asked
what they prefer. If 76 of the 90 burger fanatics said they prefer to
eat at Beefy Burger, what conclusion do we draw? Use a 0.005
level of significance.
2. Haus of Gaz claims that more than two-thirds of the houses in a
certain subdivision use their brand. Do we have reason to doubt
this claim if in a random sample of 40 houses in this subdivision, it is
found that 25 use the company’s brand. Use a 0.01 level of
significance.
3. A congressman is hoping that his bill is favored by his constituents
to increase his chance of getting reelected in the next election.
He asked his research office to conduct a survey on this matter to
verify whether or not he can get support from his constituents so
that if he gets a 90% support for the bill, he would certainly start
with his campaign. A random sample of 150 respondents yielded
128 who are in favor of his bill. Test the hypothesis that p=0.9
against the alternative p<0.9 at 0.05 level of significance.
6.1 CORRELATION
SCATTER PLOT – visual tool that can help us to analyze the relationship
between the two variables in a bivariate data.
The scatter plot in (A) shows a linear relationship between the variables, number
of hours of sleep x and the test scores y. It suggests that the longer hours of sleep
yields higher test scores. This type of relationship is said to be positive and linear.
Figure (B) also shows a linear relationship but it shows a negative relationship, that
is, as the number of hours of travel x increases, the corresponding speed of travel
y becomes slower. This type is a negative and linear relationship. Figure (C) does
not show linearity as you can see that the points that represent the data do not
follow a linear pattern. Instead, these points seem to follow the shape of a
quadratic curve. This kind of relationship is non-linear and yet we do not claim
that the independent and dependent variables are not related at all because
we still see a pattern from the graph. We can still describe the graph as when the
number of products sold x is less than 25, the profit y increases but when the
number of product x is more than 25, the profit y decreases. On the other hand,
(D) shows points that are just scattered in the plane and we see no pattern
showing the relationship between the independent variable (number of
hours of training) and the dependent variable (number of sales).
The correlation coefficient ranges between -1 and +1. A value close to +1 signifies
a strong positive linear relationship. A value close to -1 signifies a strong negative
linear relationship. When the variables concerned yield a coefficient value that is
close to 0 then there is no linear relationship between the variables.
Right-tailed Test:
Left-tailed Test:
Two-tailed Test:
No significant correlation
Significant correlation
Test Statistic:
Example 12. The following are test scores of 15 students in abstract reasoning x
and arithmetic y. Determine whether the two variables have a significant positive
x 100 95 78 85 74 79 89 90 100 88 85 50 54 75 65
y 80 90 80 70 55 40 50 88 50 72 100 74 30 90 88
n x y xy x² y²
1 100 80
2 95 90
3 78 80
4 85 70
5 74 55
6 79 40
7 89 50
8 90 88
9 100 50
10 88 72
11 85 100
12 50 74
13 54 30
14 75 90
15 65 88
To solve for the test-statistic:
Interpretation:
Example 18. In a Biology class, students experiment a number of cultures that are
grown in their laboratory. The number of bacteria y (in millions) and their ages x
(in days), are given below.
x 1 2 3 4 5 6 7 8
y 34 107 140 200 198 240 270 350
n x y xy x² y²
1 1 34 34 1 1,156
2 2 107 214 4 11,449
3 3 140 420 9 19,600
4 4 200 800 16 40,000
5 5 198 990 25 39,204
6 6 240 1,440 36 57,600
7 7 270 1,890 49 72,900
8 8 350 2,800 64 122,500
x = 36 y = 1,539 xy = 8,588 x² = 204 y² = 364,409
To solve for the test-statistic:
Interpretation:
Judge x 9 8 8 5 7 5
Judge y 10 8 9 6 9 10
n x y xy x² y²
1 9 10 90 81 100
2 8 8 64 64 64
3 8 9 72 64 81
4 5 6 30 25 36
5 7 9 63 49 81
6 5 10 50 25 100
42