Professional Documents
Culture Documents
11.1 Introduction
Although sampling is gaining greater acceptance in variety of fields, but still
there are some persons who are accustomed to deal with complete
enumeration and they are suspicious about the sampling results. Their
suspicious might have arisen because of the fact that in the past statistical
methods based on sampling were carelessly used or they were misused.
But the sampling methods are now much advanced as compared to the
past. Now a day, except in the census, where complete enumeration is
made, almost in all the other situations sampling methods are very
Sikkim Manipal University Page No.: 337
Probability and Statistics Unit 11
frequently used. Infact, sampling is a valuable tool for obtaining the data
quickly, accurately and, above all, cheaply.
In different fields of human activity, the decision making process is based on
the observations of few units which form a portion of the total population.
The process of studying only a portion of the population and making
decisions involves risk, the risk of making wrong decisions. This unit deals
with the various techniques of drawing samples from the population.
When sampling design is not done properly, the estimation or the inferences
drawn from the sample can go wrong and the managerial decisions taken
on the wrong conclusions may lead to loss of time, money and human
resources. This may badly affect the reputation of their organisation. Hence,
the risks involved in using the incorrect sampling design are of primary
concerns to investigators.
Objectives:
At the end of this unit, the student should be able to:
Describe the basic concept of sampling theory and types of sampling
Explain the statistical errors and its measurement
Determine the sample size
Population
Sample
he may experiment with more and more blood, causing the patient to die.
Also to test the life the electric bulbs, manufacturer surely will not burn out
all the bulbs.
In a properly design sample survey it is also possible to make a valid
estimate of error to determine before hand the accuracy or precision and
reliability of the sampling results. But in a complete census it is not possible
to know the margin of uncertainty about the conclusions drawn.
Disadvantage
Inspite of the above advantages, sample survey is not always preferred to
complete enumeration for example if information is require about each unit
of the population, no sampling method is suitable to give the desire
information. Only complete census can do this. Intact, the details of the
universe are scarified through sampling.
2 2
fX fX
S2
f f
1 1,2 1.5
2 1,3 2
3 1,4 2.5
4 1,5 3
5 2,3 2.5
6 2,4 3
7 2,5 3.5
8 3,4 3.5
9 3,5 4
10 4,5 4.5
This gives the means of sample size 2. We form a distribution of sample
means which can be represented in table below
Frequency table
X f 2
fx fx
Mean Frequency
2 2
2 fx fx 97.50
S (3)2
f f 10
S 0.7500 0.866
AE t a
AE t - a
RE
a a
It provides a degree of error for comparison purposes between different sets
of data.
but it greater than 29 we cannot choose that number. The bold numbers in
table 7.2 are the selected sample that is the numbers selected are 23, 08,
27, 1013, 05, 11, 12, 07, 08. The corresponding students constitute required
sample.
Stratified Random Sampling: This sampling design is most appropriate if
the population is heterogeneous with respect to characteristic under study or
the population distribution is highly skewed.
We subdivide the population into several groups or strata such that :
i) Units within each stratum is more homogeneous
ii) Units between strata are heterogeneous
iii) Strata do not overlap, in other words, every unit of population belongs
to one and only one stratum
The criteria used for stratification are geographical, sociological, age, sex,
income and so on. The population of size N is divided into K strata
relatively homogenous of size N1, N2.Nk such that N1 + N2
+ + Nk = N. Then, we draw a simple random sample from each
stratum either proportional to size of stratum or equal units from each
stratum.
Merits and demerits of stratified random sampling
Merits Demerits
1. Sample is more representative 1. Many times the stratification is not
effective
2. Provides more efficient estimate 2. Appropriate sample sizes are not
drawn from each of the stratum
Example 2
The items produced by factories located at three cities X, Y and Z are
200, 300 and 500 respectively. We wish to draw a sample of 20 items under
proportional stratified sampling. We number the unit from 0 to 999. Then
refer to random table and select the numbers as represented in table below
200
For Factory X 20 4
1000
300
For Factory Y 20 6
1000
500
For Factory Z 20 10
1000
Total = 20
For first factory sample units selected are 174, 192, 069, 156.
For second factory sample units selected are 287, 432, 444, 482, 302, 254.
For third factory sample units selected are 854, 772, 733, 741, 822, 853,
570, 802, 629, 525.
Systematic Sampling
This design is recommended if we have a complete list of sampling units
arranged in some systematic order such as geographical, chronological or
alphabetical order.
Suppose the population size is N. The population units are serially
numbered 1 to N in some systematic order and we wish to draw a sample
of n units. Then we divide units from 1 to N into K groups such that each
group has n units.
This implies nK = N or K = N/n. From the first group, we select a unit at
random. Suppose the unit selected is 6th unit, thereafter we select every 6 +
Kth units. If K is 20, n is 5 and N is 100 then units selected are 6, 26, 46,
66, 86.
Example: We want to select 1000 colleges from southern states. In the first
stages we may select any three states. In the second stage we may select
some districts in that state. In the 3rd stage, we may select the colleges in
each district. We may adopt any sampling technique at each stage.
Merits and demerits of multi stage sampling
Merits Demerits
Greater flexibility in sampling Estimates are less accurate
method
Existing division can be used Investigator should have knowledge of the
entire population that will be sampled
Convenience Sampling
The sample units are selected according to convenience of the investigator.
It is also called chunk which refers to the fraction of the population being
investigated which is selected neither by probability nor by judgment.
Moreover, a list or framework should be available for the selection of the
sample. It is used to make pilot studies. However, there is a high chance of
bias being introduced.
Quota Sampling
It is a type of judgment sampling. Under this design, quotas are set up
according to some specified characteristic such as age groups or income
groups. From each group a specified number of units are sampled
according to the quota allotted to the group. Within the group the selection
of sample units depends on personal judgment. It has a risk of personal
prejudice and bias entering the process. This method is often used in public
opinion studies.
P Ps
Z (For finite population )
N - n / N - 1 PQ / n
P Ps
Z (For infinite population )
PQ / n
z 2 PQ
n
e2
where,
Z = value correspond to the degree of confidence desired
P = Population proportion,
Ps = Sample proportion which implies P - Ps error we admit in the
result
Q=1P
n = Sample size.
3. The formula used for calculating the sample size for infinite population,
when population mean and sample mean are given, is:
Z s (For infinite population )
n
z 2 2
n
e2
where,
= Population mean
s = Sample mean
Then we have,
z 2 2 (1.41) 2 92 1.9881 81
n 2 40.25 40.
e 22 4
Hence the required sample size is 40.
Note: The formula used for calculating the sample size for finite population,
when population mean and sample mean are given, is:
Z s (For finite population )
N n
n N 1
z 2 2 N
n
( N 1)e 2 z 2 2
where,
= Population mean
s = Sample mean
z 2 2 N
n
( N 1)e 2 z 2 2
(2.07) 2 (8.3) 2 350
(350 1) (2.4) 2 (2.07) 2 (8.3) 2
103315.4
44.8 45.
2305.4
Hence the required sample size is 45.
Note: The formula used for calculating the sample size, when mean of
sample means is given, is:
x
n
where,
= Mean of sample means
x
= Population standard deviation
n = Sample size
x n
ix) The central limit theorem assures that the sampling distribution of
mean is always normal.
x) Stratified sampling is used when each group considered are more
homogenous within itself and heterogeneous between group.
11.10 Summary
There are two methods of studying the characteristics of population, census
and sampling. The various advantages of sampling and the various errors
that could prop up in using these methods were explained.
Mainly, there are two methods of sampling namely; probability sampling and
non-probability sampling. The merits and demerits of each sampling method
were explained. We discussed the procedure for determining sample size.
We concluded the chapter with the importance of central limit theorem.
11.12 Answers
Self Assessment Questions
1. i- T, ii- F, iii- T, iv- T, v- F, vi- T, vii- T, viii - T, ix- T, x- T, xi- F, xii- F
2. i- F, ii- T, iii- T, iv- F, v- T, vi- F, vii- F, viii - T, ix- T, x- T
T denotes True
F denotes False
Sikkim Manipal University Page No.: 358
Probability and Statistics Unit 11
Terminal Questions
1. Refer section 11.6
2. Refer section 11.7.1
3. Refer section 11.7.1
4. Refer section 11.7.2
5. Refer section 11.4
6. Refer section 11.5
7. Refer section 11.5
8. Refer section 11.7.1
9. The sample size is approximately 19.
10. The sample size is approximately 23.