Professional Documents
Culture Documents
INTRODUCTION TO STATISTICS
Objectives
The Objectives are to help to understand:
The meaning of Statistics
Origin of statistics
Growth of statistics
Function of statistics
Scope of statistics
Value of statistics
Basic concepts of statistics.
Structure
1.1 Introduction
1.2 Origin of statistics
1.3 Definitions of statistics
1.4 Functions of statistics
1.5 Scope of statistics
1.6 Trust of statistics
1.7 Population and sample
1.8 Scales on measurement
1.9 Quantitative data
1.10 Qualitative data
1.11 Methods of data Collection
1.12 Classification of data
1.13 Summary
1.14 Key words
1.15 Answer to check your progress
1.16 Further readings
1.1 INTRODUCTION
First of all, we welcome you for studying the science statistics. The most
successful man in life is one who makes the best decision, based on the available
information. Thus, you have embarked on the study which can help to take a decision
for bright future.
This course has been designed primarily to learn the subjects and thereby the
students can aspire to take up professions that demand skills and techniques in
statistics.
1.2 ORIGIN OF STATISTICS
Statistics is a tool of decision making in real life problems. It originated more
than 2000 years ago, but it was recognized as separate discipline from 1940 in India.
From then to till now, statistics is evolving as a versatile powerful and indispensable
instrument for investigation in all fields of real life problems. It provides a wide
Variety of analytical tools. We have reached a stage where no empirical Science can
afford to ignore the science of statistics since the diagnosis of pattern of recognition
can be achieved through the science of Statistics.
Statistics conveys a variety of meanings to people. Statistics is a method of
obtaining and analysing data in order to take decisions on them.
In India, during the period of Chandra Gupta Maurya there was an efficient
system of collecting official and administrative statistics. During Akbars reign
(1556-1605AD) maintained good records of land and agricultural
statistics.
Statistical surveys were also conducted during his reign.
In Germany, the systematic collection of official statistics originated towards
the end of the 18th century.
Sir Ronald A. Fisher known as Father of Statistics placed Statistics on a very
sound footing by applying it to various diversified fields. His contributions in
statistics led to a very responsible position of statistics among science.
Professor C.R. Rao is a living Indian legend, whose career spans the history of
modern statistics: He is considered by many to be the greatest living statistician in the
world to day. He has derived the most fruitful years of his life between India and
America.
Check your progress 1:
State the names of some Indian statisticians.
respondents, we could use a nominal scale of 1 for male and 2 for female But 1 and 2
in this case do not represent any order or distance. They are simply used as labels.
One can use the nominal scale to indicate categories of any variable which is not to be
given a numerical significance. For example, demographic variables such as religion,
education level, languages spoken, and other variables like magazines read, TV shows
watched can be nominally scaled.
1.8.2 Ordinal Scale:
The ordinal scale places the observations in order. The use of an ordinal scale
implies a statement of greater than or less than symbol (> or <) which may be used to
designate the qualitiative data. For example Rama is greater than Arun or Sita is less
than Ramya.
The difference between nominal and ordinal scale is that the nominal scale
incorporates the relation of equivalence only while the ordinal scale incorporates the
relation of equivalence as well as of greater than or lesser than.
1.8.3 Interval Scale:
Interval scale measures the observations with respect to the distance or degree
of difference between all the observations. In interval scale zero is the arbitrary
origin. For example temperature is measured on interval Scale. Celsius (Centigrade)
F 32
0 and Fahrenheit 0 are different for the two scales. Linear transformation C =
18
of valves of an interval scale remain unchanged. In Fahrenheit 0 degree is equal to
17.78 degree Celsius.
Check your progress 5:
Give below the differences between interval scale and ordinal scale.
10
workers in the city and obtain the desired information. The information thus obtained
is first hand or original in character.
1.11.4 Indirect oral Investigation:
In this method of collecting data, the investigator contacts the respondents or
witnesses capable of supplying the necessary information. This method is adopted in
those cases where the information obtained is of a complex nature and the
respondents are not inclined to respond if approached directly. The success of this
method of collecting the primay data depends on the following:
The ability of the interviewers to draw out the information from witnesses by
means of appropriate questions and cross-examination.
The honesty of the interviewers who are collecting the primary data.
1.11.5 Information from correspondents
Under this method, the investigator appoints local agents or correspondents in
different places to collect information. These correspondents collect and transmit the
information to the central office where the data are processed. News paper agencies
adopt this method. For example, in the construction of wholesale price index
numbers regular information is obtained from correspondents appointed in different
areas. The special advantage of this method is that it is cheap and appropriate for
extensive investigation. However, this method of collecting information may not
ensure accurate results because of the personal prejudice and bias of the
correspondents.
1.11.6 Mailed Questionnaire method
Mailed questionnaire method consists a list of questions pertaining to the
survey (known as questionnaire). The questions are prepared and sent to various
respondents by post or messengers. The questions have also space for answers. A
request is made to the respondents through a covering letter to fill up the
questionnaire and send it back within a specified time. The advantages of the method
of collecting data can be easily adopted where the field of investigation is very fast
and it is also cheap provided the respondents respond in time. The main setback of
this method is that it can be adopted only literate respondents. The other limitations
are:
It involves uncertainty about the response
Co-operation on the part of respondents may be difficult to presume.
The information supplied by the informants may not be correct.
It may be difficult to verify the accuracy.
1.11.7 Interview schedule
Interview schedule method of collecting information is that schedules are send
to respondents through interviewers. The interviewers contact the respondents, get
11
replies to the questions contained in a schedule and fill them in their own hand writing
in the questionnaire.
The main difference between the mailed questionnaire and the interview
schedule is that the questionnaire is sent to the respondents by post whereas in the
interview schedule method the interviewers carry the schedule directly to the
respondents. This method is free from most of the limitations of the mailed
questionnaire method except that there is a little non-response of prestige issues when
the interviewers go personally to obtain the information.
The main advantages of this method are:
It can be adopted in the case of illiterate people.
The information received under this method is more reliable as compared to
the other methods of collecting information.
The success of the method depends on the well trained expertised and skilled
inerviewers. This method is quite popularly used in practice because of the personal
contact of the interviewers.
1.11.8 Drafting the Questionnaire
Drafting the questionnaire is an art. The success depends largely on the skill
and insight with which the lists of questions are formulated. Questions must be
unequivocal and easily intelligible. The following considerations should be kept in
mind while drafting a questionnaire.
The person conducting the survey must introduce himself and state the
objective of the survey.
The precise number of questions to be included would depend on the object
and scope of the investigation.
The questions must be arranged in a logical order so as to get a spontaneous
reply from the respondents.
The questions should be short and simple to understand.
Personal questions should be avoided
The questionnaire should provide necessary instructions to the respondents.
Questions should be of objective types which obtain uniformity of answers
from the respondents.
Open questions should be minimised
A questionnaire should be made to look as attractive as possible
Questions requiring calculations should be avoided.
The questionnaire should be pre-tested with a group before mailing it out.
12
13
ii) Female
d) Your age: __________years
e) Educational Qualification
i) Illiterate
ii) upto higher secondary level
iii) upto UG degree level
iv) upto PG degree level
v) Professional level
f) Occupational status
i) House wife
ii) Student
iii) Daily wage
iv) Petty traders/Shop owners
v) Business / Industrialists
vi) Self Employed professional
vii) Service
viii) Former
g) Your monthly income: Rs._____________
h) Monthly income from the other sources: Rs.__________________________
i) Total number of members in your family
i) Male:___________
ii) Female:_____________
j) Type of your family
i) Joint family
ii) Nuclear family
k) Marital status
14
i) Single
ii) Married
l) Community: OC/BE/MBC/SC/ST
m) Religion: Hindu/Christian/Muslim
n) Nature of occupation
i) Traditional
ii) New
II Information Search
a) Why did you gather information?
Strongly Agree-SA, Agree-A, Undecided-UD, Disagree-DA, Strongly Disagree-SDA
(Please tick the appropriate box)
i) ________because it involves high investment
SA A
UD DA
SDA
UD
DA
SDA
UD
DA
SDA
UD
DA
SDA
UD
DA
SDA
UD
DA
SDA
15
x) _________television advertisement
1 2
3
1 2
xv) ________exhibition
16
17
Number of companies
Chennai
100
Coimbatore
75
Trichy
40
Salem
10
Population
(in crores)
2001
100
2002
102
2003
108
2004
110
Number of Students
Male
12000
Female
12050
Total
24050
18
When are attribute is studied, two classes are formed, one possessing the
attribute and the other not possessing the attribute. This type of classification is
known as simple classification.
Number of Students
90 100
100
100 110
300
110 -120
500
120 -130
600
130 140
750
140 150
850
150 160
200
160 170
100
Simple classification
19
We may classify population on the basis of sex, literacy employed and so on.
This type of classification is known as manifold classification.
Manifold Classification
1.13 SUMMARY
You have now some idea of what comes under the study of descriptive
statistics. The above illustrations have dealt with the basic concepts of descriptive
statistics. These basic concepts give the foundation to learn the science of statistics.
1.14 KEY WORDS:
Statistics:
The word statistics seems to have been derived from the Latin word
Status or the Italian word Statista or the German word Statistik each of which
means a Political State.
Data set: A collection of data is called data set.
Data point: A Single observation is known as data point.
Science: A system study of knowledge provided the facts are existing in the world.
1.15 ANSWER TO CHECK YOUR PROGRESS:
1. Sir Ronald A. Fisher, Prof. C.R. Rao.
2. Santhakumaran defines, Statistics is the science of winding and twisting
network connecting Mathematics, Scientific Philosophy, computer Software
and other intellectual sources of the Millennium.
3. In Insurance, the sampling techniques are commonly followed. For making
various insurance Techniques and Methods, so much precision is essential and
the life of every insured persons are generally valued on the basis of their.
Insurance policies claimed accordingly. Various Insurance issues and policies
are valued according to their incomes, ages, family etc., thus, it gives various
keys and ideas to get the clear solution based on Insurance Policies.
4. (i) Statistics does not deal with individual items.
20
21
22