Professional Documents
Culture Documents
industry
Highlighting the relevance of statistical methods in everyday applications, The Role of Statistics in
Business and Industry bridges the gap between the tools of statistics and their use in today's business
world. This one-of-a-kind resource encourages the proactive use of statistics in three well-organized
and succinct parts:
Setting the Stage provides an introduction to statistics, with a general overview of its uses in business
and industry
Manufactured Product Applications explains how statistical techniques assist in designing, building,
improving, and ensuring the reliability of a wide variety of manufactured products such as appliances,
plastic materials, aircraft engines, and locomotives
Other Applications describe the role of statistics in pharmaceuticals, finance, and business services, as
well as more specialized areas including the food, semiconductor, and communications industries
This book is truly unique in that it first describes case studies and key business problems, and then
shows how statistics is used to address them, while most literature on the topic does the reverse. This
approach provides a comprehensive understanding of common issues and the most effective methods
for their treatment. Each chapter concludes with general questions that allow the reader to test their
understanding of the presented statistical concepts as well as technical questions that raise more
complex issues. An extensive FTP site provides additional material, including solutions to some of the
applications.
With its accessible style and real-world examples, The Role ofStatistics in Business and Industry is a
valuable supplement for courses on applied statistics and statistical consulting at the upper-
undergraduate and graduate levels. It is also an ideal resource for early-career statisticians and
practitioners who would like to learn the value of applying statistics to their everyday work.
The Role of Statistics in Business and Industry provides insights, not available elsewhere, into what
business and industrial statistics is all about. It prepares students and aspiring statisticians for a career
in business and industry, broadens the perspective of those already practicing, and shows non-
statisticians the value of using statistics. It supplements other texts by describing key problems that
can be addressed by statistics--based on the internationally renowned authors' extensive experience--
and then demonstrates the tools to address them successfully.
Statistics plays a vital role in every fields of human activity. Statistics has important role in
determining the existing position of per capita income, unemployment, population growth rate,
housing, schooling medical facilities etc…in a country. Now statistics holds a central position in almost
every field like Industry, Commerce, Trade, Physics, Chemistry, Economics, Mathematics, Biology,
Botany, Psychology, Astronomy etc…, so application of statistics is very wide. Now we discuss some
important fields in which statistics is commonly applied.
(1) Business:
Statistics play an important role in business. A successful businessman must be very quick and
accurate in decision making. He knows that what his customers wants, he should therefore, know what
to produce and sell and in what quantities. Statistics helps businessman to plan production according
to the taste of the costumers, the quality of the products can also be checked more efficiently by using
statistical methods. So all the activities of the businessman based on statistical information. He can
make correct decision about the location of business, marketing of the products, financial resources
etc…
(2) In Economics:
Statistics play an important role in economics. Economics largely depends upon statistics.
National income accounts are multipurpose indicators for the economists and administrators.
Statistical methods are used for preparation of these accounts. In economics research statistical
methods are used for collecting and analysis the data and testing hypothesis. The relationship
between supply and demands is studies by statistical methods, the imports and exports, the inflation
rate, the per capita income are the problems which require good knowledge of statistics.
(3) In Mathematics:
Statistical plays a central role in almost all natural and social sciences. The methods of natural
sciences are most reliable but conclusions draw from them are only probable, because they are based
on incomplete evidence. Statistical helps in describing these measurements more precisely. Statistics
is branch of applied mathematics. The large number of statistical methods like probability averages,
dispersions, estimation etc… is used in mathematics and different techniques of pure mathematics like
integration, differentiation and algebra are used in statistics.
(4) In Banking:
Statistics play an important role in banking. The banks make use of statistics for a number of
purposes. The banks work on the principle that all the people who deposit their money with the banks
do not withdraw it at the same time. The bank earns profits out of these deposits by lending to others
on interest. The bankers use statistical approaches based on probability to estimate the numbers of
depositors and their claims for a certain day.
(8) In Astronomy:
Astronomy is one of the oldest branch of statistical study, it deals with the measurement of
distance, sizes, masses and densities of heavenly bodies by means of observations. During these
measurements errors are unavoidable so most probable measurements are founded by using
statistical methods.
Example: This distance of moon from the earth is measured. Since old days the astronomers have
been statistical methods like method of least squares for finding the movements of stars.
• Statistical methods for business intelligence
Innovation Matters
Business data exist in many different forms, including time series and longitudinal data, and may
contain various types of variables, such as time to event, occurrences, categorical and continuous
measurements. Statistical data analysis includes data management, modeling, segmentation and
application integration, and seeks to uncover useful information; forecast business processes at
various horizons; and detect hidden structural changes.
The challenge of business analytics is that one has to estimate the parameters of the model without
comprehensive information. For instance, IBM Research may be asked to develop a control center (or
dashboard) that will track a portfolio of large IT services customers and identify those that might
decide to terminate their contract, or renegotiate it to achieve a lower price, which would lead to a
significant loss of revenue to the company. In this case, as shown in Figure 1, the inputs to the model
are numerous variables that describe the following five risk factors: 1) financial health of client
companies; 2) previous relationships with the service provider; 3) price and competitiveness of the
offered service; 4) significant events in the client company that could have a potential impact on the
decision to cancel the service (e.g. change of CEO, merger, restructuring, etc.); and 5) previous history
of contract terminations or renegotiations. The output variable is the likelihood that a customer will
terminate its contract (or a part of it).
Conventional methods are limited to estimating the likelihood of termination, without providing
insights into which factor is most influential in the decision. Yet, knowing the impact of different factors
to the client's decision can help the service provider influence the outcome. For example, if the
decision to terminate is based on limited cash availability, the service provider might arrange different
financing. On the other hand, if the decision is based on the low satisfaction with the service, the
service provider can influence the outcome by improving service and mobilizing its sales and
marketing teams to save the relationship. Typically only the variables that influence the risk factors
are known, not the risk factors themselves, which means they are considered hidden states.
In traditional methods, such as logistic regression, the values of these hidden states are computed as
a byproduct of the model. However, in many applications, at least some of the relationships among
these factors (i.e. hidden variables) are known. For example, in the aforementioned problem of the
dashboard design, it is often possible to provide additional information in the form of "Company A has
been more satisfied than Company B" or "Company C has better financial health than Company D."
Because the traditional parameter estimation techniques, such as (Multidimensional Analysis Platform
(MAP) or iteratively re-weighted least squares, do not account for these relationships, the estimation of
hidden variables obtained with the standard parameter estimation procedures is not optimal. An
estimation procedure that captures such relationships in the data is needed.
At IBM Research, an algorithm was developed to compute the regression parameters, taking into
account the issue of estimating hidden variables with constraints. A recursive adaptation procedure
has also been developed to identify the most significant nonlinear relationships in the data and to
adapt the model by introducing corresponding higher order terms. The entire approach is embedded in
a Web-based platform and used to track the portfolio of large IT services clients in the IBM Global
Services division, and identify those who are likely to terminate or reduce the scope of their
engagement.
Ibexi Solut ions Page 1
Mode=630;
Median=554;
Average=522
This reveals that we sell 522
policies per month, on an
annual average; however, as
the median is 554, which is
greater than the average, it
indicates that the lowervolume months are pulling
down the performance more
sharply than can be made up
by the higher-selling
months.
Distribution of
Data, using
! Frequency
Distribution
! Histogram
What is the distribution of policies by product class, assured gender?
Illustration:
Product / Gender Company Female Male
Level Term Insurance 32,525 25,296 63,295
Pure Endowment 2,090 56,092 217,092
Universal Life 490 7,267 20,880
Term Insurance
27%
21%
52%
Endowment
1% 20%
79%
Universal life
2% 25%
73%
The distribution data
indicate a difference in the
purchase pattern for pure
term insurance among
genders, as opposed to
investment products like
endowment and universal
life.
IMRB, Census
Interest rates, Tax changes,
Regulatory changes,
Product launched by
compet it ion
Typically from databases
and transactional systems
Launch of new products by
the company, new
partnership agreements
Secondary data has the advantage of saving time and reducing data gathering costs. However,
there are limitations of secondary data. These include:
D
A
T
A
S
O
U
R
C
E
External
Internal
DATA STRUCTURE
Structured Unstructured
Examples of secondary data in different categories
The factor loading can be defined as the correlations between the factors and their underlying
variables. A factor-loading matrix is a key output of the factor analysis.
Factor X Factor Y Factor Z
Variable 1 0.7 0.6 0.6
Variable 2 0.6 0.6 0.8
Variable 3 0.9 0.7 0.8
Columnís Sum of
Squares
1.66 1.21 1.64
Each cell in the matrix represents correlation between the variable and the factor associated
with that cell. The square of this correlation represents the proportion of the variation in the
variable explained by the factor. The sum of the squares of the factor loadings in each column
is called an eigenvalue. An eigenvalue represents the amount of variance in the original
variables that is associated with that factor. The communality is the amount of the variable
variance explained by common factors.
5. Cross Selling
It costs 5 times more to acquire a new customer than retaining an existing customer.
Encouraging existing customers to spend more can have a dramatic effect on companyís