Professional Documents
Culture Documents
4.1
4.0
3.9
3.8
1 1
3.7
Subgroup 0 5 10 15 20 25
0.6
1
0.5
Sample StDev
0.4
0.3
0.2
0.1
S
0.0
Author
Anil Kumar Ammina
Reliability Engineering
BE Analytic Solutions
Key Terminologies
Descriptive Statistics :
Numerical index that describes some characteristics of the population. It
describes sample or finite population, gathers and describes data in terms of
tables, graphs etc. and it is subjective in nature
Variance σ s
Point Estimator:
The estimation of the population in terms of a single value. Specified by putting
hat on the parameter. If q the parameter, q^ is the point estimator
Interval Estimator:
The estimation of the population in terms of a range. Specified as Point Estimator
^ D is the interval estimator
± Error If q the parameter, q±
Types of Data
Variable data:
Data which can be quantified, e.g., temperature, number of countries etc.
Attribute data :
Data which cannot be quantified, e.g., fragrance of a flower, aesthetics of the
product, pleasing appearance, conduct of the students etc. Based on human
feelings e.g. taste of tea. Subjectivity in assessment.
Visualizing Your Data – Box Plot
Outlier
* any point outside the Lower or
Upper Limit.
Minimum Observation
that falls within the
Lower Limit = Q1 - 1.5 (Q3 - Q1)
Box Plot Exercise
3.5
1 2 3 4
Vendor
Visualizing Your Data – Histogram
When is it used? 15
Frequency
data.
5
Note
Number of classes,K, is user adjustable (under “options”)
Visualizing Your Data – Scatter Plot
When is it used?
Sample Size = n = 24
Mean (x)
Q1 Q3
Spread
Represents the variation in the process
P5 P95
Q1 Q3 Q1 Q3
Descriptive Statistics
Minitab Output
Normality Check
Histogram
Box Plot
Basic Statistical
Information
Confidence Intervals
Using Excel
45
40
35
30
25
20
15
10
5
0
0 2 4 6 8 10 12 14 16 18
Series1 0 0 0 0 4 8 37 42 7 2
0.08
sigma =5
Function
0
-50 -30 -10 10 30 50 70 90 110 130 150 170
x
Mean
The expected or mean value of a continuous random variable X with pdf f (x) is
∞
µX = E (X ) = ∫ x ⋅ f ( x ) dx For continuous data
−∞
n
xi
µX = ∑i=1 n
For discrete data
∫
2
σ X = V (x) = ( x − µ )2 ⋅ f ( x)dx For continuous data
−∞
= E [( X − µ )2 ]
n
( xi − x ) 2
S2 = ∑
i =1 n −1
For discrete data
The Variance is a measure of the average squared deviation from the mean µ
Z (Standardised Normal) Distribution
Put the following substitution in the normal distribution
z=
(x − µ )
σ
The probability density function 2is given as follows -
1 − z2
N (0,1) = N ( z ,0,1) = e for − ∞ ≤ z ≤ ∞
2π x = µ + zσ
•Symmetric about its mean z=0
•Basis for 6σ and z-score
•Binds the family of normal distributions into a single
distribution.
z Distribution
Z = 6 means
0.5
Z-score is 6
Probability Density
0.4
Function
0.3
0.2 and
0.1
0
-4 -3 -2 -1 0 1 2 3 4 The process is 6σ
z
Standardize the Normal Distribution
X–µ
Z=
Normal
σ Standardized
Distribution f (Z) Normal Distribution
f(X)
σz=1
σ
µ X µZ =0 Z
One table!
Standardized Normal Distribution
X −µ 6.2 − 5
Z= = = 0.12
σ 10
6.2 X 0.12 Z
µ =5 µZ = 0
Shaded Area Exaggerated
P(X > 8) = .3821
X −µ 8−5
Z= = = .30
σ 10
8 X .30 Z
µ =5 µZ = 0
Shaded Area Exaggerated
P(X > 8) = .3821 (continued)
Z = X − µ = 6 . 2 − 5 = 0 . 12
σ 10
Normal Standardized
Distribution Normal Distribution
σ = 10 σZ = 1
.0478
µ = 5 6.2 X µ = 0 .12 Z
Z
Shaded Area Exaggerated
Normal Distribution Thinking Challenge
X −µ 2400 − 2000
Z= = = 2.0
σ 200
Normal Standardized
Distribution Normal Distribution
σ = 200 σZ = 1
.4772
X −µ 1470 − 2000
Z= = = −2.65
σ 200
Normal Standardized
Distribution Normal Distribution
σ = 200 σZ = 1
.0040
1470 µ = 2000 X -2.65 µ Z= 0 Z
Finding Z Values for Known Probabilities
? X 0.31 Z
µ =5 µZ = 0
X = µ + Zσ = 5 + (.31)(10 ) = 8.1
Normal Distribution Example
The bolt has mean diameter of 10.5mm and its standard deviation is 0.8mm.
What is the probability that in a random sample, the diameter is between 8.8mm
and 12.5mm?
Solution :
µ = 10.5; σ = 0.8; xL = 8.8; xU = 12.5
xL − µ 8.8 − 10.5
zL = = = −2.125
σ 0.8
xU − µ 12.5 − 10.5
zU = = = 2.5
σ 0.8
Pr( xL ≤ x ≤ xU ) = Pr( z L ≤ z ≤ zU )
∴ Pr(8.8 ≤ x ≤ 12.5) = Pr(−2.125 ≤ z ≤ 2.5)
Normal Distribution Example
= −
Construct charts
For small- or moderate-sized data sets, do stem-and-leaf display and
box-and-whisker plot look symmetric?
For large data sets, does the histogram or polygon appear bell-shaped?
Do the mean, median and mode have similar values?
Check the P-Value
Central Limit Theorem
For majority of the distributions, the mean of the samples follow almost a normal
distribution as the sample size becomes at least 30.
When the inference is to be drawn about the sample mean E.g. Sachin
Tendulkar’s average batting performance in test series. It may be possible that his
batting statistics for all the tests is a non-normal distribution. In such situation, one
may like to consider a series average and use this data of means to analyze his
performance.
It is used as the basis to calculate confidence interval when the sample size is
greater than 30
equal probabilities under the normal and lognormal pdfs, incremental areas should also be equal
The mean of the lognormal distribution, µ, is
For values significantly greater than 1, the pdf rises very sharply in the
beginning, i.e. for very small values of T near zero, and essentially follows the
ordinate axis, peaks out early, and then decreases sharply like an exponential
pdf or a Weibull pdf with 0 < β < 1.
P[ X > 8] = 1 − P[ X ≤ 8]
Since ln(X) has a normal distribution with mean µ=3.2 and standard deviation σ=1,
ln(8) − 3.2
P[X ≤ 8] = Φ = Φ(−1.12) = 0.1314
1
Here, we use the Φ notation to denote the cumulative distribution function of the
standard normal distribution. As a result, the probability that the pollutant
concentration exceeds 8 parts per million is 0.1314.
Example 2
The Life, in thousands of miles, of a certain type of electronic control for locomotives
Has an lognormal distribution with µ = 5.149 and σ = 0.737. Find the 5th percentile of
the life of such locomotive.
Let X be the life of the locomotive, since ln(X) has a normal distribution.
= 3.937
Hence, X=51.265
This means that only 5% of the locomotives will have lifetime less than 51265 miles
Questions ?
Thank You