Professional Documents
Culture Documents
functions
Normal distribution
Lognormal distribution
Mean, median and mode
Tails
Extreme value distributions
Normal (Gaussian)
distributionf (x) 1 exp 1 x
f (t )dt
Normal
distribution is denoted , with the square
1
1 n
xi x x xi
n 1 i 1
n i 1
For example, for a sample of 5 from
a standard
normal distribution, the standard deviation will be
estimated on average as 0.94 (with standard
deviation of 0.34)
2
Lognormal distribution
If
ln(X) has normal distribution X has
lognormal distribution. That is, if X is
normally distributed exp(X) is
lognormally distributed.
Notation: 1
ln x
f ( x)
exp
2
x 2
PDF
2
X2 Var X e 1 e 2
2
(highest point) =
Median (50% of samples)
Figure for =0.
0.9
0.8
CDF
standard deviation.
0.6
0.5
0.4
0.3
0.2
experimental
lognormal
normal
0.1
0
Maximum of normal
samples
With normal distribution, maximum of sample is more
narrowly distributed than original distribution.
8000
9000
Max of 10
standard
normal
samples. 1.54
mean, 0.59
standard
deviation
7000
6000
5000
4000
3000
7000
6000
5000
4000
3000
2000
2000
1000
0
-1
Max of 100
standard normal
samples. 2.50
mean, 0.43
standard
deviation
8000
1000
0
1.5
2.5
3.5
4.5
5.5
Gumbel distribution
1
exp z e z ,
CDF exp(e )
.
Mean, median, mode and variance
PDF
Mean
median ln(ln(2))
2 2
Variance
1
0.9
mode=
fitted ev1
-max10 data
0.9
0.8
fitted ev1
-max100 data
0.8
0.7
0.7
0.6
0.6
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
-5
0
-5.5
-4
-3
-2
-1
-5
-4.5
-4
-3.5
-3
-2.5
-2
-1.5
-1
Weibull distribution
Probability
Its log has Gumbel dist.
k x
distribution
f ( x; , k )
k 1
x/ k
x 0, k 0, 0
log weibull
ev1 fit
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
-8
-6
-4
-2
Exercises
Find how many samples of normally distributed numbers
you need in order to estimate the mean and standard
deviation with an error that will be less than 10% of the
true standard deviation most of the time.
Both the lognormal and Weibull distributions are used to
model strength. Find how closely you can approximate
data generated from a standard lognormal distribution by
fitting it with Weibull.
Take the introduction and preamble of the US Declaration
of Independence, and fit the distribution of word lengths
using the K-S criterion. What distribution fits best?
Compare the graphs of the CDFs. Compare to a more
contemporary text.