Professional Documents
Culture Documents
PROBABILITY REVIEW
PROBABILITY REVIEW
PROBABILITY REVIEW
PROBABILITY REVIEW PARAMETER ESTIMATION
PARAMETER ESTIMATION
PARAMETER ESTIMATION PARAMETER ESTIMATION HYPOTHESIS TESTING
HYPOTHESIS TESTING
HYPOTHESIS TESTING HYPOTHESIS TESTING
Probability Functions
Probability Functions
Probability Functions Estimator Quality
Estimator Quality
Estimator Quality Hypothesis Testing
Hypothesis Testing
Hypothesis Testing
&& &
Bias Bias
Bias Type I error: Reject
Type I error: Reject
Type I error: Reject even though
even though
even though is true
is true
is true
=== d dd
==Pr =PrPr
'('( '(
bias
biasbias
vv v ==E=EE Type II error: Accept
Type II error: Accept
Type II error: Accept even though
even though
even though RR is true
is true
R is true
==1=11 ==Pr =PrPr > >> An estimator is asymptotically unbiased if the bias =
An estimator is asymptotically unbiased if the bias
An estimator is asymptotically unbiased if the bias =Pr
=PrPr Type I error
Type I error
Type I error==significance level =significance level
significance level
of an estimator goes to 0 as the sample size goes to =
of an estimator goes to 0 as the sample size goes to
of an estimator goes to 0 as the sample size goes to =Pr
=PrPr Type II error
Type II error
Type II error
=== infinity.
infinity.
infinity. 111 = =Pr(=Pr( Pr(Reject
Reject
Reject RR is true
is true
R is true ==power =power
power
&& & Consistency
Consistency
Consistency -value: Probability of the observations if
-value: Probability of the observations if
-value: Probability of the observations if is true
is true
is true
=== d dd
== =ln lnln ; ;; ==='8
'8'8
&& & is consistent if:
is consistent if:
is consistent if: Confidence Intervals and Sample Size
Confidence Intervals and Sample Size
Confidence Intervals and Sample Size
'( '( '( ?? ? ?? ?
lim
lim lim
PrPrPrbb If ~(,
If ~(,
If ~(, ) and
) and
) and is known, then the
is known, then the
is known, then the 111
%
% %
Moments
Moments
Moments b
<<< ==1 for all
=1 for all
1 for all >>0 >0 0
b(
b( b(
(( ( confidence interval for is
confidence interval for is
confidence interval for is /? /? where
/? where
where
EEE === d
d d Sufficient but not necessary for consistency:
Sufficient but not necessary for consistency:
Sufficient but not necessary for consistency: bb b
bias
bias bias 0 and Var
0 and Var
0 and Var 0 as
0 as
0 as Pr (
Pr (
Pr ( <<</? /?
/? ))== ) /2.
=/2.
/2.
'('( '(
kk th thk raw moment:
raw moment:
th raw moment: == =
<<= =E=EE<< <
< Relative Efficiency
Relative Efficiency
Relative Efficiency Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals for Means
?? ?
kk th thk central moment:
central moment:
th central moment: = = E=E E
<< <
The relative efficiency of
The relative efficiency of
The relative efficiency of distribution
distribution
distribution
<< < RR with respect to
with respect to
R with respect to ?? ?
Variance:
Variance:
Variance: ?? ?
===??== ?E =E E
?? = ?
=E=EE?? ?
?? ? If
If
If ===iR ?? ?
i i, has a
, has a ?? ?
i , has a distribution. distribution.
distribution.
Var
Var Var ??? iR iR
=== For
For
For i ~
i ~i ~
?? ?
,,,
: : :
Coefficient of variation:
Coefficient of variation:
Coefficient of variation: === VarVarVar RRR
biR b b
iR iRi i
i
?? ???~ ? ?? ?
~ ~
SSS Mean Square Error
Mean Square Error
Mean Square Error bb b
?? ? ?? ? ?? ?
~
~ ~ 1 1 1
Skewness:
Skewness:
Skewness: = =
RR R SS S = ? ? ? ?? ? iR
iR iRi i i
MSE
MSE MSEvv v ==E=E E ==Var =Var Var
+++bias bias biasvv v Students t distribution
Students t distribution
Students t distribution
VVV
Kurtosis:
Kurtosis:
Kurtosis: ??= =
? =VV V Uniformly Minimum Variance Unbiased Estimator has a distribution if
Uniformly Minimum Variance Unbiased Estimator
Uniformly Minimum Variance Unbiased Estimator has a distribution if
has a distribution if === with
with
with
?? ?
Covariance: Cov
Covariance: Cov
Covariance: Cov ,
,, ==E=E E
YYY ZZZ (UMVUE)
(UMVUE)
(UMVUE) ~(0,1) and ~
~(0,1) and ~
~(0,1) and ~ ().
().
().
==E=E E E EEEEE A UMVUE is an unbiased estimator that has the
A UMVUE is an unbiased estimator that has the
A UMVUE is an unbiased estimator that has the Testing the mean of a normal population
Testing the mean of a normal population
Testing the mean of a normal population
Cov
Cov Cov ,,
,
lowest variance of any unbiased estimator
lowest variance of any unbiased estimator
lowest variance of any unbiased estimator For a sample of size from a normal population,
For a sample of size from a normal population,
For a sample of size from a normal population,
Correlation coefficient:
Correlation coefficient:
Correlation coefficient: YZYZYZ === regardless of the true value of the estimated
regardless of the true value of the estimated
regardless of the true value of the estimated the statistic to test if the mean is is:
the statistic to test if the mean is is:
the statistic to test if the mean is is:
YYZYZZ
parameter.
parameter.
parameter.
Percentiles
Percentiles
Percentiles ===
The 100p
The 100p
The 100p percentile,
percentile,
percentile, Maximum Likelihood
Maximum Likelihood
Maximum Likelihood
]], is the lowest point where
, is the lowest point where
] , is the lowest point where
thth th
Steps to Calculating MLE:
Steps to Calculating MLE:
Steps to Calculating MLE:
=== 1 1 1
]]]. If is continuous,
. If is continuous,
. If is continuous, ]] is a unique point
is a unique point
] is a unique point 1.1.1. === 3.3.Set
3.Set == =
Set ==0 =0 0
satisfying
satisfying
satisfying ]]]==. =. . Testing the mean of a Bernoulli population
Testing the mean of a Bernoulli population
Testing the mean of a Bernoulli population
2.2.2. ==ln =lnln 4.4.Solve for
4.Solve for
Solve for For a sample of size from a Bernoulli population
For a sample of size from a Bernoulli population
For a sample of size from a Bernoulli population
Mode
Mode Mode The maximum likelihood estimate equals the
The maximum likelihood estimate equals the
The maximum likelihood estimate equals the with mean , the statistic to test if
with mean , the statistic to test if
with mean , the statistic to test if === is:
is:
is:
Mode is x that maximizes
Mode is x that maximizes
Mode is x that maximizes .
. . method of moments estimate for:
method of moments estimate for:
method of moments estimate for:
Moment Generating Function
Moment Generating Function
Moment Generating Function Binomial with fixed
Binomial with fixed
Binomial with fixed ===
YY Y = =E=EEaYaY aY
Poisson
Poisson
Poisson 111
Y
bb b
==E=EEbb, where
b bb b : number of successes
: number of successes
: number of successes
Y Y00 0 , where
, where is the n
is the n
is the n derivative
derivative
th derivative
thth
Negative binomial with fixed
Negative binomial with fixed
Negative binomial with fixed
Conditional Probability
Conditional Probability
Conditional Probability Testing the difference of means from two
Testing the difference of means from two
Testing the difference of means from two
Exponential
Exponential
Exponential
PrPr
Pr populations
populations
populations
Gamma with fixed
Gamma with fixed
Gamma with fixed
PrPrPr ===
If the two populations and are normally
If the two populations and are normally
If the two populations and are normally
Pr PrPr Normal
Normal
Normal
,
,,
distributed with unknown variances that are
distributed with unknown variances that are
distributed with unknown variances that are
Variance of Maximum Likelihood Estimator
Variance of Maximum Likelihood Estimator
Variance of Maximum Likelihood Estimator
<'R = = R + ? + 1
The density function of the order statistic is: E R = R
i i ! 2
iR iR <'R b'<
Z = 1 Y Y () R + ? + 1
= 1 1 ! ! Y Var R = R ?
= number of groups
The expected value of the order statistic from a 12
When testing equality of distributions in two sample of from a uniform distribution on 0, is The Mann-Whitney statistic is the sum of the
dimensions, = 1 ( 1) where is the < counts of the number of items in the first sample
.
number of rows and is the number of columns. bR less than each item in the second sample. There
Sign Test should be ? terms in the sum.
Confidence Intervals for Variances
The confidence level that the 100pth percentile is The relationship between and the rank-sum
The unbiased sample variance from a normal
between the th and th order statistics from a statistic is
population, ? , has the distribution sample of size is equal to Pr < , where
? 1 , where ~ ? 1 . ? ? + 1
~Binomial , . = ?
Confidence intervals for the variances at the 2
To test if the median is equal to , subtract from R R + 1
100 1 % confidence interval are: each sample item to produce positive numbers = R ? + R
1. 0, ? 1 2
after subtraction. For large sample sizes, calculate
Algorithmic Efficiency between times and + is a Poisson random Birth and Death Processes
'R
variable with mean () d. A continuous Markov chain in which the states are
1
E = nonnegative integers and the only transitions
Thinning
possible are + 1 < and 1 < .
iR Given a Poisson process with intensity function
'i Rate of leaving state = < + < .
1 1 () and probability function () for a certain
Var = Expected time to stay in state = 1 < + <
1 type of an event:
Expected time starting in state k until process
iR That type of event follows a Poisson process R
Both E and Var approach ln j as . with intensity (). reaches state + 1 < = + <'R
Classification Of States They are independent of any other events For M/M/1 queuing system:
Absorbing: Cannot be exited. Homogeneous Poisson Processes Characteristics 1 <R
Accessible from state i to j: ib > 0 for some n. Given two Poisson processes with rate R and ? : ,
E < =
Two states communicate: each state is accessible The probability that the next event is from the + 1
, =
from the other. first process is
Class of states: A maximal set of states that Processes < <
The probability that k events from process 1
communicate with each other; an absorbing state Poisson 0
occur before l events from process 2 is
is a class by itself. <'R Yule 0
Irreducible chain: Only one class exists. + 1 i <'R'i M/M/1
1
Recurrent state: Probability of reentering the state , 1 <
i<
is 1. A finite Markov chain must have at least one M/M/s
where = R R + ? . ,
recurrent class.
is a non-decreasing function of t.
Test the significance of additional variables
Data Issues SSE SSE
Decreasing Failure Rate Distribution (DFR) ,b' ]R =
Censoring: Have information on the number of
is a decreasing function of t. SSE + 1
claims above or below a certain number, but dont ?
Increasing Failure On The Average (IFRA) ?
know their amounts. It results in under- =
/ is a non-decreasing function of t. ?
1 + 1
representation of large claims.
The monotone systems lifetime distribution Variance Inflation Factor (VIF)
Lack of Independence: Multiple exposures are
is IFRA if the lifetime of all components are Test collinearity of variable j.
correlated. Eg. Multiple claims from a single
IFRA. ? VIF ? VIF
accident.
Exposure Differences: Factors that may affect ? = ? = ?
& 1 b
exposures must be taken into account. iR i