Professional Documents
Culture Documents
Probability
Quantifying randomness
The context: An experiment that admits several
possible outcomes
Some outcome will occur
The observer is uncertain which (or what) before
the experiment takes place
Event space = the set of possible outcomes. (Also
called the sample space.)
Probability = a measure of likelihood attached to
the events in the event space. (Try to define
probability without using a word that means
probability.)
Rules (Axioms) of
Probability
An event E will occur or not occur
P(E) is a number that equals the probability
that E will occur.
By convention, 0 < P(E) < 1.
E' = the event that E does not occur
P(E') = the probability that E does not
occur.
Joint Events
Pairs (or groups) of events: A and B
One or the other occurs: A or B A B
Both events occur A and B A B
Independent events: Occurrence of A does
not affect the probability of B
An addition rule: P(A B) = P(A)+P(B)-P(A
B)
The product rule for independent events:
P(A B) = P(A)P(B)
Application
Survey of 27326 German Individuals over 5 years. Frequency in black,
sample proportion in red. E.g., .04186 = 1144/27326, .52123 =
14243/27326
Female
Male
Total
Female
Male
Total
Uninsured
1144
1979
3123
Uninsured
.04186
.07242
.11429
Insured
11939
12264
24203
Insured
.43691
.44880
.88571
Total
13083
14243
27326
Total
.47877
.52123
1.00000
Male
Total
Uninsured
.04186
.07242
.11429
Insured
.43691
.44880
.88571
Total
.47877
.52123
1.00000
Independent Events
Events are independent if the occurrence of one
does not affect probabilities related to the other.
Events A and B are independent if P(A|B) = P(A).
I.e., conditioning on B does not affect the
probability of A.
Using Conditional
Probabilities: Bayes
Theorem
P(A,B)
P(A | B)
P(B)
P(B | A)P(A)
P(B)
P(B | A)P(A)
P(A,B) P(notA,B)
P(B | A)P(A)
Target
Theorem
Def inition
Computation
Drug Testing
Notation
+ = test indicates disease, = indicates no disease
D = presence of disease, N = absence of disease
Known Data
P(Disease) = P(D) = .005 (Fairly rare) (Incidence)
P(Test correctly indicates disease) = P(+|D) = .98 (Sensitivity)
(Correct detection of the disease)
P(Test correctly indicates absence) = P(-|N) = . 95 (Specificity)
(Correct failure to detect the disease)
Objectives: Deduce
P(D|+) (Probability disease really is present | test positive)
P(N|) (Probability disease really is absent | test negative)
Note, P(D|+) = the probability that a patient actually has the disease
when the test says they do.
More Information
Deduce: Since P(+|D)=.98, we know P(|
D)=.02
because P(-|D)+P(+|D)=1
[P(|D) is the P(False negative).
Deduce: Since P(|N)=.95, we know P(+|
N)=.05
because P(-|N)+P(+|N)=1
[P(+|N) is the P(False positive).
Deduce: Since P(D)=.005,
P(N)=.995 because P(D)+P(N)=1.
P(D|+) =
=
P(+|D)P(D)
P(+|D)P(D)
=
P(+)
P(+|D)P(D) + P(+|N)P(N)
.98(.005)
= 0.08966 (Yikes!!)
.98(.005)+.05(.995)
Insurance policy
You pay premium = F
If you collect on the policy, the payout
=W
Probability they pay you = P
Expected profit to them is
E[Profit] = F - P x W > 0 if F/W
>P
When is insurance fair? E[Profit] = 0?
Applications
Automobile deductible
Consumer product warranties
Random Variable
Definition: A variable that will take a value
assigned to it by the outcome of a random
experiment.
Realization of a random variable: The
outcome of the experiment after it occurs.
The value that is assigned to the random
variable is the realization.
X = the variable, x = the realization
Use random variables to organize the
information about a random occurrence.
Probability Distribution
Range of the random variable = the
set of values it can take
Notation
Probability distribution =
probabilities assigned to outcomes.
P(X=x) or P(Y=y) is common.
Probability function = PX(x).
Sometimes called the density function
Cumulative probability is
Prob(X < x) for the specific X.
P(x) 1
Common Results
for Random Variables
Concentration of Probability
Independence
Measuring How
Variables Move
Together: Covariance
Cov(X,Y)
P(x,y)(x- )(y
values of X
values of Y
Covariance(x,y)
Aspect of Correlation
x y 2 xy
2
x
2
y
ax by a b 2abxy x y
2
2
x
2
y