You are on page 1of 40

Basic Probability

Probability
Probability theory is a mathematical framework that
allows us to describe and analyze random
phenomena in the world around us.
Random phenomena
Events or experiments whose outcomes we can't predict with
certainty.
Our knowledge about the outcome is limited, so we can't be
certain what will happen.

Example:
Flipping a fair coin.

Example of Randomness
Phenomena

Transmission of data from a cell phone to a cell tower.

The problem that communication engineers must consider is


that the transmission is always affected bynoise.

The noise in the transmission is a random phenomenon.

Review of Set Theory


A set is a collection of some items (elements).
We often use capital letters to denote a set.
To define a set we can simply list all the elements in curly
brackets, for example to define a set A that consist of the two
elements e and i, we write A = {e, i}.
The order of the elements in a set does not matter.
{e, i} = {i, e}
To say e belongs to A, we write e A.
To say f does not belong to A, we write f A.

Review of Set Theory (cont.)


The set with no elements, i.e., = {} is the null set or the
empty set. For any set A, A.
The universal set S is the set of all things that we could possibly
consider in the context we are studying.
In the language of probability theory, the universal set is called
the sample space.
Thus every set A is a subset of the universal set, thus A S.
For example:
If we are discussing rolling of a die, S={1,2,3,4,5,6}
If we are discussing tossing of a coin once, S={H,T}S={H,T}
(H for heads and T for tails).

Set Operations
Union
The union of two sets is a set containing all elements
that are in A or in B (possibly both).
For example, A = {1,2}; B = {2,3}; A B = {1,2,3}.

Set Operations (cont.)


Intersection
The intersection of two sets A and B, denoted by A B,
consists of all elements that are both in A and B.
For example, A = {1,2}; B = {2,3}; A B = {2}.

Set Operations (cont.)


Complement
The complement of a set A, denoted by Ac, is the set of
all elements that are in the universal set S but are not in
A.
Ac is shown by the shaded area using a Venn diagram.

Mutually Exclusive / Disjoint


Two sets A and B are mutually exclusive or disjoint if
they do not have any shared elements (their
intersection is the empty set, A B = ).
More generally, several sets are called disjoint if they
are pairwise disjoint (no two of them share a common
elements).
Example of 3 disjoint events:

Mutually Exclusive / Disjoint (cont.)


Events are said to be mutually exclusive if they have no
outcomes in common.
These are also called disjoint events.
Example:
One store carries six kinds of cookies. Three kinds are made by
Nabisco and three by Keebler.
The cookies made by Nabisco are not made by Keebler.
Observe the next person who comes into the store to buy
cookies. They choose one bag.
It can only be made by either Nabisco OR Keebler, thus the
probability that they choose one made by either company is zero.
These events are mutually exclusive.

Random Experiments
A random experiment is a process by which we observe
something uncertain, ex: rolling a die.
After the experiment, the result of the random experiment is
known outcome.
Thus in the context of a random experiment, the sample space
is our universal set.
Example:
Random experiment: toss a coin; sample space:
S={heads,tails} or as we usually write it, {H,T}.
Random experiment: roll a die; sample space: S={1,2,3,4,5,6}.

Random Experiments (cont.)


When we repeat a random experiment several times, we call
each one of them a trial.
We toss a coin three times and observe the sequence of
heads/tails. The sample space here may be defined as
S = {(H,H,H),(H,H,T),(H,T,H),(T,H,H),(H,T,T),(T,H,T),(T,T,H),
(T,T,T)}.

Random Experiments (cont.)


Our goal is to assign probability to certain events.
For example, suppose that we would like to know the probability
that the outcome of rolling a fair die is an even number. In this
case, our event is the set E = {2,4,6}.
If the result of our random experiment belongs to the set E, we
say that the event E has occurred.
Thus an event is a collection of possible outcomes.
In other words, an event is a subset of the sample space to which
we assign a probability.

Random Experiments (cont.)


If A and B are events, then A B and A B are also
events.
By remembering the definition of union and
intersection, we observe that:
A B occurs if A or B occur.
A B occurs if both A and B occur.

Probability
We assign a probability measure P(A) to an event A.
This is a value between 0 and 1 that shows how likely the event
is.
If P(A) is close to 0, it is very unlikely that the event A occurs.
On the other hand, if P(A) is close to 1, event A is very likely to
occur.

Axioms of Probability
Probability theory is based on some axioms that act as
the foundation for the theory.

Example
A adalah kejadian munculnya head (H) saat melempar koin.
S = {H,T}
A = {H}
P(A) = 1/2
axiom 1: 0 P(A) 1
P(S) = 2/2 = 1
axiom 2: P(S) = 1
B adalah kejadian munculnya tail (T) saat melempar koin.
B = {T}
kejadian A dan B adalah mutual eksklusif (disjoint)
P(B) = 1/2
P(A B) = P(A) + P(B) = 1/2 + 1/2 = 1
axiom 3

Example
A adalah kejadian munculnya bilangan ganjil saat melempar dadu.
S = {1,2,3,4,5,6}
A = {1,3,5}
P(A) = 3/6 = 1/2 axiom 1: 0 P(A) 1
P(S) = 6/6 = 1 axiom 2: P(S) = 1
B adalah kejadian munculnya bilangan genap saat melempar dadu.
B = {2,4,6}
kejadian A dan B adalah mutual eksklusif (disjoint)
P(B) = 3/6 =
P(A B) = P(A) + P(B) = 3/6 + 3/6 = 1

axiom 3

Inclusion-exclusion principle
As we have seen, when working with events, intersection means
"and", and union means "or".
The probability of intersection of A and B, P(AB), is sometimes
shown by P(A,B) or P(AB).
Inclusion-exclusion principle:
P(AB) = P(A) + P(B) P(AB) (A and B are not disjoint
events)
We can extend it to the union of three or more sets.
P(ABC) = P(A) + P(B) + P(C) P(AB) P(AC) P(BC) +
P(ABC)

Example
A adalah kejadian munculnya bilangan prima saat melempar dadu.
S = {1,2,3,4,5,6}
A = {2,3}
sehingga P(A) = 2/6 = 1/3
B adalah kejadian munculnya bilangan genap saat melempar dadu.
B = {2,4,6} sehingga P(B) = 3/6 = 1/2
(AB) = {2} sehingga P(AB) = 1/6
(AB) = {2,3,4,6} sehingga P(AB) = 4/6 = 2/3

P(AB) = P(A) + P(B) P(AB) A dan B tidak disjoint


= 2/6 + 3/6 1/6 = 4/6 = 2/3

Conditional Probability
P(A|B) , the conditional probability of A given that B has
occurred.
It is reasonable to assume that in this example, P(A|B)
should be larger than the original P(B), which is called
the prior probability of B.

Conditional Probability (cont.)


Example
I roll a fair die. Let A be the event that the outcome is an odd number, i.e.,
A={1,3,5}. Also let B be the event that the outcome is less than or equal to 3,
i.e., B={1,2,3}. What is the probability of A, P(A)? What is the probability of A
given B, P(A|B)?
Solution

Conditional Probability (cont.)


When we know that B has occurred, every outcome that is
outside B should be discarded. Thus, our sample space is
reduced to the set B.
Now the only way that A can happen is when the outcome
belongs to the set AB.
We divide P(AB) by P(B), so that the conditional probability of
the new sample space becomes 1, i.e., P(B|B) = P(BB)P(B) = 1.
Note that conditional probability of P(A|B) is undefined when
P(B)=0. That is okay because if P(B)=0, it means that the event B
never occurs so it does not make sense to talk about the
probability of A given B.

Conditional Probability (cont.)

Chain
Rule

sehingga

Chain

Contingency Table
X

Total of Rows

C
C
Total of
Column

The values inside the table are called interior


probabilities and represent the intersection of events.
The sum of the rows and colums are displayed as Totals
and called marginal probabilities because they lie on
the margin of the table.

This table shows hypothetical probablities of a disk crash


using brand X drive within a year.
Brand X

Not Brand X

Total of Rows

Crash C

0.6

0.1

0.7

No Crash C

0.2

0.1

0.3

Total of Colums

0.8

0.2

Tentukan:
Probabilitas disk crash saat disk yang digunakan adalah
Brand X.
Probabilitas Brand X saat disk yang digunakan crash.

Independent
Two events are independent if one does not convey any
information about the other.
Example:
A : an event that it rains tomorrow, and suppose that P(A) = 1/3.
B : an event that it lands heads up when flipping a coin, so we
have P(B) = 1/2.
A and B are two independent events, because the result of coin
toss does not have anything to do with tomorrow's weather.

Independent (cont.)
Thus,
if two events A and B are independent and P(B)0, then

P(A|B) = P(A)
Two events A and B are independent if
P(AB) = P(A).P(B)

We can say "independence means we can multiply the


probabilities of events to obtain the probability of their
intersection",
or equivalently, "independence means that conditional probability
of one event given another is the same as the original (prior)

Independent (cont.)
Example:
I pick a random number from {1,2,3,,10}, and call it N. Suppose
that all outcomes are equally likely. Let A be the event that N is
less than 7, and let B be the event that N is an even number. Are A
and B independent?

Disjoint vs. Independent


Lemma
Consider two events A and B, with P(A)0 and P(B)0.
If A and B are disjoint, then they are not independent.
Proof
Since A and B are disjoint, we have
P(AB) = 0 P(A)P(B)
Thus, A and B are not independent because P(AB) P(A)P(B).

Disjoint vs. Independent (cont.)


Concept

Meaning

Disjoint

AandBcannot occur at the same AB=,


time
P(AB)=P(A)+P(B)

Independe
nt

Adoes not give any information


aboutB

Formulas

P(A|B)=P(A),
P(B|A)=P(B)
P(AB)=P(A)P(B)

Law of Total Probability


In particular, if you want to find P(A), you can look at a partition of
S, and add the amount of probability of A that falls in each partition.
We have already seen the special case where the partition is B and
Bc.
We saw that for any two events A and B,

P(A) = P(AB) + P(ABc)


Using the definition of conditional probability,
P(AB) = P(A|B)P(B)
we can write
P(A) = P(A|B)P(B) + P(A|Bc)P(Bc).

Law of Total Probability (cont.)

Law of Total Probability (cont.)


Example:
I have three bags that each contain 100 marbles:
Bag 1 has 75 red and 25 blue marbles;
Bag 2 has 60 red and 40 blue marbles;
Bag 3 has 45 red and 55 blue marbles.
I choose one of the bags at random and then pick a marble from
the chosen bag, also at random. What is the probability that the
chosen marble is red?

Law of Total Probability (cont.)


Solution

Conditional Independence
Two events A and B are conditionally independent given
an event C with P(C)>0 if
P(AB|C) = P(A|C)P(B|C)
If A and B are conditionally independent given C, then
P(A|B,C) = P(A|C)

Conditional Independence (cont.)


Recall
that from the definition of conditional probability,

P(A|B) = P(AB)P(B), if P(B)>0.


By conditioning on CC, we obtain
P(A|B,C) = P(AB|C)P(B|C), if P(B|C),P(C)0.
If A and B are conditionally independent given C, we obtain

Conditional Independence (cont.)


Example:
A box contains two coins: a regular coin and one fake two-headed coin
(P(H)=1).
I choose a coin at random and toss it twice. Define the following events.
A= First coin toss results in an H.
B= Second coin toss results in an H.
C= Coin 1 (regular) has been selected.
We have P(A|C) = P(B|C) = 1/2
Also, given that Coin 1 is selected, we have P(AB|C) = (1/2).(1/2) = 1/4

Conditional Independence (cont.)


Consider rolling a die and let
A = {1,2} B = {2,4,6}

C = {1,4}

Then, we have
P(A) = 1/3 P(B) = 1/2
P(AB) = 1/6 = P(A)P(B)

thus, A and B are independent

But we have
P(A|C) = 1/2

P(B|C) = 1/2

P(AB|C) = P({2}|C)=0

thus P(AB|C) P(A|C)P(B|C)

which means A and B are not conditionally independent given C.

You might also like