Professional Documents
Culture Documents
Contents
1 Convergence of Sequence of Measurable Functions 1
• I’ll start with the simplest non-trivial probability space: (Ω1 , 2Ω1 , µ1 ), where
Ω1 = {H, T }, µ1 ({H}) = µ1 ({T }) = 12 .
My point: different random variables are only different ways to assign numbers
to events. They do not change the whole space nor the probability measure.
• Bernoulli random variable review: X ∼ Bernoulli(p) means P (X = 1) = p
and P (X = 0) = 1 − p. So it is more than just a coin tossing distribution: it
must send the two possible outcomes to numbers 0 and 1.
• The point: X̃n is just Xn defined for the product space so we don’t need to make
any distinction in practice.
• X̃n s are defined on the same probability spaces now, so they can be compared.
• Apparently Xn 6= Xm in general. There is only one exception: Xn (ωn ) =
Xm (ωm ) = const. for all ωn ∈ Ωn and ωm ∈ Ωm . It turns out to be the case
for the strong law of large numbers (SLLN).
• SLLN
Pn (without proof, just state the conclusion for a special case): Let Zn be
1
n i=1 Xi . Demonstrate the behavior of Zn up to n = 3.
• Back to the homework #6, problem 1. It asks you to prove a.s. convergence.
Without any handy theorems/tools, you must start from scratch, that is, proof
that for almost surely every ω, X1 (ω), X2 (ω), . . . as a sequence of real numbers
converges.
• Homework #7, problem 2. Limits are defined for ω ∈ Ω∞ . You must use count-
ably many set operations of rectangles (essentially finitely dimensional rectan-
gles) to defined those sets.
2
Convergence in measure/probability
µ
• fn → f iff ∀ > 0, µ({ω : |fn (ω) − f (ω)| ≥ ω}) → 0.
• Convergence in measure says that the measure of “not-convergent” points shrinks
to zero. Or in probability theory: the probability of seeing “outlier” (those ω such
that |fn (ω) − f (ω)| > ) decreases to zero.
• It looks awfully like a.e. convergence! Counter example: shrinking but bouncy
indicators.
Definition
w
Let P1 , P2 , . . . be probability measures on Ω. Pn → P iff any one of the following
equivalent conditions hold:
• Fn (x) → F (x) for all continuous points (including ±∞). (Durrett book defini-
tion)
• µn (A) → µ(A) for all continuity sets A of P , which are sets such that µ(∂A) =
0.
R R
• Ω f dµn → Ω f dµ, for all bounded, continuous functions.
• Several other criteria. See Thm 2.8.1. in Ash’s book.
• We say a sequence of r.v.s X1 , X2 , . . . converges weakly (converges in distri-
bution) to X∞ if the distribution functions Fn associated with Xn converges
weakly to that of X∞ .
• This topic will be re-studied in the CLT chapter.