Professional Documents
Culture Documents
Tang Yin-cai
yctang@stat.ecnu.edu.cn
— Noninformative
prior distributions
Introduction to (Reference: Gel-
man et. al.
Multiparameter Models (I) 2.9)
Noninformative
prior distributions
Proper and
— Noninformative prior distributions improper priors
Second example
(Reference: Gelman et. al. 2.9) binomial parameter
— Noninformative
prior distributions
(Reference: Gel-
man et. al.
2.9)
Noninformative
prior distributions
Proper and
improper priors
Second example
binomial parameter
Note: Introduction to
Multiparameter
Models (I)
p(σ 2 ) ∝ 1/σ 2 (improper) ⇒ σ 2 |y ∼ Inv-χ2 (n, ν).
— Noninformative
prior distributions
(Reference: Gel-
man et. al.
2.9)
Noninformative
prior distributions
Proper and
improper priors
Second example
binomial parameter
Note: Introduction to
Multiparameter
Models (I)
p(σ 2 ) ∝ 1/σ 2 (improper) ⇒ σ 2 |y ∼ Inv-χ2 (n, ν).
— Noninformative
prior distributions
■ The combination of an improper prior density and (Reference: Gel-
man et. al.
a normal likelihood does not define a proper joint 2.9)
Noninformative
probability model, p(y, θ). prior distributions
Proper and
■ But applying improper priors
Second example
binomial parameter
p(θ|y) ∝ p(y|θ)p(θ),
still gives a proper posterior density.
Note: Introduction to
Multiparameter
Models (I)
p(σ 2 ) ∝ 1/σ 2 (improper) ⇒ σ 2 |y ∼ Inv-χ2 (n, ν).
— Noninformative
prior distributions
■ The combination of an improper prior density and (Reference: Gel-
man et. al.
a normal likelihood does not define a proper joint 2.9)
Noninformative
probability model, p(y, θ). prior distributions
Proper and
■ But applying improper priors
Second example
binomial parameter
p(θ|y) ∝ p(y|θ)p(θ),
Note: Introduction to
Multiparameter
Models (I)
p(σ 2 ) ∝ 1/σ 2 (improper) ⇒ σ 2 |y ∼ Inv-χ2 (n, ν).
— Noninformative
prior distributions
■ The combination of an improper prior density and (Reference: Gel-
man et. al.
a normal likelihood does not define a proper joint 2.9)
Noninformative
probability model, p(y, θ). prior distributions
Proper and
■ But applying improper priors
Second example
binomial parameter
p(θ|y) ∝ p(y|θ)p(θ),
— Noninformative
prior distributions
(Reference: Gel-
man et. al.
2.9)
Noninformative
prior distributions
Proper and
improper priors
Second example
binomial parameter
— Noninformative
Noninformative prior: prior distributions
(Reference: Gel-
man et. al.
p(θ) ∝ Constant, θ ∈ (−∞, ∞). 2.9)
Noninformative
prior distributions
Proper and
improper priors
Second example
binomial parameter
— Noninformative
Noninformative prior: prior distributions
(Reference: Gel-
man et. al.
p(θ) ∝ Constant, θ ∈ (−∞, ∞). 2.9)
Noninformative
prior distributions
Proper and
2. θ is a scale parameter, that is, p(y/θ|θ) is free of improper priors
θ and y. Second example
binomial parameter
Noninformative prior:
1
p(θ) ∝ ∈ (−∞, ∞).
θ
p(log(θ)) ∝ 1.
p(θ2 ) ∝ 1/θ2 . SCHOOL OF FINANCE AND S TAT I S T I C S
— Noninformative
(Noninformative/improper) prior distributions can prior distributions
lead to improper and thus uninterpretable posterior (Reference: Gel-
man et. al.
distributions. 2.9)
Noninformative
prior distributions
Proper and
improper priors
Second example
binomial parameter
— Noninformative
(Noninformative/improper) prior distributions can prior distributions
lead to improper and thus uninterpretable posterior (Reference: Gel-
man et. al.
distributions. 2.9)
Noninformative
prior distributions
All noninformative prior specifications are arbitrary Proper and
improper priors
and if the results are sensitive to the particular Second example
binomial parameter
choice, then more effort in specifying genuine prior
information is required to justify any particular infer-
ence.
(Reference: Gel-
man et. al.
Multiparameter models: Introduction 3.1)
Multiparameter
models
(Reference: Gelman et. al. 3.1) The idea
Computation
Gibbs Sampling
(Reference: Gel-
man et. al.
3.1)
Multiparameter
models
The idea
Computation
Gibbs Sampling
model.
model.
■ At this point the simple conceptual framework of
the Bayesian approach reveals its principal
advantage over other forms of inference.
(Reference: Gel-
man et. al.
3.1)
Multiparameter
models
The idea
Computation
Gibbs Sampling
(Reference: Gel-
man et. al.
3.1)
Multiparameter
models
The idea
Computation
Gibbs Sampling
Alternatively, Multiparameter
models:
Z Introduction
Alternatively, Multiparameter
models:
Z Introduction
(Reference: Gel-
man et. al.
Multiparameter models: Normal mean 3.2)
Normal Dist’n
& variance
iid 2 Multiparameter
■ The data: y ∼ N (µ, σ ). models: Normal
mean & variance
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
iid 2 Multiparameter
■ The data: y ∼ N (µ, σ ). models: Normal
mean & variance
■ Prior distribution: noninformative (which is easily (Reference: Gel-
extended to informative priors). man et. al.
3.2)
Normal Dist’n
We assume prior independence of location and
scale parameters and take p(µ, σ 2 ) to be uniform
on (µ, log(σ 2 )):
p(µ, σ 2 ) ∝ (σ 2 )−1 .
iid 2 Multiparameter
■ The data: y ∼ N (µ, σ ). models: Normal
mean & variance
■ Prior distribution: noninformative (which is easily (Reference: Gel-
extended to informative priors). man et. al.
3.2)
Normal Dist’n
We assume prior independence of location and
scale parameters and take p(µ, σ 2 ) to be uniform
on (µ, log(σ 2 )):
p(µ, σ 2 ) ∝ (σ 2 )−1 .
2
Sufficient statistics:
S
ȳ and
CHOOL OFF
sINANCE AND S TAT I S T I C S
(Reference: Gel-
p(µ, σ 2 |y) = p(µ|σ 2 , y) × p(σ 2 |y) man et. al.
3.2)
Normal Dist’n
(Reference: Gel-
p(µ, σ 2 |y) = p(µ|σ 2 , y) × p(σ 2 |y) man et. al.
3.2)
Normal Dist’n
■ We can use a previous result for the mean of a
normal distribution with known variance.
µ|σ 2 , y ∼ N (ȳ, σ 2 /n). (3.3)
Thus,
2
2 2 −[ n−1 +1] (n − 1)s
p(σ |y) ∝ (σ ) 2 exp − 2
, (3.4)
2σ
Thus,
2
2 2 −[ n−1 +1] (n − 1)s
p(σ |y) ∝ (σ ) 2 exp − 2
, (3.4)
2σ
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
(Reference: Gel-
Bayes: man et. al.
2 3.2)
(n − 1)s 2 Normal Dist’n
2
y ∼ χ n−1 .
σ
(Reference: Gel-
Bayes: man et. al.
2 3.2)
(n − 1)s 2 Normal Dist’n
2
y ∼ χ n−1 .
σ
Frequentist:
2
(n − 1)s 2 2
2
µ, σ ∼ χ n−1 .
σ
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
(Reference: Gel-
man et. al.
Z 3.2)
∞ Normal Dist’n
p(µ|y) ∝ A−n/2 z (n−2)/2 exp(−z)dz
0
∝ [(n − 1)s2 + n(µ − ȳ)2 ]−n/2
2
n(µ − ȳ)
∝ 1+
(n − 1)s2
(Reference: Gel-
man et. al.
Z 3.2)
∞ Normal Dist’n
p(µ|y) ∝ A−n/2 z (n−2)/2 exp(−z)dz
0
∝ [(n − 1)s2 + n(µ − ȳ)2 ]−n/2
2
n(µ − ȳ)
∝ 1+
(n − 1)s2
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
(Reference: Gel-
man et. al.
3.2)
Normal Dist’n
— Example:
Bioassay
Introduction to experiment
References:
Multiparameter Models (II) (1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
— Example: Bioassay experiment Multiparameter
models
References: Bioassay example
Example Data
(1) Gelman et. al. 3.7 Sampling Model
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
■ Data analysis for such models is usually Bioassay
experiment
achieved with simulation (especially MCMC References:
(1) Gelman
methods). et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
■ Data analysis for such models is usually Bioassay
experiment
achieved with simulation (especially MCMC References:
(1) Gelman
methods). et. al. 3.7
(2) Jim
■ We will illustrate with a nonconjugate model for Albert, Chapter 4
Multiparameter
bioassay experiment using a two-parameter models
generalized linear model. Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
■ Various dose(Jþ) levels of the compound are Bioassay
experiment
administered() to batches of animals. References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
■ Various dose(Jþ) levels of the compound are Bioassay
experiment
administered() to batches of animals. References:
(1) Gelman
et. al. 3.7
■ Animals responses typically characterized by a (2) Jim
Albert, Chapter 4
binary outcome: alive or dead, tumor or no Multiparameter
tumor, response or no response etc. models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
Bioassay
Dose, xi Number of Number of experiment
References:
(log g/ml) animals, ni deaths, yi (1) Gelman
et. al. 3.7
-0.86 5 0 (2) Jim
Albert, Chapter 4
-0.30 5 1 Multiparameter
models
-0.05 5 3 Bioassay example
Example Data
0.73 5 5 Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
Bioassay
Dose, xi Number of Number of experiment
References:
(log g/ml) animals, ni deaths, yi (1) Gelman
et. al. 3.7
-0.86 5 0 (2) Jim
Albert, Chapter 4
-0.30 5 1 Multiparameter
models
-0.05 5 3 Bioassay example
Example Data
0.73 5 5 Sampling Model
Model across dose
levels
Source: Racine A, Grieve AP, Fluhler H, Smith AFM. (1986). Approximation
Approximation
Bayesian methods in practice: experiences in the pharmaceutical Posterior inference
Results
industry (with discussion). Applied Statistics 35, 93-150.
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
xi .s. — Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
xi .s. — Example:
Bioassay
experiment
■ Note:We start by assuming that each θi is an References:
(1) Gelman
independent parameter. We relax this et. al. 3.7
assumption when we develop hierarchical (2) Jim
Albert, Chapter 4
models (See Chapter 5). Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
xi .s. — Example:
Bioassay
experiment
■ Note:We start by assuming that each θi is an References:
(1) Gelman
independent parameter. We relax this et. al. 3.7
assumption when we develop hierarchical (2) Jim
Albert, Chapter 4
models (See Chapter 5). Multiparameter
models
Bioassay example
■ There are many possibilities for relating the θi ’s Example Data
Sampling Model
to the xi .s, but a popular and reasonable choice Model across dose
is a logistic regression model: levels
Approximation
Approximation
θi Posterior inference
logit(θi ) = log = α + βxi Results
1 − θi
SCHOOL OF FINANCE AND S TAT I S T I C S
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Figure 1: (a) Contour plot of the posterior density of the pa- Approximation
rameters α and β. (b) Scatterplotof 1000 draws from the pos- Approximation
Posterior inference
terior distribution Results
2-dimensions: — Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
2-dimensions: — Example:
Bioassay
experiment
1. We begin with a rough estimate of the References:
(1) Gelman
parameters. et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
2-dimensions: — Example:
Bioassay
experiment
1. We begin with a rough estimate of the References:
(1) Gelman
parameters. et. al. 3.7
(2) Jim
Albert, Chapter 4
■ Since Multiparameter
models
logit(E[(yi /ni )|α, β]) = α + βxi , Bioassay example
Example Data
Sampling Model
we obtain rough estimates of α and β using a Model across dose
linear regression of logit(yi /ni ) on xi . levels
Approximation
■ Set y1 = 0.5, y4 = 4.5 to enable calculation. Approximation
Posterior inference
Results
■ α̂ = 0.1, β̂ = 2.9 (standard errors 0.3 and 0.5).
SCHOOL OF FINANCE AND S TAT I S T I C S
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
P P (1) Gelman
4. Renormalize on grid so that α β p(α, β|y) = 1 et. al. 3.7
(2) Jim
(i.e., create discrete approx to posterior) Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
P P (1) Gelman
4. Renormalize on grid so that α β p(α, β|y) = 1 et. al. 3.7
(2) Jim
(i.e., create discrete approx to posterior) Albert, Chapter 4
Multiparameter
5. Sample from marginal Pdistribution of one models
Bioassay example
parameter: p(α|y) = β p(α, β|y) Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
P P (1) Gelman
4. Renormalize on grid so that α β p(α, β|y) = 1 et. al. 3.7
(2) Jim
(i.e., create discrete approx to posterior) Albert, Chapter 4
Multiparameter
5. Sample from marginal Pdistribution of one models
Bioassay example
parameter: p(α|y) = β p(α, β|y) Example Data
Sampling Model
6. Sample from conditional distribution of second Model across dose
levels
parameter: p(β|α, y) Approximation
Approximation
Posterior inference
Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
P P (1) Gelman
4. Renormalize on grid so that α β p(α, β|y) = 1 et. al. 3.7
(2) Jim
(i.e., create discrete approx to posterior) Albert, Chapter 4
Multiparameter
5. Sample from marginal Pdistribution of one models
Bioassay example
parameter: p(α|y) = β p(α, β|y) Example Data
Sampling Model
6. Sample from conditional distribution of second Model across dose
levels
parameter: p(β|α, y) Approximation
Approximation
7. We can improve sampling slightly by drawing Posterior inference
from linear interpolation between grid points. Results
— Example:
3. Use R to produce a contour plot (lines of equal Bioassay
experiment
posterior density). References:
P P (1) Gelman
4. Renormalize on grid so that α β p(α, β|y) = 1 et. al. 3.7
(2) Jim
(i.e., create discrete approx to posterior) Albert, Chapter 4
Multiparameter
5. Sample from marginal Pdistribution of one models
Bioassay example
parameter: p(α|y) = β p(α, β|y) Example Data
Sampling Model
6. Sample from conditional distribution of second Model across dose
levels
parameter: p(β|α, y) Approximation
Approximation
7. We can improve sampling slightly by drawing Posterior inference
from linear interpolation between grid points. Results
— Example:
Bioassay
experiment
References:
(1) Gelman
et. al. 3.7
(2) Jim
Albert, Chapter 4
Multiparameter
models
Bioassay example
Example Data
Sampling Model
Model across dose
levels
Approximation
Approximation
Posterior inference
Results
(Reference: Gel-
man et. al.
3.8)
Lessons from simple examples Lessons
five steps
Lessons from
The lack of multiparameter models with explicit simple examples
posterior distributions not necessarily a barrier to (Reference: Gel-
analysis. man et. al.
3.8)
Lessons
five steps
Lessons from
The lack of multiparameter models with explicit simple examples
posterior distributions not necessarily a barrier to (Reference: Gel-
analysis. man et. al.
3.8)
Lessons
We can use simulation, maybe after replacing five steps
(Reference: Gel-
man et. al.
3.8)
Lessons
five steps
(Reference: Gel-
man et. al. Chapter
APPENDIX: Large sample Bayesian 4)
Normal
inference approximations
Transformations
(Reference: Gel-
man et. al. Chapter
4)
Normal
approximations
Transformations
APPENDIX: Large
Transformations: in many cases convergence to sample Bayesian
normality of posterior distribution for θ can be dra- inference
APPENDIX: Large
Transformations: in many cases convergence to sample Bayesian
normality of posterior distribution for θ can be dra- inference
APPENDIX: Large
Transformations: in many cases convergence to sample Bayesian
normality of posterior distribution for θ can be dra- inference
(Reference: Gel-
man et. al. Chapter
4)
Normal
approximations
Transformations