Sample Size and Power Calculations

Sample Size & Power Calculations
Sample size calculations
$ available
n =
$ per sample
Power
X 1, . . . , X n iid Normal(µA, σA) Y 1, . . . , Y m iid Normal(µB, σB)
Test H0 : µA = µB vs Ha : µA ̸= µB at α = 0.05.
X −Y
Test statistic: T = .
! (X − Y )
SD
−→ Critical value: C such that Pr(|T| > C | µA = µB) = α.
Power: Pr(|T| > C | µA ̸= µB)

Power
−C 0 ∆ C
Power depends on...
• The design of your experiment
• What test you’re doing
• Chosen significance level, α
• Sample size
• True difference, µA − µB
• Population SD’s, σA and σB.

The case of known population SDs
Suppose σA and σB are known.

"
σA2 σB2
Then X − Y ∼ Normal( µA − µB, n + m )
X −Y
Test statistic: Z̃ = "
σA2 σB2
n + m
If H0 is true (i.e. µA = µB), we have Z̃ ∼ Normal(0,1).
−→ C = zα/2 so that Pr(|Z̃ | > C | µA = µB) = α.
For example, for α = 0.05, C = qnorm(0.975) = 1.96.
Power when the population SDs are known

(X − Y ) − ∆
If µA − µB = ∆, then Z = " ∼ Normal(0,1)
σA2 σB2
n + m
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
Pr ⎝ %|X −
2
Y|
2
> 1.96⎠ = Pr ⎝ %X −Y > 1.96⎠+ Pr ⎝ %X −Y < –1.96⎠
2 2 2 2
σA σB σA σB σA σB
n +m n +m n +m
⎛ ⎞ ⎛ ⎞
= Pr ⎝ X%−Y2 −∆2 > 1.96 − % ∆ ⎠+ Pr ⎝ X%−Y −∆ < –1.96 − % ∆ ⎠

σA σB σ2A+ B
σ2 2 2 σA σB 2 σ2
σA B
n +m n m n +m n +m
⎛ ⎞ ⎛ ⎞
= Pr ⎝Z > 1.96 − % ∆
2 σ 2
⎠ + Pr ⎝Z < –1.96 − % ∆
2 σ2
⎠
σA B
σA B
n +m n +m
Calculations in R
⎛ ⎞ ⎛ ⎞
Power = Pr ⎝Z > 1.96 − % ∆ ⎠ + Pr ⎝Z < –1.96 − % ∆ ⎠

σ2
A+ B
σ2 σ2
A+ B
σ2
n m n m
C <- qnorm(0.975)
se <- sqrt( sigmaAˆ2/n + sigmaBˆ2/m )
power <- 1-pnorm(C-delta/se) + pnorm(-C-delta/se)
Power curves
100
80
Power
60
n=20
40 n=10
20 n=5
0
− 2σ −σ 0 σ 2σ
∆
Power depends on . . .
⎛ ⎞ ⎛ ⎞
Power = Pr ⎝Z > C − % ∆ ⎠ + Pr ⎝Z < −C − % ∆ ⎠

σ2
A+ B
σ2 2 σ2
σA B
n m n +m
• Choice of α (which affects C)

Larger α → less stringent → greater power.
• ∆ = µA − µB = the true “effect.”

Larger ∆ → greater power.
• Population SDs, σA and σB

Smaller σ ’s → greater power.
• Sample sizes, n and m

Larger n, m → greater power.
Choice of sample size
We mostly influence power via n and m.

2 2
σA σB
Power is greatest when n + m is as small as possible.
Suppose the total sample size N = n + m is fixed.
σA2 2
−→ n + σmB is minimized when n = σA
σA+σB × N and m = σB
σA+σB ×N
For example:
• If σA = σB, we should choose n = m.

• If σA = 2 σB, we should choose n = 2 m.
That means, if σA = 4 and σB = 2, we might use n=20 and m=10.
Calculating the sample size
Suppose we seek 80% power to detect a particular value of µA −

µB = ∆, in the case that σA and σB are known.
(For convenience here, let’s pretend that σA = σB and that we plan

to have equal sample sizes for the two groups.)
⎛ ⎞
( √ )
⎝ ⎠
Power ≈ Pr Z > C − % 2 2 = Pr Z > 1.96 − σ√2n
∆ ∆
σ σ
A+ B
n m
( √ )
∆√ n
−→ Find n such that Pr Z > 1.96 − σ 2 = 80%.
√
Thus 1.96 − ∆σ√2n = qnorm(0.2) = – 0.842.
√ √
−→ n = ∆σ {1.96 − (−0.842)} 2 −→ n = 15.7 × ( ∆σ )2
Equal but unknown population SDs
X 1, . . . , X n iid Normal(µA, σ ) Y 1, . . . , Y m iid Normal(µB, σ )
"2 "
sA(n−1)+s2B(m−1) !
σ̂p = n+m−2 SD(X − Y ) = σ̂p 1n + m1
X −Y
Test statistic: T = .
! (X − Y )
SD
In the case µA = µB, T follows a t distribution with n + m – 2 d.f.
−→ Critical value: C = qt(0.975,n+m-2)
Power: equal but unknown pop’n SDs

* +
|X − Y |
Power = Pr √ 1 1 > C
σ̂p n+m
X
√−1Y 1
−→ In the case µA − µB = ∆, the statistic follows a non-
σ̂p n + m
central t distribution.
This distribution has two parameters:

−→ The degrees of freedom (as before)
−→ The non-centrality parameter, √∆1 1
σ n+m
C <- qt(0.975, n + m - 2)
se <- sigma * sqrt( 1/n + 1/m )
power <- 1 - pt(C, n+m-2, ncp=delta/se) +
pt(-C, n+m-2, ncp=delta/se)
Power: equal population SDs
Power curves
100
80
60
Power
40
n = 20
n = 10
n=5
known SDs
20
unknown SDs
0
− 2σ −σ 0 σ 2σ
A built-in function: power.t.test()
Calculate power (or determine the sample size) for the t-test when:
• Sample sizes equal
• Population SDs equal
Arguments:
• n = sample size
• delta = ∆ = µ2 − µ1
• sd = σ = population SD
• sig.level = α = significance level
• power = the power
• type = type of data (two-sample, one-sample, paired)
• alternative = two-sided or one-sided test
Examples
A. n = 10 for each group; effect = ∆ = 5; pop’n SD = σ = 10

power.t.test(n=10, delta=5, sd=10)
−→ 18%
B. power = 80%; effect = ∆ = 5; pop’n SD = σ = 10

power.t.test(delta=5, sd=10, power=0.8)
−→ n = 63.8 −→ 64 for each group
C. power = 80%; effect = ∆ = 5; pop’n SD = σ = 10; one-sided

power.t.test(delta=5, sd=10, power=0.8,
alternative="one.sided")
−→ n = 50.2 −→ 51 for each group
Unknown and different pop’n SDs
X 1, . . . , X n iid Normal(µA, σA) Y 1, . . . , Y m iid Normal(µB, σB)
Test statistic: T = %X −Y
s2
A
s2
B
n +m
To calculate the critical value for the test, we need the null distri-
bution of T (that is, the distribution of T if µA = µB).
To calculate the power, we need the distribution of T given the

value of ∆ = µA − µB.
We don’t really know either of these.

Power by computer simulation
• Specify n, m, σA, σB, ∆ = µA − µB, and the significance level, α.
• Simulate data under the model.
• Perform the proposed test and calculate the P-value.
• Repeat many times.
−→ Example:
n = 5, m = 10, σA = 1, σB = 2,
∆ = 0.0, 0.5, 1.0, 1.5, 2.0 or 2.5.
Example
∆=0 ∆ = 0.5
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
P−value P−value
∆ = 1.0 ∆ = 1.5
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
P−value P−value
∆ = 2.0 ∆ = 2.5
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
P−value P−value
Example
100
80
60
Power
40
20
0.0 0.5 1.0 1.5 2.0 2.5

∆
Determining sample size
The things you need to know:
• Structure of the experiment

• Method for analysis
• Chosen significance level, α (usually 5%)

• Desired power (usually 80%)
• Variability in the measurements
→ If necessary, perform a pilot study, or use data from prior experiments or publications.
• The smallest meaningful effect

Reducing sample size
• Reduce the number of treatment groups being compared.
• Find a more precise measurement (e.g., average survival time

rather than proportion dead).
• Decrease the variability in the measurements.

◦ Make subjects more homogenous.
◦ Use stratification.
◦ Control for other variables (e.g., weight).
◦ Average multiple measurements on each subject.

Sample Size and Power Calculations

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Sample Size and Power Calculations

Uploaded by

Copyright:

Available Formats

Sample Size & Power Calculations

Sample size calculations

X 1, . . . , X n iid Normal(µA, σA) Y 1, . . . , Y m iid Normal(µB, σB)

−→ Critical value: C such that Pr(|T| > C | µA = µB) = α.

Power: Pr(|T| > C | µA ̸= µB)

Power depends on...

• The design of your experiment

• What test you’re doing

• Chosen significance level, α

• Population SD’s, σA and σB.

Suppose σA and σB are known.

If H0 is true (i.e. µA = µB), we have Z̃ ∼ Normal(0,1).

−→ C = zα/2 so that Pr(|Z̃ | > C | µA = µB) = α.

For example, for α = 0.05, C = qnorm(0.975) = 1.96.

Power when the population SDs are known

= Pr ⎝ X%−Y2 −∆2 > 1.96 − % ∆ ⎠+ Pr ⎝ X%−Y −∆ < –1.96 − % ∆ ⎠

Power = Pr ⎝Z > 1.96 − % ∆ ⎠ + Pr ⎝Z < –1.96 − % ∆ ⎠

Power = Pr ⎝Z > C − % ∆ ⎠ + Pr ⎝Z < −C − % ∆ ⎠

• Choice of α (which affects C)

• ∆ = µA − µB = the true “effect.”

• Population SDs, σA and σB

• Sample sizes, n and m

We mostly influence power via n and m.

Suppose the total sample size N = n + m is fixed.

• If σA = σB, we should choose n = m.

Calculating the sample size

Suppose we seek 80% power to detect a particular value of µA −

(For convenience here, let’s pretend that σA = σB and that we plan

X 1, . . . , X n iid Normal(µA, σ ) Y 1, . . . , Y m iid Normal(µB, σ )

In the case µA = µB, T follows a t distribution with n + m – 2 d.f.

−→ Critical value: C = qt(0.975,n+m-2)

Power: equal but unknown pop’n SDs

This distribution has two parameters:

A built-in function: power.t.test()

A. n = 10 for each group; effect = ∆ = 5; pop’n SD = σ = 10

B. power = 80%; effect = ∆ = 5; pop’n SD = σ = 10

C. power = 80%; effect = ∆ = 5; pop’n SD = σ = 10; one-sided

Unknown and different pop’n SDs

X 1, . . . , X n iid Normal(µA, σA) Y 1, . . . , Y m iid Normal(µB, σB)

To calculate the power, we need the distribution of T given the

We don’t really know either of these.

• Specify n, m, σA, σB, ∆ = µA − µB, and the significance level, α.

• Simulate data under the model.

• Perform the proposed test and calculate the P-value.

• Repeat many times.

0.0 0.5 1.0 1.5 2.0 2.5

Determining sample size

The things you need to know:

• Structure of the experiment

• Chosen significance level, α (usually 5%)

• The smallest meaningful effect

• Reduce the number of treatment groups being compared.

• Find a more precise measurement (e.g., average survival time

• Decrease the variability in the measurements.

You might also like