Tests of the Variance in the Normal Model

3. Tests of the Variance in the Normal Model

Preliminaries

Suppose that X₁, X₂, ..., X_nis a random sample from the normal distribution with mean ľ and variance d². In this section we will construct hypothesis tests for d. The key tools that we will need are the sample mean and the sample variance, and special properties of these statistics when the sampling distribution is normal. This section parallels the section on Estimation of the Variance in the Normal Model in the chapter on Interval Estimation.

The mean ľ will play the role of a nuisance parameter, in the sense that the test procedure is different, depending on whether ľ is known or unknown.

We will first assume that the mean ľ is known, even though this is usually not a realistic assumption in applications. In this case, the parameter space is {d: d > 0} and all hypotheses about d define subsets of this space. A natural test statistic is

V₀ = (1 / d₀²)_{i
= 1, ..., n} (X_i - ľ)².

Note that W² = d₀² V₀ / n is the natural estimator of the variance when ľ is known.

$Mathematical Exercise$ 1. Show that if d₀ = d, V₀ has the chi-square distribution with n degrees of freedom

Consider now the more realistic case in which ľ is also unknown. In this case, the underlying parameter space is {(ľ, d): ľ in R, d > 0}, and all hypotheses about d define subsets of this space. A natural test statistic is

V₀ = (1 / d₀²)_{i
= 1, ..., n} (X_i - M)².

where M = (1 / n)_{i
= 1, ..., n}X_i is the sample mean. Note that S² = d₀² V₀ / (n - 1) is the sample variance.

$Mathematical Exercise$ 2. Show that if d₀ = d, V₀ has the chi-square distribution with n - 1 degrees of freedom.

Hypothesis Tests

Hypothesis tests for d work the same way, whether ľ is known or unknown; the only difference is the definition of the test statistic V₀ and the number of degrees of freedom in the chi-square distribution. We will let v_{k, p} denote the quantile of order p for the chi-square distribution with k degrees of freedom. If ľ is known, we let k = n; if ľ is unknown, we let k = n - 1. For selected values of k and p, v_{k, p} can be obtained from the table of the chi-square distribution.

$Mathematical Exercise$ 3. Show that for H₀: d = d₀ versus H₁: d d₀, the following test has significance level r:

Reject H₀ if and only V₀ > v_k_{,
1 -}_r_/2 or V₀ < v_k_, _r_/2.

$Mathematical Exercise$ 4. Show that for H₀: d d₀ versus H₁: d > d₀, the following test has significance level r:

Reject H₀ if and only if V₀ > v_k_{,
1 -}_r.

$Mathematical Exercise$ 5. Show that forH₀: d d₀ versus H₁: d < d₀, the following test has significance level r:

Reject H₀ if and only if V₀ < v_k_{,
r}.

$Mathematical Exercise$ 6. Show that for the tests in Exercises 3-5, we fail to reject H₀ at significance level a if and only if the test variance d₀² is in the corresponding 1 - r confidence interval.

Of course, the result in Exercise 6 is a special case of the general equivalence between hypothesis testing and interval estimation that was discussed in the introduction.

Power Curves

Recall that the power function for a test of d is Q(d) = P(Reject H₀ | d). For the tests above, we can compute the power functions explicitly in terms of the distribution function F_k of the chi-square distribution with k degrees of freedom. Again, k = n if ľ is known and k = n - 1 if ľ is unknown.

$Mathematical Exercise$ 7. For the test H₀: d = d₀ versus H₁: d d₀ at significance level r, show the following results and sketch the graph of Q:

Q(d) = 1 - F_k[d₀² v_k_{,
1 -}_r_/2 / d²] + F_k[d₀² v_k_, _r_/2 / d²]
Q(d) decreases for d < d₀ and increases for d > d₀.
Q(d₀) = r.
Q(d) 1 as d 0⁺ and Q(d) 1 as d .

$Mathematical Exercise$ 8. For the test H₀: d d₀ versus H₁: d > d₀ at significance level r, show the following results and sketch the graph of Q:

Q(d) = 1 - F_k[d₀² v_k, 1 - a / d²]
Q(d) increases for d > 0.
Q(d₀) = a.
Q(d) 0 as d 0⁺ and Q(d) 1 as d .

$Mathematical Exercise$ 9. For the test H₀: d d₀ versus H₁: d < d₀ at significance level r, show the following results and sketch the graph of Q:

Q(d) = F_k[d₀² v_k_,_r / d²]
Q(d) decreases for d > 0.
Q(d₀) = r.
Q(d) 1 as d 0⁺ and Q(d) 0 as d .

$Mathematical Exercise$ 10. Show that in each case, the test of d when ľ is known is more powerful than the test of d when ľ is unknown.

Simulation Exercises

11. In the variance test experiment, select the normal distribution with mean 0, the two-sided test at significance level 0.1, sample size n = 10, and test standard deviation 1.0.

For each of the values of the true standard deviation 0.7, 0.8, 0.9, 1.0, 1.1, 1.2, 1.3, run the experiment 1000 times, updating every 10 runs, and note the relative frequency of rejecting H₀.
When the true standard deviation is 1.0, compare the relative frequency of rejecting H₀ with the significance level.
Using the relative frequencies in (a), plot the empirical power curve.

12. In the variance test experiment, repeat Exercise 11 with the left tailed test.

13. In the variance test experiment, repeat Exercise 11 with the right tailed test.

14. In the variance estimate experiment, select the normal distribution with ľ = 0 and standard deviation 2, the two-sided interval at confidence level 0.90, and sample size n = 10. Run the experiment 20 times, updating after each run. State the corresponding hypotheses and significance level, and for each run, give the set of test standard deviations for which the null hypothesis would be rejected.

15. In the variance estimate experiment, repeat Exercise 14 with the confidence lower bound.

16. In the variance estimate experiment, repeat Exercise 14 with the confidence upper bound.

Non-Normal Distributions

Even when the underlying distribution is not normal, the procedure of this section is still used to perform approximate tests for the variance. You will see in the simulation exercises below that this procedure is not nearly as robust as that of testing for the mean. Nonetheless, if the distribution is not too far from normal, the procedure usually works well.

17. In the variance test experiment, select the gamma distribution with shape parameter 1 and scale parameter 1 (thus, the true standard deviation is 1). Select the two-sided test at significance level 0.1 and sample size n = 10.

For each of the test standard deviations 0.7, 0.8, 0.9, 1.0, 1.1, 1.2, 1.3, run the simulation 1000 times, updating every 10 runs, and note the relative frequency of rejecting H₀.
When the test standard deviation is 1.0, compare the relative frequency in (a) with the significance level.

18. In the variance test experiment. repeat Exercise 17 with sample size n = 20.

19. In the variance test experiment, select the gamma distribution with shape parameter 4 and scale parameter 1 (thus, the true standard deviation is 2). Select the two-sided test at significance level 0.1 and sample size n = 10.

For each of the test standard deviations 1.6, 1.8, 2.0, 2.2, 2.4, run the simulation 1000 times, updating every 10 runs, and note the relative frequency of rejecting H₀.
When the test standard deviation is 2.0, compare the relative frequency in (a) with the significance level.

20. In the variance test experiment, select the uniform distribution on (0, 4) (thus, the true standard deviation is about 1.15). Select the two-sided test at significance level 0.1 and sample size n = 10.

For each of the test standard deviations 0.69, 0.92, 1.15, 1.39, 1.62, run the simulation 1000 times, updating every 10 runs, and note the relative frequency of rejecting H₀.
When the test standard deviation is 1.15, compare the relative frequency in (a) with the significance level.

Data Analysis Exercises

21. Using Michelson's data, test to see if the standard deviation of the velocity of light measurements is less than 80 km/sec, at the 0.1 significance level.

Assume that ľ is the "true value."
Assume that the ľ is unknown.

22. Using Cavendish's data, test to see if the standard deviation of density measurements is greater than 0.2, at the 0.05 significance level.

Assume that ľ is the "true value."
Assume that the ľ is unknown.

23. Using Short's data, test to see if the standard deviation of parallax measurements differs from 0.7 seconds of a degree, at the 0.1 significance level.

Assume that ľ is the "true value."
Assume that the ľ is unknown.

24. Using Fisher's iris data, perform the following tests, at the 0.1 level:

The standard deviation of the petal length of Setosa irises differs from 2 mm.
The standard deviation of the petal length of Verginica irises is greater than 5 mm.
The standard deviation of the petal length of Versicolor irises is less than 5.5 mm.