Tests in the Bernoulli Model

4. Tests in the Bernoulli Model

Preliminaries

Suppose that I₁, I₂, ..., I_nis a random sample from the Bernoulli distribution with unknown parameter p in (0, 1). Thus, these are independent indicator variables taking the values 1 and 0 with probabilities p and 1 - p respectively. Usually, this model arises in one of the following contexts:

There is an event of interest in a basic experiment, with unknown probability p. We replicate the experiment n times and define I_i = 1 if and only if the event occurred on the i'th run.
We have a population of objects of several different types; p is the unknown proportion of objects of a particular type of interest. We select n objects at random from the population and let I_i = 1 if and only if the i'th object is of the type of interest. When the sampling is with replacement, these variables really do form a random sample from the Bernoulli distribution. When the sampling is without replacement, the variables are dependent, but the Bernoulli model may still be approximately valid. For more on these points, see the chapter on Finite Sampling Models.

In this section, we will construct hypothesis tests for the parameter p. This section parallels the section on Estimation in the Bernoulli Model in the Chapter on Interval Estimation.

Tests of `p`

The parameter space is {p: 0 < p < 1}, and all hypotheses define subsets of this space. Recall that

N = I₁ + I₂ + ЗЗЗ + I_n

has the binomial distribution with parameters n and p and has mean and variance given by

E(N) = np, var(N) = np(1 - p).

Moreover, N is sufficient for p, so it is natural to construct a test statistic from N. For r in (0, 1), let b_r(n, p) denote the quantile of order r for the binomial distribution with parameters n and p. Since the binomial distribution is discrete, only certain (exact) quantiles are possible.

$Mathematical Exercise$ 1. Show that the following tests have significance level r:

Reject H₀: p = p₀ versus H₁: p p₀ if and only if N < b_r_/2(n, p₀) or N > b_{1
-}_r_/2(n, p₀).
Reject H₀: p p₀ versus H₁: p > p₀ if and only if N > b_{1 -}_r(n, p₀).
Reject H₀: p p₀ versus H₁: p < p₀ if and only if N < b_r(n, p₀).

When n is large, the distribution of N is approximately normal, by the central limit theorem. Thus, an approximate normal test can be constructed using the test statistic

Z₀ = (N - np₀) / [np₀(1 - p₀)]^1/2.

Note that Z₀ is the standard score of N, under the null hypothesis. As usual, for r in (0, 1), let z_r denote the quantile of order r for the standard normal distribution.

$Mathematical Exercise$ 2. Show that if n is large, the following tests have approximate significance level r:

Reject H₀: p = p₀ versus H₁: p p₀ if and only if Z₀ > z_{1 -} _r_/2 or Z₀ < -z_{1
-}_r_/2.
Reject H₀: p p₀ versus H₁: p > p₀ if and only if Z₀ > z_{1 -}_r.
Reject H₀: p p₀ versus H₁: p < p₀ if and only if Z₀ < -z_{1 -}_r.

3. In the proportion test experiment, set H₀: p = p₀, n = 10, significance level 0.1, and p₀ = 0.5.

For each p = 0.1, 0.2, ..., 0.9, run the experiment 1000 times, updating every 10 runs, and then note the relative frequency of rejecting H₀ for each value of p.
When p = 0.5, compare the relative frequency with the significance level.
Based on these relative frequencies, sketch the empirical power curve.

4. In the proportion test experiment, repeat the previous exercise with n = 20.

5. In the proportion test experiment, set H₀: p p₀, n = 15, significance level 0.05, and p₀ = 0.3.

For each p = 0.1, 0.2, ..., 0.9, run the experiment 1000 times, updating every 10 runs, and then note the relative frequency of rejecting H₀ for each value of p.
When p = 0.3, compare the relative frequency with the significance level.
Based on these relative frequencies, sketch the imperial power curve.

6. In the proportion test experiment, repeat the previous exercise with n = 30.

7. In the proportion test experiment, set H₀: p p₀, n = 20, significance level 0.01, and p₀ = 0.6.

For each p = 0.1, 0.2, ..., 0.9, run the experiment 1000 times, updating every 10 runs, and then note the relative frequency of rejecting H₀ for each value of p.
When p = 0.6, compare the relative frequency with the significance level.
Based on these relative frequencies, sketch the imperial power curve.

8. In the proportion test experiment, repeat the previous exercise with n = 50.

The Sign Test

Suppose now that we have a basic random experiment with a random variable X of interest. We assume that X has a continuous distribution. Let p₀ be a specified number in (0, 1), and let m denote quantile of order p₀ for the distribution of X. Thus, by definition,

p₀ = P(X < m).

Suppose that m is unknown and that we want to construct hypothesis tests for m. For a given test value m₀, let

p = P(X < m₀).

$Mathematical Exercise$ 9. Show that

m = m₀ if and only if p = p₀.
m < m₀ if and only if p > p₀.
m > m₀ if and only if p < p₀.

As usual, we repeat the basic experiment n times to generate a random sample of size n from the distribution of X:

X₁, X₂, ..., X_n.

Let I_i be the indicator variable of the event {X_i < m₀} for i = 1, 2, ..., n.

$Mathematical Exercise$ 10. Show that I₁, I₂, ..., I_n is a random sample of size n from the Bernoulli distribution with parameter p.

From Exercises 9 and 10, tests of the unknown quantile m can be converted to tests of the Bernoulli parameter p, and thus the tests developed in the previous subsections apply. This procedure is known as the sign test, because essentially, only the sign of X_i - m₀ is recorded for each i. This procedure is also an example of a nonparametric test, because no assumptions about the distribution of X are made (except for continuity). In particular, we do not need to assume that the distribution of X belongs to a particular parametric family.

The most important special case of the sign test is the case where p₀ = 1/2; this is the sign test of the median. If the distribution of X is known to be symmetric, the median and the mean agree. In this case, sign tests of the median are also tests of the mean.

11. In the sign test experiment, set the sampling distribution to normal with mean 0 and standard deviation 2. Set the sample size to 10 and the significance level to 0.1. For each of the 9 values of m₀, run the simulation 1000 times, updating every 10 runs.

When m₀ = m, give the empirical estimate of the significance level of the test and compare with 0.1.
In the other cases, give the empirical estimate of the power of the test.

12. In the sign test experiment, set the sampling distribution to uniform on the interval [0, 5]. Set the sample size to 20 and the significance level to 0.05. For each of the 9 values of m₀, run the simulation 1000 times, updating every 10 runs.

When m₀ = m, give the empirical estimate of the significance level of the test and compare with 0.05.
In the other cases, give the empirical estimate of the power of the test.

13. In the sign test experiment, set the sampling distribution to gamma with shape parameter a = 2 and scale parameter r = 1 . Set the sample size to 30 and the significance level to 0.025. For each of the 9 values of m₀, run the simulation 1000 times, updating every 10 runs.

When m₀ = m, give the empirical estimate of the significance level of the test and compare with 0.025.
In the other cases, give the empirical estimate of the power of the test.

Computational Exercises

$Mathematical Exercise$ 14. In a pole of 1000 registered voters in a certain district, 427 prefer candidate X. At the 0.1 level, is the evidence sufficient to conclude that more that 40% of the registered voters prefer X?

$Mathematical Exercise$ 15. A coin is tossed 500 times and results in 302 heads. At the 0.05 level, test to see if the coin is unfair.

$Mathematical Exercise$ 16. A sample of 400 memory chips from a production line are tested, and 30 are defective. At the 0.05 level, test to see if the proportion of defective chips is less than 0.1.

$Mathematical Exercise$ 17. A new drug is administered to 50 patients and the drug is effective in 42 cases. At the 0.1 level, test to see if the success rate for the new drug is greater that 0.8.

18. Using the M&M data, test the following alternative hypotheses at the 0.1 significance level:

The proportion of red M&Ms differs from 1/6.
The proportion of green M&Ms is less than 1/6
The proportion of yellow M&M is greater than 1/6

19. Using the M&M data, test to see if the median weight exceeds 47.9 grams, at the 0.1 level.

20. Using Fisher's iris data, perform the following tests, at the 0.1 level:

The median petal length of Setosa irises differs from 15 mm.
The median petal length of Verginica irises is greater than 52 mm.
The median petal length of Versicolor irises is less than 42 mm.

4. Tests in the Bernoulli Model

Preliminaries

Tests of p

The Sign Test

Computational Exercises

Tests of `p`