The Sample Mean and the Law of Large Numbers

2. The Sample Mean and the Law of Large Numbers

The Sample Mean

As usual, we start with a random experiment that has a sample space and a probability measure P. Suppose that X is a real-valued random variable. We will denote the mean and standard deviation of X by Е and d respectively.

Now suppose we perform independent replications of the basic experiment. This defines a new, compound experiment with a sequence of independent random variables, each with the same distribution as X:

X₁, X₂, ...,

Recall that in statistical terms, (X₁, X₂, ..., X_n) is a random sample of size n from the distribution of X for each n. The sample mean is simply the average of the variables in the sample:

M_n = (X₁ + X₂ + ЗЗЗ + X_n) / n. The sample mean is a real-valued function of the random sample and thus is a statistic. Like any statistic, the sample mean is itself a random variable with a distribution, mean, and variance of its own. Many times, the distribution mean is unknown and the sample mean is used as an estimator of the distribution mean. 1. In the dice experiment, select the average random variable. For each die distribution, start with n = 1 die and increase the number of dice by one until you get to n = 20 dice. Note the shape and location of the density function at each stage. With 20 dice, run the simulation 1000 times with an update frequency of 10. Note the apparent convergence of the empirical density function to the true density function. Properties of the Sample Mean 2. Show that E(M_n) = Е. Exercise 1 shows that M_n is an unbiased estimator of Е. Therefore, the variance of the sample mean is the mean square error, when the sample mean is used as an estimator of the distribution mean. 3. Show that var(M_n) = d² / n. From Exercise 3, the variance of the sample mean is an increasing function of the distribution variance and a decreasing function of the sample size. Both of these make intuitive sense if we think of the sample mean as an estimator of the distribution mean. 4. In the dice experiment, select the average random variable. For each die distribution, start with n = 1 die and increase the number of dice by one until you get to n = 20 dice. Note that the mean of the sample mean stays the same, but the standard deviation of the sample mean decreases (as we now know, in inverse proportion to the square root of the sample size). Run the simulation 1000 times, updating every 10 runs. Note the apparent convergence of the empirical moments of the sample mean to the true moments. 5. Compute the sample mean of the petal width variable for the following cases in Fisher's iris data. Compare the results. All cases Setosa only Versicolor only Verginica only The Weak Law of Large Numbers By Exercise 3, note that var(M_n) 0 as n . This means that M_n Е as n in mean square. 6. Use Chebyshev's inequality to show that P[|M_n - Е| > r] 0 as n for any r > 0. This result is known as the weak law of large numbers, and states that the sample mean converges to the mean of the distribution in probability. Recall that in general, convergence in mean square implies convergence in probability. The Strong Law of Large Numbers The strong law of large numbers states that the sample mean M_n converges to the distribution mean Е with probability 1: P(M_n Е as n ) = 1. As the name suggests, this is a much stronger result than the weak law. We will construct a fairly simple proof under the assumption that the 4'th central moment is finite: b₄ = E[(X - Е)⁴] < . However, there are better proofs that do not need this assumption--see for example, the book Probability and Measure by Patrick Billingsley. 7. Let Y_i = X_i - Е. and let W_n = Y₁ + Y₂ + ЗЗЗ + Y_n. Show that Y₁, Y₂, ..., Y_n are independent and identically distributed. E(Y_i) = 0. E(Y_i²) = d². E(Y_i⁴) = b₄. M_n Е as n if and only if W_n / n 0 as n . By Exercise 7, we want to show that with probability 1, W_n / n 0 as n . 8. Show that W_n / n does not converge to 0 if and only if there exists a rational number r > 0 such that |W_n / n| > r for infinitely many n. Thus, we need to show that the event described in Exercise 8 has probability 0. 9. Show that W_n⁴ is the sum of Y_iY_jY_kY_l over all i, j, k, l in {1, 2, ..., n}. 10. Show that E(Y_iY_jY_kY_l) = 0 if one index differs from the other three. E(Y_i²Y_j²) = d⁴ if i and j are distinct, and there are 3n(n - 1) such terms in E(W_n⁴). E(Y_i⁴) = b₄ and there are n such terms in E(W_n⁴). 11. Use the results in Exercise 10 to show that E(S_n⁴) Cn² for some constant C (independent of n). 12. Use Markov's inequality and the result of Exercise 11 to show that for r > 0, P(|W_n / n| > r) = P(W_n⁴ > r⁴n⁴) C / (r⁴n²). 13. Use the first Borel-Cantelli lemma to show that P(|W_n / n| > r for infinitely many n) = 0. 14. Finally, show that P(there exists rational r > 0 such that |W_n / n| > r for infinitely many n) = 0. Simulation Exercises 15. In the dice experiment, select the average random variable. For each die distribution, start with n = 1 die and increase the number of dice by one until you get to n = 20 dice. Note how the distribution of the sample mean begins to resemble a point mass distribution. Run the simulation 1000 times, updating every 10 runs. Note the apparent convergence of the empirical density of the sample mean to the true density. Many of the applets in this project are simulations of experiments with a basic random variable of interest. When you run the simulation, you are performing independent replications of the experiment. In most cases, the applet displays the mean of the distribution numerically in a table and graphically as the center of the blue horizontal bar in the graph box. When you run the simulation, sample mean is also displayed numerically in the table and graphically as the center of the red horizontal bar in the graph box. 16. In the simulation of the binomial coin experiment, the random variable is the number of heads. Run the simulation 1000 times updating every 10 runs and note the apparent convergence of the sample mean to the distribution mean. 17. In the simulation of the matching experiment, the random variable is the number of matches. Run the simulation 1000 times updating every 10 runs and note the apparent convergence of the sample mean to the distribution mean. 18. Run the simulation of the exponential experiment 1000 times with an update frequency of 10. Note the apparent convergence of the sample mean to the distribution mean. Virtual Laboratories > Random Samples > 1 [2] 3 4 5 6 7 8 9 Contents | Applets | Data Sets | Biographies | Resources | Keywords | Љ