Statistics/Interval Estimation

Introduction

Previously, we have discussed Template:Colored em, which gives us an estimator $\hat{θ}$ for the value of an unknown parameter $θ$ . Now, suppose we want to know the size of Template:Colored em of the point estimator $\hat{θ}$ , i.e. the difference between $\hat{θ}$ and the unknown parameter $θ$ . Of course, we can make use of the value of the Template:Colored em of $\hat{θ}$ , $𝔼 [(\hat{θ} - θ)^{2}]$ , or other things.

However, what if we only know about one specific Template:Colored em? We cannot calculate the mean squared error of its corresponding point estimator with Template:Colored em this point estimates, right? So, how do we know the possible size of error of this Template:Colored em? Indeed, it is impossible to tell, since we are only given a particular estimated value of parameter $θ$ , but of course we do know the value of the unknown parameter $θ$ , thus the difference between this point estimate and $θ$ is also unknown.

To illustrate this, consider the following example: suppose we take a random sample of 10 students from one particular course in university to estimate the mean score of the students in the final exam in that course, denoted by $μ$ , (assume the score is normally distributed), and the observed value of the sample mean is $\overline{x} = 60$ . Then, what is the difference between this point estimate and the true unknown parameter $μ$ ? Can we be "confident" that this sample mean is close to $μ$ , say $μ \in [\overline{x} - 5, \overline{x} + 5] = [55, 65]$ ?

It is possible that $μ$ is, say 90, and somehow the students in the sample are the one with very poor performance. On the other hand, it is also possible that $μ$ is, say 30, and somehow the students in the sample are the one who perform well (relatively). Of course, it is also possible that the $μ$ is quite close to 60, say 59. From this example, we can see that a particular value $\overline{x} = 60$ does not tell us the possible size of error: the error can be very large, and also can be very small.

In this chapter, we will introduce Template:Colored em where we use Template:Colored em that can describe the size of error through providing the probability for the random interval (i.e. interval with at least one of its bounds to be a random variable) given by the interval estimator to contain the unknown parameter $θ$ , which measures the "accuracy" of the interval estimator of $θ$ , and hence the size of error.

As suggested by the name Template:Colored em, the estimator involves some sort of Template:Colored em. Also, as one may expect, Template:Colored em is also based on Template:Colored em: Template:Colored definition Of course, we would like the probability for the unknown parameter $θ$ to lie in the interval to be close to 1, so that the interval estimator is very accurate. However, a very accurate interval estimator may have a very bad "precision", i.e. the interval covers "too many" plausible values of an unknown parameter, and therefore even if we know that $θ$ is very likely to be one of such values, there are too many different possibilities. Hence, such interval estimator is not very "useful". To illustrate this, suppose the interval concerned is $ℝ$ , which is the parameter space of $θ$ . Then, of course $ℙ (θ \in ℝ) = 1$ (so the "confidence" is high) since $θ$ must lie in its parameter space. However, such interval has basically "zero precision", and is quite "useless", since the "plausible values" of $θ$ in the intervals are essentially all possible values of $θ$ .

From this, we can observe the need of the "precision" of the interval, that is, we also want the Template:Colored em of the interval to be small, so that we can have some ideas about the "location" of $θ$ . However, as the interval becomes smaller, it is more likely that such interval misses $θ$ , i.e. does not cover the actual value of $θ$ , and therefore the probability for $θ$ to lie in that interval becomes smaller, i.e. the interval becomes less "accurate". To illustrate this, let us consider the extreme case: the interval is so small that it becomes an interval containing a single point (the two end-points of the interval coincide). Then, the "interval estimator" basically becomes a "point estimator" in some sense, and we know that it is very unlikely that the true value of $θ$ equals the value of the point estimator $\hat{θ}$ ( $θ$ lies in that "interval" is equivalent to $θ = \hat{θ}$ in this case). Indeed, if the distribution of $\hat{θ}$ is Template:Colored em, then $ℙ (\hat{θ} = θ) = 0$ .

As we can see from above, although we want the interval to have a very high "confidence" and also "very precise" (i.e. the interval is very narrow), we cannot have both of them, since an increase in confidence causes a decrease in "precision", and an increase in "precision" causes a decrease in confidence. Therefore, we need to make some compromises between them, and pick an interval that gives a sufficiently high confidence, and also is quite precise. In other words, we would like to have a Template:Colored em interval that will cover $θ$ with a Template:Colored em.

Terminologies

Now, let us formally define some terminologies related to Template:Colored em. Template:Colored definition Template:Colored remark Template:Colored definition Template:Colored example Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark Template:Colored example Template:Colored exercise

Construction of confidence intervals

After understanding what confidence interval is, we would like to know how to construct one naturally. A main way for such construction is using the Template:Colored em, which is defined below. Template:Colored definition Template:Colored remark After having such pivotal quantity $Q (𝐗, θ)$ , we can construct a $1 - α$ confidence interval for $θ$ by the following steps:

For that value of $α$ , find $a, b$ such that $ℙ (a \leq Q (𝐗, θ) \leq b) = 1 - α$ ^[1] ( $a, b$ does not involve $θ$ since $Q (𝐗, θ)$ is a pivotal quantity).
After that, we can transform $a \leq Q (𝐗, θ) \leq b$ to $L (𝐗) \leq θ \leq U (𝐗)$ since the expression of $Q (𝐗, θ)$ involves $θ$ , as we have assumed (the resulting inequalities should be Template:Colored em to the original inequalities, that is, $a \leq Q (𝐗, θ) \leq b ⟺ L (𝐗) \leq θ \leq U (𝐗)$ , so that $ℙ (L (𝐗) \leq θ \leq U (𝐗)) = ℙ (a \leq Q (𝐗, θ) \leq b)$ ).

Template:Colored example Template:Colored exercise Template:Colored example Template:Colored example

Confidence intervals for means of normal distributions

In the following, we will use the concept of pivotal quantity to construct confidence intervals for means and variances of Template:Colored em distributions. After that, because of the central limit theorem, we can construct Template:Colored em confidence intervals for means and variances of other types of distributions that are not normal.

Mean of a normal distribution

Before discussing this confidence interval, let us first introduce a notation:

$z_{α}$ is the upper percentile of $𝒩 (0, 1)$ at level $α$ , i.e. it satisfies $ℙ (Z \geq z_{α}) = α$ where $Z \sim 𝒩 (0, 1)$ .

We can find (or calculate) the values of $z_{α}$ for different $α$ from Template:Colored em. Template:Colored theorem Template:Colored remark

Proof. Let $Z = \frac{\overline{X} - μ}{σ / \sqrt{n}} \sim 𝒩 (0, 1)$ . Since $Z$ is a pivotal quantity (its distribution is independent from $μ$ ), we set $1 - α = 1 - ℙ (Z \geq z_{α / 2}) - ℙ (Z \leq - z_{α / 2}) = ℙ (- z_{α / 2} < Z < z_{α / 2}) = ℙ (- z_{α / 2} \leq Z \leq z_{α / 2}),$ where $z_{α / 2}$ is a constant (and does not involve $μ$ ). Then, we have $\begin{matrix} 1 - α & = ℙ (- z_{α / 2} \leq Z \leq z_{α / 2}) \\ = ℙ (- z_{α / 2} \leq \frac{\overline{X} - μ}{σ / \sqrt{n}} \leq z_{α / 2}) \\ = ℙ (- z_{α / 2} \frac{σ}{\sqrt{n}} \leq \overline{X} - μ \leq z_{α / 2} \frac{σ}{\sqrt{n}}) \\ = ℙ (z_{α / 2} \frac{σ}{\sqrt{n}} \geq μ - \overline{X} \geq - z_{α / 2} \frac{σ}{\sqrt{n}}) \\ = ℙ (- z_{α / 2} \frac{σ}{\sqrt{n}} \leq μ - \overline{X} \leq z_{α / 2} \frac{σ}{\sqrt{n}}) & (rewrite) \\ = ℙ (\overline{X} - z_{α / 2} \frac{σ}{\sqrt{n}} \leq μ \leq \overline{X} + z_{α / 2} \frac{σ}{\sqrt{n}}) . \end{matrix}$ The result follows.

$◻$

The following graph illustrates $ℙ (- z_{α / 2} \leq Z \leq z_{α / 2}) = 1 - α$ :

                    |
                  *-|-*
                 /##|##\   
                /###|###\  <----- area 1-a
               /####|####\
              /#####|#####\
             /######|######\
            /|######|######|\
 area    --*.|######|######|.*-- 
 a/2 --> ....|######|######|....  <---  area a/2
        ------------*---------------
           -z_{a/2}       z_{a/2}

Template:Colored example Template:Colored example We have previously discussed a way to construct confidence interval for the mean when the variance is Template:Colored em. However, this is not always the case in practice. We may not know the variance, right? Then, we cannot use the $σ$ in the confidence interval from the previous theorem.

Intuitively, one may think that we can use the Template:Colored em $S^{2}$ to "replace" the $σ^{2}$ , according to the weak law of large number. Then, we can simply replace the unknown $σ$ in the confidence interval by the known $S$ (or its realization $s$ for interval estimate). However, the flaw in this argument is that the sample size may not be large enough to apply the weak law of large number for approximation. Template:Colored remark So, you may now ask that when the sample size is large enough, can we do such "replacement" for approximation. The answer is Template:Colored em, and we will discuss in the last section about approximated confidence intervals.

Before that section, the confidence intervals discussed is Template:Colored em in the sense that no approximation is used to construct them. Therefore, the confidence intervals constructed "work" for Template:Colored em sample size, no matter how large or how small it is (it works even if the sample size is 1, although such confidence interval constructed may not be very "nice", in the sense that the width of the interval may be quite large).

Before discussing how to construct an confidence interval for the mean when the variance is unknown, we first give some results that are useful for deriving such confidence interval.

Template:Colored proposition

Proof.

(i) One may use Basu's theorem to prove this, but the details about Basu's theorem and the proof are omitted here, since they are a bit complicated.

(ii) We will use the following definition of chi-squared distribution $χ_{k}^{2}$ : $\sum_{i = 1}^{k} Z_{i}^{2} \sim χ_{k}^{2}$ where $Z_{1}, Z_{2}, \dots, Z_{k} \sim 𝒩 (0, 1)$ are independent. Also, we will use the fact that the mgf of $χ_{k}^{2}$ is $M (t) = (1 - 2 t)^{- k / 2}, t < \frac{1}{2}$ .

Now, first let $W = \sum_{i = 1}^{n} {(\frac{X_{i} - μ}{σ})}^{2}$ which follows $χ_{n}^{2}$ since $\frac{X_{1} - μ}{σ}, \dots, \frac{X_{n} - μ}{σ} \sim 𝒩 (0, 1)$ are independent. Then, we write $W$ as $\begin{matrix} W & = \sum_{i = 1}^{n} {(\frac{X_{i} - μ}{σ})}^{2} \\ = \sum_{i = 1}^{n} {(\frac{X_{i} - \overline{X}}{σ} + \frac{\overline{X} - μ}{σ})}^{2} \\ = \sum_{i = 1}^{n} {(\frac{X_{i} - \overline{X}}{σ})}^{2} + \sum_{i = 1}^{n} {(\frac{\overline{X} - μ}{σ})}^{2} + 0 & (2 \sum_{i = 1}^{n} \frac{(\overline{X} - μ) (X_{i} - \overline{X})}{σ^{2}} = \frac{2 (\overline{X} - μ)}{σ^{2}} (\underset{= n \overline{X}}{\underset{⏟}{\sum_{i = 1}^{n} X_{i}}} - \underset{= n \overline{X}}{\underset{⏟}{\sum_{i = 1}^{n} \overset{constant wrt i}{\overset{⏞}{\overline{X}}}}}) = 0) \\ = \frac{1}{σ^{2}} \underset{= n S^{2}}{\underset{⏟}{\sum_{i = 1}^{n} (X_{i} - \overline{X})^{2}}} + (\underset{\sim 𝒩 (0, 1) by property}{\underset{⏟}{\frac{\sqrt{n} (\overline{X} - μ)}{σ}}})^{2} \\ = \frac{n S^{2}}{σ^{2}} + Z^{2} & (Z = \frac{\sqrt{n} (\overline{X} - μ)}{σ} \sim 𝒩 (0, 1)) \end{matrix}$ Applying the definition of chi-squared distribution, we have $Z^{2} \sim χ_{1}^{2}$ .

By (i), $\overline{X}$ and $S^{2}$ are independent. Thus, $\frac{n S^{2}}{σ^{2}}$ (a function of $S^{2}$ ) is independent from $Z^{2}$ (a function of $\overline{X}$ ). Now, let $U = \frac{n S^{2}}{σ^{2}}$ and $V = Z^{2}$ . Since $U$ and $V$ are independent, and also we have $W = U + V$ from above derivation, the mgf $M_{W} (t) = M_{U + V} (t) = M_{U} (t) M_{V} (t) .$ Since $W \sim χ_{n}^{2}$ and $V \sim χ_{1}^{2}$ , we can further write $(1 - 2 t)^{- n / 2} = M_{U} (t) (1 - 2 t)^{- 1 / 2}, t < \frac{1}{2},$ which implies that the mgf of $U$ is $M_{U} (t) = (1 - 2 t)^{- (n - 1) / 2}, t < \frac{1}{2}$ , which is exactly the mgf of $χ_{n - 1}^{2}$ . Hence, $U = \frac{n S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ .

(iii) We will use the following definition of $t$ -distribution $t_{k}$ : $\frac{Z}{\sqrt{Y / k}} \sim t_{k}$ where $Z \sim 𝒩 (0, 1)$ , $Y \sim χ_{k}^{2}$ , and $Z$ and $Y$ are independent.

After using this definition, it is easy to prove (iii) with (ii), as follows: $\begin{matrix} \frac{\overline{X} - μ}{S / \sqrt{n - 1}} & = \frac{\sqrt{n} (\overline{X} - μ) / σ}{\sqrt{n} S / (σ \sqrt{n - 1})} \\ = \frac{\sqrt{n} (\overline{X} - μ) / σ}{\sqrt{\frac{n S^{2}}{σ^{2}} / n - 1}} . \end{matrix}$ By (ii), $\frac{n S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ . Also, we know that $\frac{\sqrt{n} (\overline{X} - μ)}{σ}$ and $\frac{n S^{2}}{σ^{2}}$ are independent since $\overline{X}$ and $S^{2}$ are independent by (i). Then, it follows by the above definition that $\frac{\overline{X} - μ}{S / \sqrt{n - 1}} \sim t_{n - 1}$ .

$◻$

Using this proposition, we can prove the following theorem. Again, before discussing this confidence interval, let us introduce a notation:

$t_{α, ν}$ is the upper percentile of $t_{ν}$ at level $α$ , i.e. it satisfies $ℙ (T \geq t_{α, ν}) = α$ where $T \sim t_{ν}$ .

Template:Colored theorem Template:Colored remark

Proof. By (iii) in the previous proposition, we have $T = \frac{\overline{X} - μ}{S / \sqrt{n - 1}} \sim t_{n - 1}$ . Since $t_{n - 1}$ is independent from $μ$ , $T$ is a pivotal quantity of $μ$ . Hence, we set $1 - α = 1 - ℙ (T \geq t_{α / 2, n - 1})) - ℙ (T \leq - t_{α / 2, n - 1}) = ℙ (- t_{α / 2, n - 1} \leq T \leq t_{α / 2, n - 1})$ where $t_{α / 2, n - 1}$ is a constant ( $t$ -distribution is symmetric (about $x = 0$ ), so we have $ℙ (T \leq - t_{α / 2, n - 1}) = α / 2$ ). It follows that $\begin{matrix} 1 - α & = ℙ (- t_{α / 2, n - 1} \leq T \leq t_{α / 2, n - 1}) \\ = ℙ (- t_{α / 2, n - 1} \leq \frac{\overline{X} - μ}{S / \sqrt{n - 1}} \leq t_{α / 2, n - 1}) \\ = ℙ (- t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}} \leq \overline{X} - μ \leq t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}}) \\ = ℙ (t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}} \geq μ - \overline{X} \geq - t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}}) \\ = ℙ (- t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}} \leq μ - \overline{X} \leq t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}}) & (rewrite) \\ = ℙ (\overline{X} - t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}} \leq μ \leq \overline{X} + t_{α / 2, n - 1} \frac{S}{\sqrt{n - 1}}) . \end{matrix}$ The result follows.

$◻$

Template:Colored example Template:Colored example

Difference in means of two normal distributions

Sometimes, apart from estimating mean of a Template:Colored em normal distribution, we would like to estimate the Template:Colored em in means of Template:Colored em normal distributions for making comparison. For example, apart from estimating the mean amount of time (lifetime) for a bulb until it burns out, we are often interested in estimating the Template:Colored em between life of two different bulbs, so that we know which of the bulbs will last longer in average, and then we know that bulb has a higher "quality".

First, let us discuss the case where the two normal distributions are Template:Colored em.

Now, the problem is that how should we construct a confidence interval for the Template:Colored em in two means. It seems that we can just construct two $1 - α$ confidence intervals $[L (𝐗), U (𝐗)], [L (𝐘), U (𝐘)]$ for each of the two means $μ_{X}, μ_{Y}$ respectively. Then, the $1 - α$ confidence interval for $μ_{X} - μ_{Y}$ is $[L (𝐗) - L (𝐘), U (𝐗) - U (𝐘)]$ . However, this is indeed incorrect since when we have $ℙ (L (𝐗) \leq μ_{X} \leq U (𝐗)) = 1 - α$ and $ℙ (L (𝐘) \leq μ_{Y} \leq U (𝐘)) = 1 - α$ , it does Template:Colored em mean that $ℙ (L (𝐗) - L (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - U (𝐘)) = 1 - α$ (there are no results in probability that justify this).

On the other hand, it seems that since ${L (𝐗) \leq μ_{X} \leq U (𝐗)}$ and ${L (𝐘) \leq μ_{Y} \leq U (𝐘)}$ are independent (since the normal distributions we are considering are independent), then we have $ℙ (L (𝐗) \leq μ_{X} \leq U (𝐗) and L (𝐘) \leq μ_{Y} \leq U (𝐘)) = (1 - α)^{2} .$ Then, when $L (𝐗) \leq μ_{X} \leq U (𝐗)$ and $L (𝐘) \leq μ_{Y} \leq U (𝐘)$ , we have $L (𝐗) - U (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - L (𝐘),$ so $ℙ (L (𝐗) - U (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - L (𝐘)) = (1 - α)^{2},$ which means $[L (𝐗) - U (𝐘), U (𝐗) - L (𝐘)]$ is a $(1 - α)^{2}$ confidence interval.

However, this is actually also incorrect. The flaw is that "when $L (𝐗) \leq μ_{X} \leq U (𝐗)$ and $L (𝐘) \leq μ_{Y} \leq U (𝐘)$ , we have $L (𝐗) - U (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - L (𝐘)$ " only means ${L (𝐗) \leq μ_{X} \leq U (𝐗) and L (𝐘) \leq μ_{Y} \leq U (𝐘)} \subseteq {L (𝐗) - U (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - L (𝐘)}$ (we do not have the reverse subset inclusion in general). This in turn means $(1 - α)^{2} = ℙ (L (𝐗) \leq μ_{X} \leq U (𝐗) and L (𝐘) \leq μ_{Y} \leq U (𝐘)) \leq ℙ (L (𝐗) - U (𝐘) \leq μ_{X} - μ_{Y} \leq U (𝐗) - L (𝐘)) .$ So, $[L (𝐗) - U (𝐘), U (𝐗) - L (𝐘)]$ is actually Template:Colored em a $(1 - α)^{2}$ confidence interval (in general).

So, the above two "methods" to construct confidence intervals for difference in means of two independent normal distributions actually do not work. Indeed, we do Template:Colored em use the confidence interval for each of the two means, which is constructed previously, to construct a confidence interval for difference in the two means. Instead, we consider a Template:Colored em of the difference in the two means, which is a standard way for constructing confidence intervals.

Template:Colored theorem Template:Colored remark Template:Colored exercise Now, we will prove the above theorem based on the result shown in the previous exercise:

Proof. Let $Z = \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / m}} \sim 𝒩 (0, 1)$ (from the previous exercise). Then, $Z$ is a pivotal quantity of $μ_{X} - μ_{Y}$ . Hence, we have $\begin{matrix} 1 - α & = ℙ (- z_{α / 2} \leq Z \leq z_{α / 2}) \\ = ℙ (- z_{α / 2} \leq \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / m}} \leq z_{α / 2}) \\ = ℙ (- z_{α / 2} \sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / m} \leq (\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y}) \leq z_{α / 2} \sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / n}) \\ = ℙ (z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}} \geq (μ_{X} - μ_{Y}) - (\overline{X} - \overline{Y}) \geq - z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}}) \\ = ℙ ((\overline{X} - \overline{Y}) - z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}} \leq μ_{X} - μ_{Y} \leq (\overline{X} - \overline{Y}) + z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}}) . \end{matrix}$

$◻$

Template:Colored example Now, we will consider the case where the variances are Template:Colored em. In this case, the construction of the confidence interval for the difference in means is more complicated, and even more complicated when $σ_{X}^{2} \neq σ_{Y}^{2}$ . Thus, we will only discuss the case where $σ_{X}^{2} = σ_{Y}^{2}$ is unknown. As you may expect, we will also use some results mentioned previously for constructing confidence interval for $μ$ when $σ^{2}$ is unknown in this case. Template:Colored theorem Template:Colored remark

Proof. Let $Z = \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{σ^{2} / n + σ^{2} / m}} \sim 𝒩 (0, 1)$ (the reason for this to follow $𝒩 (0, 1)$ is shown in a previous exercise). From a previous result, we know that $V = \frac{n S_{X}^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ and $W = \frac{m S_{Y}^{2}}{σ^{2}} \sim χ_{m - 1}^{2}$ . Then, we know that the mgf of $V$ is $M_{V} (t) = (1 - 2 t)^{- (n - 1) / 2}$ and the mgf of $W$ is $M_{W} (t) = (1 - 2 t)^{- (m - 1) / 2}$ . Since the distributions $𝒩 (μ_{X}, σ^{2})$ and $𝒩 (μ_{Y}, σ^{2})$ are independent, the mgf of $U = V + W$ is $M_{U} (t) = M_{V + W} (t) = M_{V} (t) M_{W} (t) = (1 - 2 t)^{- (n - 1) / 2 - (m - 1) / 2} = (1 - 2 t)^{- (n + m - 2) / 2} .$ Hence, $W \sim χ_{n + m - 2}^{2}$ .

By the independence of sample mean and sample variance ( $\overline{X}$ and $S_{X}^{2}$ are independent, $\overline{Y}$ and $S_{Y}^{2}$ are independent), we can deduce that $Z$ and $U$ are independent. Thus, by the definition of $t$ -distribution, $\begin{matrix} T & = \frac{Z}{\sqrt{U / (n + m - 2)}} \\ = \frac{((\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})) / \sqrt{σ^{2} / n + σ^{2} / m}}{\sqrt{(n S_{X}^{2} + m S_{Y}^{2}) / (σ^{2} (n + m - 2))}} \\ = \frac{((\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})) / (σ \sqrt{1 / n + 1 / m})}{(1 / σ) \sqrt{(n S_{X}^{2} + m S_{Y}^{2}) / (n + m - 2)}} & (this step is not possible without the equal variance assumption) \\ = \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{1 / n + 1 / m} \sqrt{(n S_{X}^{2} + m S_{Y}^{2}) / (n + m - 2)}} \\ = ((\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})) / \sqrt{\frac{n S_{X}^{2} + m S_{Y}^{2}}{n + m - 2} (\frac{1}{n} + \frac{1}{m})} \\ = \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{R} & (R = \sqrt{\frac{n S_{X}^{2} + m S_{Y}^{2}}{n + m - 2} (\frac{1}{n} + \frac{1}{m})}) \end{matrix}$ follows $t_{n + m - 2}$ . Therefore, $T \sim t_{n + m - 2}$ is a pivotal quantity of $μ_{X} - μ_{Y}$ . Hence, we have $\begin{matrix} 1 - α & = ℙ (- t_{α / 2, n + m - 2} \leq T \leq t_{α / 2, n + m - 2}) \\ = ℙ (- t_{α / 2, n + m - 2} \leq \frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{R} \leq t_{α / 2, n + m - 2}) \\ = ℙ (- t_{α / 2, n + m - 2} R \leq (\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y}) \leq t_{α / 2, n + m - 2} R) \\ = ℙ ((\overline{X} - \overline{Y}) + t_{α / 2, n + m - 2} R \geq (μ_{X} - μ_{Y}) \geq (\overline{X} - \overline{Y}) - t_{α / 2, n + m - 2} R) \\ = ℙ ((\overline{X} - \overline{Y}) - t_{α / 2, n + m - 2} R \leq (μ_{X} - μ_{Y}) \leq (\overline{X} - \overline{Y}) + t_{α / 2, n + m - 2} R) \\ = ℙ ((\overline{X} - \overline{Y}) - t_{α / 2, n + m - 2} \sqrt{\frac{n S_{X}^{2} + m S_{Y}^{2}}{n + m - 2} (\frac{1}{n} + \frac{1}{m})} \leq (μ_{X} - μ_{Y}) \leq (\overline{X} - \overline{Y}) + t_{α / 2, n + m - 2} \sqrt{\frac{n S_{X}^{2} + m S_{Y}^{2}}{n + m - 2} (\frac{1}{n} + \frac{1}{m})}) . \end{matrix}$ The result follows.

$◻$

Template:Colored example

Now, what if the two normal distributions concerned are Template:Colored em? Clearly, we cannot use the above results anymore, and we need to develop a new method to construct a confidence interval for the difference of means in this case. In this case, we need to consider the notion of Template:Colored em.

Template:Colored proposition Template:Colored remark

Proof.

1. Independence of $D_{1}, \dots, D_{n}$ :

Since $X_{1}, \dots, X_{n}$ are independent, and $Y_{1}, \dots, Y_{n}$ are independent, it follows that $X_{1} - Y_{1}, \dots, X_{n} - Y_{n}$ are independent, which is what we want to show.

2. $D_{1}, \dots, D_{n} \sim 𝒩 (μ_{X} - μ_{Y}, σ_{X}^{2} + σ_{Y}^{2} + 2 Cov (X, Y))$ :

To show that $D_{1}, \dots, D_{n}$ still follow normal distribution, we can consider the pdf of $D_{i}$ for each $i \in {1, \dots, n}$ . The pdf can be obtained using the transformation of random variables formula: e.g., let $U = X - Y$ and $V = Y$ where $X \sim 𝒩 (μ_{X}, σ_{X}^{2})$ and $𝒩 (μ_{Y}, σ_{Y}^{2})$ . Then, the pdf of $U$ obtained, which is the pdf of $D_{i}$ , should be in the form of normal distribution.

However, since the actual derivation process is somewhat complicated, it is omitted here.

Of course, the mean and variance of $D_{i}$ can be observed from the pdf of $D_{i}$ determined previously. Alternatively, before determining the pdf of $D_{i}$ , we can also know that the mean of $D_{i}$ is $𝔼 [D_{i}] = 𝔼 [X_{i}] - 𝔼 [Y_{i}] = μ_{X} - μ_{Y}$ (we use the linearity of expectation here, which does not require independence assumption), and the variance of $D_{i}$ is $Var (D_{i}) = Var (X_{i}) + (- 1)^{2} Var (Y_{i}) + 2 Cov (X_{i}, - Y_{i}) = σ_{X}^{2} + σ_{Y}^{2} + 2 (- 1) Cov (X_{i}, Y_{i}) = σ_{X}^{2} + σ_{Y}^{2} - 2 Cov (X, Y)$ ( $X \sim 𝒩 (μ_{X}, σ_{X}^{2})$ and $Y \sim 𝒩 (μ_{Y}, σ_{Y}^{2})$ ).

$◻$

Template:Colored corollary Template:Colored remark

Proof. From the previous proposition, we know that $D_{1}, \dots, D_{n}$ is a random sample from $𝒩 (μ_{D}, σ_{D}^{2})$ . Since $σ_{D}^{2}$ is known, it follows from a previous theorem that a $1 - α$ confidence interval for $μ_{D} = μ_{X} - μ_{Y}$ is $[\overline{D} - z_{α / 2} \frac{σ_{D}}{\sqrt{n}}, \overline{D} + z_{α / 2} \frac{σ_{D}}{\sqrt{n}}] .$

$◻$

Template:Colored corollary Template:Colored remark Template:Colored exercise Template:Colored example

Confidence intervals for variances of normal distributions

Variance of a normal distribution

After discussing the confidence intervals for means of normal distributions, let us consider the confidence intervals for Template:Colored em of normal distributions. Similarly, we need to consider a pivotal quantity of $σ^{2}$ . Can you suggest a pivotal quantity of $σ^{2}$ , based on a previous result discussed?

Recall that we have $\frac{n S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ , and $χ_{n - 1}^{2}$ is independent from $σ^{2}$ with some suitable assumptions. Thus, this result gives us a pivotal quantity of $σ^{2}$ , namely $\frac{n S^{2}}{σ^{2}}$ . Before discussing the theorem for constructing a confidence interval for $σ^{2}$ . Let us introduce a notation:

$χ_{α, ν}^{2}$ is the upper percentile of $χ_{ν}^{2}$ at level $α$ , i.e. it satisfies $ℙ (X \geq χ_{α, ν}^{2}) = α$ where $X \sim χ_{ν}^{2}$ .

Some values of $χ_{α, ν}^{2}$ can be found in the chi-squared table.

To find the value of $χ_{α, ν}^{2}$ , locate the row for $ν$ degrees of freedom and the column for "probability content" $α$ .

Template:Colored theorem Template:Colored remark

Proof. Since $Y = \frac{n S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ , set $1 - α = (1 - α / 2) - α / 2 = ℙ (Y \geq χ_{1 - α / 2, n - 1}^{2}) - ℙ (Y \geq χ_{α / 2, n - 1}^{2}) = ℙ (χ_{1 - α / 2, n - 1}^{2} \leq Y < χ_{α / 2, n - 1}^{2}) = ℙ (χ_{1 - α / 2, n - 1}^{2} \leq Y < χ_{α / 2, n - 1}^{2}) .$ ^[2] Then, we have $\begin{matrix} 1 - α & = ℙ (χ_{1 - α / 2, n - 1}^{2} \leq \frac{n S^{2}}{σ^{2}} \leq χ_{α / 2, n - 1}^{2}) \\ = ℙ (\frac{χ_{1 - α / 2, n - 1}^{2}}{n S^{2}} \leq \frac{1}{σ^{2}} \leq \frac{χ_{α / 2, n - 1}^{2}}{n S^{2}}) \\ = ℙ (\frac{n S^{2}}{χ_{1 - α / 2, n - 1}^{2}} \geq σ^{2} \geq \frac{n S^{2}}{χ_{α / 2, n - 1}^{2}}) \\ = ℙ (\frac{n S^{2}}{χ_{α / 2, n - 1}^{2}} \leq σ^{2} \leq \frac{n S^{2}}{χ_{1 - α / 2, n - 1}^{2}}) . \end{matrix}$ The result follows.

$◻$

Template:Colored example Template:Colored remark

Ratio of variances of two independent normal distributions

Similar to the case for means, we would also sometimes like to compare the variances of two normal distributions. One may naturally expect that we should construct a confidence interval for Template:Colored em in variances, similar to the case for means. However, there are simple ways to do this, since we do not have some results that help with this construction. Therefore, we need to consider an alternative way to Template:Colored em the variances, without using the Template:Colored em in variances. Can you suggest a way?

Recall the definition of Template:Colored em in point estimation. Template:Colored em gives us a nice way to compare two variances without considering their difference, where the Template:Colored em of two variances is considered. Fortunately, we have some results that help us to construct a confidence interval for the Template:Colored em of two variances.

Recall that the definition of $F$ -distribution: if $U \sim χ_{ν_{1}}^{2}$ and $V \sim χ_{ν_{2}}^{2}$ are independent, then $\frac{U / ν_{1}}{V / ν_{2}}$ follows the $F$ -distribution with $ν_{1}$ and $ν_{2}$ degrees of freedom, denoted by $F_{ν_{1}, ν_{2}}$ . From the definition of $F$ -distribution, we can see that it involves a ratio of two independent chi-squared random variables. How can it be linked to the ratio of two variances?

Recall that we have $\frac{n S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ with some suitable assumptions. This connects the variance with the chi-squared random variable, and thus we can use this property together with the definition of $F$ -distribution to construct a pivotal quantity, and hence a confidence interval.

Let us introduce a notation before discussing the construction of confidence interval:

$F_{α, ν_{1}, ν_{2}}$ is the upper percentile of $F_{ν_{1}, ν_{2}}$ at level $α$ , i.e. it satisfies $ℙ (X \geq F_{α, ν_{1}, ν_{2}}) = α$ .

Some values of $F_{α, ν_{1}, ν_{2}}$ can be found in $F$ -tables (there is different $F$ -tables for different values of $α$ , and the row and column of each table indicates the first and second degrees of freedom respectively). Also, using the property that $F_{α, ν_{1}, ν_{2}} = \frac{1}{F_{1 - α, ν_{2}, ν_{1}}}$ , we can obtain some more values of $F_{α, ν_{1}, ν_{2}}$ which are not included in the $F$ -tables. Template:Colored theorem Template:Colored remark

Proof. By the assumptions, we have $\frac{n S_{X}^{2}}{σ_{X}^{2}} \sim χ_{n - 1}^{2} and \frac{m S_{Y}^{2}}{σ_{Y}^{2}} \sim χ_{m - 1}^{2} .$ Thus, by the definition of $F$ -distribution, we have $\frac{m S_{Y}^{2}}{σ_{Y}^{2} (m - 1)} / \frac{n S_{X}^{2}}{σ_{X}^{2} (n - 1)} = \frac{m (n - 1) S_{Y}^{2} σ_{X}^{2}}{n (m - 1) S_{X}^{2} σ_{Y}^{2}} \sim F_{m - 1, n - 1},$ which is a pivotal quantity of $σ_{X}^{2} / σ_{Y}^{2}$ . Hence, we have $\begin{matrix} 1 - α & = (1 - α / 2) - α / 2 \\ = ℙ (F_{1 - α / 2, m - 1, n - 1} \leq \frac{m (n - 1) S_{Y}^{2} σ_{X}^{2}}{n (m - 1) S_{Y}^{2} σ_{X}^{2}} \leq F_{α / 2, m - 1, n - 1}) \\ = ℙ (\frac{n (m - 1) S_{X}^{2}}{m (n - 1) S_{Y}^{2}} \cdot F_{1 - α / 2, m - 1, n - 1} \leq \frac{σ_{X}^{2}}{σ_{Y}^{2}} \leq \frac{n (m - 1) S_{X}^{2}}{m (n - 1) S_{Y}^{2}} \cdot F_{α / 2, m - 1, n - 1}), \end{matrix}$ as desired.

$◻$

Apart from using this confidence interval to Template:Colored em variances (or standard deviations), it can also be useful to Template:Colored em some assumptions about variances. Let us illustrate these two usages in the following examples. Template:Colored example Template:Colored example Template:Colored remark

Approximated confidence intervals for means

Previously, the distributions for the population are assumed to be normal, but the distributions are often Template:Colored em normal in reality. So, does it mean our previous discussions are meaningless in reality? No. The discussions are indeed still quite meaningful in reality, since we can use the Template:Colored em to "connect" the distributions in reality (which are usually not normal) to normal distribution. Through this, we can construct Template:Colored em confidence intervals, since we use central limit theorem for approximation.

To be more precise, recall that the Template:Colored em suggests that $\frac{\overline{X} - μ}{σ / \sqrt{n}} \overset{d}{\to} Z \sim 𝒩 (0, 1)$ with some suitable assumptions. Therefore, if the sample size $n$ is large enough (a rule of thumb: at least 30), then $\frac{\overline{X} - μ}{σ / \sqrt{n}}$ follows Template:Colored em standard normal distribution. Hence, $\frac{\overline{X} - μ}{σ / \sqrt{n}}$ is a pivotal quantity (approximately). Recall from the property of normal distribution that if $X_{1}, \dots, X_{n}$ is a random sample from $𝒩 (μ, σ^{2})$ , then we have $\frac{\overline{X} - μ}{σ / \sqrt{n}} \sim 𝒩 (0, 1)$ Template:Colored em (not approximately), and we have used this for the pivotal quantity for the confidence interval for mean when variance is known, and also the confidence interval for $μ_{X} - μ_{Y}$ when $σ_{D}^{2}$ is known. Therefore, we can just use basically the same confidence interval in these cases, but we need to notice that such confidence intervals are approximated, but not exact since we have used the central limit theorem for constructing the pivot quantity.

Now, how about the other confidence intervals where the pivotal quantity is "not in this form"? In the confidence interval for difference in means when variance is unknown, the pivotal quantity is similar in some sense: $\frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / m}} \sim 𝒩 (0, 1)$ (see the corresponding theorem for the meaning of the notations involved). Can we use the central limit theorem to conclude that when the distributions involved are not normal (but are still independent), and the sample sizes $n$ and $m$ are both large enough, then $\frac{(\overline{X} - \overline{Y}) - (μ_{X} - μ_{Y})}{\sqrt{σ_{X}^{2} / n + σ_{Y}^{2} / m}} \sim 𝒩 (0, 1)$ Template:Colored em? The answer is yes. For the proof, see the following exercise. Template:Colored exercise As a result, we know that we can again just use basically the same confidence interval in this case, but of course such confidence interval is approximated.

There are still some confidence intervals that are not considered yet. Let us first consider the confidence interval for mean when the variance is Template:Colored em.

Recall that we have mentioned that we can simply replace the " $σ$ " by " $S$ " according to the weak law of large number, which is quite intuitive. But why can we do this? Consider the following theorem. Template:Colored theorem Template:Colored remark

Proof. Under the assumption that the random sample has finite mean and variance, applying weak law of large number gives $S \overset{p}{\to} σ$ (we have shown that $S^{2} \overset{p}{\to} σ^{2}$ , then we can just apply continuous mapping theorem to get this). Hence, $\frac{σ}{S} \overset{p}{\to} \frac{σ}{σ} = 1$ ( $σ > 0$ ) by property of convergence in probability.

By central limit theorem, we have $\frac{\overline{X} - μ}{σ / \sqrt{n}} \overset{d}{\to} Z \sim 𝒩 (0, 1)$ . Thus, $\frac{\overline{X} - μ}{S / \sqrt{n}} = \frac{\overline{X} - μ}{σ / \sqrt{n}} \cdot \frac{σ}{S} \overset{d}{\to} Z \sim 𝒩 (0, 1)$ by Slutsky's theorem.

Therefore, $\frac{\overline{X} - μ}{S / \sqrt{n}}$ is a pivotal quantity, which follows $𝒩 (0, 1)$ approximately. Notice that its approximated distribution, $𝒩 (0, 1)$ , is the same as that of pivotal quantity for confidence interval for $μ$ when $σ^{2}$ is known, namely $\frac{\overline{X} - μ}{σ / \sqrt{n}}$ . As a result, we can use similar steps to obtain the approximated confidence interval, where " $σ$ " is replaced by " $S$ ".

$◻$

So far, we have not discussed how to construct an approximated confidence interval for $μ_{X} - μ_{Y}$ when $σ_{X}^{2} = σ_{Y}^{2} = σ^{2}$ is unknown, as well as approximated confidence intervals of variances. Since the pivotal quantities used are constructed according to some results that are exclusive to normal distributions, they all do Template:Colored em work when the distributions involved are not normal. Therefore, there are no simple ways to perform such constructions.

The following table summarizes the approximated $1 - α$ confidence intervals in different cases: $\begin{matrix} (approximated (1 - α) confidence intervals) & mean & difference in means \\ known variance & [\overline{X} - z_{α / 2} \frac{σ}{\sqrt{n}}, \overline{X} + z_{α / 2} \frac{σ}{\sqrt{n}}] & [(\overline{X} - \overline{Y}) - z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}}, (\overline{X} - \overline{Y}) + z_{α / 2} \sqrt{\frac{σ_{X}^{2}}{n} + \frac{σ_{Y}^{2}}{m}}] OR [\overline{D} - z_{α / 2} \frac{σ_{D}}{\sqrt{n}}, \overline{D} + z_{α / 2} \frac{σ_{D}}{\sqrt{n}}] (paired samples) \\ unknown variance & [\overline{X} - z_{α / 2} \frac{S}{\sqrt{n}}, \overline{X} + z_{α / 2} \frac{S}{\sqrt{n}}] & [\overline{D} - z_{α / 2} \frac{S_{D}}{\sqrt{n}}, \overline{D} + z_{α / 2} \frac{S_{D}}{\sqrt{n}}] (paired samples) \end{matrix}$ Template:Colored remark Template:Colored example Let us consider an application of the approximated confidence intervals. Template:Colored proposition Template:Colored remark

Proof. Since $ξ = 𝟏 {X \in S}$ , by the fundamental bridge between probability and expectation, we have $𝔼 [ξ] = 𝔼 [𝟏 {X \in S}] = ℙ (X \in S) = p .$

Applying the result for constructing an approximated confidence interval for mean when variance is unknown (the variance of $ξ$ is $p (1 - p)$ since $ξ$ follows the Bernoulli distribution $Ber (p)$ actually), an approximated $1 - α$ confidence interval for $p$ is $[\overline{ξ} - z_{α / 2} \frac{S_{ξ}}{\sqrt{n}}, \overline{ξ} + z_{α / 2} \frac{S_{ξ}}{\sqrt{n}}] .$

$◻$

Template:Colored example Template:Colored example Template:Nav Template:BookCat

↑ Usually, we choose $a$ and $b$ such that $ℙ (Q (𝐗, θ) < a) = α / 2$ and $ℙ (Q (𝐗, θ) > b) = α / 2$ because of convenience (if the pdf of $Q (𝐗, θ)$ is symmetric about $x = 0$ , then we know that $b = - a$ ).

↑ We need to do this since chi-squared distribution is Template:Colored em symmetric about

x = 0

. Graphically, it looks like

|         area: 1-a
|     #    |
|   #....# v
|  # .......#   
| # |..........#
|#  |..........|  #
*---*----------*------
chi^2 1-a/2  chi^2 a/2

[1] Usually, we choose $a$ and $b$ such that $ℙ (Q (𝐗, θ) < a) = α / 2$ and $ℙ (Q (𝐗, θ) > b) = α / 2$ because of convenience (if the pdf of $Q (𝐗, θ)$ is symmetric about $x = 0$ , then we know that $b = - a$ ).

[2] We need to do this since chi-squared distribution is Template:Colored em symmetric about $x = 0$ . Graphically, it looks like
| area: 1-a | # | | #....# v | # .......# | # |..........# |# |..........| # *---*----------*------ chi^2 1-a/2 chi^2 a/2

[1]

[2]

Statistics/Interval Estimation

Contents

Introduction

Terminologies

Construction of confidence intervals

Confidence intervals for means of normal distributions

Mean of a normal distribution

Difference in means of two normal distributions

Confidence intervals for variances of normal distributions

Variance of a normal distribution

Ratio of variances of two independent normal distributions

Approximated confidence intervals for means

Navigation menu

Statistics/Interval Estimation

Introduction

Terminologies

Construction of confidence intervals

Confidence intervals for means of normal distributions

Mean of a normal distribution

Difference in means of two normal distributions

Confidence intervals for variances of normal distributions

Variance of a normal distribution

Ratio of variances of two independent normal distributions

Approximated confidence intervals for means

Navigation menu

Search