Statistics/Preliminaries

Template:Nav This chapter discusses some preliminary knowledge (related to statistics) for the following chapters in the advanced part.

Empirical distribution

Template:Colored definition Template:Colored remark Since all these $n$ random variables follow the same cdf as $X$ , we may expect their distribution should be somewhat similar to the distribution of $X$ , and indeed, this is true. Before showing how this is true, we need to define "the distribution of these $n$ random variables" more precisely, as follows: Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark Template:Colored theorem Template:Colored remark We have mentioned how we can approximate the cdf, and now we would like to estimate the Template:Colored em/Template:Colored em. Let us first discuss how to estimate the pmf.

For the discrete random variable $X$ , from the empirical cdf, we know that each $X_{1}, \dots, X_{n}$ is "assigned" with the probability $1 / n$ . Also, considering the previous example, the empirical pmf is $f_{n} (x) = \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} = x}}{n}$ . Template:Colored remark To discuss the estimation of pdf of continuous random variable, we need to define Template:Colored em first. Template:Colored definition For the continuous random variable $X$ , construct class intervals for $X$ which are a non-overlapped partition of the interval $[X_{min}, X_{max}]$ , in which $X_{min}$ and $X_{max}$ are the minimum and maximum values in the sample. Then, the pdf $f (x) \approx \frac{F (c_{j}) - F (c_{j - 1})}{c_{j} - c_{j - 1}}, x \in (c_{j - 1}, c_{j}] and j = 1, 2, \dots, i,$ when $c_{j - 1}$ and $c_{j}$ are close, i.e. the length of each class interval is small. (Although the union of the above class intervals is $(c_{0}, c_{i}]$ and thus the value $c_{0}$ is not included in the interval, it does not matter since the value of the pdf at $c_{0}$ does not affect the calculation of probability.) Here, $c_{0}$ is $X_{min}$ and $c_{i}$ is $X_{max}$ .

Since $F (c_{j}) - F (c_{j - 1}) = ℙ (X \in (c_{j - 1}, c_{j}]) \approx \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{j - 1}, c_{j}]}}{n}$ is the relative frequency of occurrences of the event ${X_{k} \in (c_{j - 1}, c_{j}]}$ , we can rewrite the above expression as $f (x) \approx h_{n} (x) = \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{j - 1}, c_{j}]}}{n (c_{j} - c_{j - 1})}, x \in (c_{j - 1}, c_{j}] and j = 1, 2, \dots, i$ in which $h_{n} (x)$ is called the Template:Colored em.

Since there are many possible ways to construct the class intervals, the value of $h_{n} (x)$ can differ even with the same $n$ and $x$ . When $n$ is Template:Colored em and the length of each class interval is Template:Colored em, we will expect $h_{n} (x)$ to be a good estimate of $f (x)$ (the theoretical pdf).

There are some properties related to the relative frequency histogram, as follows: Template:Colored proposition

Proof.

(i) Since the indicator function is nonnegative (its value is either 0 or 1), $n$ is positive, and $c_{j} > c_{j - 1}$ so $c_{j} - c_{j - 1}$ is positive, we have $h_{n} (x) \geq 0$ by definition.

(ii) $\begin{matrix} \int_{c_{0}}^{c_{i}} h_{n} (x) d x & = \int_{c_{0}}^{c_{1}} h_{n} (x) d x + \int_{c_{1}}^{c_{2}} h_{n} (x) d x + \dots + \int_{c_{i - 1}}^{c_{i}} h_{n} (x) d x \\ = \frac{1}{n} (\int_{c_{0}}^{c_{1}} \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{0}, c_{1}]}}{c_{1} - c_{0}} d x + \int_{c_{1}}^{c_{2}} \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{1}, c_{2}]}}{c_{2} - c_{1}} d x + \dots + \int_{c_{i - 1}}^{c_{i}} \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{i - 1}, c_{i}]}}{c_{i} - c_{i - 1}} d x) \\ = \frac{1}{n} (\frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{0}, c_{1}]}}{c_{1} - c_{0}} \cdot (c_{1} - c_{0}) + \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{1}, c_{2}]}}{c_{2} - c_{1}} \cdot (c_{2} - c_{1}) + \dots + \frac{\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{i - 1}, c_{i}]}}{c_{i} - c_{i - 1}} \cdot (c_{i} - c_{i - 1})) \\ = \frac{1}{n} (\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{0}, c_{1}]} + \sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{1}, c_{2}]} + \dots + \sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{i - 1}, c_{i}]}) \\ = \frac{1}{n} (\sum_{k = 1}^{n} 𝟏 {X_{k} \in (c_{0}, c_{1}] \cup (c_{1}, c_{2}] \cup \dots \cup (c_{i - 1}, c_{i}]}) \\ = \frac{1}{n} (\sum_{k = 1}^{n} 𝟏 {X_{k} \in \underset{sample space of X}{\underset{⏟}{(c_{0}, c_{i}]}}}) \\ = \frac{1}{n} \cdot \sum_{k = 1}^{n} 1 \\ = \frac{1}{n} \cdot n \\ = 1 . \end{matrix}$ Here, $c_{0}$ is $X_{min}$ and $c_{i}$ is $X_{max}$ .

(iii) We can "split" the integral in a similar way as in (ii), and then eventually the integral equals $\frac{1}{n} \cdot \sum_{k = 1}^{n} 𝟏 {X_{k} \in A}$ , and it can can approximate $ℙ (A)$ since it is the relative frequency of occurrences of the event ${X_{k} \in A}$ .

$◻$

Expectation

In this section, we will discuss some results about expectation, which involve some sort of inequalities. Let $a$ and $b$ be constants. Also, let $Ω$ be the sample space of $X$ .

Template:Colored proposition

Proof. Assume $ℙ (a < X \leq B) = 1$ .

Case 1: $X$ is discrete.

By definition of expectation, $𝔼 [X] = \sum_{x \in Ω}^{} x f (x)$ . Then, we have $\sum_{x \in Ω}^{} a f (x) < \sum_{x \in Ω}^{} x f (x) \leq \sum_{x \in Ω}^{} b f (x) \Rightarrow a \sum_{x \in Ω}^{} f (x) < 𝔼 [X] \leq b \sum_{x \in Ω}^{} f (x) \Rightarrow a < 𝔼 [X] \leq b$ because of the condition $ℙ (a < X \leq b) = 1$ .

Case 2: $X$ is continuous.

We have similarly $\int_{Ω}^{} a f (x) d x < \int_{Ω}^{} x f (x) d x \leq \int_{Ω}^{} b f (x) d x \Rightarrow a < 𝔼 [X] \leq b$ because of the condition of $ℙ (a < X \leq b) = 1$ .

$◻$

Template:Colored remark Template:Colored proposition

Proof. $\frac{𝔼 [X]}{a} = \frac{1}{a} \int_{- \infty}^{\infty} \underset{\geq 0}{\underset{⏟}{x f (x)}} d x \geq \int_{a}^{\infty} x f (x) d x \geq \frac{1}{a} \int_{a}^{\infty} a f (x) d x = \int_{a}^{\infty} f (x) d x = ℙ (X \geq a),$ as desired.

$◻$

Template:Colored corollary

Proof. First, observe that $X^{2}$ is a nonnegative random variable. Then, by Markov's inequality, for each (positive) $a^{'} = a^{2}$ , we have $ℙ (X^{2} \geq a^{'}) \leq \frac{𝔼 [X^{2}]}{a^{'}} ⟹ ℙ (X^{2} \geq a^{2}) \leq \frac{𝔼 [X^{2}]}{a^{2}} ⟹ ℙ (\sqrt{X^{2}} \geq \sqrt{a^{2}}) \leq \frac{𝔼 [X^{2}]}{a^{2}} ⟹ ℙ (| X | \geq a) \leq \frac{𝔼 [X^{2}]}{a^{2}}$ , since $a$ is positive.

$◻$

Template:Colored proposition

Proof. Let $L (x) = a + b x$ be the tangent of the function $g (x)$ at $x = 𝔼 [X]$ . Then, since $g$ is convex, we have $g (x) \geq L (x)$ for each $x$ (informally, we can observe this graphically). As a result, we have $\begin{matrix} \int_{Ω}^{} g (x) f (x) d x & \geq \int_{Ω}^{} L (x) f (x) d x \\ \Rightarrow & 𝔼 [g (X)] & \geq 𝔼 [L (X)] \\ = 𝔼 [a + b X] \\ = a + b 𝔼 [X] \\ = L (𝔼 [X]) \\ = g (𝔼 [X]) & since L (x) is tangent of g (x) at x = 𝔼 [X], \end{matrix}$ as desired.

$◻$

Template:Colored theorem

Proof. $\begin{matrix} 0 & \leq 𝔼 [(X 𝔼 [Y^{2}] - Y 𝔼 [X Y])^{2}] \\ = 𝔼 [X^{2} \underset{constant}{\underset{⏟}{(𝔼 [Y^{2}])^{2}}} - 2 X Y \underset{constant}{\underset{⏟}{𝔼 [Y^{2}] 𝔼 [X Y]}} + Y^{2} \underset{constant}{\underset{⏟}{(𝔼 [X Y])^{2}}}] \\ = (𝔼 [Y^{2}])^{2} 𝔼 [X^{2}] - 2 𝔼 [Y^{2}] 𝔼 [X Y] 𝔼 [X Y] + (𝔼 [X Y])^{2} 𝔼 [Y^{2}] \\ = 𝔼 [Y^{2}] (𝔼 [X^{2}] 𝔼 [Y^{2}] - 2 (𝔼 [X Y])^{2} + (𝔼 [X Y])^{2}) \\ = 𝔼 [Y^{2}] (𝔼 [X^{2}] 𝔼 [Y^{2}] - (𝔼 [X Y])^{2}) \end{matrix}$ Since $𝔼 [Y^{2}] \geq 0$ , we must have $𝔼 [X^{2}] 𝔼 [Y^{2}] - (𝔼 [X Y])^{2} \geq 0 \Leftrightarrow (𝔼 [X Y])^{2} \leq 𝔼 [X^{2}] 𝔼 [Y^{2}]$ .

$◻$

Template:Colored example

Convergence

Before discussing convergence, we will define some terms that will be used later. Template:Colored definition Template:Colored remark In a Template:Colored em, say $x_{1}, \dots, x_{n}$ , we observe Template:Colored em values of their sample mean, $\overline{x} = \frac{\sum_{i = 1}^{n} x_{i}}{n}$ , and sample variance, $s^{2} = \frac{\sum_{i = 1}^{n} (x_{i} - \overline{x})^{2}}{n}$ . Template:Colored em, each of the values is only Template:Colored em realization of the respective random variables $\overline{X}$ and $S^{2}$ . We should notice the difference between these definite values (not random variables) and the statistics (random variables).

To explain the definitions of the sample mean $\overline{X}$ and sample variance $S^{2}$ more intuitively, consider the following.

Recall that the empirical cdf $F_{n} (x)$ assigns probability $\frac{1}{n}$ to each of the random sample $X_{1}, \dots, X_{n}$ . Thus, by the definition of mean and variance, the Template:Colored em of a random variable, say $Y$ , with this cdf $F_{n} (x)$ (and hence with the corresponding pmf $f_{n} (x)$ ) is $\sum_{i = 1}^{n} (X_{i} \cdot \frac{1}{n}) = \overline{X}$ . Similarly, the Template:Colored em of $Y$ is $\sum_{i = 1}^{n} ((X_{i} - \overline{X})^{2} \cdot \frac{1}{n}) = S^{2}$ . In other words, the Template:Colored em and Template:Colored em of the empirical distribution, which corresponds to the Template:Colored em, is the Template:Colored em $\overline{X}$ and the Template:Colored em $S^{2}$ respectively, which is quite natural, right? Template:Colored remark Also, recall that the empirical cdf $F_{n} (x)$ can well approximate the cdf of $X$ , $F (x)$ when $n$ is large. Since $\overline{X}$ and $S^{2}$ are the mean and variance of a random variable with cdf $F_{n} (x)$ it is natural to expect that $\overline{X}$ and $S^{2}$ can well approximate the mean and variance of $X$ .

Convergence in probability

Template:Colored definition Template:Colored remark The following theorem, namely Template:Colored em, is an important theorem which is related to convergence in probability. Template:Colored theorem

Proof. We use $S_{n}$ to denote $\sum_{i = 1}^{n} X_{i}$ .

By definition, $\overline{X} \overset{p}{\to} μ$ as $n \to \infty$ is equivalent to $ℙ (| \frac{S_{n}}{n} - μ | > ε) \to 0$ as $n \to \infty$ .

By Chebyshov's inequality, we have $\begin{matrix} ℙ (| \frac{S_{n}}{n} - μ | > ϵ) & \leq \frac{1}{ε^{2}} 𝔼 [{(\frac{S_{n}}{n} - μ)}^{2}] \\ = \frac{1}{ε^{2}} 𝔼 [{(\frac{S_{n} - n μ}{n})}^{2}] \\ = \frac{1}{n^{2} ε^{2}} 𝔼 [{(S_{n} - n μ)}^{2}] \\ = \frac{1}{n^{2} ε^{2}} 𝔼 [{(\sum_{i = 1}^{n} X_{i} - μ)}^{2}] \\ = \frac{1}{n^{2} ε^{2}} 𝔼 [\sum_{i = 1}^{n} \sum_{j = 1}^{n} (X_{i} - μ) (X_{j} - μ)] \\ = \frac{1}{n^{2} ε^{2}} (𝔼 [\sum_{i = j = 1}^{n} (X_{i} - μ)^{2}] + 𝔼 [\sum_{i = 1}^{n} \sum_{j \neq i, j = 1}^{n} (X_{i} - μ) (X_{j} - μ)]) \end{matrix}$

Since $X_{1}, X_{2}, \dots$ are Template:Colored em (and hence functions of them are also independent) and the expectation is multiplicative under independence, $\begin{matrix} \frac{1}{n^{2} ε^{2}} (𝔼 [\sum_{i = j = 1}^{n} (X_{i} - μ)^{2}] + 𝔼 [\sum_{i = 1}^{n} \sum_{j \neq i, j = 1}^{n} (X_{i} - μ) (X_{j} - μ)]) & = \frac{1}{n^{2} ε^{2}} (𝔼 [\sum_{i = j = 1}^{n} (X_{i} - μ)^{2}] + \sum_{i = 1}^{n} \sum_{j \neq i, j = 1}^{n} \underset{= μ - μ = 0}{\underset{⏟}{𝔼 [X_{i} - μ]}} \underset{= μ - μ = 0}{\underset{⏟}{𝔼 [X_{j} - μ]}}) \\ = \frac{1}{n^{2} ε^{2}} \cdot \sum_{i = 1}^{n} \underset{= σ^{2}}{\underset{⏟}{𝔼 [(X_{i} - μ)^{2}]}} \\ = \frac{n σ^{2}}{n^{2} ε^{2}} \\ = \frac{σ^{2}}{n ε^{2}} \\ \to 0 & as n \to \infty . \end{matrix}$ So, the probability $ℙ (| \frac{S_{n}}{n} - μ | > ε)$ is Template:Colored em an expression that tends to be 0 as $n \to \infty$ . Since the probability is nonnegative ( $\geq 0$ ), it follows that the probability also tends to be 0 as $n \to \infty$ .

$◻$

Template:Colored remark There are also some properties of convergence in probability that help us to determine a complex expression converges to what thing. Template:Colored proposition

Proof. Template:Colored em: Assume $X_{n} \overset{p}{\to} X$ and $Y_{n} \overset{p}{\to} Y$ . Continuous mapping theorem is first proven so that we can use it in the proof of other properties (the proof is omitted here). Also, it can be shown that $(X_{n}, Y_{n}) \overset{p}{\to} (X, Y)$ (joint convergence in probability, the definition is similar, except that the random variables become ordered pairs, so the interpretation of " $| Z_{n} - Z |$ " becomes the Template:Colored em between the two points in Cartesian coordinate system, which are represented by the ordered pairs)

After that we define $g (z_{1}, z_{2}) = a z_{1} + b z_{2}$ , $g (z_{1}, z_{2}) = z_{1} z_{2}$ , and $g (z_{1} / z_{2}) = z_{1} / z_{2}$ respectively, where each of these functions is continuous, and $a, b$ are constants. Then, applying the continuous mapping theorem using each of these functions gives us the first three results.

$◻$

Convergence in distribution

Template:Colored definition Template:Colored remark A very important theorem in statistics which is related to convergence in distribution is Template:Colored em. Template:Colored theorem

Proof. A (lengthy) proof can be founded in Probability/Transformation of Random Variables#Central limit theorem.

$◻$

There are some properties of convergence in distribution, but they are a bit different from the properties of convergence in probability. These properties are given by Template:Colored em, and also continuous mapping theorem. Template:Colored theorem

Proof. Omitted.

$◻$

Template:Colored theorem

Proof. Template:Colored em: Assume $X_{n} \overset{d}{\to} X$ and $Y_{n} \overset{p}{\to} c$ . Then, it can be shown that $(X_{n}, Y_{n}) \overset{d}{\to} (X, c)$ (joint convergence in distribution, and the definitions of this is similar, except that the cdf's become joint cdf's of ordered pairs). After that, we define $g (z_{1}, z_{2}) = z_{1} + z_{2}$ , $g (z_{1}, z_{2}) = z_{1} z_{2}$ , and $g (z_{1}, z_{2}) = z_{1} / z_{2}$ respectively, where each of the functions is continuous, and then applying the continuous mapping theorem using each of these functions gives us the three desired results.

$◻$

Template:Colored remark

Resampling

By Template:Colored em, we mean creating new samples based on an existing sample. Now, let us consider the following for a general overview of the procedure of resampling.

Suppose $X_{1}, \dots, X_{n}$ is a Template:Colored em from a distribution of a random variable $X$ with cdf, $F (x)$ . Let $x_{1}, \dots, x_{n}$ be a corresponding Template:Colored em of the random sample $X_{1}, \dots, X_{n}$ . Based on this realization, we have also a Template:Colored em of the empirical cdf: $\frac{1}{n} \sum_{k = 1}^{n} 𝟏 {x_{k} \leq x}$ ^[1]. Since this is a realization of the empirical cdf, by Glivenko-Cantelli theorem, it is a good estimate of the cdf $F (x)$ when $n$ is large ^[2]. In other words, if we denote the random variable with the same pdf as that Template:Colored em of the empirical cdf by $X^{*}$ , $X^{*}$ and $X$ have similar distributions when $n$ is large.

Notice that a realization of empirical cdf is a Template:Colored em cdf (since the support $x_{1}, \dots, x_{n}$ is countable). We now draw a Template:Colored em (called the bootstrap (or resampling) random sample) with sample size $B$ (called the Template:Colored em) $X_{1}^{*}, \dots, X_{B}^{*}$ from the distribution of a random variable $X^{*}$ ( $X^{*}$ comes from Template:Colored em from $X$ , so the behaviour of sampling from $X^{*}$ is called Template:Colored em).

Then, the relative frequency historgram of $X_{1}^{*}, \dots, X_{B}^{*}$ should be close to that of the corresponding Template:Colored em of the empirical pmf of $X^{*}$ (found from the realization of the empirical cdf of $X^{*}$ ), which is close to pdf $f (x)$ of $X$ . This means the relative frequency historgram of $X_{1}^{*}, \dots, X_{B}^{*}$ is close to the pdf $f (x)$ of $X$ .

In particular, since the cdf of $X^{*}$ , $F_{n} (x)$ , assigns probability $1 / n$ to each of $X_{1}^{*}, \dots, X_{B}^{*}$ ^[3], the pmf of $X^{*}$ is $ℙ (X^{*} = x_{i}) = \frac{1}{n}, i = 1, 2, \dots, n$ . Notice that this pmf is quite simple, and therefore it can make the related calculation about it simpler. For example, in the following, we want to know the distribution of $T^{*} = g (X_{1}^{*}, \dots, X_{n}^{*})$ , and this simple pmf can make the resulting distribution also quite simple.

Template:Colored remark In the following, we will discuss an application of the bootstrap method (or Template:Colored em) mentioned above, namely using bootstrap method to Template:Colored em the distribution of a statistic $T = g (X_{1}, X_{2}, \dots, X_{n})$ (the inputs of the functions are random variables and $g$ is a function). The reason for approximating, rather than finding the distribution exactly, is that the latter is usually infeasible (or may be too complicated).

To do this, consider the "bootstrapped statistic" $T^{*} = g (X_{1}^{*}, X_{2}^{*}, \dots, X_{n}^{*})$ and the statistic $T = g (X_{1}, X_{2}, \dots, X_{n})$ . $X_{1}^{*}, X_{2}^{*}, \dots, X_{n}^{*}$ is the bootstrap random sample (with bootstrap sample size $n$ ) from the distribution of $X^{*}$ and $X_{1}, X_{2}, \dots, X_{n}$ is the random sample from the distribution of $X^{*}$ . When $n$ is large, since the distribution of $X^{*}$ is similar to that of $X$ , the bootstrap random sample $X_{1}^{*}, X_{2}^{*}, \dots, X_{B}^{*}$ and the random sample $X_{1}, X_{2}, \dots, X_{n}$ are also similar. It follows that $T^{*}$ and $T$ are similar as well, or to be more precise, the Template:Colored em of $T^{*}$ and $T$ are close. As a result, we can utilize the distribution of $T^{*}$ (which is easier to find and simpler, since the pmf of $X^{*}$ is simple as in above) to approximate the distribution of $T$ . A procedure to do this is as follows:

Generate a Template:Colored em $x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*}$ from the Template:Colored em $X_{1}^{*}, X_{2}^{*}, \dots, X_{n}^{*}$ , which is from the distribution of $X^{*}$ .
Calculate a realization of the bootstrapped statistic $T^{*}$ , $t^{*} = g (x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*})$ .
Repeat 1. to 2. $j$ times to get a sequence of $j$ realizations of $T^{*}$ : $t_{1}^{*}, t_{2}^{*}, \dots, t_{j}^{*}$ .
Plot the relative frequency historgram of the $j$ realizations $t_{1}^{*}, t_{2}^{*}, \dots, t_{j}^{*}$ .

This histogram of the $j$ realizations (which are a realization of a random sample from $T^{*}$ with sample size $j$ ) is close to the pmf of $T^{*}$ ^[4], and thus close to the pmf of $T$ . Template:Nav Template:BookCat

↑ This is different from the empirical cdf $\frac{1}{n} \sum_{k = 1}^{n} 𝟏 {X_{k} \leq x}$ .
↑ For Glivenko-Cantelli theorem, the empirical cdf is a good estimate of the cdf $F (x)$ , regardless of what the actual values (realization) of the random sample are, i.e. for each realization of the empirical cdf, it is a good estimate of the cdf $F (x)$ , when $n$ is large.
↑ That is, for a realization of random sample $X_{1}, X_{2}, \dots, X_{n}$ , say $x_{1}, x_{2}, \dots, x_{n}$ , the probability for $X^{*}$ to equal $x_{1}, x_{2}, \dots, x_{n}$ (which corresponds to the realization of $X_{1}, X_{2}, \dots, X_{n}$ respectively), is $1 / n$ each.
↑ The reason is mentioned similarly above: the histogram should be close to the pmf of $T^{*}$ since the cdf corresponding to the histogram (i.e. the realization of the empirical cdf of the random sample $T_{1}^{*}, T_{2}^{*}, \dots, T_{j}^{*}$ ) is close to the cdf of $T^{*}$

[1] This is different from the empirical cdf $\frac{1}{n} \sum_{k = 1}^{n} 𝟏 {X_{k} \leq x}$ .

[2] For Glivenko-Cantelli theorem, the empirical cdf is a good estimate of the cdf $F (x)$ , regardless of what the actual values (realization) of the random sample are, i.e. for each realization of the empirical cdf, it is a good estimate of the cdf $F (x)$ , when $n$ is large.

[3] That is, for a realization of random sample $X_{1}, X_{2}, \dots, X_{n}$ , say $x_{1}, x_{2}, \dots, x_{n}$ , the probability for $X^{*}$ to equal $x_{1}, x_{2}, \dots, x_{n}$ (which corresponds to the realization of $X_{1}, X_{2}, \dots, X_{n}$ respectively), is $1 / n$ each.

[4] The reason is mentioned similarly above: the histogram should be close to the pmf of $T^{*}$ since the cdf corresponding to the histogram (i.e. the realization of the empirical cdf of the random sample $T_{1}^{*}, T_{2}^{*}, \dots, T_{j}^{*}$ ) is close to the cdf of $T^{*}$

[1]

[2]

[3]

[4]

Statistics/Preliminaries

Contents

Empirical distribution

Expectation

Convergence

Convergence in probability

Convergence in distribution

Resampling

Navigation menu

Statistics/Preliminaries

Empirical distribution

Expectation

Convergence

Convergence in probability

Convergence in distribution

Resampling

Navigation menu

Search