Probability/Properties of Distributions

Introduction

Recall that pdf (or cdf) describes the random behaviour of a random variable Template:Colored em . However, we may sometimes find the pdf (or cdf) to be too complicated, and only want to know some Template:Colored em about the random variable. In view of this, we study some properties of distributions in this chapter, which provide Template:Colored em descriptions of the random behaviour of the random variable.

Some examples of such partial descriptions include

location (e.g. pdf is 'located' at left or right?),
dispersion (e.g. 'sharp' of 'flat' pdf?),
skewness (e.g. pdf is symmetric, skewed to left, or skewed to right?), and
tail property (e.g. pdf have 'light' or 'heavy' tails?).

We can Template:Colored em describe them, but such descriptions are quite subjective and inaccurate. To give a more objective and accurate measure to such descriptions, we evaluate them Template:Colored em using some quantitative measures derived from the pdf (or cdf) of the random variable.

We will discuss some of such quantitative measures in this chapter. Among these, the Template:Colored em is the most important one, since many of other properties base upon the concept of Template:Colored em.

Expectation

We have different alternative names for expectation, e.g. expected value and mean. Template:Colored definition Template:Colored remark Template:Colored example Template:Colored example Template:Colored exercise In the following, we introduce a useful result that gives the relationship between expectation and probability, we can use expectation to ease the computation of probability using this result. Template:Colored proposition

Proof. Let $X = 𝟏 {E}$ . Since $X = 𝟏 {E} \sim Ber (ℙ (E))$ (which is a discrete random variable), $𝔼 [X] = 0 [ℙ (X = 0)] + 1 [ℙ (X = 1)] = ℙ (𝟏 {E} = 1) = ℙ (E) .$

$◻$

When there are multiple random variables involved, we may derive the joint pmf or pdf first to compute the expectation, but it can be quite difficult and complicated to do so. Practically, we use the following theorem more often. Template:Colored theorem Template:Colored remark The proof is quite complicated, and hence we skip it. In the following, we will introduce several properties of expectation that can help us to simplify computations of the expectation.

Template:Colored proposition

Proof.

Template:Colored em:

for continuous random variables $X, Y$ , $\begin{matrix} 𝔼 [α X + β Y + γ] = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (α x + β y + γ) \underset{joint pdf}{\underset{⏟}{f (x, y)}} d x d y & = α \int_{- \infty}^{\infty} x \underset{marginal pdf of X}{\underset{⏟}{\int_{- \infty}^{\infty} f (x, y) d y}} d x + β \int_{- \infty}^{\infty} y \underset{margianl pdf of Y}{\underset{⏟}{\int_{- \infty}^{\infty} f (x, y) d x}} d y + γ \underset{1}{\underset{⏟}{\int_{- \infty}^{\infty} f (x, y) d x d y}} \\ = α \underset{𝔼 [X]}{\underset{⏟}{\int_{- \infty}^{\infty} x f_{X} (x) d x}} + β \underset{𝔼 [Y]}{\underset{⏟}{\int_{- \infty}^{\infty} y f_{Y} (y) d y}} + γ \\ = α 𝔼 [X] + β 𝔼 [Y] + γ . \end{matrix}$ Similarly, for discrete random variables $X, Y$ , $\begin{matrix} 𝔼 [α X + β Y + γ] & = \sum_{x}^{} \sum_{y}^{} (α x + β y + γ) f (x, y) \\ = α \sum_{x}^{} x \sum_{y}^{} f (x, y) + β \sum_{y}^{} y \sum_{x}^{} f (x, y) + γ \sum_{x}^{} \sum_{y}^{} f (x, y) \\ = α \sum_{x}^{} f_{X} (x) + β \sum_{y}^{} f_{Y} (y) + γ (1) \\ = α 𝔼 [X] + β 𝔼 [Y] + γ . \end{matrix}$

Template:Colored em:

For continuous random variable $X$ , $\underset{∵ X \geq 0}{\underset{⏟}{\int_{0}^{\infty}}} x f_{X} (x) \geq 0 .$ Similarly, for discrete random variable $X$ , $\underset{∵ X \geq 0}{\underset{⏟}{\sum_{x \geq 0}^{}}} x f_{X} (x) \geq 0 .$

Template:Colored em:

For random variables $X, Y$ that are either both discrete or both continuous, $X \geq Y \Rightarrow X - Y \geq 0 \Rightarrow 𝔼 [X] - 𝔼 [Y] \overset{linearity}{=} 𝔼 [X - Y] \overset{nonnegativity}{\geq} 0 .$

Template:Colored em:

$- | X | \leq X \leq | X | \overset{monotonicity}{\Rightarrow} - 𝔼 [| X |] \leq 𝔼 [X] \leq 𝔼 [| X |]$

Template:Colored em:

For continuous random variables $X, Y$ , $𝔼 [X Y] = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} x y \underset{joint pdf}{\underset{⏟}{f (x, y)}} d x d y = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} x y \underset{marginal pdf's}{\underset{⏟}{f_{X} (x) f_{Y} (y)}} d x d y = \int_{- \infty}^{\infty} y f_{Y} (y) \underset{independent from y}{\underset{⏟}{\int_{- \infty}^{\infty} x f_{X} (x) d x}} d y = \int_{- \infty}^{\infty} x f_{X} (x) d x \int_{- \infty}^{\infty} y f_{Y} (y) d y = 𝔼 [X] 𝔼 [Y] .$ Similarly, for discrete random variables $X, Y$ , $𝔼 [X Y] = \sum_{x}^{} \sum_{y}^{} x y \underset{joint pmf}{\underset{⏟}{f (x, y)}} = \sum_{y}^{} \sum_{x}^{} x y \underset{marginal pmf's}{\underset{⏟}{f_{X} (x) f_{Y} (y)}} = (\sum_{x}^{} x f_{X} (x)) (\sum_{y}^{} y f_{Y} (y)) = 𝔼 [X] 𝔼 [Y] .$

$◻$

Template:Colored remark

Mean of some distributions of a discrete random variable

Template:Colored proposition

Proof.

$𝔼 [X] = \underset{= 0}{\underset{⏟}{0 \cdot ℙ (X = 0)}} + 1 \cdot \underset{= p}{\underset{⏟}{ℙ (X = 1)}} = p$ .
Since $Y = X_{1} + \dots + X_{n}$ , in which $X_{1}, \dots, X_{n}$ are i.i.d. and follow $Ber (p)$ ^[1],
$𝔼 [Y] = 𝔼 [X_{1} + \dots + X_{n}] \overset{linearity}{=} 𝔼 [X_{1}] + \dots + 𝔼 [X_{n}] = \underset{n times}{\underset{⏟}{p + \dots + p}} = n p$ .

$◻$

Template:Colored proposition

Proof. $𝔼 [X] = \sum_{k = 0}^{\infty} k \underset{ℙ (X = k)}{\underset{⏟}{(\frac{λ^{k} e^{- λ}}{k!})}} = λ (0 + \sum_{\underset{k - 1 = 0}{\underset{⏟}{k = 1}}}^{\infty} \underset{ℙ (X = k - 1)}{\underset{⏟}{k (\frac{λ^{k - 1} e^{- λ}}{k (k - 1)!})}}) = λ (0 + 1) = λ .$

$◻$

Template:Colored proposition

Proof.

Since

$\begin{matrix} 𝔼 [X] & = \sum_{k = 0}^{\infty} k \underset{ℙ (X = k)}{\underset{⏟}{(1 - p)^{k} p}} \\ = \sum_{k = 0}^{\infty} (k - 1) (1 - p)^{k} p + \overset{= 1}{\overset{⏞}{\sum_{k = 0}^{\infty} \underset{ℙ (X = k)}{\underset{⏟}{(1 - p)^{k} p}}}} \\ = \underset{= - p}{\underset{⏟}{(0 - 1) (1 - p)^{0} p}} + ((1 - p) \sum_{k - 1 = 0}^{\infty} (k - 1) (1 - p)^{k - 1} p) + 1 \\ = - p + (1 - p) 𝔼 [X] + 1, \end{matrix}$

it follows that $p 𝔼 [X] = 1 - p \Rightarrow 𝔼 [X] = \frac{1 - p}{p} .$ .
Since $Y = X_{1} + \dots + X_{k}$ in which $X_{1}, \dots, X_{k}$ are i.i.d., and follow $Geo (p)$ ^[2],
$𝔼 [Y] = 𝔼 [X_{1}] + \dots + 𝔼 [X_{k}] = \underset{k times}{\underset{⏟}{\frac{1 - p}{p} + \dots + \frac{1 - p}{p}}} = \frac{k (1 - p)}{p} .$

$◻$

Template:Colored proposition

Proof.

Since $X = X_{1} + \dots + X_{n}$ in which $X_{1}, \dots, X_{n} \sim Ber (K / N)$ (each of the Bernoulli r.v.'s indicates whether the corresponding draw of ball is of type 1, with probability $K / N$ without knowing the results of other draws ^[3], since each draw is equally likely to be any of the $N$ balls) ^[4] ,
it follows that $𝔼 [X] = 𝔼 [X_{1}] + \dots + 𝔼 [X_{n}] = \underset{n times}{\underset{⏟}{\frac{K}{N} + \dots + \frac{K}{N}}} = \frac{n K}{N} .$

$◻$

Mean of some distributions of a continuous random variable

We will introduce the formulas for mean of some distributions of a Template:Colored em random variable, which are relatively simpler. Template:Colored proposition

Proof. $𝔼 [X] = \int_{a}^{b} \frac{x}{b - a} d x = \frac{1}{2 (b - a)} (b^{2} - a^{2}) = \frac{(b - a) (b + a)}{2 (b - a)} .$

$◻$

Template:Colored proposition

Proof.

It suffices to prove the formula for mean of gamma r.v.'s, since exponential and chi-squared r.v.'s are essentially special cases of gamma r.v.'s, and thus we can simply substitute some values into the formula for mean of gamma r.v.'s to obtain the formulas for them.
$\begin{matrix} 𝔼 [X] & = \int_{0}^{\infty} x \cdot \frac{λ^{α} x^{α - 1} e^{- λ x}}{Γ (α)} d x \\ = \frac{α}{λ} \underset{= F (\infty) = 1}{\underset{⏟}{\int_{0}^{\infty} \frac{λ^{α + 1} x^{α + 1 - 1} e^{- λ x}}{Γ (α + 1)} d x}}, & F is the cdf of Gamma (α + 1, λ), \\ = \frac{α}{λ} . \end{matrix}$
Since $Exp (λ) \equiv Gamma (1, λ)$ , $𝔼 [Y] = 1 / λ$ by substituting $α = 1$ .
Since $χ_{ν}^{2} \equiv Gamma (ν / 2, 1 / 2)$ , $𝔼 [Z] = (ν / 2) / (1 / 2) = ν$ by substituting $α = ν / 2$ and $λ = 1 / 2$ .

$◻$

Template:Colored proposition

Proof.

We use similar approach from the previous proof.

$\begin{matrix} 𝔼 [X] & = \int_{0}^{1} x \cdot \frac{Γ (α + β)}{Γ (α) Γ (β)} x^{α - 1} (1 - x)^{β - 1} d x \\ = \frac{α}{α + β} \underset{F (1) = 1}{\underset{⏟}{\int_{0}^{1} \frac{Γ (α + β + 1)}{Γ (α + 1) Γ (β)} x^{α + 1 - 1} (1 - x)^{β - 1} d x}}, & F is the cdf of Beta (α + 1, β), \\ = \frac{α}{α + β} . \end{matrix}$

$◻$

Template:Colored proposition

Proof. $\begin{matrix} 𝔼 [X] & = 𝔼 [X - θ] + θ & by linearity, \\ = θ + \frac{1}{π} \int_{- \infty}^{\infty} (x - θ) \cdot \frac{1}{1 + (x - θ)^{2}} d x \\ = θ + \frac{1}{π} \int_{- \infty}^{\infty} \frac{u}{1 + u^{2}} d u, & let u = x - θ \Rightarrow d u = d x, \\ = θ + \frac{1}{π} (\int_{- \infty}^{0} \frac{u}{1 + u^{2}} d u + \int_{0}^{\infty} \frac{u}{1 + u^{2}} d u) \\ = θ + \frac{1}{π} (\frac{1}{2} [\ln (1 + u^{2})]_{u = - \infty}^{u = 0} + \frac{1}{2} [\ln (1 + u^{2})]_{u = 0}^{u = \infty}) \\ = θ + \frac{1}{π} (\underset{undefined}{\underset{⏟}{- \infty + \infty}}) . \end{matrix}$

$◻$

Template:Colored proposition

Proof.

Let $Z = \frac{X - μ}{σ} \sim 𝒩 (0, 1)$ .
$\begin{matrix} 𝔼 [Z] & = \int_{- \infty}^{\infty} x φ (x) d x \\ = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} x e^{- x^{2} / 2} d x \\ = - \frac{1}{\sqrt{2 π}} \int_{- \infty}^{- \infty} e^{u} d u, & let u = - \frac{x^{2}}{2} \Rightarrow d u = - d x \\ = - \frac{1}{\sqrt{2 π}} (\underset{= 0}{\underset{⏟}{e^{- \infty}}} - \underset{= 0}{\underset{⏟}{e^{- \infty}}}) \\ = 0 . \end{matrix}$
It follows that $𝔼 [X] = 𝔼 [σ Z + μ] = σ \underset{= 0}{\underset{⏟}{𝔼 [Z]}} + μ = μ$ .

$◻$

Examples

Template:Colored example Template:Colored exercise Let us illustrate the usefulness of fundamental bridge between probability and expectation by giving a proof to inclusion-exclusion using this bridge. Template:Colored example

Probability generating functions

An application of expectation is Template:Colored em. As suggested by its name, it can Template:Colored em probabilities in some sense. Template:Colored definition Template:Colored remark

Variance (and standard deviation)

Indeed, Template:Colored em is a special case of Template:Colored em, and is related to Template:Colored em in some sense. Template:Colored definition Template:Colored definition Template:Colored definition Since $(X - 𝔼 [X])^{2}$ is the squared deviation of the value of $X$ from its mean, in view of the definition of variance, we can see that variance measure the Template:Colored em (or Template:Colored em) of distribution, since it is what we would Template:Colored em of the squared deviation if we are to take an observation of the random variable.

Another term which is closed related is Template:Colored em. Template:Colored definition Template:Colored remark Template:Colored proposition

Proof.

alternative expression for variance:

Let

μ = 𝔼 [X]

for clearer expression.

$𝔼 [(X - μ)^{2}] = 𝔼 [X^{2} - 2 X μ + μ^{2}] = 𝔼 [X^{2}] - 2 μ \underset{μ}{\underset{⏟}{𝔼 [X]}} + μ^{2} = 𝔼 [X^{2}] - μ^{2},$ and the result follows.

invariance under change in location parameter:

$Var (X + a) = 𝔼 [(X + a - \underset{𝔼 [X] + a}{\underset{⏟}{𝔼 [X + a]}})^{2}] = 𝔼 [(X - 𝔼 [X])^{2}] = Var (X) .$

nonnegativity: it follows from $(X - 𝔼 [X])^{2} \geq 0$ .
zero variance implies non-randomness:

Let

μ = 𝔼 [X]

for clearer expression. Consider the event

E_{n} = {| X - μ | \geq n^{- 1}}

, in which

n

is a positive integer.

Since

0 = Var (X) = 𝔼 [(X - μ)^{2}] \geq 𝔼 [(X - μ)^{2} \underset{\leq 1}{\underset{⏟}{𝟏 {E_{n}}}}] = 𝔼 [| X - a |^{2} 𝟏 {E_{n}}] \geq 𝔼 [\underset{constant}{\underset{⏟}{n^{- 2}}} 𝟏 {E_{n}}] = \underset{\geq 0}{\underset{⏟}{n^{- 2}}} \underset{\geq 0}{\underset{⏟}{ℙ (E_{n})}} \geq 0,

we have

0 \geq n^{- 2} ℙ (E_{n}) \geq 0 \Rightarrow 0 \geq ℙ (E_{n}) \geq 0 \Rightarrow ℙ (E_{n}) = 0

.

Thus,

$ℙ (\underset{X \neq μ}{\underset{⏟}{| X - μ | > 0}}) = ℙ (⋃_{n = 1}^{\infty} E_{n}) \overset{a lemma}{=} \lim_{n \to \infty} \underset{0}{\underset{⏟}{ℙ (E_{n})}} = 0 \Rightarrow ℙ (X = μ) = 1 - \underset{0}{\underset{⏟}{ℙ (X \neq μ)}} = 1$

additivity under independence:

For each random variable

X

and

Y

that are independent with means

μ, ν

respectively,

$\begin{matrix} Var (X + Y) & = 𝔼 [(X + Y - 𝔼 [X + Y])^{2}] \\ Var (X + Y) & = 𝔼 [(X + Y - μ - ν)^{2}] & by linearity \\ = \underset{Var (X)}{\underset{⏟}{𝔼 [(X - μ)^{2}]}} + \underset{Var (Y)}{\underset{⏟}{𝔼 [(Y - ν)^{2}]}} + 2 𝔼 [(X - μ) (Y - ν)] & by linearity \\ = Var (X) + Var (Y) + 2 𝔼 [X Y] - 2 ν 𝔼 [X] - 2 μ 𝔼 [Y] + 2 μ ν & by linearity \\ = Var (X) + Var (Y) + 2 \underset{μ ν}{\underset{⏟}{𝔼 [X] 𝔼 [Y]}} - 2 ν μ - 2 μ ν + 2 μ ν & by independence of X, Y \\ = Var (X) + Var (Y) + 2 μ ν - 2 ν μ \\ = Var (X) + Var (Y) . \end{matrix}$ Thus, inductively, $Var (X_{1} + \dots + X_{n}) = Var (X_{1}) + Var (X_{2} + \dots + X_{n}) = \dots = Var (X_{1}) + \dots + Var (X_{n})$ if $X_{1}, \dots, X_{n}$ are independent.

$◻$

Variance of some distributions of a discrete random variable

Template:Colored proposition

Proof.

$𝔼 [X^{2}] = 0 \cdot ℙ (X = 0) + 1 \cdot ℙ (\underset{\Leftrightarrow X = 1}{\underset{⏟}{X^{2} = 1}}) = p$ since $X$ is nonnegative.
It follows that $Var (X) = 𝔼 [X^{2}] - (𝔼 [X])^{2} = p - p^{2} = p (1 - p)$ .
Similar to the proof for the mean of Bernoulli and binomial r.v.'s, $Y = X_{1} + \dots + X_{n}$ in which $X_{1}, \dots, X_{n}$ are i.i.d. and follow $Ber (p)$ .
Because of the Template:Colored em (from i.i.d. property), $Var (Y) = \underset{n times}{\underset{⏟}{Var (X_{1}) + \dots + Var (X_{n})}} = n p (1 - p) .$

$◻$

Template:Colored proposition

Proof.

$𝔼 [X^{2}] = \sum_{k = 0}^{\infty} k^{2} \underset{ℙ (X = k)}{\underset{⏟}{(\frac{λ^{k} e^{- λ}}{k!})}} = λ (0 + \sum_{\underset{k - 1 = 0}{\underset{⏟}{k = 1}}}^{\infty} k (\frac{k λ^{k - 1} e^{- λ}}{k (k - 1)!})) = λ (\underset{𝔼 [X]}{\underset{⏟}{\sum_{k - 1 = 0}^{\infty} \frac{(k - 1) e^{- λ} λ^{k - 1}}{(k - 1)!}}} + \overset{= 1}{\overset{⏞}{\sum_{k - 1 = 0}^{\infty} \underset{ℙ (X = k - 1)}{\underset{⏟}{\frac{e^{- λ} λ^{k - 1}}{(k - 1)!}}}}}) = λ (λ + 1) .$
Hence, $Var (X) = 𝔼 [X^{2}] - (𝔼 [X])^{2} = λ (λ + 1) - λ^{2} = λ .$

$◻$

Template:Colored proposition

Proof.

Since

$\begin{matrix} 𝔼 [X] & = \sum_{k = 0}^{\infty} k^{2} \underset{ℙ (X = k)}{\underset{⏟}{(1 - p)^{k} p}} \\ = \sum_{k = 0}^{\infty} (k - 1 + 1)^{2} \underset{ℙ (X = k)}{\underset{⏟}{(1 - p)^{k} p}} \\ = \sum_{k = 0}^{\infty} (k - 1)^{2} (1 - p)^{k} p + \sum_{k = 0}^{\infty} 2 (k - 1) (1 - p)^{k} p + \overset{= 1}{\overset{⏞}{\sum_{k = 0}^{\infty} \underset{ℙ (X = k)}{\underset{⏟}{(1 - p)^{k} p}}}} \\ = \underset{= p}{\underset{⏟}{(0 - 1)^{2} (1 - p)^{0} p}} + (1 - p) \sum_{k - 1 = 0}^{\infty} (k - 1)^{2} (1 - p)^{k - 1} p + \underset{= - 2 p}{\underset{⏟}{2 (0 - 1) (1 - p)^{0} p}} + 2 (1 - p) \sum_{k - 1 = 0}^{\infty} (k - 1) (1 - p)^{k - 1} p + 1 \\ = p + (1 - p) 𝔼 [X^{2}] - 2 p + 2 (1 - p) \underset{(1 - p) / p}{\underset{⏟}{𝔼 [X]}} + 1 \\ = (1 - p) 𝔼 [X^{2}] + \frac{2 (1 - p)^{2}}{p} + 1 - p, \end{matrix}$

it follows that $p 𝔼 [X^{2}] = \frac{2 (1 - p)^{2}}{p} + 1 - p \Rightarrow 𝔼 [X^{2}] = \frac{2 (1 - p)^{2} + p (1 - p)}{p^{2}}$ .
Hence, $Var (X) = 𝔼 [X^{2}] - (𝔼 [X])^{2} = \frac{2 (1 - p)^{2} + p (1 - p)}{p^{2}} - \frac{(1 - p)^{2}}{p^{2}} = \frac{(1 - p)^{2} + p (1 - p)}{p^{2}} = \frac{(1 - p) (1 - p + p)}{p^{2}}$ .
Similarly, $Y = X_{1} + \dots + X_{k}$ in which $X_{1}, \dots, X_{k}$ are i.i.d., and follow $Geo (p)$ ^[5].
Because of the independence, $Var (Y) = Var (X_{1}) + \dots + Var (X_{k}) = \underset{k times}{\underset{⏟}{\frac{1 - p}{p^{2}} + \dots + \frac{1 - p}{p^{2}}}} = \frac{k (1 - p)}{p^{2}} .$

$◻$

Variance of some distributions of a continuous random variable

Template:Colored proposition

Proof. $\begin{matrix} Var (X) & = 𝔼 [X^{2}] - (𝔼 [X])^{2} \\ = \int_{a}^{b} \frac{x^{2}}{b - a} d x - {(\frac{b + a}{2})}^{2} \\ = \frac{1}{b - a} (b^{3} / 3 - a^{3} / 3) - {(\frac{a + b}{2})}^{2} \\ = \frac{1}{3 (b - a)} (b^{3} - a^{3}) - {(\frac{a + b}{2})}^{2} \\ = \frac{1}{3 (b - a)} (b - a) (b^{2} + b a + a^{2}) - \frac{a^{2} + 2 a b + b^{2}}{4} \\ = \frac{4 b^{2} + 4 a b + 4 a^{2} - 3 b^{2} - \overset{2}{6} a b - 3 a^{2}}{12} \\ = \frac{b^{2} - 2 a b + a^{2}}{12} \\ = \frac{(b - a)^{2}}{12} . \end{matrix}$

$◻$

Template:Colored proposition

Proof.

Similarly, it suffices to prove the formula for variance of gamma r.v.'s.
$\begin{matrix} 𝔼 [X^{2}] & = \int_{0}^{\infty} x^{2} \cdot \frac{λ^{α} x^{α - 1} e^{- λ x}}{Γ (α)} d x \\ = \frac{(α + 1) α}{λ^{2}} \underset{= F (\infty) = 1}{\underset{⏟}{\int_{0}^{\infty} \frac{λ^{α + 2} x^{α + 2 - 1} e^{- λ x}}{Γ (α + 2)} d x}}, & F is the cdf of Gamma (α + 2, λ), \\ = \frac{(α + 1) α}{λ^{2}} . \end{matrix}$
It follows that $Var (X) = 𝔼 [X^{2}] - (𝔼 [X]^{2}) = \frac{(α + 1) α}{λ^{2}} - \frac{α^{2}}{λ^{2}} = \frac{α}{λ^{2}} .$
Since $Exp (λ) \equiv Gamma (1, λ)$ , $Var (Y) = 1 / λ^{2}$ by substituting $α = 1$ .
Since $χ_{ν}^{2} \equiv Gamma (ν / 2, 1 / 2)$ , $Var (Z) = (ν / 2) / (1 / 2)^{2} = 2 ν$ by substituting $α = ν / 2$ and $λ = 1 / 2$ .

$◻$

Template:Colored proposition

Proof.

$\begin{matrix} 𝔼 [X^{2}] & = \int_{0}^{1} x^{2} \cdot \frac{Γ (α + β)}{Γ (α) Γ (β)} x^{α - 1} (1 - x)^{β - 1} d x \\ = \frac{(α + 1) α}{(α + β + 1) (α + β)} \underset{F (1) = 1}{\underset{⏟}{\int_{0}^{1} \frac{Γ (α + β + 2)}{Γ (α + 2) Γ (β)} x^{α + 2 - 1} (1 - x)^{β - 1} d x}}, & F is the cdf of Beta (α + 2, β), \\ = \frac{(α + 1) α}{(α + β + 1) (α + β)} . \end{matrix}$
It follows that $\begin{matrix} Var (X) & = 𝔼 [X^{2}] - (𝔼 [X])^{2} = \frac{(α + 1) α}{(α + β + 1) (α + β)} - \frac{α^{2}}{(α + β)^{2}} \\ = \frac{(α + 1) (α) (α + β) - α^{2} (α + β + 1)}{(α + β)^{2} (α + β + 1)} \\ = \frac{α (α^{2} + α β + α + β - α^{2} - α β - α)}{(α + β)^{2} (α + β + 1)} \\ = \frac{α β}{(α + β)^{2} (α + β + 1)} . \end{matrix}$

$◻$

Template:Colored proposition

Proof. It follows from the proposition about undefined mean of Cauchy r.v.'s and the formula $Var (X) = 𝔼 [X^{2}] - (𝔼 [X])^{2}$ (arbitrary term minus undefined term is undefined).

$◻$

Template:Colored proposition

Proof.

Let $Z = \frac{X - μ}{σ} \sim 𝒩 (0, 1)$ .
$\begin{matrix} 𝔼 [Z^{2}] & = \int_{- \infty}^{\infty} x^{2} φ (x) d x \\ = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} x^{2} e^{- x^{2} / 2} d x \\ = - \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} x d (e^{- x^{2} / 2}) \\ = - \frac{1}{\sqrt{2 π}} ([x e^{- x^{2} / 2}]_{- \infty}^{\infty} - \int_{- \infty}^{\infty} e^{- x^{2} / 2} d x) & by integration by parts, \\ = - \frac{1}{\sqrt{2 π}} (0 - 0 - \int_{- \infty}^{\infty} e^{- x^{2} / 2} d x) & since exponential function ↓ much faster than linear function, or by L'Hospital rule, \\ = \underset{= Φ (\infty) = 1}{\underset{⏟}{\int_{- \infty}^{\infty} φ (x) d x}} \\ = 1 . \end{matrix}$
It follows that $Var (Z) = 𝔼 [Z^{2}] - (𝔼 [Z])^{2} = 1 - 0 = 1$ .
Hence, $Var (X) = Var (σ Z + μ) = σ^{2} Var (Z) = σ^{2}$ .

$◻$

Template:Colored exercise

Coefficient of variation

Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark

Quantile

Mode

Mode is another measure of centrality. Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark

Covariance and correlation coefficients

In this section, we will discuss two important properties of Template:Colored em distributions, namely Template:Colored em and Template:Colored em. As we will see, covariance is related to variance in some sense, and correlation coefficient is closed related to correlation. Template:Colored definition Template:Colored definition Both Template:Colored em and Template:Colored em measure Template:Colored em between $X$ and $Y$ . As we will see, $ρ (X, Y) \in [- 1, 1]$ , $X, Y$ are more highly correlated as $| ρ (X, Y) |$ increases, and $X$ has a linear relationship with $Y$ if $| ρ (X, Y) | = 1$ .

Template:Colored proposition

Proof.

(i) $Cov (X, Y) = 𝔼 [(X - 𝔼 [X]) (Y - 𝔼 [Y])] = 𝔼 [(Y - 𝔼 [Y]) (X - 𝔼 [X])] = Cov (Y, X)$ (ii) $Cov (X, X) = 𝔼 [(X - 𝔼 [X]) (X - 𝔼 [X])] = 𝔼 [(X - 𝔼 [X])^{2}] = Var (X)$ (iii) $\begin{matrix} Cov (X, Y) & = 𝔼 [(X - 𝔼 [X]) (Y - 𝔼 [Y])] \\ = 𝔼 [X Y - X 𝔼 [Y] - Y 𝔼 [X] + 𝔼 [X] 𝔼 [Y]] \\ = 𝔼 [X Y] - 𝔼 [Y] 𝔼 [X] - 𝔼 [X] 𝔼 [Y] + 𝔼 [X] 𝔼 [Y] by linearity \\ = 𝔼 [X Y] - 𝔼 [X] 𝔼 [Y] \end{matrix}$ (iv) $\begin{matrix} Cov (\sum_{i = 1}^{n} (a_{i} X_{i} + c), \sum_{j = 1}^{m} (b_{j} Y_{j} + d)) & = 𝔼 [(\sum_{i = 1}^{n} (a_{i} X_{i} + c) - \sum_{i = 1}^{n} 𝔼 [a_{i} X_{i} + c]) (\sum_{j = 1}^{m} (b_{j} Y_{j} + d) - \sum_{j = 1}^{m} 𝔼 [b_{j} Y_{j} + d])] \\ = 𝔼 [\sum_{i = 1}^{n} (a_{i} X_{i} - 𝔼 [a_{i} X_{i}]) \sum_{j = 1}^{m} (b_{j} Y_{j} - 𝔼 [b_{j} Y_{j}])] \\ = 𝔼 [\sum_{i = 1}^{n} \sum_{j = 1}^{m} (a_{i} X_{i} - 𝔼 [a_{i} X_{i}]) (b_{j} Y_{j} - 𝔼 [b_{j} Y_{j}])] \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m} 𝔼 [(a_{i} X_{i} - a_{i} 𝔼 [X_{i}]) (b_{j} Y_{j} - b_{j} 𝔼 [Y_{j}])] & by linearity \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m} a_{i} b_{j} 𝔼 [X_{i} - 𝔼 [X_{i}]] 𝔼 [Y_{j} - 𝔼 [Y_{j}]] \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m} a_{i} b_{j} Cov (X_{i}, Y_{j}) \end{matrix}$ (v) $\begin{matrix} Var (\sum_{i = 1}^{n} X_{i}) & \overset{(ii)}{=} Cov (\sum_{i = 1}^{n} X_{i}, \sum_{j = 1}^{n} X_{j}) \\ \overset{(iv)}{=} \sum_{i = 1}^{n} \sum_{j = 1}^{n} Cov (X_{1}, X_{j}) \\ = \sum_{1 \leq i = j \leq n}^{} Cov (X_{i}, X_{j}) + \sum_{1 \leq i \neq j \leq n}^{} Cov (X_{i}, X_{j}) \\ \overset{(ii)}{=} \sum_{i = 1}^{n} Var (X_{i}) + \sum_{1 \leq i < j \leq n}^{} Cov (X_{i}, X_{j}) + \sum_{1 \leq j < i \leq n}^{} Cov (X_{i}, X_{j}) \\ \overset{(i)}{=} \sum_{i = 1}^{n} Var (X_{i}) + 2 \sum_{1 \leq i < j \leq n}^{} Cov (X_{i}, X_{j}) \end{matrix}$

$◻$

Then, we will discuss about Template:Colored em. The following is the definition of Template:Colored em between correlation between two random variables. Template:Colored definition Template:Colored remark Covariance and correlation coefficient are Template:Colored em, but they have differences. In particular, $Cov (X, Y)$ depends on Template:Colored em of $X$ and $Y$ , not just their relationship. Thus, this number is affected by the variances, and does not measure their relationship accurately. On the other hand, $ρ (X, Y)$ Template:Colored em for Template:Colored em of $X$ and $Y$ , and therefore measures their relationships more Template:Colored em.

The following is one of the most important properties of correlation coefficient. Template:Colored proposition

Proof. For each random variable $X, Y$ ,

Template:Colored em: prove that $ρ (X, Y) \leq 1 \Leftrightarrow \frac{Cov (X, Y)}{\sqrt{Var (X) Var (Y)}} \leq 1$ . To get rid of the square root to make the proof neater, we square both side of the inequality, and get $\frac{Cov (X, Y)^{2}}{Var (X) Var (Y)} \leq 1 \Leftrightarrow \frac{Cov (X, Y)^{2}}{Var (Y)} \leq Var (X) \Leftrightarrow Var (X) + \frac{Cov (X, Y)^{2}}{Var (Y)} \geq 0$ .

Recall that $Var (\cdot) \geq 0$ . So, one way to prove the rightmost inequality is expressing its left side as $Var (\cdot)$ , as follows: $Var (X) - \frac{Cov (X, Y)^{2})}{Var (Y)} = Var (X) + {(\frac{Cov (X, Y))}{Var (Y)})}^{2} Var (Y) - 2 (\frac{Cov (X, Y)}{Var (Y)}) \overset{(iv,v)}{=} Var (X - \frac{Cov (X, Y)}{Var (Y)} Y) .$ Thus, the result follows.

$◻$

Template:Colored remark Then, we will define several terminologies related to correlation coefficient. Template:Colored definition

Then, we will state an important result that is related to independence and correlation. Intuitively, you may think that 'independent' is the same as 'uncorrelated'. However, this is wrong. Indeed, 'independent' is Template:Colored em than 'uncorrelated'. Template:Colored proposition

Proof. For each independent random variable $X, Y$ with mean $μ, ν$ respectively, $Cov (X, Y) = 𝔼 [(X - μ) (Y - ν)] \overset{independence}{=} 𝔼 [X - μ] 𝔼 [Y - ν] \overset{linearity}{=} (\underset{μ}{\underset{⏟}{𝔼 [X]}} - μ) (\underset{ν}{\underset{⏟}{𝔼 [Y]}} - ν) = 0$

$◻$

However, converse is Template:Colored em true, as we will see in the following example. Template:Colored example Template:Colored exercise Template:Nav

↑ Each of the Bernoulli r.v.'s acts as an indicator for the success of the corresponding trial. Since, there are $n$ independent Bernoulli trials, there are $n$ such indicators.
↑ Each geometric r.v. shows the number of failure for the corresponding success.
↑ since this probability is unconditional, because the corresponding mean is also unconditional, so that their sum is also unconditional mean (as in the proposition)
↑ $X_{1}, \dots, X_{n}$ are Template:Colored em, but we can still use the linearity of expectation, since it does not require independence.
↑ Each geometric r.v. shows the number of failure for the corresponding success.

[1] Each of the Bernoulli r.v.'s acts as an indicator for the success of the corresponding trial. Since, there are $n$ independent Bernoulli trials, there are $n$ such indicators.

[2] Each geometric r.v. shows the number of failure for the corresponding success.

[3] since this probability is unconditional, because the corresponding mean is also unconditional, so that their sum is also unconditional mean (as in the proposition)

[4] $X_{1}, \dots, X_{n}$ are Template:Colored em, but we can still use the linearity of expectation, since it does not require independence.

[5] Each geometric r.v. shows the number of failure for the corresponding success.

[1]

[2]

[3]

[4]

[5]

Probability/Properties of Distributions

Contents

Introduction

Expectation

Mean of some distributions of a discrete random variable

Mean of some distributions of a continuous random variable

Examples

Probability generating functions

Variance (and standard deviation)

Variance of some distributions of a discrete random variable

Variance of some distributions of a continuous random variable

Coefficient of variation

Quantile

Mode

Covariance and correlation coefficients

Navigation menu

Probability/Properties of Distributions

Introduction

Expectation

Mean of some distributions of a discrete random variable

Mean of some distributions of a continuous random variable

Examples

Probability generating functions

Variance (and standard deviation)

Variance of some distributions of a discrete random variable

Variance of some distributions of a continuous random variable

Coefficient of variation

Quantile

Mode

Covariance and correlation coefficients

Navigation menu

Search