Probability/Transformation of Random Variables

Transformation of random variables

Underlying principle

Let $X_{1}, \dots, X_{n}$ be $n$ random variables, $Y_{1}, \dots, Y_{n}$ be another $n$ random variables, and $𝐗 = (X_{1}, \dots, X_{n})^{T}, 𝐘 = (Y_{1}, \dots, Y_{n})^{T}$ be random (column) vectors.

Suppose the vector-valued function^[1] $𝐠 : supp (𝐗) \to supp (𝐘)$ is bijective (it is also called one-to-one correspondence in this case). Then, its inverse $𝐠^{- 1} : supp (𝐘) \to supp (𝐗)$ exists.

After that, we can transform $𝐗$ to $𝐘$ by applying the transformation $𝐠$ , i.e. by $𝐘 = 𝐠 (𝐗)$ , and transform $𝐘$ to $𝐗$ by applying the inverse transformation $𝐠^{- 1}$ , i.e. by $𝐗 = 𝐠^{- 1} (𝐘)$ .

We are often interested in deriving the joint probability function $f_{𝐘} (𝐲)$ of $𝐘$ , given the joint probability function $f_{𝐗} (𝐱)$ of $𝐗$ . We will examine the Template:Colored em and Template:Colored em cases one by one in the following.

Transformation of discrete random variables

Template:Colored proposition

Proof. Considering the original pmf $f_{𝐘} (𝐲)$ , we have $f_{𝐘} (𝐲) \overset{def}{=} ℙ (𝐘 = 𝐲) = ℙ (𝐠^{- 1} (𝐘) = 𝐠^{- 1} (𝐲)) = ℙ (𝐗 = 𝐠^{- 1} (𝐲)) \overset{def}{=} f_{𝐗} (𝐠^{- 1} (𝐲)), 𝐲 \in supp (𝐘) .$ In particular, the inverse $𝐠^{- 1}$ exists since $𝐠$ is bijective.

$◻$

Transformation of continuous random variables

For Template:Colored em random variables, the situation is more complicated.

Let us investigate the case for univariate pdf, which is simpler. Template:Colored theorem

Proof. Under the assumption that $g$ is differentiable and strictly monotone, the cdf $F_{Y} (y) = ℙ (g (X) \leq y) = {\begin{matrix} ℙ (X \leq g^{- 1} (y)) = F_{X} (g^{- 1} (y)), & g^{- 1} is increasing; \\ ℙ (X \geq g^{- 1} (y)) = 1 - F_{X} (g^{- 1} (y)), & g^{- 1} is decreasing . \end{matrix}$ ( $g^{- 1}$ exists since $g$ is strictly monotonic.) Differentiating both side of the above equation (assuming the cdf's involved are differentiable) gives $f_{Y} (y) = {\begin{matrix} f_{X} (g^{- 1} (y)) \frac{d g^{- 1} (y)}{d y}, & g^{- 1} is increasing; \\ - f_{X} (g^{- 1} (y)) \frac{d g^{- 1} (y)}{d y}, & g^{- 1} is decreasing . \end{matrix}$ Since $x = g^{- 1} (y)$ , we can write $\frac{d g^{- 1} (y)}{d y}$ as $\frac{d x}{d y}$ . Also, we can summarize the above case defined function into a single expression by applying absolute value function to both side: $f_{Y} (y) = f_{X} (g^{- 1} (y)) | \frac{d x}{d y} |,$ where the absolute value sign is only applied to $\frac{d x}{d y}$ since the pdf's must be nonnegative, and thus we do not need to apply the sign to them.

$◻$

Template:Colored remark Let us define Template:Colored em, and introduce several notations in the definition. Template:Colored definition Template:Colored remark Template:Colored example

Template:Colored theorem

Proof. Template:Colored em: Assume $𝐠$ is differentiable and bijective.

First, $ℙ (Y \in S) = \int_{}^{} \dots \int_{S}^{} f_{𝐘} (𝐲) d y_{1} \dots d y_{n} (1) .$

On the other hand, we have $ℙ (Y \in S) = ℙ (X = 𝐠^{- 1} (Y) \in 𝐠^{- 1} (S)) = \int \dots \int_{𝐠^{- 1} (S)}^{} f_{𝐗} (𝐱) d x_{1} \dots d x_{n}$ where $𝐠^{- 1} (S) = {x \in X : 𝐠 (x) \in S}$ , which is the preimage of the set $S$ under $𝐠$ .

Applying the change of variable formula to this integral (whose proof is advanced and uses our assumptions), we get $\int \dots \int_{𝐠^{- 1} (S)} f_{𝐗} (𝐱) d x_{1} \dots d x_{n} = \int \dots \int_{S} f_{𝐗} (𝐠^{- 1} (𝐲)) | \det \frac{\partial 𝐱}{\partial 𝐲} | d y_{1} \dots d y_{n} (2)$ Comparing the integrals in $(1)$ and $(2)$ , we can observe the desired result.

$◻$

Moment generating function

Template:Colored definition Template:Colored remark Template:Colored proposition

Proof.

Since

$M_{X} (t) = 𝔼 [e^{t X}] = 𝔼 [1 + t X + \frac{t^{2} X^{2}}{2!} + \dots] \overset{linearity}{=} 1 + t 𝔼 [X] + \frac{t^{2}}{2!} 𝔼 [X^{2}] + \dots,$ $\frac{d^{n}}{d t^{n}} M_{X} (t) |_{t = 0} = \frac{d^{n}}{d t^{n}} (1 + t 𝔼 [X] + \frac{t^{2}}{2!} 𝔼 [X^{2}] + \dots) |_{t = 0} = 𝔼 [X] \frac{d^{n}}{d t^{n}} t + \frac{𝔼 [X^{2}]}{2!} \frac{d^{n}}{d t^{n}} t^{2} + \dots,$

The result follows from simplifying the above expression by $\frac{d^{n}}{d t^{n}} t^{m} = 𝟏 {m = n} n! + 𝟏 {m \neq n} (0) .$

$◻$

Template:Colored proposition

Proof. $M_{X Y} (t) = 𝔼 [e^{t X Y}] \overset{lote}{=} 𝔼_{X} [{𝔼_{Y} [e}^{t X Y} | X]] = 𝔼_{X} [M_{Y} (t X)] .$ Similarly, $M_{X Y} (t) = 𝔼 [e^{t X Y}] \overset{lote}{=} 𝔼_{X} [{𝔼_{X} [e}^{t X Y} | Y]] = 𝔼_{Y} [M_{X} (t Y)] .$

lote: law of total expectation

$◻$

Template:Colored remark

Joint moment generating function

In the following, we will use $𝐗$ to denote $(X_{1}, \dots, X_{n})^{T}$ . Template:Colored definition Template:Colored remark Template:Colored proposition

Proof. 'only if' part: Assume $X_{1}, \dots, X_{n}$ are independent. Then, $M_{𝐗} (𝐭) = 𝔼 [e^{𝐭 \cdot 𝐗}] = 𝔼 [e^{t_{1} X_{1}} \dots e^{t_{n} X_{n}}] \overset{independence}{=} 𝔼 [e^{t_{1} X_{1}}] \dots 𝔼 [e^{t_{n} X_{n}}] = M_{X_{1}} (t_{1}) \dots M_{X_{n}} (t_{n}) .$ Proof for 'if' part is quite complicated, and thus is omitted.

$◻$

Analogously, we have Template:Colored em mgf. Template:Colored definition Template:Colored proposition

Proof. $M_{𝐚 \cdot 𝐗 + b} (t) = 𝔼 [e^{t 𝐚 \cdot 𝐗 + b t}] = e^{b t} 𝔼 [e^{t 𝐚 \cdot 𝐗}] = e^{b t} M_{𝐗} (t 𝐚) = e^{b t} M_{𝐗} (t a_{1}, \dots, t a_{n}) .$

$◻$

Template:Colored remark

Moment generating function of some important distributions

Template:Colored proposition

Proof. $M_{X} (t) = \sum_{k = 0}^{n} e^{t k} \underset{for Binom (n, p)}{\underset{⏟}{(\binom{n}{k}) p^{k} (1 - p)^{n - k}}} = \sum_{k = 0}^{n} (\binom{n}{k}) (p e^{t})^{k} (1 - p)^{n - k} = (p e^{t} + 1 - p)^{n} by binomial theorem .$

$◻$

Template:Colored proposition

Proof. $M_{X} (t) \overset{def}{=} 𝔼 [e^{t X}] \overset{LOTUS}{=} \sum_{k = 0}^{\infty} e^{t k} \cdot \underset{for Pois (λ)}{\underset{⏟}{\frac{e^{- λ} λ^{k}}{k!}}} = e^{λ (e^{t} - 1)} \overset{= 1}{\overset{⏞}{\sum_{k = 0}^{\infty} \underset{for Pois (λ e^{t})}{\underset{⏟}{\frac{e^{- λ e^{t}} (λ e^{t})^{k}}{k!}}}}} = e^{λ (e^{t} - 1)} .$

$◻$

Template:Colored proposition

Proof.

$M_{X} (t) = 𝔼 [e^{t X}] = λ \int_{0}^{\infty} e^{t x} e^{- λ x} d x = λ \int_{0}^{\infty} e^{- (λ - t) x} d x = \frac{λ}{λ - t} \overset{= 1}{\overset{⏞}{\int_{0}^{\infty} \underset{for Exp (λ - t)}{\underset{⏟}{(λ - t) e^{- (λ - t) x}}} d x}}, \underset{ensuring valid rate parameter}{\underset{⏟}{λ - t > 0}} \Leftrightarrow t < λ .$
The result follows.

$◻$

Template:Colored proposition

Proof.

We use similar proof technique from the proof for mgf of exponential distribution.

$M_{X} (t) = 𝔼 [e^{t X}] = \frac{λ^{α}}{Γ (α)} \int_{0}^{\infty} e^{t x} x^{α - 1} e^{- λ x} d x = \frac{λ^{α}}{Γ (α)} \int_{0}^{\infty} e^{- (λ - t) x} x^{α - 1} d x = \frac{λ^{α}}{(λ - t)^{α}} \overset{= 1}{\overset{⏞}{\int_{0}^{\infty} \underset{for Gamma (α, λ - t)}{\underset{⏟}{\frac{(λ - t)^{α}}{Γ (α)} e^{- (λ - t) x} x^{α - 1}}} d x}}, \underset{ensuring valid rate parameter}{\underset{⏟}{λ - t > 0}} \Leftrightarrow t < λ .$

The result follows.

$◻$

Template:Colored proposition

Proof.

Let $Z = \frac{X - μ}{σ} \sim 𝒩 (0, 1)$ . Then, $X = σ Z + μ$ .
First, consider the mgf of $Z$ :

$M_{Z} (t) \overset{def}{=} 𝔼 [e^{t Z}] = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} \underset{= e^{- (x^{2} - 2 t x) / 2}}{\underset{⏟}{e^{t x} e^{- x^{2} / 2}}} d x = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} \exp (\overset{= - (x - t)^{2}}{\overset{⏞}{- (x^{2} - 2 t x + t^{2})}} / 2 + t^{2} / 2) d x = e^{t^{2} / 2} \overset{= 1}{\overset{⏞}{\int_{- \infty}^{\infty} \underset{for 𝒩 (t, 1)}{\underset{⏟}{\frac{1}{\sqrt{2 π}} \cdot e^{- (x - t)^{2} / 2}}} d x}} = e^{t^{2} / 2} .$

It follows that the mgf of $X$ is

$M_{X} (t) = e^{μ t} M_{X} (σ t) = e^{μ t} e^{σ^{2} t^{2} / 2} .$

The result follows.

$◻$

Distribution of linear transformation of random variables

We will prove some propositions about distributions of linear transformation of random variables using Template:Colored em. Some of them are mentioned in previous chapters. As we will see, proving these propositions using mgf is quite simple. Template:Colored proposition

Proof.

The mgf of $a X + b$ is

$M_{a X + b} (t) = e^{b t} M_{X} (a t) = e^{b t} (\exp (a μ t + (a σ)^{2} t^{2} / 2)) = \exp ((a μ + b) t + a^{2} σ^{2} t^{2} / 2),$

which is the mgf of

𝒩 (a μ + b, a^{2} σ^{2})

, and the result follows since mgf identify a distribution uniquely.

$◻$

Sum of independent random variables

Template:Colored proposition

Proof.

The mgf of $X_{1} + \dots + X_{n}$ is

$M_{X_{1} + \dots + X_{n}} (t) = M_{X_{1}} (t) \dots M_{X_{n}} (t) = (p e^{t} + 1 - p)^{n_{1}} \dots (p e^{t} + 1 - p)^{n_{m}} = (p e^{t} + 1 - p)^{n_{1} + \dots + n_{m}},$

which is the mgf of

Binom (n_{1} + \dots + n_{m}, p)

, as desired.

$◻$

Template:Colored proposition

Proof.

The mgf of $X_{1} + \dots + X_{n}$ is

$M_{X_{1} + \dots + X_{n}} (t) = M_{X_{1}} (t) \dots M_{X_{n}} (t) = e^{λ_{1} (e^{t} - 1)} \dots e^{λ_{n} (e^{t} - 1)} = e^{(λ_{1} + \dots + λ_{n}) (e^{t} - 1)},$

which is the mgf of

Pois (λ_{1} + \dots + λ_{n})

, as desired.

$◻$

Template:Colored proposition

Proof.

The mgf of $X_{1} + \dots + X_{n}$ is

$M_{X_{1} + \dots + X_{n}} (t) = M_{X_{1}} (t) \dots M_{X_{n}} (t) = {(\frac{λ}{λ - t})}^{n},$

which is the mgf of

Gamma (n, λ)

, as desired.

$◻$

Template:Colored proposition

Proof.

The mgf of $X_{1} + \dots + X_{n}$ is

$M_{X_{1} + \dots + X_{n}} (t) = M_{X_{1}} (t) \dots M_{X_{n}} (t) = {(\frac{λ}{λ - t})}^{α_{1}} \dots {(\frac{λ}{λ - t})}^{α_{n}} = {(\frac{λ}{λ - t})}^{α_{1} + \dots + α_{n}},$

which is the mgf of

Gamma (α_{1} + \dots + α_{n}, λ)

, as desired.

$◻$

Template:Colored proposition

Proof.

The mgf of $X_{1} + \dots + X_{n}$ (in which they are independent) is

$M_{X_{1} + \dots + X_{n}} (t) = M_{X_{1}} (t) \dots M_{X_{n}} (t) = \exp (μ_{1} t + σ_{1}^{2} t^{2} / 2) \dots \exp (μ_{n} t + σ_{n}^{2} t^{2} / 2) = \exp ((μ_{1} + \dots + μ_{n}) t + (σ_{1}^{2} + \dots + σ_{n}^{2}) t^{2} / 2),$

which is the mgf of

𝒩 (μ_{1} + \dots + μ_{n}, σ_{1}^{2} + \dots + σ_{n}^{2})

, as desired.

$◻$

Central limit theorem

We will provide a proof to Template:Colored em (CLT) using mgf here. Template:Colored theorem

Proof.

Define $T_{n} = \frac{\sqrt{n} ({\overline{X}}_{n} - μ)}{σ}$ . Then, we have

$T_{n} = \frac{\sqrt{n} ((X_{1} + \dots + X_{n}) / n - μ)}{σ} = \frac{X_{1} + \dots + X_{n}}{σ \sqrt{n}} - \frac{\sqrt{n} μ}{σ},$

which is in the form of $𝐚 \cdot 𝐗 + b, 𝐚 = {(\frac{1}{σ \sqrt{n}}, \dots, \frac{1}{σ \sqrt{n}})}^{T} and b = - \frac{\sqrt{n} μ}{σ}$ .
Therefore,

$\begin{matrix} M_{T_{n}} (t) & = e^{- \sqrt{n} μ t / σ} (M_{X_{1}} (\frac{t}{σ \sqrt{n}}) \dots M_{X_{n}} (\frac{t}{σ \sqrt{n}})) \\ M_{T_{n}} (t) & = e^{- \sqrt{n} μ t / σ} {(M_{X_{1}} (\frac{t}{σ \sqrt{n}}))}^{n} since X_{1}, \dots, X_{n} are identically distributed, which is equivalent to they have the same mgf \\ \Rightarrow & \ln M_{T_{n}} (t) & = - \sqrt{n} μ t / σ + n \ln (𝔼 [e^{t / (σ \sqrt{n}) X_{1}}]) \\ = - \sqrt{n} μ t / σ + n \ln 𝔼 [1 + t / (σ \sqrt{n}) X_{1} + (1 / 2!) t^{2} / (σ^{2} n) + \dots] since e^{x} = 1 + x + \frac{x^{2}}{2!} + \dots \\ = - \sqrt{n} μ t / σ + n \ln (1 + t / (σ \sqrt{n}) 𝔼 [X] + (1 / 2!) t^{2} / (σ^{2} n) (\underset{Var (X) + (𝔼 [X])^{2}}{\underset{⏟}{𝔼 [X^{2}]}}) + terms of order smaller than n^{- 1}) \\ = - \sqrt{n} μ t / σ + n \ln (1 + t / (σ \sqrt{n}) μ + (1 / 2) t^{2} / (σ^{2} n) (σ^{2} + μ^{2}) + terms of order smaller than n^{- 1}) \\ = - \sqrt{n} μ t / σ + n [t / (σ \sqrt{n}) μ + (1 / 2) t^{2} / (σ^{2} n) (σ^{2} + μ^{2}) - (1 / 2) (t / (σ \sqrt{n}) μ)^{2} + terms of order smaller than n^{- 1}] since \ln (1 + x) = x - x^{2} / 2 + \dots \\ = - \sqrt{n} μ t / σ + \sqrt{n} μ t / σ + n (1 / 2) t^{2} / (σ^{2} n) (σ^{2} + μ^{2}) - n (1 / 2) (t^{2} / (σ^{2} n) μ^{2}) + terms of order smaller than n^{0} \\ = (1 / 2) t^{2} (σ^{2} / σ^{2}) + (1 / 2) μ^{2} t^{2} - (1 / 2) t^{2} (μ^{2}) + terms of order smaller than n^{0} \\ = (1 / 2) t^{2} + \underset{\to 0 as n \to \infty}{\underset{⏟}{terms of order smaller than n^{0}}} \\ \Rightarrow & \lim_{n \to \infty} M_{T_{n}} (t) & = \underset{mgf of 𝒩 (0, 1)}{\underset{⏟}{e^{t^{2} / 2}}}, \end{matrix}$ and the result follows from the mgf property of identifying distribution uniquely.

$◻$

Template:Colored remark A special case of using CLT as Template:Colored em is using normal distribution to Template:Colored em discrete distribution. To improve accuracy, we should ideally have Template:Colored em, as explained in the following. Template:Colored proposition Template:Colored remark Illustration of continuity correcction:

| 
|              /
|             /
|            /
|           /|
|          /#|
|         *##|
|        /|##|
|       /#|##|   
|      /##|##|   
|     /|##|##|   
|    / |##|##|   
|   /  |##|##|
|  /   |##|##|
| /    |##|##|
*------*--*--*---------------------
    i-1/2 i i+1/2

| 
|              /
|             /
|            /
|           / 
|          /  
|         *   
|        /|   
|       /#|      
|      /##|      
|     /###|      
|    /####|      
|   /#####|   
|  /|#####|   
| / |#####|   
*---*-----*------------------------
   i-1    i      

| 
|              /|
|             /#|
|            /##|
|           /###|
|          /####|
|         *#####|
|        /|#####|
|       / |#####|
|      /  |#####|
|     /   |#####|
|    /    |#####|
|   /     |#####| 
|  /      |#####|
| /       |#####|
*---------*-----*------------------
          i     i+1

Template:Nav

↑ or equivalently, transformation between supports of $𝐗$ and $𝐘$

[1] r equivalently, transformation between supports of $𝐗$ and $𝐘$

[1]

Probability/Transformation of Random Variables

Contents

Transformation of random variables

Underlying principle

Transformation of discrete random variables

Transformation of continuous random variables

Moment generating function

Joint moment generating function

Moment generating function of some important distributions

Distribution of linear transformation of random variables

Sum of independent random variables

Central limit theorem

Navigation menu

Probability/Transformation of Random Variables

Transformation of random variables

Underlying principle

Transformation of discrete random variables

Transformation of continuous random variables

Moment generating function

Joint moment generating function

Moment generating function of some important distributions

Distribution of linear transformation of random variables

Sum of independent random variables

Central limit theorem

Navigation menu

Search