Probability/Conditional Distributions
Motivation
Suppose there is an earthquake. Let be the number of casualties and be the Richter scale of the earthquake.

(a) Without given anything, what is the distribution of ?
(b) Given that , what is the distribution of ?
(c) Given that , what is the distribution of ?
Template:Colored remark Are your answers to (a),(b),(c) different?
In (b) and (c), we have the Template:Colored em distribution of given , and the Template:Colored em distribution of given respectively.
In general, we have Template:Colored em of given (Template:Colored em observing the value of ), or given (Template:Colored em observing the value of ).
Conditional distributions
Recall the definition of Template:Colored em: in which are events, with . Applying this definition to Template:Colored em , we have where is the joint pmf of and , and is the marginal pmf of . It is natural to call such conditional probability as Template:Colored em, right? We will denote such conditional probability as . Then, this is basically the definition of Template:Colored em pmf: Template:Colored em pmf of given is the conditional probability . Naturally, we will expect that Template:Colored em is defined similarly. This is indeed the case: Template:Colored definition Template:Colored remark To understand the definition more intuitively for the continuous case, consider the following diagram.
Top view:
|
|
*---------------*
| |
| |
fixed y *===============* <--- corresponding interval
| |
| |
*---------------*
|
*---------------- x
Side view:
*
/ \
*\ * /
/|#\ \
| / |##\ / *---------*
| * |###\ /\
| |\ |##/#\----------/--\
| | \|#/###*--------* /
| | \/############/#\ /
| |y *\===========/===*
| | / *---------* /
| |/ \ /
| *----------------*
|/
*------------------------- x
Front view:
|
|
|
*\
|#\
|##\
|###\
|####\ <------ Area: f_Y(y)
|#####*--------*
|###############\
*================*-------------- x
*---*
|###| : corresponding cross section from joint pdf
*---*
We can see that when we are conditioning , we take a "slice" out from the region under joint pdf, and the area of the "whole slice" is the area between the Template:Colored em joint pdf with fixed and variable , and the -axis. Since the area is given by , while according to the probability axioms, the area should equal 1. Hence, we scale down the area of "slice" by a factor of , by dividing the univariate joint pdf by . After that, the curve at the top of scaled "slice" is the graph of the conditional pdf .
Now, we have discussed the case where both random variables are discrete or continuous. How about the case where one of them is discrete and another one is continuous? In this case, there is no "joint probability function" of these two random variables, since one is discrete and another is continuous! But, we can still define the conditional probability function in some other ways. To motivate the following definition, let be the conditional probability . Then, differentiating with respect to should yield the conditional pdf . So, we have Thus, it is natural to have the following definition. Template:Colored definition Now, how about the case where is discrete and is continuous? In this case, let us use the above definition for the motivation of definition. However, we should interchange and so that the assumptions are still satisfied. Then, we get In this case, is discrete, so it is natural to define the conditional pmf of given as in the expression. Now, after rearranging the terms, we get Thus, we have the following definition. Template:Colored definition Based on the definitions of conditional probability functions, it is natural to define the Template:Colored em cdf as follows. Template:Colored definition Template:Colored remark Graphical illustration of the definition (continuous random variables):
Top view:
|
|
*---------------*
| |
| |
fixed y *=========@=====* <--- corresponding interval
| x |
| |
*---------------*
|
*----------------
Side view:
*
/ \
*\ * /
/|#\ \
| / |##\ / *---------*
| * |###\ /\
| |\ |##/#\----------/--\
| | \|#/###*--------* /
| | \/######### / \ /
| |y *\========@==/===*
| | / *-------x-* /
| |/ \ /
| *----------------*
|/
*------------------------- x
Front view:
|
|
|
*\
|#\
|##\
|###\
|####\ <------------- Area: f_Y(y)
|#####*--------*
|########### \
*==========@=====*--------------
x
*---*
|###| : the desired region from the cross section from joint pdf, whose area is the probability from the cdf
*---*
If for some event , we have some special notations for simplicity:
- the conditional probability function of given becomes
- the conditional cdf of given becomes
Proof. Recall the definition of independence between two random variables:
- are independent if
- for each .
Since for each , we have the desired result.
We can extend the definition of conditional probability function and cdf to groups of random variables, for joint cdf's and joint probability functions, as follows: Template:Colored definition Then, we also have a similar proposition for determining independence of two random vectors. Template:Colored proposition
Proof. The definition of independence between two random vectors is
- are independent if
- for each .
Since for each , we have the desired result.
Conditional distributions of bivariate normal distribution
Recall from the [[../Important Distributions]] chapter that the joint pdf of is , and and in this case. in which and are positive. Template:Colored proposition
Proof.
- First, the conditional pdf
- Then, we can see that ,
- and by symmetry (interchanging and , and also interchanging and ), .
Conditional version of concepts
We can obtain Template:Colored em version of concepts previously established for 'unconditional' distributions analogously for Template:Colored em distributions by substituting 'unconditional' cdf, pdf or pmf, i.e. or , by their Template:Colored em counterparts, i.e. or .
Conditional independence
Template:Colored definition Template:Colored remark Template:Colored example Template:Colored example
Conditional expectation
Template:Colored definition Template:Colored remark Similarly, we have conditional version of law of the unconscious statistician. Template:Colored proposition Template:Colored proposition
Proof.
Template:Colored remark Template:Colored example The properties of still hold for conditional expectations , with Template:Colored em 'unconditional' expectation replaced by Template:Colored em expectation and some suitable modifications, as follows: Template:Colored proposition
Proof. The proof is similar to the one for 'unconditional' expectations.
Template:Colored remark The following theorem about conditional expectation is quite important. Template:Colored theorem
Proof.
Template:Colored remark Template:Colored corollary
Proof.
- First,
- Then, using law of total expectation,
Template:Colored remark Template:Colored corollary
Proof. Define if occurs, in which is a positive integer. Then,
Template:Colored remark Template:Colored example Template:Colored corollary
Proof. By the formula of expectation computed by weighted average of conditional expectations, and the result follows if .
Template:Colored remark After defining Template:Colored em expectation, we can also have Template:Colored em variance, covariance and correlation coefficient, since variance, covariance, and correlation coefficient are built upon expectation.
Conditional expectations of bivariate normal distribution
Proof.
- The result follows from the proposition about conditional distributions of bivariate normal distribution readily.
Conditional variance
Template:Colored definition Similarly, we have properties of Template:Colored em variance which are similar to that of variance. Template:Colored proposition
Proof. The proof is similar to the one for properties of variance.
Beside law of total expectation, we also have law of total variance, as follows: Template:Colored proposition
Proof.
Conditional variances of bivariate normal distribution
Proof.
- The result follows from he proposition about conditional distributions of bivariate normal distribution readily.
Conditional covariance
Template:Colored definition Template:Colored proposition
Conditional correlation coefficient
Template:Colored definition Template:Colored remark