Fundamental Actuarial Mathematics/Mortality Models
Learning objectives
The Candidate will understand key concepts concerning parametric and non-parametric mortality models for individual lives.
Learning outcomes
The Candidate will be able to:
- Understand parametric survival models, life tables, and the relationships between them.
- Given a parametric survival model, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable.
- Identify and apply standard actuarial notation for future lifetime distributions and moments, including select and ultimate functions.
- Given a life table, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable, using appropriate fractional age assumptions where necessary.
- Understand and apply select life tables.
- Identify common features of population mortality curves.
Survival distributions
In the model discussed in this chapter, it describes the length of survival (or time until death) of an individual. Thus, the Template:Colored em will be the basic building block.
Age-at-death random variable
In this section, we will discuss a special case for the time-until-death random variable, in which the time until death is applicable to newborn (i.e. people aged zero). We denote this kind of random variable by . We can observe that also represents the age at death, since the age is counted starting from the beginning of life.
Since time-until-death random variable is describing time, it is a continuous random variable. Also, time is nonnegative, so the support (or "domain") of time-until-death random variable is .
To describe the time until death for newborn, we need to determine the distribution of completely. There are several ways to do this.
- cumulative distribution function (cdf):
- probability density function (pdf): if is differentiable.
- Template:Colored em
- Template:Colored em
You should have learnt about cdf and pdf when learning about probability, but may not have learnt about survival function and force of mortality. Thus, we will discuss them here.
Survival function
As suggested by the name "Template:Colored em function", we may guess that this function is somewhat related to survival. This is actually true. Take the time-until-death random variable as an example, when the newborn survives for, say, units of time, what is its probability? It is (or , but since is continuous, it does not matter). This probability corresponding to the input is actually the survival function, which is defined below: Template:Colored definition Template:Colored remark Template:Colored example
Force of mortality
In financial mathematics, you should have learnt about Template:Colored em, which can be interpreted as the Template:Colored em, and it is given by in which the notation has its usual meaning in financial mathematics. Why do we call it force of Template:Colored em? This is because the interest refers to an increase (or positive change) in the amount.
We may guess that the force of Template:Colored em is defined similarly, in the sense that it can also be interpreted as the relative rate of change of something. We know that the interest refers to change in amount, but what change does mortality refer to? Since mortality means the state of being susceptible to death, it refers to the Template:Colored em (or negative change) in survival rate, and the "higher" the mortality, the larger decrease in survival rate. Recall that the Template:Colored em is related to survival rate (probability of surviving for a certain time) in some sense. So, we can make use of survival function to define mortality.
However, there is a difference between force of Template:Colored em and force of Template:Colored em, namely for force of interest, interest refers to an Template:Colored em in amount, while for force of mortality, mortality refers to a Template:Colored em in survival rate. So the changes are in opposite direction, and thus if we define force of mortality in an exact analogous manner, its value will be negative (the relative rate of change will be negative). To make the force of mortality positive, we can define the Template:Colored em as follows: Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark Template:Colored exercise After that, we will introduce some propositions related to the Template:Colored em and Template:Colored em. Template:Colored proposition
Proof. as desired.
Proof. For simplicity of presentation, we will have some abuse of notations (the infinity is in limit sense) in the following, but the reasoning is still understandable.
Future lifetime of a life aged Template:Math
Now, we Template:Colored em our discussion from future lifetime of a life aged zero (a newborn) to a life aged (). For simplicity of presentation, we denote a life aged by . Template:Colored remark Similarly, we denote the future lifetime of by (recall that we denote the future lifetime of (newborn) by ). We Template:Colored em the distribution of mathematically (and quite naturally) as the Template:Colored em of , Template:Colored em .
To understand this, consider the following reasoning: Refer to the following timeline:
death
x T_x |
|---------|-------v
-------------------------
0 x t
|-----------------|
T_0
We can observe that if (or , but since is continuous, it does not matter). So, if , then .
On the other hand, if , we have the following timeline:
death
x |
|-------v-|
-------------------------
0 x t
|-------|
T_0
In this case, does not exist, since the person does not survive for years, and thus will never be age , so there is not , and therefore there is not , future lifetime of . This shows the necessity of the condition .
From this definition, we have , , etc.. This is quite important since it is the basis for the calculations of probabilities related to .
For the pdf, cdf and survival function of , we have similar notations as follows:
- : pdf of
- : cdf of
- : survival function of
In particular, we have some special actuarial notations for the cdf and survival function, as follows:
In actuarial notations, "" often refers to something related to Template:Colored em, while "" often refers to something related to Template:Colored em. In this context, this holds since refers to the probability for to Template:Colored em within time units, and refers to the probability for to Template:Colored em for time units.
For simplicity, if , we write as and as .
Using the relationship between and , we can develop some useful formulas for and , as follows: Template:Colored proposition
Proof. First, we have , in which since (), and so , and thus is a subset of .
It follows that .
We can also express the pdf of as follows: Template:Colored proposition
Proof. We have
Template:Colored remark Template:Colored example We have a special notation for the probability for to Template:Colored em ages and (), namely (we use "" here since this is related to death). Thus, we have by definition . We have the following proposition for another formula of . Template:Colored proposition
Proof.
Template:Colored remark Template:Colored example Template:Colored exercise
Curtate-future-lifetime of a life aged Template:Math
The Template:Colored em is just like the future lifetime in previous sections, except that it is Template:Colored em. Template:Colored definition Template:Colored remark Similarly, we would like to completely determine the distribution of , as in the case for . We can do this using cdf or probability mass function (pmf). Its pmf is given by the following proposition. Template:Colored proposition
Proof. The pmf of is
Proof. The cdf of is
Life tables
In a life table, the values of and other functions for different (integer) ages are tabulated. The values are assumed to be based on the survival distribution discussed in previous sections. In this section, we will discuss more functions appearing in a life table.
In previous sections, we have discussed time-until-death random variable for one person, and we will consider multiple people here. Suppose there are newborns. Let the indicator function Also let be the sum of all such indicator functions , i.e. . We can interpret as the number of survivors to age for the newborns.
We denote the Template:Colored em of by . Template:Colored proposition
Proof. Since (this is true for each life , since the survival distribution for the future lifetime of different lifes are assumed to be the same), equals .
As a corollary, ( is constant with respect to ). Also, .
Also, we can use to calculate probabilities like and , as follows:
,
and thus .
In a later section in which selection age is involved in the life table (select table), we will use these formulas to calculate these probabilities from such life table, to incorporate the effect of selection.
We have discussed about the number of Template:Colored em to age , and we will discuss the "opposite thing" in the following, namely the number of Template:Colored em to age (i.e. between age 0 and ), or in general, between age and .
We denote the Template:Colored em of such number of deaths by . Template:Colored proposition
Proof. We can define another indicator function for this context similarly (with value 1 if life dies between age and age and 0 otherwise). Then, the expected value of each indicator function equals the probability for one of the newborns to die between and , which is . With similar reasoning as in above, we have .
Template:Colored remark Apart from the life table functions and which are related to the expectation of Template:Colored em of survivors and deaths respectively, we will also discuss two more life table functions, that is related to the expectation of Template:Colored em. Template:Colored example Template:Colored remark Template:Colored exercise There are two types of expectation of life: one is discrete and another is continuous, and they are called curtate-expectation-of-life and complete-expectation-of-life respectively. Template:Colored definition Template:Colored definition Template:Colored proposition
Proof. We will use integration by parts. Now, it suffices to prove that and this is true since and as , so this limit either equals or 0. However, since the expected value exists (i.e. does not tend to infinity, or else the expectation of life does not make sense), this limit cannot equal , and so this limit is 0.
Template:Colored remark Template:Colored proposition
Proof. The previous proposition about uses integration by parts in the proof, and we can analogously use Template:Colored em (may be interpreted as a discrete analogue of integration by parts) in the proof. However, there is a simpler way to prove this proposition, where the summation is "split" appropriately: We can observe that this sum and the sum in the proposition represent the same thing, and thus the result follows. Template:Hide
Template:Colored remark Template:Colored exercise The following are recursion relations for and , which can be useful when we want to find the complete/curtate-expectation-of-life of given the expectation of a life with some other ages, say and .
We will state the recursion relations as a form of proposition, and then prove them formally. After the proof, we will try to give some intuitive explanations about the recursion relation for . Template:Colored proposition
Proof. In particular, we have .
An intuitive explanation of this recursion relation is as follows:
- for LHS, is the curtate-expectation-of-life of ;
- for RHS, is the curtate-expectation-of-life of , and we want to "transform" it to the expectation of . The first step is adding 1 to it, since this is the expectation with respect to , but we want the expectation from the perspective of , which is 1 year younger. But only this step is not enough, since "" assumes the life already lives for years, but for , the life is only assumed to live for years. Hence, we also need to multiply the probability for to live for one year, , to "get to" .
- Now, "the expectation of life from age onward" is done through . How about "the expectation of life from age to age "? Indeed, when the life dies within age and , . This means such "expectation of life" is zero.
Proof.
Template:Colored example Template:Colored exercise
Assumptions for fractional ages
Previously, we have discussed the continuous random variable and discrete random variable . A life table can specify the distribution of since the values of for different integer can be obtained from the life table. However, the life table is not enough to specify the distribution of , since we do not know the value of when is not an integer. Thus, in order to specify a distribution of using a life table, we need to make some assumptions about the fractional (non-integer) ages.
In actuarial science, three assumptions are widely used, namely Template:Colored em (UDD) (or linear interpolation), Template:Colored em (or exponential interpolation), and Template:Colored em (or Balducci) assumption (or harmonic interpolation). We will define them each using survival functions, as follows: Template:Colored definition Template:Colored definition Template:Colored remark Template:Colored definition Template:Colored remark Under UDD assumption, we have some "nice" and simple expressions for various probabilities related to mortality. We can obtain those expressions by substituting by the RHS of the equation mentioned in the assumption. Template:Colored example Template:Colored exercise
For the three assumption mentioned, there is a particularly "nice" and simple result for each of them, and we may use those "nice" results for the calculation in practice, rather than applying the definitions. The "nice" result for UDD assumption is mentioned in a previous example: when , .
The "nice" results for the other two assumptions are as follows:
Template:Colored theorem
Proof.
Proof.
An interesting result under the UDD assumption is related to the independence of two random variables.
To simplify the notations, from now on, we let and unless otherwise specified.
Define a continuous random variable by . That is, is the random variable representing the Template:Colored em lived in the year of death of . For example, if , then lives for half year in the year of death.
Then, and are independent under UDD assumption. This is because under UDD assumption. Also, we can observe that the cdf of is , which is the cdf of uniform distribution with support . This means follows the uniform distribution on under UDD assumption. Hence, and . This gives rise to results under UDD assumptions:
- .
- .
These two results give us an alternative way to calculate the mean and variance of where only Template:Colored em things are used in the calculation. However, we should be careful that these results hold Template:Colored em, so we cannot use these results without UDD assumption.
Laws of mortality
In this section, we will introduce some simple laws of mortality (i.e. some specified distributions for mortality). Some of these laws may be appropriate to model the human mortality for Template:Colored em ages, but it is commonly believed that none of these laws is appropriate to model the human mortality for Template:Colored em ages.
Indeed, if we want to model the human mortality using some probability distribution, we may need a Template:Colored em of distributions, since the mortality should be distributed in a different manner when human is in different ages, and thus different distributions should be used in different ages. To choose a suitable distribution for some ages, we may investigate the shape of corresponding empirical distribution based on the actual human mortality data, and pick a suitable distribution accordingly. For example, if the mortality increases exponentially for some ages, then we may select a distribution for which the force of mortality also increases exponentially.
In practice, to calculate the probabilities related to human mortality, we usually use life tables for the calculations. This is usually the case for insurance companies. Each insurance company has its own life table, based on the mortality data of, possibly its clients. Since such life table is constructed using the Template:Colored em human mortality data based on past experiences, the life table is usually deemed to be more accurate than a specific distribution.
Nevertheless, having a law for mortality allows us to simplify the calculation of the probabilities related to mortality.
Template:Colored definition Template:Colored remark Template:Colored exercise Template:Colored example Template:Colored definition Template:Colored definition Template:Colored remark Template:Colored definition
Select and ultimate table
When a person purchases a life insurance policy offered by an insurance company, he needs to give some personal information to the insurance company, e.g. some information about his health status. For the insurance company to decide whether it should sell the policy to the person, those information provided by that person is accessed through the process of Template:Colored em. For underwriting, the Template:Colored em check the information to see whether the risk in insuring that person is appropriate.
Without underwriting, it is likely that people will only purchase life insurance policy when they think they will die soon (e.g. they have a very serious disease), so that they will likely have early claims. In this case, the insurance company may need to pay a large amount of money and suffer a great loss in a short time, and then bankrupt. This shows the necessity of underwriting.
Basically, the "select" in the section title arises from the underwriting process, and when we say an individual is "selected" at age , he is underwritten at age (so the most recent information about the individual is known). Since there are some new information about the individual when he is underwritten (or selected), we will expect there is some update in his survival distribution, and therefore his probabilities related to mortality will change as well. Because of this, we need to have some changes in the actuarial notations, depending on the selection age.
In such actuarial notations, we usually add square brackets around the age at selection, and the numbers in the subscript change accordingly. For example, becomes if the selection age is 25, if the selection age is 12.
Since when a person is underwritten long time ago, he may have poorer health condition (e.g. getting older having some new diseases) from the time at which he is underwritten to now, we will intuitively expect that the longer time passed from the time at which a person aged is underwritten, the more likely that person will be die in the coming year. That is, [1].
The impact on the survival distribution from the selection age may decrease when the time passed from the selection is longer. Beyond a certain time period, say years, the ""'s at the same attained age (i.e. selection age plus the time passed from it to now) but different selection ages will be very close. In other words, (the condition on is to ensure that the selection age at LHS ) [2]. Such years is called the Template:Colored em. Because the ""'s mentioned above will be very close, all such ""'s will only be written as , without any square brackets (since the effect of selection is basically "gone", and so the square brackets are gone). For example, if the selection period is 2 years, then and will be both written as simply. However, we will not write and as , since these two ""'s are "quite different" because of the selection impact is still "quite large".
The following are some terms related to life table.
- An Template:Colored em is a life table in which the functions are only given for Template:Colored em.
- A Template:Colored em is a life table in which some function involves the age at selection.
- An Template:Colored em is usually appended to a select table as a last column to reflect the setting of select table. The combination of a select table and an ultimate table in such a way is called a Template:Colored em table.
For example, an excerpt of select-and-ultimate table with select period 2 years may look like:
| Age at selection, | |||
|---|---|---|---|
| 0 | |||
| 1 | |||
| 2 | |||
| ... | ... | ... | ... |
The last column is the ultimate table. We can observe that we do not need additional columns for etc., since we can already get such values in the "" value in a different row (with different ).
Given a select-and-ultimate table, we can do various calculations based on it. Template:Colored example Template:Nav Template:BookCat