Fundamental Actuarial Mathematics/Mortality Models

From testwiki
Revision as of 13:33, 8 December 2022 by imported>LeoChiukl (Curtate-future-lifetime of a life aged {{math|x}})
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Nav

Learning objectives

The Candidate will understand key concepts concerning parametric and non-parametric mortality models for individual lives.

Learning outcomes

The Candidate will be able to:

  1. Understand parametric survival models, life tables, and the relationships between them.
  2. Given a parametric survival model, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable.
  3. Identify and apply standard actuarial notation for future lifetime distributions and moments, including select and ultimate functions.
  4. Given a life table, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable, using appropriate fractional age assumptions where necessary.
  5. Understand and apply select life tables.
  6. Identify common features of population mortality curves.

Survival distributions

In the model discussed in this chapter, it describes the length of survival (or time until death) of an individual. Thus, the Template:Colored em will be the basic building block.

Age-at-death random variable

In this section, we will discuss a special case for the time-until-death random variable, in which the time until death is applicable to newborn (i.e. people aged zero). We denote this kind of random variable by T0. We can observe that T0 also represents the age at death, since the age is counted starting from the beginning of life.

Since time-until-death random variable is describing time, it is a continuous random variable. Also, time is nonnegative, so the support (or "domain") of time-until-death random variable is [0,).

To describe the time until death for newborn, we need to determine the distribution of T0 completely. There are several ways to do this.

You should have learnt about cdf and pdf when learning about probability, but may not have learnt about survival function and force of mortality. Thus, we will discuss them here.

Survival function

As suggested by the name "Template:Colored em function", we may guess that this function is somewhat related to survival. This is actually true. Take the time-until-death random variable T0 as an example, when the newborn survives for, say, t units of time, what is its probability? It is (T0>t) (or (T0t), but since T0 is continuous, it does not matter). This probability corresponding to the input t is actually the survival function, which is defined below: Template:Colored definition Template:Colored remark Template:Colored example

Force of mortality

In financial mathematics, you should have learnt about Template:Colored em, which can be interpreted as the Template:Colored em, and it is given by A(t)A(t) in which the notation has its usual meaning in financial mathematics. Why do we call it force of Template:Colored em? This is because the interest refers to an increase (or positive change) in the amount.

We may guess that the force of Template:Colored em is defined similarly, in the sense that it can also be interpreted as the relative rate of change of something. We know that the interest refers to change in amount, but what change does mortality refer to? Since mortality means the state of being susceptible to death, it refers to the Template:Colored em (or negative change) in survival rate, and the "higher" the mortality, the larger decrease in survival rate. Recall that the Template:Colored em is related to survival rate (probability of surviving for a certain time) in some sense. So, we can make use of survival function to define mortality.

However, there is a difference between force of Template:Colored em and force of Template:Colored em, namely for force of interest, interest refers to an Template:Colored em in amount, while for force of mortality, mortality refers to a Template:Colored em in survival rate. So the changes are in opposite direction, and thus if we define force of mortality in an exact analogous manner, its value will be negative (the relative rate of change will be negative). To make the force of mortality positive, we can define the Template:Colored em as follows: Template:Colored definition Template:Colored remark Template:Colored example Template:Colored remark Template:Colored exercise After that, we will introduce some propositions related to the Template:Colored em and Template:Colored em. Template:Colored proposition

Proof. exp(0tμxdx)=exp(0tS'0(x)S0(x)dx)=exp(0tS'0(x)S0(x)dx)=exp(0t1S0(x)d(S0(x)))=exp([ln|S0(x)0|]0t)=exp(ln(S0(t))ln(S0(0)=1))=exp(ln(S0(t))ln1=0)=exp(ln(S0(t)))=S0(t), as desired.

Template:Colored proposition

Proof. For simplicity of presentation, we will have some abuse of notations (the infinity is in limit sense) in the following, but the reasoning is still understandable. 0μtdt=[ln(S0(t))]0=lnS0()lnS0(0)=ln0=.

Template:Colored example

Future lifetime of a life aged Template:Math

Now, we Template:Colored em our discussion from future lifetime of a life aged zero (a newborn) to a life aged x (x0). For simplicity of presentation, we denote a life aged x by (x). Template:Colored remark Similarly, we denote the future lifetime of (x) by Tx (recall that we denote the future lifetime of (0) (newborn) by T0). We Template:Colored em the distribution of Tx mathematically (and quite naturally) as the Template:Colored em of T0x, Template:Colored em T0>x.

To understand this, consider the following reasoning: Refer to the following timeline:

                  death
    x       T_x   |
|---------|-------v
-------------------------
0         x                 t
|-----------------|
      T_0

We can observe that Tx=T0x if T0>x (or T0x, but since T0 is continuous, it does not matter). So, if T0>x, then Tx=T0x.

On the other hand, if T0<x, we have the following timeline:

      death       
    x   |       
|-------v-|
-------------------------
0         x                 t
|-------|
   T_0

In this case, Tx does not exist, since the person does not survive for x years, and thus will never be age x, so there is not (x), and therefore there is not Tx, future lifetime of (x). This shows the necessity of the condition T0>x.

From this definition, we have (Txt)=(T0xt|T0>x), (Tx>t)=(T0x>t|T0>x), etc.. This is quite important since it is the basis for the calculations of probabilities related to Tx.

For the pdf, cdf and survival function of Tx, we have similar notations as follows:

  • fx(t): pdf of Tx
  • Fx(t): cdf of Tx
  • Sx(t): survival function of Tx

In particular, we have some special actuarial notations for the cdf and survival function, as follows:

  • tqx=Fx(t)=(Txt)
  • tpx=Sx(t)=(Tx>t)=1tqx

In actuarial notations, "q" often refers to something related to Template:Colored em, while "p" often refers to something related to Template:Colored em. In this context, this holds since tqx refers to the probability for (x) to Template:Colored em within t time units, and tpx refers to the probability for (x) to Template:Colored em for t time units.

For simplicity, if t=1, we write 1qx as qx and 1px as px.

Using the relationship between Tx and T0, we can develop some useful formulas for tpx and tqx, as follows: Template:Colored proposition

Proof. First, we have tpx=(Tx>t)=(T0x>t|T0>x)=(T0>t+xT0>x)(T0>x)=(T0>t+x)(T0>x)=S0(t+x)S0(x), in which {T0>t+x}{T0>x}={T0>t+x} since t+x>x (t0), and so T0>t+xT0>x, and thus {T0>t+x} is a subset of {T0>x}.

It follows that tqx=1tpx=1S0(t+x)S0(x).

We can also express the pdf of Tx as follows: Template:Colored proposition

Proof. We have fx(t)=ddttqx=(1S0(t+x)S0(x))=S'0(t+x)S0(x)=S0(t+x)S0(x)S'0(t+x)S0(t+x)=tpxμx+t.

Template:Colored remark Template:Colored example We have a special notation for the probability for (x) to Template:Colored em ages x+t and x+t+u (x,t,u0), namely t|uqx (we use "q" here since this is related to death). Thus, we have by definition t|uqx=(t<Tx<t+u)). We have the following proposition for another formula of t|uqx. Template:Colored proposition

Proof. t|uqx=(t<Tx<t+u)=(Txt+u)(Txt)=t+uqxtqx=(1t+upx)(1tpx)=tpxt+upx=tpxS0(x+t+u)S0(x)=tpxS0(x+t+u)S0(x+t)(S0(x+t)S0(x))=tpxupx+t(tpx)=tpx(1upx+t)=tpx(uqx+t)

Template:Colored remark Template:Colored example Template:Colored exercise

Curtate-future-lifetime of a life aged Template:Math

The Template:Colored em is just like the future lifetime in previous sections, except that it is Template:Colored em. Template:Colored definition Template:Colored remark Similarly, we would like to completely determine the distribution of Kx, as in the case for Tx. We can do this using cdf or probability mass function (pmf). Its pmf is given by the following proposition. Template:Colored proposition

Proof. The pmf of Kx is (Kx=k)=(kTx<k+1)=(k<Tx<k+1)since Tx is continuous=k|qx by definition=kpx(qx+k)by proposition.

Template:Colored proposition

Proof. The cdf of Kx is (Kxk)=j=0kj|qx=0|qx+1|qx++k|qx=((x) dies between ages 0 and 1)+((x) dies between ages 1 and 2)++((x) dies between ages k and k+1)=((x) dies between ages 0 and k+1)=((x) dies within k+1 years)=k+1qx.

Template:Colored example

Life tables

In a life table, the values of qx and other functions for different (integer) ages x are tabulated. The values are assumed to be based on the survival distribution discussed in previous sections. In this section, we will discuss more functions appearing in a life table.

In previous sections, we have discussed time-until-death random variable for one person, and we will consider multiple people here. Suppose there are 0 newborns. Let the indicator function 𝟏j(x)={1,if life j survives to age x;0,otherwise. Also let (x) be the sum of all such indicator functions 𝟏j(x), i.e. (x)=j=10𝟏j(x). We can interpret (x) as the number of survivors to age x for the 0 newborns.

We denote the Template:Colored em of (x) by x. Template:Colored proposition

Proof. Since 𝔼[𝟏j(x)]=1(life j survives to age x)+0(life j does not survive to age x)=(life j survives to age x)=S0(x) (this is true for each life j, since the survival distribution for the future lifetime of different lifes are assumed to be the same), x equals 𝔼[(x)]=𝔼[j=10𝟏j(x)]=j=10𝔼[𝟏j(x)]=j=10S0(x)constant with respect to j=0S0(x).


As a corollary, 'x=(0S0(x))=0S'0(x) (0 is constant with respect to x). Also, 'xx=0S'0(x)0S0(x)=S'0(x)S0(x)=μx. Also, we can use x to calculate probabilities like tpx and tqx, as follows: tpx=S0(x+t)S0(x)=0S0(x+t)0S0(x)=x+tx, and thus tqx=1tpx=1x+tx. In a later section in which selection age is involved in the life table (select table), we will use these formulas to calculate these probabilities from such life table, to incorporate the effect of selection.

We have discussed about the number of Template:Colored em to age x, and we will discuss the "opposite thing" in the following, namely the number of Template:Colored em to age x (i.e. between age 0 and x), or in general, between age x and x+n.

We denote the Template:Colored em of such number of deaths by ndx. Template:Colored proposition

Proof. We can define another indicator function for this context similarly (with value 1 if life j dies between age x and age x+n and 0 otherwise). Then, the expected value of each indicator function equals the probability for one of the newborns to die between x and x+n, which is S0(x)S0(x+n). With similar reasoning as in above, we have ndx=0(S0(x)S0(x+n))=0S0(x)0S0(x+n)=xx+n.

Template:Colored remark Apart from the life table functions x and ndx which are related to the expectation of Template:Colored em of survivors and deaths respectively, we will also discuss two more life table functions, that is related to the expectation of Template:Colored em. Template:Colored example Template:Colored remark Template:Colored exercise There are two types of expectation of life: one is discrete and another is continuous, and they are called curtate-expectation-of-life and complete-expectation-of-life respectively. Template:Colored definition Template:Colored definition Template:Colored proposition

Proof. We will use integration by parts. ex=𝔼[Tx]=0tfx(t)dt=0td(Fx(t))=0td(tqx)=0td(1tpx)=0td(tpx)=[t(tpx)]00tpxdt=limt(ttpx)+0tpxdt Now, it suffices to prove that limt(ttpx)=0 and this is true since t and tpx=Sx(t)0 as t, so this limit either equals or 0. However, since the expected value exists (i.e. does not tend to infinity, or else the expectation of life does not make sense), this limit cannot equal , and so this limit is 0.

Template:Colored remark Template:Colored proposition

Proof. The previous proposition about ex uses integration by parts in the proof, and we can analogously use Template:Colored em (may be interpreted as a discrete analogue of integration by parts) in the proof. However, there is a simpler way to prove this proposition, where the summation is "split" appropriately: ex=𝔼[Kx]=k=0kkpxqx+k=k=0k(kpxk+1px)=k=1kkpxk=0kk+1px(kkpx=0 when k=0)=k=1kkpxk=1(k1)kpx(k=k+1)=k=1kkpxk=1kkpx=0+k=1kpx(the first two sums represent the same thing)=k=1kpx. We can observe that this sum and the sum in the proposition represent the same thing, and thus the result follows. Template:Hide

Template:Colored remark Template:Colored exercise The following are recursion relations for ex and ex, which can be useful when we want to find the complete/curtate-expectation-of-life of (x) given the expectation of a life with some other ages, say x+1 and x+2.

We will state the recursion relations as a form of proposition, and then prove them formally. After the proof, we will try to give some intuitive explanations about the recursion relation for ex. Template:Colored proposition

Proof. ex=k=1kpx=px+k=2kpx=px+k=2pxk1px+1=px+pxk=1kpx+1=px(1+ex+1). In particular, we have kpx=S0(x+k)S0(x)=S0(x+1)S0(x)S0(x+k)S0(x+1)=pxk1px+1.

An intuitive explanation of this recursion relation is as follows:

  • for LHS, ex is the curtate-expectation-of-life of (x);
  • for RHS, ex+1 is the curtate-expectation-of-life of (x+1), and we want to "transform" it to the expectation of (x). The first step is adding 1 to it, since this is the expectation with respect to (x+1), but we want the expectation from the perspective of (x), which is 1 year younger. But only this step is not enough, since "ex+1" assumes the life already lives for x+1 years, but for ex, the life is only assumed to live for x years. Hence, we also need to multiply the probability for (x) to live for one year, px, to "get to" ex+1.
  • Now, "the expectation of life from age x+1 onward" is done through ex+1. How about "the expectation of life from age x to age x+1"? Indeed, when the life dies within age x and x+1, K=0. This means such "expectation of life" is zero.

Template:Colored proposition

Proof. ex=0tpxdt=01tpxdt+1tpxdt=01tpxdt+px1t1px+1dt=01tpxdt+px0upx+1du(u=t1)=01tpxdt+pxex+1.

Template:Colored example Template:Colored exercise

Assumptions for fractional ages

Previously, we have discussed the continuous random variable Tx and discrete random variable Kx. A life table can specify the distribution of Kx since the values of qx for different integer x can be obtained from the life table. However, the life table is not enough to specify the distribution of Tx, since we do not know the value of qx when x is not an integer. Thus, in order to specify a distribution of Tx using a life table, we need to make some assumptions about the fractional (non-integer) ages.

In actuarial science, three assumptions are widely used, namely Template:Colored em (UDD) (or linear interpolation), Template:Colored em (or exponential interpolation), and Template:Colored em (or Balducci) assumption (or harmonic interpolation). We will define them each using survival functions, as follows: Template:Colored definition Template:Colored definition Template:Colored remark Template:Colored definition Template:Colored remark Under UDD assumption, we have some "nice" and simple expressions for various probabilities related to mortality. We can obtain those expressions by substituting S0(x+t) by the RHS of the equation mentioned in the assumption. Template:Colored example Template:Colored exercise


For the three assumption mentioned, there is a particularly "nice" and simple result for each of them, and we may use those "nice" results for the calculation in practice, rather than applying the definitions. The "nice" result for UDD assumption is mentioned in a previous example: when 0t1, tqx=t(qx). The "nice" results for the other two assumptions are as follows: Template:Colored theorem

Proof. tpx=S0(x+t)S0(x)=[S0(x)]1t[S0(x+1)]tS0(x)=[S0(x+1)]t[S0(x)]t=(S0(x+1)S0(x))t=pxt.

Template:Colored theorem

Proof. 1tqx+t=1S0(x+1)S0(x+t)=1S0(x+1)(1tS0(x)+tS0(x+1))=1(1t)S0(x+1)S0(x)t=1t(1t)px=(1t)(1px)=(1t)qx.

An interesting result under the UDD assumption is related to the independence of two random variables.

To simplify the notations, from now on, we let T=Tx and K=Kx unless otherwise specified.

Define a continuous random variable S by T=K+S,S(0,1). That is, S is the random variable representing the Template:Colored em lived in the year of death of (x). For example, if S=0.5, then (x) lives for half year in the year of death.

Then, K and S are independent under UDD assumption. This is because (K=k and Ss)=(kTk+s)=kpxsqx+k=kpxsqx+k=(k|qx)s=(K=k)(Ss) under UDD assumption. Also, we can observe that the cdf of S is (Ss)=s=s010, which is the cdf of uniform distribution with support (0,1). This means S follows the uniform distribution on (0,1) under UDD assumption. Hence, 𝔼[S]=12 and Var(S)=112. This gives rise to results under UDD assumptions:

  • ex=𝔼[T]=𝔼[K+S]=𝔼[K]+𝔼[S]=ex+12.
  • Var(T)=Var(T)= independence Var(K)+Var(S)=(k=1(2k1)kpx)(ex)2+112.

These two results give us an alternative way to calculate the mean and variance of T where only Template:Colored em things are used in the calculation. However, we should be careful that these results hold Template:Colored em, so we cannot use these results without UDD assumption.

Template:Colored exercise

Laws of mortality

In this section, we will introduce some simple laws of mortality (i.e. some specified distributions for mortality). Some of these laws may be appropriate to model the human mortality for Template:Colored em ages, but it is commonly believed that none of these laws is appropriate to model the human mortality for Template:Colored em ages.

Indeed, if we want to model the human mortality using some probability distribution, we may need a Template:Colored em of distributions, since the mortality should be distributed in a different manner when human is in different ages, and thus different distributions should be used in different ages. To choose a suitable distribution for some ages, we may investigate the shape of corresponding empirical distribution based on the actual human mortality data, and pick a suitable distribution accordingly. For example, if the mortality increases exponentially for some ages, then we may select a distribution for which the force of mortality also increases exponentially.

In practice, to calculate the probabilities related to human mortality, we usually use life tables for the calculations. This is usually the case for insurance companies. Each insurance company has its own life table, based on the mortality data of, possibly its clients. Since such life table is constructed using the Template:Colored em human mortality data based on past experiences, the life table is usually deemed to be more accurate than a specific distribution.

Nevertheless, having a law for mortality allows us to simplify the calculation of the probabilities related to mortality.

Template:Colored definition Template:Colored remark Template:Colored exercise Template:Colored example Template:Colored definition Template:Colored definition Template:Colored remark Template:Colored definition

Select and ultimate table

When a person purchases a life insurance policy offered by an insurance company, he needs to give some personal information to the insurance company, e.g. some information about his health status. For the insurance company to decide whether it should sell the policy to the person, those information provided by that person is accessed through the process of Template:Colored em. For underwriting, the Template:Colored em check the information to see whether the risk in insuring that person is appropriate.

Without underwriting, it is likely that people will only purchase life insurance policy when they think they will die soon (e.g. they have a very serious disease), so that they will likely have early claims. In this case, the insurance company may need to pay a large amount of money and suffer a great loss in a short time, and then bankrupt. This shows the necessity of underwriting.

Basically, the "select" in the section title arises from the underwriting process, and when we say an individual is "selected" at age x, he is underwritten at age x (so the most recent information about the individual is known). Since there are some new information about the individual when he is underwritten (or selected), we will expect there is some update in his survival distribution, and therefore his probabilities related to mortality will change as well. Because of this, we need to have some changes in the actuarial notations, depending on the selection age.

In such actuarial notations, we usually add square brackets around the age at selection, and the numbers in the subscript change accordingly. For example, q25 becomes q[25] if the selection age is 25, q[12]+13 if the selection age is 12.

Since when a person is underwritten long time ago, he may have poorer health condition (e.g. getting older having some new diseases) from the time at which he is underwritten to now, we will intuitively expect that the longer time passed from the time at which a person aged x is underwritten, the more likely that person will be die in the coming year. That is, q[0]+x>q[1]+x>>q[x1]+1>q[x] [1].

The impact on the survival distribution from the selection age may decrease when the time passed from the selection is longer. Beyond a certain time period, say r years, the "q"'s at the same attained age (i.e. selection age plus the time passed from it to now) but different selection ages will be very close. In other words, q[xj]+r+jq[x]+r,0jx (the condition on j is to ensure that the selection age at LHS xj0) [2]. Such r years is called the Template:Colored em. Because the "q"'s mentioned above will be very close, all such "q"'s will only be written as qx+r, without any square brackets (since the effect of selection is basically "gone", and so the square brackets are gone). For example, if the selection period is 2 years, then q[10]+2 and q[5]+7 will be both written as q12 simply. However, we will not write q[15]+1 and q[16] as q16, since these two "q"'s are "quite different" because of the selection impact is still "quite large".

The following are some terms related to life table.

For example, an excerpt of select-and-ultimate table with select period 2 years may look like:

Select-and-ultimate table with 2-year select period
Age at selection, x q[x] q[x]+1 q[x]+2qx+2
0 q[0] q[0]+1 q2
1 q[1] q[1]+1 q3
2 q[2] q[2]+1 q4
... ... ... ...

The last column is the ultimate table. We can observe that we do not need additional columns for qx+3,qx+4 etc., since we can already get such values in the "qx+2" value in a different row (with different x).

Given a select-and-ultimate table, we can do various calculations based on it. Template:Colored example Template:Nav Template:BookCat

  1. The number inside square brackets is integer since it refers to age.
  2. (Optional) To be more precise, the selection period is the smallest integer r such that |q[x]+rq[xj]+r+j|<ε (ε is a small positive constant. When it is smaller, the requirement is "stricter" and the "q"'s will be "closer".) for each j{0,1,,x}.