This note describes a case study on survival models, which can be used to measure the worsening of mortality caused by anti-selective lapses. This example demonstrates the power of survival models, which we have implemented within an R package called BenG. The package fits parametric survival models for any decrement, such as mortality or lapse, to generate portfolio-specific assumptions for actuarial modelling. Our case study incorporates multivariate analysis, which means we model multiple risk factors at the same time. Survival models in BenG can also be used to measure time trends.
Data
In this case study, we use mortality experience data for a Universal Life (UL) portfolio covering insured lives aged 18 to 99 years, observed over calendar years 2008 to 2017, inclusive.
The data contains multiple risk factors. In this report, we investigate the effects of gender, smoker category, and the Cost of Insurance (COI) type on mortality. Note that there are two COI types in this portfolio: Annual Renewable Term (ART) insurance where the COI increases annually, and Level (LVL) insurance where the COI remains constant over the entire premium payment period.
| Category | Level | Lives | Exposure | Decrements |
|---|---|---|---|---|
| All | All | 225,385 | 1,588,696.44 | 2,981 |
| Gender Smoker | FNS | 101,297 | 708,951.05 | 923 |
| FSM | 12,365 | 81,678.31 | 239 | |
| MNS | 92,904 | 675,565.39 | 1,339 | |
| MSM | 18,819 | 122,501.70 | 480 |
Besides the usual distinction between genders and smoking status, the data includes information about the life insurance products, i.e., the premium structure being either Level COI (LVL) or Annually Renewable Term (ART).
We also sort the lives insured into face amount bands, according to their policy size. The chart below demonstrates that the 9% of lives with the highest face amount make for 30% of the financial risk, demonstrating substantial concentration risk. Modelling amount-specific mortality is outside the scope of this short note, but can be readily implemented using the same techniques as outlined here.
| Category | Level | Lives | Exposure | Decrements |
|---|---|---|---|---|
| All | All | 225,385 | 1,588,696.4 | 2,981 |
| Face Amount Band | [0,100k) | 46,420 | 315,880.0 | 1,066 |
| [100k,150k) | 75,780 | 549,977.5 | 1,053 | |
| [150k,250k) | 37,643 | 259,123.0 | 348 | |
| [250k,500k) | 44,548 | 312,031.4 | 332 | |
| 500k+ | 20,994 | 151,684.5 | 182 |
Selecting Mortality Law
We fit mortality laws for the force of mortality \(\mu_x\) to data at the level of the individual using Maximum Likelihood Estimation. We then compare the model fit to the crude hazard rates by age, i.e., the ratio of deaths to central exposure by age-group. Comparing the fit to the crude rates visually and in terms of the deviance residuals, we can tell which model is most suitable. The AIC score also provides a quantitative measure for model choice.
A Gompertz model is equivalent to fitting a Generalised Linear Model (GLM) with a \(log\) link. Observe how this fails to model mortality for younger ages which experience the so-called “accident hump” of mortality.
\[\mu_{x} = e^{\alpha+\beta x}\] \[ \Leftrightarrow log(\mu_{x}) = \alpha+\beta x\]
Here the parameter \(\alpha\) is the INTERCEPT and \(\beta\) the SLOPE of the logarithmic force of mortality.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -11.6157 | 0.0903 | -128.57 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta | 0.0992 | 0.0014 | 69.97 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
The Perks model is also an example of a GLM, using the \(logit\) link function. It is simply the logistic extension to Gompertz’ model.
\[\mu_{x} = \frac{e^{\alpha + \beta x}}{1+e^{\alpha + \beta x}} \Leftrightarrow logit(\mu_{x}) = \alpha+\beta x\] Note that the force of mortality \(\mu_x\) tends towards \(1\), and thus \(q_x \to 0.6321\) when \(x \to \infty\).
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -11.6396 | 0.0918 | -126.74 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta | 0.0997 | 0.0014 | 68.85 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
A Makeham model, which has an additive parameter, models the younger ages’ mortality far better.
\[\mu_{x} = e^{\epsilon} +e^{\alpha + \beta x}\]
Note that the Makeham factor \(e^{\epsilon}\) breaks the linearity so that the Makeham mortality law cannot be modelled using GLMs.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -12.7400 | 0.1504 | -84.73 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta | 0.1147 | 0.0021 | 53.64 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon | -8.3400 | 0.1133 | -73.60 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
The Makeham-Beard model, proposed by R.E. Beard in 1954, generalises the Makeham-Perks model (Perks, 1932) adding the Beard parameter \(\rho\).
\[\mu_{x} = \frac{e^\epsilon + e^{\alpha + \beta x}}{1+e^{\alpha + \rho + \beta x}}\]
The BEARD parameter \(\rho\) governs the asymptotic mortality behaviour at advanced ages, because \(\mu_x \to e^{-\rho}\) for \(x \to \infty\). A postive estimate \(\rho \ge 0\) is equivalent to finding a mortality plateau. However, even a statistically significant negative estimate of the BEARD parameter \(\rho \lt 0\) indicates mortality deceleration.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -12.7528 | 0.1504 | -84.82 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta | 0.1149 | 0.0021 | 53.74 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon | -8.3315 | 0.1123 | -74.21 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| rho | -4.4720 | 0.0002 | -21086.55 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
Adding Risk Factors
Gender and smoker category are the most important risk factors to model. We do not split our data and create models for MNS, MSM, FNS, and FSM, separately - we create one single model that results in different \(\mu_x\)-rates based on risk factor combination.
Including risk factors in the model is done by estimating an adjustment to any of the standard parameters \(\theta = (\alpha, \beta, \epsilon, \rho)\), depending on whether a given life \(i\) is in risk group \(j\).
\[\theta_i = \theta_{baseline} + \sum_{j=1}^{m} z_{ij}\theta_j\]
The indicator parameter is unity, \(z_{i,j} = 1\), if person \(i\) is in risk group \(j\), otherwise \(z_{i,j}=0\).
We refer to adjustments to the INTERCEPT parameter \(\alpha\) as Main Effects, adjustments to the age parameter \(\beta\) as AGE interactions, and to the Makeham parameter as MAKEHAM interactions. Note that having different values for the BEARD parameter by risk group is not very common, however, because mortality differentials generally disappear at advanced ages. This phenomenon is referred to as mortality convergence.
We successively add main effects and interaction parameters to improve the model. The three main criteria for a better model are (1) that each new parameter has a statistically significant estimate, (2) that adding the parameter reduces the AIC score by at least four units, an (3) that the corresponding residual chart shows greater randomness than before inclusion of the new parameter.
This bottom-up method of model building is illustrated below for the risk factors GENDER and SMOKER category.
Reviewing the model results for our simple model without risk factors by gender and smoker categories in the charts below, we clearly see how male mortality would be under, and female mortality overestimated, just as the excess mortality for smokers would not be captured in the simple model.
Adding a main effect for GENDER in the form of an \(\alpha\) parameter already improves the model a lot, reducing the AIC score by more than 100 units. The residual charts on the right-hand side of the panel below show that for younger ages the gender differential is even greater than the age-independent main effect allows for.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -12.6353 | 0.1532 | -82.50 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.4249 | 0.0430 | -9.88 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| beta | 0.1159 | 0.0022 | 52.66 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon | -8.3191 | 0.1110 | -74.93 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
The inclusion of a MAKEHAM:GENDER interaction parameter improves the AIC by another 8 units. We also tested modelling the age-dependence of the gender differential using an AGE:GENDER interaction term, which turned out to be less effective.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -12.7051 | 0.1546 | -82.20 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.3677 | 0.0464 | -7.93 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| beta | 0.1165 | 0.0022 | 52.91 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon | -8.0327 | 0.1231 | -65.27 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.5581 | 0.1810 | -3.08 | 0.0020 | ** | 113,662 | 790,629.4 | 1,162 |
We deal with the smoker mortality differential in a similar way. First, we incorporate a main effect for SMOKER in the Makeham model. This already improves the AIC by more than 350 points. The residual charts show that this already accounts for most of the smoker differential.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -13.2198 | 0.1615 | -81.84 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.3399 | 0.0462 | -7.37 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| alpha.smokerCategory.SM | 1.1511 | 0.0492 | 23.38 | 0.0000 | *** | 31,184 | 204,180.0 | 719 |
| beta | 0.1212 | 0.0022 | 53.90 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon | -8.0471 | 0.1207 | -66.66 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.4434 | 0.1667 | -2.66 | 0.0078 | ** | 113,662 | 790,629.4 | 1,162 |
When we adjust the AGE parameter for smokers with an AGE:SMOKER interaction as shown in the table below as beta.smokerCategory.SM we find that the model marginally improves, with the AIC reduced by three units and the parameter estimate statistically significant with a p-value of \(0.046\). A qualitative difference in the residual fit is difficult to discern.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -13.3854 | 0.1845 | -72.54 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.3462 | 0.0464 | -7.46 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| alpha.smokerCategory.SM | 1.7056 | 0.2829 | 6.03 | 0.0000 | *** | 31,184 | 204,180.0 | 719 |
| beta | 0.1236 | 0.0026 | 47.83 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta.smokerCategory.SM | -0.0086 | 0.0043 | -1.99 | 0.0465 | * | 31,184 | 204,180.0 | 719 |
| epsilon | -8.0325 | 0.1192 | -67.41 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.4155 | 0.1626 | -2.55 | 0.0106 | * | 113,662 | 790,629.4 | 1,162 |
Selection effect
Just as we model risk factors by age, we can also model selection effects by policy duration. The chart of residuals for a model without selection plotted against policy duration below shows that a selection effect is present in the data.
To model the selection effect, we split the experience data for each life into phases. In this example, policy durations up to five years fall within \(Phase\ 1 := [0,5)\) and \(Phase\ 2 := [5, \infty)\) contains the exposure for policies five years or older. Using the DURATION parameter \(\gamma\) as a coefficient for policy duration \(t\), we can model simple log-linear select effects by phase according to the equation below.
\[\mu_{x,t} = e^{\epsilon} +e^{\alpha + \beta x + \gamma t}\]
The table below summarises the experience data by policy year phase. Note that we only observe a fraction of the total deaths during the first five policy years.
| Category | Level | Lives | Exposure | Decrements |
|---|---|---|---|---|
| All | All | 225,385 | 1,588,696.4 | 2,981 |
| Phase | 1 | 130,380 | 436,135.6 | 237 |
| 2 | 185,830 | 1,152,560.8 | 2,744 |
Again, we start off with a model that does not include DURATION and compare the crude hazard rates by duration with the rates predicted by the model built so far. The model clearly overestimates mortality for policy years 1 to 5, and also underestimates mortality for policy years 11+, as can be seen in the charts below.
| Category | Level | Lives | Exposure | Decrements |
|---|---|---|---|---|
| All | All | 225,385 | 1,588,696.4 | 2,981 |
| Phase | 1 | 130,380 | 436,135.6 | 237 |
| 2 | 185,830 | 1,152,560.8 | 2,744 |
Considering policies with Level COI and ART business separately, the residual charts below show that the selection effect is different for the two categories of business. ART business shows a steeper slope of increasing mortality rates than Level COI business.
| Category | Level | Lives | Exposure | Decrements |
|---|---|---|---|---|
| All | All | 225,385 | 1,588,696.4 | 2,981 |
| COI Type | ART | 123,429 | 793,575.1 | 1,095 |
| LVL | 101,956 | 795,121.4 | 1,886 | |
| Phase | 1 | 130,380 | 436,135.6 | 237 |
| 2 | 185,830 | 1,152,560.8 | 2,744 |
Including a DURATION parameter gamma and both a main effect alpha.phase.1 and a DURATION:PHASE interaction parameter gamma.phase.1 largely captures the selection effect, as shown in the charts below. The estimate for gamma.phase.1 shown in the table below is not statistically significant, but is included here for completeness.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -13.3999 | 0.1890 | -70.89 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.2975 | 0.0465 | -6.40 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| alpha.smokerCategory.SM | 1.6897 | 0.2843 | 5.94 | 0.0000 | *** | 31,184 | 204,180.0 | 719 |
| alpha.COIType.ART | 0.1224 | 0.0442 | 2.77 | 0.0056 | ** | 123,429 | 793,575.1 | 1,095 |
| alpha.phase.1 | -0.6025 | 0.2557 | -2.36 | 0.0185 | * | 130,380 | 436,135.6 | 237 |
| beta | 0.1187 | 0.0026 | 45.31 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta.smokerCategory.SM | -0.0089 | 0.0044 | -2.04 | 0.0414 | * | 31,184 | 204,180.0 | 719 |
| gamma | 0.0196 | 0.0037 | 5.33 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| gamma.phase.1 | 0.0892 | 0.0725 | 1.23 | 0.2184 | 130,380 | 436,135.6 | 237 | |
| epsilon | -8.0183 | 0.1165 | -68.82 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.4534 | 0.1617 | -2.80 | 0.0050 | ** | 113,662 | 790,629.4 | 1,162 |
As we had seen in the earlier residual charts that ART business experiences greater ongoing deterioration than Level COI business, we account for this phenomenon by including a parameter for the interaction between DURATION and COI-TYPE. This parameter gamma.COIType.ART turns out to be statistically significant, according to the table below. This means that we can objectively measure that the ongoing deterioration of mortality rates for the ART business is worse than for Level COI business.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -13.3095 | 0.1880 | -70.81 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.2989 | 0.0465 | -6.43 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| alpha.smokerCategory.SM | 1.6896 | 0.2838 | 5.95 | 0.0000 | *** | 31,184 | 204,180.0 | 719 |
| alpha.phase.1 | -0.3453 | 0.1141 | -3.03 | 0.0025 | ** | 130,380 | 436,135.6 | 237 |
| beta | 0.1184 | 0.0026 | 45.24 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta.smokerCategory.SM | -0.0090 | 0.0044 | -2.06 | 0.0396 | * | 31,184 | 204,180.0 | 719 |
| gamma | 0.0152 | 0.0040 | 3.80 | 0.0001 | *** | 225,385 | 1,588,696.4 | 2,981 |
| gamma.COIType.ART | 0.0080 | 0.0025 | 3.26 | 0.0011 | ** | 123,429 | 793,575.1 | 1,095 |
| epsilon | -8.0203 | 0.1170 | -68.58 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.4522 | 0.1620 | -2.79 | 0.0052 | ** | 113,662 | 790,629.4 | 1,162 |
Mortality Trend
Within the survival model framework we include mortality trends by treating calendar year \(y\) as another independent variable with a \(TIME\) coefficient \(\delta\).
\[\mu_{x,t,y} = e^{\epsilon} +e^{\alpha + \beta x + \gamma t + \delta (y \ -2000)}\]
As above, we compare the model so far with the crude hazard rates, here against calendar year. The residual chart on the right-hand side below, indicates a clear improvement trend, which we can account for with a single additional parameter.
Once the TIME parameter delta is included, the apparent trend in the residuals disappears. However, the inclusion of this parameter has an even greater impact: It unmasks the ongoing deterioration by policy year of the Level COI business. The strong trend of mortality improving at a rate of around \(3.4\%\) per year was masking the impact of anti-selective lapses, which our model is now able to fully capture.
| parameter | estimate | stdError | zScore | pValue | sig | subjects | exposure | decrements |
|---|---|---|---|---|---|---|---|---|
| alpha | -12.9698 | 0.2015 | -64.38 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| alpha.gender.F | -0.2873 | 0.0466 | -6.17 | 0.0000 | *** | 113,662 | 790,629.4 | 1,162 |
| alpha.smokerCategory.SM | 1.6638 | 0.2848 | 5.84 | 0.0000 | *** | 31,184 | 204,180.0 | 719 |
| alpha.phase.1 | -0.3155 | 0.1149 | -2.75 | 0.0060 | ** | 130,380 | 436,135.6 | 237 |
| beta | 0.1189 | 0.0026 | 45.33 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| beta.smokerCategory.SM | -0.0086 | 0.0044 | -1.97 | 0.0490 | * | 31,184 | 204,180.0 | 719 |
| gamma | 0.0217 | 0.0044 | 4.97 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| gamma.COIType.ART | 0.0062 | 0.0025 | 2.47 | 0.0134 | * | 123,429 | 793,575.1 | 1,095 |
| epsilon | -8.0079 | 0.1154 | -69.41 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
| epsilon.gender.F | -0.4664 | 0.1609 | -2.90 | 0.0037 | ** | 113,662 | 790,629.4 | 1,162 |
| delta | -0.0344 | 0.0080 | -4.28 | 0.0000 | *** | 225,385 | 1,588,696.4 | 2,981 |
Discussion and Outlook
The final model shown in the table above incorporates all our findings: It demonstrates that we have modelled the age-dependence of mortality, included the selection effect in early policy years, as well as the mortality trend by calendar year and anti-selective mortality deterioration, all in one model. Besides the “usual suspects” of gender and smoking status, we were able to incorporate COI Type as a risk factor. We demonstrate that lives covered by policies with annually increasing COI (ART) experience greater ongoing deterioration of mortality with policy duration than Level COI business, as the estimate for the DURATION:COI-TYPE interaction is statistically significant. It’s value implies that ART business experiences ongoing deterioration at a rate of around \(2.8\%\) per policy year, while the mortality of Level COI business worsens at a rate of around \(2.2\%\) per policy year. Both effects are masked by an ongoing mortality improvement trend of around \(3.4\%\) per year.
Furthermore, our modelling demonstrated that it would be inaccurate to simply use a mortality table with greater slope for ART business, as an AGE:COI-TYPE interaction was not significant in the presence of the duration parameter \(\gamma\).
A model for mortality rates which are to be used in an actuarial application should also properly account for differences in mortality by policy size, as shown in the exploratory analysis. Differentiating mortality by face amount band can be implemented in a similar way as shown for the other risk factors.
The Appendices show mortality rates \(q_x\) against attained age, as well as a review of crude hazard rates by each risk factor compared to the corresponding final model expected deaths.
Appendix: Mortality rates
The models developed in this note can be used to derive actuarial \(q_x\) type rates for any given valuation date. Note that we do not recommend applying the TIME trend \(\delta\) beyond the observation period of the mortality study.