Loss Data Analytics

9.1 Introduction to Applications of Credibility Theory

What premium should be charged to provide insurance? The answer depends upon the exposure to the risk of loss. A common method to compute an insurance premium is to rate an insured using a classification rating plan. A classification plan is used to select an insurance rate based on an insured’s rating characteristics such as geographic territory, age, etc. All classification rating plans use a limited set of criteria to group insureds into a “class” and there will be variation in the risk of loss among insureds within the class.

An experience rating plan attempts to capture some of the variation in the risk of loss among insureds within a rating class by using the insured’s own loss experience to complement the rate from the classification rating plan. One way to do this is to use a credibility weight $Z$ with $0\leq Z \leq 1$ to compute

\[\begin{equation*} \hat{R}=Z\bar{X}+(1-Z)M, \end{equation*}\] \[\begin{eqnarray*} \hat{R}&=&\textrm{credibility weighted rate for risk,}\\ \bar{X}&=&\textrm{average loss for the risk over a specified time period,}\\ M&=&\textrm{the rate for the classification group, often called the manual rate.}\\ \end{eqnarray*}\]

For a large risk whose loss experience is stable from year to year, $Z$ might be close to 1. For a smaller risk whose losses vary widely from year to year, $Z$ may be close to 0.

Credibility theory is also used for computing rates for individual classes within a classification rating plan. When classification plan rates are being determined, some or many of the groups may not have sufficient data to produce stable and reliable rates. The actual loss experience for a group will be assigned a credibility weight $Z$ and the complement of credibility $1-Z$ may be given to the average experience for risk across all classes. Or, if a class rating plan is being updated, the complement of credibility may be assigned to the current class rate. Credibility theory can also be applied to the calculation of expected frequencies and severities.

Computing numeric values for $Z$ requires analysis and understanding of the data. What are the variances in the number of losses and sizes of losses for risks? What is the variance between expected values across risks?

9.2 Limited Fluctuation Credibility

In this section, you learn how to:

Calculate full credibility standards for number of claims, average size of claims, and aggregate losses.
Learn how the relationship between means and variances of underlying distributions affects full credibility standards.
Determine credibility-weight $Z$ using the square-root partial credibility formula.

Limited fluctuation credibility, also called “classical credibility”, was given this name because the method explicitly attempts to limit fluctuations in estimates for claim frequencies, severities, or losses. For example, suppose that you want to estimate the expected number of claims for a group of risks in an insurance rating class. How many risks are needed in the class to ensure that a specified level of accuracy is attained in the estimate? First the question will be considered from the perspective of how many claims are needed.

9.2.1 Full Credibility for Claim Frequency

Let $N$ be a random variable representing the number of claims for a group of risks. The observed number of claims will be used to estimate $\mu_N=\mathrm{E}[N]$, the expected number of claims. How big does $\mu_N$ need to be to get a good estimate? One way to quantify the accuracy of the estimate would be a statement like: ``The observed value of $N$ should be within 5$\%$ of $\mu_N$ at least 90$\%$ of the time." Writing this as a mathematical expression would give $\Pr[0.95\mu_N\leq N \leq1.05\mu_N] \geq 0.90$. Generalizing this statement by letting $k$ replace 5$\%$ and probability $p$ replace 0.90 produces a confidence interval

\[\begin{equation} \Pr[(1-k)\mu_N\leq N \leq(1+k)\mu_N] \geq p. \tag{9.1} \end{equation}\]

The expected number of claims required for the probability on the left-hand side of (9.1) to equal $p$ is called the full credibility standard.

If the expected number of claims is greater than or equal to the full credibility standard then full credibility can be assigned to the data so $Z=1$. Usually the expected value $\mu_N$ is not known so full credibility will be assigned to the data if the actual observed value of $N$ is greater than or equal to the full credibility standard. The $k$ and $p$ values must be selected and the actuary may rely on experience, judgment, and other factors in making the choices.

Subtracting $\mu_N$ from each term in (9.1) and dividing by the standard deviation $\sigma_N$ of $N$ gives

\[\begin{equation} \Pr\left[\frac{-k\mu_N}{\sigma_N}\leq \frac{N-\mu_N}{\sigma_N} \leq \frac{k\mu_N}{\sigma_N}\right] \geq p. \tag{9.2} \end{equation}\]

For large values of $\mu_N=\mathrm{E}[N]$ it may be reasonable to approximate the distribution for $Z=(N-\mu_N)/\sigma_N$ with the standard normal distribution.

Let $y_p$ be the value such that $\Pr[-y_p\leq Z \leq y_p]=\Phi(y_p)-\Phi(-y_p)=p$ where $\Phi( )$ is the cumulative standard normal distribution. Because $\Phi(-y_p)=1-\Phi(y_p)$, the equality can be rewritten as $2\Phi(y_p)-1=p$. Solving for $y_p$ gives $y_p=\Phi^{-1}((p+1)/2)$ where $\Phi^{-1}( )$ is the inverse of the cumulative normal.

Equation (9.2) will be satisfied if $k\mu_N/\sigma_N \geq y_p$ assuming the normal approximation. First we will consider this inequality for the case when $N$ has a Poisson distribution: $\Pr[N=n] = \lambda^n\textrm{e}^{\lambda}/n!$. Because $\lambda=\mu_N=\sigma_N^2$ for the Poisson, taking square roots yields $\mu_N^{1/2}=\sigma_N$. So, $k\mu_N/\mu_N^{1/2} \geq y_p$ which is equivalent to $\mu_N \geq (y_p/k)^2$. Let’s define $\lambda_{kp}$ to be the value of $\mu_N$ for which equality holds. Then the full credibility standard for the Poission distribution is

\[\begin{equation} \lambda_{kp} = \left(\frac{y_p}{k}\right)^2 \textrm{with } y_p=\Phi^{-1}((p+1)/2). \tag{9.3} \end{equation}\]

If the expected number of claims $\mu_N$ is greater than or equal to $\lambda_{kp}$ then equation (9.1) is assumed to hold and full credibility can be assigned to the data. As noted previously, because $\mu_N$ is usually unknown, full credibility is given if the observed value of $N$ satisfies $N \geq \lambda_{kp}.$

Example 9.2.1. The full credibility standard is set so that the observed number of claims is to be within 5% of the expected value with probability $p=0.95$. If the number of claims has a Poisson distribution find the number of claims needed for full credibility.

Chapter 9 Experience Rating Using Credibility Theory

9.1 Introduction to Applications of Credibility Theory

9.2 Limited Fluctuation Credibility

9.2.1 Full Credibility for Claim Frequency

9.2.2 Full Credibility for Aggregate Losses and Pure Premium

9.2.3 Full Credibility for Severity

9.2.4 Partial Credibility

9.3 Bühlmann Credibility

9.3.1 Credibility Z, EPV, and VHM

9.4 Bühlmann-Straub Credibility

9.5 Bayesian Inference and Bühlmann

9.5.1 Gamma-Poisson Model

9.5.2 Exact Credibility

9.6 Estimating Credibility Parameters

9.6.1 Full Credibility Standard for Limited Fluctuation Credibility

9.6.2 Nonparametric Estimation for Bühlmann and Bühlmann-Straub Models

9.6.3 Semiparametric Estimation for Bühlmann and Bühlmann-Straub Models

9.6.4 Balancing Credibility Estimators

9.7 Further Resources and Contributors

Exercises

Contributors

Bibliography