SEVERITY Procedure

Predefined Distributions

For the response variable Y, PROC SEVERITY assumes the model

upper Y tilde script upper F left-parenthesis normal upper Theta right-parenthesis

where script upper F is a continuous probability distribution with parameters normal upper Theta. The model hypothesizes that the observed response is generated from a stochastic process that is governed by the distribution script upper F. This model is usually referred to as the error model. Given a representative input sample of response variable values, PROC SEVERITY estimates the model parameters for any distribution script upper F and computes the statistics of fit for each model. This enables you to find the distribution that is most likely to generate the observed sample.

A set of predefined distributions is provided with the SEVERITY procedure. A summary of the distributions is provided in Table 3. For each distribution, the table lists the name of the distribution that should be used in the DIST statement, the parameters of the distribution along with their bounds, and the mathematical expressions for the probability density function (PDF) and cumulative distribution function (CDF) of the distribution.

All the predefined distributions, except LOGN and TWEEDIE, are parameterized such that their first parameter is the scale parameter. For LOGN, the first parameter mu is a log-transformed scale parameter. TWEEDIE does not have a scale parameter. The presence of scale parameter or a log-transformed scale parameter enables you to use all of the predefined distributions, except TWEEDIE, as a candidate for estimating regression effects.

A distribution model is associated with each predefined distribution. You can also define your own distribution model, which is a set of functions and subroutines that you define by using the FCMP procedure. For more information, see the section Defining a Severity Distribution Model with the FCMP Procedure.

Table 3: Predefined PROC SEVERITY Distributions

TheadName Distribution Parameters PDF (f) and CDF (F)
BURR Burr (Type XII) theta greater-than 0, alpha greater-than 0, f left-parenthesis x right-parenthesis equals StartFraction alpha gamma z Superscript gamma Baseline Over x left-parenthesis 1 plus z Superscript gamma Baseline right-parenthesis Superscript left-parenthesis alpha plus 1 right-parenthesis Baseline EndFraction
gamma greater-than 0 upper F left-parenthesis x right-parenthesis equals 1 minus left-parenthesis StartFraction 1 Over 1 plus z Superscript gamma Baseline EndFraction right-parenthesis Superscript alpha
EXP Exponential theta greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction 1 Over theta EndFraction e Superscript negative z
upper F left-parenthesis x right-parenthesis equals 1 minus e Superscript negative z
GAMMA Gamma theta greater-than 0, alpha greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction z Superscript alpha Baseline e Superscript negative z Baseline Over x normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction
upper F left-parenthesis x right-parenthesis equals StartFraction gamma left-parenthesis alpha comma z right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction
GPD Generalized theta greater-than 0, xi greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction 1 Over theta EndFraction left-parenthesis 1 plus xi z right-parenthesis Superscript negative 1 minus 1 slash xi
Pareto upper F left-parenthesis x right-parenthesis equals 1 minus left-parenthesis 1 plus xi z right-parenthesis Superscript negative 1 slash xi
IGAUSS Inverse Gaussian theta greater-than 0, alpha greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction 1 Over theta EndFraction StartRoot StartFraction alpha Over 2 pi z cubed EndFraction EndRoot e Superscript StartFraction minus alpha left-parenthesis z minus 1 right-parenthesis squared Over 2 z EndFraction
(Wald) upper F left-parenthesis x right-parenthesis equals normal upper Phi left-parenthesis left-parenthesis z minus 1 right-parenthesis StartRoot StartFraction alpha Over z EndFraction EndRoot right-parenthesis plus
normal upper Phi left-parenthesis minus left-parenthesis z plus 1 right-parenthesis StartRoot StartFraction alpha Over z EndFraction EndRoot right-parenthesis e Superscript 2 alpha
LOGN Lognormal mu (no bounds), f left-parenthesis x right-parenthesis equals StartFraction 1 Over x sigma StartRoot 2 pi EndRoot EndFraction e Superscript minus one-half left-parenthesis StartFraction log left-parenthesis x right-parenthesis minus mu Over sigma EndFraction right-parenthesis squared
sigma greater-than 0 upper F left-parenthesis x right-parenthesis equals normal upper Phi left-parenthesis StartFraction log left-parenthesis x right-parenthesis minus mu Over sigma EndFraction right-parenthesis
PARETO Pareto (Type II) theta greater-than 0, alpha greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction alpha theta Superscript alpha Baseline Over left-parenthesis x plus theta right-parenthesis Superscript alpha plus 1 Baseline EndFraction
upper F left-parenthesis x right-parenthesis equals 1 minus left-parenthesis StartFraction theta Over x plus theta EndFraction right-parenthesis Superscript alpha
TWEEDIE Tweedie** p greater-than 1, mu greater-than 0, f left-parenthesis x right-parenthesis equals a left-parenthesis x comma phi right-parenthesis exp left-bracket StartFraction 1 Over phi EndFraction left-parenthesis StartFraction x mu Superscript 1 minus p Baseline Over 1 minus p EndFraction minus kappa left-parenthesis mu comma p right-parenthesis right-parenthesis right-bracket
phi greater-than 0 upper F left-parenthesis x right-parenthesis equals integral Subscript 0 Superscript x Baseline f left-parenthesis t right-parenthesis d t
STWEEDIE Scaled Tweedie** theta greater-than 0, lamda greater-than 0, f left-parenthesis x right-parenthesis equals a left-parenthesis x comma theta comma lamda comma p right-parenthesis exp left-parenthesis minus StartFraction x Over theta EndFraction minus lamda right-parenthesis
1 less-than p less-than 2 upper F left-parenthesis x right-parenthesis equals integral Subscript 0 Superscript x Baseline f left-parenthesis t right-parenthesis d t
WEIBULL Weibull theta greater-than 0, tau greater-than 0 f left-parenthesis x right-parenthesis equals StartFraction 1 Over x EndFraction tau z Superscript tau Baseline e Superscript minus z Super Superscript tau
upper F left-parenthesis x right-parenthesis equals 1 minus e Superscript minus z Super Superscript tau
**For more information, see the section Tweedie Distributions.
Notes:
1. z equals x slash theta, wherever z is used.
2. theta denotes the scale parameter for all the distributions. For LOGN, log left-parenthesis theta right-parenthesis equals mu.
3. Parameters are listed in the order in which they are defined in the distribution model.
4. gamma left-parenthesis a comma b right-parenthesis equals integral Subscript 0 Superscript b Baseline t Superscript a minus 1 Baseline e Superscript negative t Baseline d t is the lower incomplete gamma function.
5. normal upper Phi left-parenthesis y right-parenthesis equals one-half left-parenthesis 1 plus normal e normal r normal f left-parenthesis StartFraction y Over StartRoot 2 EndRoot EndFraction right-parenthesis right-parenthesis is the standard normal CDF.


Tweedie Distributions

Tweedie distributions are a special case of the exponential dispersion family (Jørgensen 1987) with a property that the variance of the distribution is equal to phi mu Superscript p, where mu is the mean of the distribution, phi is a dispersion parameter, and p is an index parameter as discovered by Tweedie (1984). The distribution is defined for all values of p except for values of p in the open interval left-parenthesis 0 comma 1 right-parenthesis. Many important known distributions are a special case of Tweedie distributions including normal (p=0), Poisson (p=1), gamma (p=2), and the inverse Gaussian (p=3). Apart from these special cases, the probability density function (PDF) of the Tweedie distribution does not have an analytic expression. For p greater-than 1, it has the form (Dunn and Smyth 2005),

f left-parenthesis x semicolon mu comma phi comma p right-parenthesis equals a left-parenthesis x comma phi right-parenthesis exp left-bracket StartFraction 1 Over phi EndFraction left-parenthesis StartFraction x mu Superscript 1 minus p Baseline Over 1 minus p EndFraction minus kappa left-parenthesis mu comma p right-parenthesis right-parenthesis right-bracket

where kappa left-parenthesis mu comma p right-parenthesis equals mu Superscript 2 minus p Baseline slash left-parenthesis 2 minus p right-parenthesis for p not-equals 2 and kappa left-parenthesis mu comma p right-parenthesis equals log left-parenthesis mu right-parenthesis for p = 2. The function a left-parenthesis x comma phi right-parenthesis does not have an analytical expression. It is typically evaluated using series expansion methods described in Dunn and Smyth (2005).

For 1 less-than p less-than 2, the Tweedie distribution is a compound Poisson-gamma mixture distribution, which is the distribution of S defined as

upper S equals sigma-summation Underscript i equals 1 Overscript upper N Endscripts upper X Subscript i

where upper N tilde Poisson left-parenthesis lamda right-parenthesis and upper X Subscript i Baseline tilde gamma left-parenthesis alpha comma theta right-parenthesis are independent and identically distributed gamma random variables with shape parameter alpha and scale parameter theta. At X = 0, the density is a probability mass that is governed by the Poisson distribution, and for values of upper X greater-than 0, it is a mixture of gamma variates with Poisson mixing probability. The parameters lamda, alpha, and theta are related to the natural parameters mu, phi, and p of the Tweedie distribution as

StartLayout 1st Row 1st Column lamda 2nd Column equals StartFraction mu Superscript 2 minus p Baseline Over phi left-parenthesis 2 minus p right-parenthesis EndFraction 2nd Row 1st Column alpha 2nd Column equals StartFraction 2 minus p Over p minus 1 EndFraction 3rd Row 1st Column theta 2nd Column equals phi left-parenthesis p minus 1 right-parenthesis mu Superscript p minus 1 EndLayout

The mean of a Tweedie distribution is positive for p greater-than 1.

Two predefined versions of the Tweedie distribution are provided with the SEVERITY procedure. The first version, named TWEEDIE and defined for p greater-than 1, has the natural parameterization with parameters mu, phi, and p. The second version, named STWEEDIE and defined for 1 less-than p less-than 2, is the version with a scale parameter. It corresponds to the compound Poisson-gamma distribution with gamma scale parameter theta, Poisson mean parameter lamda, and the index parameter p. The index parameter decides the shape parameter alpha of the gamma distribution as

alpha equals StartFraction 2 minus p Over p minus 1 EndFraction

The parameters theta and lamda of the STWEEDIE distribution are related to the parameters mu and phi of the TWEEDIE distribution as

StartLayout 1st Row 1st Column mu 2nd Column equals lamda theta alpha 2nd Row 1st Column phi 2nd Column equals StartFraction left-parenthesis lamda theta alpha right-parenthesis Superscript 2 minus p Baseline Over lamda left-parenthesis 2 minus p right-parenthesis EndFraction equals StartFraction theta Over left-parenthesis p minus 1 right-parenthesis left-parenthesis lamda theta alpha right-parenthesis Superscript p minus 1 Baseline EndFraction EndLayout

You can fit either version when there are no regression variables. Each version has its own merits. If you fit the TWEEDIE version, you have the direct estimate of the overall mean of the distribution. If you are interested in the most practical range of the index parameter 1 less-than p less-than 2, then you can fit the STWEEDIE version, which provides you direct estimates of the Poisson and gamma components that comprise the distribution (an estimate of the gamma shape parameter alpha is easily obtained from the estimate of p).

If you want to estimate the effect of exogenous (regression) variables on the distribution, then you must use the STWEEDIE version, because PROC SEVERITY requires a distribution to have a scale parameter in order to estimate regression effects. For more information, see the section Estimating Regression Effects. The gamma scale parameter theta is the scale parameter of the STWEEDIE distribution. If you are interested in determining the effect of regression variables on the mean of the distribution, you can do so by first fitting the STWEEDIE distribution to determine the effect of the regression variables on the scale parameter theta. Then, you can easily estimate how the mean of the distribution mu is affected by the regression variables using the relationship mu equals c theta, where c equals lamda alpha equals lamda left-parenthesis 2 minus p right-parenthesis slash left-parenthesis p minus 1 right-parenthesis. The estimates of the regression parameters remain the same, whereas the estimate of the intercept parameter is adjusted by the estimates of the lamda and p parameters.

Parameter Initialization for Predefined Distributions

The parameters are initialized by using the method of moments for all the distributions, except for the gamma and the Weibull distributions. For the gamma distribution, approximate maximum likelihood estimates are used. For the Weibull distribution, the method of percentile matching is used.

Given n observations of the severity value y Subscript i (1 less-than-or-equal-to i less-than-or-equal-to n), the estimate of kth raw moment is denoted by m Subscript k and computed as

m Subscript k Baseline equals StartFraction 1 Over n EndFraction sigma-summation Underscript i equals 1 Overscript n Endscripts y Subscript i Superscript k

The 100pth percentile is denoted by pi Subscript p (0 less-than-or-equal-to p less-than-or-equal-to 1). By definition, pi Subscript p satisfies

upper F left-parenthesis pi Subscript p Baseline minus right-parenthesis less-than-or-equal-to p less-than-or-equal-to upper F left-parenthesis pi Subscript p Baseline right-parenthesis

where upper F left-parenthesis pi Subscript p Baseline minus right-parenthesis equals limit Underscript h down-arrow 0 Endscripts upper F left-parenthesis pi Subscript p Baseline minus h right-parenthesis. PROC SEVERITY uses the following practical method of computing pi Subscript p. Let ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y right-parenthesis denote the empirical distribution function (EDF) estimate at a severity value y. Let y Subscript p Superscript minus and y Subscript p Superscript plus denote two consecutive values in the ascending sequence of y values such that ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript minus Baseline right-parenthesis less-than p and ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript plus Baseline right-parenthesis greater-than-or-equal-to p. Then, the estimate ModifyingAbove pi With caret Subscript p is computed as

ModifyingAbove pi With caret Subscript p Baseline equals y Subscript p Superscript minus Baseline plus StartFraction p minus ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript minus Baseline right-parenthesis Over ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript plus Baseline right-parenthesis minus ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript minus Baseline right-parenthesis EndFraction left-parenthesis y Subscript p Superscript plus Baseline minus y Subscript p Superscript minus Baseline right-parenthesis

Let epsilon denote the smallest double-precision floating-point number such that 1 plus epsilon greater-than 1. This machine precision constant can be obtained by using the CONSTANT function in Base SAS software.

The details of how parameters are initialized for each predefined distribution are as follows:

BURR

Burr proposed 12 types of families of continuous distributions (Burr 1942; Rodriguez 2006). The predefined BURR distribution in PROC SEVERITY implements Burr’s Type XII distribution. The parameters are initialized by using the method of moments. The kth raw moment of the Burr distribution of Type XII is

upper E left-bracket upper X Superscript k Baseline right-bracket equals StartFraction theta Superscript k Baseline normal upper Gamma left-parenthesis 1 plus k slash gamma right-parenthesis normal upper Gamma left-parenthesis alpha minus k slash gamma right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction comma negative gamma less-than k less-than alpha gamma

Three moment equations upper E left-bracket upper X Superscript k Baseline right-bracket equals m Subscript k (k equals 1 comma 2 comma 3) need to be solved for initializing the three parameters of the distribution. In order to get an approximate closed form solution, the second shape parameter ModifyingAbove gamma With caret is initialized to a value of 2. If 2 m 3 minus 3 m 1 m 2 greater-than 0, then simplifying and solving the moment equations yields the following feasible set of initial values:

ModifyingAbove theta With caret equals StartRoot StartFraction m 2 m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction EndRoot comma ModifyingAbove alpha With caret equals 1 plus StartFraction m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction comma ModifyingAbove gamma With caret equals 2

If 2 m 3 minus 3 m 1 m 2 less-than epsilon, then the parameters are initialized as follows:

ModifyingAbove theta With caret equals StartRoot m 2 EndRoot comma ModifyingAbove alpha With caret equals 2 comma ModifyingAbove gamma With caret equals 2
EXP

The parameters are initialized by using the method of moments. The kth raw moment of the exponential distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals theta Superscript k Baseline normal upper Gamma left-parenthesis k plus 1 right-parenthesis comma k greater-than negative 1

Solving upper E left-bracket upper X right-bracket equals m 1 yields the initial value of ModifyingAbove theta With caret equals m 1.

GAMMA

The parameter alpha is initialized by using its approximate maximum likelihood (ML) estimate. For a set of n independent and identically distributed observations y Subscript i (1 less-than-or-equal-to i less-than-or-equal-to n) drawn from a gamma distribution, the log likelihood l is defined as follows:

StartLayout 1st Row 1st Column l 2nd Column equals sigma-summation Underscript i equals 1 Overscript n Endscripts log left-parenthesis y Subscript i Superscript alpha minus 1 Baseline StartFraction e Superscript minus y Super Subscript i Superscript slash theta Baseline Over theta Superscript alpha Baseline normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction right-parenthesis 2nd Row 1st Column Blank 2nd Column equals left-parenthesis alpha minus 1 right-parenthesis sigma-summation Underscript i equals 1 Overscript n Endscripts log left-parenthesis y Subscript i Baseline right-parenthesis minus StartFraction 1 Over theta EndFraction sigma-summation Underscript i equals 1 Overscript n Endscripts y Subscript i Baseline minus n alpha log left-parenthesis theta right-parenthesis minus n log left-parenthesis normal upper Gamma left-parenthesis alpha right-parenthesis right-parenthesis EndLayout

Using a shorter notation of sigma-summation to denote sigma-summation Underscript i equals 1 Overscript n Endscripts and solving the equation partial-differential l slash partial-differential theta equals 0 yields the following ML estimate of theta:

ModifyingAbove theta With caret equals StartFraction sigma-summation y Subscript i Baseline Over n alpha EndFraction equals StartFraction m 1 Over alpha EndFraction

Substituting this estimate in the expression of l and simplifying gives

l equals left-parenthesis alpha minus 1 right-parenthesis sigma-summation log left-parenthesis y Subscript i Baseline right-parenthesis minus n alpha minus n alpha log left-parenthesis m 1 right-parenthesis plus n alpha log left-parenthesis alpha right-parenthesis minus n log left-parenthesis normal upper Gamma left-parenthesis alpha right-parenthesis right-parenthesis

Let d be defined as follows:

d equals log left-parenthesis m 1 right-parenthesis minus StartFraction 1 Over n EndFraction sigma-summation log left-parenthesis y Subscript i Baseline right-parenthesis

Solving the equation partial-differential l slash partial-differential alpha equals 0 yields the following expression in terms of the digamma function, psi left-parenthesis alpha right-parenthesis:

log left-parenthesis alpha right-parenthesis minus psi left-parenthesis alpha right-parenthesis equals d

The digamma function can be approximated as follows:

ModifyingAbove psi With caret left-parenthesis alpha right-parenthesis almost-equals log left-parenthesis alpha right-parenthesis minus StartFraction 1 Over alpha EndFraction left-parenthesis 0.5 plus StartFraction 1 Over 12 alpha plus 2 EndFraction right-parenthesis

This approximation is within 1.4% of the true value for all the values of alpha greater-than 0 except when alpha is arbitrarily close to the positive root of the digamma function (which is approximately 1.461632). Even for the values of alpha that are close to the positive root, the absolute error between true and approximate values is still acceptable (StartAbsoluteValue ModifyingAbove psi With caret left-parenthesis alpha right-parenthesis minus psi left-parenthesis alpha right-parenthesis EndAbsoluteValue less-than 0.005 for alpha greater-than 1.07). Solving the equation that arises from this approximation yields the following estimate of alpha:

ModifyingAbove alpha With caret equals StartFraction 3 minus d plus StartRoot left-parenthesis d minus 3 right-parenthesis squared plus 24 d EndRoot Over 12 d EndFraction

If this approximate ML estimate is infeasible, then the method of moments is used. The kth raw moment of the gamma distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals theta Superscript k Baseline StartFraction normal upper Gamma left-parenthesis alpha plus k right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction comma k greater-than negative alpha

Solving upper E left-bracket upper X right-bracket equals m 1 and upper E left-bracket upper X squared right-bracket equals m 2 yields the following initial value for alpha:

ModifyingAbove alpha With caret equals StartFraction m 1 squared Over m 2 minus m 1 squared EndFraction

If m 2 minus m 1 squared less-than epsilon (almost zero sample variance), then alpha is initialized as follows:

ModifyingAbove alpha With caret equals 1

After computing the estimate of alpha, the estimate of theta is computed as follows:

ModifyingAbove theta With caret equals StartFraction m 1 Over ModifyingAbove alpha With caret EndFraction

Both the maximum likelihood method and the method of moments arrive at the same relationship between ModifyingAbove alpha With caret and ModifyingAbove theta With caret.

GPD

The parameters are initialized by using the method of moments. Notice that for xi greater-than 0, the CDF of the generalized Pareto distribution (GPD) is:

StartLayout 1st Row 1st Column upper F left-parenthesis x right-parenthesis 2nd Column equals 1 minus left-parenthesis 1 plus StartFraction xi x Over theta EndFraction right-parenthesis Superscript negative 1 slash xi Baseline 2nd Row 1st Column Blank 2nd Column equals 1 minus left-parenthesis StartFraction theta slash xi Over x plus theta slash xi EndFraction right-parenthesis Superscript 1 slash xi EndLayout

This is equivalent to a Pareto distribution with scale parameter theta 1 equals theta slash xi and shape parameter alpha equals 1 slash xi. Using this relationship, the parameter initialization method used for the PARETO distribution is used to get the following initial values for the parameters of the GPD distribution:

ModifyingAbove theta With caret equals StartFraction m 1 m 2 Over 2 left-parenthesis m 2 minus m 1 squared right-parenthesis EndFraction comma ModifyingAbove xi With caret equals StartFraction m 2 minus 2 m 1 squared Over 2 left-parenthesis m 2 minus m 1 squared right-parenthesis EndFraction

If m 2 minus m 1 squared less-than epsilon (almost zero sample variance) or m 2 minus 2 m 1 squared less-than epsilon, then the parameters are initialized as follows:

ModifyingAbove theta With caret equals StartFraction m 1 Over 2 EndFraction comma ModifyingAbove xi With caret equals one-half
IGAUSS

The parameters are initialized by using the method of moments. The standard parameterization of the inverse Gaussian distribution (also known as the Wald distribution), in terms of the location parameter mu and shape parameter lamda, is as follows (Klugman, Panjer, and Willmot 1998, p. 583):

StartLayout 1st Row 1st Column f left-parenthesis x right-parenthesis 2nd Column equals StartRoot StartFraction lamda Over 2 pi x cubed EndFraction EndRoot exp left-parenthesis StartFraction minus lamda left-parenthesis x minus mu right-parenthesis squared Over 2 mu squared x EndFraction right-parenthesis 2nd Row 1st Column upper F left-parenthesis x right-parenthesis 2nd Column equals normal upper Phi left-parenthesis left-parenthesis StartFraction x Over mu EndFraction minus 1 right-parenthesis StartRoot StartFraction lamda Over x EndFraction EndRoot right-parenthesis plus normal upper Phi left-parenthesis minus left-parenthesis StartFraction x Over mu EndFraction plus 1 right-parenthesis StartRoot StartFraction lamda Over x EndFraction EndRoot right-parenthesis exp left-parenthesis StartFraction 2 lamda Over mu EndFraction right-parenthesis EndLayout

For this parameterization, it is known that the mean is upper E left-bracket upper X right-bracket equals mu and the variance is normal upper V normal a normal r left-bracket upper X right-bracket equals mu cubed slash lamda, which yields the second raw moment as upper E left-bracket upper X squared right-bracket equals mu squared left-parenthesis 1 plus mu slash lamda right-parenthesis (computed by using upper E left-bracket upper X squared right-bracket equals normal upper V normal a normal r left-bracket upper X right-bracket plus left-parenthesis upper E left-bracket upper X right-bracket right-parenthesis squared).

The predefined IGAUSS distribution in PROC SEVERITY uses the following alternate parameterization to allow the distribution to have a scale parameter, theta:

StartLayout 1st Row 1st Column f left-parenthesis x right-parenthesis 2nd Column equals StartRoot StartFraction alpha theta Over 2 pi x cubed EndFraction EndRoot exp left-parenthesis StartFraction minus alpha left-parenthesis x minus theta right-parenthesis squared Over 2 x theta EndFraction right-parenthesis 2nd Row 1st Column upper F left-parenthesis x right-parenthesis 2nd Column equals normal upper Phi left-parenthesis left-parenthesis StartFraction x Over theta EndFraction minus 1 right-parenthesis StartRoot StartFraction alpha theta Over x EndFraction EndRoot right-parenthesis plus normal upper Phi left-parenthesis minus left-parenthesis StartFraction x Over theta EndFraction plus 1 right-parenthesis StartRoot StartFraction alpha theta Over x EndFraction EndRoot right-parenthesis exp left-parenthesis 2 alpha right-parenthesis EndLayout

The parameters theta (scale) and alpha (shape) of this alternate form are related to the parameters mu and lamda of the preceding form such that theta equals mu and alpha equals lamda slash mu. Using this relationship, the first and second raw moments of the IGAUSS distribution are

StartLayout 1st Row 1st Column upper E left-bracket upper X right-bracket 2nd Column equals theta 2nd Row 1st Column upper E left-bracket upper X squared right-bracket 2nd Column equals theta squared left-parenthesis 1 plus StartFraction 1 Over alpha EndFraction right-parenthesis EndLayout

Solving upper E left-bracket upper X right-bracket equals m 1 and upper E left-bracket upper X squared right-bracket equals m 2 yields the following initial values:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals StartFraction m 1 squared Over m 2 minus m 1 squared EndFraction

If m 2 minus m 1 squared less-than epsilon (almost zero sample variance), then the parameters are initialized as follows:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals 1
LOGN

The parameters are initialized by using the method of moments. The kth raw moment of the lognormal distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals exp left-parenthesis k mu plus StartFraction k squared sigma squared Over 2 EndFraction right-parenthesis

Solving upper E left-bracket upper X right-bracket equals m 1 and upper E left-bracket upper X squared right-bracket equals m 2 yields the following initial values:

ModifyingAbove mu With caret equals 2 log left-parenthesis m Baseline 1 right-parenthesis minus StartFraction log left-parenthesis m Baseline 2 right-parenthesis Over 2 EndFraction comma ModifyingAbove sigma With caret equals StartRoot log left-parenthesis m Baseline 2 right-parenthesis minus 2 log left-parenthesis m Baseline 1 right-parenthesis EndRoot
PARETO

The predefined PARETO distribution in PROC SEVERITY implements the Type II Pareto distribution with the location parameter set to 0. This predefined PARETO distribution is also known as the Lomax distribution. The parameters are initialized by using the method of moments. The kth raw moment of the Pareto distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals StartFraction theta Superscript k Baseline normal upper Gamma left-parenthesis k plus 1 right-parenthesis normal upper Gamma left-parenthesis alpha minus k right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction comma negative 1 less-than k less-than alpha

Solving upper E left-bracket upper X right-bracket equals m 1 and upper E left-bracket upper X squared right-bracket equals m 2 yields the following initial values:

ModifyingAbove theta With caret equals StartFraction m 1 m 2 Over m 2 minus 2 m 1 squared EndFraction comma ModifyingAbove alpha With caret equals StartFraction 2 left-parenthesis m 2 minus m 1 squared right-parenthesis Over m 2 minus 2 m 1 squared EndFraction

If m 2 minus m 1 squared less-than epsilon (almost zero sample variance) or m 2 minus 2 m 1 squared less-than epsilon, then the parameters are initialized as follows:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals 2
TWEEDIE

The parameter p is initialized by assuming that the sample is generated from a gamma distribution with shape parameter alpha and by computing ModifyingAbove p With caret equals StartFraction ModifyingAbove alpha With caret plus 2 Over ModifyingAbove alpha With caret plus 1 EndFraction. The initial value ModifyingAbove alpha With caret is obtained from using the method previously described for the GAMMA distribution. The parameter mu is the mean of the distribution. Hence, it is initialized to the sample mean as

ModifyingAbove mu With caret equals m 1

Variance of a Tweedie distribution is equal to phi mu Superscript p. Thus, the sample variance is used to initialize the value of phi as

ModifyingAbove phi With caret equals StartFraction m 2 minus m 1 squared Over ModifyingAbove mu With caret Superscript ModifyingAbove p With caret Baseline EndFraction
STWEEDIE

STWEEDIE is a compound Poisson-gamma mixture distribution with mean mu equals lamda theta alpha, where alpha is the shape parameter of the gamma random variables in the mixture and the parameter p is determined solely by alpha. First, the parameter p is initialized by assuming that the sample is generated from a gamma distribution with shape parameter alpha and by computing ModifyingAbove p With caret equals StartFraction ModifyingAbove alpha With caret plus 2 Over ModifyingAbove alpha With caret plus 1 EndFraction. The initial value ModifyingAbove alpha With caret is obtained from using the method previously described for the GAMMA distribution. As done for initializing the parameters of the TWEEDIE distribution, the sample mean and variance are used to compute the values ModifyingAbove mu With caret and ModifyingAbove phi With caret as

StartLayout 1st Row 1st Column ModifyingAbove mu With caret 2nd Column equals m 1 2nd Row 1st Column ModifyingAbove phi With caret 2nd Column equals StartFraction m 2 minus m 1 squared Over ModifyingAbove mu With caret Superscript ModifyingAbove p With caret Baseline EndFraction EndLayout

Based on the relationship between the parameters of TWEEDIE and STWEEDIE distributions described in the section Tweedie Distributions, values of theta and lamda are initialized as

StartLayout 1st Row 1st Column ModifyingAbove theta With caret 2nd Column equals ModifyingAbove phi With caret left-parenthesis ModifyingAbove p With caret minus 1 right-parenthesis ModifyingAbove mu With caret Superscript p minus 1 Baseline 2nd Row 1st Column ModifyingAbove lamda With caret 2nd Column equals StartFraction ModifyingAbove mu With caret Over ModifyingAbove theta With caret ModifyingAbove alpha With caret EndFraction EndLayout
WEIBULL

The parameters are initialized by using the percentile matching method. Let q Baseline 1 and q Baseline 3 denote the estimates of the 25th and 75th percentiles, respectively. Using the formula for the CDF of Weibull distribution, they can be written as

StartLayout 1st Row 1st Column 1 minus exp left-parenthesis minus left-parenthesis q Baseline 1 slash theta right-parenthesis Superscript tau Baseline right-parenthesis 2nd Column equals 0.25 2nd Row 1st Column 1 minus exp left-parenthesis minus left-parenthesis q Baseline 3 slash theta right-parenthesis Superscript tau Baseline right-parenthesis 2nd Column equals 0.75 EndLayout

Simplifying and solving these two equations yields the following initial values,

ModifyingAbove theta With caret equals exp left-parenthesis StartFraction r log left-parenthesis q Baseline 1 right-parenthesis minus log left-parenthesis q Baseline 3 right-parenthesis Over r minus 1 EndFraction right-parenthesis comma ModifyingAbove tau With caret equals StartFraction log left-parenthesis log left-parenthesis 4 right-parenthesis right-parenthesis Over log left-parenthesis q Baseline 3 right-parenthesis minus log left-parenthesis ModifyingAbove theta With caret right-parenthesis EndFraction

where r equals log left-parenthesis log left-parenthesis 4 right-parenthesis right-parenthesis slash log left-parenthesis log left-parenthesis 4 slash 3 right-parenthesis right-parenthesis. These initial values agree with those suggested in Klugman, Panjer, and Willmot (1998).

A summary of the initial values of all the parameters for all the predefined distributions is given in Table 4. The table also provides the names of the parameters to use in the INIT= option in the DIST statement if you want to provide a different initial value.

Table 4: Parameter Initialization for Predefined Distributions

Distribution Parameter Name for INIT Option Default Initial Value
BURR theta theta StartRoot StartFraction m 2 m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction EndRoot
alpha alpha 1 plus StartFraction m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction
gamma gamma 2
EXP theta theta m 1
GAMMA theta theta m 1 slash alpha
alpha alpha StartFraction 3 minus d plus StartRoot left-parenthesis d minus 3 right-parenthesis squared plus 24 d EndRoot Over 12 d EndFraction
GPD theta theta m 1 m 2 slash left-parenthesis 2 left-parenthesis m 2 minus m 1 squared right-parenthesis right-parenthesis
xi xi left-parenthesis m 2 minus 2 m 1 squared right-parenthesis slash left-parenthesis 2 left-parenthesis m 2 minus m 1 squared right-parenthesis right-parenthesis
IGAUSS theta theta m 1
alpha alpha m 1 squared slash left-parenthesis m 2 minus m 1 squared right-parenthesis
LOGN mu mu 2 log left-parenthesis m Baseline 1 right-parenthesis minus log left-parenthesis m Baseline 2 right-parenthesis slash 2
sigma sigma StartRoot log left-parenthesis m Baseline 2 right-parenthesis minus 2 log left-parenthesis m Baseline 1 right-parenthesis EndRoot
PARETO theta theta m 1 m 2 slash left-parenthesis m 2 minus 2 m 1 squared right-parenthesis
alpha alpha 2 left-parenthesis m 2 minus m 1 squared right-parenthesis slash left-parenthesis m 2 minus 2 m 1 squared right-parenthesis
TWEEDIE mu mu m 1
phi phi left-parenthesis m 2 minus m 1 squared right-parenthesis slash m 1 Superscript p
p p left-parenthesis alpha plus 2 right-parenthesis slash left-parenthesis alpha plus 1 right-parenthesis
where alpha equals StartFraction 3 minus d plus StartRoot left-parenthesis d minus 3 right-parenthesis squared plus 24 d EndRoot Over 12 d EndFraction
STWEEDIE theta theta left-parenthesis m 2 minus m 1 squared right-parenthesis left-parenthesis p minus 1 right-parenthesis slash m 1
lamda lambda m 1 squared slash left-parenthesis alpha left-parenthesis m 2 minus m 1 squared right-parenthesis left-parenthesis p minus 1 right-parenthesis right-parenthesis
p p left-parenthesis alpha plus 2 right-parenthesis slash left-parenthesis alpha plus 1 right-parenthesis
where alpha equals StartFraction 3 minus d plus StartRoot left-parenthesis d minus 3 right-parenthesis squared plus 24 d EndRoot Over 12 d EndFraction
WEIBULL theta theta exp left-parenthesis StartFraction r log left-parenthesis q Baseline 1 right-parenthesis minus log left-parenthesis q Baseline 3 right-parenthesis Over r minus 1 EndFraction right-parenthesis
tau tau log left-parenthesis log left-parenthesis 4 right-parenthesis right-parenthesis slash left-parenthesis log left-parenthesis q Baseline 3 right-parenthesis minus log left-parenthesis ModifyingAbove theta With caret right-parenthesis right-parenthesis
Notes:
1. m Subscript k denotes the kth raw moment.
2. d equals log left-parenthesis m 1 right-parenthesis minus left-parenthesis sigma-summation log left-parenthesis y Subscript i Baseline right-parenthesis right-parenthesis slash n
3. q Baseline 1 and q Baseline 3 denote the 25th and 75th percentiles, respectively.
4. r equals log left-parenthesis log left-parenthesis 4 right-parenthesis right-parenthesis slash log left-parenthesis log left-parenthesis 4 slash 3 right-parenthesis right-parenthesis


Last updated: June 19, 2025