SEVERITY Procedure

Predefined Distributions

For the response variable Y, PROC SEVERITY assumes the model

upper Y tilde script upper F left-parenthesis normal upper Theta right-parenthesis

where is a continuous probability distribution with parameters . The model hypothesizes that the observed response is generated from a stochastic process that is governed by the distribution . This model is usually referred to as the error model. Given a representative input sample of response variable values, PROC SEVERITY estimates the model parameters for any distribution and computes the statistics of fit for each model. This enables you to find the distribution that is most likely to generate the observed sample.

A set of predefined distributions is provided with the SEVERITY procedure. A summary of the distributions is provided in Table 3. For each distribution, the table lists the name of the distribution that should be used in the DIST statement, the parameters of the distribution along with their bounds, and the mathematical expressions for the probability density function (PDF) and cumulative distribution function (CDF) of the distribution.

All the predefined distributions, except LOGN and TWEEDIE, are parameterized such that their first parameter is the scale parameter. For LOGN, the first parameter is a log-transformed scale parameter. TWEEDIE does not have a scale parameter. The presence of scale parameter or a log-transformed scale parameter enables you to use all of the predefined distributions, except TWEEDIE, as a candidate for estimating regression effects.

A distribution model is associated with each predefined distribution. You can also define your own distribution model, which is a set of functions and subroutines that you define by using the FCMP procedure. For more information, see the section Defining a Severity Distribution Model with the FCMP Procedure.

Table 3: Predefined PROC SEVERITY Distributions

TheadName	Distribution	Parameters
BURR	Burr (Type XII)	, ,

EXP	Exponential

GAMMA	Gamma	,

GPD	Generalized	,
	Pareto
IGAUSS	Inverse Gaussian	,
	(Wald)

LOGN	Lognormal	(no bounds),

PARETO	Pareto (Type II)	,

TWEEDIE	Tweedie**	, ,

STWEEDIE	Scaled Tweedie**	, ,

WEIBULL	Weibull	,

**For more information, see the section Tweedie Distributions.
Notes:
1. , wherever z is used.
2. denotes the scale parameter for all the distributions. For LOGN, .
3. Parameters are listed in the order in which they are defined in the distribution model.
4. is the lower incomplete gamma function.
5. is the standard normal CDF.

Tweedie Distributions

Tweedie distributions are a special case of the exponential dispersion family (Jørgensen 1987) with a property that the variance of the distribution is equal to , where is the mean of the distribution, is a dispersion parameter, and p is an index parameter as discovered by Tweedie (1984). The distribution is defined for all values of p except for values of p in the open interval . Many important known distributions are a special case of Tweedie distributions including normal (p=0), Poisson (p=1), gamma (p=2), and the inverse Gaussian (p=3). Apart from these special cases, the probability density function (PDF) of the Tweedie distribution does not have an analytic expression. For , it has the form (Dunn and Smyth 2005),

f left-parenthesis x semicolon mu comma phi comma p right-parenthesis equals a left-parenthesis x comma phi right-parenthesis exp left-bracket StartFraction 1 Over phi EndFraction left-parenthesis StartFraction x mu Superscript 1 minus p Baseline Over 1 minus p EndFraction minus kappa left-parenthesis mu comma p right-parenthesis right-parenthesis right-bracket

where for and for p = 2. The function does not have an analytical expression. It is typically evaluated using series expansion methods described in Dunn and Smyth (2005).

For , the Tweedie distribution is a compound Poisson-gamma mixture distribution, which is the distribution of S defined as

upper S equals sigma-summation Underscript i equals 1 Overscript upper N Endscripts upper X Subscript i

where and are independent and identically distributed gamma random variables with shape parameter and scale parameter . At X = 0, the density is a probability mass that is governed by the Poisson distribution, and for values of , it is a mixture of gamma variates with Poisson mixing probability. The parameters , , and are related to the natural parameters , , and p of the Tweedie distribution as

StartLayout 1st Row 1st Column lamda 2nd Column equals StartFraction mu Superscript 2 minus p Baseline Over phi left-parenthesis 2 minus p right-parenthesis EndFraction 2nd Row 1st Column alpha 2nd Column equals StartFraction 2 minus p Over p minus 1 EndFraction 3rd Row 1st Column theta 2nd Column equals phi left-parenthesis p minus 1 right-parenthesis mu Superscript p minus 1 EndLayout

The mean of a Tweedie distribution is positive for .

Two predefined versions of the Tweedie distribution are provided with the SEVERITY procedure. The first version, named TWEEDIE and defined for , has the natural parameterization with parameters , , and p. The second version, named STWEEDIE and defined for , is the version with a scale parameter. It corresponds to the compound Poisson-gamma distribution with gamma scale parameter , Poisson mean parameter , and the index parameter p. The index parameter decides the shape parameter of the gamma distribution as

alpha equals StartFraction 2 minus p Over p minus 1 EndFraction

The parameters and of the STWEEDIE distribution are related to the parameters and of the TWEEDIE distribution as

StartLayout 1st Row 1st Column mu 2nd Column equals lamda theta alpha 2nd Row 1st Column phi 2nd Column equals StartFraction left-parenthesis lamda theta alpha right-parenthesis Superscript 2 minus p Baseline Over lamda left-parenthesis 2 minus p right-parenthesis EndFraction equals StartFraction theta Over left-parenthesis p minus 1 right-parenthesis left-parenthesis lamda theta alpha right-parenthesis Superscript p minus 1 Baseline EndFraction EndLayout

You can fit either version when there are no regression variables. Each version has its own merits. If you fit the TWEEDIE version, you have the direct estimate of the overall mean of the distribution. If you are interested in the most practical range of the index parameter , then you can fit the STWEEDIE version, which provides you direct estimates of the Poisson and gamma components that comprise the distribution (an estimate of the gamma shape parameter is easily obtained from the estimate of p).

If you want to estimate the effect of exogenous (regression) variables on the distribution, then you must use the STWEEDIE version, because PROC SEVERITY requires a distribution to have a scale parameter in order to estimate regression effects. For more information, see the section Estimating Regression Effects. The gamma scale parameter is the scale parameter of the STWEEDIE distribution. If you are interested in determining the effect of regression variables on the mean of the distribution, you can do so by first fitting the STWEEDIE distribution to determine the effect of the regression variables on the scale parameter . Then, you can easily estimate how the mean of the distribution is affected by the regression variables using the relationship , where . The estimates of the regression parameters remain the same, whereas the estimate of the intercept parameter is adjusted by the estimates of the and p parameters.

Parameter Initialization for Predefined Distributions

The parameters are initialized by using the method of moments for all the distributions, except for the gamma and the Weibull distributions. For the gamma distribution, approximate maximum likelihood estimates are used. For the Weibull distribution, the method of percentile matching is used.

Given n observations of the severity value (), the estimate of kth raw moment is denoted by and computed as

m Subscript k Baseline equals StartFraction 1 Over n EndFraction sigma-summation Underscript i equals 1 Overscript n Endscripts y Subscript i Superscript k

The 100pth percentile is denoted by (). By definition, satisfies

upper F left-parenthesis pi Subscript p Baseline minus right-parenthesis less-than-or-equal-to p less-than-or-equal-to upper F left-parenthesis pi Subscript p Baseline right-parenthesis

where . PROC SEVERITY uses the following practical method of computing . Let denote the empirical distribution function (EDF) estimate at a severity value y. Let and denote two consecutive values in the ascending sequence of y values such that and . Then, the estimate is computed as

ModifyingAbove pi With caret Subscript p Baseline equals y Subscript p Superscript minus Baseline plus StartFraction p minus ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript minus Baseline right-parenthesis Over ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript plus Baseline right-parenthesis minus ModifyingAbove upper F With caret Subscript n Baseline left-parenthesis y Subscript p Superscript minus Baseline right-parenthesis EndFraction left-parenthesis y Subscript p Superscript plus Baseline minus y Subscript p Superscript minus Baseline right-parenthesis

Let denote the smallest double-precision floating-point number such that . This machine precision constant can be obtained by using the CONSTANT function in Base SAS software.

The details of how parameters are initialized for each predefined distribution are as follows:

BURR

Burr proposed 12 types of families of continuous distributions (Burr 1942; Rodriguez 2006). The predefined BURR distribution in PROC SEVERITY implements Burr’s Type XII distribution. The parameters are initialized by using the method of moments. The kth raw moment of the Burr distribution of Type XII is

upper E left-bracket upper X Superscript k Baseline right-bracket equals StartFraction theta Superscript k Baseline normal upper Gamma left-parenthesis 1 plus k slash gamma right-parenthesis normal upper Gamma left-parenthesis alpha minus k slash gamma right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction comma negative gamma less-than k less-than alpha gamma

Three moment equations () need to be solved for initializing the three parameters of the distribution. In order to get an approximate closed form solution, the second shape parameter is initialized to a value of 2. If , then simplifying and solving the moment equations yields the following feasible set of initial values:

ModifyingAbove theta With caret equals StartRoot StartFraction m 2 m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction EndRoot comma ModifyingAbove alpha With caret equals 1 plus StartFraction m 3 Over 2 m 3 minus 3 m 1 m 2 EndFraction comma ModifyingAbove gamma With caret equals 2

If , then the parameters are initialized as follows:

ModifyingAbove theta With caret equals StartRoot m 2 EndRoot comma ModifyingAbove alpha With caret equals 2 comma ModifyingAbove gamma With caret equals 2

EXP

The parameters are initialized by using the method of moments. The kth raw moment of the exponential distribution is

Solving yields the initial value of .

GAMMA

The parameter is initialized by using its approximate maximum likelihood (ML) estimate. For a set of n independent and identically distributed observations () drawn from a gamma distribution, the log likelihood l is defined as follows:

Using a shorter notation of to denote and solving the equation yields the following ML estimate of :

ModifyingAbove theta With caret equals StartFraction sigma-summation y Subscript i Baseline Over n alpha EndFraction equals StartFraction m 1 Over alpha EndFraction

Substituting this estimate in the expression of l and simplifying gives

l equals left-parenthesis alpha minus 1 right-parenthesis sigma-summation log left-parenthesis y Subscript i Baseline right-parenthesis minus n alpha minus n alpha log left-parenthesis m 1 right-parenthesis plus n alpha log left-parenthesis alpha right-parenthesis minus n log left-parenthesis normal upper Gamma left-parenthesis alpha right-parenthesis right-parenthesis

Let d be defined as follows:

d equals log left-parenthesis m 1 right-parenthesis minus StartFraction 1 Over n EndFraction sigma-summation log left-parenthesis y Subscript i Baseline right-parenthesis

Solving the equation yields the following expression in terms of the digamma function, :

log left-parenthesis alpha right-parenthesis minus psi left-parenthesis alpha right-parenthesis equals d

The digamma function can be approximated as follows:

ModifyingAbove psi With caret left-parenthesis alpha right-parenthesis almost-equals log left-parenthesis alpha right-parenthesis minus StartFraction 1 Over alpha EndFraction left-parenthesis 0.5 plus StartFraction 1 Over 12 alpha plus 2 EndFraction right-parenthesis

This approximation is within 1.4% of the true value for all the values of except when is arbitrarily close to the positive root of the digamma function (which is approximately 1.461632). Even for the values of that are close to the positive root, the absolute error between true and approximate values is still acceptable ( for ). Solving the equation that arises from this approximation yields the following estimate of :

ModifyingAbove alpha With caret equals StartFraction 3 minus d plus StartRoot left-parenthesis d minus 3 right-parenthesis squared plus 24 d EndRoot Over 12 d EndFraction

If this approximate ML estimate is infeasible, then the method of moments is used. The kth raw moment of the gamma distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals theta Superscript k Baseline StartFraction normal upper Gamma left-parenthesis alpha plus k right-parenthesis Over normal upper Gamma left-parenthesis alpha right-parenthesis EndFraction comma k greater-than negative alpha

Solving and yields the following initial value for :

ModifyingAbove alpha With caret equals StartFraction m 1 squared Over m 2 minus m 1 squared EndFraction

If (almost zero sample variance), then is initialized as follows:

ModifyingAbove alpha With caret equals 1

After computing the estimate of , the estimate of is computed as follows:

ModifyingAbove theta With caret equals StartFraction m 1 Over ModifyingAbove alpha With caret EndFraction

Both the maximum likelihood method and the method of moments arrive at the same relationship between and .

GPD

The parameters are initialized by using the method of moments. Notice that for , the CDF of the generalized Pareto distribution (GPD) is:

This is equivalent to a Pareto distribution with scale parameter and shape parameter . Using this relationship, the parameter initialization method used for the PARETO distribution is used to get the following initial values for the parameters of the GPD distribution:

ModifyingAbove theta With caret equals StartFraction m 1 m 2 Over 2 left-parenthesis m 2 minus m 1 squared right-parenthesis EndFraction comma ModifyingAbove xi With caret equals StartFraction m 2 minus 2 m 1 squared Over 2 left-parenthesis m 2 minus m 1 squared right-parenthesis EndFraction

If (almost zero sample variance) or , then the parameters are initialized as follows:

ModifyingAbove theta With caret equals StartFraction m 1 Over 2 EndFraction comma ModifyingAbove xi With caret equals one-half

IGAUSS

The parameters are initialized by using the method of moments. The standard parameterization of the inverse Gaussian distribution (also known as the Wald distribution), in terms of the location parameter and shape parameter , is as follows (Klugman, Panjer, and Willmot 1998, p. 583):

StartLayout 1st Row 1st Column f left-parenthesis x right-parenthesis 2nd Column equals StartRoot StartFraction lamda Over 2 pi x cubed EndFraction EndRoot exp left-parenthesis StartFraction minus lamda left-parenthesis x minus mu right-parenthesis squared Over 2 mu squared x EndFraction right-parenthesis 2nd Row 1st Column upper F left-parenthesis x right-parenthesis 2nd Column equals normal upper Phi left-parenthesis left-parenthesis StartFraction x Over mu EndFraction minus 1 right-parenthesis StartRoot StartFraction lamda Over x EndFraction EndRoot right-parenthesis plus normal upper Phi left-parenthesis minus left-parenthesis StartFraction x Over mu EndFraction plus 1 right-parenthesis StartRoot StartFraction lamda Over x EndFraction EndRoot right-parenthesis exp left-parenthesis StartFraction 2 lamda Over mu EndFraction right-parenthesis EndLayout

For this parameterization, it is known that the mean is and the variance is , which yields the second raw moment as (computed by using ).

The predefined IGAUSS distribution in PROC SEVERITY uses the following alternate parameterization to allow the distribution to have a scale parameter, :

StartLayout 1st Row 1st Column f left-parenthesis x right-parenthesis 2nd Column equals StartRoot StartFraction alpha theta Over 2 pi x cubed EndFraction EndRoot exp left-parenthesis StartFraction minus alpha left-parenthesis x minus theta right-parenthesis squared Over 2 x theta EndFraction right-parenthesis 2nd Row 1st Column upper F left-parenthesis x right-parenthesis 2nd Column equals normal upper Phi left-parenthesis left-parenthesis StartFraction x Over theta EndFraction minus 1 right-parenthesis StartRoot StartFraction alpha theta Over x EndFraction EndRoot right-parenthesis plus normal upper Phi left-parenthesis minus left-parenthesis StartFraction x Over theta EndFraction plus 1 right-parenthesis StartRoot StartFraction alpha theta Over x EndFraction EndRoot right-parenthesis exp left-parenthesis 2 alpha right-parenthesis EndLayout

The parameters (scale) and (shape) of this alternate form are related to the parameters and of the preceding form such that and . Using this relationship, the first and second raw moments of the IGAUSS distribution are

StartLayout 1st Row 1st Column upper E left-bracket upper X right-bracket 2nd Column equals theta 2nd Row 1st Column upper E left-bracket upper X squared right-bracket 2nd Column equals theta squared left-parenthesis 1 plus StartFraction 1 Over alpha EndFraction right-parenthesis EndLayout

Solving and yields the following initial values:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals StartFraction m 1 squared Over m 2 minus m 1 squared EndFraction

If (almost zero sample variance), then the parameters are initialized as follows:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals 1

LOGN

The parameters are initialized by using the method of moments. The kth raw moment of the lognormal distribution is

upper E left-bracket upper X Superscript k Baseline right-bracket equals exp left-parenthesis k mu plus StartFraction k squared sigma squared Over 2 EndFraction right-parenthesis

Solving and yields the following initial values:

ModifyingAbove mu With caret equals 2 log left-parenthesis m Baseline 1 right-parenthesis minus StartFraction log left-parenthesis m Baseline 2 right-parenthesis Over 2 EndFraction comma ModifyingAbove sigma With caret equals StartRoot log left-parenthesis m Baseline 2 right-parenthesis minus 2 log left-parenthesis m Baseline 1 right-parenthesis EndRoot

PARETO

The predefined PARETO distribution in PROC SEVERITY implements the Type II Pareto distribution with the location parameter set to 0. This predefined PARETO distribution is also known as the Lomax distribution. The parameters are initialized by using the method of moments. The kth raw moment of the Pareto distribution is

Solving and yields the following initial values:

ModifyingAbove theta With caret equals StartFraction m 1 m 2 Over m 2 minus 2 m 1 squared EndFraction comma ModifyingAbove alpha With caret equals StartFraction 2 left-parenthesis m 2 minus m 1 squared right-parenthesis Over m 2 minus 2 m 1 squared EndFraction

If (almost zero sample variance) or , then the parameters are initialized as follows:

ModifyingAbove theta With caret equals m 1 comma ModifyingAbove alpha With caret equals 2

TWEEDIE

The parameter p is initialized by assuming that the sample is generated from a gamma distribution with shape parameter and by computing . The initial value is obtained from using the method previously described for the GAMMA distribution. The parameter is the mean of the distribution. Hence, it is initialized to the sample mean as

Variance of a Tweedie distribution is equal to . Thus, the sample variance is used to initialize the value of as

ModifyingAbove phi With caret equals StartFraction m 2 minus m 1 squared Over ModifyingAbove mu With caret Superscript ModifyingAbove p With caret Baseline EndFraction

STWEEDIE

STWEEDIE is a compound Poisson-gamma mixture distribution with mean , where is the shape parameter of the gamma random variables in the mixture and the parameter p is determined solely by . First, the parameter p is initialized by assuming that the sample is generated from a gamma distribution with shape parameter and by computing . The initial value is obtained from using the method previously described for the GAMMA distribution. As done for initializing the parameters of the TWEEDIE distribution, the sample mean and variance are used to compute the values and as

StartLayout 1st Row 1st Column ModifyingAbove mu With caret 2nd Column equals m 1 2nd Row 1st Column ModifyingAbove phi With caret 2nd Column equals StartFraction m 2 minus m 1 squared Over ModifyingAbove mu With caret Superscript ModifyingAbove p With caret Baseline EndFraction EndLayout

Based on the relationship between the parameters of TWEEDIE and STWEEDIE distributions described in the section Tweedie Distributions, values of and are initialized as

StartLayout 1st Row 1st Column ModifyingAbove theta With caret 2nd Column equals ModifyingAbove phi With caret left-parenthesis ModifyingAbove p With caret minus 1 right-parenthesis ModifyingAbove mu With caret Superscript p minus 1 Baseline 2nd Row 1st Column ModifyingAbove lamda With caret 2nd Column equals StartFraction ModifyingAbove mu With caret Over ModifyingAbove theta With caret ModifyingAbove alpha With caret EndFraction EndLayout

WEIBULL

The parameters are initialized by using the percentile matching method. Let and denote the estimates of the 25th and 75th percentiles, respectively. Using the formula for the CDF of Weibull distribution, they can be written as

StartLayout 1st Row 1st Column 1 minus exp left-parenthesis minus left-parenthesis q Baseline 1 slash theta right-parenthesis Superscript tau Baseline right-parenthesis 2nd Column equals 0.25 2nd Row 1st Column 1 minus exp left-parenthesis minus left-parenthesis q Baseline 3 slash theta right-parenthesis Superscript tau Baseline right-parenthesis 2nd Column equals 0.75 EndLayout

Simplifying and solving these two equations yields the following initial values,

ModifyingAbove theta With caret equals exp left-parenthesis StartFraction r log left-parenthesis q Baseline 1 right-parenthesis minus log left-parenthesis q Baseline 3 right-parenthesis Over r minus 1 EndFraction right-parenthesis comma ModifyingAbove tau With caret equals StartFraction log left-parenthesis log left-parenthesis 4 right-parenthesis right-parenthesis Over log left-parenthesis q Baseline 3 right-parenthesis minus log left-parenthesis ModifyingAbove theta With caret right-parenthesis EndFraction

where . These initial values agree with those suggested in Klugman, Panjer, and Willmot (1998).

A summary of the initial values of all the parameters for all the predefined distributions is given in Table 4. The table also provides the names of the parameters to use in the INIT= option in the DIST statement if you want to provide a different initial value.

Table 4: Parameter Initialization for Predefined Distributions

Distribution	Parameter	Name for INIT Option	Default Initial Value
BURR		theta
		alpha
		gamma	2
EXP		theta
GAMMA		theta
		alpha
GPD		theta
		xi
IGAUSS		theta
		alpha
LOGN		mu
		sigma
PARETO		theta
		alpha
TWEEDIE		mu
		phi
	p	p
			where
STWEEDIE		theta
		lambda
	p	p
			where
WEIBULL		theta
		tau
Notes:
1. denotes the kth raw moment.
2.
3. and denote the 25th and 75th percentiles, respectively.
4.

Last updated: June 19, 2025