SEVERITY Procedure

Censoring and Truncation

One of the key features of PROC SEVERITY is that it enables you to specify whether the severity event’s magnitude is observable and if it is observable, then whether the exact value of the magnitude is known. If an event is unobservable when the magnitude is in certain intervals, then it is referred to as a truncation effect. If the exact magnitude of the event is not known, but it is known to have a value in a certain interval, then it is referred to as a censoring effect.

PROC SEVERITY allows a severity event to be subject to any combination of the following four censoring and truncation effects:

  • Left-truncation: An event is said to be left-truncated if it is observed only when upper Y greater-than upper T Superscript l, where Y denotes the random variable for the magnitude and upper T Superscript l denotes a random variable for the truncation threshold. You can specify left-truncation using the LEFTTRUNCATED= option in the LOSS statement.

  • Right-truncation: An event is said to be right-truncated if it is observed only when upper Y less-than-or-equal-to upper T Superscript r, where Y denotes the random variable for the magnitude and upper T Superscript r denotes a random variable for the truncation threshold. You can specify right-truncation using the RIGHTTRUNCATED= option in the LOSS statement.

  • Left-censoring: An event is said to be left-censored if it is known that the magnitude is upper Y less-than-or-equal-to upper C Superscript l, but the exact value of Y is not known. upper C Superscript l is a random variable for the censoring limit. You can specify left-censoring using the LEFTCENSORED= option in the LOSS statement.

  • Right-censoring: An event is said to be right-censored if it is known that the magnitude is upper Y greater-than upper C Superscript r, but the exact value of Y is not known. upper C Superscript r is a random variable for the censoring limit. You can specify right-censoring using the RIGHTCENSORED= option in the LOSS statement.

For each effect, you can specify a different threshold or limit for each observation or specify a single threshold or limit that applies to all the observations.

If all four types of effects are present on an event, then the following relationship holds: upper T Superscript l Baseline less-than upper C Superscript r Baseline less-than-or-equal-to upper C Superscript l Baseline less-than-or-equal-to upper T Superscript r. PROC SEVERITY checks these relationships and writes a warning to the SAS log if any relationship is violated.

If you specify the response variable in the LOSS statement, then PROC SEVERITY also checks whether each observation satisfies the definitions of the specified censoring and truncation effects. If you specify left-truncation, then PROC SEVERITY ignores observations where upper Y less-than-or-equal-to upper T Superscript l, because such observations are not observable by definition. Similarly, if you specify right-truncation, then PROC SEVERITY ignores observations where upper Y greater-than upper T Superscript r. If you specify left-censoring, then PROC SEVERITY treats an observation with upper Y greater-than upper C Superscript l as uncensored and ignores the value of upper C Superscript l. The observations with upper Y less-than-or-equal-to upper C Superscript l are considered as left-censored, and the value of Y is ignored. If you specify right-censoring, then PROC SEVERITY treats an observation with upper Y less-than-or-equal-to upper C Superscript r as uncensored and ignores the value of upper C Superscript r. The observations with upper Y greater-than upper C Superscript r are considered as right-censored, and the value of Y is ignored. If you specify both left-censoring and right-censoring, it is referred to as interval-censoring. If upper C Superscript r Baseline less-than upper C Superscript l is satisfied for an observation, then it is considered as interval-censored and the value of the response variable is ignored. If upper C Superscript r Baseline equals upper C Superscript l for an observation, then PROC SEVERITY assumes that observation to be uncensored. If all the observations in a data set are censored in some form, then the specification of the response variable in the LOSS statement is optional, because the actual value of the response variable is not required for the purposes of estimating a model.

Specification of censoring and truncation affects the likelihood of the data (see the section Likelihood Function) and how the empirical distribution function (EDF) is estimated (see the section Empirical Distribution Function Estimation Methods).

Probability of Observability

For left-truncated data, PROC SEVERITY also enables you to provide additional information in the form of probability of observability by using the PROBOBSERVED= option. It is defined as the probability that the underlying severity event gets observed (and recorded) for the specified left-truncation threshold value. For example, if you specify a value of 0.75, then for every 75 observations recorded above a specified threshold, 25 more events have happened with a severity value less than or equal to the specified threshold. Although the exact severity value of those 25 events is not known, PROC SEVERITY can use the information about the number of those events.

In particular, for each left-truncated observation, PROC SEVERITY assumes a presence of left-parenthesis 1 minus p right-parenthesis slash p additional observations with y Subscript i Baseline equals t Subscript i. These additional observations are then used for computing the likelihood (see the section Probability of Observability and Likelihood) and an unconditional estimate of the empirical distribution function (see the section EDF Estimates and Truncation).

Truncation and Conditional CDF Estimates

If you specify left-truncation without the probability of observability or if you specify right-truncation, then the EDF estimates that are computed by all methods except the STANDARD method are conditional on the truncation information. For more information, see the section EDF Estimates and Truncation. In such cases, PROC SEVERITY uses conditional estimates of the CDF whenever they are used for computational or visual comparison with the EDF estimates.

Let t Subscript min Superscript l Baseline equals min Subscript i Baseline left-brace t Subscript i Superscript l Baseline right-brace be the smallest value of the left-truncation threshold (t Subscript i Superscript l is the left-truncation threshold for observation i) and t Subscript max Superscript r Baseline equals max Subscript i Baseline left-brace t Subscript i Superscript r Baseline right-brace be the largest value of the right-truncation threshold (t Subscript i Superscript r is the right-truncation threshold for observation i). If ModifyingAbove upper F With caret left-parenthesis y right-parenthesis denotes the unconditional estimate of the CDF at y, then the conditional estimate ModifyingAbove upper F With caret Superscript c Baseline left-parenthesis y right-parenthesis is computed as follows:

  • If you do not specify the probability of observability, then the EDF estimates are conditional on the left-truncation information. If an observation is both left-truncated and right-truncated, then

    ModifyingAbove upper F With caret Superscript c Baseline left-parenthesis y right-parenthesis equals StartFraction ModifyingAbove upper F With caret left-parenthesis y right-parenthesis minus ModifyingAbove upper F With caret left-parenthesis t Subscript min Superscript l Baseline right-parenthesis Over ModifyingAbove upper F With caret left-parenthesis t Subscript max Superscript r Baseline right-parenthesis minus ModifyingAbove upper F With caret left-parenthesis t Subscript min Superscript l Baseline right-parenthesis EndFraction

    If an observation is left-truncated but not right-truncated, then

    ModifyingAbove upper F With caret Superscript c Baseline left-parenthesis y right-parenthesis equals StartFraction ModifyingAbove upper F With caret left-parenthesis y right-parenthesis minus ModifyingAbove upper F With caret left-parenthesis t Subscript min Superscript l Baseline right-parenthesis Over 1 minus ModifyingAbove upper F With caret left-parenthesis t Subscript min Superscript l Baseline right-parenthesis EndFraction

    If an observation is right-truncated but not left-truncated, then

    ModifyingAbove upper F With caret Superscript c Baseline left-parenthesis y right-parenthesis equals StartFraction ModifyingAbove upper F With caret left-parenthesis y right-parenthesis Over ModifyingAbove upper F With caret left-parenthesis t Subscript max Superscript r Baseline right-parenthesis EndFraction
  • If you specify the probability of observability, then EDF estimates are not conditional on the left-truncation information. If an observation is not right-truncated, then the conditional estimate is the same as the unconditional estimate. If an observation is right-truncated, then the conditional estimate is computed as

    ModifyingAbove upper F With caret Superscript c Baseline left-parenthesis y right-parenthesis equals StartFraction ModifyingAbove upper F With caret left-parenthesis y right-parenthesis Over ModifyingAbove upper F With caret left-parenthesis t Subscript max Superscript r Baseline right-parenthesis EndFraction

If you specify regression effects, then ModifyingAbove upper F With caret left-parenthesis y right-parenthesis, ModifyingAbove upper F With caret left-parenthesis t Subscript min Superscript l Baseline right-parenthesis, and ModifyingAbove upper F With caret left-parenthesis t Subscript max Superscript r Baseline right-parenthesis are all computed from a mixture distribution, as described in the section CDF and PDF Estimates with Regression Effects.

Last updated: June 19, 2025