AUTOREG Procedure

GARCH Models

Consider the series y Subscript t, which follows the GARCH process. The conditional distribution of the series Y for time t is written

y Subscript t Baseline vertical-bar normal upper Psi Subscript t minus 1 Baseline tilde normal upper N left-parenthesis 0 comma h Subscript t Baseline right-parenthesis

where normal upper Psi Subscript t minus 1 denotes all available information at time t minus 1. The conditional variance h Subscript t is

h Subscript t Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline y Subscript t minus i Superscript 2 Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j

where

p greater-than-or-equal-to 0 comma q greater-than 0
omega greater-than 0 comma alpha Subscript i Baseline greater-than-or-equal-to 0 comma gamma Subscript j Baseline greater-than-or-equal-to 0

The GARCHleft-parenthesis p comma q right-parenthesis model reduces to the ARCHleft-parenthesis q right-parenthesis process when p equals 0. At least one of the ARCH parameters must be nonzero (q greater-than 0). The GARCH regression model can be written

y Subscript t Baseline equals bold x prime Subscript t Baseline beta plus epsilon Subscript t
epsilon Subscript t Baseline equals StartRoot h Subscript t Baseline EndRoot e Subscript t
h Subscript t Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline epsilon Subscript t minus i Superscript 2 Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j

where epsilon Subscript t Baseline tilde normal upper I normal upper N left-parenthesis 0 comma 1 right-parenthesis.

In addition, you can consider the model with disturbances following an autoregressive process and with the GARCH errors. The ARleft-parenthesis m right-parenthesis-GARCHleft-parenthesis p comma q right-parenthesis regression model is denoted

y Subscript t Baseline equals bold x prime Subscript t Baseline beta plus nu Subscript t
nu Subscript t Baseline equals epsilon Subscript t Baseline minus phi 1 nu Subscript t minus 1 Baseline minus midline-horizontal-ellipsis minus phi Subscript m Baseline nu Subscript t minus m
epsilon Subscript t Baseline equals StartRoot h Subscript t Baseline EndRoot e Subscript t
h Subscript t Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline epsilon Subscript t minus i Superscript 2 Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j

GARCH Estimation with Nelson-Cao Inequality Constraints

The GARCHleft-parenthesis p comma q right-parenthesis model is written in ARCH(normal infinity) form as

StartLayout 1st Row 1st Column h Subscript t 2nd Column equals 3rd Column left-parenthesis 1 minus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline upper B Superscript j Baseline right-parenthesis Superscript negative 1 Baseline left-bracket omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline epsilon Subscript t minus i Superscript 2 Baseline right-bracket 2nd Row 1st Column Blank 2nd Column equals 3rd Column omega Superscript asterisk Baseline plus sigma-summation Underscript i equals 1 Overscript normal infinity Endscripts phi Subscript i Baseline epsilon Subscript t minus i Superscript 2 EndLayout

where B is a backshift operator. Therefore, h Subscript t Baseline greater-than-or-equal-to 0 if omega Superscript asterisk Baseline greater-than-or-equal-to 0 and phi Subscript i Baseline greater-than-or-equal-to 0 comma for-all i. Assume that the roots of the following polynomial equation are inside the unit circle,

sigma-summation Underscript j equals 0 Overscript p Endscripts minus gamma Subscript j Baseline upper Z Superscript p minus j

where gamma 0 equals negative 1 and Z is a complex scalar. minus sigma-summation Underscript j equals 0 Overscript p Endscripts gamma Subscript j Baseline upper Z Superscript p minus j and sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline upper Z Superscript q minus i do not share common factors. Under these conditions, StartAbsoluteValue omega Superscript asterisk Baseline EndAbsoluteValue less-than normal infinity, StartAbsoluteValue phi Subscript i Baseline EndAbsoluteValue less-than normal infinity, and these coefficients of the ARCH(normal infinity) process are well defined.

Define n equals max left-parenthesis p comma q right-parenthesis. The coefficient phi Subscript i is written

StartLayout 1st Row 1st Column phi 0 2nd Column equals 3rd Column alpha 1 2nd Row 1st Column phi 1 2nd Column equals 3rd Column gamma 1 phi 0 plus alpha 2 3rd Row 1st Column midline-horizontal-ellipsis 4th Row 1st Column phi Subscript n minus 1 2nd Column equals 3rd Column gamma 1 phi Subscript n minus 2 plus gamma 2 phi Subscript n minus 3 plus midline-horizontal-ellipsis plus gamma Subscript n minus 1 Baseline phi 0 plus alpha Subscript n 5th Row 1st Column phi Subscript k 2nd Column equals 3rd Column gamma 1 phi Subscript k minus 1 Baseline plus gamma 2 phi Subscript k minus 2 Baseline plus midline-horizontal-ellipsis plus gamma Subscript n Baseline phi Subscript k minus n Baseline for k greater-than-or-equal-to n EndLayout

where alpha Subscript i Baseline equals 0 for i greater-than q and gamma Subscript j Baseline equals 0 for j greater-than p.

Nelson and Cao (1992) proposed the finite inequality constraints for GARCHleft-parenthesis 1 comma q right-parenthesis and GARCHleft-parenthesis 2 comma q right-parenthesis cases. However, it is not straightforward to derive the finite inequality constraints for the general GARCHleft-parenthesis p comma q right-parenthesis model.

For the GARCHleft-parenthesis 1 comma q right-parenthesis model, the nonlinear inequality constraints are

StartLayout 1st Row 1st Column omega 2nd Column greater-than-or-equal-to 3rd Column 0 2nd Row 1st Column gamma 1 2nd Column greater-than-or-equal-to 3rd Column 0 3rd Row 1st Column phi Subscript k 2nd Column greater-than-or-equal-to 3rd Column 0 normal f normal o normal r k equals 0 comma 1 comma ellipsis comma q minus 1 EndLayout

For the GARCHleft-parenthesis 2 comma q right-parenthesis model, the nonlinear inequality constraints are

StartLayout 1st Row 1st Column normal upper Delta Subscript i 2nd Column element-of 3rd Column normal upper R for i equals 1 comma 2 2nd Row 1st Column omega Superscript asterisk 2nd Column greater-than-or-equal-to 3rd Column 0 3rd Row 1st Column normal upper Delta 1 2nd Column greater-than 3rd Column 0 4th Row 1st Column sigma-summation Underscript j equals 0 Overscript q minus 1 Endscripts normal upper Delta 1 Superscript negative j Baseline alpha Subscript j plus 1 2nd Column greater-than 3rd Column 0 5th Row 1st Column phi Subscript k 2nd Column greater-than-or-equal-to 3rd Column 0 for k equals 0 comma 1 comma ellipsis comma q EndLayout

where normal upper Delta 1 and normal upper Delta 2 are the roots of left-parenthesis upper Z squared minus gamma 1 upper Z minus gamma 2 right-parenthesis.

For the GARCHleft-parenthesis p comma q right-parenthesis model with p greater-than 2, only max left-parenthesis q minus 1 comma p right-parenthesis plus 1 nonlinear inequality constraints (phi Subscript k Baseline greater-than-or-equal-to 0 for k equals 0 to max(q minus 1 comma p)) are imposed, together with the in-sample positivity constraints of the conditional variance h Subscript t.

IGARCH and Stationary GARCH Model

The condition sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline less-than 1 implies that the GARCH process is weakly stationary since the mean, variance, and autocovariance are finite and constant over time. When the GARCH process is stationary, the unconditional variance of epsilon Subscript t is computed as

bold upper V left-parenthesis epsilon Subscript t Baseline right-parenthesis equals StartFraction omega Over left-parenthesis 1 minus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline minus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline right-parenthesis EndFraction

where epsilon Subscript t Baseline equals StartRoot h Subscript t Baseline EndRoot e Subscript t and h Subscript t is the GARCHleft-parenthesis p comma q right-parenthesis conditional variance.

Sometimes the multistep forecasts of the variance do not approach the unconditional variance when the model is integrated in variance; that is, sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline equals 1.

The unconditional variance does not exist for the IGARCH model. However, it is interesting that the IGARCH model can be strongly stationary even though it is not weakly stationary. For more information, see Nelson (1990).

EGARCH Model

The EGARCH model was proposed by Nelson (1991). Nelson and Cao (1992) argue that the nonnegativity constraints in the linear GARCH model are too restrictive. The GARCH model imposes the nonnegative constraints on the parameters, alpha Subscript i and gamma Subscript j, while there are no restrictions on these parameters in the EGARCH model. In the EGARCH model, the conditional variance, h Subscript t, is an asymmetric function of lagged disturbances epsilon Subscript t minus i,

ln left-parenthesis h Subscript t Baseline right-parenthesis equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline g left-parenthesis z Subscript t minus i Baseline right-parenthesis plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline ln left-parenthesis h Subscript t minus j Baseline right-parenthesis

where

g left-parenthesis z Subscript t Baseline right-parenthesis equals theta z Subscript t Baseline plus gamma left-bracket StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue minus upper E StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue right-bracket
z Subscript t Baseline equals epsilon Subscript t Baseline slash StartRoot h Subscript t Baseline EndRoot

The coefficient of the second term in g left-parenthesis z Subscript t Baseline right-parenthesis is set to be 1 (gamma=1) in our formulation. Note that upper E StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue equals left-parenthesis 2 slash pi right-parenthesis Superscript 1 slash 2 if z Subscript t Baseline tilde normal upper N left-parenthesis 0 comma 1 right-parenthesis. The properties of the EGARCH model are summarized as follows:

  • The function g left-parenthesis z Subscript t Baseline right-parenthesis is linear in z Subscript t with slope coefficient theta plus 1 if z Subscript t is positive while g left-parenthesis z Subscript t Baseline right-parenthesis is linear in z Subscript t with slope coefficient theta minus 1 if z Subscript t is negative.

  • Suppose that theta equals 0. Large innovations increase the conditional variance if StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue minus upper E StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue greater-than 0 and decrease the conditional variance if StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue minus upper E StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue less-than 0.

  • Suppose that theta less-than 1. The innovation in variance, g left-parenthesis z Subscript t Baseline right-parenthesis, is positive if the innovations z Subscript t are less than left-parenthesis 2 slash pi right-parenthesis Superscript 1 slash 2 Baseline slash left-parenthesis theta minus 1 right-parenthesis. Therefore, the negative innovations in returns, epsilon Subscript t, cause the innovation to the conditional variance to be positive if theta is much less than 1.

The unconditional variance does not exist for the EGARCH model.

QGARCH, TGARCH, and PGARCH Models

As shown in many empirical studies, positive and negative innovations have different impacts on future volatility. There is a long list of variations of GARCH models that consider the asymmetricity. Three typical variations are the quadratic GARCH (QGARCH) model (Engle and Ng 1993), the threshold GARCH (TGARCH) model (Glosten, Jaganathan, and Runkle 1993; Zakoian 1994), and the power GARCH (PGARCH) model (Ding, Granger, and Engle 1993). For more information about the asymmetric GARCH models, see Engle and Ng (1993).

In the QGARCH model, the lagged errors’ centers are shifted from zero to some constant values:

h Subscript t Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline left-parenthesis epsilon Subscript t minus i Baseline minus psi Subscript i Baseline right-parenthesis squared plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j

In the TGARCH model, there is an extra slope coefficient for each lagged squared error,

h Subscript t Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts left-parenthesis alpha Subscript i Baseline plus 1 Subscript epsilon Sub Subscript t minus i Subscript less-than 0 Baseline psi Subscript i Baseline right-parenthesis epsilon Subscript t minus i Superscript 2 Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j

where the indicator function 1 Subscript epsilon Sub Subscript t Subscript less-than 0 is one if epsilon Subscript t Baseline less-than 0; otherwise, it is zero.

The PGARCH model not only considers the asymmetric effect but also provides another way to model the long memory property in the volatility,

h Subscript t Superscript lamda Baseline equals omega plus sigma-summation Underscript i equals 1 Overscript q Endscripts alpha Subscript i Baseline left-parenthesis StartAbsoluteValue epsilon Subscript t minus i Baseline EndAbsoluteValue minus psi Subscript i Baseline epsilon Subscript t minus i Baseline right-parenthesis Superscript 2 lamda Baseline plus sigma-summation Underscript j equals 1 Overscript p Endscripts gamma Subscript j Baseline h Subscript t minus j Superscript lamda

where lamda greater-than 0 and StartAbsoluteValue psi Subscript i Baseline EndAbsoluteValue less-than-or-equal-to 1 comma i equals 1 comma ellipsis comma q.

Note that the implemented TGARCH model is also well known as GJR-GARCH (Glosten, Jaganathan, and Runkle 1993), which is similar to the threshold GARCH model proposed by Zakoian (1994) but not exactly the same. In Zakoian’s model, the conditional standard deviation is a linear function of the past values of the white noise. Zakoian’s version can be regarded as a special case of the PGARCH model when lamda equals 1 slash 2. The unconditional variance does not exist for the QGARCH, TGARCH, and PGARCH models.

Using the HETERO Statement with GARCH Models

The HETERO statement can be combined with the GARCH= option in the MODEL statement to include input variables in the GARCH conditional variance model. For example, the GARCHleft-parenthesis 1 comma 1 right-parenthesis variance model with two dummy input variables, D1 and D2, is

StartLayout 1st Row 1st Column epsilon Subscript t 2nd Column equals 3rd Column StartRoot h Subscript t Baseline EndRoot e Subscript t 2nd Row 1st Column h Subscript t 2nd Column equals 3rd Column omega plus alpha 1 epsilon Subscript t minus 1 Superscript 2 plus gamma 1 h Subscript t minus 1 plus eta 1 upper D 1 Subscript t plus eta 2 upper D 2 Subscript t EndLayout

The following statements estimate this GARCH model:

proc autoreg data=one;
   model y = x z / garch=(p=1,q=1);
   hetero d1 d2;
run;

The parameters for the variables D1 and D2 can be constrained using the COEF= option. For example, the constraints eta 1 equals eta 2 equals 1 are imposed by the following statements:

proc autoreg data=one;
   model y = x z / garch=(p=1,q=1);
   hetero d1 d2 / coef=unit;
run;

For the EGARCH model, the input variables enter ln left-parenthesis h Subscript t Baseline right-parenthesis. For example, the EGARCHleft-parenthesis 1 comma 1 right-parenthesis model with two dummy input variables, D1 and D2, is

ln left-parenthesis h Subscript t Baseline right-parenthesis equals omega plus alpha 1 g left-parenthesis z Subscript t minus 1 Baseline right-parenthesis plus gamma 1 ln left-parenthesis h Subscript t minus 1 Baseline right-parenthesis plus eta 1 upper D 1 Subscript t Baseline plus eta 2 upper D 2 Subscript t

where

g left-parenthesis z Subscript t Baseline right-parenthesis equals theta z Subscript t Baseline plus gamma left-bracket StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue minus upper E StartAbsoluteValue z Subscript t Baseline EndAbsoluteValue right-bracket
z Subscript t Baseline equals epsilon Subscript t Baseline slash StartRoot h Subscript t Baseline EndRoot

The following statements estimate the EGARCH model:

proc autoreg data=one;
   model y = x z / garch=(p=1,q=1,type=egarch);
   hetero d1 d2;
run;

For the PGARCH model, the input variables enter h Subscript t Superscript lamda. For example, the PGARCHleft-parenthesis 1 comma 1 right-parenthesis model with two dummy input variables, D1 and D2, is

h Subscript t Superscript lamda Baseline equals omega plus alpha 1 left-parenthesis StartAbsoluteValue epsilon Subscript t minus 1 Baseline EndAbsoluteValue minus psi 1 epsilon Subscript t minus 1 Baseline right-parenthesis Superscript 2 lamda Baseline plus gamma Subscript j Baseline h Subscript t minus j Superscript lamda Baseline plus eta 1 upper D 1 Subscript t Baseline plus eta 2 upper D 2 Subscript t

The following statements estimate the PGARCH model:

proc autoreg data=one;
   model y = x z / garch=(p=1,q=1,type=pgarch);
   hetero d1 d2;
run;

GARCH-in-Mean

The GARCH-M model has the added regressor that is the conditional standard deviation,

y Subscript t Baseline equals bold x prime Subscript t Baseline beta plus delta StartRoot h Subscript t Baseline EndRoot plus epsilon Subscript t
epsilon Subscript t Baseline equals StartRoot h Subscript t Baseline EndRoot e Subscript t

where h Subscript t follows the ARCH or GARCH process.

Maximum Likelihood Estimation

The family of GARCH models are estimated using the maximum likelihood method. The log-likelihood function is computed from the product of all conditional densities of the prediction errors.

When e Subscript t is assumed to have a standard normal distribution (e Subscript t Baseline tilde normal upper N left-parenthesis 0 comma 1 right-parenthesis), the log-likelihood function is given by

l equals sigma-summation Underscript t equals 1 Overscript upper N Endscripts one-half left-bracket minus ln left-parenthesis 2 pi right-parenthesis minus ln left-parenthesis h Subscript t Baseline right-parenthesis minus StartFraction epsilon Subscript t Superscript 2 Baseline Over h Subscript t Baseline EndFraction right-bracket

where epsilon Subscript t Baseline equals y Subscript t Baseline minus bold x prime Subscript t Baseline beta and h Subscript t is the conditional variance. When the GARCHleft-parenthesis p comma q right-parenthesis-M model is estimated, epsilon Subscript t Baseline equals y Subscript t Baseline minus bold x prime Subscript t Baseline beta minus delta StartRoot h Subscript t Baseline EndRoot. When there are no regressors, the residuals epsilon Subscript t are denoted as y Subscript t or y Subscript t Baseline minus delta StartRoot h Subscript t Baseline EndRoot.

If e Subscript t has the standardized Student’s t distribution, the log-likelihood function for the conditional t distribution is

script l equals sigma-summation Underscript t equals 1 Overscript upper N Endscripts left-bracket ln left-parenthesis normal upper Gamma left-parenthesis StartFraction nu plus 1 Over 2 EndFraction right-parenthesis right-parenthesis minus ln left-parenthesis normal upper Gamma left-parenthesis StartFraction nu Over 2 EndFraction right-parenthesis right-parenthesis minus one-half ln left-parenthesis left-parenthesis nu minus 2 right-parenthesis pi h Subscript t Baseline right-parenthesis
minus one-half left-parenthesis nu plus 1 right-parenthesis ln left-parenthesis 1 plus StartFraction epsilon Subscript t Superscript 2 Baseline Over h Subscript t Baseline left-parenthesis nu minus 2 right-parenthesis EndFraction right-parenthesis right-bracket

where normal upper Gamma left-parenthesis dot right-parenthesis is the gamma function and nu is the degree of freedom (nu greater-than 2). Under the conditional t distribution, the additional parameter 1 slash nu is estimated. The log-likelihood function for the conditional t distribution converges to the log-likelihood function of the conditional normal GARCH model as 1 slash nu right-arrow 0.

The likelihood function is maximized via either the dual quasi-Newton or the trust region algorithm. The default is the dual quasi-Newton algorithm. The starting values for the regression parameters beta are obtained from the OLS estimates. When there are autoregressive parameters in the model, the initial values are obtained from the Yule-Walker estimates. The starting value 1.0 Superscript negative 6 is used for the GARCH process parameters. Computation of the conditional variance sequence h Subscript t requires values for h Subscript negative p plus 1 Baseline comma ellipsis comma h Baseline 0 and epsilon Subscript negative q plus 1 Superscript 2 Baseline comma ellipsis comma epsilon 0 squared. For the GARCH(p,q) model, epsilon Subscript negative q plus 1 Superscript 2 Baseline comma ellipsis comma epsilon 0 squared are set to the mean square error from the OLS estimation if no autoregressive model is specified, and set to the mean square error from the Yule-Walker estimation if an autoregressive model is specified. Values for h Subscript negative p plus 1 Baseline comma ellipsis comma h 0 are then set equal to the expected conditional variance, upper E left-parenthesis h Subscript t Baseline), which you compute by setting upper E left-parenthesis epsilon Subscript t Superscript 2 Baseline right-parenthesis to the initializing MSE for epsilon Subscript negative q plus 1 Superscript 2 Baseline comma ellipsis comma epsilon 0 squared. For the GARCH(1,1) model, this gives h 0 equals left-parenthesis omega plus alpha 1 asterisk normal upper M normal upper S normal upper E right-parenthesis slash left-parenthesis 1 minus gamma 1 right-parenthesis.

The variance-covariance matrix is computed using the Hessian matrix. The dual quasi-Newton method approximates the Hessian matrix while the quasi-Newton method gets an approximation of the inverse of Hessian. The trust region method uses the Hessian matrix obtained using numerical differentiation. When there are active constraints, that is, bold q left-parenthesis theta right-parenthesis equals bold 0, the variance-covariance matrix is given by

bold upper V left-parenthesis ModifyingAbove theta With caret right-parenthesis equals bold upper H Superscript negative 1 Baseline left-bracket bold upper I minus bold upper Q prime left-parenthesis bold upper Q bold upper H Superscript negative 1 Baseline bold upper Q prime right-parenthesis Superscript negative 1 Baseline bold upper Q bold upper H Superscript negative 1 Baseline right-bracket

where bold upper H equals minus partial-differential squared l slash partial-differential theta partial-differential theta prime and bold upper Q equals partial-differential bold q left-parenthesis theta right-parenthesis slash partial-differential theta prime. Therefore, the variance-covariance matrix without active constraints reduces to bold upper V left-parenthesis ModifyingAbove theta With caret right-parenthesis equals bold upper H Superscript negative 1.

Last updated: June 19, 2025