VARMAX Procedure

VAR and VARX Modeling

The pth-order VAR process is written as

StartLayout 1st Row bold y Subscript t Baseline minus bold-italic mu equals sigma-summation Underscript i equals 1 Overscript p Endscripts normal upper Phi Subscript i Baseline left-parenthesis bold y Subscript t minus i Baseline minus bold-italic mu right-parenthesis plus bold-italic epsilon Subscript t Baseline normal o normal r normal upper Phi left-parenthesis upper B right-parenthesis left-parenthesis bold y Subscript t Baseline minus bold-italic mu right-parenthesis equals bold-italic epsilon Subscript t Baseline EndLayout

with .

Equivalently, it can be written as

StartLayout 1st Row bold y Subscript t Baseline equals bold-italic delta plus sigma-summation Underscript i equals 1 Overscript p Endscripts normal upper Phi Subscript i Baseline bold y Subscript t minus i Baseline plus bold-italic epsilon Subscript t Baseline normal o normal r normal upper Phi left-parenthesis upper B right-parenthesis bold y Subscript t Baseline equals bold-italic delta plus bold-italic epsilon Subscript t Baseline EndLayout

with .

Stationarity

For stationarity, the VAR process must be expressible in the convergent causal infinite MA form as

StartLayout 1st Row bold y Subscript t Baseline equals bold-italic mu plus sigma-summation Underscript j equals 0 Overscript normal infinity Endscripts normal upper Psi Subscript j Baseline bold-italic epsilon Subscript t minus j EndLayout

where with , where denotes a norm for the matrix A such as . The matrix can be recursively obtained from the relation ; it is

StartLayout 1st Row normal upper Psi Subscript j Baseline equals normal upper Phi 1 normal upper Psi Subscript j minus 1 Baseline plus normal upper Phi 2 normal upper Psi Subscript j minus 2 Baseline plus midline-horizontal-ellipsis plus normal upper Phi Subscript p Baseline normal upper Psi Subscript j minus p EndLayout

where and for .

The stationarity condition is satisfied if all roots of are outside of the unit circle. The stationarity condition is equivalent to the condition in the corresponding VAR(1) representation, , that all eigenvalues of the companion matrix be less than one in absolute value, where , , and

StartLayout 1st Row normal upper Phi equals Start 5 By 5 Matrix 1st Row 1st Column normal upper Phi 1 2nd Column normal upper Phi 2 3rd Column midline-horizontal-ellipsis 4th Column normal upper Phi Subscript p minus 1 Baseline 5th Column normal upper Phi Subscript p Baseline 2nd Row 1st Column upper I Subscript k Baseline 2nd Column 0 3rd Column midline-horizontal-ellipsis 4th Column 0 5th Column 0 3rd Row 1st Column 0 2nd Column upper I Subscript k Baseline 3rd Column midline-horizontal-ellipsis 4th Column 0 5th Column 0 4th Row 1st Column vertical-ellipsis 2nd Column vertical-ellipsis 3rd Column down-right-diagonal-ellipsis 4th Column vertical-ellipsis 5th Column vertical-ellipsis 5th Row 1st Column 0 2nd Column 0 3rd Column midline-horizontal-ellipsis 4th Column upper I Subscript k Baseline 5th Column 0 EndMatrix EndLayout

If the stationarity condition is not satisfied, a nonstationary model (a differenced model or an error correction model) might be more appropriate.

The following statements estimate a VAR(1) model and use the ROOTS option to compute the characteristic polynomial roots:

proc varmax data=simul1;
   model y1 y2 / p=1 noint print=(roots);
run;

Figure 63 shows the output associated with the ROOTS option, which indicates that the series is stationary since the modulus of the eigenvalue is less than one.

Figure 63: Stationarity (ROOTS Option)

The VARMAX Procedure

Roots of AR Characteristic Polynomial
Index	Real	Imaginary	Modulus	Radian	Degree
1	0.77238	0.35899	0.8517	0.4351	24.9284
2	0.77238	-0.35899	0.8517	-0.4351	-24.9284

Parameter Estimation

Consider the stationary VAR(p) model

where are assumed to be available (for convenience of notation). This can be represented by the general form of the multivariate linear model,

StartLayout 1st Row upper Y equals upper X upper B plus upper E or bold y equals left-parenthesis upper X circled-times upper I Subscript k Baseline right-parenthesis bold-italic beta plus bold e EndLayout

where

StartLayout 1st Row 1st Column upper Y 2nd Column equals 3rd Column left-parenthesis bold y 1 comma ellipsis comma bold y Subscript upper T Baseline right-parenthesis prime 2nd Row 1st Column upper B 2nd Column equals 3rd Column left-parenthesis bold-italic delta comma normal upper Phi 1 comma ellipsis comma normal upper Phi Subscript p Baseline right-parenthesis prime 3rd Row 1st Column upper X 2nd Column equals 3rd Column left-parenthesis upper X 0 comma ellipsis comma upper X Subscript upper T minus 1 Baseline right-parenthesis prime 4th Row 1st Column upper X Subscript t 2nd Column equals 3rd Column left-parenthesis 1 comma bold y prime Subscript t Baseline comma ellipsis comma bold y prime Subscript t minus p plus 1 right-parenthesis prime 5th Row 1st Column upper E 2nd Column equals 3rd Column left-parenthesis bold-italic epsilon 1 comma ellipsis comma bold-italic epsilon Subscript upper T Baseline right-parenthesis prime 6th Row 1st Column bold y 2nd Column equals 3rd Column vec left-parenthesis upper Y Superscript prime Baseline right-parenthesis 7th Row 1st Column bold-italic beta 2nd Column equals 3rd Column vec left-parenthesis upper B Superscript prime Baseline right-parenthesis 8th Row 1st Column bold e 2nd Column equals 3rd Column vec left-parenthesis upper E prime right-parenthesis EndLayout

with vec denoting the column stacking operator.

The conditional least squares estimator of is

ModifyingAbove bold-italic beta With caret equals left-parenthesis left-parenthesis upper X prime upper X right-parenthesis Superscript negative 1 Baseline upper X prime circled-times upper I Subscript k Baseline right-parenthesis bold y

and the estimate of is

ModifyingAbove normal upper Sigma With caret equals left-parenthesis upper T minus left-parenthesis k p plus 1 right-parenthesis right-parenthesis Superscript negative 1 Baseline sigma-summation Underscript t equals 1 Overscript upper T Endscripts ModifyingAbove bold-italic epsilon Subscript t Baseline With caret ModifyingAbove bold-italic epsilon Subscript t Baseline With caret prime

where is the residual vectors. Consistency and asymptotic normality of the LS estimator are that

where converges in probability to and denotes convergence in distribution.

The (conditional) maximum likelihood estimator in the VAR(p) model is equal to the (conditional) least squares estimator on the assumption of normality of the error vectors.

Asymptotic Distributions of Impulse Response Functions

As before, vec denotes the column stacking operator and vech is the corresponding operator that stacks the elements on and below the diagonal. For any matrix A, the commutation matrix is defined as ; the duplication matrix is defined as ; the elimination matrix is defined as .

The asymptotic distribution of the impulse response function (Lütkepohl 1993) is

where and

upper G Subscript j Baseline equals StartFraction partial-differential normal v normal e normal c left-parenthesis normal upper Psi Subscript j Baseline right-parenthesis Over partial-differential bold-italic beta prime EndFraction equals sigma-summation Underscript i equals 0 Overscript j minus 1 Endscripts bold upper J left-parenthesis bold upper Phi prime right-parenthesis Superscript j minus 1 minus i Baseline circled-times normal upper Psi Subscript i

where is a matrix and is a companion matrix.

The asymptotic distribution of the accumulated impulse response function is

where .

The asymptotic distribution of the orthogonalized impulse response function is

where , , ,

upper H equals StartFraction partial-differential normal v normal e normal c left-parenthesis normal upper Psi 0 Superscript o Baseline right-parenthesis Over partial-differential bold-italic sigma prime EndFraction equals upper L prime Subscript k Baseline StartSet upper L Subscript k Baseline left-parenthesis upper I Subscript k squared Baseline plus upper K Subscript k Baseline right-parenthesis left-parenthesis normal upper Psi 0 Superscript o Baseline circled-times upper I Subscript k Baseline right-parenthesis upper L prime Subscript k EndSet Superscript negative 1

and with and .

Granger Causality Test

Let be arranged and partitioned in subgroups and with dimensions and , respectively (); that is, with the corresponding white noise process . Consider the VAR(p) model with partitioned coefficients for as follows:

StartLayout 1st Row Start 2 By 2 Matrix 1st Row 1st Column normal upper Phi 11 left-parenthesis upper B right-parenthesis 2nd Column normal upper Phi 12 left-parenthesis upper B right-parenthesis 2nd Row 1st Column normal upper Phi 21 left-parenthesis upper B right-parenthesis 2nd Column normal upper Phi 22 left-parenthesis upper B right-parenthesis EndMatrix StartBinomialOrMatrix bold y Subscript 1 t Baseline Choose bold y Subscript 2 t Baseline EndBinomialOrMatrix equals StartBinomialOrMatrix bold-italic delta 1 Choose bold-italic delta 2 EndBinomialOrMatrix plus StartBinomialOrMatrix bold-italic epsilon Subscript 1 t Baseline Choose bold-italic epsilon Subscript 2 t EndBinomialOrMatrix EndLayout

The variables are said to cause , but do not cause if . The implication of this model structure is that future values of the process are influenced only by its own past and not by the past of , where future values of are influenced by the past of both and . If the future are not influenced by the past values of , then it can be better to model separately from .

Consider testing , where C is a matrix of rank s and c is an s-dimensional vector where . Assuming that

you get the Wald statistic

For the Granger causality test, the matrix C consists of zeros or ones and c is the zero vector. For more information about the Granger causality test, see Lütkepohl (1993).

VARX Modeling

The vector autoregressive model with exogenous variables is called the VARX(p,s) model. The form of the VARX(p,s) model can be written as

The parameter estimates can be obtained by representing the general form of the multivariate linear model,

where

The conditional least squares estimator of can be obtained by using the same method in a VAR(p) modeling. If the multivariate linear model has different independent variables that correspond to dependent variables, the SUR (seemingly unrelated regression) method is used to improve the regression estimates.

The following example fits the ordinary regression model:

proc varmax data=one;
   model y1-y3 = x1-x5;
run;

This is equivalent to the REG procedure in the SAS/STAT software:

proc reg data=one;
   model y1 = x1-x5;
   model y2 = x1-x5;
   model y3 = x1-x5;
run;

The following example fits the second-order lagged regression model:

proc varmax data=two;
   model y1 y2 = x / xlag=2;
run;

This is equivalent to the REG procedure in the SAS/STAT software:

data three;
   set two;
   xlag1 = lag1(x);
   xlag2 = lag2(x);
run;

proc reg data=three;
   model y1 = x xlag1 xlag2;
   model y2 = x xlag1 xlag2;
run;

The following example fits the ordinary regression model with different regressors:

proc varmax data=one;
   model y1 = x1-x3, y2 = x2 x3;
run;

This is equivalent to the following SYSLIN procedure statements:

proc syslin data=one vardef=df sur;
   endogenous y1 y2;
   model y1 = x1-x3;
   model y2 = x2 x3;
run;

From the output in Figure 25 in the section Getting Started: VARMAX Procedure, you can see that the parameters, XL0_1_2, XL0_2_1, XL0_3_1, and XL0_3_2 associated with the exogenous variables, are not significant. The following example fits the VARX(1,0) model with different regressors:

proc varmax data=grunfeld;
   model y1 = x1, y2 = x2, y3 / p=1 print=(estimates);
run;

Figure 64: Parameter Estimates for the VARX(1, 0) Model

The VARMAX Procedure

XLag
Lag	Variable	x1	x2
0	y1	1.83231	_
	y2	_	2.42110
	y3	_	_

As you can see in Figure 64, the symbol ‘_’ in the elements of matrix corresponds to endogenous variables that do not take the denoted exogenous variables.

Last updated: June 19, 2025