VARMAX Procedure

Tentative Order Selection

Sample Cross-Covariance and Cross-Correlation Matrices

Given a stationary multivariate time series bold y Subscript t, cross-covariance matrices are

normal upper Gamma left-parenthesis l right-parenthesis equals normal upper E left-bracket left-parenthesis bold y Subscript t Baseline minus bold-italic mu right-parenthesis left-parenthesis bold y Subscript t plus l Baseline minus bold-italic mu right-parenthesis prime right-bracket

where bold-italic mu equals normal upper E left-parenthesis bold y Subscript t Baseline right-parenthesis, and cross-correlation matrices are

rho left-parenthesis l right-parenthesis equals upper D Superscript negative 1 Baseline normal upper Gamma left-parenthesis l right-parenthesis upper D Superscript negative 1

where D is a diagonal matrix with the standard deviations of the components of bold y Subscript t on the diagonal.

The sample cross-covariance matrix at lag l, denoted as upper C left-parenthesis l right-parenthesis, is computed as

ModifyingAbove normal upper Gamma With caret left-parenthesis l right-parenthesis equals upper C left-parenthesis l right-parenthesis equals StartFraction 1 Over upper T EndFraction sigma-summation Underscript t equals 1 Overscript upper T minus l Endscripts bold y overTilde Subscript t Baseline bold y overTilde prime Subscript t plus l

where bold y overTilde Subscript t is the centered data and upper T is the number of nonmissing observations. Thus, the (i, j) element of ModifyingAbove normal upper Gamma With caret left-parenthesis l right-parenthesis is ModifyingAbove gamma With caret Subscript i j Baseline left-parenthesis l right-parenthesis equals c Subscript i j Baseline left-parenthesis l right-parenthesis. The sample cross-correlation matrix at lag l is computed as

ModifyingAbove rho With caret Subscript i j Baseline left-parenthesis l right-parenthesis equals c Subscript i j Baseline left-parenthesis l right-parenthesis slash left-bracket c Subscript i i Baseline left-parenthesis 0 right-parenthesis c Subscript j j Baseline left-parenthesis 0 right-parenthesis right-bracket Superscript 1 slash 2 Baseline comma i comma j equals 1 comma ellipsis comma k

The following statements use the CORRY option to compute the sample cross-correlation matrices and their summary indicator plots in terms of plus comma minus comma and dot, where + indicates significant positive cross-correlations, minus indicates significant negative cross-correlations, and dot indicates insignificant cross-correlations:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3 print=(corry)
                 printform=univariate;
run;

Figure 58 shows the sample cross-correlation matrices of y Subscript 1 t and y Subscript 2 t. As shown, the sample autocorrelation functions for each variable decay quickly, but are significant with respect to two standard errors.

Figure 58: Cross-Correlations (CORRY Option)

The VARMAX Procedure

Cross Correlations of Dependent Series
by Variable
Variable Lag y1 y2
y1 0 1.00000 0.67041
  1 0.83143 0.84330
  2 0.56094 0.81972
  3 0.26629 0.66154
y2 0 0.67041 1.00000
  1 0.29707 0.77132
  2 -0.00936 0.48658
  3 -0.22058 0.22014

Schematic Representation
of Cross Correlations
Variable/Lag 0 1 2 3
y1 ++ ++ ++ ++
y2 ++ ++ .+ -+
+ is > 2*std error,  - is < -2*std error,  . is between


Partial Autoregressive Matrices

For each m equals 1 comma 2 comma ellipsis comma p, you can define a sequence of matrices normal upper Phi Subscript m m, which is called the partial autoregression matrices of lag m, as the solution for normal upper Phi Subscript m m to the Yule-Walker equations of order m,

StartLayout 1st Row  normal upper Gamma left-parenthesis l right-parenthesis equals sigma-summation Underscript i equals 1 Overscript m Endscripts normal upper Gamma left-parenthesis l minus i right-parenthesis normal upper Phi Subscript i m Superscript prime Baseline comma l equals 1 comma 2 comma ellipsis comma m EndLayout

The sequence of the partial autoregression matrices normal upper Phi Subscript m m of order m has the characteristic property that if the process follows the AR(p), then normal upper Phi Subscript p p Baseline equals normal upper Phi Subscript p and normal upper Phi Subscript m m Baseline equals 0 for m greater-than p. Hence, the matrices normal upper Phi Subscript m m have the cutoff property for a VAR(p) model, and so they can be useful in the identification of the order of a pure VAR model.

The following statements use the PARCOEF option to compute the partial autoregression matrices:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3
                 printform=univariate
                print=(corry parcoef pcorr
                       pcancorr roots);
run;

Figure 59 shows that the model can be obtained by an AR order m equals 1 since partial autoregression matrices are insignificant after lag 1 with respect to two standard errors. The matrix for lag 1 is the same as the Yule-Walker autoregressive matrix.

Figure 59: Partial Autoregression Matrices (PARCOEF Option)

The VARMAX Procedure

Partial Autoregression
Lag Variable y1 y2
1 y1 1.14844 -0.50954
  y2 0.54985 0.37409
2 y1 -0.00724 0.05138
  y2 0.02409 0.05909
3 y1 -0.02578 0.03885
  y2 -0.03720 0.10149

Schematic Representation
of Partial Autoregression
Variable/Lag 1 2 3
y1 +- .. ..
y2 ++ .. ..
+ is > 2*std error,  - is < -2*std error,  . is between


Partial Correlation Matrices

Define the forward autoregression

StartLayout 1st Row  bold y Subscript t Baseline equals sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Phi Subscript i comma m minus 1 Baseline bold y Subscript t minus i Baseline plus bold u Subscript m comma t EndLayout

and the backward autoregression

StartLayout 1st Row  bold y Subscript t minus m Baseline equals sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Phi Subscript i comma m minus 1 Superscript asterisk Baseline bold y Subscript t minus m plus i Baseline plus bold u Subscript m comma t minus m Superscript asterisk EndLayout

The matrices upper P left-parenthesis m right-parenthesis defined by Ansley and Newbold (1979) are given by

StartLayout 1st Row  upper P left-parenthesis m right-parenthesis equals normal upper Sigma Subscript m minus 1 Superscript asterisk 1 slash 2 Baseline normal upper Phi prime Subscript m m Baseline normal upper Sigma Subscript m minus 1 Superscript negative 1 slash 2 EndLayout

where

StartLayout 1st Row  normal upper Sigma Subscript m minus 1 Baseline equals normal upper C normal o normal v left-parenthesis bold u Subscript m comma t Baseline right-parenthesis equals normal upper Gamma left-parenthesis 0 right-parenthesis minus sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Gamma left-parenthesis negative i right-parenthesis normal upper Phi prime Subscript i comma m minus 1 EndLayout

and

StartLayout 1st Row  normal upper Sigma Subscript m minus 1 Superscript asterisk Baseline equals normal upper C normal o normal v left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline right-parenthesis equals normal upper Gamma left-parenthesis 0 right-parenthesis minus sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Gamma left-parenthesis m minus i right-parenthesis normal upper Phi Subscript m minus i comma m minus 1 Superscript asterisk prime EndLayout

upper P left-parenthesis m right-parenthesis are the partial cross-correlation matrices at lag m between the elements of bold y Subscript t and bold y Subscript t minus m, given bold y Subscript t minus 1 Baseline comma ellipsis comma bold y Subscript t minus m plus 1 Baseline. The matrices upper P left-parenthesis m right-parenthesis have the cutoff property for a VAR(p) model, and so they can be useful in the identification of the order of a pure VAR structure.

The following statements use the PCORR option to compute the partial cross-correlation matrices:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3
                 print=(pcorr)
                 printform=univariate;
run;

The partial cross-correlation matrices in Figure 60 are insignificant after lag 1 with respect to two standard errors. This indicates that an AR order of m equals 1 can be an appropriate choice.

Figure 60: Partial Correlations (PCORR Option)

The VARMAX Procedure

Partial Cross Correlations by Variable
Variable Lag y1 y2
y1 1 0.80348 0.42672
  2 0.00276 0.03978
  3 -0.01091 0.00032
y2 1 -0.30946 0.71906
  2 0.04676 0.07045
  3 0.01993 0.10676

Schematic Representation
of Partial Cross
Correlations
Variable/Lag 1 2 3
y1 ++ .. ..
y2 -+ .. ..
+ is > 2*std error,  - is < -2*std error,  . is between


Partial Canonical Correlation Matrices

The partial canonical correlations at lag m between the vectors bold y Subscript t and bold y Subscript t minus m, given bold y Subscript t minus 1 Baseline comma ellipsis comma bold y Subscript t minus m plus 1 Baseline, are 1 greater-than-or-equal-to rho 1 left-parenthesis m right-parenthesis greater-than-or-equal-to rho 2 left-parenthesis m right-parenthesis midline-horizontal-ellipsis greater-than-or-equal-to rho Subscript k Baseline left-parenthesis m right-parenthesis. The partial canonical correlations are the canonical correlations between the residual series bold u Subscript m comma t and bold u Subscript m comma t minus m Superscript asterisk, where bold u Subscript m comma t and bold u Subscript m comma t minus m Superscript asterisk are defined in the previous section. Thus, the squared partial canonical correlations rho Subscript i Superscript 2 Baseline left-parenthesis m right-parenthesis are the eigenvalues of the matrix

StartSet normal upper C normal o normal v left-parenthesis bold u Subscript m comma t Baseline right-parenthesis EndSet Superscript negative 1 Baseline normal upper E left-parenthesis bold u Subscript m comma t Baseline bold u Subscript m comma t minus m Superscript asterisk prime Baseline right-parenthesis StartSet normal upper C normal o normal v left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline right-parenthesis EndSet Superscript negative 1 Baseline normal upper E left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline bold u Subscript m comma t Superscript prime Baseline right-parenthesis equals normal upper Phi Subscript m m Superscript asterisk prime Baseline normal upper Phi Subscript m m Superscript prime

It follows that the test statistic to test for normal upper Phi Subscript m Baseline equals 0 in the VAR model of order m greater-than p is approximately

left-parenthesis upper T minus m right-parenthesis normal t normal r StartSet normal upper Phi Subscript m m Superscript asterisk prime Baseline normal upper Phi Subscript m m Superscript prime Baseline EndSet almost-equals left-parenthesis upper T minus m right-parenthesis sigma-summation Underscript i equals 1 Overscript k Endscripts rho Subscript i Superscript 2 Baseline left-parenthesis m right-parenthesis

and has an asymptotic chi-square distribution with k squared degrees of freedom for m greater-than p.

The following statements use the PCANCORR option to compute the partial canonical correlations:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3 print=(pcancorr);
run;

Figure 61 shows that the partial canonical correlations rho Subscript i Baseline left-parenthesis m right-parenthesis between bold y Subscript t and bold y Subscript t minus m are {0.918, 0.773}, {0.092, 0.018}, and {0.109, 0.011} for lags m equals1 to 3. After lag m equals1, the partial canonical correlations are insignificant with respect to the 0.05 significance level, indicating that an AR order of m equals 1 can be an appropriate choice.

Figure 61: Partial Canonical Correlations (PCANCORR Option)

The VARMAX Procedure

Partial Canonical Correlations
Lag Correlation1 Correlation2 DF Chi-Square Pr > ChiSq
1 0.91783 0.77335 4 142.61 <.0001
2 0.09171 0.01816 4 0.86 0.9307
3 0.10861 0.01078 4 1.16 0.8854


The Minimum Information Criterion (MINIC) Method

The minimum information criterion (MINIC) method can tentatively identify the orders of a VARMA(p,q) process (Spliid 1983; Koreisha and Pukkila 1989; Quinn 1980). The first step of this method is to obtain estimates of the innovations series, bold-italic epsilon Subscript t, from the VAR(p Subscript epsilon), where p Subscript epsilon is chosen sufficiently large. The choice of the autoregressive order, p Subscript epsilon, is determined by use of a selection criterion. From the selected VAR(p Subscript epsilon) model, you obtain estimates of residual series

bold-italic epsilon Subscript t Baseline overTilde equals bold y Subscript t Baseline minus sigma-summation Underscript i equals 1 Overscript p Subscript epsilon Baseline Endscripts ModifyingAbove normal upper Phi With caret Subscript i Superscript p Super Subscript epsilon Superscript Baseline bold y Subscript t minus i Baseline minus ModifyingAbove bold-italic delta With caret Superscript p Super Subscript epsilon Superscript Baseline comma t equals p Subscript epsilon Baseline plus 1 comma ellipsis comma upper T

In the second step, you select the order (p comma q) of the VARMA model for p in left-parenthesis p Subscript m i n Baseline colon p Subscript m a x Baseline right-parenthesis and q in left-parenthesis q Subscript m i n Baseline colon q Subscript m a x Baseline right-parenthesis

bold y Subscript t Baseline equals bold-italic delta plus sigma-summation Underscript i equals 1 Overscript p Endscripts normal upper Phi Subscript i Baseline bold y Subscript t minus i Baseline minus sigma-summation Underscript i equals 1 Overscript q Endscripts normal upper Theta Subscript i Baseline bold-italic epsilon overTilde Subscript t minus i Baseline plus bold-italic epsilon Subscript t

which minimizes a selection criterion like SBC or HQ.

According to Lütkepohl (1993), the information criteria, namely Akaike’s information criterion (AIC), the corrected Akaike’s information criterion (AICC), the final prediction error criterion (FPE), the Hannan-Quinn criterion (HQC), and the Schwarz Bayesian criterion (SBC), are defined as

StartLayout 1st Row 1st Column AIC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k slash upper T 2nd Row 1st Column AICC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k slash left-parenthesis upper T minus r Subscript b Baseline right-parenthesis 3rd Row 1st Column FPE 2nd Column equals 3rd Column left-parenthesis StartFraction upper T plus r Subscript b Baseline Over upper T minus r Subscript b Baseline EndFraction right-parenthesis Superscript k Baseline StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue 4th Row 1st Column HQC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k log left-parenthesis log left-parenthesis upper T right-parenthesis right-parenthesis slash upper T 5th Row 1st Column SBC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus r Subscript b Baseline k log left-parenthesis upper T right-parenthesis slash upper T EndLayout

where normal upper Sigma overTilde is the maximum likelihood estimate of the innovation covariance matrix normal upper Sigma, r Subscript b is the number of parameters in each mean equation, k is the number of dependent variables, and T is the number of observations used to estimate the model. Compared to the definitions of AIC, AICC, HQC, and SBC discussed in the section Multivariate Model Diagnostic Checks, the preceding definitions omit some constant terms and are normalized by T. More specifically, only the parameters in each of the mean equations are counted; the parameters in the innovation covariance matrix normal upper Sigma are not counted.

The following statements use the MINIC= option to compute a table that contains the information criterion associated with various AR and MA orders:

proc varmax data=simul1;
   model y1 y2 / p=1 noint minic=(p=3 q=3);
run;

Figure 62 shows the output associated with the MINIC= option. The criterion takes the smallest value at AR order 1.

Figure 62: MINIC= Option

The VARMAX Procedure

Minimum Information Criterion Based on AICC
Lag MA 0 MA 1 MA 2 MA 3
AR 0 3.3574947 3.0331352 2.7080996 2.3049869
AR 1 0.5544431 0.6146887 0.6771732 0.7517968
AR 2 0.6369334 0.6729736 0.7610413 0.8481559
AR 3 0.7235629 0.7551756 0.8053765 0.8654079


Last updated: June 19, 2025