VARMAX Procedure

Tentative Order Selection

Sample Cross-Covariance and Cross-Correlation Matrices

Given a stationary multivariate time series , cross-covariance matrices are

normal upper Gamma left-parenthesis l right-parenthesis equals normal upper E left-bracket left-parenthesis bold y Subscript t Baseline minus bold-italic mu right-parenthesis left-parenthesis bold y Subscript t plus l Baseline minus bold-italic mu right-parenthesis prime right-bracket

where , and cross-correlation matrices are

rho left-parenthesis l right-parenthesis equals upper D Superscript negative 1 Baseline normal upper Gamma left-parenthesis l right-parenthesis upper D Superscript negative 1

where D is a diagonal matrix with the standard deviations of the components of on the diagonal.

The sample cross-covariance matrix at lag l, denoted as , is computed as

ModifyingAbove normal upper Gamma With caret left-parenthesis l right-parenthesis equals upper C left-parenthesis l right-parenthesis equals StartFraction 1 Over upper T EndFraction sigma-summation Underscript t equals 1 Overscript upper T minus l Endscripts bold y overTilde Subscript t Baseline bold y overTilde prime Subscript t plus l

where is the centered data and is the number of nonmissing observations. Thus, the (i, j) element of is . The sample cross-correlation matrix at lag l is computed as

ModifyingAbove rho With caret Subscript i j Baseline left-parenthesis l right-parenthesis equals c Subscript i j Baseline left-parenthesis l right-parenthesis slash left-bracket c Subscript i i Baseline left-parenthesis 0 right-parenthesis c Subscript j j Baseline left-parenthesis 0 right-parenthesis right-bracket Superscript 1 slash 2 Baseline comma i comma j equals 1 comma ellipsis comma k

The following statements use the CORRY option to compute the sample cross-correlation matrices and their summary indicator plots in terms of and , where + indicates significant positive cross-correlations, indicates significant negative cross-correlations, and indicates insignificant cross-correlations:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3 print=(corry)
                 printform=univariate;
run;

Figure 58 shows the sample cross-correlation matrices of and . As shown, the sample autocorrelation functions for each variable decay quickly, but are significant with respect to two standard errors.

Figure 58: Cross-Correlations (CORRY Option)

The VARMAX Procedure

Cross Correlations of Dependent Series by Variable
Variable	Lag	y1	y2
y1	0	1.00000	0.67041
	1	0.83143	0.84330
	2	0.56094	0.81972
	3	0.26629	0.66154
y2	0	0.67041	1.00000
	1	0.29707	0.77132
	2	-0.00936	0.48658
	3	-0.22058	0.22014

Schematic Representation of Cross Correlations
Variable/Lag	0	1	2	3
y1	++	++	++	++
y2	++	++	.+	-+
+ is > 2std error, - is < -2std error, . is between

Partial Autoregressive Matrices

For each , you can define a sequence of matrices , which is called the partial autoregression matrices of lag m, as the solution for to the Yule-Walker equations of order m,

StartLayout 1st Row normal upper Gamma left-parenthesis l right-parenthesis equals sigma-summation Underscript i equals 1 Overscript m Endscripts normal upper Gamma left-parenthesis l minus i right-parenthesis normal upper Phi Subscript i m Superscript prime Baseline comma l equals 1 comma 2 comma ellipsis comma m EndLayout

The sequence of the partial autoregression matrices of order m has the characteristic property that if the process follows the AR(p), then and for . Hence, the matrices have the cutoff property for a VAR(p) model, and so they can be useful in the identification of the order of a pure VAR model.

The following statements use the PARCOEF option to compute the partial autoregression matrices:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3
                 printform=univariate
                print=(corry parcoef pcorr
                       pcancorr roots);
run;

Figure 59 shows that the model can be obtained by an AR order since partial autoregression matrices are insignificant after lag 1 with respect to two standard errors. The matrix for lag 1 is the same as the Yule-Walker autoregressive matrix.

Figure 59: Partial Autoregression Matrices (PARCOEF Option)

The VARMAX Procedure

Partial Autoregression
Lag	Variable	y1	y2
1	y1	1.14844	-0.50954
	y2	0.54985	0.37409
2	y1	-0.00724	0.05138
	y2	0.02409	0.05909
3	y1	-0.02578	0.03885
	y2	-0.03720	0.10149

Schematic Representation of Partial Autoregression
Variable/Lag	1	2	3
y1	+-	..	..
y2	++	..	..
+ is > 2std error, - is < -2std error, . is between

Partial Correlation Matrices

Define the forward autoregression

StartLayout 1st Row bold y Subscript t Baseline equals sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Phi Subscript i comma m minus 1 Baseline bold y Subscript t minus i Baseline plus bold u Subscript m comma t EndLayout

and the backward autoregression

StartLayout 1st Row bold y Subscript t minus m Baseline equals sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Phi Subscript i comma m minus 1 Superscript asterisk Baseline bold y Subscript t minus m plus i Baseline plus bold u Subscript m comma t minus m Superscript asterisk EndLayout

The matrices defined by Ansley and Newbold (1979) are given by

StartLayout 1st Row upper P left-parenthesis m right-parenthesis equals normal upper Sigma Subscript m minus 1 Superscript asterisk 1 slash 2 Baseline normal upper Phi prime Subscript m m Baseline normal upper Sigma Subscript m minus 1 Superscript negative 1 slash 2 EndLayout

where

StartLayout 1st Row normal upper Sigma Subscript m minus 1 Baseline equals normal upper C normal o normal v left-parenthesis bold u Subscript m comma t Baseline right-parenthesis equals normal upper Gamma left-parenthesis 0 right-parenthesis minus sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Gamma left-parenthesis negative i right-parenthesis normal upper Phi prime Subscript i comma m minus 1 EndLayout

and

StartLayout 1st Row normal upper Sigma Subscript m minus 1 Superscript asterisk Baseline equals normal upper C normal o normal v left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline right-parenthesis equals normal upper Gamma left-parenthesis 0 right-parenthesis minus sigma-summation Underscript i equals 1 Overscript m minus 1 Endscripts normal upper Gamma left-parenthesis m minus i right-parenthesis normal upper Phi Subscript m minus i comma m minus 1 Superscript asterisk prime EndLayout

are the partial cross-correlation matrices at lag m between the elements of and , given . The matrices have the cutoff property for a VAR(p) model, and so they can be useful in the identification of the order of a pure VAR structure.

The following statements use the PCORR option to compute the partial cross-correlation matrices:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3
                 print=(pcorr)
                 printform=univariate;
run;

The partial cross-correlation matrices in Figure 60 are insignificant after lag 1 with respect to two standard errors. This indicates that an AR order of can be an appropriate choice.

Figure 60: Partial Correlations (PCORR Option)

The VARMAX Procedure

Partial Cross Correlations by Variable
Variable	Lag	y1	y2
y1	1	0.80348	0.42672
	2	0.00276	0.03978
	3	-0.01091	0.00032
y2	1	-0.30946	0.71906
	2	0.04676	0.07045
	3	0.01993	0.10676

Schematic Representation of Partial Cross Correlations
Variable/Lag	1	2	3
y1	++	..	..
y2	-+	..	..
+ is > 2std error, - is < -2std error, . is between

Partial Canonical Correlation Matrices

The partial canonical correlations at lag m between the vectors and , given , are . The partial canonical correlations are the canonical correlations between the residual series and , where and are defined in the previous section. Thus, the squared partial canonical correlations are the eigenvalues of the matrix

StartSet normal upper C normal o normal v left-parenthesis bold u Subscript m comma t Baseline right-parenthesis EndSet Superscript negative 1 Baseline normal upper E left-parenthesis bold u Subscript m comma t Baseline bold u Subscript m comma t minus m Superscript asterisk prime Baseline right-parenthesis StartSet normal upper C normal o normal v left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline right-parenthesis EndSet Superscript negative 1 Baseline normal upper E left-parenthesis bold u Subscript m comma t minus m Superscript asterisk Baseline bold u Subscript m comma t Superscript prime Baseline right-parenthesis equals normal upper Phi Subscript m m Superscript asterisk prime Baseline normal upper Phi Subscript m m Superscript prime

It follows that the test statistic to test for in the VAR model of order is approximately

left-parenthesis upper T minus m right-parenthesis normal t normal r StartSet normal upper Phi Subscript m m Superscript asterisk prime Baseline normal upper Phi Subscript m m Superscript prime Baseline EndSet almost-equals left-parenthesis upper T minus m right-parenthesis sigma-summation Underscript i equals 1 Overscript k Endscripts rho Subscript i Superscript 2 Baseline left-parenthesis m right-parenthesis

and has an asymptotic chi-square distribution with degrees of freedom for .

The following statements use the PCANCORR option to compute the partial canonical correlations:

proc varmax data=simul1;
   model y1 y2 / p=1 noint lagmax=3 print=(pcancorr);
run;

Figure 61 shows that the partial canonical correlations between and are {0.918, 0.773}, {0.092, 0.018}, and {0.109, 0.011} for lags 1 to 3. After lag 1, the partial canonical correlations are insignificant with respect to the 0.05 significance level, indicating that an AR order of can be an appropriate choice.

Figure 61: Partial Canonical Correlations (PCANCORR Option)

The VARMAX Procedure

Partial Canonical Correlations
Lag	Correlation1	Correlation2	DF	Chi-Square	Pr > ChiSq
1	0.91783	0.77335	4	142.61	<.0001
2	0.09171	0.01816	4	0.86	0.9307
3	0.10861	0.01078	4	1.16	0.8854

The Minimum Information Criterion (MINIC) Method

The minimum information criterion (MINIC) method can tentatively identify the orders of a VARMA(p,q) process (Spliid 1983; Koreisha and Pukkila 1989; Quinn 1980). The first step of this method is to obtain estimates of the innovations series, , from the VAR(), where is chosen sufficiently large. The choice of the autoregressive order, , is determined by use of a selection criterion. From the selected VAR() model, you obtain estimates of residual series

bold-italic epsilon Subscript t Baseline overTilde equals bold y Subscript t Baseline minus sigma-summation Underscript i equals 1 Overscript p Subscript epsilon Baseline Endscripts ModifyingAbove normal upper Phi With caret Subscript i Superscript p Super Subscript epsilon Superscript Baseline bold y Subscript t minus i Baseline minus ModifyingAbove bold-italic delta With caret Superscript p Super Subscript epsilon Superscript Baseline comma t equals p Subscript epsilon Baseline plus 1 comma ellipsis comma upper T

In the second step, you select the order () of the VARMA model for p in and q in

bold y Subscript t Baseline equals bold-italic delta plus sigma-summation Underscript i equals 1 Overscript p Endscripts normal upper Phi Subscript i Baseline bold y Subscript t minus i Baseline minus sigma-summation Underscript i equals 1 Overscript q Endscripts normal upper Theta Subscript i Baseline bold-italic epsilon overTilde Subscript t minus i Baseline plus bold-italic epsilon Subscript t

which minimizes a selection criterion like SBC or HQ.

According to Lütkepohl (1993), the information criteria, namely Akaike’s information criterion (AIC), the corrected Akaike’s information criterion (AICC), the final prediction error criterion (FPE), the Hannan-Quinn criterion (HQC), and the Schwarz Bayesian criterion (SBC), are defined as

StartLayout 1st Row 1st Column AIC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k slash upper T 2nd Row 1st Column AICC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k slash left-parenthesis upper T minus r Subscript b Baseline right-parenthesis 3rd Row 1st Column FPE 2nd Column equals 3rd Column left-parenthesis StartFraction upper T plus r Subscript b Baseline Over upper T minus r Subscript b Baseline EndFraction right-parenthesis Superscript k Baseline StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue 4th Row 1st Column HQC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus 2 r Subscript b Baseline k log left-parenthesis log left-parenthesis upper T right-parenthesis right-parenthesis slash upper T 5th Row 1st Column SBC 2nd Column equals 3rd Column log left-parenthesis StartAbsoluteValue normal upper Sigma overTilde EndAbsoluteValue right-parenthesis plus r Subscript b Baseline k log left-parenthesis upper T right-parenthesis slash upper T EndLayout

where is the maximum likelihood estimate of the innovation covariance matrix , is the number of parameters in each mean equation, k is the number of dependent variables, and T is the number of observations used to estimate the model. Compared to the definitions of AIC, AICC, HQC, and SBC discussed in the section Multivariate Model Diagnostic Checks, the preceding definitions omit some constant terms and are normalized by T. More specifically, only the parameters in each of the mean equations are counted; the parameters in the innovation covariance matrix are not counted.

The following statements use the MINIC= option to compute a table that contains the information criterion associated with various AR and MA orders:

proc varmax data=simul1;
   model y1 y2 / p=1 noint minic=(p=3 q=3);
run;

Figure 62 shows the output associated with the MINIC= option. The criterion takes the smallest value at AR order 1.

Figure 62: MINIC= Option

The VARMAX Procedure

Minimum Information Criterion Based on AICC
Lag	MA 0	MA 1	MA 2	MA 3
AR 0	3.3574947	3.0331352	2.7080996	2.3049869
AR 1	0.5544431	0.6146887	0.6771732	0.7517968
AR 2	0.6369334	0.6729736	0.7610413	0.8481559
AR 3	0.7235629	0.7551756	0.8053765	0.8654079

Last updated: June 19, 2025