SSM Procedure

Models with Dependent Lags

Many useful time series models relate the present value of a response variable to its own lagged values and, in the multivariate case, the lagged values of other response variables in the model. In the SSM procedure, you can use the DEPLAG statement to specify the terms in the model that involve lagged response variables. These models apply only to the regular data type. This section describes the state space form of such models; for more information, see Harvey (1989, sec. 7.1.1). As an illustration, consider the following model, where the q-dimensional coefficient matrices and are either fully or partially known:

StartLayout 1st Row 1st Column bold upper Y Subscript t 2nd Column equals 3rd Column normal upper Phi normal upper Phi 1 bold upper Y Subscript t minus 1 plus normal upper Phi normal upper Phi 2 bold upper Y Subscript t minus 2 plus bold upper Z Subscript t Baseline alpha alpha Subscript t plus bold upper X Subscript t Baseline beta beta plus epsilon epsilon Subscript t 2nd Row 1st Column alpha alpha Subscript t plus 1 2nd Column equals 3rd Column bold upper T Subscript t Baseline alpha alpha Subscript t plus bold upper W Subscript t plus 1 Baseline gamma gamma plus bold c Subscript t plus 1 Baseline plus eta eta Subscript t plus 1 3rd Row 1st Column alpha alpha 1 2nd Column equals 3rd Column bold c 1 plus bold upper A 1 delta delta plus eta eta 1 EndLayout

Except for the presence of the terms that involve lagged response vectors ( and ) in the observation equation, the form of this model is the same as the standard state space form that is described in the section State Space Model and Notation. It turns out that this model can be expressed in the standard state space form by suitably enlarging the latent vectors in the state equation and by appropriately reorganizing the system matrices. The enlarged latent vectors and the corresponding system matrices are distinguished by the presence of dagger () as a superscript in the following reformulated model,

StartLayout 1st Row 1st Column bold upper Y Subscript t 2nd Column equals 3rd Column bold upper Z Subscript t Superscript dagger Baseline alpha alpha Subscript t Superscript dagger 2nd Row 1st Column alpha alpha Subscript t plus 1 Superscript dagger 2nd Column equals 3rd Column bold upper T Subscript t Superscript dagger Baseline alpha alpha Subscript t Superscript dagger plus bold upper W Subscript t plus 1 Superscript dagger Baseline gamma gamma Superscript dagger plus bold c Subscript t plus 1 Superscript dagger Baseline plus eta eta Subscript t plus 1 Superscript dagger 3rd Row 1st Column alpha alpha 1 Superscript dagger 2nd Column equals 3rd Column bold c 1 Superscript dagger Baseline plus bold upper A 1 Superscript dagger Baseline delta delta Superscript dagger plus eta eta 1 Superscript dagger EndLayout

where the following conditions are true (column vectors are displayed horizontally to save space):

The enlarged state vector () is formed by vertically stacking the old state vector (), the observation disturbance vector (), and the present and lagged response vectors ( and , respectively). That is, . Because is m-dimensional and , , and are q-dimensional, the dimension of is .
The new state regression vector () is formed by vertically stacking the old state regression vector () and the observation equation regression vector (). That is, .
The enlarged disturbance vector () is formed by vertically stacking the old state disturbance vector (), the observation disturbance vector (), the vector sum , and filling the rest of the vector with zeros. That is, .
The deterministic vector .
The last 2q elements of the initial state vector (), which correspond to , and , are taken to be diffuse (which means that the diffuse vector has 2q additional elements compared to ).

The new system matrices can be described in blockwise form in terms of the old system matrices as follows:

The -dimensional , where is either a -dimensional or -dimensional matrix of zeros and is a q-dimensional identity matrix.
The matrices (transition matrix) and (covariance of ) are

where denotes the covariance matrix (which is diagonal by design) of the observation error vector . Recall that the system matrices in the transition equation can depend on both t and even if the subscripts of and show dependence on t alone.
The matrix is

This state space form can be easily extended to account for higher-order lags.

Models that contain dependent lag terms must be used with care. Because the SSM procedure does not impose any special constraints on the lag coefficients (the elements of coefficient matrices ), the resulting models can often be explosive. For an example of a model with lagged response variables, see Example 33.13.

PROC SSM and PROC UCM (see Chapter 41, UCM Procedure) handle models that contain dependent lags in essentially the same way. However, there is one difference: if the model parameter vector contains unknown lag parameters, PROC UCM parameters are estimated by optimizing the nondiffuse part of the likelihood, whereas PROC SSM continues to use the full diffuse likelihood for parameter estimation.

Last updated: June 19, 2025