SSM Procedure

Models with Dependent Lags

Many useful time series models relate the present value of a response variable to its own lagged values and, in the multivariate case, the lagged values of other response variables in the model. In the SSM procedure, you can use the DEPLAG statement to specify the terms in the model that involve lagged response variables. These models apply only to the regular data type. This section describes the state space form of such models; for more information, see Harvey (1989, sec. 7.1.1). As an illustration, consider the following model, where the q-dimensional coefficient matrices normal upper Phi normal upper Phi 1 and normal upper Phi normal upper Phi 2 are either fully or partially known:

StartLayout 1st Row 1st Column bold upper Y Subscript t 2nd Column equals 3rd Column normal upper Phi normal upper Phi 1 bold upper Y Subscript t minus 1 plus normal upper Phi normal upper Phi 2 bold upper Y Subscript t minus 2 plus bold upper Z Subscript t Baseline alpha alpha Subscript t plus bold upper X Subscript t Baseline beta beta plus epsilon epsilon Subscript t 2nd Row 1st Column alpha alpha Subscript t plus 1 2nd Column equals 3rd Column bold upper T Subscript t Baseline alpha alpha Subscript t plus bold upper W Subscript t plus 1 Baseline gamma gamma plus bold c Subscript t plus 1 Baseline plus eta eta Subscript t plus 1 3rd Row 1st Column alpha alpha 1 2nd Column equals 3rd Column bold c 1 plus bold upper A 1 delta delta plus eta eta 1 EndLayout

Except for the presence of the terms that involve lagged response vectors (normal upper Phi normal upper Phi 1 bold upper Y Subscript t minus 1 and normal upper Phi normal upper Phi 2 bold upper Y Subscript t minus 2) in the observation equation, the form of this model is the same as the standard state space form that is described in the section State Space Model and Notation. It turns out that this model can be expressed in the standard state space form by suitably enlarging the latent vectors in the state equation and by appropriately reorganizing the system matrices. The enlarged latent vectors and the corresponding system matrices are distinguished by the presence of dagger (dagger) as a superscript in the following reformulated model,

StartLayout 1st Row 1st Column bold upper Y Subscript t 2nd Column equals 3rd Column bold upper Z Subscript t Superscript dagger Baseline alpha alpha Subscript t Superscript dagger 2nd Row 1st Column alpha alpha Subscript t plus 1 Superscript dagger 2nd Column equals 3rd Column bold upper T Subscript t Superscript dagger Baseline alpha alpha Subscript t Superscript dagger plus bold upper W Subscript t plus 1 Superscript dagger Baseline gamma gamma Superscript dagger plus bold c Subscript t plus 1 Superscript dagger Baseline plus eta eta Subscript t plus 1 Superscript dagger 3rd Row 1st Column alpha alpha 1 Superscript dagger 2nd Column equals 3rd Column bold c 1 Superscript dagger Baseline plus bold upper A 1 Superscript dagger Baseline delta delta Superscript dagger plus eta eta 1 Superscript dagger EndLayout

where the following conditions are true (column vectors are displayed horizontally to save space):

  • The enlarged state vector (alpha alpha Subscript t Superscript dagger) is formed by vertically stacking the old state vector (alpha alpha Subscript t), the observation disturbance vector (epsilon epsilon Subscript t), and the present and lagged response vectors (bold upper Y Subscript t and bold upper Y Subscript t minus 1, respectively). That is, alpha alpha Subscript t Superscript dagger Baseline equals left-bracket alpha alpha Subscript t Baseline epsilon epsilon Subscript t Baseline bold upper Y Subscript t Baseline bold upper Y Subscript t minus 1 Baseline right-bracket. Because alpha alpha Subscript t is m-dimensional and epsilon epsilon Subscript t, bold upper Y Subscript t, and bold upper Y Subscript t minus 1 are q-dimensional, the dimension of alpha alpha Subscript t Superscript dagger is m Superscript dagger Baseline equals left-parenthesis m plus 3 asterisk q right-parenthesis.

  • The new state regression vector (gamma gamma Superscript dagger) is formed by vertically stacking the old state regression vector (gamma gamma) and the observation equation regression vector (beta beta). That is, gamma gamma Superscript dagger Baseline equals left-bracket gamma gamma beta beta right-bracket.

  • The enlarged disturbance vector (eta eta Subscript t Superscript dagger) is formed by vertically stacking the old state disturbance vector (eta eta Subscript t), the observation disturbance vector (epsilon epsilon Subscript t), the vector sum left-parenthesis bold upper Z Subscript t Baseline eta eta Subscript t Baseline plus epsilon epsilon Subscript t Baseline right-parenthesis, and filling the rest of the vector with zeros. That is, eta eta Subscript t Superscript dagger Baseline equals left-bracket eta eta Subscript t Baseline epsilon epsilon Subscript t Baseline left-parenthesis bold upper Z Subscript t Baseline eta eta Subscript t Baseline plus epsilon epsilon Subscript t Baseline right-parenthesis Baseline bold 0 right-bracket.

  • The deterministic vector bold c Subscript t plus 1 Superscript dagger Baseline equals left-bracket bold c Subscript t plus 1 Baseline bold 0 bold upper Z Subscript t plus 1 Baseline bold c Subscript t plus 1 Baseline bold 0 right-bracket.

  • The last 2q elements of the initial state vector (alpha alpha 1 Superscript dagger), which correspond to bold upper Y 1, and bold upper Y 0, are taken to be diffuse (which means that the diffuse vector delta delta Superscript dagger has 2q additional elements compared to delta delta).

The new system matrices can be described in blockwise form in terms of the old system matrices as follows:

  • The q times left-parenthesis m plus 3 asterisk q right-parenthesis-dimensional bold upper Z Subscript t Superscript dagger Baseline equals left-bracket bold 0 bold 0 bold upper I Baseline bold 0 right-bracket, where bold 0 is either a q times m-dimensional or q times q-dimensional matrix of zeros and bold upper I is a q-dimensional identity matrix.

  • The m Superscript dagger Baseline times m Superscript dagger matrices bold upper T Subscript t Superscript dagger (transition matrix) and bold upper Q Subscript t Superscript dagger (covariance of eta eta Subscript t plus 1 Superscript dagger) are

    bold upper T Subscript t Superscript dagger Baseline equals Start 4 By 4 Matrix 1st Row 1st Column bold upper T Subscript t Baseline 2nd Column bold 0 3rd Column bold 0 4th Column bold 0 2nd Row 1st Column bold 0 2nd Column bold 0 3rd Column bold 0 4th Column bold 0 3rd Row 1st Column bold upper Z Subscript t plus 1 Baseline bold upper T Subscript t Baseline 2nd Column bold 0 3rd Column normal upper Phi normal upper Phi 1 4th Column normal upper Phi normal upper Phi 2 4th Row 1st Column bold 0 2nd Column bold 0 3rd Column bold upper I 4th Column bold 0 EndMatrix and bold upper Q Subscript t Superscript dagger Baseline equals Start 4 By 4 Matrix 1st Row 1st Column bold upper Q Subscript t Baseline 2nd Column bold 0 3rd Column bold upper Q Subscript t Baseline bold upper Z Subscript t plus 1 Superscript Super Superscript prime Superscript Baseline 4th Column bold 0 2nd Row 1st Column bold 0 2nd Column normal upper Sigma normal upper Sigma Subscript t plus 1 Baseline 3rd Column normal upper Sigma normal upper Sigma Subscript t plus 1 Baseline 4th Column bold 0 3rd Row 1st Column bold upper Z Subscript t plus 1 Baseline bold upper Q Subscript t Baseline 2nd Column normal upper Sigma normal upper Sigma Subscript t plus 1 Baseline 3rd Column left-parenthesis bold upper Z Subscript t plus 1 Baseline bold upper Q Subscript t Baseline bold upper Z Subscript t plus 1 Superscript Super Superscript prime Superscript Baseline plus normal upper Sigma normal upper Sigma Subscript t plus 1 Baseline right-parenthesis 4th Column bold 0 4th Row 1st Column bold 0 2nd Column bold 0 3rd Column bold 0 4th Column bold 0 EndMatrix

    where normal upper Sigma normal upper Sigma Subscript t denotes the covariance matrix (which is diagonal by design) of the observation error vector epsilon epsilon Subscript t. Recall that the system matrices in the transition equation can depend on both t and t plus 1 even if the subscripts of bold upper T and bold upper Q show dependence on t alone.

  • The m Superscript dagger Baseline times left-parenthesis k plus g right-parenthesis matrix bold upper W Subscript t Superscript dagger is

    bold upper W Subscript t plus 1 Superscript dagger Baseline equals Start 4 By 2 Matrix 1st Row 1st Column bold upper W Subscript t plus 1 Baseline 2nd Column bold 0 2nd Row 1st Column bold 0 2nd Column bold 0 3rd Row 1st Column bold upper Z Subscript t plus 1 Baseline bold upper W Subscript t plus 1 Baseline 2nd Column bold upper X Subscript t plus 1 Baseline 4th Row 1st Column bold 0 2nd Column bold 0 EndMatrix

This state space form can be easily extended to account for higher-order lags.

Models that contain dependent lag terms must be used with care. Because the SSM procedure does not impose any special constraints on the lag coefficients (the elements of coefficient matrices normal upper Phi normal upper Phi 1 comma normal upper Phi normal upper Phi 2 comma and so on), the resulting models can often be explosive. For an example of a model with lagged response variables, see Example 33.13.

PROC SSM and PROC UCM (see ChapterĀ 41, UCM Procedure) handle models that contain dependent lags in essentially the same way. However, there is one difference: if the model parameter vector contains unknown lag parameters, PROC UCM parameters are estimated by optimizing the nondiffuse part of the likelihood, whereas PROC SSM continues to use the full diffuse likelihood for parameter estimation.

Last updated: June 19, 2025