SSM Procedure

State Space Model and Notation

The (linear) state space model is described in the literature in a few different ways and with varying degree of generality. The description given in this section loosely follows the description given in Durbin and Koopman (2012, chap. 6, sec. 4). This formulation of SSM is quite general; in particular, it includes nonstationary SSMs with time-varying system matrices and state equations with a diffuse initial condition (these terms are defined later in this subsection).

Suppose that observations are collected in a sequential fashion (indexed by a numeric variable ) on some variables: the vector , which denotes the q-variate response values, and the k-dimensional vector , which denotes the predictors. Suppose that the observation instances are . The possibility that multiple observations are taken at a particular instance is not ruled out, and the successive observation instances do not need to be regularly spaced—that is, does not need to equal . For , suppose () denotes the number of observations recorded at instance . For notational simplicity, an integer-valued secondary index t is used to index the data so that corresponds to , corresponds to , and so on. Consider the following model:

StartLayout 1st Row 1st Column bold upper Y Subscript t 2nd Column equals bold upper Z Subscript t Baseline alpha alpha Subscript t Baseline plus bold upper X Subscript t Baseline beta beta plus epsilon epsilon Subscript t Baseline 3rd Column Observation equation 2nd Row 1st Column alpha alpha Subscript t plus 1 2nd Column equals bold upper T Subscript t Baseline alpha alpha Subscript t Baseline plus bold upper W Subscript t plus 1 Baseline gamma gamma plus bold c Subscript t plus 1 Baseline plus eta eta Subscript t plus 1 Baseline 3rd Column State transition equation 3rd Row 1st Column alpha alpha 1 2nd Column equals bold c 1 plus bold upper A bold 1 delta delta plus bold upper W 1 gamma gamma plus eta eta 1 3rd Column Initial condition EndLayout

The following list describes these equations:

The observation equation describes the relationship between the -dimensional response vector and the unobserved vectors , , and . The q-variate responses are vertically stacked in a column to form this -dimensional response vector . The m-dimensional vectors are called states, the k-dimensional vector is the regression coefficient vector associated with predictors , and the -dimensional vectors are called the observation disturbances. The matrices (of dimension ) and (of dimension ) correspond to the state effect and the regression effect, respectively. The elements of are assumed to be fully known. The states and the disturbances are random sequences. It is assumed that is a sequence of independent, zero-mean, Gaussian random vectors with diagonal covariances, with the diagonal elements denoted by .
The state sequence is assumed to follow a Markovian structure described by the state transition equation and the associated initial condition.
The state transition equation postulates that a new instance of the state, , is obtained by multiplying its previous instance, , by an m-dimensional square matrix (called the state transition matrix) and by adding three more terms: a known nonrandom vector (called the state input); a regression term , where is an -dimensional design matrix with fully known elements and is the g-dimensional regression vector; and a random disturbance vector . The m-dimensional state disturbance vectors are assumed to be independent, zero-mean, Gaussian random vectors with covariances (not necessarily diagonal).
The initial condition describes the starting condition of the state evolution equation. The starting state vector is assumed to be partially diffuse: it is the sum of a known nonrandom vector , a mean-zero Gaussian vector , and the terms and . represents the contribution from a d-dimensional diffuse vector (a diffuse vector is a Gaussian vector with infinite covariance). The observation and state regression vectors and are also assumed to be diffuse. The matrix is assumed to be completely known.
The observation disturbances and the state disturbances (for ) are assumed to be mutually independent. Either the elements of the matrices , , and and the diagonal elements of the observation disturbance covariances are assumed to be completely known, or some of them can be functions of a small set of unknown parameters (to be estimated from the data). Suppose that this unknown set of parameters is denoted by .
The d-dimensional diffuse vector from the state initial condition together with the observation and state regression vectors and constitute the overall -dimensional diffuse initial condition of the model. For more information, see the section Filtering, Smoothing, Likelihood, and Structural Break Detection.

Although this description of the state space model might appear involved, it conveniently covers many variants of the SSMs that are encountered in practice and precisely describes the most general case that can be handled by the SSM procedure. An important restriction about the preceding description of the model formulation is that it assumes that the matrices and that appear in the observation equation and the state equation respectively are free of unknown parameters and that the covariances of the observation disturbances are diagonal. In most practical situations, the model under consideration can be easily reformulated to a statistically equivalent form that conforms to this restriction.

Note: The transition matrix in the state equation relates the state at time t with the state at time . In many situations, such as when the observations are taken at irregular time intervals, depends on information at both t and . Therefore, it is more appropriate to denote the transition matrix as . However, for simplicity, the former notation is used throughout this chapter. The same comment applies to the covariance matrix of the disturbance term .

For easy reference, Table 4 summarizes the information contained in the SSM equations.

Table 4: State Space Model: Notation

Notation	Description
	Distinct index values at which the observations are recorded
n	Number of distinct index instances
	Number of observations recorded at index ,
q	Number of response variables in the model
	Vertically stacked vector of response values recorded at
	Total number of response values in the data set
k	Number of predictor (regressor) variables in the observation equation
	matrix of predictor values recorded at
	k-dimensional regression vector that is associated with the predictors
	-dimensional observation disturbance vector with diagonal covariance
m	Dimension of the state vectors
	m-dimensional state vector
	matrix that is associated with in the observation equation
	state transition matrix
	m-dimensional state input vector
	design matrix associated with , the state regression vector
	g-dimensional state regression vector
	m-dimensional state disturbance vector
d	Dimension of the diffuse vector in the state initial condition
,	Diffuse vector in the state initial condition
	constant matrix associated with
	m-dimensional state disturbance vector in the initial condition
	Parameter vector

Last updated: June 19, 2025