Provides actions for fitting linear, generalized linear, and logistic models
Fits linear regression models using the method of least squares.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
 code |
casOut |
writes SAS DATA step code for computing predicted values of the fitted model |
|
 output |
required parametercasOut |
creates a table on the server that contains observationwise statistics, which are computed after fitting the model. |
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
 store |
— |
stores regression models to a binary large object (BLOB). |
specifies the significance level to use for the construction of all confidence intervals.
| Default | 0.05 |
|---|---|
| Range | (0, 1) |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
specifies that the analysis not be performed if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
names the classification variables to be used as explanatory variables in the analysis.
For more information about class subparameters, see class Parameter (Shared Concepts).
For more information about specifying the class parameter, see the common classStatement parameter (Appendix A: Common Parameters).
| Alias | classVars |
|---|
lists options that apply to all classification variables.
For more information about specifying the classGlobalOpts parameter, see the common classopts parameter (Appendix A: Common Parameters).
when set to False, suppresses the display of class levels.
| Default | TRUE |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | FALSE |
|---|
writes SAS DATA step code for computing predicted values of the fitted model
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
defines a set of variables that are treated as a single effect that has multiple degrees of freedom.
For more information, see Collection Effects (Shared Concepts).
The collection value can be one or more of the following:
when set to True, requests a table that shows additional details that are related to this effect.
| Default | FALSE |
|---|
specifies the name of the effect.
specifies a set of variables that are treated as a single effect that has multiple degrees of freedom. The columns in the design matrix that are contributed by a collection effect are the design columns of its constituent variables in the order in which they appear in the definition of the collection effect.
specifies a list of results tables to send to the client for display.
For more information about display subparameters, see display Parameter (Shared Concepts).
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
names the numeric variable that contains the frequency of occurrence of each observation.
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies that models not be fit if the number of parameters exceeds the specified value.
| Minimum value | 0 |
|---|
names the dependent variable, explanatory effects, and model options.
For information about model specification, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The glmmodel value can be one or more of the following:
when set to FALSE, ignores the information from the last stop step.
| Default | TRUE |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | FALSE |
|---|
specifies one or more variables to use as response variables in the model. Not all models support more than one response variable.
| Aliases | depVar |
|---|---|
| target |
names the response variable.
specifies a list of effects that define the model. Each term in this list is made up of variables specified in the vars parameter and their interaction (which can be NONE, CROSS, or BAR). When the interaction is BAR, it can be limited by the maxInteract parameter.
For more information, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The effect value can be one or more of the following:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
specifies the entry variable.
specifies effects to include at the start of the selection process for the specified selection method. Included effects are never dropped during the selection process. If you specify n, where n is a positive integer, then the included effects consist of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, models missing values by using extra model effects. These effects consist of dummy variables that take the value 1 when the value of a continuous model variable involved in the effect is missing, and take the value 0 otherwise. The missing value in the original model effect is replaced by the average value of the effect for the nonmissing values. For classification variables, missing values are treated as valid levels.
For more information, see Informative Missingness (Shared Concepts).
| Default | FALSE |
|---|
when set to True, does not include the intercept term in the model.
| Default | FALSE |
|---|
specifies the ridge constant values for ridge regression.
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | FALSE |
|---|
specifies effects to use to begin the selection process in the FORWARD, FORWARDSWAP, and STEPWISE selection methods. If you specify n, where n is a positive integer, then the starting model consists of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, produces standardized regression coefficients. A standardized regression coefficient is computed by dividing a parameter estimate by the ratio of the sample standard deviation of the dependent variable to the sample standard deviation of the regressor.
| Default | FALSE |
|---|
when set to True, produces tolerance values for the estimates. Tolerance for a variable is defined as 1 - R-square, where R-square is obtained from the regression of the variable on all other regressors in the model.
| Default | FALSE |
|---|
when set to True, produces variance inflation factors with the parameter estimates. Variance inflation is the reciprocal of tolerance.
| Default | FALSE |
|---|
Crossproducts
| Default | FALSE |
|---|
Scaled Crossproducts
| Default | FALSE |
|---|
Unscaled Crossproducts
| Default | FALSE |
|---|
uses one or more classification variables specified in the vars parameter in such a way that each observation can be associated with one or more levels of the union of the levels of the classification variables.
For more information, see Multimember Effects (Shared Concepts).
For more information about specifying the multimember parameter, see the common multimember parameter (Appendix A: Common Parameters).
limits the display of class levels. The value 0 suppresses all levels.
| Minimum value | 0 |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
creates a table on the server that contains observationwise statistics, which are computed after fitting the model.
For more information about the available statistics, see Predicted Values and Regression Diagnostics (GENSELECT Procedure in SAS Visual Statistics: Procedures).
For more information, see OUTPUT Statement (GENSELECT Procedure in SAS Visual Statistics: Procedures).
The glmOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the Cook's D influence statistic.
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
names the standard influence of the observation on covariance of betas. The COVRATIO statistic measures the change in the determinant of the covariance matrix of the estimates by deleting the ith observation.
names the scaled measure of the change in the predicted value for the ith observation and is calculated by deleting the ith observation. A large value indicates that the observation is very influential in its neighborhood of the X space.
names the leverage of the observation.
names the lower bound of a confidence interval for an individual prediction.
names the lower bound of a confidence interval for the expected value of the dependent variable.
names the likelihood displacement.
names the predicted value. If you do not specify any output statistics, then by default the predicted value is named Pred.
names the ith residual divided by 1 - h, where h is the leverage, and where the model has been refit without the ith observation.
names the residual, calculated as ACTUAL minus PREDICTED.
identifies the training, validation, and test roles for the observations.
names the studentized residual with the current observation deleted.
names the standard error of the individual predicted value.
names the standard error of the mean predicted value.
names the standard error of the residual.
names the studentized residuals, which are the residuals divided by their standard errors.
names the upper bound of a confidence interval for an individual prediction.
names the upper bound of a confidence interval for the expected value of the dependent variable.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies whether to add raw and formatted values of classification variables in the ParameterEstimates table.
| Default | RAW |
|---|
specifies the fractions of the data to be used for validation and testing.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
names the variable and its values used to partition the data into training, validation, and testing roles.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
| Long form | partByVar={name="variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies a polynomial effect. All specified variables must be numeric. A design matrix column is generated for each term of the specified polynomial. By default, each of these terms is treated as a separate effect for the purpose of model building.
For more information, see Polynomial Effects (Shared Concepts).
For more information about specifying the polynomial parameter, see the common polynomial parameter (Appendix A: Common Parameters).
| Alias | poly |
|---|
specifies the method and options for performing model selection.
For more information, see selection Parameter (Shared Concepts).
| Long form | selection={method="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE"} |
|---|---|
| Shortcut form | selection="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE" |
The selectionStatement value can be one or more of the following:
when set to True, applies adaptive weights to each of the coefficients in the LASSO method.
| Default | FALSE |
|---|
specifies options to perform best-subset selection.
The bestOptions value can be one or more of the following:
specifies the maximum number of subset models to display.
| Minimum value | 0 |
|---|
when set to True, requests estimated regression coefficients for each subset model.
| Alias | beta |
|---|---|
| Default | FALSE |
when set to True, adds Akaike's information criterion to the selection summary.
| Alias | aic |
|---|---|
| Default | FALSE |
when set to True, adds the Bayesian information criterion to the selection summary.
| Alias | bic |
|---|---|
| Default | FALSE |
when set to True, adds estimated mean square error of prediction to the selection summary.
| Alias | gmsep |
|---|---|
| Default | FALSE |
when set to True, adds final prediction error to the selection summary.
| Alias | jp |
|---|---|
| Default | FALSE |
when set to True, adds mean square error to the selection summary.
| Alias | mse |
|---|---|
| Default | FALSE |
when set to True, adds Amemiya's prediction criterion to the selection summary.
| Alias | pc |
|---|---|
| Default | FALSE |
when set to True, adds root mean square error to the selection summary.
| Alias | rmse |
|---|---|
| Default | FALSE |
when set to True, adds the Schwarz Bayesian criterion to the selection summary.
| Alias | sbc |
|---|---|
| Default | FALSE |
when set to True, adds SP to the selection summary.
| Alias | sp |
|---|---|
| Default | FALSE |
when set to True, adds error sum of squares to the selection summary.
| Alias | sse |
|---|---|
| Default | FALSE |
specifies the true standard deviation of the error term to use in computing the CP and BIC statistics.
| Default | 0 |
|---|
specifies the maximum number of candidates to display at each step of the selection process, when the detail level ALL is specified.
specifies the criterion for choosing the model. The specified criterion is evaluated at each step of the selection process, and the model that yields the best value of the criterion is chosen.
For more information, see the discussion of the choose subparameter (Shared Concepts).
when set to True, evaluates (during stepwise selection) the selection criterion for all models in which an effect currently in the model is dropped or an effect not yet in the model is added. The effect whose removal from or addition to the model yields the maximum improvement to the selection criterion is dropped or added.
| Default | FALSE |
|---|
specifies the level of detail to produce about the selection process.
For more information, see the description of the details subparameter (Shared Concepts).
| Default | SUMMARY |
|---|
specifies options to use in performing elastic net selection methods.
The enOptions value can be one or more of the following:
specifies the absolute function difference convergence criterion.
| Alias | abstol |
|---|---|
| Default | 1E-08 |
| Minimum value | 0 |
specifies the relative function difference convergence criterion.
| Alias | fTol |
|---|---|
| Minimum value | 0 |
specifies the relative gradient convergence criterion.
| Alias | gTol |
|---|---|
| Minimum value | 0 |
specifies the regularization parameters in the elastic net selection method.
specifies the elastic net mixing parameter.
specifies the number of regularization parameters in the elastic net selection method.
| Alias | nLambda |
|---|---|
| Default | 0 |
| Minimum value | 0 |
specifies the scaling factor to use in computing minimum regularization parameter.
| Range | (0, 1) |
|---|
specifies a solver for elastic net selection.
when set to True, applies scaling to beta in the elastic net selection method.
| Default | FALSE |
|---|
specifies the number of iterations to use in the elastic net selection method.
| Default | 50 |
|---|
specifies options to use in performing the folded concave penalized (FCP) selection methods.
The fcpOptions value can be one or more of the following:
specifies the alpha value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the big M constant in the mixed integer linear programming (MILP) solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the tolerance for truncating estimated coefficients.
| Alias | coefficientTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the tolerance for integer variables in the mixed integer linear programming (MILP) solver.
| Alias | integerTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the fixed lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the lambda searching grid in the SCAD and MCP selection methods.
specifies the upper bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the number of iterations to use in searching for the best alpha value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum number of iterations to use in searching for the best lambda value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the time limit allowed for the optimization solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the lower bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the minimum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
when set to True, applies normalization in computing the crossproducts matrix.
| Default | TRUE |
|---|
specifies the solver to use in the SCAD and MCP selection methods.
specifies the exponent of the power transformation that is applied to the parameters in forming the adaptive weights.
| Default | 1 |
|---|
specifies whether and how to apply the model hierarchy requirement. Model hierarchy refers to the requirement that, for any term to be in the model, all model effects that are contained in the term must be present in the model.
For more information, see the description of the hierarchy subparameter (Shared Concepts).
| Default | DEFAULT |
|---|
specifies the coefficients in the relaxed LASSO method.
specifies the L2 parameter in the elastic net selection method.
| Default | 0 |
|---|
specifies the upper bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | maxL2 |
|---|---|
| Default | 1 |
specifies the lower bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | minL2 |
|---|---|
| Default | 0 |
when set to True, specifies a hybrid version of the LAR or LASSO method, in which the sequence of models is determined by the LAR or LASSO method but the coefficients of the parameters for the model at any step are determined by using ordinary least squares.
| Default | FALSE |
|---|
specifies the maximum number of effects in any model to consider during the selection process. This parameter is ignored for backward selection.
specifies the maximum number of selection steps to perform.
specifies the model selection method.
For more information, see Model Selection Methods (Shared Concepts).
| Default | STEPWISE |
|---|
specifies the minimum number of effects in any model to consider during backward selection.
when set to True, shows effects in parameter estimates tables in the order in which they were added to the model.
| Default | FALSE |
|---|
when set to True, produces coefficientProgression and selectionSummaryForPlots tables that you can use to produce selection diagnostic plots.
For more information about the plots you can create, see Model Selection Plots (Shared Concepts in SAS Visual Statistics: Procedures).
| Default | FALSE |
|---|
when set to True, applies the relaxed LASSO method.
| Default | FALSE |
|---|
specifies the criterion to use in determining the order in which effects enter or leave at each step of the selection method. This parameter does not apply to LAR or LASSO selection.
For more information, see the discussion of the select subparameter (Shared Concepts).
specifies the significance level for entry when the significance level is used as the select or stop criterion.
| Alias | sle |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies the significance level for removal when the significance level is used as the select or stop criterion.
| Alias | sls |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies a criterion that to use for stopping the selection process. If you do not specify a stop criterion, then the criterion that is used to select the model is also used as the stop criterion.
For more information, see the discussion of the stop subparameter (Shared Concepts).
specifies the number of consecutive steps at which the stop criterion must worsen in order for a local extremum to be detected.
For more information, see the description of the stopHorizon subparameter (Shared Concepts).
| Default | 3 |
|---|
expands variables into spline bases whose form depends on the specified parameters.
For more information, see Spline Effects (Shared Concepts).
For more information about specifying the spline parameter, see the common spline parameter (Appendix A: Common Parameters).
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | FALSE |
|---|
stores regression models to a binary large object (BLOB).
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | savestate |
|---|
specifies the input data table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
names the numeric variable to use to perform a weighted analysis of the data.
Fits linear regression models using the method of least squares.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
 code |
casOut |
writes SAS DATA step code for computing predicted values of the fitted model |
|
 output |
required parametercasOut |
creates a table on the server that contains observationwise statistics, which are computed after fitting the model. |
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
 store |
— |
stores regression models to a binary large object (BLOB). |
specifies the significance level to use for the construction of all confidence intervals.
| Default | 0.05 |
|---|---|
| Range | (0, 1) |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
specifies that the analysis not be performed if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
names the classification variables to be used as explanatory variables in the analysis.
For more information about class subparameters, see class Parameter (Shared Concepts).
For more information about specifying the class parameter, see the common classStatement parameter (Appendix A: Common Parameters).
| Alias | classVars |
|---|
lists options that apply to all classification variables.
For more information about specifying the classGlobalOpts parameter, see the common classopts parameter (Appendix A: Common Parameters).
when set to False, suppresses the display of class levels.
| Default | true |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | false |
|---|
writes SAS DATA step code for computing predicted values of the fitted model
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
defines a set of variables that are treated as a single effect that has multiple degrees of freedom.
For more information, see Collection Effects (Shared Concepts).
The collection value can be one or more of the following:
when set to True, requests a table that shows additional details that are related to this effect.
| Default | false |
|---|
specifies the name of the effect.
specifies a set of variables that are treated as a single effect that has multiple degrees of freedom. The columns in the design matrix that are contributed by a collection effect are the design columns of its constituent variables in the order in which they appear in the definition of the collection effect.
specifies a list of results tables to send to the client for display.
For more information about display subparameters, see display Parameter (Shared Concepts).
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
names the numeric variable that contains the frequency of occurrence of each observation.
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies that models not be fit if the number of parameters exceeds the specified value.
| Minimum value | 0 |
|---|
names the dependent variable, explanatory effects, and model options.
For information about model specification, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The glmmodel value can be one or more of the following:
when set to FALSE, ignores the information from the last stop step.
| Default | true |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | false |
|---|
specifies one or more variables to use as response variables in the model. Not all models support more than one response variable.
| Aliases | depVar |
|---|---|
| target |
names the response variable.
specifies a list of effects that define the model. Each term in this list is made up of variables specified in the vars parameter and their interaction (which can be NONE, CROSS, or BAR). When the interaction is BAR, it can be limited by the maxInteract parameter.
For more information, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The effect value can be one or more of the following:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
specifies the entry variable.
specifies effects to include at the start of the selection process for the specified selection method. Included effects are never dropped during the selection process. If you specify n, where n is a positive integer, then the included effects consist of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, models missing values by using extra model effects. These effects consist of dummy variables that take the value 1 when the value of a continuous model variable involved in the effect is missing, and take the value 0 otherwise. The missing value in the original model effect is replaced by the average value of the effect for the nonmissing values. For classification variables, missing values are treated as valid levels.
For more information, see Informative Missingness (Shared Concepts).
| Default | false |
|---|
when set to True, does not include the intercept term in the model.
| Default | false |
|---|
specifies the ridge constant values for ridge regression.
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | false |
|---|
specifies effects to use to begin the selection process in the FORWARD, FORWARDSWAP, and STEPWISE selection methods. If you specify n, where n is a positive integer, then the starting model consists of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, produces standardized regression coefficients. A standardized regression coefficient is computed by dividing a parameter estimate by the ratio of the sample standard deviation of the dependent variable to the sample standard deviation of the regressor.
| Default | false |
|---|
when set to True, produces tolerance values for the estimates. Tolerance for a variable is defined as 1 - R-square, where R-square is obtained from the regression of the variable on all other regressors in the model.
| Default | false |
|---|
when set to True, produces variance inflation factors with the parameter estimates. Variance inflation is the reciprocal of tolerance.
| Default | false |
|---|
Crossproducts
| Default | false |
|---|
Scaled Crossproducts
| Default | false |
|---|
Unscaled Crossproducts
| Default | false |
|---|
uses one or more classification variables specified in the vars parameter in such a way that each observation can be associated with one or more levels of the union of the levels of the classification variables.
For more information, see Multimember Effects (Shared Concepts).
For more information about specifying the multimember parameter, see the common multimember parameter (Appendix A: Common Parameters).
limits the display of class levels. The value 0 suppresses all levels.
| Minimum value | 0 |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
creates a table on the server that contains observationwise statistics, which are computed after fitting the model.
For more information about the available statistics, see Predicted Values and Regression Diagnostics (GENSELECT Procedure in SAS Visual Statistics: Procedures).
For more information, see OUTPUT Statement (GENSELECT Procedure in SAS Visual Statistics: Procedures).
The glmOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the Cook's D influence statistic.
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
names the standard influence of the observation on covariance of betas. The COVRATIO statistic measures the change in the determinant of the covariance matrix of the estimates by deleting the ith observation.
names the scaled measure of the change in the predicted value for the ith observation and is calculated by deleting the ith observation. A large value indicates that the observation is very influential in its neighborhood of the X space.
names the leverage of the observation.
names the lower bound of a confidence interval for an individual prediction.
names the lower bound of a confidence interval for the expected value of the dependent variable.
names the likelihood displacement.
names the predicted value. If you do not specify any output statistics, then by default the predicted value is named Pred.
names the ith residual divided by 1 - h, where h is the leverage, and where the model has been refit without the ith observation.
names the residual, calculated as ACTUAL minus PREDICTED.
identifies the training, validation, and test roles for the observations.
names the studentized residual with the current observation deleted.
names the standard error of the individual predicted value.
names the standard error of the mean predicted value.
names the standard error of the residual.
names the studentized residuals, which are the residuals divided by their standard errors.
names the upper bound of a confidence interval for an individual prediction.
names the upper bound of a confidence interval for the expected value of the dependent variable.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies whether to add raw and formatted values of classification variables in the ParameterEstimates table.
| Default | RAW |
|---|
specifies the fractions of the data to be used for validation and testing.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
names the variable and its values used to partition the data into training, validation, and testing roles.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
| Long form | partByVar={name="variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies a polynomial effect. All specified variables must be numeric. A design matrix column is generated for each term of the specified polynomial. By default, each of these terms is treated as a separate effect for the purpose of model building.
For more information, see Polynomial Effects (Shared Concepts).
For more information about specifying the polynomial parameter, see the common polynomial parameter (Appendix A: Common Parameters).
| Alias | poly |
|---|
specifies the method and options for performing model selection.
For more information, see selection Parameter (Shared Concepts).
| Long form | selection={method="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE"} |
|---|---|
| Shortcut form | selection="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE" |
The selectionStatement value can be one or more of the following:
when set to True, applies adaptive weights to each of the coefficients in the LASSO method.
| Default | false |
|---|
specifies options to perform best-subset selection.
The bestOptions value can be one or more of the following:
specifies the maximum number of subset models to display.
| Minimum value | 0 |
|---|
when set to True, requests estimated regression coefficients for each subset model.
| Alias | beta |
|---|---|
| Default | false |
when set to True, adds Akaike's information criterion to the selection summary.
| Alias | aic |
|---|---|
| Default | false |
when set to True, adds the Bayesian information criterion to the selection summary.
| Alias | bic |
|---|---|
| Default | false |
when set to True, adds estimated mean square error of prediction to the selection summary.
| Alias | gmsep |
|---|---|
| Default | false |
when set to True, adds final prediction error to the selection summary.
| Alias | jp |
|---|---|
| Default | false |
when set to True, adds mean square error to the selection summary.
| Alias | mse |
|---|---|
| Default | false |
when set to True, adds Amemiya's prediction criterion to the selection summary.
| Alias | pc |
|---|---|
| Default | false |
when set to True, adds root mean square error to the selection summary.
| Alias | rmse |
|---|---|
| Default | false |
when set to True, adds the Schwarz Bayesian criterion to the selection summary.
| Alias | sbc |
|---|---|
| Default | false |
when set to True, adds SP to the selection summary.
| Alias | sp |
|---|---|
| Default | false |
when set to True, adds error sum of squares to the selection summary.
| Alias | sse |
|---|---|
| Default | false |
specifies the true standard deviation of the error term to use in computing the CP and BIC statistics.
| Default | 0 |
|---|
specifies the maximum number of candidates to display at each step of the selection process, when the detail level ALL is specified.
specifies the criterion for choosing the model. The specified criterion is evaluated at each step of the selection process, and the model that yields the best value of the criterion is chosen.
For more information, see the discussion of the choose subparameter (Shared Concepts).
when set to True, evaluates (during stepwise selection) the selection criterion for all models in which an effect currently in the model is dropped or an effect not yet in the model is added. The effect whose removal from or addition to the model yields the maximum improvement to the selection criterion is dropped or added.
| Default | false |
|---|
specifies the level of detail to produce about the selection process.
For more information, see the description of the details subparameter (Shared Concepts).
| Default | SUMMARY |
|---|
specifies options to use in performing elastic net selection methods.
The enOptions value can be one or more of the following:
specifies the absolute function difference convergence criterion.
| Alias | abstol |
|---|---|
| Default | 1E-08 |
| Minimum value | 0 |
specifies the relative function difference convergence criterion.
| Alias | fTol |
|---|---|
| Minimum value | 0 |
specifies the relative gradient convergence criterion.
| Alias | gTol |
|---|---|
| Minimum value | 0 |
specifies the regularization parameters in the elastic net selection method.
specifies the elastic net mixing parameter.
specifies the number of regularization parameters in the elastic net selection method.
| Alias | nLambda |
|---|---|
| Default | 0 |
| Minimum value | 0 |
specifies the scaling factor to use in computing minimum regularization parameter.
| Range | (0, 1) |
|---|
specifies a solver for elastic net selection.
when set to True, applies scaling to beta in the elastic net selection method.
| Default | false |
|---|
specifies the number of iterations to use in the elastic net selection method.
| Default | 50 |
|---|
specifies options to use in performing the folded concave penalized (FCP) selection methods.
The fcpOptions value can be one or more of the following:
specifies the alpha value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the big M constant in the mixed integer linear programming (MILP) solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the tolerance for truncating estimated coefficients.
| Alias | coefficientTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the tolerance for integer variables in the mixed integer linear programming (MILP) solver.
| Alias | integerTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the fixed lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the lambda searching grid in the SCAD and MCP selection methods.
specifies the upper bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the number of iterations to use in searching for the best alpha value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum number of iterations to use in searching for the best lambda value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the time limit allowed for the optimization solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the lower bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the minimum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
when set to True, applies normalization in computing the crossproducts matrix.
| Default | true |
|---|
specifies the solver to use in the SCAD and MCP selection methods.
specifies the exponent of the power transformation that is applied to the parameters in forming the adaptive weights.
| Default | 1 |
|---|
specifies whether and how to apply the model hierarchy requirement. Model hierarchy refers to the requirement that, for any term to be in the model, all model effects that are contained in the term must be present in the model.
For more information, see the description of the hierarchy subparameter (Shared Concepts).
| Default | DEFAULT |
|---|
specifies the coefficients in the relaxed LASSO method.
specifies the L2 parameter in the elastic net selection method.
| Default | 0 |
|---|
specifies the upper bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | maxL2 |
|---|---|
| Default | 1 |
specifies the lower bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | minL2 |
|---|---|
| Default | 0 |
when set to True, specifies a hybrid version of the LAR or LASSO method, in which the sequence of models is determined by the LAR or LASSO method but the coefficients of the parameters for the model at any step are determined by using ordinary least squares.
| Default | false |
|---|
specifies the maximum number of effects in any model to consider during the selection process. This parameter is ignored for backward selection.
specifies the maximum number of selection steps to perform.
specifies the model selection method.
For more information, see Model Selection Methods (Shared Concepts).
| Default | STEPWISE |
|---|
specifies the minimum number of effects in any model to consider during backward selection.
when set to True, shows effects in parameter estimates tables in the order in which they were added to the model.
| Default | false |
|---|
when set to True, produces coefficientProgression and selectionSummaryForPlots tables that you can use to produce selection diagnostic plots.
For more information about the plots you can create, see Model Selection Plots (Shared Concepts in SAS Visual Statistics: Procedures).
| Default | false |
|---|
when set to True, applies the relaxed LASSO method.
| Default | false |
|---|
specifies the criterion to use in determining the order in which effects enter or leave at each step of the selection method. This parameter does not apply to LAR or LASSO selection.
For more information, see the discussion of the select subparameter (Shared Concepts).
specifies the significance level for entry when the significance level is used as the select or stop criterion.
| Alias | sle |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies the significance level for removal when the significance level is used as the select or stop criterion.
| Alias | sls |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies a criterion that to use for stopping the selection process. If you do not specify a stop criterion, then the criterion that is used to select the model is also used as the stop criterion.
For more information, see the discussion of the stop subparameter (Shared Concepts).
specifies the number of consecutive steps at which the stop criterion must worsen in order for a local extremum to be detected.
For more information, see the description of the stopHorizon subparameter (Shared Concepts).
| Default | 3 |
|---|
expands variables into spline bases whose form depends on the specified parameters.
For more information, see Spline Effects (Shared Concepts).
For more information about specifying the spline parameter, see the common spline parameter (Appendix A: Common Parameters).
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | false |
|---|
stores regression models to a binary large object (BLOB).
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | savestate |
|---|
specifies the input data table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
names the numeric variable to use to perform a weighted analysis of the data.
Fits linear regression models using the method of least squares.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
 code |
casOut |
writes SAS DATA step code for computing predicted values of the fitted model |
|
 output |
required parametercasOut |
creates a table on the server that contains observationwise statistics, which are computed after fitting the model. |
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
 store |
— |
stores regression models to a binary large object (BLOB). |
specifies the significance level to use for the construction of all confidence intervals.
| Default | 0.05 |
|---|---|
| Range | (0, 1) |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
specifies that the analysis not be performed if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
names the classification variables to be used as explanatory variables in the analysis.
For more information about class subparameters, see class Parameter (Shared Concepts).
For more information about specifying the class parameter, see the common classStatement parameter (Appendix A: Common Parameters).
| Alias | classVars |
|---|
lists options that apply to all classification variables.
For more information about specifying the classGlobalOpts parameter, see the common classopts parameter (Appendix A: Common Parameters).
when set to False, suppresses the display of class levels.
| Default | True |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | False |
|---|
writes SAS DATA step code for computing predicted values of the fitted model
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
defines a set of variables that are treated as a single effect that has multiple degrees of freedom.
For more information, see Collection Effects (Shared Concepts).
The collection value can be one or more of the following:
when set to True, requests a table that shows additional details that are related to this effect.
| Default | False |
|---|
specifies the name of the effect.
specifies a set of variables that are treated as a single effect that has multiple degrees of freedom. The columns in the design matrix that are contributed by a collection effect are the design columns of its constituent variables in the order in which they appear in the definition of the collection effect.
specifies a list of results tables to send to the client for display.
For more information about display subparameters, see display Parameter (Shared Concepts).
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
names the numeric variable that contains the frequency of occurrence of each observation.
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies that models not be fit if the number of parameters exceeds the specified value.
| Minimum value | 0 |
|---|
names the dependent variable, explanatory effects, and model options.
For information about model specification, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The glmmodel value can be one or more of the following:
when set to FALSE, ignores the information from the last stop step.
| Default | True |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | False |
|---|
specifies one or more variables to use as response variables in the model. Not all models support more than one response variable.
| Aliases | depVar |
|---|---|
| target |
names the response variable.
specifies a list of effects that define the model. Each term in this list is made up of variables specified in the vars parameter and their interaction (which can be NONE, CROSS, or BAR). When the interaction is BAR, it can be limited by the maxInteract parameter.
For more information, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The effect value can be one or more of the following:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
specifies the entry variable.
specifies effects to include at the start of the selection process for the specified selection method. Included effects are never dropped during the selection process. If you specify n, where n is a positive integer, then the included effects consist of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, models missing values by using extra model effects. These effects consist of dummy variables that take the value 1 when the value of a continuous model variable involved in the effect is missing, and take the value 0 otherwise. The missing value in the original model effect is replaced by the average value of the effect for the nonmissing values. For classification variables, missing values are treated as valid levels.
For more information, see Informative Missingness (Shared Concepts).
| Default | False |
|---|
when set to True, does not include the intercept term in the model.
| Default | False |
|---|
specifies the ridge constant values for ridge regression.
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | False |
|---|
specifies effects to use to begin the selection process in the FORWARD, FORWARDSWAP, and STEPWISE selection methods. If you specify n, where n is a positive integer, then the starting model consists of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, produces standardized regression coefficients. A standardized regression coefficient is computed by dividing a parameter estimate by the ratio of the sample standard deviation of the dependent variable to the sample standard deviation of the regressor.
| Default | False |
|---|
when set to True, produces tolerance values for the estimates. Tolerance for a variable is defined as 1 - R-square, where R-square is obtained from the regression of the variable on all other regressors in the model.
| Default | False |
|---|
when set to True, produces variance inflation factors with the parameter estimates. Variance inflation is the reciprocal of tolerance.
| Default | False |
|---|
Crossproducts
| Default | False |
|---|
Scaled Crossproducts
| Default | False |
|---|
Unscaled Crossproducts
| Default | False |
|---|
uses one or more classification variables specified in the vars parameter in such a way that each observation can be associated with one or more levels of the union of the levels of the classification variables.
For more information, see Multimember Effects (Shared Concepts).
For more information about specifying the multimember parameter, see the common multimember parameter (Appendix A: Common Parameters).
limits the display of class levels. The value 0 suppresses all levels.
| Minimum value | 0 |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
creates a table on the server that contains observationwise statistics, which are computed after fitting the model.
For more information about the available statistics, see Predicted Values and Regression Diagnostics (GENSELECT Procedure in SAS Visual Statistics: Procedures).
For more information, see OUTPUT Statement (GENSELECT Procedure in SAS Visual Statistics: Procedures).
The glmOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the Cook's D influence statistic.
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
names the standard influence of the observation on covariance of betas. The COVRATIO statistic measures the change in the determinant of the covariance matrix of the estimates by deleting the ith observation.
names the scaled measure of the change in the predicted value for the ith observation and is calculated by deleting the ith observation. A large value indicates that the observation is very influential in its neighborhood of the X space.
names the leverage of the observation.
names the lower bound of a confidence interval for an individual prediction.
names the lower bound of a confidence interval for the expected value of the dependent variable.
names the likelihood displacement.
names the predicted value. If you do not specify any output statistics, then by default the predicted value is named Pred.
names the ith residual divided by 1 - h, where h is the leverage, and where the model has been refit without the ith observation.
names the residual, calculated as ACTUAL minus PREDICTED.
identifies the training, validation, and test roles for the observations.
names the studentized residual with the current observation deleted.
names the standard error of the individual predicted value.
names the standard error of the mean predicted value.
names the standard error of the residual.
names the studentized residuals, which are the residuals divided by their standard errors.
names the upper bound of a confidence interval for an individual prediction.
names the upper bound of a confidence interval for the expected value of the dependent variable.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies whether to add raw and formatted values of classification variables in the ParameterEstimates table.
| Default | RAW |
|---|
specifies the fractions of the data to be used for validation and testing.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
names the variable and its values used to partition the data into training, validation, and testing roles.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
| Long form | partByVar={"name":"variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies a polynomial effect. All specified variables must be numeric. A design matrix column is generated for each term of the specified polynomial. By default, each of these terms is treated as a separate effect for the purpose of model building.
For more information, see Polynomial Effects (Shared Concepts).
For more information about specifying the polynomial parameter, see the common polynomial parameter (Appendix A: Common Parameters).
| Alias | poly |
|---|
specifies the method and options for performing model selection.
For more information, see selection Parameter (Shared Concepts).
| Long form | selection={"method":"BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE"} |
|---|---|
| Shortcut form | selection="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE" |
The selectionStatement value can be one or more of the following:
when set to True, applies adaptive weights to each of the coefficients in the LASSO method.
| Default | False |
|---|
specifies options to perform best-subset selection.
The bestOptions value can be one or more of the following:
specifies the maximum number of subset models to display.
| Minimum value | 0 |
|---|
when set to True, requests estimated regression coefficients for each subset model.
| Alias | beta |
|---|---|
| Default | False |
when set to True, adds Akaike's information criterion to the selection summary.
| Alias | aic |
|---|---|
| Default | False |
when set to True, adds the Bayesian information criterion to the selection summary.
| Alias | bic |
|---|---|
| Default | False |
when set to True, adds estimated mean square error of prediction to the selection summary.
| Alias | gmsep |
|---|---|
| Default | False |
when set to True, adds final prediction error to the selection summary.
| Alias | jp |
|---|---|
| Default | False |
when set to True, adds mean square error to the selection summary.
| Alias | mse |
|---|---|
| Default | False |
when set to True, adds Amemiya's prediction criterion to the selection summary.
| Alias | pc |
|---|---|
| Default | False |
when set to True, adds root mean square error to the selection summary.
| Alias | rmse |
|---|---|
| Default | False |
when set to True, adds the Schwarz Bayesian criterion to the selection summary.
| Alias | sbc |
|---|---|
| Default | False |
when set to True, adds SP to the selection summary.
| Alias | sp |
|---|---|
| Default | False |
when set to True, adds error sum of squares to the selection summary.
| Alias | sse |
|---|---|
| Default | False |
specifies the true standard deviation of the error term to use in computing the CP and BIC statistics.
| Default | 0 |
|---|
specifies the maximum number of candidates to display at each step of the selection process, when the detail level ALL is specified.
specifies the criterion for choosing the model. The specified criterion is evaluated at each step of the selection process, and the model that yields the best value of the criterion is chosen.
For more information, see the discussion of the choose subparameter (Shared Concepts).
when set to True, evaluates (during stepwise selection) the selection criterion for all models in which an effect currently in the model is dropped or an effect not yet in the model is added. The effect whose removal from or addition to the model yields the maximum improvement to the selection criterion is dropped or added.
| Default | False |
|---|
specifies the level of detail to produce about the selection process.
For more information, see the description of the details subparameter (Shared Concepts).
| Default | SUMMARY |
|---|
specifies options to use in performing elastic net selection methods.
The enOptions value can be one or more of the following:
specifies the absolute function difference convergence criterion.
| Alias | abstol |
|---|---|
| Default | 1E-08 |
| Minimum value | 0 |
specifies the relative function difference convergence criterion.
| Alias | fTol |
|---|---|
| Minimum value | 0 |
specifies the relative gradient convergence criterion.
| Alias | gTol |
|---|---|
| Minimum value | 0 |
specifies the regularization parameters in the elastic net selection method.
specifies the elastic net mixing parameter.
specifies the number of regularization parameters in the elastic net selection method.
| Alias | nLambda |
|---|---|
| Default | 0 |
| Minimum value | 0 |
specifies the scaling factor to use in computing minimum regularization parameter.
| Range | (0, 1) |
|---|
specifies a solver for elastic net selection.
when set to True, applies scaling to beta in the elastic net selection method.
| Default | False |
|---|
specifies the number of iterations to use in the elastic net selection method.
| Default | 50 |
|---|
specifies options to use in performing the folded concave penalized (FCP) selection methods.
The fcpOptions value can be one or more of the following:
specifies the alpha value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the big M constant in the mixed integer linear programming (MILP) solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the tolerance for truncating estimated coefficients.
| Alias | coefficientTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the tolerance for integer variables in the mixed integer linear programming (MILP) solver.
| Alias | integerTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the fixed lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the lambda searching grid in the SCAD and MCP selection methods.
specifies the upper bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the number of iterations to use in searching for the best alpha value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum number of iterations to use in searching for the best lambda value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the time limit allowed for the optimization solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the lower bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the minimum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
when set to True, applies normalization in computing the crossproducts matrix.
| Default | True |
|---|
specifies the solver to use in the SCAD and MCP selection methods.
specifies the exponent of the power transformation that is applied to the parameters in forming the adaptive weights.
| Default | 1 |
|---|
specifies whether and how to apply the model hierarchy requirement. Model hierarchy refers to the requirement that, for any term to be in the model, all model effects that are contained in the term must be present in the model.
For more information, see the description of the hierarchy subparameter (Shared Concepts).
| Default | DEFAULT |
|---|
specifies the coefficients in the relaxed LASSO method.
specifies the L2 parameter in the elastic net selection method.
| Default | 0 |
|---|
specifies the upper bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | maxL2 |
|---|---|
| Default | 1 |
specifies the lower bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | minL2 |
|---|---|
| Default | 0 |
when set to True, specifies a hybrid version of the LAR or LASSO method, in which the sequence of models is determined by the LAR or LASSO method but the coefficients of the parameters for the model at any step are determined by using ordinary least squares.
| Default | False |
|---|
specifies the maximum number of effects in any model to consider during the selection process. This parameter is ignored for backward selection.
specifies the maximum number of selection steps to perform.
specifies the model selection method.
For more information, see Model Selection Methods (Shared Concepts).
| Default | STEPWISE |
|---|
specifies the minimum number of effects in any model to consider during backward selection.
when set to True, shows effects in parameter estimates tables in the order in which they were added to the model.
| Default | False |
|---|
when set to True, produces coefficientProgression and selectionSummaryForPlots tables that you can use to produce selection diagnostic plots.
For more information about the plots you can create, see Model Selection Plots (Shared Concepts in SAS Visual Statistics: Procedures).
| Default | False |
|---|
when set to True, applies the relaxed LASSO method.
| Default | False |
|---|
specifies the criterion to use in determining the order in which effects enter or leave at each step of the selection method. This parameter does not apply to LAR or LASSO selection.
For more information, see the discussion of the select subparameter (Shared Concepts).
specifies the significance level for entry when the significance level is used as the select or stop criterion.
| Alias | sle |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies the significance level for removal when the significance level is used as the select or stop criterion.
| Alias | sls |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies a criterion that to use for stopping the selection process. If you do not specify a stop criterion, then the criterion that is used to select the model is also used as the stop criterion.
For more information, see the discussion of the stop subparameter (Shared Concepts).
specifies the number of consecutive steps at which the stop criterion must worsen in order for a local extremum to be detected.
For more information, see the description of the stopHorizon subparameter (Shared Concepts).
| Default | 3 |
|---|
expands variables into spline bases whose form depends on the specified parameters.
For more information, see Spline Effects (Shared Concepts).
For more information about specifying the spline parameter, see the common spline parameter (Appendix A: Common Parameters).
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | False |
|---|
stores regression models to a binary large object (BLOB).
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | savestate |
|---|
specifies the input data table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
names the numeric variable to use to perform a weighted analysis of the data.
Fits linear regression models using the method of least squares.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
 code |
casOut |
writes SAS DATA step code for computing predicted values of the fitted model |
|
 output |
required parametercasOut |
creates a table on the server that contains observationwise statistics, which are computed after fitting the model. |
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
 store |
— |
stores regression models to a binary large object (BLOB). |
specifies the significance level to use for the construction of all confidence intervals.
| Default | 0.05 |
|---|---|
| Range | (0, 1) |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
specifies that the analysis not be performed if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
names the classification variables to be used as explanatory variables in the analysis.
For more information about class subparameters, see class Parameter (Shared Concepts).
For more information about specifying the class parameter, see the common classStatement parameter (Appendix A: Common Parameters).
| Alias | classVars |
|---|
lists options that apply to all classification variables.
For more information about specifying the classGlobalOpts parameter, see the common classopts parameter (Appendix A: Common Parameters).
when set to False, suppresses the display of class levels.
| Default | TRUE |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | FALSE |
|---|
writes SAS DATA step code for computing predicted values of the fitted model
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
defines a set of variables that are treated as a single effect that has multiple degrees of freedom.
For more information, see Collection Effects (Shared Concepts).
The collection value can be one or more of the following:
when set to True, requests a table that shows additional details that are related to this effect.
| Default | FALSE |
|---|
specifies the name of the effect.
specifies a set of variables that are treated as a single effect that has multiple degrees of freedom. The columns in the design matrix that are contributed by a collection effect are the design columns of its constituent variables in the order in which they appear in the definition of the collection effect.
specifies a list of results tables to send to the client for display.
For more information about display subparameters, see display Parameter (Shared Concepts).
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
names the numeric variable that contains the frequency of occurrence of each observation.
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies that models not be fit if the number of parameters exceeds the specified value.
| Minimum value | 0 |
|---|
names the dependent variable, explanatory effects, and model options.
For information about model specification, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The glmmodel value can be one or more of the following:
when set to FALSE, ignores the information from the last stop step.
| Default | TRUE |
|---|
when set to True, displays upper and lower confidence limits for the parameter estimates.
| Default | FALSE |
|---|
specifies one or more variables to use as response variables in the model. Not all models support more than one response variable.
| Aliases | depVar |
|---|---|
| target |
names the response variable.
specifies a list of effects that define the model. Each term in this list is made up of variables specified in the vars parameter and their interaction (which can be NONE, CROSS, or BAR). When the interaction is BAR, it can be limited by the maxInteract parameter.
For more information, see Introduction (Specifying Linear Models for SAS Viya Analytical Actions).
The effect value can be one or more of the following:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
specifies the entry variable.
specifies effects to include at the start of the selection process for the specified selection method. Included effects are never dropped during the selection process. If you specify n, where n is a positive integer, then the included effects consist of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, models missing values by using extra model effects. These effects consist of dummy variables that take the value 1 when the value of a continuous model variable involved in the effect is missing, and take the value 0 otherwise. The missing value in the original model effect is replaced by the average value of the effect for the nonmissing values. For classification variables, missing values are treated as valid levels.
For more information, see Informative Missingness (Shared Concepts).
| Default | FALSE |
|---|
when set to True, does not include the intercept term in the model.
| Default | FALSE |
|---|
specifies the ridge constant values for ridge regression.
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | FALSE |
|---|
specifies effects to use to begin the selection process in the FORWARD, FORWARDSWAP, and STEPWISE selection methods. If you specify n, where n is a positive integer, then the starting model consists of the first n effects of the model specification.
The effect value is specified as follows:
specifies the type of interaction for the variables.
| Alias | interact |
|---|---|
| Default | NONE |
eliminates interaction effects whose order is higher than the specified integer value when used in conjunction with the BAR interaction.
specifies the variables to be nested within the term that is defined by the vars parameter. For terms with a BAR or CROSS interaction, the nest corresponds to the last variable in the vars parameter. For terms with no interaction, the nest is distributed across all variables that are listed in the vars parameter.
specifies the variables to use in defining a term of the effect. You must specify at least one variable.
when set to True, produces standardized regression coefficients. A standardized regression coefficient is computed by dividing a parameter estimate by the ratio of the sample standard deviation of the dependent variable to the sample standard deviation of the regressor.
| Default | FALSE |
|---|
when set to True, produces tolerance values for the estimates. Tolerance for a variable is defined as 1 - R-square, where R-square is obtained from the regression of the variable on all other regressors in the model.
| Default | FALSE |
|---|
when set to True, produces variance inflation factors with the parameter estimates. Variance inflation is the reciprocal of tolerance.
| Default | FALSE |
|---|
Crossproducts
| Default | FALSE |
|---|
Scaled Crossproducts
| Default | FALSE |
|---|
Unscaled Crossproducts
| Default | FALSE |
|---|
uses one or more classification variables specified in the vars parameter in such a way that each observation can be associated with one or more levels of the union of the levels of the classification variables.
For more information, see Multimember Effects (Shared Concepts).
For more information about specifying the multimember parameter, see the common multimember parameter (Appendix A: Common Parameters).
limits the display of class levels. The value 0 suppresses all levels.
| Minimum value | 0 |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
creates a table on the server that contains observationwise statistics, which are computed after fitting the model.
For more information about the available statistics, see Predicted Values and Regression Diagnostics (GENSELECT Procedure in SAS Visual Statistics: Procedures).
For more information, see OUTPUT Statement (GENSELECT Procedure in SAS Visual Statistics: Procedures).
The glmOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the Cook's D influence statistic.
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
names the standard influence of the observation on covariance of betas. The COVRATIO statistic measures the change in the determinant of the covariance matrix of the estimates by deleting the ith observation.
names the scaled measure of the change in the predicted value for the ith observation and is calculated by deleting the ith observation. A large value indicates that the observation is very influential in its neighborhood of the X space.
names the leverage of the observation.
names the lower bound of a confidence interval for an individual prediction.
names the lower bound of a confidence interval for the expected value of the dependent variable.
names the likelihood displacement.
names the predicted value. If you do not specify any output statistics, then by default the predicted value is named Pred.
names the ith residual divided by 1 - h, where h is the leverage, and where the model has been refit without the ith observation.
names the residual, calculated as ACTUAL minus PREDICTED.
identifies the training, validation, and test roles for the observations.
names the studentized residual with the current observation deleted.
names the standard error of the individual predicted value.
names the standard error of the mean predicted value.
names the standard error of the residual.
names the studentized residuals, which are the residuals divided by their standard errors.
names the upper bound of a confidence interval for an individual prediction.
names the upper bound of a confidence interval for the expected value of the dependent variable.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies whether to add raw and formatted values of classification variables in the ParameterEstimates table.
| Default | RAW |
|---|
specifies the fractions of the data to be used for validation and testing.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
names the variable and its values used to partition the data into training, validation, and testing roles.
For more information, see partByFrac and partByVar Partitioning Parameters (Shared Concepts).
| Long form | partByVar=list(name="variable-name") |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies a polynomial effect. All specified variables must be numeric. A design matrix column is generated for each term of the specified polynomial. By default, each of these terms is treated as a separate effect for the purpose of model building.
For more information, see Polynomial Effects (Shared Concepts).
For more information about specifying the polynomial parameter, see the common polynomial parameter (Appendix A: Common Parameters).
| Alias | poly |
|---|
specifies the method and options for performing model selection.
For more information, see selection Parameter (Shared Concepts).
| Long form | selection=list(method="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE") |
|---|---|
| Shortcut form | selection="BACKWARD" | "BESTSUBSET" | "ELASTICNET" | "FORWARD" | "FORWARDSWAP" | "LAR" | "LASSO" | "MCP" | "NONE" | "SCAD" | "STEPWISE" |
The selectionStatement value can be one or more of the following:
when set to True, applies adaptive weights to each of the coefficients in the LASSO method.
| Default | FALSE |
|---|
specifies options to perform best-subset selection.
The bestOptions value can be one or more of the following:
specifies the maximum number of subset models to display.
| Minimum value | 0 |
|---|
when set to True, requests estimated regression coefficients for each subset model.
| Alias | beta |
|---|---|
| Default | FALSE |
when set to True, adds Akaike's information criterion to the selection summary.
| Alias | aic |
|---|---|
| Default | FALSE |
when set to True, adds the Bayesian information criterion to the selection summary.
| Alias | bic |
|---|---|
| Default | FALSE |
when set to True, adds estimated mean square error of prediction to the selection summary.
| Alias | gmsep |
|---|---|
| Default | FALSE |
when set to True, adds final prediction error to the selection summary.
| Alias | jp |
|---|---|
| Default | FALSE |
when set to True, adds mean square error to the selection summary.
| Alias | mse |
|---|---|
| Default | FALSE |
when set to True, adds Amemiya's prediction criterion to the selection summary.
| Alias | pc |
|---|---|
| Default | FALSE |
when set to True, adds root mean square error to the selection summary.
| Alias | rmse |
|---|---|
| Default | FALSE |
when set to True, adds the Schwarz Bayesian criterion to the selection summary.
| Alias | sbc |
|---|---|
| Default | FALSE |
when set to True, adds SP to the selection summary.
| Alias | sp |
|---|---|
| Default | FALSE |
when set to True, adds error sum of squares to the selection summary.
| Alias | sse |
|---|---|
| Default | FALSE |
specifies the true standard deviation of the error term to use in computing the CP and BIC statistics.
| Default | 0 |
|---|
specifies the maximum number of candidates to display at each step of the selection process, when the detail level ALL is specified.
specifies the criterion for choosing the model. The specified criterion is evaluated at each step of the selection process, and the model that yields the best value of the criterion is chosen.
For more information, see the discussion of the choose subparameter (Shared Concepts).
when set to True, evaluates (during stepwise selection) the selection criterion for all models in which an effect currently in the model is dropped or an effect not yet in the model is added. The effect whose removal from or addition to the model yields the maximum improvement to the selection criterion is dropped or added.
| Default | FALSE |
|---|
specifies the level of detail to produce about the selection process.
For more information, see the description of the details subparameter (Shared Concepts).
| Default | SUMMARY |
|---|
specifies options to use in performing elastic net selection methods.
The enOptions value can be one or more of the following:
specifies the absolute function difference convergence criterion.
| Alias | abstol |
|---|---|
| Default | 1E-08 |
| Minimum value | 0 |
specifies the relative function difference convergence criterion.
| Alias | fTol |
|---|---|
| Minimum value | 0 |
specifies the relative gradient convergence criterion.
| Alias | gTol |
|---|---|
| Minimum value | 0 |
specifies the regularization parameters in the elastic net selection method.
specifies the elastic net mixing parameter.
specifies the number of regularization parameters in the elastic net selection method.
| Alias | nLambda |
|---|---|
| Default | 0 |
| Minimum value | 0 |
specifies the scaling factor to use in computing minimum regularization parameter.
| Range | (0, 1) |
|---|
specifies a solver for elastic net selection.
when set to True, applies scaling to beta in the elastic net selection method.
| Default | FALSE |
|---|
specifies the number of iterations to use in the elastic net selection method.
| Default | 50 |
|---|
specifies options to use in performing the folded concave penalized (FCP) selection methods.
The fcpOptions value can be one or more of the following:
specifies the alpha value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the big M constant in the mixed integer linear programming (MILP) solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the tolerance for truncating estimated coefficients.
| Alias | coefficientTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the tolerance for integer variables in the mixed integer linear programming (MILP) solver.
| Alias | integerTolerance |
|---|---|
| Minimum value (exclusive) | 0 |
specifies the fixed lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the lambda searching grid in the SCAD and MCP selection methods.
specifies the upper bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the number of iterations to use in searching for the best alpha value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum number of iterations to use in searching for the best lambda value in the SCAD and MCP selection methods.
| Minimum value | 2 |
|---|
specifies the maximum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the time limit allowed for the optimization solver.
| Minimum value (exclusive) | 0 |
|---|
specifies the lower bound to use in searching for alpha in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
specifies the minimum lambda value in the SCAD and MCP selection methods.
| Minimum value (exclusive) | 0 |
|---|
when set to True, applies normalization in computing the crossproducts matrix.
| Default | TRUE |
|---|
specifies the solver to use in the SCAD and MCP selection methods.
specifies the exponent of the power transformation that is applied to the parameters in forming the adaptive weights.
| Default | 1 |
|---|
specifies whether and how to apply the model hierarchy requirement. Model hierarchy refers to the requirement that, for any term to be in the model, all model effects that are contained in the term must be present in the model.
For more information, see the description of the hierarchy subparameter (Shared Concepts).
| Default | DEFAULT |
|---|
specifies the coefficients in the relaxed LASSO method.
specifies the L2 parameter in the elastic net selection method.
| Default | 0 |
|---|
specifies the upper bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | maxL2 |
|---|---|
| Default | 1 |
specifies the lower bound to use in searching for the L2 parameter in the elastic net selection method.
| Alias | minL2 |
|---|---|
| Default | 0 |
when set to True, specifies a hybrid version of the LAR or LASSO method, in which the sequence of models is determined by the LAR or LASSO method but the coefficients of the parameters for the model at any step are determined by using ordinary least squares.
| Default | FALSE |
|---|
specifies the maximum number of effects in any model to consider during the selection process. This parameter is ignored for backward selection.
specifies the maximum number of selection steps to perform.
specifies the model selection method.
For more information, see Model Selection Methods (Shared Concepts).
| Default | STEPWISE |
|---|
specifies the minimum number of effects in any model to consider during backward selection.
when set to True, shows effects in parameter estimates tables in the order in which they were added to the model.
| Default | FALSE |
|---|
when set to True, produces coefficientProgression and selectionSummaryForPlots tables that you can use to produce selection diagnostic plots.
For more information about the plots you can create, see Model Selection Plots (Shared Concepts in SAS Visual Statistics: Procedures).
| Default | FALSE |
|---|
when set to True, applies the relaxed LASSO method.
| Default | FALSE |
|---|
specifies the criterion to use in determining the order in which effects enter or leave at each step of the selection method. This parameter does not apply to LAR or LASSO selection.
For more information, see the discussion of the select subparameter (Shared Concepts).
specifies the significance level for entry when the significance level is used as the select or stop criterion.
| Alias | sle |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies the significance level for removal when the significance level is used as the select or stop criterion.
| Alias | sls |
|---|---|
| Default | 0.05 |
| Range | (0, 1) |
specifies a criterion that to use for stopping the selection process. If you do not specify a stop criterion, then the criterion that is used to select the model is also used as the stop criterion.
For more information, see the discussion of the stop subparameter (Shared Concepts).
specifies the number of consecutive steps at which the stop criterion must worsen in order for a local extremum to be detected.
For more information, see the description of the stopHorizon subparameter (Shared Concepts).
| Default | 3 |
|---|
expands variables into spline bases whose form depends on the specified parameters.
For more information, see Spline Effects (Shared Concepts).
For more information about specifying the spline parameter, see the common spline parameter (Appendix A: Common Parameters).
when set to True, performs a model analysis of variance based on type III sums of squares.
| Default | FALSE |
|---|
stores regression models to a binary large object (BLOB).
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | savestate |
|---|
specifies the input data table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
names the numeric variable to use to perform a weighted analysis of the data.