Provides actions for explaining already trained models.
Uses linear models to explain already trained models. Supports global linear surrogates as well as the local methods: LIME and KERNEL SHAP.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code. |
|
|
— |
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables. |
|
|
required parameterquery |
— |
specifies the query input data table, which contains the query observation. |
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the output table to contain the generated data that are created during the explanation process. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
specifies the ID variables that were used in the creation of the analytic store model. This parameter applies only when an analytic store model is used as the model table.
| Aliases | id |
|---|---|
| copyVars | |
| copyVar |
specifies the bin width (relative to the standard deviation) to use to create the binary encoding for interval variables.
| Default | 0.1 |
|---|---|
| Minimum value | 1E-06 |
specifies the DATA step code that describes the model or DS2 code that is used along with an analytic store that is specified in the modelTable or modelTables parameter.
| Alias | codeString |
|---|
specifies the method and options to use for data generation.
| Long form | dataGeneration={method="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION"} |
|---|---|
| Shortcut form | dataGeneration="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION" |
The dataGenOptions value can be one or more of the following:
specifies what types of variables to map to a binary encoding. The default is ALL if the value of the preset parameter is KERNELSHAP. Otherwise the default is NONE.
| Default | NONE |
|---|
specifies the method of data generation. The default is NONE if the value of the preset parameter is GLOBALREG. The default is REFERENCEDISTRIBUTION if the value of the preset parameter is KERNELSHAP. The default is QUERYCENTERED if the value of the preset parameter is LIME.
specifies the number of observations to generate. The default is the minimum of 500 times the number of variables that are listed in the input parameter and 1,000,000.
| Minimum value | 2 |
|---|
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the options to use for distance calculation between the observations and the query.
| Long form | distance={measure="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM"} |
|---|---|
| Shortcut form | distance="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM" |
The distanceOptions value can be one or more of the following:
specifies the denominator in the exponential kernel when the value of the measure parameter is EUCLIDEAN or NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the method of distance calculation. The default is UNIFORM if the value of the preset parameter is GLOBALREG. The default is SHAPKERNEL if the value of the preset parameter is KERNELSHAP. The default is NORMEUCLIDEAN if the value of the preset parameter is LIME.
uses the Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
uses the normalized Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
specifies the distance calculation weighting that balances interval and nominal inputs. The default is a function based upon the input data, described in the details. The default is 1 if the value of the measure parameter is NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the explainer type and options to use.
| Long form | explainer={method="LASSOREGRESSION" | "REGRESSION"} |
|---|---|
| Shortcut form | explainer="LASSOREGRESSION" | "REGRESSION" |
The explainerOptions value can be one or more of the following:
specifies the maximum number of effects that the lasso regression can select.
| Minimum value | 1 |
|---|
specifies the method to use to explain the input model. The default is LASSOREGRESSION if the value of the preset parameter is LIME. The default is REGRESSION otherwise.
specifies the minimum number of effects that the lasso regression can select.
| Minimum value | 0 |
|---|
names the numeric variable that contains the frequency of occurrence of each observation.
specifies the output table to contain the generated data that are created during the explanation process.
For more information about specifying the generatedOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
when set to True, treats missing values of nominal variables as a valid level.
| Default | FALSE |
|---|
specifies the model input variables to use in the analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies a value to control the amount of notes to print to the client log. A value of 0 prints only warnings and errors; a value of 1 prints some notes; a value of 2 prints many notes.
| Default | 1 |
|---|---|
| Range | 0–2 |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code.
| Long form | modelTable={name="table-name"} |
|---|---|
| Shortcut form | modelTable="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables.
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the nominal model input variables to use in the analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
lists the names of results tables to save as CAS tables on the server.
| Alias | displayOut |
|---|
The outputTables value can be one or more of the following:
when set to True, an output CAS table is automatically created for each result table. The name of the created output CAS table is the same as the corresponding result table name. This parameter when set to true takes precedence over the names parameter.
| Default | FALSE |
|---|
specifies the results tables to save as CAS tables on the server. By default, each created CAS table uses the same name as the results table. To use a different name, specify a casouttable value for the corresponding results table name.
The casouttable value is specified as follows:
specifies the name of the caslib for the output table.
when set to True, applies data compression to the table.
| Default | FALSE |
|---|
specifies the list of variables to create indexes for in the output data.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the maximum amount of memory, in bytes, that each thread should allocate for in-memory blocks before converting to a memory-mapped file. Files are written in the directories that are specified in the CAS_DISK_CACHE environment variable.
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
|---|
specifies the memory format for the output table.
| Default | INHERIT |
|---|
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | FALSE |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | FALSE |
|---|
specifies the number of copies of the table to make for fault tolerance. Larger values result in slower performance and use more memory, but provide high availability for data in the event of a node failure. Data redundancy applies to distributed servers only.
| Default | 1 |
|---|---|
| Minimum value | 0 |
Specifies the Table Redistribution Policy when the number of worker pods increases on a running CAS server.
specifies the number of bytes to use for blocks in the output table. The blocks are read by threads. Gradually increase this value when you have a large table with millions or billions of rows and you are tuning for performance. Larger values can increase performance with indexed tables. However, if the value is too large, then you can cause thread starvation due to too few blocks for threads to work on.
| Alias | blockSize |
|---|---|
| Default | 1048576 |
| Minimum value | 0 |
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
specifies to add a timestamp column to the table. Support for timeStamp is action-specific. Specify the value in the form that is appropriate for your session locale.
specifies one or more expressions for subsetting the output data. When multiple expressions are specified, the expressions are effectively combined using AND to form the final output filter. If an expression contains quoted values, use nested quotation marks.
when set to True, replicates CAS tables on all nodes. This parameter applies to all tables.
| Default | FALSE |
|---|
when set to True, overwrites an existing CAS table of the same name. This parameter applies to all tables specified in the names parameter unless overridden by the replace parameter in individual casouttable specifications.
| Default | TRUE |
|---|
specifies the variable that contains the model's predictions. The variable typically has a P_ prefix.
defines default values for the binaryEncoding subparameter of the dataGeneration parameter, the method subparameter of the dataGeneration parameter, the measure subparameter of the distance parameter, and the method subparameter of the explainer parameter.
| Default | LIME |
|---|
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to NONE, the default value of the measure subparameter of the distance parameter to UNIFORM, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to ALL, the default value of the method subparameter of the dataGeneration parameter to REFERENCEDISTRIBUTION, the default value of the measure subparameter of the distance parameter to SHAPKERNEL, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to QUERYCENTERED, the default value of the measure subparameter of the distance parameter to NORMEUCLIDEAN, and the default value of the method subparameter of the explainer parameter to LASSOREGRESSION.
specifies the query input data table, which contains the query observation.
| Long form | query={name="table-name"} |
|---|---|
| Shortcut form | query="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | FALSE |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | FALSE |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the seed value for random number generation.
| Default | 0 |
|---|
specifies the input data table.
| Long form | table={name="table-name"} |
|---|---|
| Shortcut form | table="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | FALSE |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | FALSE |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
names the numeric variable to use to perform a weighted analysis of the data.
Uses linear models to explain already trained models. Supports global linear surrogates as well as the local methods: LIME and KERNEL SHAP.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code. |
|
|
— |
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables. |
|
|
required parameterquery |
— |
specifies the query input data table, which contains the query observation. |
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the output table to contain the generated data that are created during the explanation process. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
specifies the ID variables that were used in the creation of the analytic store model. This parameter applies only when an analytic store model is used as the model table.
| Aliases | id |
|---|---|
| copyVars | |
| copyVar |
specifies the bin width (relative to the standard deviation) to use to create the binary encoding for interval variables.
| Default | 0.1 |
|---|---|
| Minimum value | 1E-06 |
specifies the DATA step code that describes the model or DS2 code that is used along with an analytic store that is specified in the modelTable or modelTables parameter.
| Alias | codeString |
|---|
specifies the method and options to use for data generation.
| Long form | dataGeneration={method="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION"} |
|---|---|
| Shortcut form | dataGeneration="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION" |
The dataGenOptions value can be one or more of the following:
specifies what types of variables to map to a binary encoding. The default is ALL if the value of the preset parameter is KERNELSHAP. Otherwise the default is NONE.
| Default | NONE |
|---|
specifies the method of data generation. The default is NONE if the value of the preset parameter is GLOBALREG. The default is REFERENCEDISTRIBUTION if the value of the preset parameter is KERNELSHAP. The default is QUERYCENTERED if the value of the preset parameter is LIME.
specifies the number of observations to generate. The default is the minimum of 500 times the number of variables that are listed in the input parameter and 1,000,000.
| Minimum value | 2 |
|---|
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the options to use for distance calculation between the observations and the query.
| Long form | distance={measure="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM"} |
|---|---|
| Shortcut form | distance="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM" |
The distanceOptions value can be one or more of the following:
specifies the denominator in the exponential kernel when the value of the measure parameter is EUCLIDEAN or NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the method of distance calculation. The default is UNIFORM if the value of the preset parameter is GLOBALREG. The default is SHAPKERNEL if the value of the preset parameter is KERNELSHAP. The default is NORMEUCLIDEAN if the value of the preset parameter is LIME.
uses the Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
uses the normalized Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
specifies the distance calculation weighting that balances interval and nominal inputs. The default is a function based upon the input data, described in the details. The default is 1 if the value of the measure parameter is NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the explainer type and options to use.
| Long form | explainer={method="LASSOREGRESSION" | "REGRESSION"} |
|---|---|
| Shortcut form | explainer="LASSOREGRESSION" | "REGRESSION" |
The explainerOptions value can be one or more of the following:
specifies the maximum number of effects that the lasso regression can select.
| Minimum value | 1 |
|---|
specifies the method to use to explain the input model. The default is LASSOREGRESSION if the value of the preset parameter is LIME. The default is REGRESSION otherwise.
specifies the minimum number of effects that the lasso regression can select.
| Minimum value | 0 |
|---|
names the numeric variable that contains the frequency of occurrence of each observation.
specifies the output table to contain the generated data that are created during the explanation process.
For more information about specifying the generatedOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
when set to True, treats missing values of nominal variables as a valid level.
| Default | false |
|---|
specifies the model input variables to use in the analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies a value to control the amount of notes to print to the client log. A value of 0 prints only warnings and errors; a value of 1 prints some notes; a value of 2 prints many notes.
| Default | 1 |
|---|---|
| Range | 0–2 |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code.
| Long form | modelTable={name="table-name"} |
|---|---|
| Shortcut form | modelTable="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables.
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the nominal model input variables to use in the analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
lists the names of results tables to save as CAS tables on the server.
| Alias | displayOut |
|---|
The outputTables value can be one or more of the following:
when set to True, an output CAS table is automatically created for each result table. The name of the created output CAS table is the same as the corresponding result table name. This parameter when set to true takes precedence over the names parameter.
| Default | false |
|---|
specifies the results tables to save as CAS tables on the server. By default, each created CAS table uses the same name as the results table. To use a different name, specify a casouttable value for the corresponding results table name.
The casouttable value is specified as follows:
specifies the name of the caslib for the output table.
when set to True, applies data compression to the table.
| Default | false |
|---|
specifies the list of variables to create indexes for in the output data.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the maximum amount of memory, in bytes, that each thread should allocate for in-memory blocks before converting to a memory-mapped file. Files are written in the directories that are specified in the CAS_DISK_CACHE environment variable.
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
|---|
specifies the memory format for the output table.
| Default | INHERIT |
|---|
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | false |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | false |
|---|
specifies the number of copies of the table to make for fault tolerance. Larger values result in slower performance and use more memory, but provide high availability for data in the event of a node failure. Data redundancy applies to distributed servers only.
| Default | 1 |
|---|---|
| Minimum value | 0 |
Specifies the Table Redistribution Policy when the number of worker pods increases on a running CAS server.
specifies the number of bytes to use for blocks in the output table. The blocks are read by threads. Gradually increase this value when you have a large table with millions or billions of rows and you are tuning for performance. Larger values can increase performance with indexed tables. However, if the value is too large, then you can cause thread starvation due to too few blocks for threads to work on.
| Alias | blockSize |
|---|---|
| Default | 1048576 |
| Minimum value | 0 |
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
specifies to add a timestamp column to the table. Support for timeStamp is action-specific. Specify the value in the form that is appropriate for your session locale.
specifies one or more expressions for subsetting the output data. When multiple expressions are specified, the expressions are effectively combined using AND to form the final output filter. If an expression contains quoted values, use nested quotation marks.
when set to True, replicates CAS tables on all nodes. This parameter applies to all tables.
| Default | false |
|---|
when set to True, overwrites an existing CAS table of the same name. This parameter applies to all tables specified in the names parameter unless overridden by the replace parameter in individual casouttable specifications.
| Default | true |
|---|
specifies the variable that contains the model's predictions. The variable typically has a P_ prefix.
defines default values for the binaryEncoding subparameter of the dataGeneration parameter, the method subparameter of the dataGeneration parameter, the measure subparameter of the distance parameter, and the method subparameter of the explainer parameter.
| Default | LIME |
|---|
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to NONE, the default value of the measure subparameter of the distance parameter to UNIFORM, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to ALL, the default value of the method subparameter of the dataGeneration parameter to REFERENCEDISTRIBUTION, the default value of the measure subparameter of the distance parameter to SHAPKERNEL, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to QUERYCENTERED, the default value of the measure subparameter of the distance parameter to NORMEUCLIDEAN, and the default value of the method subparameter of the explainer parameter to LASSOREGRESSION.
specifies the query input data table, which contains the query observation.
| Long form | query={name="table-name"} |
|---|---|
| Shortcut form | query="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | false |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | false |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the seed value for random number generation.
| Default | 0 |
|---|
specifies the input data table.
| Long form | table={name="table-name"} |
|---|---|
| Shortcut form | table="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | false |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | false |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
names the numeric variable to use to perform a weighted analysis of the data.
Uses linear models to explain already trained models. Supports global linear surrogates as well as the local methods: LIME and KERNEL SHAP.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code. |
|
|
— |
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables. |
|
|
required parameterquery |
— |
specifies the query input data table, which contains the query observation. |
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the output table to contain the generated data that are created during the explanation process. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
specifies the ID variables that were used in the creation of the analytic store model. This parameter applies only when an analytic store model is used as the model table.
| Aliases | id |
|---|---|
| copyVars | |
| copyVar |
specifies the bin width (relative to the standard deviation) to use to create the binary encoding for interval variables.
| Default | 0.1 |
|---|---|
| Minimum value | 1E-06 |
specifies the DATA step code that describes the model or DS2 code that is used along with an analytic store that is specified in the modelTable or modelTables parameter.
| Alias | codeString |
|---|
specifies the method and options to use for data generation.
| Long form | dataGeneration={"method":"NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION"} |
|---|---|
| Shortcut form | dataGeneration="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION" |
The dataGenOptions value can be one or more of the following:
specifies what types of variables to map to a binary encoding. The default is ALL if the value of the preset parameter is KERNELSHAP. Otherwise the default is NONE.
| Default | NONE |
|---|
specifies the method of data generation. The default is NONE if the value of the preset parameter is GLOBALREG. The default is REFERENCEDISTRIBUTION if the value of the preset parameter is KERNELSHAP. The default is QUERYCENTERED if the value of the preset parameter is LIME.
specifies the number of observations to generate. The default is the minimum of 500 times the number of variables that are listed in the input parameter and 1,000,000.
| Minimum value | 2 |
|---|
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the options to use for distance calculation between the observations and the query.
| Long form | distance={"measure":"EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM"} |
|---|---|
| Shortcut form | distance="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM" |
The distanceOptions value can be one or more of the following:
specifies the denominator in the exponential kernel when the value of the measure parameter is EUCLIDEAN or NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the method of distance calculation. The default is UNIFORM if the value of the preset parameter is GLOBALREG. The default is SHAPKERNEL if the value of the preset parameter is KERNELSHAP. The default is NORMEUCLIDEAN if the value of the preset parameter is LIME.
uses the Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
uses the normalized Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
specifies the distance calculation weighting that balances interval and nominal inputs. The default is a function based upon the input data, described in the details. The default is 1 if the value of the measure parameter is NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the explainer type and options to use.
| Long form | explainer={"method":"LASSOREGRESSION" | "REGRESSION"} |
|---|---|
| Shortcut form | explainer="LASSOREGRESSION" | "REGRESSION" |
The explainerOptions value can be one or more of the following:
specifies the maximum number of effects that the lasso regression can select.
| Minimum value | 1 |
|---|
specifies the method to use to explain the input model. The default is LASSOREGRESSION if the value of the preset parameter is LIME. The default is REGRESSION otherwise.
specifies the minimum number of effects that the lasso regression can select.
| Minimum value | 0 |
|---|
names the numeric variable that contains the frequency of occurrence of each observation.
specifies the output table to contain the generated data that are created during the explanation process.
For more information about specifying the generatedOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
when set to True, treats missing values of nominal variables as a valid level.
| Default | False |
|---|
specifies the model input variables to use in the analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies a value to control the amount of notes to print to the client log. A value of 0 prints only warnings and errors; a value of 1 prints some notes; a value of 2 prints many notes.
| Default | 1 |
|---|---|
| Range | 0–2 |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code.
| Long form | modelTable={"name":"table-name"} |
|---|---|
| Shortcut form | modelTable="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables.
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the nominal model input variables to use in the analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
lists the names of results tables to save as CAS tables on the server.
| Alias | displayOut |
|---|
The outputTables value can be one or more of the following:
when set to True, an output CAS table is automatically created for each result table. The name of the created output CAS table is the same as the corresponding result table name. This parameter when set to true takes precedence over the names parameter.
| Default | False |
|---|
specifies the results tables to save as CAS tables on the server. By default, each created CAS table uses the same name as the results table. To use a different name, specify a casouttable value for the corresponding results table name.
The casouttable value is specified as follows:
specifies the name of the caslib for the output table.
when set to True, applies data compression to the table.
| Default | False |
|---|
specifies the list of variables to create indexes for in the output data.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the maximum amount of memory, in bytes, that each thread should allocate for in-memory blocks before converting to a memory-mapped file. Files are written in the directories that are specified in the CAS_DISK_CACHE environment variable.
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
|---|
specifies the memory format for the output table.
| Default | INHERIT |
|---|
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | False |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | False |
|---|
specifies the number of copies of the table to make for fault tolerance. Larger values result in slower performance and use more memory, but provide high availability for data in the event of a node failure. Data redundancy applies to distributed servers only.
| Default | 1 |
|---|---|
| Minimum value | 0 |
Specifies the Table Redistribution Policy when the number of worker pods increases on a running CAS server.
specifies the number of bytes to use for blocks in the output table. The blocks are read by threads. Gradually increase this value when you have a large table with millions or billions of rows and you are tuning for performance. Larger values can increase performance with indexed tables. However, if the value is too large, then you can cause thread starvation due to too few blocks for threads to work on.
| Alias | blockSize |
|---|---|
| Default | 1048576 |
| Minimum value | 0 |
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
specifies to add a timestamp column to the table. Support for timeStamp is action-specific. Specify the value in the form that is appropriate for your session locale.
specifies one or more expressions for subsetting the output data. When multiple expressions are specified, the expressions are effectively combined using AND to form the final output filter. If an expression contains quoted values, use nested quotation marks.
when set to True, replicates CAS tables on all nodes. This parameter applies to all tables.
| Default | False |
|---|
when set to True, overwrites an existing CAS table of the same name. This parameter applies to all tables specified in the names parameter unless overridden by the replace parameter in individual casouttable specifications.
| Default | True |
|---|
specifies the variable that contains the model's predictions. The variable typically has a P_ prefix.
defines default values for the binaryEncoding subparameter of the dataGeneration parameter, the method subparameter of the dataGeneration parameter, the measure subparameter of the distance parameter, and the method subparameter of the explainer parameter.
| Default | LIME |
|---|
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to NONE, the default value of the measure subparameter of the distance parameter to UNIFORM, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to ALL, the default value of the method subparameter of the dataGeneration parameter to REFERENCEDISTRIBUTION, the default value of the measure subparameter of the distance parameter to SHAPKERNEL, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to QUERYCENTERED, the default value of the measure subparameter of the distance parameter to NORMEUCLIDEAN, and the default value of the method subparameter of the explainer parameter to LASSOREGRESSION.
specifies the query input data table, which contains the query observation.
| Long form | query={"name":"table-name"} |
|---|---|
| Shortcut form | query="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | False |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | False |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the seed value for random number generation.
| Default | 0 |
|---|
specifies the input data table.
| Long form | table={"name":"table-name"} |
|---|---|
| Shortcut form | table="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | False |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | False |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import_ |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
names the numeric variable to use to perform a weighted analysis of the data.
Uses linear models to explain already trained models. Supports global linear surrogates as well as the local methods: LIME and KERNEL SHAP.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code. |
|
|
— |
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables. |
|
|
required parameterquery |
— |
specifies the query input data table, which contains the query observation. |
|
required parametertable |
— |
specifies the input data table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the output table to contain the generated data that are created during the explanation process. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
specifies the ID variables that were used in the creation of the analytic store model. This parameter applies only when an analytic store model is used as the model table.
| Aliases | id |
|---|---|
| copyVars | |
| copyVar |
specifies the bin width (relative to the standard deviation) to use to create the binary encoding for interval variables.
| Default | 0.1 |
|---|---|
| Minimum value | 1E-06 |
specifies the DATA step code that describes the model or DS2 code that is used along with an analytic store that is specified in the modelTable or modelTables parameter.
| Alias | codeString |
|---|
specifies the method and options to use for data generation.
| Long form | dataGeneration=list(method="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION") |
|---|---|
| Shortcut form | dataGeneration="NONE" | "QUERYCENTERED" | "REFERENCEDISTRIBUTION" |
The dataGenOptions value can be one or more of the following:
specifies what types of variables to map to a binary encoding. The default is ALL if the value of the preset parameter is KERNELSHAP. Otherwise the default is NONE.
| Default | NONE |
|---|
specifies the method of data generation. The default is NONE if the value of the preset parameter is GLOBALREG. The default is REFERENCEDISTRIBUTION if the value of the preset parameter is KERNELSHAP. The default is QUERYCENTERED if the value of the preset parameter is LIME.
specifies the number of observations to generate. The default is the minimum of 500 times the number of variables that are listed in the input parameter and 1,000,000.
| Minimum value | 2 |
|---|
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the options to use for distance calculation between the observations and the query.
| Long form | distance=list(measure="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM") |
|---|---|
| Shortcut form | distance="EUCLIDEAN" | "NORMEUCLIDEAN" | "SHAPKERNEL" | "UNIFORM" |
The distanceOptions value can be one or more of the following:
specifies the denominator in the exponential kernel when the value of the measure parameter is EUCLIDEAN or NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the method of distance calculation. The default is UNIFORM if the value of the preset parameter is GLOBALREG. The default is SHAPKERNEL if the value of the preset parameter is KERNELSHAP. The default is NORMEUCLIDEAN if the value of the preset parameter is LIME.
uses the Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
uses the normalized Euclidean distance in the exponential kernel to calculate weights for the weighted regression.
specifies the distance calculation weighting that balances interval and nominal inputs. The default is a function based upon the input data, described in the details. The default is 1 if the value of the measure parameter is NORMEUCLIDEAN.
| Minimum value | 1E-06 |
|---|
specifies the explainer type and options to use.
| Long form | explainer=list(method="LASSOREGRESSION" | "REGRESSION") |
|---|---|
| Shortcut form | explainer="LASSOREGRESSION" | "REGRESSION" |
The explainerOptions value can be one or more of the following:
specifies the maximum number of effects that the lasso regression can select.
| Minimum value | 1 |
|---|
specifies the method to use to explain the input model. The default is LASSOREGRESSION if the value of the preset parameter is LIME. The default is REGRESSION otherwise.
specifies the minimum number of effects that the lasso regression can select.
| Minimum value | 0 |
|---|
names the numeric variable that contains the frequency of occurrence of each observation.
specifies the output table to contain the generated data that are created during the explanation process.
For more information about specifying the generatedOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
when set to True, treats missing values of nominal variables as a valid level.
| Default | FALSE |
|---|
specifies the model input variables to use in the analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies a value to control the amount of notes to print to the client log. A value of 0 prints only warnings and errors; a value of 1 prints some notes; a value of 2 prints many notes.
| Default | 1 |
|---|---|
| Range | 0–2 |
specifies the input table that contains the model to explain. This table must contain an analytic store or DATA step score code.
| Long form | modelTable=list(name="table-name") |
|---|---|
| Shortcut form | modelTable="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the input tables that contain the model to explain. These table must contain analytic stores, and you must also specify in the code parameter the DS2 code that accompanies these tables.
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the name of the input table.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the nominal model input variables to use in the analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
lists the names of results tables to save as CAS tables on the server.
| Alias | displayOut |
|---|
The outputTables value can be one or more of the following:
when set to True, an output CAS table is automatically created for each result table. The name of the created output CAS table is the same as the corresponding result table name. This parameter when set to true takes precedence over the names parameter.
| Default | FALSE |
|---|
specifies the results tables to save as CAS tables on the server. By default, each created CAS table uses the same name as the results table. To use a different name, specify a casouttable value for the corresponding results table name.
The casouttable value is specified as follows:
specifies the name of the caslib for the output table.
when set to True, applies data compression to the table.
| Default | FALSE |
|---|
specifies the list of variables to create indexes for in the output data.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the maximum amount of memory, in bytes, that each thread should allocate for in-memory blocks before converting to a memory-mapped file. Files are written in the directories that are specified in the CAS_DISK_CACHE environment variable.
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
|---|
specifies the memory format for the output table.
| Default | INHERIT |
|---|
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | FALSE |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | FALSE |
|---|
specifies the number of copies of the table to make for fault tolerance. Larger values result in slower performance and use more memory, but provide high availability for data in the event of a node failure. Data redundancy applies to distributed servers only.
| Default | 1 |
|---|---|
| Minimum value | 0 |
Specifies the Table Redistribution Policy when the number of worker pods increases on a running CAS server.
specifies the number of bytes to use for blocks in the output table. The blocks are read by threads. Gradually increase this value when you have a large table with millions or billions of rows and you are tuning for performance. Larger values can increase performance with indexed tables. However, if the value is too large, then you can cause thread starvation due to too few blocks for threads to work on.
| Alias | blockSize |
|---|---|
| Default | 1048576 |
| Minimum value | 0 |
| TIP | You can enclose the value in quotation marks and specify B, K, M, G, or T as a suffix to indicate the units. For example, "8M" specifies eight megabytes. |
specifies to add a timestamp column to the table. Support for timeStamp is action-specific. Specify the value in the form that is appropriate for your session locale.
specifies one or more expressions for subsetting the output data. When multiple expressions are specified, the expressions are effectively combined using AND to form the final output filter. If an expression contains quoted values, use nested quotation marks.
when set to True, replicates CAS tables on all nodes. This parameter applies to all tables.
| Default | FALSE |
|---|
when set to True, overwrites an existing CAS table of the same name. This parameter applies to all tables specified in the names parameter unless overridden by the replace parameter in individual casouttable specifications.
| Default | TRUE |
|---|
specifies the variable that contains the model's predictions. The variable typically has a P_ prefix.
defines default values for the binaryEncoding subparameter of the dataGeneration parameter, the method subparameter of the dataGeneration parameter, the measure subparameter of the distance parameter, and the method subparameter of the explainer parameter.
| Default | LIME |
|---|
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to NONE, the default value of the measure subparameter of the distance parameter to UNIFORM, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to ALL, the default value of the method subparameter of the dataGeneration parameter to REFERENCEDISTRIBUTION, the default value of the measure subparameter of the distance parameter to SHAPKERNEL, and the default value of the method subparameter of the explainer parameter to REGRESSION.
sets the default value of the binaryEncoding subparameter of the dataGeneration parameter to NONE, the default value of the method subparameter of the dataGeneration parameter to QUERYCENTERED, the default value of the measure subparameter of the distance parameter to NORMEUCLIDEAN, and the default value of the method subparameter of the explainer parameter to LASSOREGRESSION.
specifies the query input data table, which contains the query observation.
| Long form | query=list(name="table-name") |
|---|---|
| Shortcut form | query="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | FALSE |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | FALSE |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
specifies the seed value for random number generation.
| Default | 0 |
|---|
specifies the input data table.
| Long form | table=list(name="table-name") |
|---|---|
| Shortcut form | table="table-name" |
The castable value can be one or more of the following:
specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.
when set to True, creates the computed variables when the table is loaded instead of when the action begins.
| Alias | compOnDemand |
|---|---|
| Default | FALSE |
specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.
| Alias | compVars |
|---|
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for each computed variable that you include in the computedVars parameter.
| Alias | compPgm |
|---|
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the input table.
when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.
| Default | FALSE |
|---|
specifies the variables to use in the action.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the input data.
specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.
The groupbytable value can be one or more of the following:
specifies the caslib for the filter table. By default, the active caslib is used.
specifies data source options.
| Aliases | options |
|---|---|
| dataSource |
For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).
specifies the settings for reading a table from a data source.
| Alias | import |
|---|
For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).
specifies the name of the filter table.
specifies the variable names to use from the filter table.
The casinvardesc value can be one or more of the following:
specifies the format to apply to the variable.
specifies the length of the format field plus the length of the format precision.
specifies the descriptive label for the variable.
specifies the name for the variable.
specifies the length of the format precision.
specifies the length of the format field.
specifies an expression for subsetting the data from the filter table.
names the numeric variable to use to perform a weighted analysis of the data.