Uses Bayesian network models to classify the target variable
Bayesian Net Classifier Action.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the name of the input table that specifies the links to be included in and excluded from the network. |
|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
||
|
— |
specifies the name of the output table for the network structure and the probability distributions. |
|
|
required parametercasOut |
creates an output table to contain the predicted target values of the input table. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the table in which to save the model for future scoring. |
specifies the significance level for independence tests by using chi-square or G-square statistics. If you want to choose the best model among several, you can specify up to five numbers, separated by spaces. If you specify multiple numbers but you do not specify the value True for the bestModel parameter, the action uses the first number and ignores the remaining numbers.
| Default | 0.05 |
|---|---|
| Requirement | The specified values must be unique. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
when set to True, selects the best model.
| Default | FALSE |
|---|
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
Code Group
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the frequency variable.
specifies the variables to copy to the generated table.
specifies the method for independence tests.
| Default | CHIGSQUARE |
|---|
uses the chi-square statistic, the G-square statistic, and the normalized mutual information for independence tests. A variable is independent of the target if the p-values of both the chi-square and the G-square statistics are greater than the value of the alpha parameter and the normalized mutual information is less than the value of the miAlpha parameter.
uses both the chi-square and G-square statistics for independence tests. A variable is independent of the target if the p-values of both the chi-square and G-square statistics are greater than the value of the alpha parameter.
uses the chi-square statistic for independence tests. A variable is independent of the target if the p-value of the statistic is greater than the value of the alpha parameter.
specifies the name of the input table that specifies the links to be included in and excluded from the network.
For more information about specifying the inNetwork parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | inNet |
|---|
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies the maximum number of parents for each node in the network. If you specify the value True for the bestModel parameter, the action tries all values from 1 to the value of this parameter to find the best setting; otherwise, the specified value is used as the maximum number of parents.
| Default | 5 |
|---|---|
| Range | 1–16 |
specifies the significance level for independence tests that use mutual information.
| Default | 0.05 |
|---|---|
| Range | 0–1 |
specifies how to handle missing values for nominal variables.
| Default | IGNORE |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
specifies the binning number for interval variables.
| Default | 5 |
|---|---|
| Range | 2–1024 |
specifies the name of the output table for the network structure and the probability distributions.
For more information about specifying the outNetwork parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | outNet |
|---|
creates an output table to contain the predicted target values of the input table.
The BnetOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
renames the generated column _ROLE_ in the output data table to the specified role name.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the structure learning methods. If you want the action to choose between the two methods, you can specify both BESTONE and BESTSET and also specify the value True for the bestModel parameter. If you specify both methods but you do not specify the value True for the bestModel parameter, the action uses the first specified method and ignores the other.
| Default | BESTSET |
|---|
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
| Long form | partByVar={name="variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies the initial screening for the input variables. If you want the action to choose the best model with or without prescreening, you can specify {"ZERO","ONE"} or {"ONE","ZERO"} for the parameter and also specify the value True for the bestModel parameter. If you specify both ONE and ZERO but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the other.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
when set to True, generates names for the predicted target variable and the predicted probability variables.
| Default | FALSE |
|---|
| Default | TRUE |
|---|
specifies the table in which to save the model for future scoring.
| Long form | saveState={name="table-name"} |
|---|---|
| Shortcut form | saveState="table-name" |
The casouttable value can be one or more of the following:
specifies the name of the caslib for the output table.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the memory format for the output table.
| Default | INHERIT |
|---|
use the duplicate value reduction memory format. This memory format can reduce the memory consumption and file size when the input data contains duplicate values.
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | FALSE |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | FALSE |
|---|
specifies the network structure types. Together with the maxParents parameter, this parameter determines which network structure the action learns from the training data. If you want the action to choose the best structure among several structures, you can specify multiple values in any combination, separated by spaces, and also specify the value True for the bestModel parameter. If you specify multiple structures but you do not specify the value True for the bestModel parameter, the first value that you specify is used and the rest are ignored.
| Alias | structure |
|---|---|
| Default | PC |
| Requirement | The specified values must be unique. |
learns a general Bayesian network. If the value "GENERAL" is specified for the structures parameter, the action learns a general Bayesian network. A general Bayesian network removes the requirement of a direct connection between the target variable and the input variables that are selected to be in the network.
| Alias | GN |
|---|
learns the Markov blanket of the target variable. The Markov blanket includes the parents, the children, and other parents of the children. After learning the Markov blanket, the action further determines the parents of the target, the links from the parents to the children, and the links among the children. When you specify the value "MB" for the structures parameter, the action learns the Markov blanket regardless of the values of the preScreening and the varSelect parameters.
learns a naive Bayesian network structure (that is, the target has a direct link to each input variable). If you specify the value 1 for maxParents, the structure being trained is a naive Bayesian network. If you specify a value greater than 1 for maxParents, the structure is a Bayesian network-augmented naive Bayesian network.
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
specifies how to select input variables beyond prescreening. If you specify the value "ONE", "TWO", or "THREE", the action automatically tests each input variable for unconditional independence of the target regardless of the value of the preScreening parameter. If no variables are left at a particular variable selection level, the action rolls back to the previous level. For example, if you specify "THREE" and there are no variables in the Markov blanket of the target, the action uses the variables from the previous level "TWO". If you want to choose the best model among different levels of variable selection, you can specify any combination of values for this parameter and also specify the value True for the bestModel parameter. If you specify multiple values for the varSelect parameter but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the remaining values.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
tests each input variable for conditional independence of the target variable given any other input variable. This type of selection rejects all variables that become conditionally independent of the target variable given any other input variable.
determines the Markov blanket of the target variable and uses only the variables in the Markov blanket.
Bayesian Net Classifier Action.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the name of the input table that specifies the links to be included in and excluded from the network. |
|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
||
|
— |
specifies the name of the output table for the network structure and the probability distributions. |
|
|
required parametercasOut |
creates an output table to contain the predicted target values of the input table. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the table in which to save the model for future scoring. |
specifies the significance level for independence tests by using chi-square or G-square statistics. If you want to choose the best model among several, you can specify up to five numbers, separated by spaces. If you specify multiple numbers but you do not specify the value True for the bestModel parameter, the action uses the first number and ignores the remaining numbers.
| Default | 0.05 |
|---|---|
| Requirement | The specified values must be unique. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
when set to True, selects the best model.
| Default | false |
|---|
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
Code Group
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the frequency variable.
specifies the variables to copy to the generated table.
specifies the method for independence tests.
| Default | CHIGSQUARE |
|---|
uses the chi-square statistic, the G-square statistic, and the normalized mutual information for independence tests. A variable is independent of the target if the p-values of both the chi-square and the G-square statistics are greater than the value of the alpha parameter and the normalized mutual information is less than the value of the miAlpha parameter.
uses both the chi-square and G-square statistics for independence tests. A variable is independent of the target if the p-values of both the chi-square and G-square statistics are greater than the value of the alpha parameter.
uses the chi-square statistic for independence tests. A variable is independent of the target if the p-value of the statistic is greater than the value of the alpha parameter.
specifies the name of the input table that specifies the links to be included in and excluded from the network.
For more information about specifying the inNetwork parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | inNet |
|---|
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies the maximum number of parents for each node in the network. If you specify the value True for the bestModel parameter, the action tries all values from 1 to the value of this parameter to find the best setting; otherwise, the specified value is used as the maximum number of parents.
| Default | 5 |
|---|---|
| Range | 1–16 |
specifies the significance level for independence tests that use mutual information.
| Default | 0.05 |
|---|---|
| Range | 0–1 |
specifies how to handle missing values for nominal variables.
| Default | IGNORE |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
specifies the binning number for interval variables.
| Default | 5 |
|---|---|
| Range | 2–1024 |
specifies the name of the output table for the network structure and the probability distributions.
For more information about specifying the outNetwork parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | outNet |
|---|
creates an output table to contain the predicted target values of the input table.
The BnetOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
renames the generated column _ROLE_ in the output data table to the specified role name.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the structure learning methods. If you want the action to choose between the two methods, you can specify both BESTONE and BESTSET and also specify the value True for the bestModel parameter. If you specify both methods but you do not specify the value True for the bestModel parameter, the action uses the first specified method and ignores the other.
| Default | BESTSET |
|---|
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
| Long form | partByVar={name="variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies the initial screening for the input variables. If you want the action to choose the best model with or without prescreening, you can specify {"ZERO","ONE"} or {"ONE","ZERO"} for the parameter and also specify the value True for the bestModel parameter. If you specify both ONE and ZERO but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the other.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
when set to True, generates names for the predicted target variable and the predicted probability variables.
| Default | false |
|---|
| Default | true |
|---|
specifies the table in which to save the model for future scoring.
| Long form | saveState={name="table-name"} |
|---|---|
| Shortcut form | saveState="table-name" |
The casouttable value can be one or more of the following:
specifies the name of the caslib for the output table.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the memory format for the output table.
| Default | INHERIT |
|---|
use the duplicate value reduction memory format. This memory format can reduce the memory consumption and file size when the input data contains duplicate values.
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | false |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | false |
|---|
specifies the network structure types. Together with the maxParents parameter, this parameter determines which network structure the action learns from the training data. If you want the action to choose the best structure among several structures, you can specify multiple values in any combination, separated by spaces, and also specify the value True for the bestModel parameter. If you specify multiple structures but you do not specify the value True for the bestModel parameter, the first value that you specify is used and the rest are ignored.
| Alias | structure |
|---|---|
| Default | PC |
| Requirement | The specified values must be unique. |
learns a general Bayesian network. If the value "GENERAL" is specified for the structures parameter, the action learns a general Bayesian network. A general Bayesian network removes the requirement of a direct connection between the target variable and the input variables that are selected to be in the network.
| Alias | GN |
|---|
learns the Markov blanket of the target variable. The Markov blanket includes the parents, the children, and other parents of the children. After learning the Markov blanket, the action further determines the parents of the target, the links from the parents to the children, and the links among the children. When you specify the value "MB" for the structures parameter, the action learns the Markov blanket regardless of the values of the preScreening and the varSelect parameters.
learns a naive Bayesian network structure (that is, the target has a direct link to each input variable). If you specify the value 1 for maxParents, the structure being trained is a naive Bayesian network. If you specify a value greater than 1 for maxParents, the structure is a Bayesian network-augmented naive Bayesian network.
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
specifies how to select input variables beyond prescreening. If you specify the value "ONE", "TWO", or "THREE", the action automatically tests each input variable for unconditional independence of the target regardless of the value of the preScreening parameter. If no variables are left at a particular variable selection level, the action rolls back to the previous level. For example, if you specify "THREE" and there are no variables in the Markov blanket of the target, the action uses the variables from the previous level "TWO". If you want to choose the best model among different levels of variable selection, you can specify any combination of values for this parameter and also specify the value True for the bestModel parameter. If you specify multiple values for the varSelect parameter but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the remaining values.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
tests each input variable for conditional independence of the target variable given any other input variable. This type of selection rejects all variables that become conditionally independent of the target variable given any other input variable.
determines the Markov blanket of the target variable and uses only the variables in the Markov blanket.
Bayesian Net Classifier Action.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the name of the input table that specifies the links to be included in and excluded from the network. |
|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
||
|
— |
specifies the name of the output table for the network structure and the probability distributions. |
|
|
required parametercasOut |
creates an output table to contain the predicted target values of the input table. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the table in which to save the model for future scoring. |
specifies the significance level for independence tests by using chi-square or G-square statistics. If you want to choose the best model among several, you can specify up to five numbers, separated by spaces. If you specify multiple numbers but you do not specify the value True for the bestModel parameter, the action uses the first number and ignores the remaining numbers.
| Default | 0.05 |
|---|---|
| Requirement | The specified values must be unique. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
when set to True, selects the best model.
| Default | False |
|---|
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
Code Group
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the frequency variable.
specifies the variables to copy to the generated table.
specifies the method for independence tests.
| Default | CHIGSQUARE |
|---|
uses the chi-square statistic, the G-square statistic, and the normalized mutual information for independence tests. A variable is independent of the target if the p-values of both the chi-square and the G-square statistics are greater than the value of the alpha parameter and the normalized mutual information is less than the value of the miAlpha parameter.
uses both the chi-square and G-square statistics for independence tests. A variable is independent of the target if the p-values of both the chi-square and G-square statistics are greater than the value of the alpha parameter.
uses the chi-square statistic for independence tests. A variable is independent of the target if the p-value of the statistic is greater than the value of the alpha parameter.
specifies the name of the input table that specifies the links to be included in and excluded from the network.
For more information about specifying the inNetwork parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | inNet |
|---|
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies the maximum number of parents for each node in the network. If you specify the value True for the bestModel parameter, the action tries all values from 1 to the value of this parameter to find the best setting; otherwise, the specified value is used as the maximum number of parents.
| Default | 5 |
|---|---|
| Range | 1–16 |
specifies the significance level for independence tests that use mutual information.
| Default | 0.05 |
|---|---|
| Range | 0–1 |
specifies how to handle missing values for nominal variables.
| Default | IGNORE |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
specifies the binning number for interval variables.
| Default | 5 |
|---|---|
| Range | 2–1024 |
specifies the name of the output table for the network structure and the probability distributions.
For more information about specifying the outNetwork parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | outNet |
|---|
creates an output table to contain the predicted target values of the input table.
The BnetOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
renames the generated column _ROLE_ in the output data table to the specified role name.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the structure learning methods. If you want the action to choose between the two methods, you can specify both BESTONE and BESTSET and also specify the value True for the bestModel parameter. If you specify both methods but you do not specify the value True for the bestModel parameter, the action uses the first specified method and ignores the other.
| Default | BESTSET |
|---|
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
| Long form | partByVar={"name":"variable-name"} |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies the initial screening for the input variables. If you want the action to choose the best model with or without prescreening, you can specify {"ZERO","ONE"} or {"ONE","ZERO"} for the parameter and also specify the value True for the bestModel parameter. If you specify both ONE and ZERO but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the other.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
when set to True, generates names for the predicted target variable and the predicted probability variables.
| Default | False |
|---|
| Default | True |
|---|
specifies the table in which to save the model for future scoring.
| Long form | saveState={"name":"table-name"} |
|---|---|
| Shortcut form | saveState="table-name" |
The casouttable value can be one or more of the following:
specifies the name of the caslib for the output table.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the memory format for the output table.
| Default | INHERIT |
|---|
use the duplicate value reduction memory format. This memory format can reduce the memory consumption and file size when the input data contains duplicate values.
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | False |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | False |
|---|
specifies the network structure types. Together with the maxParents parameter, this parameter determines which network structure the action learns from the training data. If you want the action to choose the best structure among several structures, you can specify multiple values in any combination, separated by spaces, and also specify the value True for the bestModel parameter. If you specify multiple structures but you do not specify the value True for the bestModel parameter, the first value that you specify is used and the rest are ignored.
| Alias | structure |
|---|---|
| Default | PC |
| Requirement | The specified values must be unique. |
learns a general Bayesian network. If the value "GENERAL" is specified for the structures parameter, the action learns a general Bayesian network. A general Bayesian network removes the requirement of a direct connection between the target variable and the input variables that are selected to be in the network.
| Alias | GN |
|---|
learns the Markov blanket of the target variable. The Markov blanket includes the parents, the children, and other parents of the children. After learning the Markov blanket, the action further determines the parents of the target, the links from the parents to the children, and the links among the children. When you specify the value "MB" for the structures parameter, the action learns the Markov blanket regardless of the values of the preScreening and the varSelect parameters.
learns a naive Bayesian network structure (that is, the target has a direct link to each input variable). If you specify the value 1 for maxParents, the structure being trained is a naive Bayesian network. If you specify a value greater than 1 for maxParents, the structure is a Bayesian network-augmented naive Bayesian network.
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
specifies how to select input variables beyond prescreening. If you specify the value "ONE", "TWO", or "THREE", the action automatically tests each input variable for unconditional independence of the target regardless of the value of the preScreening parameter. If no variables are left at a particular variable selection level, the action rolls back to the previous level. For example, if you specify "THREE" and there are no variables in the Markov blanket of the target, the action uses the variables from the previous level "TWO". If you want to choose the best model among different levels of variable selection, you can specify any combination of values for this parameter and also specify the value True for the bestModel parameter. If you specify multiple values for the varSelect parameter but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the remaining values.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
tests each input variable for conditional independence of the target variable given any other input variable. This type of selection rejects all variables that become conditionally independent of the target variable given any other input variable.
determines the Markov blanket of the target variable and uses only the variables in the Markov blanket.
Bayesian Net Classifier Action.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the name of the input table that specifies the links to be included in and excluded from the network. |
|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
||
|
— |
specifies the name of the output table for the network structure and the probability distributions. |
|
|
required parametercasOut |
creates an output table to contain the predicted target values of the input table. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the table in which to save the model for future scoring. |
specifies the significance level for independence tests by using chi-square or G-square statistics. If you want to choose the best model among several, you can specify up to five numbers, separated by spaces. If you specify multiple numbers but you do not specify the value True for the bestModel parameter, the action uses the first number and ignores the remaining numbers.
| Default | 0.05 |
|---|---|
| Requirement | The specified values must be unique. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | attribute |
|---|
when set to True, selects the best model.
| Default | FALSE |
|---|
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
Code Group
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies the frequency variable.
specifies the variables to copy to the generated table.
specifies the method for independence tests.
| Default | CHIGSQUARE |
|---|
uses the chi-square statistic, the G-square statistic, and the normalized mutual information for independence tests. A variable is independent of the target if the p-values of both the chi-square and the G-square statistics are greater than the value of the alpha parameter and the normalized mutual information is less than the value of the miAlpha parameter.
uses both the chi-square and G-square statistics for independence tests. A variable is independent of the target if the p-values of both the chi-square and G-square statistics are greater than the value of the alpha parameter.
uses the chi-square statistic for independence tests. A variable is independent of the target if the p-value of the statistic is greater than the value of the alpha parameter.
specifies the name of the input table that specifies the links to be included in and excluded from the network.
For more information about specifying the inNetwork parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | inNet |
|---|
specifies variables to use for analysis.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | input |
|---|
specifies the maximum number of parents for each node in the network. If you specify the value True for the bestModel parameter, the action tries all values from 1 to the value of this parameter to find the best setting; otherwise, the specified value is used as the maximum number of parents.
| Default | 5 |
|---|---|
| Range | 1–16 |
specifies the significance level for independence tests that use mutual information.
| Default | 0.05 |
|---|---|
| Range | 0–1 |
specifies how to handle missing values for nominal variables.
| Default | IGNORE |
|---|
specifies nominal variables to use for analysis.
For more information about specifying the nominals parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Alias | nominal |
|---|
specifies the binning number for interval variables.
| Default | 5 |
|---|---|
| Range | 2–1024 |
specifies the name of the output table for the network structure and the probability distributions.
For more information about specifying the outNetwork parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | outNet |
|---|
creates an output table to contain the predicted target values of the input table.
The BnetOutputStatement value can be one or more of the following:
specifies the settings for an output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a list of one or more variables to be copied from the input table to the output table. You can alternatively specify the value ALL, ALL_MODEL, or ALL_NUMERIC, which respectively copies all variables, all variables used in the modeling, or all numeric variables from the input table to the output table.
renames the generated column _ROLE_ in the output data table to the specified role name.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the structure learning methods. If you want the action to choose between the two methods, you can specify both BESTONE and BESTSET and also specify the value True for the bestModel parameter. If you specify both methods but you do not specify the value True for the bestModel parameter, the action uses the first specified method and ignores the other.
| Default | BESTSET |
|---|
The partByFracStatement value can be one or more of the following:
specifies the seed to use in the random number generator that is used for partitioning the data.
| Default | 0 |
|---|
randomly assigns the specified proportion of observations in the input table to the testing role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Range | 0–1 |
|---|
randomly assigns the specified proportion of observations in the input table to the validation role. The sum of the fractions that are specified in the test and validate parameters must be less than 1.
| Alias | valid |
|---|---|
| Range | 0–1 |
| Long form | partByVar=list(name="variable-name") |
|---|---|
| Shortcut form | partByVar="variable-name" |
The partByVarStatement value can be one or more of the following:
names the variable in the input table whose values are used to assign roles to each observation.
specifies the formatted value of the variable that is used to assign observations to the testing role.
specifies the formatted value of the variable that is used to assign observations to the training role. If you do not specify the train parameter, then all observations whose roles are not determined by the test and validate parameters are assigned to training.
specifies the formatted value of the variable that is used to assign observations to the validation role.
| Alias | valid |
|---|
specifies the initial screening for the input variables. If you want the action to choose the best model with or without prescreening, you can specify {"ZERO","ONE"} or {"ONE","ZERO"} for the parameter and also specify the value True for the bestModel parameter. If you specify both ONE and ZERO but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the other.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
when set to True, generates names for the predicted target variable and the predicted probability variables.
| Default | FALSE |
|---|
| Default | TRUE |
|---|
specifies the table in which to save the model for future scoring.
| Long form | saveState=list(name="table-name") |
|---|---|
| Shortcut form | saveState="table-name" |
The casouttable value can be one or more of the following:
specifies the name of the caslib for the output table.
specifies the descriptive label to associate with the table.
specifies the number of seconds to keep the table in memory after it is last accessed. The table is dropped if it is not accessed for the specified number of seconds.
| Default | 0 |
|---|---|
| Minimum value | 0 |
specifies the memory format for the output table.
| Default | INHERIT |
|---|
use the duplicate value reduction memory format. This memory format can reduce the memory consumption and file size when the input data contains duplicate values.
specifies the name for the output table.
when set to True, adds the output table with a global scope. This enables other sessions to access the table, subject to access controls. The target caslib must also have a global scope.
| Default | FALSE |
|---|
when set to True, overwrites an existing table that has the same name.
| Default | FALSE |
|---|
specifies the network structure types. Together with the maxParents parameter, this parameter determines which network structure the action learns from the training data. If you want the action to choose the best structure among several structures, you can specify multiple values in any combination, separated by spaces, and also specify the value True for the bestModel parameter. If you specify multiple structures but you do not specify the value True for the bestModel parameter, the first value that you specify is used and the rest are ignored.
| Alias | structure |
|---|---|
| Default | PC |
| Requirement | The specified values must be unique. |
learns a general Bayesian network. If the value "GENERAL" is specified for the structures parameter, the action learns a general Bayesian network. A general Bayesian network removes the requirement of a direct connection between the target variable and the input variables that are selected to be in the network.
| Alias | GN |
|---|
learns the Markov blanket of the target variable. The Markov blanket includes the parents, the children, and other parents of the children. After learning the Markov blanket, the action further determines the parents of the target, the links from the parents to the children, and the links among the children. When you specify the value "MB" for the structures parameter, the action learns the Markov blanket regardless of the values of the preScreening and the varSelect parameters.
learns a naive Bayesian network structure (that is, the target has a direct link to each input variable). If you specify the value 1 for maxParents, the structure being trained is a naive Bayesian network. If you specify a value greater than 1 for maxParents, the structure is a Bayesian network-augmented naive Bayesian network.
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variable to use for analysis.
specifies how to select input variables beyond prescreening. If you specify the value "ONE", "TWO", or "THREE", the action automatically tests each input variable for unconditional independence of the target regardless of the value of the preScreening parameter. If no variables are left at a particular variable selection level, the action rolls back to the previous level. For example, if you specify "THREE" and there are no variables in the Markov blanket of the target, the action uses the variables from the previous level "TWO". If you want to choose the best model among different levels of variable selection, you can specify any combination of values for this parameter and also specify the value True for the bestModel parameter. If you specify multiple values for the varSelect parameter but you do not specify the value True for the bestModel parameter, the action uses the first specified value and ignores the remaining values.
| Default | ONE |
|---|---|
| Requirement | The specified values must be unique. |
tests each input variable for conditional independence of the target variable given any other input variable. This type of selection rejects all variables that become conditionally independent of the target variable given any other input variable.
determines the Markov blanket of the target variable and uses only the variables in the Markov blanket.