Provides actions for Boolean rule extraction and scoring
Extracts Boolean rules from text.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
table |
specifies the information about the document table. |
|
|
required parametertable |
— |
specifies the input data table for rule extraction. |
|
table |
specifies the information about the terms table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
rules, ruleTerms, candidateTerms |
specifies the data tables that contain the results generated by the action. |
specifies the data tables that contain the results generated by the action.
The output_parm value can be one or more of the following:
specifies a data table to contain the terms that have been selected for rule creation.
For more information about specifying the candidateTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the rules that have been generated by the action for each category.
For more information about specifying the rules parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the terms in each rule that is generated by the action.
For more information about specifying the ruleTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the document ID.
| Default | "_document_" |
|---|
specifies the information about the document table.
The docinfo_parm value can be one or more of the following:
specifies the values of target variables that are considered as positive events; incompatible with MULTICLASS target type.
specifies the variable that contains the document ID.
specifies the input data table that contains information about documents.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variables.
specifies the type of the target variables, either BINARY or MULTICLASS.
| Default | BINARY |
|---|
specifies the minimum g-score needed for a negative term to be considered for rule extraction.
| Alias | gNeg |
|---|---|
| Default | 8 |
| Range | 0–32767 |
specifies the minimum g-score needed for a positive.
| Alias | gPos |
|---|---|
| Default | 8 |
| Range | 1–32767 |
specifies the number of term candidates to be selected for each category.
| Default | 500 |
|---|---|
| Range | 0–32767 |
specifies the k-in value for k-best search in the term ensemble process for creating rules.
| Default | 150 |
|---|---|
| Range | 0–32767 |
specifies the k-out value for k-best search in the rule ensemble process for creating a rule set.
| Default | 50 |
|---|---|
| Range | 0–32767 |
specifies the minimum number of documents in which a term needs to appear in order for the term to be used for creating a rule.
| Default | 3 |
|---|---|
| Range | 0–32767 |
specifies the m value for computing estimated precision for negative terms.
| Alias | mNeg |
|---|---|
| Default | 4 |
| Range | 0–32767 |
specifies the m value for computing estimated precision for positive terms.
| Alias | mPos |
|---|---|
| Default | 2 |
| Range | 1–32767 |
specifies the number of threads to be used per node. The value must be an integer.
| Default | 0 |
|---|---|
| Range | 0–1024 |
specifies the input data table for rule extraction.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the term ID.
| Default | "_termnum_" |
|---|
specifies the information about the terms table.
The terminfo_parm value can be one or more of the following:
specifies the variable that contains the term ID.
specifies the variable that contains the text of the terms. This variable must be a text variable.
specifies the input data table that contains information about terms.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the old variable names used in HPBOOLRULE
| Aliases | legacyName |
|---|---|
| legacyNames | |
| Default | FALSE |
Extracts Boolean rules from text.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
table |
specifies the information about the document table. |
|
|
required parametertable |
— |
specifies the input data table for rule extraction. |
|
table |
specifies the information about the terms table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
rules, ruleTerms, candidateTerms |
specifies the data tables that contain the results generated by the action. |
specifies the data tables that contain the results generated by the action.
The output_parm value can be one or more of the following:
specifies a data table to contain the terms that have been selected for rule creation.
For more information about specifying the candidateTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the rules that have been generated by the action for each category.
For more information about specifying the rules parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the terms in each rule that is generated by the action.
For more information about specifying the ruleTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the document ID.
| Default | "_document_" |
|---|
specifies the information about the document table.
The docinfo_parm value can be one or more of the following:
specifies the values of target variables that are considered as positive events; incompatible with MULTICLASS target type.
specifies the variable that contains the document ID.
specifies the input data table that contains information about documents.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variables.
specifies the type of the target variables, either BINARY or MULTICLASS.
| Default | BINARY |
|---|
specifies the minimum g-score needed for a negative term to be considered for rule extraction.
| Alias | gNeg |
|---|---|
| Default | 8 |
| Range | 0–32767 |
specifies the minimum g-score needed for a positive.
| Alias | gPos |
|---|---|
| Default | 8 |
| Range | 1–32767 |
specifies the number of term candidates to be selected for each category.
| Default | 500 |
|---|---|
| Range | 0–32767 |
specifies the k-in value for k-best search in the term ensemble process for creating rules.
| Default | 150 |
|---|---|
| Range | 0–32767 |
specifies the k-out value for k-best search in the rule ensemble process for creating a rule set.
| Default | 50 |
|---|---|
| Range | 0–32767 |
specifies the minimum number of documents in which a term needs to appear in order for the term to be used for creating a rule.
| Default | 3 |
|---|---|
| Range | 0–32767 |
specifies the m value for computing estimated precision for negative terms.
| Alias | mNeg |
|---|---|
| Default | 4 |
| Range | 0–32767 |
specifies the m value for computing estimated precision for positive terms.
| Alias | mPos |
|---|---|
| Default | 2 |
| Range | 1–32767 |
specifies the number of threads to be used per node. The value must be an integer.
| Default | 0 |
|---|---|
| Range | 0–1024 |
specifies the input data table for rule extraction.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the term ID.
| Default | "_termnum_" |
|---|
specifies the information about the terms table.
The terminfo_parm value can be one or more of the following:
specifies the variable that contains the term ID.
specifies the variable that contains the text of the terms. This variable must be a text variable.
specifies the input data table that contains information about terms.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the old variable names used in HPBOOLRULE
| Aliases | legacyName |
|---|---|
| legacyNames | |
| Default | false |
Extracts Boolean rules from text.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
table |
specifies the information about the document table. |
|
|
required parametertable |
— |
specifies the input data table for rule extraction. |
|
table |
specifies the information about the terms table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
rules, ruleTerms, candidateTerms |
specifies the data tables that contain the results generated by the action. |
specifies the data tables that contain the results generated by the action.
The output_parm value can be one or more of the following:
specifies a data table to contain the terms that have been selected for rule creation.
For more information about specifying the candidateTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the rules that have been generated by the action for each category.
For more information about specifying the rules parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the terms in each rule that is generated by the action.
For more information about specifying the ruleTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the document ID.
| Default | "_document_" |
|---|
specifies the information about the document table.
The docinfo_parm value can be one or more of the following:
specifies the values of target variables that are considered as positive events; incompatible with MULTICLASS target type.
specifies the variable that contains the document ID.
specifies the input data table that contains information about documents.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variables.
specifies the type of the target variables, either BINARY or MULTICLASS.
| Default | BINARY |
|---|
specifies the minimum g-score needed for a negative term to be considered for rule extraction.
| Alias | gNeg |
|---|---|
| Default | 8 |
| Range | 0–32767 |
specifies the minimum g-score needed for a positive.
| Alias | gPos |
|---|---|
| Default | 8 |
| Range | 1–32767 |
specifies the number of term candidates to be selected for each category.
| Default | 500 |
|---|---|
| Range | 0–32767 |
specifies the k-in value for k-best search in the term ensemble process for creating rules.
| Default | 150 |
|---|---|
| Range | 0–32767 |
specifies the k-out value for k-best search in the rule ensemble process for creating a rule set.
| Default | 50 |
|---|---|
| Range | 0–32767 |
specifies the minimum number of documents in which a term needs to appear in order for the term to be used for creating a rule.
| Default | 3 |
|---|---|
| Range | 0–32767 |
specifies the m value for computing estimated precision for negative terms.
| Alias | mNeg |
|---|---|
| Default | 4 |
| Range | 0–32767 |
specifies the m value for computing estimated precision for positive terms.
| Alias | mPos |
|---|---|
| Default | 2 |
| Range | 1–32767 |
specifies the number of threads to be used per node. The value must be an integer.
| Default | 0 |
|---|---|
| Range | 0–1024 |
specifies the input data table for rule extraction.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the term ID.
| Default | "_termnum_" |
|---|
specifies the information about the terms table.
The terminfo_parm value can be one or more of the following:
specifies the variable that contains the term ID.
specifies the variable that contains the text of the terms. This variable must be a text variable.
specifies the input data table that contains information about terms.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the old variable names used in HPBOOLRULE
| Aliases | legacyName |
|---|---|
| legacyNames | |
| Default | False |
Extracts Boolean rules from text.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
table |
specifies the information about the document table. |
|
|
required parametertable |
— |
specifies the input data table for rule extraction. |
|
table |
specifies the information about the terms table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
rules, ruleTerms, candidateTerms |
specifies the data tables that contain the results generated by the action. |
specifies the data tables that contain the results generated by the action.
The output_parm value can be one or more of the following:
specifies a data table to contain the terms that have been selected for rule creation.
For more information about specifying the candidateTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the rules that have been generated by the action for each category.
For more information about specifying the rules parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a data table to contain the terms in each rule that is generated by the action.
For more information about specifying the ruleTerms parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the document ID.
| Default | "_document_" |
|---|
specifies the information about the document table.
The docinfo_parm value can be one or more of the following:
specifies the values of target variables that are considered as positive events; incompatible with MULTICLASS target type.
specifies the variable that contains the document ID.
specifies the input data table that contains information about documents.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the target variables.
specifies the type of the target variables, either BINARY or MULTICLASS.
| Default | BINARY |
|---|
specifies the minimum g-score needed for a negative term to be considered for rule extraction.
| Alias | gNeg |
|---|---|
| Default | 8 |
| Range | 0–32767 |
specifies the minimum g-score needed for a positive.
| Alias | gPos |
|---|---|
| Default | 8 |
| Range | 1–32767 |
specifies the number of term candidates to be selected for each category.
| Default | 500 |
|---|---|
| Range | 0–32767 |
specifies the k-in value for k-best search in the term ensemble process for creating rules.
| Default | 150 |
|---|---|
| Range | 0–32767 |
specifies the k-out value for k-best search in the rule ensemble process for creating a rule set.
| Default | 50 |
|---|---|
| Range | 0–32767 |
specifies the minimum number of documents in which a term needs to appear in order for the term to be used for creating a rule.
| Default | 3 |
|---|---|
| Range | 0–32767 |
specifies the m value for computing estimated precision for negative terms.
| Alias | mNeg |
|---|---|
| Default | 4 |
| Range | 0–32767 |
specifies the m value for computing estimated precision for positive terms.
| Alias | mPos |
|---|---|
| Default | 2 |
| Range | 1–32767 |
specifies the number of threads to be used per node. The value must be an integer.
| Default | 0 |
|---|---|
| Range | 0–1024 |
specifies the input data table for rule extraction.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the variable in the TABLE= data table that contains the term ID.
| Default | "_termnum_" |
|---|
specifies the information about the terms table.
The terminfo_parm value can be one or more of the following:
specifies the variable that contains the term ID.
specifies the variable that contains the text of the terms. This variable must be a text variable.
specifies the input data table that contains information about terms.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the old variable names used in HPBOOLRULE
| Aliases | legacyName |
|---|---|
| legacyNames | |
| Default | FALSE |